python - 在 List 中具有不同大小的 List 的 pandas 中制作 DataFrame-6ren

python - 在 List 中具有不同大小的 List 的 pandas 中制作 DataFrame

转载作者：太空宇宙更新时间：2023-11-04 02:53:35

25

4

我有这样的数据

genre_list
Out[7]: 
0                    [Action, Adventure, Fantasy, Sci-Fi]
1                            [Action, Adventure, Fantasy]
2                           [Action, Adventure, Thriller]
3                                      [Action, Thriller]
4                                           [Documentary]
5                             [Action, Adventure, Sci-Fi]
6                            [Action, Adventure, Romance]
7       [Adventure, Animation, Comedy, Family, Fantasy...
8                             [Action, Adventure, Sci-Fi]
9                   [Adventure, Family, Fantasy, Mystery]
10                            [Action, Adventure, Sci-Fi]
11                            [Action, Adventure, Sci-Fi]

我编码以制作具有不同列表大小的数据框

genre_df = pd.DataFrame()
for i in range(len(genre_list)):
    genre_df = genre_df.append(pd.DataFrame(genre_list[i]).T)

得到这个

genre_df.head()
Out[9]: 
             0          1         2       3    4    5    6    7
0       Action  Adventure   Fantasy  Sci-Fi  NaN  NaN  NaN  NaN
0       Action  Adventure   Fantasy     NaN  NaN  NaN  NaN  NaN
0       Action  Adventure  Thriller     NaN  NaN  NaN  NaN  NaN
0       Action   Thriller       NaN     NaN  NaN  NaN  NaN  NaN
0  Documentary        NaN       NaN     NaN  NaN  NaN  NaN  NaN

有没有简单的方法来获取 Dataframe ....？

最佳答案

您可以使用 DataFrame 构造函数将 genre_list 的值转换为 numpy array by values然后到 list:

df1 = pd.DataFrame(genre_list.values.tolist(), index=genre_list.index)
print (df1)

              0          1         2        3        4
0        Action  Adventure   Fantasy   Sci-Fi     None
1        Action  Adventure   Fantasy     None     None
2        Action  Adventure  Thriller     None     None
3        Action   Thriller      None     None     None
4   Documentary       None      None     None     None
5        Action  Adventure    Sci-Fi     None     None
6        Action  Adventure   Romance     None     None
7     Adventure  Animation    Comedy   Family  Fantasy
8        Action  Adventure    Sci-Fi     None     None
9     Adventure     Family   Fantasy  Mystery     None
10       Action  Adventure    Sci-Fi     None     None
11       Action  Adventure    Sci-Fi     None     None

如果需要将None替换为NaN:

df1 = pd.DataFrame(genre_list.values.tolist(), index=genre_list.index).replace({None:np.nan})
print (df1)
              0          1         2        3        4
0        Action  Adventure   Fantasy   Sci-Fi      NaN
1        Action  Adventure   Fantasy      NaN      NaN
2        Action  Adventure  Thriller      NaN      NaN
3        Action   Thriller       NaN      NaN      NaN
4   Documentary        NaN       NaN      NaN      NaN
5        Action  Adventure    Sci-Fi      NaN      NaN
6        Action  Adventure   Romance      NaN      NaN
7     Adventure  Animation    Comedy   Family  Fantasy
8        Action  Adventure    Sci-Fi      NaN      NaN
9     Adventure     Family   Fantasy  Mystery      NaN
10       Action  Adventure    Sci-Fi      NaN      NaN
11       Action  Adventure    Sci-Fi      NaN      NaN

另一个较慢的解决方案是apply Series:

df1 = genre_list.apply(pd.Series)
              0          1         2        3        4
0        Action  Adventure   Fantasy   Sci-Fi      NaN
1        Action  Adventure   Fantasy      NaN      NaN
2        Action  Adventure  Thriller      NaN      NaN
3        Action   Thriller       NaN      NaN      NaN
4   Documentary        NaN       NaN      NaN      NaN
5        Action  Adventure    Sci-Fi      NaN      NaN
6        Action  Adventure   Romance      NaN      NaN
7     Adventure  Animation    Comedy   Family  Fantasy
8        Action  Adventure    Sci-Fi      NaN      NaN
9     Adventure     Family   Fantasy  Mystery      NaN
10       Action  Adventure    Sci-Fi      NaN      NaN
11       Action  Adventure    Sci-Fi      NaN      NaN

时间:

#[12000 rows]
genre_list = pd.concat([genre_list]*1000).reset_index(drop=True)

In [115]: %timeit pd.DataFrame(genre_list.values.tolist(), index=genre_list.index).replace({None:np.nan})
100 loops, best of 3: 15.7 ms per loop

In [116]: %timeit df1 = genre_list.apply(pd.Series)
1 loop, best of 3: 1.96 s per loop

关于python - 在 List 中具有不同大小的 List 的 pandas 中制作 DataFrame，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/43134198/

25

4

0

文章推荐： python - 如何直接从sqlalchemy中的关系排序

文章推荐： javascript - 视频背景在 Safari 上无法正常播放

文章推荐： javascript - js中的this关键字

文章推荐： java - 如何使用 jna 将 byte[] 映射到 void* 缓冲区？

r - 如何创建像这样的多维度列表 DATA<-list(list(list(),list(),list()),list(list(),list(),list()),list(list() ，列表()，列表()))？
我想使用 R 预定义这样的列表 DATA<-list( list(list(),list(),list()), list(list(),list(),list()), list(list(),l
haskell - 如何 `List + List = List[List]]`
如何将一个列表添加到另一个列表，返回一个列表的列表？ foo :: [a] -> [a] -> [[a]] 例如，我想要的结果是: foo [1,2] [3,4] 将是 [[1,2], [3,4]]。
python - 从 "lists of lists"和 "list"创建两个单独的 "list of lists"
我还没有在这里找到类似问题的解决方案，所以我会寻求你的帮助。有 2 个列表，其中之一是列表列表: categories = ['APPLE', 'ORANGE', 'BANANA'] test_re
python - "Flatten"list 包含lists of lists to lists of lists
这个问题不同于Converting list of lists / nested lists to list of lists without nesting (这会产生一组非常具体的响应，但无法解决
java - 无法从 List 转换为 List>
原始列表转换为 List正好。为什么原始列表的列表不能转换为 List 的列表？ { // works List raw = null; List wild = raw; } {
java - 涉及类型参数时，List> 不能赋值给 List>
在下面的代码中，get()被调用并将其结果分配给类型为 List> 的变量. get()返回 List>并在类型参数为 T 的实例上调用设置为 ? ，所以它应该适合。 import java.util
java - 无法从 List 转换为 List>
原始列表转换为 List正好。为什么原始列表的列表不能转换为 List 的列表? { // works List raw = null; List wild = raw; } {
scala - 在不够多态的情况下，为什么实现 `List a -> List a -> List a` 的方法比 `List Char -> List Char -> List Char` 少
在insufficiently-polymorphic 作者说: def foo[A](fst: List[A], snd: List[A]): List[A] There are fewer way
kotlin - List > + List = List <任何>？
我有下面的代码有效。 class ListManipulate(val list: List, val blockCount: Int) { val result: MutableList>
java - 有没有一种好的方法可以将 List>> 转换为 List>> 而不需要 3 个嵌套循环？
关闭。这个问题需要多问focused 。目前不接受答案。想要改进此问题吗？更新问题，使其仅关注一个问题 editing this post . 已关闭 5 年前。 Improve this ques
Scala - 将列表列表转换为单个列表 : List[List[A]] to List[A]
在 scala (2.9) 中转换列表列表的最佳方法是什么？我有一个 list : List[List[A]] 我想转换成 List[A] 如何递归地实现这一点？或者还有其他更好的办法吗？最佳答案
list - 标准ML : Searching through a list of lists
我编写了这个函数来确定给定元素是否存储在元组列表的列表中，但目前它只搜索第一个列表。我将如何搜索其余列表？ fun findItem (name : command, ((x,y)::firstlis
Java List of List of List，更好的解决方案？
我创建了一个类名 objectA，它有 4 个变量:约会时间;字符串文本；变量 1，变量 2 我需要创建一个 ObjectA() 列表。然后首先按时间对它们进行分组，其次按 var1，然后按 var2
python : Removing a List from List of List?
我有一套说法 char={'J','A'} 和列表的列表 content = [[1,'J', 2], [2, 'K', 3], [2, 'A', 3], [3,'A', 9], [5, 'J', 9
java - 访问List>>> titles = new ArrayList>>>();
我有以下列表 List >>> titles = new ArrayList >>> ();我想访问它的元素，但我不知道该怎么做.. 该列表有 1 个元素，它又包含 3 个元素，这 3 个元素中的
scala - 如何将 List[List[Long]] 转换为 List[List[Int]]？
转换 List[List[Long]] 的最佳方法是什么？到 List[List[Int]]在斯卡拉？例如，给定以下类型列表 List[List[Long]] val l: List[List[Lo
Java:将 List> 转换为 List>
我有一个来自 Filereader (String) 的 List-List，如何将其转换为 List-List (Double):我必须返回一个包含 line-Array 的第一个 Values 的
c# - 将 List> 转换为 List>
我收集了List> 。我需要将其转换为List> 。这是我尝试过的， List> dataOne = GetDataOne(); var dataTwo = dataOne.Select(x => x
java - List> 和 List 是 java 中不兼容的类型
这个问题在这里已经有了答案: Cannot convert from List to List> (3 个答案) 关闭 7 年前。我没有得到这段代码以任何方式编译: List a = new Ar
java - List> 和 List 是 java 中不兼容的类型
这个问题在这里已经有了答案: Cannot convert from List to List> (3 个答案) 关闭 7 年前。我没有得到这段代码以任何方式编译: List a = new Ar

首页

博学

6Ren·AI

商城

python - 在 List 中具有不同大小的 List 的 pandas 中制作 DataFrame