python - Pandas 数据帧 : How to extract the last two string type numbers from a column which doesn't always end with the two numbers-6ren

python - Pandas 数据帧 : How to extract the last two string type numbers from a column which doesn't always end with the two numbers

转载作者：行者123 更新时间：2023-11-28 21:35:08

24

4

对于标题可能造成的混淆，我深表歉意，这是我正在尝试做的事情:

我正在尝试将我的地 block 数据框与我的市政代码查找表合并。包裹数据框:

df1.head()

    PARID           OWNER1
0   B10 2 1 0131    WILSON ROBERT JR
1   B10 2 18B 0131  COMUNALE MICHAEL J & MARY ANN
2   B10 2 18D 0131  COMUNALE MICHAEL J & MARY ANN
3   B10 2 19F 0131  MONROE & JEFFERSON HOLDINGS LLC
4   B10 4 11 0131   NOEL JAMES H

市政代码数据框:

df_LU.head()
  PARID  Municipality
0   01  Allen Twp.
1   02  Bangor
2   03  Bath
3   04  Bethlehem
4   05  Bethlehem Twp.

df1 第一列中的最后两个数字(“B10 2 1 0131”中的“31”)是我需要与市政代码 DataFrame 合并的市政代码。但是在我的30000多条记录中，大约有200条记录以如下所示的字母结尾:

        PARID           OWNER1  
299    D11 10 10 0131F  HOWARD THEODORE P & CLAUDIA S   
1007    F10 4 3 0134F   KNEEBONE JUDY ANN   
1011    F10 5 2 0134F   KNEEBONE JUDY ANN   
1114    F8 18 10 0626F  KNITTER WILBERT D JR & AMY J    
1115    F8 18 8 0626F   KNITTER DONALD

对于这些行，最后一个字母之前的两个数字是我需要提取的代码(如“D11 10 10 0131F”中的“31”)

如果我只是使用 pd.DataFrame(df1['PARID'].str[-2:])这将给我:

PARID
...
299 1F
...

虽然我需要的是:

PARID
...
299 31
...

我完成这个的代码非常冗长，其中几乎包括:

连接所有以 2 个数字结尾的行。
找出“PARID”字段中以字母结尾的行的索引
再次将第 2 步的结果与市政府查询数据框结合起来。

代码在那里:

#Do the extraction and merge for the rows that end with numbers
df_2015= df1[['PARID','OWNER1']]
df_2015['PARID'] = df_2015['PARID'].str[-2:]
df_15r =pd.merge(df_2015, df_LU, how = 'left', on = 'PARID')
df_15r

#The pivot result for rows generated from above.
Result15_First = df_15r.groupby('Municipality').count()
Result15_First.to_clipboard()

#Check the ID field for rows that end with letters
check15 = df_2015['PARID'].unique()
check15
C = pd.DataFrame({'ID':check15})
NC = C.dropna()
LNC = NC[NC['ID'].str.endswith('F')]
MNC = NC[NC['ID'].str.endswith('A')]
F = [LNC, MNC]
NNC = pd.concat(F, axis = 0)


s = NNC['ID'].tolist()
s

# Identify the records in s

df_p15 = df_2015.loc[df_2015['PARID'].isin(s)]
df_p15

# Separate out a dataframe with just the rows that end with a letter
df15= df1[['PARID','OWNER1']]
df15c = df15[df15.index.isin(df_p15.index)]
df15c

#This step is to create the look up field from the new data frame, the two numbers before the ending letter.
df15c['PARID1'] = df15c['PARID'].str[-3:-1]
df15c

#Then I will join the look up table
df_15t =df15c.merge(df_LU.set_index('PARID'), left_on = 'PARID1', right_index = True)

df_15b = df_15t.groupby('Municipality').count()
df_15b

直到我完成后，我才意识到我的代码对于一个看似简单的任务来说是多么冗长。如果有更好的实现方式，这是肯定的，请告诉我。谢谢。

最佳答案

您可以使用 pandas 字符串方法来提取最后两个数字

df1['PARID'].str.extract('.*(\d{2})', expand = False)

你得到

关于python - Pandas 数据帧 : How to extract the last two string type numbers from a column which doesn't always end with the two numbers，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/52601707/

24

4

0

文章推荐： python - 将数据帧值合并到新数据帧

文章推荐： python - 类的评估返回 false

文章推荐： testing - 检查网络应用程序爬虫是否有任何错误？

文章推荐： python - 如何在Python3中检查用户输入？

PyTorch:type(a)、a.type、a.type() 之间的区别
假设a是张量，那么有什么区别: 类型(a) a.类型 a.type() 我找不到区分这些的文档。最佳答案 type 是 python 内置方法。它将返回对象的类型。喜欢 torch.Tensor.
dependent-type - `Type 1` 既不是 `Type` 也不是 `Type` 的居民的示例
什么是 Type 1 的居民的例子？两者都不是 Type也不是Type的居民?在 Idris REPL 中进行探索时，我无法想出任何东西。更准确地说，我正在寻找一些 x除了 Type产生以下结果:
abap - 什么是 : TYPE, TYPES、TYPE-POOL、TYPE-POOLS 和类型组？
我找到了一些资源，但我不确定我是否理解。我找到的一些资源是: http://help.sap.com/saphelp_nw70/helpdata/en/fc/eb2ff3358411d1829f00
c++ - 函数指针的 Type(f)(Type) 和 Type(*f)(Type) 之间的区别？
这两个函数原型(prototype)有什么区别？ void apply1(double(f)(double)); void apply2(double(*f)(double)); 如果目标是将提供的函
types - 去戈兰 : Type assertion on customized type
http://play.golang.org/p/icQO_bAZNE 我正在练习使用堆进行排序，但是 prog.go:85: type bucket is not an expression
Replace Generic Types In `System.Type[]` With Types(将`System.Type[]`中的泛型类型替换为类型)
假设有一个泛型定义的方法信息对象，即一个方法信息对象，这样的方法Info.IsGenericMethodDefinition==TRUE：。也可以说它们也有一个泛型参数列表：。我可以使用以下命令获取该
dependent-type - 在依赖类型的编程语言中，Type-in-Type 是否适用于编程？
在具有依赖类型的语言中，您可以使用 Type-in-Type 来简化语言并赋予它很多功能。这使得语言在逻辑上不一致，但如果您只对编程感兴趣而不对定理证明感兴趣，这可能不是问题。在 Cayenne
types - "static type"和 "dynamic type"怎么可能不同？
根据 Nim 手册，变量类型是“静态类型”，而变量在内存中指向的实际值是“动态类型”。它们怎么可能是不同的类型？我认为将错误的类型分配给变量将是一个错误。最佳答案 import typetrait
Swift 结构扩展 : 'Cannot convert return expression of type to return type '
假设您有以下结构和协议(protocol): struct Ticket { var items: [TicketItem] = [] } struct TicketItem { } prot
c# - 什么可能导致 Entity Framework 抛出消息为 "(some type) is neither a super-type nor a sub-type of (some other type)"的异常？
我正在处理一个 EF 问题，我发现它很难调试...以前，在我的系统中有一个表类型继承设置管理不同的用户类型 - 所有用户共有的一种根类型，以及大致基于使用该帐户的人员类型的几种不同的子类型。现在，我遇
ios - Realm iOS : Cannot Convert value of type 'Dogs.Type' to expected argument type 'T.Type'
这是我的 DBManager.swift import RealmSwift class DBManager { class func getAllDogs() -> [Dog] {
python - (215 :Assertion failed) type == CV_32FC1 || type == CV_32FC2 || type == CV_64FC1 || type == CV_64FC2 in function 'dft'
我正在尝试使用傅里叶校正图像中的曝光。这是我面临的错误 5 padded = np.log(padded + 1) #so we never have log of 0 6 g
c# - : The mapping of CLR type to EDM type is ambiguous because multiple CLR types match the EDM type 的建议
关闭。这个问题是opinion-based .它目前不接受答案。想要改进这个问题？更新问题，以便 editing this post 可以用事实和引用来回答它. 关闭 9 年前。 Improve
Swift 泛型错误 : Cannot convert value of type 'Type' to expected argument type 'Type<_>'
请考虑以下设置: protocol MyProcotol { } class MyModel: MyProcotol { } enum Result { case success(value:
python - 类型错误 : type 'types.GenericAlias' is not an acceptable base type
好吧，我将我的 python 项目编译成一个可执行文件，它在我的电脑上运行，但我将它发送给几个 friend 进行测试，他们都遇到了这个错误。我以前从未见过这样的错误。我使用 Nuitka 来编译代码
python - 值错误 : Type must be a sub-type of ndarray type
当我尝试训练我的模型时"ValueError: Type must be a sub-type of ndarray type"出现在 line x_norm=(np.power(x,2)).sum(
swift - 静态 Var 闭包返回 Type.Type 而不是 Type
我尝试在另一个类中打断、计数然后加入对象。所以我构建协议(protocol): typealias DataBreaker = () -> [Double] typealias DataJoiner
angular - npm types 或 typings 或 @type 或什么？
我正在使用 VS 2015 更新 3、Angular 2.1.2、Typescript 2.0.6 有人可以澄清什么是 typings 与 npm @types 以及本月很难找到的任何其他文档吗？或
与 bool Type.op_Equality (Type, Type) 的 Mono 兼容性
我正在考虑从 VS2010 更改为 Mono，因此我通过 MoMA 运行我的程序集，看看我在转换过程中可能遇到多少困难。在生成的报告中，我发现我不断收到此错误: bool Type.op_Equali
reactjs - typescript 如何混合动态([key : type]: type) and static typing for an interface
主要问题不太确定这是否可能，但由于我讨厌 Typescript 并且它使我的编码变得困难，我想我会问只是为了确定。 interface ISomeInterface { handler: ()

首页

博学

6Ren·AI

商城

python - Pandas 数据帧 : How to extract the last two string type numbers from a column which doesn't always end with the two numbers