pandas string contains lookup: NaN leads to Value Error -


if filter rows string in column value, possible use data.sample_id.str.contains('hph') (answered before: check if string in pandas dataframe column in list, or check if string in pandas dataframe).

however, lookup column contains emtpy cells. terefore, str.contains() yields nan values , value error upon indexing.

`valueerror: cannot index vector containing na / nan values`` 

what works:

# runs  mask = [index index, item in enumerate(data.sample_id.values) if 'zent' in str(item)] 

is there more elegant , faster method (similar str.contains()) 1 ?

you can set parameter na in str.contains false:

print (df.a.str.contains('hph', na=false)) 

using edchum sample:

df = pd.dataframe({'a':['hph', np.nan, 'sadhphsad', 'hello']}) print (df)            0        hph 1        nan 2  sadhphsad 3      hello  print (df.a.str.contains('hph', na=false)) 0     true 1    false 2     true 3    false name: a, dtype: bool 

Comments

Popular posts from this blog

Spring Boot + JPA + Hibernate: Unable to locate persister -

go - Golang: panic: runtime error: invalid memory address or nil pointer dereference using bufio.Scanner -

c - double free or corruption (fasttop) -