日韩性视频-久久久蜜桃-www中文字幕-在线中文字幕av-亚洲欧美一区二区三区四区-撸久久-香蕉视频一区-久久无码精品丰满人妻-国产高潮av-激情福利社-日韩av网址大全-国产精品久久999-日本五十路在线-性欧美在线-久久99精品波多结衣一区-男女午夜免费视频-黑人极品ⅴideos精品欧美棵-人人妻人人澡人人爽精品欧美一区-日韩一区在线看-欧美a级在线免费观看

歡迎訪問(wèn) 生活随笔!

生活随笔

當(dāng)前位置: 首頁(yè) > 编程资源 > 编程问答 >内容正文

编程问答

pandas数据分析常用方法总结(数据集可以自己更改)

發(fā)布時(shí)間:2023/12/8 编程问答 22 豆豆
生活随笔 收集整理的這篇文章主要介紹了 pandas数据分析常用方法总结(数据集可以自己更改) 小編覺(jué)得挺不錯(cuò)的,現(xiàn)在分享給大家,幫大家做個(gè)參考.

1.導(dǎo)入庫(kù)

#導(dǎo)入庫(kù) import numpy as np import pandas as pd

2.查看數(shù)據(jù)

#讀取文件 data = pd.read_csv('titanic_data.csv') data PassengerIdSurvivedPclassNameSexAgeSibSpParchTicketFareCabinEmbarked01234567891011121314151617181920212223242526272829...861862863864865866867868869870871872873874875876877878879880881882883884885886887888889890
103Braund, Mr. Owen Harrismale22.010A/5 211717.2500NaNS
211Cumings, Mrs. John Bradley (Florence Briggs Th...female38.010PC 1759971.2833C85C
313Heikkinen, Miss. Lainafemale26.000STON/O2. 31012827.9250NaNS
411Futrelle, Mrs. Jacques Heath (Lily May Peel)female35.01011380353.1000C123S
503Allen, Mr. William Henrymale35.0003734508.0500NaNS
603Moran, Mr. JamesmaleNaN003308778.4583NaNQ
701McCarthy, Mr. Timothy Jmale54.0001746351.8625E46S
803Palsson, Master. Gosta Leonardmale2.03134990921.0750NaNS
913Johnson, Mrs. Oscar W (Elisabeth Vilhelmina Berg)female27.00234774211.1333NaNS
1012Nasser, Mrs. Nicholas (Adele Achem)female14.01023773630.0708NaNC
1113Sandstrom, Miss. Marguerite Rutfemale4.011PP 954916.7000G6S
1211Bonnell, Miss. Elizabethfemale58.00011378326.5500C103S
1303Saundercock, Mr. William Henrymale20.000A/5. 21518.0500NaNS
1403Andersson, Mr. Anders Johanmale39.01534708231.2750NaNS
1503Vestrom, Miss. Hulda Amanda Adolfinafemale14.0003504067.8542NaNS
1612Hewlett, Mrs. (Mary D Kingcome)female55.00024870616.0000NaNS
1703Rice, Master. Eugenemale2.04138265229.1250NaNQ
1812Williams, Mr. Charles EugenemaleNaN0024437313.0000NaNS
1903Vander Planke, Mrs. Julius (Emelia Maria Vande...female31.01034576318.0000NaNS
2013Masselmani, Mrs. FatimafemaleNaN0026497.2250NaNC
2102Fynney, Mr. Joseph Jmale35.00023986526.0000NaNS
2212Beesley, Mr. Lawrencemale34.00024869813.0000D56S
2313McGowan, Miss. Anna "Annie"female15.0003309238.0292NaNQ
2411Sloper, Mr. William Thompsonmale28.00011378835.5000A6S
2503Palsson, Miss. Torborg Danirafemale8.03134990921.0750NaNS
2613Asplund, Mrs. Carl Oscar (Selma Augusta Emilia...female38.01534707731.3875NaNS
2703Emir, Mr. Farred ChehabmaleNaN0026317.2250NaNC
2801Fortune, Mr. Charles Alexandermale19.03219950263.0000C23 C25 C27S
2913O'Dwyer, Miss. Ellen "Nellie"femaleNaN003309597.8792NaNQ
3003Todoroff, Mr. LaliomaleNaN003492167.8958NaNS
....................................
86202Giles, Mr. Frederick Edwardmale21.0102813411.5000NaNS
86311Swift, Mrs. Frederick Joel (Margaret Welles Ba...female48.0001746625.9292D17S
86403Sage, Miss. Dorothy Edith "Dolly"femaleNaN82CA. 234369.5500NaNS
86502Gill, Mr. John Williammale24.00023386613.0000NaNS
86612Bystrom, Mrs. (Karolina)female42.00023685213.0000NaNS
86712Duran y More, Miss. Asuncionfemale27.010SC/PARIS 214913.8583NaNC
86801Roebling, Mr. Washington Augustus IImale31.000PC 1759050.4958A24S
86903van Melkebeke, Mr. PhilemonmaleNaN003457779.5000NaNS
87013Johnson, Master. Harold Theodormale4.01134774211.1333NaNS
87103Balkic, Mr. Cerinmale26.0003492487.8958NaNS
87211Beckwith, Mrs. Richard Leonard (Sallie Monypeny)female47.0111175152.5542D35S
87301Carlsson, Mr. Frans Olofmale33.0006955.0000B51 B53 B55S
87403Vander Cruyssen, Mr. Victormale47.0003457659.0000NaNS
87512Abelson, Mrs. Samuel (Hannah Wizosky)female28.010P/PP 338124.0000NaNC
87613Najib, Miss. Adele Kiamie "Jane"female15.00026677.2250NaNC
87703Gustafsson, Mr. Alfred Ossianmale20.00075349.8458NaNS
87803Petroff, Mr. Nedeliomale19.0003492127.8958NaNS
87903Laleff, Mr. KristomaleNaN003492177.8958NaNS
88011Potter, Mrs. Thomas Jr (Lily Alexenia Wilson)female56.0011176783.1583C50C
88112Shelley, Mrs. William (Imanita Parrish Hall)female25.00123043326.0000NaNS
88203Markun, Mr. Johannmale33.0003492577.8958NaNS
88303Dahlberg, Miss. Gerda Ulrikafemale22.000755210.5167NaNS
88402Banfield, Mr. Frederick Jamesmale28.000C.A./SOTON 3406810.5000NaNS
88503Sutehall, Mr. Henry Jrmale25.000SOTON/OQ 3920767.0500NaNS
88603Rice, Mrs. William (Margaret Norton)female39.00538265229.1250NaNQ
88702Montvila, Rev. Juozasmale27.00021153613.0000NaNS
88811Graham, Miss. Margaret Edithfemale19.00011205330.0000B42S
88903Johnston, Miss. Catherine Helen "Carrie"femaleNaN12W./C. 660723.4500NaNS
89011Behr, Mr. Karl Howellmale26.00011136930.0000C148C
89103Dooley, Mr. Patrickmale32.0003703767.7500NaNQ

891 rows × 12 columns

#查看數(shù)據(jù)維度 data.shape (891, 12) #查看數(shù)據(jù)基本信息 data.info() <class 'pandas.core.frame.DataFrame'> RangeIndex: 891 entries, 0 to 890 Data columns (total 12 columns): PassengerId 891 non-null int64 Survived 891 non-null int64 Pclass 891 non-null int64 Name 891 non-null object Sex 891 non-null object Age 714 non-null float64 SibSp 891 non-null int64 Parch 891 non-null int64 Ticket 891 non-null object Fare 891 non-null float64 Cabin 204 non-null object Embarked 889 non-null object dtypes: float64(2), int64(5), object(5) memory usage: 83.6+ KB #查看空值.把函數(shù)分步運(yùn)行理解 data.isnull().sum().sort_values(ascending=False) Cabin 687 Age 177 Embarked 2 Fare 0 Ticket 0 Parch 0 SibSp 0 Sex 0 Name 0 Pclass 0 Survived 0 PassengerId 0 dtype: int64 #查看每一列的數(shù)據(jù)格式 data.dtypes PassengerId int64 Survived int64 Pclass int64 Name object Sex object Age float64 SibSp int64 Parch int64 Ticket object Fare float64 Cabin object Embarked object dtype: object #查看具體每一列的的格式 data['Sex'].dtype dtype('O') #查看整個(gè)表的空值 data.isnull() PassengerIdSurvivedPclassNameSexAgeSibSpParchTicketFareCabinEmbarked01234567891011121314151617181920212223242526272829...861862863864865866867868869870871872873874875876877878879880881882883884885886887888889890
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseFalseFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseFalseFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseTrueFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseFalseFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseFalseFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseFalseFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseTrueFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseTrueFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseFalseFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseFalseFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseTrueFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseFalseFalse
FalseFalseFalseFalseFalseTrueFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseTrueFalseFalseFalseFalseTrueFalse
....................................
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseFalseFalse
FalseFalseFalseFalseFalseTrueFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseFalseFalse
FalseFalseFalseFalseFalseTrueFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseFalseFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseFalseFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseTrueFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseFalseFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseFalseFalse
FalseFalseFalseFalseFalseTrueFalseFalseFalseFalseTrueFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseFalseFalse
FalseFalseFalseFalseFalseFalseFalseFalseFalseFalseTrueFalse

891 rows × 12 columns

#查看具體某一列的空值 data['Age'].isnull() 0 False 1 False 2 False 3 False 4 False 5 True 6 False 7 False 8 False 9 False 10 False 11 False 12 False 13 False 14 False 15 False 16 False 17 True 18 False 19 True 20 False 21 False 22 False 23 False 24 False 25 False 26 True 27 False 28 True 29 True... 861 False 862 False 863 True 864 False 865 False 866 False 867 False 868 True 869 False 870 False 871 False 872 False 873 False 874 False 875 False 876 False 877 False 878 True 879 False 880 False 881 False 882 False 883 False 884 False 885 False 886 False 887 False 888 True 889 False 890 False Name: Age, Length: 891, dtype: bool #查看某列的取值種類 data['Parch'].unique() array([0, 1, 2, 5, 3, 4, 6], dtype=int64) #查看某一列(值不同)的值分布 data['Parch'].value_counts(dropna=False) #不把空值丟棄 0 678 1 118 2 80 5 5 3 5 4 4 6 1 Name: Parch, dtype: int64 #查看數(shù)據(jù)表的值,就是不包括表頭和索引 data.values array([[1, 0, 3, ..., 7.25, nan, 'S'],[2, 1, 1, ..., 71.2833, 'C85', 'C'],[3, 1, 3, ..., 7.925, nan, 'S'],...,[889, 0, 3, ..., 23.45, nan, 'S'],[890, 1, 1, ..., 30.0, 'C148', 'C'],[891, 0, 3, ..., 7.75, nan, 'Q']], dtype=object) #查看前幾行 data.head(4) PassengerIdSurvivedPclassNameSexAgeSibSpParchTicketFareCabinEmbarked0123
103Braund, Mr. Owen Harrismale22.010A/5 211717.2500NaNS
211Cumings, Mrs. John Bradley (Florence Briggs Th...female38.010PC 1759971.2833C85C
313Heikkinen, Miss. Lainafemale26.000STON/O2. 31012827.9250NaNS
411Futrelle, Mrs. Jacques Heath (Lily May Peel)female35.01011380353.1000C123S
#查看后幾行 data.tail(4) PassengerIdSurvivedPclassNameSexAgeSibSpParchTicketFareCabinEmbarked887888889890
88811Graham, Miss. Margaret Edithfemale19.00011205330.00B42S
88903Johnston, Miss. Catherine Helen "Carrie"femaleNaN12W./C. 660723.45NaNS
89011Behr, Mr. Karl Howellmale26.00011136930.00C148C
89103Dooley, Mr. Patrickmale32.0003703767.75NaNQ

3.清洗數(shù)據(jù)

#-----------------------------------------以上是查看數(shù)據(jù) #-----------------------------------------以下是清洗數(shù)據(jù) #用0填充空值 data['Age'].fillna(value=0) 0 22.0 1 38.0 2 26.0 3 35.0 4 35.0 5 0.0 6 54.0 7 2.0 8 27.0 9 14.0 10 4.0 11 58.0 12 20.0 13 39.0 14 14.0 15 55.0 16 2.0 17 0.0 18 31.0 19 0.0 20 35.0 21 34.0 22 15.0 23 28.0 24 8.0 25 38.0 26 0.0 27 19.0 28 0.0 29 0.0... 861 21.0 862 48.0 863 0.0 864 24.0 865 42.0 866 27.0 867 31.0 868 0.0 869 4.0 870 26.0 871 47.0 872 33.0 873 47.0 874 28.0 875 15.0 876 20.0 877 19.0 878 0.0 879 56.0 880 25.0 881 33.0 882 22.0 883 28.0 884 25.0 885 39.0 886 27.0 887 19.0 888 0.0 889 26.0 890 32.0 Name: Age, Length: 891, dtype: float64 #使用空值前面一個(gè)數(shù)填充數(shù)據(jù)(真正的表數(shù)據(jù)沒(méi)變) data['Age'].fillna(method='ffill') 0 22.0 1 38.0 2 26.0 3 35.0 4 35.0 5 35.0 6 54.0 7 2.0 8 27.0 9 14.0 10 4.0 11 58.0 12 20.0 13 39.0 14 14.0 15 55.0 16 2.0 17 2.0 18 31.0 19 31.0 20 35.0 21 34.0 22 15.0 23 28.0 24 8.0 25 38.0 26 38.0 27 19.0 28 19.0 29 19.0... 861 21.0 862 48.0 863 48.0 864 24.0 865 42.0 866 27.0 867 31.0 868 31.0 869 4.0 870 26.0 871 47.0 872 33.0 873 47.0 874 28.0 875 15.0 876 20.0 877 19.0 878 19.0 879 56.0 880 25.0 881 33.0 882 22.0 883 28.0 884 25.0 885 39.0 886 27.0 887 19.0 888 19.0 889 26.0 890 32.0 Name: Age, Length: 891, dtype: float64 #使用空值后面一個(gè)數(shù)填充數(shù)據(jù)(真正的表數(shù)據(jù)沒(méi)變) data['Age'].fillna(method='bfill') 0 22.0 1 38.0 2 26.0 3 35.0 4 35.0 5 54.0 6 54.0 7 2.0 8 27.0 9 14.0 10 4.0 11 58.0 12 20.0 13 39.0 14 14.0 15 55.0 16 2.0 17 31.0 18 31.0 19 35.0 20 35.0 21 34.0 22 15.0 23 28.0 24 8.0 25 38.0 26 19.0 27 19.0 28 40.0 29 40.0... 861 21.0 862 48.0 863 24.0 864 24.0 865 42.0 866 27.0 867 31.0 868 4.0 869 4.0 870 26.0 871 47.0 872 33.0 873 47.0 874 28.0 875 15.0 876 20.0 877 19.0 878 56.0 879 56.0 880 25.0 881 33.0 882 22.0 883 28.0 884 25.0 885 39.0 886 27.0 887 19.0 888 26.0 889 26.0 890 32.0 Name: Age, Length: 891, dtype: float64 #查看元數(shù)據(jù) data['Age'] 0 22.0 1 38.0 2 26.0 3 35.0 4 35.0 5 NaN 6 54.0 7 2.0 8 27.0 9 14.0 10 4.0 11 58.0 12 20.0 13 39.0 14 14.0 15 55.0 16 2.0 17 NaN 18 31.0 19 NaN 20 35.0 21 34.0 22 15.0 23 28.0 24 8.0 25 38.0 26 NaN 27 19.0 28 NaN 29 NaN... 861 21.0 862 48.0 863 NaN 864 24.0 865 42.0 866 27.0 867 31.0 868 NaN 869 4.0 870 26.0 871 47.0 872 33.0 873 47.0 874 28.0 875 15.0 876 20.0 877 19.0 878 NaN 879 56.0 880 25.0 881 33.0 882 22.0 883 28.0 884 25.0 885 39.0 886 27.0 887 19.0 888 NaN 889 26.0 890 32.0 Name: Age, Length: 891, dtype: float64 #空值依然是空值,不影響原數(shù)據(jù) #因?yàn)樘畛鋾r(shí)有個(gè)inplace參數(shù),該參數(shù)默認(rèn)為False不改變?cè)瓟?shù)據(jù),若要改變?cè)瓟?shù)據(jù)需要設(shè)置為T(mén)rue #使用均值改變?cè)瓟?shù)據(jù)空值 data['Age'].fillna(data['Age'].mean(),inplace=True) data['Age'] 0 22.000000 1 38.000000 2 26.000000 3 35.000000 4 35.000000 5 29.699118 6 54.000000 7 2.000000 8 27.000000 9 14.000000 10 4.000000 11 58.000000 12 20.000000 13 39.000000 14 14.000000 15 55.000000 16 2.000000 17 29.699118 18 31.000000 19 29.699118 20 35.000000 21 34.000000 22 15.000000 23 28.000000 24 8.000000 25 38.000000 26 29.699118 27 19.000000 28 29.699118 29 29.699118... 861 21.000000 862 48.000000 863 29.699118 864 24.000000 865 42.000000 866 27.000000 867 31.000000 868 29.699118 869 4.000000 870 26.000000 871 47.000000 872 33.000000 873 47.000000 874 28.000000 875 15.000000 876 20.000000 877 19.000000 878 29.699118 879 56.000000 880 25.000000 881 33.000000 882 22.000000 883 28.000000 884 25.000000 885 39.000000 886 27.000000 887 19.000000 888 29.699118 889 26.000000 890 32.000000 Name: Age, Length: 891, dtype: float64 #清除字符空格 ' a bb'.strip() #data['a']=data['a'].map(str.strip()) 對(duì)于a列的每一個(gè)字符都進(jìn)行操作 'a bb' data['Name']=data['Name'].map(str.strip) data['Name'] 0 Braund, Mr. Owen Harris 1 Cumings, Mrs. John Bradley (Florence Briggs Th... 2 Heikkinen, Miss. Laina 3 Futrelle, Mrs. Jacques Heath (Lily May Peel) 4 Allen, Mr. William Henry 5 Moran, Mr. James 6 McCarthy, Mr. Timothy J 7 Palsson, Master. Gosta Leonard 8 Johnson, Mrs. Oscar W (Elisabeth Vilhelmina Berg) 9 Nasser, Mrs. Nicholas (Adele Achem) 10 Sandstrom, Miss. Marguerite Rut 11 Bonnell, Miss. Elizabeth 12 Saundercock, Mr. William Henry 13 Andersson, Mr. Anders Johan 14 Vestrom, Miss. Hulda Amanda Adolfina 15 Hewlett, Mrs. (Mary D Kingcome) 16 Rice, Master. Eugene 17 Williams, Mr. Charles Eugene 18 Vander Planke, Mrs. Julius (Emelia Maria Vande... 19 Masselmani, Mrs. Fatima 20 Fynney, Mr. Joseph J 21 Beesley, Mr. Lawrence 22 McGowan, Miss. Anna "Annie" 23 Sloper, Mr. William Thompson 24 Palsson, Miss. Torborg Danira 25 Asplund, Mrs. Carl Oscar (Selma Augusta Emilia... 26 Emir, Mr. Farred Chehab 27 Fortune, Mr. Charles Alexander 28 O'Dwyer, Miss. Ellen "Nellie" 29 Todoroff, Mr. Lalio... 861 Giles, Mr. Frederick Edward 862 Swift, Mrs. Frederick Joel (Margaret Welles Ba... 863 Sage, Miss. Dorothy Edith "Dolly" 864 Gill, Mr. John William 865 Bystrom, Mrs. (Karolina) 866 Duran y More, Miss. Asuncion 867 Roebling, Mr. Washington Augustus II 868 van Melkebeke, Mr. Philemon 869 Johnson, Master. Harold Theodor 870 Balkic, Mr. Cerin 871 Beckwith, Mrs. Richard Leonard (Sallie Monypeny) 872 Carlsson, Mr. Frans Olof 873 Vander Cruyssen, Mr. Victor 874 Abelson, Mrs. Samuel (Hannah Wizosky) 875 Najib, Miss. Adele Kiamie "Jane" 876 Gustafsson, Mr. Alfred Ossian 877 Petroff, Mr. Nedelio 878 Laleff, Mr. Kristo 879 Potter, Mrs. Thomas Jr (Lily Alexenia Wilson) 880 Shelley, Mrs. William (Imanita Parrish Hall) 881 Markun, Mr. Johann 882 Dahlberg, Miss. Gerda Ulrika 883 Banfield, Mr. Frederick James 884 Sutehall, Mr. Henry Jr 885 Rice, Mrs. William (Margaret Norton) 886 Montvila, Rev. Juozas 887 Graham, Miss. Margaret Edith 888 Johnston, Miss. Catherine Helen "Carrie" 889 Behr, Mr. Karl Howell 890 Dooley, Mr. Patrick Name: Name, Length: 891, dtype: object #大小寫(xiě)轉(zhuǎn)換 data['Name']=data['Name'].str.lower() data['Name'] 0 braund, mr. owen harris 1 cumings, mrs. john bradley (florence briggs th... 2 heikkinen, miss. laina 3 futrelle, mrs. jacques heath (lily may peel) 4 allen, mr. william henry 5 moran, mr. james 6 mccarthy, mr. timothy j 7 palsson, master. gosta leonard 8 johnson, mrs. oscar w (elisabeth vilhelmina berg) 9 nasser, mrs. nicholas (adele achem) 10 sandstrom, miss. marguerite rut 11 bonnell, miss. elizabeth 12 saundercock, mr. william henry 13 andersson, mr. anders johan 14 vestrom, miss. hulda amanda adolfina 15 hewlett, mrs. (mary d kingcome) 16 rice, master. eugene 17 williams, mr. charles eugene 18 vander planke, mrs. julius (emelia maria vande... 19 masselmani, mrs. fatima 20 fynney, mr. joseph j 21 beesley, mr. lawrence 22 mcgowan, miss. anna "annie" 23 sloper, mr. william thompson 24 palsson, miss. torborg danira 25 asplund, mrs. carl oscar (selma augusta emilia... 26 emir, mr. farred chehab 27 fortune, mr. charles alexander 28 o'dwyer, miss. ellen "nellie" 29 todoroff, mr. lalio... 861 giles, mr. frederick edward 862 swift, mrs. frederick joel (margaret welles ba... 863 sage, miss. dorothy edith "dolly" 864 gill, mr. john william 865 bystrom, mrs. (karolina) 866 duran y more, miss. asuncion 867 roebling, mr. washington augustus ii 868 van melkebeke, mr. philemon 869 johnson, master. harold theodor 870 balkic, mr. cerin 871 beckwith, mrs. richard leonard (sallie monypeny) 872 carlsson, mr. frans olof 873 vander cruyssen, mr. victor 874 abelson, mrs. samuel (hannah wizosky) 875 najib, miss. adele kiamie "jane" 876 gustafsson, mr. alfred ossian 877 petroff, mr. nedelio 878 laleff, mr. kristo 879 potter, mrs. thomas jr (lily alexenia wilson) 880 shelley, mrs. william (imanita parrish hall) 881 markun, mr. johann 882 dahlberg, miss. gerda ulrika 883 banfield, mr. frederick james 884 sutehall, mr. henry jr 885 rice, mrs. william (margaret norton) 886 montvila, rev. juozas 887 graham, miss. margaret edith 888 johnston, miss. catherine helen "carrie" 889 behr, mr. karl howell 890 dooley, mr. patrick Name: Name, Length: 891, dtype: object #更改數(shù)據(jù)格式 data['Survived'].dtype dtype('int64') data['Survived'].astype('str') 0 0 1 1 2 1 3 1 4 0 5 0 6 0 7 0 8 1 9 1 10 1 11 1 12 0 13 0 14 0 15 1 16 0 17 1 18 0 19 1 20 0 21 1 22 1 23 1 24 0 25 1 26 0 27 0 28 1 29 0.. 861 0 862 1 863 0 864 0 865 1 866 1 867 0 868 0 869 1 870 0 871 1 872 0 873 0 874 1 875 1 876 0 877 0 878 0 879 1 880 1 881 0 882 0 883 0 884 0 885 0 886 0 887 1 888 0 889 1 890 0 Name: Survived, Length: 891, dtype: object #改變列名 data.rename(columns={'Sex':'性別 ','Name':'姓名'}).head() PassengerIdSurvivedPclass姓名性別AgeSibSpParchTicketFareCabinEmbarked01234
103braund, mr. owen harrismale22.010A/5 211717.2500NaNS
211cumings, mrs. john bradley (florence briggs th...female38.010PC 1759971.2833C85C
313heikkinen, miss. lainafemale26.000STON/O2. 31012827.9250NaNS
411futrelle, mrs. jacques heath (lily may peel)female35.01011380353.1000C123S
503allen, mr. william henrymale35.0003734508.0500NaNS
#打印列名(依然不會(huì)在原表上改) data.columns Index(['PassengerId', 'Survived', 'Pclass', 'Name', 'Sex', 'Age', 'SibSp','Parch', 'Ticket', 'Fare', 'Cabin', 'Embarked'],dtype='object') #刪除某列重復(fù)值(刪除后出現(xiàn)的) data['Sex'].drop_duplicates() 0 male 1 female Name: Sex, dtype: object #刪除某列重復(fù)值(刪除先出現(xiàn)的)保留后面的 data['Sex'].drop_duplicates(keep='last') 888 female 890 male Name: Sex, dtype: object #數(shù)據(jù)替換 data['Survived'].replace(0,'died') 0 died 1 1 2 1 3 1 4 died 5 died 6 died 7 died 8 1 9 1 10 1 11 1 12 died 13 died 14 died 15 1 16 died 17 1 18 died 19 1 20 died 21 1 22 1 23 1 24 died 25 1 26 died 27 died 28 1 29 died... 861 died 862 1 863 died 864 died 865 1 866 1 867 died 868 died 869 1 870 died 871 1 872 died 873 died 874 1 875 1 876 died 877 died 878 died 879 1 880 1 881 died 882 died 883 died 884 died 885 died 886 died 887 1 888 died 889 1 890 died Name: Survived, Length: 891, dtype: object

4.數(shù)據(jù)合并

#--------------------------------------數(shù)據(jù)合并 data1= pd.DataFrame({'PassengerId':[9,19,29,39999],'Sur':[0,1,0,1],'Name':['xiaoming','xiaohong','xiaozhang','xiaozhao'],'time':[10.0,22.0,30.0,40.0]}) data1 PassengerIdSurNametime0123
90xiaoming10.0
191xiaohong22.0
290xiaozhang30.0
399991xiaozhao40.0
#merge合并兩張表 data_inner = pd.merge(data,data1,on='PassengerId',how='inner')#根據(jù)'PassengerId'字段合并,取交集,也就是內(nèi)連接 data_inner PassengerIdSurvivedPclassName_xSexAgeSibSpParchTicketFareCabinEmbarkedSurName_ytime012
913johnson, mrs. oscar w (elisabeth vilhelmina berg)female27.0000000234774211.1333NaNS0xiaoming10.0
1903vander planke, mrs. julius (emelia maria vande...female31.0000001034576318.0000NaNS1xiaohong22.0
2913o'dwyer, miss. ellen "nellie"female29.699118003309597.8792NaNQ0xiaozhang30.0
#左連接 data_left = pd.merge(data,data1,on='PassengerId',how='left') data_left.head(30) PassengerIdSurvivedPclassName_xSexAgeSibSpParchTicketFareCabinEmbarkedSurName_ytime01234567891011121314151617181920212223242526272829
103braund, mr. owen harrismale22.00000010A/5 211717.2500NaNSNaNNaNNaN
211cumings, mrs. john bradley (florence briggs th...female38.00000010PC 1759971.2833C85CNaNNaNNaN
313heikkinen, miss. lainafemale26.00000000STON/O2. 31012827.9250NaNSNaNNaNNaN
411futrelle, mrs. jacques heath (lily may peel)female35.0000001011380353.1000C123SNaNNaNNaN
503allen, mr. william henrymale35.000000003734508.0500NaNSNaNNaNNaN
603moran, mr. jamesmale29.699118003308778.4583NaNQNaNNaNNaN
701mccarthy, mr. timothy jmale54.000000001746351.8625E46SNaNNaNNaN
803palsson, master. gosta leonardmale2.0000003134990921.0750NaNSNaNNaNNaN
913johnson, mrs. oscar w (elisabeth vilhelmina berg)female27.0000000234774211.1333NaNS0.0xiaoming10.0
1012nasser, mrs. nicholas (adele achem)female14.0000001023773630.0708NaNCNaNNaNNaN
1113sandstrom, miss. marguerite rutfemale4.00000011PP 954916.7000G6SNaNNaNNaN
1211bonnell, miss. elizabethfemale58.0000000011378326.5500C103SNaNNaNNaN
1303saundercock, mr. william henrymale20.00000000A/5. 21518.0500NaNSNaNNaNNaN
1403andersson, mr. anders johanmale39.0000001534708231.2750NaNSNaNNaNNaN
1503vestrom, miss. hulda amanda adolfinafemale14.000000003504067.8542NaNSNaNNaNNaN
1612hewlett, mrs. (mary d kingcome)female55.0000000024870616.0000NaNSNaNNaNNaN
1703rice, master. eugenemale2.0000004138265229.1250NaNQNaNNaNNaN
1812williams, mr. charles eugenemale29.6991180024437313.0000NaNSNaNNaNNaN
1903vander planke, mrs. julius (emelia maria vande...female31.0000001034576318.0000NaNS1.0xiaohong22.0
2013masselmani, mrs. fatimafemale29.6991180026497.2250NaNCNaNNaNNaN
2102fynney, mr. joseph jmale35.0000000023986526.0000NaNSNaNNaNNaN
2212beesley, mr. lawrencemale34.0000000024869813.0000D56SNaNNaNNaN
2313mcgowan, miss. anna "annie"female15.000000003309238.0292NaNQNaNNaNNaN
2411sloper, mr. william thompsonmale28.0000000011378835.5000A6SNaNNaNNaN
2503palsson, miss. torborg danirafemale8.0000003134990921.0750NaNSNaNNaNNaN
2613asplund, mrs. carl oscar (selma augusta emilia...female38.0000001534707731.3875NaNSNaNNaNNaN
2703emir, mr. farred chehabmale29.6991180026317.2250NaNCNaNNaNNaN
2801fortune, mr. charles alexandermale19.0000003219950263.0000C23 C25 C27SNaNNaNNaN
2913o'dwyer, miss. ellen "nellie"female29.699118003309597.8792NaNQ0.0xiaozhang30.0
3003todoroff, mr. laliomale29.699118003492167.8958NaNSNaNNaNNaN
#右連接 data_right = pd.merge(data,data1,on='PassengerId',how='right') data_right PassengerIdSurvivedPclassName_xSexAgeSibSpParchTicketFareCabinEmbarkedSurName_ytime0123
91.03.0johnson, mrs. oscar w (elisabeth vilhelmina berg)female27.0000000.02.034774211.1333NaNS0xiaoming10.0
190.03.0vander planke, mrs. julius (emelia maria vande...female31.0000001.00.034576318.0000NaNS1xiaohong22.0
291.03.0o'dwyer, miss. ellen "nellie"female29.6991180.00.03309597.8792NaNQ0xiaozhang30.0
39999NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN1xiaozhao40.0
#全連接 data_outer = pd.merge(data,data1,on='PassengerId',how='outer') data_outer.head(20)#打印前20條看 PassengerIdSurvivedPclassName_xSexAgeSibSpParchTicketFareCabinEmbarkedSurName_ytime012345678910111213141516171819
10.03.0braund, mr. owen harrismale22.0000001.00.0A/5 211717.2500NaNSNaNNaNNaN
21.01.0cumings, mrs. john bradley (florence briggs th...female38.0000001.00.0PC 1759971.2833C85CNaNNaNNaN
31.03.0heikkinen, miss. lainafemale26.0000000.00.0STON/O2. 31012827.9250NaNSNaNNaNNaN
41.01.0futrelle, mrs. jacques heath (lily may peel)female35.0000001.00.011380353.1000C123SNaNNaNNaN
50.03.0allen, mr. william henrymale35.0000000.00.03734508.0500NaNSNaNNaNNaN
60.03.0moran, mr. jamesmale29.6991180.00.03308778.4583NaNQNaNNaNNaN
70.01.0mccarthy, mr. timothy jmale54.0000000.00.01746351.8625E46SNaNNaNNaN
80.03.0palsson, master. gosta leonardmale2.0000003.01.034990921.0750NaNSNaNNaNNaN
91.03.0johnson, mrs. oscar w (elisabeth vilhelmina berg)female27.0000000.02.034774211.1333NaNS0.0xiaoming10.0
101.02.0nasser, mrs. nicholas (adele achem)female14.0000001.00.023773630.0708NaNCNaNNaNNaN
111.03.0sandstrom, miss. marguerite rutfemale4.0000001.01.0PP 954916.7000G6SNaNNaNNaN
121.01.0bonnell, miss. elizabethfemale58.0000000.00.011378326.5500C103SNaNNaNNaN
130.03.0saundercock, mr. william henrymale20.0000000.00.0A/5. 21518.0500NaNSNaNNaNNaN
140.03.0andersson, mr. anders johanmale39.0000001.05.034708231.2750NaNSNaNNaNNaN
150.03.0vestrom, miss. hulda amanda adolfinafemale14.0000000.00.03504067.8542NaNSNaNNaNNaN
161.02.0hewlett, mrs. (mary d kingcome)female55.0000000.00.024870616.0000NaNSNaNNaNNaN
170.03.0rice, master. eugenemale2.0000004.01.038265229.1250NaNQNaNNaNNaN
181.02.0williams, mr. charles eugenemale29.6991180.00.024437313.0000NaNSNaNNaNNaN
190.03.0vander planke, mrs. julius (emelia maria vande...female31.0000001.00.034576318.0000NaNS1.0xiaohong22.0
201.03.0masselmani, mrs. fatimafemale29.6991180.00.026497.2250NaNCNaNNaNNaN
data_outer.tail(20)#打印后20條看 PassengerIdSurvivedPclassName_xSexAgeSibSpParchTicketFareCabinEmbarkedSurName_ytime872873874875876877878879880881882883884885886887888889890891
8730.01.0carlsson, mr. frans olofmale33.0000000.00.06955.0000B51 B53 B55SNaNNaNNaN
8740.03.0vander cruyssen, mr. victormale47.0000000.00.03457659.0000NaNSNaNNaNNaN
8751.02.0abelson, mrs. samuel (hannah wizosky)female28.0000001.00.0P/PP 338124.0000NaNCNaNNaNNaN
8761.03.0najib, miss. adele kiamie "jane"female15.0000000.00.026677.2250NaNCNaNNaNNaN
8770.03.0gustafsson, mr. alfred ossianmale20.0000000.00.075349.8458NaNSNaNNaNNaN
8780.03.0petroff, mr. nedeliomale19.0000000.00.03492127.8958NaNSNaNNaNNaN
8790.03.0laleff, mr. kristomale29.6991180.00.03492177.8958NaNSNaNNaNNaN
8801.01.0potter, mrs. thomas jr (lily alexenia wilson)female56.0000000.01.01176783.1583C50CNaNNaNNaN
8811.02.0shelley, mrs. william (imanita parrish hall)female25.0000000.01.023043326.0000NaNSNaNNaNNaN
8820.03.0markun, mr. johannmale33.0000000.00.03492577.8958NaNSNaNNaNNaN
8830.03.0dahlberg, miss. gerda ulrikafemale22.0000000.00.0755210.5167NaNSNaNNaNNaN
8840.02.0banfield, mr. frederick jamesmale28.0000000.00.0C.A./SOTON 3406810.5000NaNSNaNNaNNaN
8850.03.0sutehall, mr. henry jrmale25.0000000.00.0SOTON/OQ 3920767.0500NaNSNaNNaNNaN
8860.03.0rice, mrs. william (margaret norton)female39.0000000.05.038265229.1250NaNQNaNNaNNaN
8870.02.0montvila, rev. juozasmale27.0000000.00.021153613.0000NaNSNaNNaNNaN
8881.01.0graham, miss. margaret edithfemale19.0000000.00.011205330.0000B42SNaNNaNNaN
8890.03.0johnston, miss. catherine helen "carrie"female29.6991181.02.0W./C. 660723.4500NaNSNaNNaNNaN
8901.01.0behr, mr. karl howellmale26.0000000.00.011136930.0000C148CNaNNaNNaN
8910.03.0dooley, mr. patrickmale32.0000000.00.03703767.7500NaNQNaNNaNNaN
39999NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN1.0xiaozhao40.0
#追加數(shù)據(jù) data2 = pd.DataFrame({'id':[1,2,3],'Sex':['famle','male','male']}) result = data1.append(data2) result NamePassengerIdSexSuridtime0123012
xiaoming9.0NaN0.0NaN10.0
xiaohong19.0NaN1.0NaN22.0
xiaozhang29.0NaN0.0NaN30.0
xiaozhao39999.0NaN1.0NaN40.0
NaNNaNfamleNaN1.0NaN
NaNNaNmaleNaN2.0NaN
NaNNaNmaleNaN3.0NaN
#設(shè)置索引列 data_inner.set_index('time') PassengerIdSurvivedPclassName_xSexAgeSibSpParchTicketFareCabinEmbarkedSurName_ytime10.022.030.0
913johnson, mrs. oscar w (elisabeth vilhelmina berg)female27.0000000234774211.1333NaNS0xiaoming
1903vander planke, mrs. julius (emelia maria vande...female31.0000001034576318.0000NaNS1xiaohong
2913o'dwyer, miss. ellen "nellie"female29.699118003309597.8792NaNQ0xiaozhang
#還原索引 data_inner.reset_index(drop=True) PassengerIdSurvivedPclassName_xSexAgeSibSpParchTicketFareCabinEmbarkedSurName_ytime012
913johnson, mrs. oscar w (elisabeth vilhelmina berg)female27.0000000234774211.1333NaNS0xiaoming10.0
1903vander planke, mrs. julius (emelia maria vande...female31.0000001034576318.0000NaNS1xiaohong22.0
2913o'dwyer, miss. ellen "nellie"female29.699118003309597.8792NaNQ0xiaozhang30.0
#按照特定列值排序 data_inner.sort_values(by=['Fare']) PassengerIdSurvivedPclassName_xSexAgeSibSpParchTicketFareCabinEmbarkedSurName_ytime201
2913o'dwyer, miss. ellen "nellie"female29.699118003309597.8792NaNQ0xiaozhang30.0
913johnson, mrs. oscar w (elisabeth vilhelmina berg)female27.0000000234774211.1333NaNS0xiaoming10.0
1903vander planke, mrs. julius (emelia maria vande...female31.0000001034576318.0000NaNS1xiaohong22.0
#還原回按照索引排序 data_inner.sort_index() PassengerIdSurvivedPclassName_xSexAgeSibSpParchTicketFareCabinEmbarkedSurName_ytime012
913johnson, mrs. oscar w (elisabeth vilhelmina berg)female27.0000000234774211.1333NaNS0xiaoming10.0
1903vander planke, mrs. julius (emelia maria vande...female31.0000001034576318.0000NaNS1xiaohong22.0
2913o'dwyer, miss. ellen "nellie"female29.699118003309597.8792NaNQ0xiaozhang30.0
#生成一個(gè)新列標(biāo)記某項(xiàng)數(shù)據(jù)的大小關(guān)系 data_inner['level']=np.where(data_inner['Fare']>10.0,'good','oh') data_inner PassengerIdSurvivedPclassName_xSexAgeSibSpParchTicketFareCabinEmbarkedSurName_ytimelevel012
913johnson, mrs. oscar w (elisabeth vilhelmina berg)female27.0000000234774211.1333NaNS0xiaoming10.0good
1903vander planke, mrs. julius (emelia maria vande...female31.0000001034576318.0000NaNS1xiaohong22.0good
2913o'dwyer, miss. ellen "nellie"female29.699118003309597.8792NaNQ0xiaozhang30.0oh
#生成一個(gè)新列標(biāo)記符合多個(gè)條件的數(shù)據(jù) data.loc[(data['Sex']=='male') & (data['Age']<=10.0),'tip']=1 data.head(50) PassengerIdSurvivedPclassNameSexAgeSibSpParchTicketFareCabinEmbarkedtip012345678910111213141516171819202122232425262728293031323334353637383940414243444546474849
103braund, mr. owen harrismale22.00000010A/5 211717.2500NaNSNaN
211cumings, mrs. john bradley (florence briggs th...female38.00000010PC 1759971.2833C85CNaN
313heikkinen, miss. lainafemale26.00000000STON/O2. 31012827.9250NaNSNaN
411futrelle, mrs. jacques heath (lily may peel)female35.0000001011380353.1000C123SNaN
503allen, mr. william henrymale35.000000003734508.0500NaNSNaN
603moran, mr. jamesmale29.699118003308778.4583NaNQNaN
701mccarthy, mr. timothy jmale54.000000001746351.8625E46SNaN
803palsson, master. gosta leonardmale2.0000003134990921.0750NaNS1.0
913johnson, mrs. oscar w (elisabeth vilhelmina berg)female27.0000000234774211.1333NaNSNaN
1012nasser, mrs. nicholas (adele achem)female14.0000001023773630.0708NaNCNaN
1113sandstrom, miss. marguerite rutfemale4.00000011PP 954916.7000G6SNaN
1211bonnell, miss. elizabethfemale58.0000000011378326.5500C103SNaN
1303saundercock, mr. william henrymale20.00000000A/5. 21518.0500NaNSNaN
1403andersson, mr. anders johanmale39.0000001534708231.2750NaNSNaN
1503vestrom, miss. hulda amanda adolfinafemale14.000000003504067.8542NaNSNaN
1612hewlett, mrs. (mary d kingcome)female55.0000000024870616.0000NaNSNaN
1703rice, master. eugenemale2.0000004138265229.1250NaNQ1.0
1812williams, mr. charles eugenemale29.6991180024437313.0000NaNSNaN
1903vander planke, mrs. julius (emelia maria vande...female31.0000001034576318.0000NaNSNaN
2013masselmani, mrs. fatimafemale29.6991180026497.2250NaNCNaN
2102fynney, mr. joseph jmale35.0000000023986526.0000NaNSNaN
2212beesley, mr. lawrencemale34.0000000024869813.0000D56SNaN
2313mcgowan, miss. anna "annie"female15.000000003309238.0292NaNQNaN
2411sloper, mr. william thompsonmale28.0000000011378835.5000A6SNaN
2503palsson, miss. torborg danirafemale8.0000003134990921.0750NaNSNaN
2613asplund, mrs. carl oscar (selma augusta emilia...female38.0000001534707731.3875NaNSNaN
2703emir, mr. farred chehabmale29.6991180026317.2250NaNCNaN
2801fortune, mr. charles alexandermale19.0000003219950263.0000C23 C25 C27SNaN
2913o'dwyer, miss. ellen "nellie"female29.699118003309597.8792NaNQNaN
3003todoroff, mr. laliomale29.699118003492167.8958NaNSNaN
3101uruchurtu, don. manuel emale40.00000000PC 1760127.7208NaNCNaN
3211spencer, mrs. william augustus (marie eugenie)female29.69911810PC 17569146.5208B78CNaN
3313glynn, miss. mary agathafemale29.699118003356777.7500NaNQNaN
3402wheadon, mr. edward hmale66.00000000C.A. 2457910.5000NaNSNaN
3501meyer, mr. edgar josephmale28.00000010PC 1760482.1708NaNCNaN
3601holverson, mr. alexander oskarmale42.0000001011378952.0000NaNSNaN
3713mamee, mr. hannamale29.6991180026777.2292NaNCNaN
3803cann, mr. ernest charlesmale21.00000000A./5. 21528.0500NaNSNaN
3903vander planke, miss. augusta mariafemale18.0000002034576418.0000NaNSNaN
4013nicola-yarred, miss. jamilafemale14.00000010265111.2417NaNCNaN
4103ahlin, mrs. johan (johanna persdotter larsson)female40.0000001075469.4750NaNSNaN
4202turpin, mrs. william john robert (dorothy ann ...female27.000000101166821.0000NaNSNaN
4303kraeff, mr. theodormale29.699118003492537.8958NaNCNaN
4412laroche, miss. simonne marie anne andreefemale3.00000012SC/Paris 212341.5792NaNCNaN
4513devaney, miss. margaret deliafemale19.000000003309587.8792NaNQNaN
4603rogers, mr. william johnmale29.69911800S.C./A.4. 235678.0500NaNSNaN
4703lennon, mr. denismale29.6991181037037115.5000NaNQNaN
4813o'driscoll, miss. bridgetfemale29.69911800143117.7500NaNQNaN
4903samaan, mr. youssefmale29.69911820266221.6792NaNCNaN
5003arnold-franchi, mrs. josef (josefine franchi)female18.0000001034923717.8000NaNSNaN
#對(duì)某列數(shù)據(jù)進(jìn)行再拆分 #對(duì)泰坦尼克號(hào)Name列拆分,拆為姓和名 data_namesplit = pd.DataFrame((x.split('.') for x in data['Name']),index=data.index,columns=['名','姓','s']) data_namesplit 名姓s01234567891011121314151617181920212223242526272829...861862863864865866867868869870871872873874875876877878879880881882883884885886887888889890
braund, mrowen harrisNone
cumings, mrsjohn bradley (florence briggs thayer)None
heikkinen, misslainaNone
futrelle, mrsjacques heath (lily may peel)None
allen, mrwilliam henryNone
moran, mrjamesNone
mccarthy, mrtimothy jNone
palsson, mastergosta leonardNone
johnson, mrsoscar w (elisabeth vilhelmina berg)None
nasser, mrsnicholas (adele achem)None
sandstrom, missmarguerite rutNone
bonnell, misselizabethNone
saundercock, mrwilliam henryNone
andersson, mranders johanNone
vestrom, misshulda amanda adolfinaNone
hewlett, mrs(mary d kingcome)None
rice, mastereugeneNone
williams, mrcharles eugeneNone
vander planke, mrsjulius (emelia maria vandemoortele)None
masselmani, mrsfatimaNone
fynney, mrjoseph jNone
beesley, mrlawrenceNone
mcgowan, missanna "annie"None
sloper, mrwilliam thompsonNone
palsson, misstorborg daniraNone
asplund, mrscarl oscar (selma augusta emilia johansson)None
emir, mrfarred chehabNone
fortune, mrcharles alexanderNone
o'dwyer, missellen "nellie"None
todoroff, mrlalioNone
.........
giles, mrfrederick edwardNone
swift, mrsfrederick joel (margaret welles barron)None
sage, missdorothy edith "dolly"None
gill, mrjohn williamNone
bystrom, mrs(karolina)None
duran y more, missasuncionNone
roebling, mrwashington augustus iiNone
van melkebeke, mrphilemonNone
johnson, masterharold theodorNone
balkic, mrcerinNone
beckwith, mrsrichard leonard (sallie monypeny)None
carlsson, mrfrans olofNone
vander cruyssen, mrvictorNone
abelson, mrssamuel (hannah wizosky)None
najib, missadele kiamie "jane"None
gustafsson, mralfred ossianNone
petroff, mrnedelioNone
laleff, mrkristoNone
potter, mrsthomas jr (lily alexenia wilson)None
shelley, mrswilliam (imanita parrish hall)None
markun, mrjohannNone
dahlberg, missgerda ulrikaNone
banfield, mrfrederick jamesNone
sutehall, mrhenry jrNone
rice, mrswilliam (margaret norton)None
montvila, revjuozasNone
graham, missmargaret edithNone
johnston, misscatherine helen "carrie"None
behr, mrkarl howellNone
dooley, mrpatrickNone

891 rows × 3 columns

#把新分出來(lái)的表合并進(jìn)原表axis = 1按行合并 data = pd.concat([data_namesplit,data],axis = 1) data 名姓sPassengerIdSurvivedPclassNameSexAgeSibSpParchTicketFareCabinEmbarkedtip01234567891011121314151617181920212223242526272829...861862863864865866867868869870871872873874875876877878879880881882883884885886887888889890
braund, mrowen harrisNone103braund, mr. owen harrismale22.00000010A/5 211717.2500NaNSNaN
cumings, mrsjohn bradley (florence briggs thayer)None211cumings, mrs. john bradley (florence briggs th...female38.00000010PC 1759971.2833C85CNaN
heikkinen, misslainaNone313heikkinen, miss. lainafemale26.00000000STON/O2. 31012827.9250NaNSNaN
futrelle, mrsjacques heath (lily may peel)None411futrelle, mrs. jacques heath (lily may peel)female35.0000001011380353.1000C123SNaN
allen, mrwilliam henryNone503allen, mr. william henrymale35.000000003734508.0500NaNSNaN
moran, mrjamesNone603moran, mr. jamesmale29.699118003308778.4583NaNQNaN
mccarthy, mrtimothy jNone701mccarthy, mr. timothy jmale54.000000001746351.8625E46SNaN
palsson, mastergosta leonardNone803palsson, master. gosta leonardmale2.0000003134990921.0750NaNS1.0
johnson, mrsoscar w (elisabeth vilhelmina berg)None913johnson, mrs. oscar w (elisabeth vilhelmina berg)female27.0000000234774211.1333NaNSNaN
nasser, mrsnicholas (adele achem)None1012nasser, mrs. nicholas (adele achem)female14.0000001023773630.0708NaNCNaN
sandstrom, missmarguerite rutNone1113sandstrom, miss. marguerite rutfemale4.00000011PP 954916.7000G6SNaN
bonnell, misselizabethNone1211bonnell, miss. elizabethfemale58.0000000011378326.5500C103SNaN
saundercock, mrwilliam henryNone1303saundercock, mr. william henrymale20.00000000A/5. 21518.0500NaNSNaN
andersson, mranders johanNone1403andersson, mr. anders johanmale39.0000001534708231.2750NaNSNaN
vestrom, misshulda amanda adolfinaNone1503vestrom, miss. hulda amanda adolfinafemale14.000000003504067.8542NaNSNaN
hewlett, mrs(mary d kingcome)None1612hewlett, mrs. (mary d kingcome)female55.0000000024870616.0000NaNSNaN
rice, mastereugeneNone1703rice, master. eugenemale2.0000004138265229.1250NaNQ1.0
williams, mrcharles eugeneNone1812williams, mr. charles eugenemale29.6991180024437313.0000NaNSNaN
vander planke, mrsjulius (emelia maria vandemoortele)None1903vander planke, mrs. julius (emelia maria vande...female31.0000001034576318.0000NaNSNaN
masselmani, mrsfatimaNone2013masselmani, mrs. fatimafemale29.6991180026497.2250NaNCNaN
fynney, mrjoseph jNone2102fynney, mr. joseph jmale35.0000000023986526.0000NaNSNaN
beesley, mrlawrenceNone2212beesley, mr. lawrencemale34.0000000024869813.0000D56SNaN
mcgowan, missanna "annie"None2313mcgowan, miss. anna "annie"female15.000000003309238.0292NaNQNaN
sloper, mrwilliam thompsonNone2411sloper, mr. william thompsonmale28.0000000011378835.5000A6SNaN
palsson, misstorborg daniraNone2503palsson, miss. torborg danirafemale8.0000003134990921.0750NaNSNaN
asplund, mrscarl oscar (selma augusta emilia johansson)None2613asplund, mrs. carl oscar (selma augusta emilia...female38.0000001534707731.3875NaNSNaN
emir, mrfarred chehabNone2703emir, mr. farred chehabmale29.6991180026317.2250NaNCNaN
fortune, mrcharles alexanderNone2801fortune, mr. charles alexandermale19.0000003219950263.0000C23 C25 C27SNaN
o'dwyer, missellen "nellie"None2913o'dwyer, miss. ellen "nellie"female29.699118003309597.8792NaNQNaN
todoroff, mrlalioNone3003todoroff, mr. laliomale29.699118003492167.8958NaNSNaN
................................................
giles, mrfrederick edwardNone86202giles, mr. frederick edwardmale21.000000102813411.5000NaNSNaN
swift, mrsfrederick joel (margaret welles barron)None86311swift, mrs. frederick joel (margaret welles ba...female48.000000001746625.9292D17SNaN
sage, missdorothy edith "dolly"None86403sage, miss. dorothy edith "dolly"female29.69911882CA. 234369.5500NaNSNaN
gill, mrjohn williamNone86502gill, mr. john williammale24.0000000023386613.0000NaNSNaN
bystrom, mrs(karolina)None86612bystrom, mrs. (karolina)female42.0000000023685213.0000NaNSNaN
duran y more, missasuncionNone86712duran y more, miss. asuncionfemale27.00000010SC/PARIS 214913.8583NaNCNaN
roebling, mrwashington augustus iiNone86801roebling, mr. washington augustus iimale31.00000000PC 1759050.4958A24SNaN
van melkebeke, mrphilemonNone86903van melkebeke, mr. philemonmale29.699118003457779.5000NaNSNaN
johnson, masterharold theodorNone87013johnson, master. harold theodormale4.0000001134774211.1333NaNS1.0
balkic, mrcerinNone87103balkic, mr. cerinmale26.000000003492487.8958NaNSNaN
beckwith, mrsrichard leonard (sallie monypeny)None87211beckwith, mrs. richard leonard (sallie monypeny)female47.000000111175152.5542D35SNaN
carlsson, mrfrans olofNone87301carlsson, mr. frans olofmale33.000000006955.0000B51 B53 B55SNaN
vander cruyssen, mrvictorNone87403vander cruyssen, mr. victormale47.000000003457659.0000NaNSNaN
abelson, mrssamuel (hannah wizosky)None87512abelson, mrs. samuel (hannah wizosky)female28.00000010P/PP 338124.0000NaNCNaN
najib, missadele kiamie "jane"None87613najib, miss. adele kiamie "jane"female15.0000000026677.2250NaNCNaN
gustafsson, mralfred ossianNone87703gustafsson, mr. alfred ossianmale20.0000000075349.8458NaNSNaN
petroff, mrnedelioNone87803petroff, mr. nedeliomale19.000000003492127.8958NaNSNaN
laleff, mrkristoNone87903laleff, mr. kristomale29.699118003492177.8958NaNSNaN
potter, mrsthomas jr (lily alexenia wilson)None88011potter, mrs. thomas jr (lily alexenia wilson)female56.000000011176783.1583C50CNaN
shelley, mrswilliam (imanita parrish hall)None88112shelley, mrs. william (imanita parrish hall)female25.0000000123043326.0000NaNSNaN
markun, mrjohannNone88203markun, mr. johannmale33.000000003492577.8958NaNSNaN
dahlberg, missgerda ulrikaNone88303dahlberg, miss. gerda ulrikafemale22.00000000755210.5167NaNSNaN
banfield, mrfrederick jamesNone88402banfield, mr. frederick jamesmale28.00000000C.A./SOTON 3406810.5000NaNSNaN
sutehall, mrhenry jrNone88503sutehall, mr. henry jrmale25.00000000SOTON/OQ 3920767.0500NaNSNaN
rice, mrswilliam (margaret norton)None88603rice, mrs. william (margaret norton)female39.0000000538265229.1250NaNQNaN
montvila, revjuozasNone88702montvila, rev. juozasmale27.0000000021153613.0000NaNSNaN
graham, missmargaret edithNone88811graham, miss. margaret edithfemale19.0000000011205330.0000B42SNaN
johnston, misscatherine helen "carrie"None88903johnston, miss. catherine helen "carrie"female29.69911812W./C. 660723.4500NaNSNaN
behr, mrkarl howellNone89011behr, mr. karl howellmale26.0000000011136930.0000C148CNaN
dooley, mrpatrickNone89103dooley, mr. patrickmale32.000000003703767.7500NaNQNaN

891 rows × 16 columns

5.提取數(shù)據(jù)

#----------------------------------------------------------數(shù)據(jù)提取 #按索引行提取單行的值 data.loc[3] 名 futrelle, mrs 姓 jacques heath (lily may peel) s None PassengerId 4 Survived 1 Pclass 1 Name futrelle, mrs. jacques heath (lily may peel) Sex female Age 35 SibSp 1 Parch 0 Ticket 113803 Fare 53.1 Cabin C123 Embarked S tip NaN Name: 3, dtype: object #按照索引提取多行的值,左閉右開(kāi) data.iloc[0:3] 名姓sPassengerIdSurvivedPclassNameSexAgeSibSpParchTicketFareCabinEmbarkedtip012
braund, mrowen harrisNone103braund, mr. owen harrismale22.010A/5 211717.2500NaNSNaN
cumings, mrsjohn bradley (florence briggs thayer)None211cumings, mrs. john bradley (florence briggs th...female38.010PC 1759971.2833C85CNaN
heikkinen, misslainaNone313heikkinen, miss. lainafemale26.000STON/O2. 31012827.9250NaNSNaN
#按照要求提取行值 data.loc[data['Age']<3] 名姓sPassengerIdSurvivedPclassNameSexAgeSibSpParchTicketFareCabinEmbarkedtip71678119164172183205297305340381386469479530642644755788803824827831
palsson, mastergosta leonardNone803palsson, master. gosta leonardmale2.003134990921.0750NaNS1.0
rice, mastereugeneNone1703rice, master. eugenemale2.004138265229.1250NaNQ1.0
caldwell, masteralden gatesNone7912caldwell, master. alden gatesmale0.830224873829.0000NaNS1.0
andersson, missellis anna mariaNone12003andersson, miss. ellis anna mariafemale2.004234708231.2750NaNSNaN
panula, mastereino viljamiNone16503panula, master. eino viljamimale1.0041310129539.6875NaNS1.0
johnson, misseleanor ileenNone17313johnson, miss. eleanor ileenfemale1.001134774211.1333NaNSNaN
becker, masterrichard fNone18412becker, master. richard fmale1.002123013639.0000F4S1.0
strom, misstelma matildaNone20603strom, miss. telma matildafemale2.000134705410.4625G6SNaN
allison, misshelen loraineNone29801allison, miss. helen lorainefemale2.0012113781151.5500C22 C26SNaN
allison, masterhudson trevorNone30611allison, master. hudson trevormale0.9212113781151.5500C22 C26S1.0
navratil, masteredmond rogerNone34112navratil, master. edmond rogermale2.001123008026.0000F2S1.0
nakid, missmaria ("mary")None38213nakid, miss. maria ("mary")female1.0002265315.7417NaNCNaN
goodwin, mastersidney leonardNone38703goodwin, master. sidney leonardmale1.0052CA 214446.9000NaNS1.0
baclini, misshelene barbaraNone47013baclini, miss. helene barbarafemale0.7521266619.2583NaNCNaN
hirvonen, misshildur eNone48013hirvonen, miss. hildur efemale2.0001310129812.2875NaNSNaN
quick, missphyllis mayNone53112quick, miss. phyllis mayfemale2.00112636026.0000NaNSNaN
skoog, missmargit elizabethNone64303skoog, miss. margit elizabethfemale2.003234708827.9000NaNSNaN
baclini, misseugenieNone64513baclini, miss. eugeniefemale0.7521266619.2583NaNCNaN
hamalainen, masterviljoNone75612hamalainen, master. viljomale0.671125064914.5000NaNS1.0
dean, masterbertram vereNone78913dean, master. bertram veremale1.0012C.A. 231520.5750NaNS1.0
thomas, masterassad alexanderNone80413thomas, master. assad alexandermale0.420126258.5167NaNC1.0
panula, masterurho abrahamNone82503panula, master. urho abrahammale2.0041310129539.6875NaNS1.0
mallet, masterandreNone82812mallet, master. andremale1.0002S.C./PARIS 207937.0042NaNC1.0
richards, mastergeorge sibleyNone83212richards, master. george sibleymale0.83112910618.7500NaNS1.0
#按位置提取數(shù)據(jù) data.iloc[:3,:4] 名姓sPassengerId012
braund, mrowen harrisNone1
cumings, mrsjohn bradley (florence briggs thayer)None2
heikkinen, misslainaNone3
data.iloc[1:3,3:4] PassengerId12
2
3
#判斷某列值是否為某值 data['SibSp'].isin([1,2]) 0 True 1 True 2 False 3 True 4 False 5 False 6 False 7 False 8 False 9 True 10 True 11 False 12 False 13 True 14 False 15 False 16 False 17 False 18 True 19 False 20 False 21 False 22 False 23 False 24 False 25 True 26 False 27 False 28 False 29 False... 861 True 862 False 863 False 864 False 865 False 866 True 867 False 868 False 869 True 870 False 871 True 872 False 873 False 874 True 875 False 876 False 877 False 878 False 879 False 880 False 881 False 882 False 883 False 884 False 885 False 886 False 887 False 888 True 889 False 890 False Name: SibSp, Length: 891, dtype: bool #提取符合條件的行 data.loc[data['SibSp'].isin([1,2])] 名姓sPassengerIdSurvivedPclassNameSexAgeSibSpParchTicketFareCabinEmbarkedtip01391013182531343538394041434648495253586265697386929399...750754755763765768774781783788799801802809817820830831835849852854856860861866869871874888
braund, mrowen harrisNone103braund, mr. owen harrismale22.00000010A/5 211717.2500NaNSNaN
cumings, mrsjohn bradley (florence briggs thayer)None211cumings, mrs. john bradley (florence briggs th...female38.00000010PC 1759971.2833C85CNaN
futrelle, mrsjacques heath (lily may peel)None411futrelle, mrs. jacques heath (lily may peel)female35.0000001011380353.1000C123SNaN
nasser, mrsnicholas (adele achem)None1012nasser, mrs. nicholas (adele achem)female14.0000001023773630.0708NaNCNaN
sandstrom, missmarguerite rutNone1113sandstrom, miss. marguerite rutfemale4.00000011PP 954916.7000G6SNaN
andersson, mranders johanNone1403andersson, mr. anders johanmale39.0000001534708231.2750NaNSNaN
vander planke, mrsjulius (emelia maria vandemoortele)None1903vander planke, mrs. julius (emelia maria vande...female31.0000001034576318.0000NaNSNaN
asplund, mrscarl oscar (selma augusta emilia johansson)None2613asplund, mrs. carl oscar (selma augusta emilia...female38.0000001534707731.3875NaNSNaN
spencer, mrswilliam augustus (marie eugenie)None3211spencer, mrs. william augustus (marie eugenie)female29.69911810PC 17569146.5208B78CNaN
meyer, mredgar josephNone3501meyer, mr. edgar josephmale28.00000010PC 1760482.1708NaNCNaN
holverson, mralexander oskarNone3601holverson, mr. alexander oskarmale42.0000001011378952.0000NaNSNaN
vander planke, missaugusta mariaNone3903vander planke, miss. augusta mariafemale18.0000002034576418.0000NaNSNaN
nicola-yarred, missjamilaNone4013nicola-yarred, miss. jamilafemale14.00000010265111.2417NaNCNaN
ahlin, mrsjohan (johanna persdotter larsson)None4103ahlin, mrs. johan (johanna persdotter larsson)female40.0000001075469.4750NaNSNaN
turpin, mrswilliam john robert (dorothy ann wonnacott)None4202turpin, mrs. william john robert (dorothy ann ...female27.000000101166821.0000NaNSNaN
laroche, misssimonne marie anne andreeNone4412laroche, miss. simonne marie anne andreefemale3.00000012SC/Paris 212341.5792NaNCNaN
lennon, mrdenisNone4703lennon, mr. denismale29.6991181037037115.5000NaNQNaN
samaan, mryoussefNone4903samaan, mr. youssefmale29.69911820266221.6792NaNCNaN
arnold-franchi, mrsjosef (josefine franchi)None5003arnold-franchi, mrs. josef (josefine franchi)female18.0000001034923717.8000NaNSNaN
harper, mrshenry sleeper (myna haxtun)None5311harper, mrs. henry sleeper (myna haxtun)female49.00000010PC 1757276.7292D33CNaN
faunthorpe, mrslizzie (elizabeth anne wilkinson)None5412faunthorpe, mrs. lizzie (elizabeth anne wilkin...female29.00000010292626.0000NaNSNaN
west, missconstance miriumNone5912west, miss. constance miriumfemale5.00000012C.A. 3465127.7500NaNSNaN
harris, mrhenry birkhardtNone6301harris, mr. henry birkhardtmale45.000000103697383.4750C83SNaN
moubarek, mastergeriosNone6613moubarek, master. geriosmale29.69911811266115.2458NaNCNaN
kink, mrvincenzNone7003kink, mr. vincenzmale26.000000203151518.6625NaNSNaN
chronopoulos, mrapostolosNone7403chronopoulos, mr. apostolosmale26.00000010268014.4542NaNCNaN
ford, mrwilliam nealNone8703ford, mr. william nealmale16.00000013W./C. 660834.3750NaNSNaN
chaffee, mrherbert fullerNone9301chaffee, mr. herbert fullermale46.00000010W.E.P. 573461.1750E31SNaN
dean, mrbertram frankNone9403dean, mr. bertram frankmale26.00000012C.A. 231520.5750NaNSNaN
kantor, mrsinaiNone10002kantor, mr. sinaimale34.0000001024436726.0000NaNSNaN
................................................
wells, missjoanNone75112wells, miss. joanfemale4.000000112910323.0000NaNSNaN
herman, mrssamuel (jane laver)None75512herman, mrs. samuel (jane laver)female48.0000001222084565.0000NaNSNaN
hamalainen, masterviljoNone75612hamalainen, master. viljomale0.6700001125064914.5000NaNS1.0
carter, mrswilliam ernest (lucile polk)None76411carter, mrs. william ernest (lucile polk)female36.00000012113760120.0000B96 B98SNaN
hogeboom, mrsjohn c (anna andrews)None76611hogeboom, mrs. john c (anna andrews)female51.000000101350277.9583D11SNaN
moran, mrdaniel jNone76903moran, mr. daniel jmale29.6991181037111024.1500NaNQNaN
hocking, mrselizabeth (eliza needs)None77512hocking, mrs. elizabeth (eliza needs)female54.000000132910523.0000NaNSNaN
dick, mrsalbert adrian (vera gillespie)None78211dick, mrs. albert adrian (vera gillespie)female17.000000101747457.0000B20SNaN
johnston, mrandrew gNone78403johnston, mr. andrew gmale29.69911812W./C. 660723.4500NaNSNaN
dean, masterbertram vereNone78913dean, master. bertram veremale1.00000012C.A. 231520.5750NaNS1.0
van impe, mrsjean baptiste (rosalie paula govaert)None80003van impe, mrs. jean baptiste (rosalie paula go...female30.0000001134577324.1500NaNSNaN
collyer, mrsharvey (charlotte annie tate)None80212collyer, mrs. harvey (charlotte annie tate)female31.00000011C.A. 3192126.2500NaNSNaN
carter, masterwilliam thornton iiNone80311carter, master. william thornton iimale11.00000012113760120.0000B96 B98SNaN
chambers, mrsnorman campbell (bertha griggs)None81011chambers, mrs. norman campbell (bertha griggs)female33.0000001011380653.1000E8SNaN
mallet, mralbertNone81802mallet, mr. albertmale31.00000011S.C./PARIS 207937.0042NaNCNaN
hays, mrscharles melville (clara jennings gregg)None82111hays, mrs. charles melville (clara jennings gr...female52.000000111274993.5000B69SNaN
yasbeck, mrsantoni (selini alexander)None83113yasbeck, mrs. antoni (selini alexander)female15.00000010265914.4542NaNCNaN
richards, mastergeorge sibleyNone83212richards, master. george sibleymale0.830000112910618.7500NaNS1.0
compton, misssara rebeccaNone83611compton, miss. sara rebeccafemale39.00000011PC 1775683.1583E49CNaN
goldenberg, mrssamuel l (edwiga grabowska)None85011goldenberg, mrs. samuel l (edwiga grabowska)female29.699118101745389.1042C92CNaN
boulos, missnourelainNone85303boulos, miss. nourelainfemale9.00000011267815.2458NaNCNaN
carter, mrsernest courtenay (lilian hughes)None85502carter, mrs. ernest courtenay (lilian hughes)female44.0000001024425226.0000NaNSNaN
wick, mrsgeorge dennick (mary hitchcock)None85711wick, mrs. george dennick (mary hitchcock)female45.0000001136928164.8667NaNSNaN
hansen, mrclaus peterNone86103hansen, mr. claus petermale41.0000002035002614.1083NaNSNaN
giles, mrfrederick edwardNone86202giles, mr. frederick edwardmale21.000000102813411.5000NaNSNaN
duran y more, missasuncionNone86712duran y more, miss. asuncionfemale27.00000010SC/PARIS 214913.8583NaNCNaN
johnson, masterharold theodorNone87013johnson, master. harold theodormale4.0000001134774211.1333NaNS1.0
beckwith, mrsrichard leonard (sallie monypeny)None87211beckwith, mrs. richard leonard (sallie monypeny)female47.000000111175152.5542D35SNaN
abelson, mrssamuel (hannah wizosky)None87512abelson, mrs. samuel (hannah wizosky)female28.00000010P/PP 338124.0000NaNCNaN
johnston, misscatherine helen "carrie"None88903johnston, miss. catherine helen "carrie"female29.69911812W./C. 660723.4500NaNSNaN

237 rows × 16 columns

6.篩選數(shù)據(jù)

#--------------------------------------------------------數(shù)據(jù)篩選 #使用'與'進(jìn)行篩選 data.loc[(data['Age']<5) & (data['Sex']=='male')] 名姓sPassengerIdSurvivedPclassNameSexAgeSibSpParchTicketFareCabinEmbarkedtip7166378164171183193261305340348386407445755788803824827831850869
palsson, mastergosta leonardNone803palsson, master. gosta leonardmale2.003134990921.0750NaNS1.0
rice, mastereugeneNone1703rice, master. eugenemale2.004138265229.1250NaNQ1.0
skoog, masterharaldNone6403skoog, master. haraldmale4.003234708827.9000NaNS1.0
caldwell, masteralden gatesNone7912caldwell, master. alden gatesmale0.830224873829.0000NaNS1.0
panula, mastereino viljamiNone16503panula, master. eino viljamimale1.0041310129539.6875NaNS1.0
rice, masterarthurNone17203rice, master. arthurmale4.004138265229.1250NaNQ1.0
becker, masterrichard fNone18412becker, master. richard fmale1.002123013639.0000F4S1.0
navratil, mastermichel mNone19412navratil, master. michel mmale3.001123008026.0000F2S1.0
asplund, masteredvin rojj felixNone26213asplund, master. edvin rojj felixmale3.004234707731.3875NaNS1.0
allison, masterhudson trevorNone30611allison, master. hudson trevormale0.9212113781151.5500C22 C26S1.0
navratil, masteredmond rogerNone34112navratil, master. edmond rogermale2.001123008026.0000F2S1.0
coutts, masterwilliam loch "william"None34913coutts, master. william loch "william"male3.0011C.A. 3767115.9000NaNS1.0
goodwin, mastersidney leonardNone38703goodwin, master. sidney leonardmale1.0052CA 214446.9000NaNS1.0
richards, masterwilliam roweNone40812richards, master. william rowemale3.00112910618.7500NaNS1.0
dodge, masterwashingtonNone44611dodge, master. washingtonmale4.00023363881.8583A34S1.0
hamalainen, masterviljoNone75612hamalainen, master. viljomale0.671125064914.5000NaNS1.0
dean, masterbertram vereNone78913dean, master. bertram veremale1.0012C.A. 231520.5750NaNS1.0
thomas, masterassad alexanderNone80413thomas, master. assad alexandermale0.420126258.5167NaNC1.0
panula, masterurho abrahamNone82503panula, master. urho abrahammale2.0041310129539.6875NaNS1.0
mallet, masterandreNone82812mallet, master. andremale1.0002S.C./PARIS 207937.0042NaNC1.0
richards, mastergeorge sibleyNone83212richards, master. george sibleymale0.83112910618.7500NaNS1.0
andersson, mastersigvard harald eliasNone85103andersson, master. sigvard harald eliasmale4.004234708231.2750NaNS1.0
johnson, masterharold theodorNone87013johnson, master. harold theodormale4.001134774211.1333NaNS1.0
#使用'或'進(jìn)行篩選 data.loc[(data['Age']==5) | (data['Age']==2)] 名姓sPassengerIdSurvivedPclassNameSexAgeSibSpParchTicketFareCabinEmbarkedtip71658119205233297340448479530642777824
palsson, mastergosta leonardNone803palsson, master. gosta leonardmale2.03134990921.0750NaNS1.0
rice, mastereugeneNone1703rice, master. eugenemale2.04138265229.1250NaNQ1.0
west, missconstance miriumNone5912west, miss. constance miriumfemale5.012C.A. 3465127.7500NaNSNaN
andersson, missellis anna mariaNone12003andersson, miss. ellis anna mariafemale2.04234708231.2750NaNSNaN
strom, misstelma matildaNone20603strom, miss. telma matildafemale2.00134705410.4625G6SNaN
asplund, misslillian gertrudNone23413asplund, miss. lillian gertrudfemale5.04234707731.3875NaNSNaN
allison, misshelen loraineNone29801allison, miss. helen lorainefemale2.012113781151.5500C22 C26SNaN
navratil, masteredmond rogerNone34112navratil, master. edmond rogermale2.01123008026.0000F2S1.0
baclini, missmarie catherineNone44913baclini, miss. marie catherinefemale5.021266619.2583NaNCNaN
hirvonen, misshildur eNone48013hirvonen, miss. hildur efemale2.001310129812.2875NaNSNaN
quick, missphyllis mayNone53112quick, miss. phyllis mayfemale2.0112636026.0000NaNSNaN
skoog, missmargit elizabethNone64303skoog, miss. margit elizabethfemale2.03234708827.9000NaNSNaN
emanuel, missvirginia ethelNone77813emanuel, miss. virginia ethelfemale5.00036451612.4750NaNSNaN
panula, masterurho abrahamNone82503panula, master. urho abrahammale2.041310129539.6875NaNS1.0
#對(duì)篩選后的數(shù)據(jù)按某列計(jì)數(shù) data.loc[(data['Age']==5) | (data['Age']==2)].Parch.count() #計(jì)算年齡為5或者1的乘客中船艙等級(jí)為1的個(gè)數(shù) 14 #使用query函數(shù)進(jìn)行篩選 data.query('Age==[5,2]') 名姓sPassengerIdSurvivedPclassNameSexAgeSibSpParchTicketFareCabinEmbarkedtip71658119205233297340448479530642777824
palsson, mastergosta leonardNone803palsson, master. gosta leonardmale2.03134990921.0750NaNS1.0
rice, mastereugeneNone1703rice, master. eugenemale2.04138265229.1250NaNQ1.0
west, missconstance miriumNone5912west, miss. constance miriumfemale5.012C.A. 3465127.7500NaNSNaN
andersson, missellis anna mariaNone12003andersson, miss. ellis anna mariafemale2.04234708231.2750NaNSNaN
strom, misstelma matildaNone20603strom, miss. telma matildafemale2.00134705410.4625G6SNaN
asplund, misslillian gertrudNone23413asplund, miss. lillian gertrudfemale5.04234707731.3875NaNSNaN
allison, misshelen loraineNone29801allison, miss. helen lorainefemale2.012113781151.5500C22 C26SNaN
navratil, masteredmond rogerNone34112navratil, master. edmond rogermale2.01123008026.0000F2S1.0
baclini, missmarie catherineNone44913baclini, miss. marie catherinefemale5.021266619.2583NaNCNaN
hirvonen, misshildur eNone48013hirvonen, miss. hildur efemale2.001310129812.2875NaNSNaN
quick, missphyllis mayNone53112quick, miss. phyllis mayfemale2.0112636026.0000NaNSNaN
skoog, missmargit elizabethNone64303skoog, miss. margit elizabethfemale2.03234708827.9000NaNSNaN
emanuel, missvirginia ethelNone77813emanuel, miss. virginia ethelfemale5.00036451612.4750NaNSNaN
panula, masterurho abrahamNone82503panula, master. urho abrahammale2.041310129539.6875NaNS1.0
#對(duì)篩選后的數(shù)據(jù)求和 data.query('Age==[5,2]').SibSp.sum() 28

7.匯總數(shù)據(jù)

#-------------------------------------------------------------數(shù)據(jù)匯總 #對(duì)某列的不同類進(jìn)行分組計(jì)數(shù)匯總 data.groupby('Pclass').count() 名姓sPassengerIdSurvivedNameSexAgeSibSpParchTicketFareCabinEmbarkedtipPclass123
21621612162162162162162162162162161762142
1841840184184184184184184184184184161849
49149104914914914914914914914914911249122
#對(duì)某列的不同類進(jìn)行分組,只計(jì)數(shù)某一列 data.groupby('Pclass')['Sex'].count() Pclass 1 216 2 184 3 491 Name: Sex, dtype: int64 #對(duì)兩個(gè)字段進(jìn)行匯總 data.groupby(['Pclass','Sex'])['PassengerId'].count() Pclass Sex 1 female 94male 122 2 female 76male 108 3 female 144male 347 Name: PassengerId, dtype: int64 #對(duì)某列數(shù)據(jù)進(jìn)行匯總,并分析其組內(nèi)的各值(agg是聚合) data.groupby(['Pclass','Sex'])['Age'].agg([len,np.sum,np.mean]) lensummeanPclassSex1femalemale2femalemale3femalemale
94.03209.29205934.141405
122.04793.10147139.287717
76.02184.89823528.748661
108.03310.62205930.653908
144.03465.86294124.068493
347.09498.13705927.372153

8.統(tǒng)計(jì)數(shù)據(jù)

#--------------------------------------------------------數(shù)據(jù)統(tǒng)計(jì) #數(shù)據(jù)采樣,隨機(jī)抽取數(shù)據(jù)樣本 data.sample(n=10) 名姓sPassengerIdSurvivedPclassNameSexAgeSibSpParchTicketFareCabinEmbarkedtip7092700626722371248795567837
jenkin, mrstephen curnowNone7102jenkin, mr. stephen curnowmale32.00000000C.A. 3311110.5000NaNSNaN
chaffee, mrherbert fullerNone9301chaffee, mr. herbert fullermale46.00000010W.E.P. 573461.1750E31SNaN
astor, mrsjohn jacob (madeleine talmadge force)None70111astor, mrs. john jacob (madeleine talmadge force)female18.00000010PC 17757227.5250C62 C64CNaN
kirkland, revcharles leonardNone62702kirkland, rev. charles leonardmale57.0000000021953312.3500NaNQNaN
gillespie, mrwilliam henryNone72302gillespie, mr. william henrymale34.000000001223313.0000NaNSNaN
wiklund, mrjakob alfredNone37203wiklund, mr. jakob alfredmale18.0000001031012676.4958NaNSNaN
beckwith, mrrichard leonardNone24911beckwith, mr. richard leonardmale37.000000111175152.5542D35SNaN
otter, mrrichardNone79602otter, mr. richardmale39.000000002821313.0000NaNSNaN
palsson, mrsnils (alma cornelia berglund)None56803palsson, mrs. nils (alma cornelia berglund)female29.0000000434990921.0750NaNSNaN
sirota, mrmauriceNone83803sirota, mr. mauricemale29.699118003920928.0500NaNSNaN
#采樣后不放回 data.sample(n=10,replace=False) 名姓sPassengerIdSurvivedPclassNameSexAgeSibSpParchTicketFareCabinEmbarkedtip345304697493226506806335560492
brown, missamelia "mildred"None34612brown, miss. amelia "mildred"female24.0000000024873313.0000F33SNaN
williams, mrhoward hugh "harry"None30503williams, mr. howard hugh "harry"male29.69911800A/5 24668.0500NaNSNaN
mullens, misskatherine "katie"None69813mullens, miss. katherine "katie"female29.69911800358527.7333NaNQNaN
artagaveytia, mrramonNone49401artagaveytia, mr. ramonmale71.00000000PC 1760949.5042NaNCNaN
mellors, mrwilliam johnNone22712mellors, mr. william johnmale19.00000000SW/PP 75110.5000NaNSNaN
quick, mrsfrederick charles (jane richards)None50712quick, mrs. frederick charles (jane richards)female33.000000022636026.0000NaNSNaN
andrews, mrthomas jrNone80701andrews, mr. thomas jrmale39.000000001120500.0000A36SNaN
denkoff, mrmittoNone33603denkoff, mr. mittomale29.699118003492257.8958NaNSNaN
morrow, mrthomas rowanNone56103morrow, mr. thomas rowanmale29.699118003726227.7500NaNQNaN
molson, mrharry marklandNone49301molson, mr. harry marklandmale55.0000000011378730.5000C30SNaN
#采樣后放回 data.sample(n=10,replace=True) #這里數(shù)據(jù)太多,看不出區(qū)別,放回和不放回的區(qū)別在于放回會(huì)取到重復(fù)值,不放回不會(huì) 名姓sPassengerIdSurvivedPclassNameSexAgeSibSpParchTicketFareCabinEmbarkedtip70339872170880526252659692597
gallagher, mrmartinNone70403gallagher, mr. martinmale25.00000000368647.7417NaNQNaN
pain, dralfredNone39902pain, dr. alfredmale23.0000000024427810.5000NaNSNaN
jensen, mrsvend lauritzNone72203jensen, mr. svend lauritzmale17.000000103500487.0542NaNSNaN
cleaver, missaliceNone70911cleaver, miss. alicefemale22.00000000113781151.5500NaNSNaN
johansson, mrkarl johanNone80603johansson, mr. karl johanmale31.000000003470637.7750NaNSNaN
taussig, mremilNone26301taussig, mr. emilmale52.0000001111041379.6500E67SNaN
harper, mrshenry sleeper (myna haxtun)None5311harper, mrs. henry sleeper (myna haxtun)female49.00000010PC 1757276.7292D33CNaN
newell, mrarthur websterNone66001newell, mr. arthur webstermale58.0000000235273113.2750D48CNaN
lam, mraliNone69313lam, mr. alimale29.69911800160156.4958NaNSNaN
johnson, mralfredNone59803johnson, mr. alfredmale49.00000000LINE0.0000NaNSNaN
#數(shù)據(jù)表描述性統(tǒng)計(jì) data.describe().round(2).T #可以一步一步執(zhí)行,先獲得描述性信息,再四舍五入保留兩位小數(shù),再轉(zhuǎn)置 countmeanstdmin25%50%75%maxPassengerIdSurvivedPclassAgeSibSpParchFaretip
891.0446.00257.351.00223.50446.00668.5891.00
891.00.380.490.000.000.001.01.00
891.02.310.841.002.003.003.03.00
891.029.7013.000.4222.0029.7035.080.00
891.00.521.100.000.000.001.08.00
891.00.380.810.000.000.000.06.00
891.032.2049.690.007.9114.4531.0512.33
33.01.000.001.001.001.001.01.00
#計(jì)算列的標(biāo)準(zhǔn)差 data['Age'].std() 13.002015226002884 #計(jì)算兩個(gè)字段的協(xié)方差 data['Age'].cov(data['SibSp']) -3.3353450099140782 #計(jì)算所有字段的協(xié)方差 data.cov() #我們能發(fā)現(xiàn)只能計(jì)算數(shù)字 PassengerIdSurvivedPclassAgeSibSpParchFaretipPassengerIdSurvivedPclassAgeSibSpParchFaretip
66231.000000-0.626966-7.561798111.113042-16.325843-0.342697161.8833690.0
-0.6269660.236772-0.137703-0.441656-0.0189540.0320176.2217870.0
-7.561798-0.1377030.699015-3.6018550.0765990.012429-22.8301960.0
111.113042-0.441656-3.601855169.052400-3.335345-1.87798759.1622000.0
-16.325843-0.0189540.076599-3.3353451.2160430.3687398.7487340.0
-0.3426970.0320170.012429-1.8779870.3687390.6497288.6610520.0
161.8833696.221787-22.83019659.1622008.7487348.6610522469.4368460.0
0.0000000.0000000.0000000.0000000.0000000.0000000.0000000.0
#兩個(gè)字段的相關(guān)性分析 data['Age'].corr(data['Survived']) -0.06980851528714313 #計(jì)算所有字段的相關(guān)性 data.corr() PassengerIdSurvivedPclassAgeSibSpParchFaretipPassengerIdSurvivedPclassAgeSibSpParchFaretip
1.000000-0.005007-0.0351440.033207-0.057527-0.0016520.012658NaN
-0.0050071.000000-0.338481-0.069809-0.0353220.0816290.257307NaN
-0.035144-0.3384811.000000-0.3313390.0830810.018443-0.549500NaN
0.033207-0.069809-0.3313391.000000-0.232625-0.1791910.091566NaN
-0.057527-0.0353220.083081-0.2326251.0000000.4148380.159651NaN
-0.0016520.0816290.018443-0.1791910.4148381.0000000.216225NaN
0.0126580.257307-0.5495000.0915660.1596510.2162251.000000NaN
NaNNaNNaNNaNNaNNaNNaNNaN

總結(jié)

以上是生活随笔為你收集整理的pandas数据分析常用方法总结(数据集可以自己更改)的全部?jī)?nèi)容,希望文章能夠幫你解決所遇到的問(wèn)題。

如果覺(jué)得生活随笔網(wǎng)站內(nèi)容還不錯(cuò),歡迎將生活随笔推薦給好友。