Difference between revisions of "Biopython/C2/Parsing-Data/Khasi"
From Script | Spoken-Tutorial
Hezekiah2016 (Talk | contribs) (Created page with "{| Border=1 ! <center>Time</center> ! <center>Narration</center> |- | 00:01 | Hello everyone.Welcome to this tutorial on '''Parsing Data.''' (Khublei baroh. Ngi pdiang sngewb...") |
Hezekiah2016 (Talk | contribs) |
||
Line 5: | Line 5: | ||
|- | |- | ||
| 00:01 | | 00:01 | ||
− | | | + | | Khublei baroh. Ngi pdiang sngewbha ia phi sha kane ka tutorial halor ka '''Parsing Data.''' |
− | + | ||
|- | |- | ||
| 00:06 | | 00:06 | ||
− | | | + | | Ha kane ka tutorial, ngin pule kumno ban download ia ki '''FASTA''' bad '''GenBank''' files na ka '''NCBI''' database website. |
− | + | ||
|- | |- | ||
| 00:14 | | 00:14 | ||
− | | | + | | Bad ban '''Parse'''ia ki data files da kaba pyndonkam ia ki '''function'''s ha ka '''Sequence Input/Output''' module. |
− | + | ||
|- | |- | ||
| 00:19 | | 00:19 | ||
− | | | + | | Ban bud ia kane ka tutorial, phi dei ban tip bha ia ka undergraduate biochemistry lane bioinformatics. |
− | + | ||
|- | |- | ||
| 00:26 | | 00:26 | ||
− | | | + | | bad ka basic '''Python''' programming. |
− | + | ||
|- | |- | ||
| 00:30 | | 00:30 | ||
− | | | + | | Peit ia ka '''Python''' tutorials ha ka link ba la ai. |
− | + | ||
|- | |- | ||
| 00:34 | | 00:34 | ||
− | | | + | | Ban record ia kane ka tutorial, nga pyndonkam da ka : * '''Ubuntu OS''' version 14.10 |
− | + | ||
|- | |- | ||
| 00:40 | | 00:40 | ||
| '''Python''' version 2.7.8 | | '''Python''' version 2.7.8 | ||
− | |||
|- | |- | ||
| 00:44 | | 00:44 | ||
− | | | + | | '''Ipython interpretor''' version 2.3.0 |
− | + | ||
|- | |- | ||
| 00:48 | | 00:48 | ||
− | | | + | | '''Biopython''' version 1.64 bad * '''Mozilla Firefox '''browser 35.0. |
− | + | ||
|- | |- | ||
| 00:56 | | 00:56 | ||
− | | | + | | Ki scientific data jong ka biology ju store barabor ia ki ha ka text file kum '''FASTA''', '''GenBank''', '''EMBL''', '''Swiss-Prot''' kumta ter ter. |
− | + | ||
|- | |- | ||
| 01:07 | | 01:07 | ||
− | | | + | | Ia ki data files lah ban download na ka database websites. |
− | + | ||
|- | |- | ||
| 01:12 | | 01:12 | ||
− | | | + | | Plie ia ka website link ba lah ai harum, da uno uno u web browser. |
− | + | ||
|- | |- | ||
| 01:17 | | 01:17 | ||
− | | | + | | Ka web-page ka plie. |
− | + | ||
|- | |- | ||
| 01:19 | | 01:19 | ||
− | | | + | | To ngin download iaka '''FASTA''' bad '''GenBank''' files na ka bynta ka human '''insulin gene'''. |
− | + | ||
|- | |- | ||
| 01:25 | | 01:25 | ||
− | | | + | | Ha ka search box, type: "human insulin", click ha '''Search''' button. |
− | + | ||
|- | |- | ||
| 01:31 | | 01:31 | ||
− | | | + | | Ka web-page ka pyni shibun tylli ki flies na ka bynta ka human '''insulin gene'''. |
− | + | ||
|- | |- | ||
| 01:35 | | 01:35 | ||
− | | | + | | Ban pyni nuksa, ngan jied 4 tylli ki files kiba kyrteng “Homo sapiens Insulin mRNA”. |
− | + | ||
|- | |- | ||
| 01:43 | | 01:43 | ||
− | | | + | | Ngan jied ia ki files ba duna ia ka 500 '''base''' pairs. |
− | + | ||
|- | |- | ||
| 01:48 | | 01:48 | ||
− | | | + | | Click ha ka check-box ban jied ia ka file ban download. |
− | + | ||
|- | |- | ||
| 01:56 | | 01:56 | ||
− | | | + | | Wanrah ia u cursor sha “'''Send to'''” option, kaba don ha kyndong khlieh ka mon jong ka page. |
− | + | ||
|- | |- | ||
| 02:02 | | 02:02 | ||
− | | | + | | Click ha i selection button barit ba don u khnam ba kdew shapoh, ba don hajan ka “'''Send to'''” button. |
− | + | ||
|- | |- | ||
| 02:09 | | 02:09 | ||
− | | | + | | Hapoh ka heading “'''Choose destination'''”, click ha '''File''' option. |
− | + | ||
|- | |- | ||
| 02:13 | | 02:13 | ||
− | | | + | | Phi lah ban '''save''' ia kane ka file ha kano kano ka format, kiba don hapoh '''format''' drop-down list box. |
− | + | ||
|- | |- | ||
| 02:21 | | 02:21 | ||
− | | | + | | Jied '''FASTA''' na ki options ba ai hapoh. |
− | + | ||
|- | |- | ||
| 02:25 | | 02:25 | ||
− | | | + | | Nangta sa click ha '''Create file''' option. |
− | + | ||
|- | |- | ||
| 02:29 | | 02:29 | ||
− | | | + | | Ka dialog-box kan sa mih ha ka screen. |
− | + | ||
|- | |- | ||
|02:32 | |02:32 | ||
− | | | + | | Jied '''Open with''', click ha '''OK.''' |
− | + | ||
|- | |- | ||
| 02:36 | | 02:36 | ||
− | | | + | | Ka file ka plie ha ka '''text editor'''. |
− | + | ||
|- | |- | ||
| 02:39 | | 02:39 | ||
− | | | + | | Kane ka file ka pyni 4 tylli ki records, namar ngi la jied 4 tylli ki files ban download. |
− | + | ||
|- | |- | ||
| 02:46 | | 02:46 | ||
− | | | + | | U line banyngkong ha kawei pa kawei ka record u dei u '''identifier''' line. |
− | + | ||
|- | |- | ||
| 02:50 | | 02:50 | ||
− | | | + | | U sdang da u “greater than >)” symbol. |
− | + | ||
|- | |- | ||
| 02:53 | | 02:53 | ||
− | | | + | | Nangta la pynbud da u'''sequence'''. |
− | + | ||
|- | |- | ||
| 02:56 | | 02:56 | ||
− | | | + | | '''Save''' ia ka file ha '''home'''folder jong phi kum ka“sequence.fasta'”. |
− | + | ||
|- | |- | ||
| 03:01 | | 03:01 | ||
− | | | + | | Khang ia ka text editor. |
− | + | ||
|- | |- | ||
| 03:03 | | 03:03 | ||
− | | | + | | Pynbud ki juh ki syn jam kum haneng ban download ia ki files ha '''GenBank''' format |
− | + | ||
− | + | ||
|- | |- | ||
| 03:08 | | 03:08 | ||
− | | | + | | na ka bynta ki files ba la jied hashwa. |
− | + | ||
|- | |- | ||
| 03:12 | | 03:12 | ||
− | | | + | | Jied ia ka '''file format''' kum '''GenBank.''' |
− | + | ||
|- | |- | ||
| 03:16 | | 03:16 | ||
− | | | + | | Shna ia ka file. Plie da u text editor. |
− | + | ||
|- | |- | ||
| 03:21 | | 03:21 | ||
− | | | + | | Peit thuh ba ka sequence file ha '''GenBank''' format ka kham bun features ban ia ka '''FASTA''' file. |
− | + | ||
|- | |- | ||
| 03:27 | | 03:27 | ||
− | | | + | | '''Save''' ia ka file kum "sequence.gb" ha ka '''home''' folder. Khang noh u text editor. |
− | + | ||
|- | |- | ||
| 03:34 | | 03:34 | ||
− | | | + | | Ban peit nuksa, ngi donkam ia ka FASTA file ba don '''record''' tang iwei. |
− | + | ||
|- | |- | ||
| 03:39 | | 03:39 | ||
− | | | + | | Na ka bynta kane, pynkhuid ia ki jingjied ba hashwa da kaba click biang ha ki check box. |
− | + | ||
|- | |- | ||
| 03:48 | | 03:48 | ||
− | | | + | | Mynta, jied ia ka file “'''Human insulin gene complete cds'''”. |
− | + | ||
− | + | ||
|- | |- | ||
| 03:54 | | 03:54 | ||
− | | | + | | Click ha ka check-box. |
− | + | ||
|- | |- | ||
| 03:57 | | 03:57 | ||
− | | | + | | Bad sa bud ia ki rukom kumba la pyni mynne ban '''save''' ia ka file ha ka '''home''' folder. |
− | + | ||
|- | |- | ||
| 04:01 | | 04:01 | ||
− | | | + | | '''Save''' ia ka file kum "insulin.fasta". |
− | + | ||
|- | |- | ||
| 04:08 | | 04:08 | ||
− | | | + | | Ia ki Biological data ba lah store ha kine ki files lah ban sei bad pynkylla da kaba pyndonkam da ka '''Biopython''' libraries. |
− | + | ||
|- | |- | ||
| 04:16 | | 04:16 | ||
− | | | + | | Khang ia u text-editor. |
− | + | ||
|- | |- | ||
| 04:19 | | 04:19 | ||
− | | | + | | Ban sei ia ki data na data files la khot '''Parsing'''. |
− | + | ||
|- | |- | ||
| 04:23 | | 04:23 | ||
− | | | + | | Jan ia baroh ki file formats lah ban parsed da kaba pyndonkam '''function'''s kiba don ha '''SeqIO''' module. |
− | + | ||
|- | |- | ||
| 04:30 | | 04:30 | ||
− | | | + | | Ki function jong ka '''SeqIO''' module kiba ju kham pyndonkam bha baroh ki long: '''parse, read, write''' bad '''convert'''. |
− | + | ||
|- | |- | ||
| 04:38 | | 04:38 | ||
− | | | + | | Plie ia ka terminal da kaba nion lang ia '''Ctrl, Alt''' bad '''t''' keys. |
− | + | ||
|- | |- | ||
| 04:44 | | 04:44 | ||
− | | | + | | Plie ia ka '''Ipython''' da kaba type "ipython" ha ka prompt. Nion '''Enter'''. |
− | + | ||
|- | |- | ||
| 04:51 | | 04:51 | ||
− | | | + | | Nangta, '''import''' "SeqIO" module na '''Bio''' package. |
− | + | ||
|- | |- | ||
| 04:56 | | 04:56 | ||
− | | | + | | Ha ka prompt, type: '''from Bio import SeqIO'''. Nion '''Enter'''. |
− | + | ||
|- | |- | ||
| 05:04 | | 05:04 | ||
− | | | + | | Ngin ia sdang da ka function kaba kham donkam ka “'''parse'''”. |
− | + | ||
|- | |- | ||
|05:07 | |05:07 | ||
− | | | + | | Na ka bynta ka nuksa, ngan pyndonkam ka '''FASTA''' file ka ba don bun '''record'''s kaba ngi download hashwa na ka database. |
− | + | ||
|- | |- | ||
| 05:17 | | 05:17 | ||
− | | | + | | Na ba bynta ka '''FASTA''' parsing ba kham suk, type kumne harum ha ka prompt. |
− | + | ||
|- | |- | ||
| 05:22 | | 05:22 | ||
− | | | + | | Hangne, ngi pyndonkam ia ka '''parse''' function ban read ia ki contents jong ka '''sequence.fasta''' file. |
− | + | ||
|- | |- | ||
| 05:30 | | 05:30 | ||
− | | | + | | Na ka bynta ka output, print '''record id''', sequence kaba don ha ka record bad ruh ia ka jingjrong jong ka sequence. |
− | + | ||
|- | |- | ||
| 05:41 | | 05:41 | ||
− | | | + | | Peit bha ruh ba ia ka '''parse''' function la pyndonkam ban read sequence data kum '''Sequence record objects'''. |
− | + | ||
|- | |- | ||
| 05:48 | | 05:48 | ||
− | | | + | | La ju pyndonkam barabor bad ka '''for''' loop. |
− | + | ||
|- | |- | ||
| 05:52 | | 05:52 | ||
− | | | + | | Ka pdiang ar tylli ki '''arguments''', kaba nyngkong dei ka file name ban read ia ka data. |
− | + | ||
|- | |- | ||
| 05:59 | | 05:59 | ||
− | | | + | | Ka ba ar ka batai shai ia ka format jong ka file. |
− | + | ||
|- | |- | ||
| 06:02 | | 06:02 | ||
− | | | + | | Nion ia u '''Enter''' key ar sien ban ioh ia ka output. |
− | + | ||
|- | |- | ||
| 06:07 | | 06:07 | ||
− | | | + | | Ka output ka pyni ia u ''' identifier line, nangta bud sa u sequence uba don ha ka file, bad ruh ka jingjrong jong ka sequence na ka bynta baroh ki records ha ka file. |
− | + | ||
|- | |- | ||
| 06:21 | | 06:21 | ||
− | | | + | | Peit thuh ba ka '''FASTA''' format kam batai shai ia u alphabet. |
− | + | ||
|- | |- | ||
| 06:26 | | 06:26 | ||
− | | | + | | Te, ka output kan nym pyni ia ka kum ka '''DNA sequence'''. |
− | + | ||
|- | |- | ||
| 06:31 | | 06:31 | ||
− | | | + | | Ki juh ki synjam lah ban leh biang ban parse iaka '''GenBank''' file. |
− | + | ||
|- | |- | ||
| 06:36 | | 06:36 | ||
− | | | + | | Ban pyni nuksa ngin pyndonkam ia ka '''GenBank''' file kaba ngi lah download mynne na ka database. |
− | + | ||
|- | |- | ||
| 06:43 | | 06:43 | ||
− | | | + | | Nion ia u up-arrow key ban ioh ia ki lines jong ki code kiba ngi lah pyndonkam hashwa. |
− | + | ||
|- | |- | ||
| 06:49 | | 06:49 | ||
− | | | + | | Pynkylla ia ka kyrteng jong ka file sha ka '''sequence.gb '''. |
− | + | ||
|- | |- | ||
| 06:53 | | 06:53 | ||
− | | | + | | Pynkylla ia ka file format sha ka '''genbank.''' |
− | + | ||
|- | |- | ||
| 06:56 | | 06:56 | ||
− | | | + | | Ki code ba sah kin neh kumjuh. |
− | + | ||
|- | |- | ||
| 06:58 | | 06:58 | ||
− | | | + | | Nion ia u '''Enter''' key arsien ban ioh ia ka output. |
− | + | ||
|- | |- | ||
| 07:03 | | 07:03 | ||
− | | | + | | Hangne ruh ka output ka pyni ia ka '''record id''', '''sequence''' bad ka jingjrong jong ka sequence na ka bynta baroh ki records ha ka file. |
− | + | ||
|- | |- | ||
| 07:12 | | 07:12 | ||
− | | | + | | Peit thuh ba ka '''GenBank''' format ka pyntikna ia ka sequence kum ka DNA sequence. |
− | + | ||
|- | |- | ||
| 07:19 | | 07:19 | ||
− | | | + | | Kumjuh ruh, ia ki '''Swiss-prot''' bad ki file '''EMBL''' lah ban parse da kaba pyndonkam ia u juh u code kum haneng. |
− | + | ||
− | + | ||
|- | |- | ||
| 07:27 | | 07:27 | ||
− | | | + | | Lada ka file jong phi ka don uwei u record phi hap type ia ki line harum ban leh '''parsing'''. |
− | + | ||
|- | |- | ||
| 07:34 | | 07:34 | ||
− | | | + | | Hangne, ngin pyndonkam iaka '''FASTA''' file ba lah save mynne, ba tang uwei u record, uta u dei '''insulin.fasta ''' kum ka nuksa. |
− | + | ||
|- | |- | ||
| 07:43 | | 07:43 | ||
− | | | + | | Peit thuh ba ngi lah dep pyndonkam ia ka '''read''' function ha jaka jong '''parse''' function. Nion '''Enter'''. |
− | + | ||
|- | |- | ||
| 07:50 | | 07:50 | ||
− | | | + | | Ka output ka pyni ki contents na ka bynta ka file '''insulin.fasta'''. |
− | + | ||
|- | |- | ||
| 07:55 | | 07:55 | ||
− | | | + | | Ka pyni ia ka sequence kum '''sequence record object'''. |
− | + | ||
|- | |- | ||
| 07:59 | | 07:59 | ||
− | | | + | | Bad kiwei ki jinglong kum '''GI, accession number '''bad '''description'''. |
− | + | ||
|- | |- | ||
| 08:06 | | 08:06 | ||
− | | | + | | Ngi lah ruh ban peit ia ki jinglong ba shimet jong kane ka record kumne harum. |
− | + | ||
|- | |- | ||
| 08:11 | | 08:11 | ||
− | | | + | | Ha ka prompt, type: '''record dot seq'''. Nion '''Enter'''. |
− | + | ||
|- | |- | ||
| 08:18 | | 08:18 | ||
− | | | + | | Ka output ka pyni ia ka sequence ba don ha ka file. |
− | + | ||
|- | |- | ||
| 08:22 | | 08:22 | ||
− | | | + | | Ban peit ia ki identifiers jong kane ka record, type: '''record dot id. Nion '''Enter'''. |
− | + | ||
|- | |- | ||
| 08:29 | | 08:29 | ||
− | | | + | | Ka output ka pyni ia u '''GI''' number bad accession number bad kumta ter ter. |
− | + | ||
|- | |- | ||
| 08:34 | | 08:34 | ||
− | | | + | | Phi lah ban pyndonkam ia u function ba lah ong haneng ban '''parse''' ia ki data files kiba phi kwah. |
− | + | ||
|- | |- | ||
| 08:40 | | 08:40 | ||
− | | | + | | Mynta to ngin batai lyngkot). |
− | + | ||
|- | |- | ||
| 08:42 | | 08:42 | ||
− | | | + | | Ha kane ka tutorial, ngi lah pule ban: download'''FASTA''' bad '''GenBank''' files na ka '''NCBI''' database website bad pyndonkam iaka '''parse''' bad '''read''' functions na ka '''SeqIO''' module. |
− | + | ||
− | ka '''NCBI''' database website bad pyndonkam | + | |
|- | |- | ||
| 08:55 | | 08:55 | ||
− | | | + | | Ban sei ia ki data kum ki '''record id'''s, description bad sequences na '''FASTA''' bad '''GenBank''' files. |
− | + | ||
|- | |- | ||
| 09:03 | | 09:03 | ||
− | | | + | | Mynta, na ka bynta ka assignment- |
− | + | ||
|- | |- | ||
| 09:06 | | 09:06 | ||
− | | | + | | Download ia ki '''FASTA''' files na ka bynta ka nucleotide sequence haka jingjied jong phi na '''NCBI''' database. |
− | + | ||
|- | |- | ||
| 09:13 | | 09:13 | ||
− | | | + | | Pynkylla ia ki file jong ki sequences sha ki '''reverse complement'''s jong ki. |
− | + | ||
|- | |- | ||
| 09:17 | | 09:17 | ||
− | | | + | | Ka assignment ba lah dep jong phi ka dei ban don ki lines of code kumne harum. |
− | + | ||
|- | |- | ||
| 09:22 | | 09:22 | ||
− | | | + | | Pyndonkam ka '''parse''' function ban '''load''' ia nucleotide sequences na ka '''FASTA''' file. |
− | + | ||
|- | |- | ||
| 09:28 | | 09:28 | ||
− | | | + | | Nangta, print ia ka reverse complements da kaba pyndonkam ia ka Sequence object’s built in '''reverse complement''' method. |
− | + | ||
|- | |- | ||
| 09:37 | | 09:37 | ||
− | | | + | | Ka video ha ka link harum ka batai lyngkot ia kane ka spoken-tutorial project. |
− | + | ||
|- | |- | ||
| 09:42 | | 09:42 | ||
− | | | + | | Sngewbha download bad peit ia ka. |
− | + | ||
|- | |- | ||
| 09:44 | | 09:44 | ||
− | | | + | | Ka Spoken Tutorial Project team ka ju pynlong ia ki workshops bad ai ruh ia ki certificates sha kito ba pass ia ka on-line test. |
− | + | ||
|- | |- | ||
| 09:51 | | 09:51 | ||
− | | | + | | Ban tip kham bniah, sngewbha thoh sha ngi. |
− | + | ||
|- | |- | ||
| 09:55 | | 09:55 | ||
− | | | + | | Ia ka Spoken Tutorial Project la bei tyngka da ka NMEICT, MHRD, Sorkar India. |
− | + | ||
|- | |- | ||
| 10:01 | | 10:01 | ||
− | | | + | | Khambun ki jingtip halor kane ka mission ka don ka ha link harum. |
− | + | ||
− | + | ||
|- | |- | ||
| 10:06 | | 10:06 | ||
− | | | + | |ïa kane ka script la pynkylla sha ka Ktien Khasi da u Yuwanki Kharlukhi na Shillong,bad ma nga u Hezekiah Lyngdoh ngan pynkut noh. khublei ba phi la ïasnoh lang. |
− | + | ||
− | + | ||
|} | |} |
Latest revision as of 15:18, 31 May 2018
|
|
---|---|
00:01 | Khublei baroh. Ngi pdiang sngewbha ia phi sha kane ka tutorial halor ka Parsing Data. |
00:06 | Ha kane ka tutorial, ngin pule kumno ban download ia ki FASTA bad GenBank files na ka NCBI database website. |
00:14 | Bad ban Parseia ki data files da kaba pyndonkam ia ki functions ha ka Sequence Input/Output module. |
00:19 | Ban bud ia kane ka tutorial, phi dei ban tip bha ia ka undergraduate biochemistry lane bioinformatics. |
00:26 | bad ka basic Python programming. |
00:30 | Peit ia ka Python tutorials ha ka link ba la ai. |
00:34 | Ban record ia kane ka tutorial, nga pyndonkam da ka : * Ubuntu OS version 14.10 |
00:40 | Python version 2.7.8 |
00:44 | Ipython interpretor version 2.3.0 |
00:48 | Biopython version 1.64 bad * Mozilla Firefox browser 35.0. |
00:56 | Ki scientific data jong ka biology ju store barabor ia ki ha ka text file kum FASTA, GenBank, EMBL, Swiss-Prot kumta ter ter. |
01:07 | Ia ki data files lah ban download na ka database websites. |
01:12 | Plie ia ka website link ba lah ai harum, da uno uno u web browser. |
01:17 | Ka web-page ka plie. |
01:19 | To ngin download iaka FASTA bad GenBank files na ka bynta ka human insulin gene. |
01:25 | Ha ka search box, type: "human insulin", click ha Search button. |
01:31 | Ka web-page ka pyni shibun tylli ki flies na ka bynta ka human insulin gene. |
01:35 | Ban pyni nuksa, ngan jied 4 tylli ki files kiba kyrteng “Homo sapiens Insulin mRNA”. |
01:43 | Ngan jied ia ki files ba duna ia ka 500 base pairs. |
01:48 | Click ha ka check-box ban jied ia ka file ban download. |
01:56 | Wanrah ia u cursor sha “Send to” option, kaba don ha kyndong khlieh ka mon jong ka page. |
02:02 | Click ha i selection button barit ba don u khnam ba kdew shapoh, ba don hajan ka “Send to” button. |
02:09 | Hapoh ka heading “Choose destination”, click ha File option. |
02:13 | Phi lah ban save ia kane ka file ha kano kano ka format, kiba don hapoh format drop-down list box. |
02:21 | Jied FASTA na ki options ba ai hapoh. |
02:25 | Nangta sa click ha Create file option. |
02:29 | Ka dialog-box kan sa mih ha ka screen. |
02:32 | Jied Open with, click ha OK. |
02:36 | Ka file ka plie ha ka text editor. |
02:39 | Kane ka file ka pyni 4 tylli ki records, namar ngi la jied 4 tylli ki files ban download. |
02:46 | U line banyngkong ha kawei pa kawei ka record u dei u identifier line. |
02:50 | U sdang da u “greater than >)” symbol. |
02:53 | Nangta la pynbud da usequence. |
02:56 | Save ia ka file ha homefolder jong phi kum ka“sequence.fasta'”. |
03:01 | Khang ia ka text editor. |
03:03 | Pynbud ki juh ki syn jam kum haneng ban download ia ki files ha GenBank format |
03:08 | na ka bynta ki files ba la jied hashwa. |
03:12 | Jied ia ka file format kum GenBank. |
03:16 | Shna ia ka file. Plie da u text editor. |
03:21 | Peit thuh ba ka sequence file ha GenBank format ka kham bun features ban ia ka FASTA file. |
03:27 | Save ia ka file kum "sequence.gb" ha ka home folder. Khang noh u text editor. |
03:34 | Ban peit nuksa, ngi donkam ia ka FASTA file ba don record tang iwei. |
03:39 | Na ka bynta kane, pynkhuid ia ki jingjied ba hashwa da kaba click biang ha ki check box. |
03:48 | Mynta, jied ia ka file “Human insulin gene complete cds”. |
03:54 | Click ha ka check-box. |
03:57 | Bad sa bud ia ki rukom kumba la pyni mynne ban save ia ka file ha ka home folder. |
04:01 | Save ia ka file kum "insulin.fasta". |
04:08 | Ia ki Biological data ba lah store ha kine ki files lah ban sei bad pynkylla da kaba pyndonkam da ka Biopython libraries. |
04:16 | Khang ia u text-editor. |
04:19 | Ban sei ia ki data na data files la khot Parsing. |
04:23 | Jan ia baroh ki file formats lah ban parsed da kaba pyndonkam functions kiba don ha SeqIO module. |
04:30 | Ki function jong ka SeqIO module kiba ju kham pyndonkam bha baroh ki long: parse, read, write bad convert. |
04:38 | Plie ia ka terminal da kaba nion lang ia Ctrl, Alt bad t keys. |
04:44 | Plie ia ka Ipython da kaba type "ipython" ha ka prompt. Nion Enter. |
04:51 | Nangta, import "SeqIO" module na Bio package. |
04:56 | Ha ka prompt, type: from Bio import SeqIO. Nion Enter. |
05:04 | Ngin ia sdang da ka function kaba kham donkam ka “parse”. |
05:07 | Na ka bynta ka nuksa, ngan pyndonkam ka FASTA file ka ba don bun records kaba ngi download hashwa na ka database. |
05:17 | Na ba bynta ka FASTA parsing ba kham suk, type kumne harum ha ka prompt. |
05:22 | Hangne, ngi pyndonkam ia ka parse function ban read ia ki contents jong ka sequence.fasta file. |
05:30 | Na ka bynta ka output, print record id, sequence kaba don ha ka record bad ruh ia ka jingjrong jong ka sequence. |
05:41 | Peit bha ruh ba ia ka parse function la pyndonkam ban read sequence data kum Sequence record objects. |
05:48 | La ju pyndonkam barabor bad ka for loop. |
05:52 | Ka pdiang ar tylli ki arguments, kaba nyngkong dei ka file name ban read ia ka data. |
05:59 | Ka ba ar ka batai shai ia ka format jong ka file. |
06:02 | Nion ia u Enter key ar sien ban ioh ia ka output. |
06:07 | Ka output ka pyni ia u identifier line, nangta bud sa u sequence uba don ha ka file, bad ruh ka jingjrong jong ka sequence na ka bynta baroh ki records ha ka file. |
06:21 | Peit thuh ba ka FASTA format kam batai shai ia u alphabet. |
06:26 | Te, ka output kan nym pyni ia ka kum ka DNA sequence. |
06:31 | Ki juh ki synjam lah ban leh biang ban parse iaka GenBank file. |
06:36 | Ban pyni nuksa ngin pyndonkam ia ka GenBank file kaba ngi lah download mynne na ka database. |
06:43 | Nion ia u up-arrow key ban ioh ia ki lines jong ki code kiba ngi lah pyndonkam hashwa. |
06:49 | Pynkylla ia ka kyrteng jong ka file sha ka sequence.gb . |
06:53 | Pynkylla ia ka file format sha ka genbank. |
06:56 | Ki code ba sah kin neh kumjuh. |
06:58 | Nion ia u Enter key arsien ban ioh ia ka output. |
07:03 | Hangne ruh ka output ka pyni ia ka record id, sequence bad ka jingjrong jong ka sequence na ka bynta baroh ki records ha ka file. |
07:12 | Peit thuh ba ka GenBank format ka pyntikna ia ka sequence kum ka DNA sequence. |
07:19 | Kumjuh ruh, ia ki Swiss-prot bad ki file EMBL lah ban parse da kaba pyndonkam ia u juh u code kum haneng. |
07:27 | Lada ka file jong phi ka don uwei u record phi hap type ia ki line harum ban leh parsing. |
07:34 | Hangne, ngin pyndonkam iaka FASTA file ba lah save mynne, ba tang uwei u record, uta u dei insulin.fasta kum ka nuksa. |
07:43 | Peit thuh ba ngi lah dep pyndonkam ia ka read function ha jaka jong parse function. Nion Enter. |
07:50 | Ka output ka pyni ki contents na ka bynta ka file insulin.fasta. |
07:55 | Ka pyni ia ka sequence kum sequence record object. |
07:59 | Bad kiwei ki jinglong kum GI, accession number bad description. |
08:06 | Ngi lah ruh ban peit ia ki jinglong ba shimet jong kane ka record kumne harum. |
08:11 | Ha ka prompt, type: record dot seq. Nion Enter. |
08:18 | Ka output ka pyni ia ka sequence ba don ha ka file. |
08:22 | Ban peit ia ki identifiers jong kane ka record, type: record dot id. Nion Enter. |
08:29 | Ka output ka pyni ia u GI number bad accession number bad kumta ter ter. |
08:34 | Phi lah ban pyndonkam ia u function ba lah ong haneng ban parse ia ki data files kiba phi kwah. |
08:40 | Mynta to ngin batai lyngkot). |
08:42 | Ha kane ka tutorial, ngi lah pule ban: downloadFASTA bad GenBank files na ka NCBI database website bad pyndonkam iaka parse bad read functions na ka SeqIO module. |
08:55 | Ban sei ia ki data kum ki record ids, description bad sequences na FASTA bad GenBank files. |
09:03 | Mynta, na ka bynta ka assignment- |
09:06 | Download ia ki FASTA files na ka bynta ka nucleotide sequence haka jingjied jong phi na NCBI database. |
09:13 | Pynkylla ia ki file jong ki sequences sha ki reverse complements jong ki. |
09:17 | Ka assignment ba lah dep jong phi ka dei ban don ki lines of code kumne harum. |
09:22 | Pyndonkam ka parse function ban load ia nucleotide sequences na ka FASTA file. |
09:28 | Nangta, print ia ka reverse complements da kaba pyndonkam ia ka Sequence object’s built in reverse complement method. |
09:37 | Ka video ha ka link harum ka batai lyngkot ia kane ka spoken-tutorial project. |
09:42 | Sngewbha download bad peit ia ka. |
09:44 | Ka Spoken Tutorial Project team ka ju pynlong ia ki workshops bad ai ruh ia ki certificates sha kito ba pass ia ka on-line test. |
09:51 | Ban tip kham bniah, sngewbha thoh sha ngi. |
09:55 | Ia ka Spoken Tutorial Project la bei tyngka da ka NMEICT, MHRD, Sorkar India. |
10:01 | Khambun ki jingtip halor kane ka mission ka don ka ha link harum. |
10:06 | ïa kane ka script la pynkylla sha ka Ktien Khasi da u Yuwanki Kharlukhi na Shillong,bad ma nga u Hezekiah Lyngdoh ngan pynkut noh. khublei ba phi la ïasnoh lang. |