shell-script-pt
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: [shell-script] usando shell em sequencias de DNA


From: Marcio Gil
Subject: RE: [shell-script] usando shell em sequencias de DNA
Date: Sun, 14 Mar 2010 19:31:15 -0300

Resolvendo em uma linha com sed:

$ sed -n '/>.*/{N;s/.*\n\(...\).*/\1/p}' dna.txt 
gtg
atg
atg
atg 

ou, se você quiser manter o identificador da sequência:

$ sed -n '/>.*/{N;s/\(.*\)\n\(...\).*/\1: \2/p}' dna.txt 
>fig|674.13.peg.56: gtg
>fig|674.13.peg.57: atg
>fig|674.13.peg.58: atg
>fig|674.13.peg.59: atg

Nota: Eu criei o arquivo dna.txt com o exemplo que você passou.

> -----Original Message-----
> From: Graciela
> 
> Trabalho com bioinformática e queria tirar uma dúvida com vocês
> Tou com essas sequências de DNA, cada '>' representa uma sequência

> Quero verificar apenas as três primeira letras de cada 
> sequência, por shel script tem como?
> 
> >fig|674.13.peg.56
> gtgacgttaggtgacaatgttgttgttgcctcgggagccgttgtgacaaagagttttggt
> agcaatgttgtcattggtggcaacccagcacgagtgttaaaagaaatcgagtaa
> >fig|674.13.peg.57
> atgtcaatatacccaactgcaccccaaggtgtgtcagttgatgccaatatgaaaatgata
> aagcaaaagtttggccgcattgttggcccaactgatttctattggttttatgaacaggtt
> agaaataaaggcccgtgggattataaacaaagtggcccgtttgaaaatttcggtaatttt
> cattatggggcagttggttatgcaggtggcattccagaagaagttttgttaagagctgct
> ggttgtgctcaaatgaaggcgggtacttcagctgatagctggggtagttgttggagcgat
> caaccatttggagatgatccaaacgatcaacgttatatagcactaggtattgaatatgca
> aaaagtaaaggctattag
> >fig|674.13.peg.58
> atgaacgtggataaagcgaaaaaacgcatattgaaacgagtacaaagaggtttcaaaggt
> tatccgcaaatatctttagagtatttcgggaaaacgacagattttgctaccgaagtagtc
> atcacttttgttcctgaagaaaacgctgaagcgcaaattcagagatttacgagtgataag
> gatgttcgtgaagatgaagcaattcaatcggtattgttgaaaattattgagcgagccgac
> gctgctacggttttagaaagccgagaagtatcagtttgttaa
> >fig|674.13.peg.59
> atgccagttgttggacaccttcaagtggtgacaaattgggatggtataccgatttgtatt
> atagaaatgacgtctgtaaccaagcaaaaatacagcgaagtaacaccggagttcgcggct
> ttagaaggtgagggtgacaagactttagcttggtggcgggaggcgcattggaacttcttc
> tcaaaagagtgcgtagagttagaaatatcaccgtcggaagatatgttgctggttctagag
> cagttcaaagtggtgtacaaataa
> 
> 



reply via email to

[Prev in Thread] Current Thread [Next in Thread]