; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC10G193460 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC10G193460
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionNADH-ubiquinone oxidoreductase chain 5
Genome locationCicolChr10:12378114..12390624
RNA-Seq ExpressionCcUC10G193460
SyntenyCcUC10G193460
Gene Ontology termsGO:0015990 - electron transport coupled proton transport (biological process)
GO:0005747 - mitochondrial respiratory chain complex I (cellular component)
GO:0003954 - NADH dehydrogenase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AXU98945.1 NADH dehydrogenase subunit 5 [Brassica juncea]3.0e-2163.64Show/hide
Query:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINA---------GSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQN
        P+     GAFVAYNVNPVADQFQ  F+   F  + + S  N             ++SFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQ+
Subjt:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINA---------GSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQN

YP_002608190.2 NADH dehydrogenase subunit 5 [Carica papaya]3.0e-2163.64Show/hide
Query:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINA---------GSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQN
        P+     GAFVAYNVNPVADQFQ  F+   F  + + S  N             ++SFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQ+
Subjt:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINA---------GSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQN

YP_009045740.1 NADH dehydrogenase subunit 5 [Batis maritima]3.0e-2163.64Show/hide
Query:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINA---------GSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQN
        P+     GAFVAYNVNPVADQFQ  F+   F  + + S  N             ++SFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQ+
Subjt:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINA---------GSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQN

YP_009466021.1 NADH dehydrogenase subunit 5 [Arabis alpina]3.0e-2163.64Show/hide
Query:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINA---------GSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQN
        P+     GAFVAYNVNPVADQFQ  F+   F  + + S  N             ++SFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQ+
Subjt:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINA---------GSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQN

YP_009659111.1 NADH dehydrogenase subunit 5 [Capsella rubella]3.0e-2163.64Show/hide
Query:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINA---------GSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQN
        P+     GAFVAYNVNPVADQFQ  F+   F  + + S  N             ++SFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQ+
Subjt:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINA---------GSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQN

TrEMBL top hitse value%identityAlignment
A0A224AT72 NADH-ubiquinone oxidoreductase chain 51.5e-2163.64Show/hide
Query:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINA---------GSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQN
        P+     GAFVAYNVNPVADQFQ  F+   F  + + S  N             ++SFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQ+
Subjt:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINA---------GSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQN

A0A3B1EW93 NADH-ubiquinone oxidoreductase chain 51.5e-2163.64Show/hide
Query:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINA---------GSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQN
        P+     GAFVAYNVNPVADQFQ  F+   F  + + S  N             ++SFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQ+
Subjt:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINA---------GSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQN

A0A7U3TGP8 NADH dehydrogenase subunit 51.5e-2163.64Show/hide
Query:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINA---------GSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQN
        P+     GAFVAYNVNPVADQFQ  F+   F  + + S  N             ++SFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQ+
Subjt:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINA---------GSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQN

A7KNH7 NADH-ubiquinone oxidoreductase chain 5 (Fragment)1.5e-2163.64Show/hide
Query:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINA---------GSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQN
        P+     GAFVAYNVNPVADQFQ  F+   F  + + S  N             ++SFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQ+
Subjt:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINA---------GSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQN

G4XYW5 NADH-ubiquinone oxidoreductase chain 51.5e-2163.64Show/hide
Query:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINA---------GSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQN
        P+     GAFVAYNVNPVADQFQ  F+   F  + + S  N             ++SFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQ+
Subjt:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINA---------GSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQN

SwissProt top hitse value%identityAlignment
P10330 NADH-ubiquinone oxidoreductase chain 52.9e-2261.62Show/hide
Query:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINA---------GSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQN
        P+     GAF+AYNVN VADQFQ  F+   F  + + S  N             ++SFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQ+
Subjt:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINA---------GSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQN

P26849 NADH-ubiquinone oxidoreductase chain 53.7e-1446.08Show/hide
Query:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLF------------VIDSIASSINAGSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQL
        P+     G+FVAY+VN V +      K   F              D + +   A    +SFLRFGYEVSF+ALDKGAIEILGPYGISYT R++A++IS++
Subjt:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLF------------VIDSIASSINAGSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQL

Query:  QN
        Q+
Subjt:  QN

P29388 NADH-ubiquinone oxidoreductase chain 58.9e-2463.64Show/hide
Query:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINA---------GSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQN
        P+     GAFVAYNVNPVADQFQ  F+   F  + + S  N             ++SFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQ+
Subjt:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINA---------GSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQN

Q36284 NADH-ubiquinone oxidoreductase chain 5 (Fragment)4.1e-2159.6Show/hide
Query:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINA---------GSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQN
        P+     GA +AYNVN VADQFQ  F+   F  + + S  N             ++SFLRFGY VSFEALDKGAIEILGPYGISYTFRRLAERISQLQ+
Subjt:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINA---------GSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQN

Q37680 NADH-ubiquinone oxidoreductase chain 54.1e-2159.6Show/hide
Query:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINA---------GSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQN
        P+     GA +AYNVN VADQFQ  F+   F  + + S  N             ++SFLRFGY VSFEALDKGAIEILGPYGISYTFRRLAERISQLQ+
Subjt:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINA---------GSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQN

Arabidopsis top hitse value%identityAlignment
ATMG00030.1 hypothetical protein1.3e-0965.85Show/hide
Query:  SRFNGDKNFVRESELGYGFPIRDPWITDGISPWLFASSSVL
        ++F+  K     +ELGYGFPI DPWITDGISPW FAS SVL
Subjt:  SRFNGDKNFVRESELGYGFPIRDPWITDGISPWLFASSSVL

ATMG00060.1 NADH dehydrogenase subunit 5C2.8e-2563.64Show/hide
Query:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINA---------GSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQN
        P+     GAFVAYNVNPVADQFQ  F+   F  + + S  N             ++SFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQ+
Subjt:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINA---------GSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQN

ATMG00513.1 NADH dehydrogenase 5A2.8e-2563.64Show/hide
Query:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINA---------GSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQN
        P+     GAFVAYNVNPVADQFQ  F+   F  + + S  N             ++SFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQ+
Subjt:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINA---------GSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQN

ATMG00665.1 NADH dehydrogenase 5B2.8e-2563.64Show/hide
Query:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINA---------GSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQN
        P+     GAFVAYNVNPVADQFQ  F+   F  + + S  N             ++SFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQ+
Subjt:  PVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINA---------GSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQLQN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCTCTCTCTCTGCCGAGCCTCTTGGAGTGCTTGCTAACGATATCTTTTTCTTCCTCATTCTGATTTTTGTACTTCTCTCTCGCTTTAACGGCGACAAAAACTTTGT
CCGAGAGTCGGAGCTTGGATACGGTTTCCCAATCAGAGATCCATGGATCACAGACGGTATCTCCCCATGGCTTTTCGCCTCTTCAAGCGTCCTGCACAGGGACTTAGGCA
AGGAAAGAGAGTTTTGCTATCGGAAGAGACTAGAGGCTGATGGGCAAGACCCCTCCAGCCGCAATGGTGTGGGGTCCCTTGCTTCTTCTTTGGGCCATTGGCCTGTTATG
ATCTCTTGGCCTGTTGCTTCTGCTTCACAACATGAAGATCCAATAGTGTACAAGAGGGATAGTACCAAGGAATACATCATGGCTTACTTCAAAAGGGTAGGTGGTAACAC
CAAATCAGCTATAGGATTGGCAAAAACGAAGGGGGTTGACACAGCGATCAAGACAAACAAACAAATAGGGCCGGTTGAGAGTACGAGATTACTGAATGAAGACTCATCAA
TTGAGATGGGCGGAAGCCCTGCTAGCGAAGGAATGCATCCAATAGTGTTCTGTCTCATTGGTCCTCTCATGCTAGAATTGGCTCAACAATCCACTTTGTTCTCCCGGAAA
TCACTTCAATTAGAATACATGACTTATGCAGCAGTGAGAATGAGGATCACTTCGGAACTCGTTGAGAATGGGAGTTTTAGGGCTGGATCTAAAAGCTCTCCAACGTGTTG
CGAATCCTTCGGCCGTGAGAGGCCACGTGATGATTCGACGTTCGTGGTAGGCGTAGGAGATGGGCAAAGCAGTAGCGAAGCTCTGGTGGTGATGCAAGTGCGCGGTGAGC
GTAGTAGCCCCCTCGGGCATGCTCTTTGTACAACCAAGCGAAGAGCTAAGTCGGATCTGAGGCTCTTTACTAGGATGAACAAGCGAGGAACGAGTGACCACAAGCTTCGC
TTAAGCGCTCGACATGAAGTTAGCTCAAAACATGTTCATCATGTGGCCGATCGAAGCGCACCTGCACCTGTCTTTGTGAAGGGTGAAGGTGCTTTTGTTGCGTATAATGT
AAATCCCGTAGCGGATCAATTCCAAGAGCCTTTTAAACTAGTACTTTTTGTAATCGACTCTATAGCTTCTTCAATAAACGCTGGTTCTTCGATCAAATCGTTCCTGCGTT
TTGGATATGAAGTCTCATTTGAAGCTTTAGACAAAGGTGCTATTGAGATATTGGGCCCTTATGGTATCTCGTACACATTCCGACGATTGGCCGAGCGAATAAGTCAACTT
CAAAACACAATACAAGAGAGAGATGGCAATAGAGGACATGAACTCGCAGCTTCACCAGAAGCAACTAGTCACCTCCGCCGAAAGATGCTTAGCCAGAAGCGACGAAAGGG
GAAGCTCGGTGGTATAGTAGTCTCAATAGCAAAGCCGCCTTCGGCCCTTCGTTTACAAAGTGCATTACCGCTACGGAAGTTGACCCAATACCAAGACGTTAGAGAGAGAA
GGAAAACAATAAACCCCGGATGTATTCTCCTGAGCGACGAACCAGATTACCAAAGTCACGCTCTTGCCCTAAGAAGTCGGCCTGGTTGCATACAGACCTTAAAAGCTCGA
GAGACCTACTTGCCACGTGGCAAGAGGCCATTAGCCCCTGAAAACAGTTCCTTCTTCGCATCTCTCAACTCAAACAAAGAGGAGTCTGGTCTAACTTCTTGCCCGGCCTG
CCCGTCTGCTGCTAGAGTCGCTCCAGGGCCTGGAAAGCGTGAGCAAGTTCTCTTCTCTCGCTCTCGGTACAAACTTGGGACACCAAACAACTGGAATCGGTCGGGAGTTG
CTGATCCTGAGGCTAAGAAAGGGATGGAGTATGCAGCCGAACAACCGCAGGGGCTTCCAGCAGTGATAGCTGCCCCAGCCTATCCATCAGAACATAATATAGGAAAGTCT
TCCCTCTCCCTCATAAACTCGGTCGAGCGCATTCGCGCTTTCAGACAACGGTCGAGGATTAGGAGGGGAGGCTGCAAGCACTTCGGGTGGAGCAGCAAGTACTCATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGCTCTCTCTCTGCCGAGCCTCTTGGAGTGCTTGCTAACGATATCTTTTTCTTCCTCATTCTGATTTTTGTACTTCTCTCTCGCTTTAACGGCGACAAAAACTTTGT
CCGAGAGTCGGAGCTTGGATACGGTTTCCCAATCAGAGATCCATGGATCACAGACGGTATCTCCCCATGGCTTTTCGCCTCTTCAAGCGTCCTGCACAGGGACTTAGGCA
AGGAAAGAGAGTTTTGCTATCGGAAGAGACTAGAGGCTGATGGGCAAGACCCCTCCAGCCGCAATGGTGTGGGGTCCCTTGCTTCTTCTTTGGGCCATTGGCCTGTTATG
ATCTCTTGGCCTGTTGCTTCTGCTTCACAACATGAAGATCCAATAGTGTACAAGAGGGATAGTACCAAGGAATACATCATGGCTTACTTCAAAAGGGTAGGTGGTAACAC
CAAATCAGCTATAGGATTGGCAAAAACGAAGGGGGTTGACACAGCGATCAAGACAAACAAACAAATAGGGCCGGTTGAGAGTACGAGATTACTGAATGAAGACTCATCAA
TTGAGATGGGCGGAAGCCCTGCTAGCGAAGGAATGCATCCAATAGTGTTCTGTCTCATTGGTCCTCTCATGCTAGAATTGGCTCAACAATCCACTTTGTTCTCCCGGAAA
TCACTTCAATTAGAATACATGACTTATGCAGCAGTGAGAATGAGGATCACTTCGGAACTCGTTGAGAATGGGAGTTTTAGGGCTGGATCTAAAAGCTCTCCAACGTGTTG
CGAATCCTTCGGCCGTGAGAGGCCACGTGATGATTCGACGTTCGTGGTAGGCGTAGGAGATGGGCAAAGCAGTAGCGAAGCTCTGGTGGTGATGCAAGTGCGCGGTGAGC
GTAGTAGCCCCCTCGGGCATGCTCTTTGTACAACCAAGCGAAGAGCTAAGTCGGATCTGAGGCTCTTTACTAGGATGAACAAGCGAGGAACGAGTGACCACAAGCTTCGC
TTAAGCGCTCGACATGAAGTTAGCTCAAAACATGTTCATCATGTGGCCGATCGAAGCGCACCTGCACCTGTCTTTGTGAAGGGTGAAGGTGCTTTTGTTGCGTATAATGT
AAATCCCGTAGCGGATCAATTCCAAGAGCCTTTTAAACTAGTACTTTTTGTAATCGACTCTATAGCTTCTTCAATAAACGCTGGTTCTTCGATCAAATCGTTCCTGCGTT
TTGGATATGAAGTCTCATTTGAAGCTTTAGACAAAGGTGCTATTGAGATATTGGGCCCTTATGGTATCTCGTACACATTCCGACGATTGGCCGAGCGAATAAGTCAACTT
CAAAACACAATACAAGAGAGAGATGGCAATAGAGGACATGAACTCGCAGCTTCACCAGAAGCAACTAGTCACCTCCGCCGAAAGATGCTTAGCCAGAAGCGACGAAAGGG
GAAGCTCGGTGGTATAGTAGTCTCAATAGCAAAGCCGCCTTCGGCCCTTCGTTTACAAAGTGCATTACCGCTACGGAAGTTGACCCAATACCAAGACGTTAGAGAGAGAA
GGAAAACAATAAACCCCGGATGTATTCTCCTGAGCGACGAACCAGATTACCAAAGTCACGCTCTTGCCCTAAGAAGTCGGCCTGGTTGCATACAGACCTTAAAAGCTCGA
GAGACCTACTTGCCACGTGGCAAGAGGCCATTAGCCCCTGAAAACAGTTCCTTCTTCGCATCTCTCAACTCAAACAAAGAGGAGTCTGGTCTAACTTCTTGCCCGGCCTG
CCCGTCTGCTGCTAGAGTCGCTCCAGGGCCTGGAAAGCGTGAGCAAGTTCTCTTCTCTCGCTCTCGGTACAAACTTGGGACACCAAACAACTGGAATCGGTCGGGAGTTG
CTGATCCTGAGGCTAAGAAAGGGATGGAGTATGCAGCCGAACAACCGCAGGGGCTTCCAGCAGTGATAGCTGCCCCAGCCTATCCATCAGAACATAATATAGGAAAGTCT
TCCCTCTCCCTCATAAACTCGGTCGAGCGCATTCGCGCTTTCAGACAACGGTCGAGGATTAGGAGGGGAGGCTGCAAGCACTTCGGGTGGAGCAGCAAGTACTCATAG
Protein sequenceShow/hide protein sequence
MGSLSAEPLGVLANDIFFFLILIFVLLSRFNGDKNFVRESELGYGFPIRDPWITDGISPWLFASSSVLHRDLGKEREFCYRKRLEADGQDPSSRNGVGSLASSLGHWPVM
ISWPVASASQHEDPIVYKRDSTKEYIMAYFKRVGGNTKSAIGLAKTKGVDTAIKTNKQIGPVESTRLLNEDSSIEMGGSPASEGMHPIVFCLIGPLMLELAQQSTLFSRK
SLQLEYMTYAAVRMRITSELVENGSFRAGSKSSPTCCESFGRERPRDDSTFVVGVGDGQSSSEALVVMQVRGERSSPLGHALCTTKRRAKSDLRLFTRMNKRGTSDHKLR
LSARHEVSSKHVHHVADRSAPAPVFVKGEGAFVAYNVNPVADQFQEPFKLVLFVIDSIASSINAGSSIKSFLRFGYEVSFEALDKGAIEILGPYGISYTFRRLAERISQL
QNTIQERDGNRGHELAASPEATSHLRRKMLSQKRRKGKLGGIVVSIAKPPSALRLQSALPLRKLTQYQDVRERRKTINPGCILLSDEPDYQSHALALRSRPGCIQTLKAR
ETYLPRGKRPLAPENSSFFASLNSNKEESGLTSCPACPSAARVAPGPGKREQVLFSRSRYKLGTPNNWNRSGVADPEAKKGMEYAAEQPQGLPAVIAAPAYPSEHNIGKS
SLSLINSVERIRAFRQRSRIRRGGCKHFGWSSKYS