; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh11G013060 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh11G013060
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionReverse transcriptase
Genome locationCmo_Chr11:8955020..8956039
RNA-Seq ExpressionCmoCh11G013060
SyntenyCmoCh11G013060
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR005162 - Retrotransposon gag domain
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022926217.1 uncharacterized protein LOC111433397 [Cucurbita moschata]5.9e-7998.06Show/hide
Query:  VLQKDAEVWWSDNKQSINPGGGITTWETFKEAFLKYYYPKETRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKIVLGLNPKTRRML
        VLQKDAEVWWSDNKQSINPGGGITTWETFKEAFLKYYYPKET IKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLA TEEKQTEK VLGLNPKTRRML
Subjt:  VLQKDAEVWWSDNKQSINPGGGITTWETFKEAFLKYYYPKETRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKIVLGLNPKTRRML

Query:  EAFNPKTYEEALRTAKALEEPPEEKKTEPTVATGRKRPVEVDTTEFQPPSQRPRY
        EAFNPKTYEEALRTAKALEEPPEEKKTEPTVATGRKRPVEVDTTEFQPPSQRPRY
Subjt:  EAFNPKTYEEALRTAKALEEPPEEKKTEPTVATGRKRPVEVDTTEFQPPSQRPRY

XP_022933231.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111440131 [Cucurbita moschata]3.4e-16795.81Show/hide
Query:  VLQKDAEVWWSDNKQSINPGGGITTWETFKEAFLKYYYPKETRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKIVLGLNPKTRRML
        VLQKDAEVWWSDNKQSINPGGGITTWETFKEAFLKYYYPKE RIKKQQEFN LTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEK VLGLNPKTRRML
Subjt:  VLQKDAEVWWSDNKQSINPGGGITTWETFKEAFLKYYYPKETRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKIVLGLNPKTRRML

Query:  EAFNPKTYEEALRTAKALEEPPEEKKTEPTVATGRKRPVEVDTTEFQPPSQRPRYQSRPPAPPPIGRYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGHV
        EAFNPKTYEEALRTAKALEEPPEEKKTEPTVA GRKRPVEVDT EFQPPSQRPRYQSRPPAPPPIGRYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGH 
Subjt:  EAFNPKTYEEALRTAKALEEPPEEKKTEPTVATGRKRPVEVDTTEFQPPSQRPRYQSRPPAPPPIGRYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGHV

Query:  GHVARTCPTKSPGIPREPLRGPVIREPTLQTRPQTKAYVTTSKEAGTSGTVVTGTLSILGHFALTLFDSGSIHSFVALPFVKQAGFVIEPLMHALLVGTP
        GHVARTCPTKSPGIPREPLRGPVIREPTLQT PQTKAYVTTS EAGTSGTVVTGTLSILGHFALTLFDSGS HSFVA PF+KQAGFVIEPLMHAL VGTP
Subjt:  GHVARTCPTKSPGIPREPLRGPVIREPTLQTRPQTKAYVTTSKEAGTSGTVVTGTLSILGHFALTLFDSGSIHSFVALPFVKQAGFVIEPLMHALLVGTP

Query:  AGVDLVTKDK
        AGVDLVTKD+
Subjt:  AGVDLVTKDK

XP_022951914.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111454654 [Cucurbita moschata]7.6e-12791.05Show/hide
Query:  GDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKIVLGLNPKTRRMLEAFNPKTYEEALRTAKALEEPPEEKKTEPTVATGRKRPVEVDTTEFQPPSQRPRY
        GDRTVDQYDQDFMRLRRFAPSL DT+EKQTEK VLGLN KTRR+LEAFNPKTYEEALRTAKALE+PPEEKKTEPTV T RKRPVEVDTTEFQPP QRPRY
Subjt:  GDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKIVLGLNPKTRRMLEAFNPKTYEEALRTAKALEEPPEEKKTEPTVATGRKRPVEVDTTEFQPPSQRPRY

Query:  QSRPPAPPPIGRYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGHVGHVARTCPTKSPGIPREPLRGPVIREPTLQTRPQTKAYVTTSKEAGTSGTVVTGT
        Q R P+PPPI RYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGH GHVARTCPTKSPGIPREPLRGPVIREPTLQT PQTKAYVTTSKEAGTSGTVVTGT
Subjt:  QSRPPAPPPIGRYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGHVGHVARTCPTKSPGIPREPLRGPVIREPTLQTRPQTKAYVTTSKEAGTSGTVVTGT

Query:  LSILGHFALTLFDSGSIHSFVALPFVKQAGFVIEPLMHALLVGTPAGVDLVTKDKSK
        LSILGHFALTLF+S S HSFVALPFVKQAGFV+EPLMHAL VGTPAGVDLVTK++ K
Subjt:  LSILGHFALTLFDSGSIHSFVALPFVKQAGFVIEPLMHALLVGTPAGVDLVTKDKSK

XP_022973318.1 uncharacterized protein LOC111471874 [Cucurbita maxima]1.7e-8161.86Show/hide
Query:  VLQKDAEVWWSDNKQSINPGGGITTWETFKEAFLKYYYPKETRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKIVLGLNPKTRRML
        VLQKDAEVWWSDNKQSIN G GITTWE FKEAFLKYYYPKETRIKKQQEFNHLTQGDR VDQYDQ+FMRLRRFA SLADTEEKQ    +LGLNPK+ RML
Subjt:  VLQKDAEVWWSDNKQSINPGGGITTWETFKEAFLKYYYPKETRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKIVLGLNPKTRRML

Query:  EAFNPKTYEEALRTAKALEEPPEEKKTEPTVATGRKRPVEVDTTEFQPPSQRPRYQSRPPAPPPIGRYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGHV
        EAFNPKTYE+ALRTAKALEEP EEKKTEPTV TGRKRPVEV+TT+    S+    +   P                           LA  G   I G+ 
Subjt:  EAFNPKTYEEALRTAKALEEPPEEKKTEPTVATGRKRPVEVDTTEFQPPSQRPRYQSRPPAPPPIGRYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGHV

Query:  GHVARTCPTKSPGIPREPLRGPVIREPTLQTRPQTKAYVTTSKEAGTSGTVVTGTLSILGHFALTLFDSGSIHSFVALPFVKQAGFVIEPLMHALLVGTP
               P +       P + P  R   +Q RP     V   +     GT    TLSILGHFALTLFDSGS HSFV+LPFVKQAGFV+EPL+H L VGTP
Subjt:  GHVARTCPTKSPGIPREPLRGPVIREPTLQTRPQTKAYVTTSKEAGTSGTVVTGTLSILGHFALTLFDSGSIHSFVALPFVKQAGFVIEPLMHALLVGTP

Query:  AGVDLVTKDKSK
        AGVDLVTK++ K
Subjt:  AGVDLVTKDKSK

XP_023522446.1 uncharacterized protein LOC111786377 [Cucurbita pepo subsp. pepo]1.8e-8353.07Show/hide
Query:  VLQKDAEVWWSDNKQSINPGGGITTWETFKEAFLKYYYPKETRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKIVLGLNPKTRRML
        VLQKDAE+WW DNK  +NP GG   WE FKEAFLK YYPK  R+K+QQEF HL QG  TV++Y+++F +L+RFAPS+ DTEEK TEK VLGL P+ RRML
Subjt:  VLQKDAEVWWSDNKQSINPGGGITTWETFKEAFLKYYYPKETRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKIVLGLNPKTRRML

Query:  EAFNPKTYEEALRTAKALEEPPEEKKTEPTVATGRKRPVEVDTTEFQPPSQR--------PRYQSRPPA------PPPIGRYLAMEKPLCRNCGKQHVGR
        EAFNPKTYEEALRTAKALE+P +EK+ E  V  G+K P E   ++  PP  R        PR+  R P       P          +  C  CG+ H GR
Subjt:  EAFNPKTYEEALRTAKALEEPPEEKKTEPTVATGRKRPVEVDTTEFQPPSQR--------PRYQSRPPA------PPPIGRYLAMEKPLCRNCGKQHVGR

Query:  CLAGSGMCYICGHVGHVARTCPTKSPGIPREPLRGPVIREPTLQTRPQTKAYVTTSKEAGTSGTVVTGTLSILGHFALTLFDSGSIHSFVALPFVKQAGF
        C+AGS  CY CG  GH+A  C   +      P R     +     R Q +AYV+TSK+ G S  VVTGTLSILGHF  TLFDS S HSF+++PFV QAGF
Subjt:  CLAGSGMCYICGHVGHVARTCPTKSPGIPREPLRGPVIREPTLQTRPQTKAYVTTSKEAGTSGTVVTGTLSILGHFALTLFDSGSIHSFVALPFVKQAGF

Query:  VIEPLMHALLVGTPAGVDLVTKDKSK
         +EPL+H + V TP GVDLV++ + K
Subjt:  VIEPLMHALLVGTPAGVDLVTKDKSK

TrEMBL top hitse value%identityAlignment
A0A6J1EDX9 uncharacterized protein LOC1114333972.9e-7998.06Show/hide
Query:  VLQKDAEVWWSDNKQSINPGGGITTWETFKEAFLKYYYPKETRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKIVLGLNPKTRRML
        VLQKDAEVWWSDNKQSINPGGGITTWETFKEAFLKYYYPKET IKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLA TEEKQTEK VLGLNPKTRRML
Subjt:  VLQKDAEVWWSDNKQSINPGGGITTWETFKEAFLKYYYPKETRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKIVLGLNPKTRRML

Query:  EAFNPKTYEEALRTAKALEEPPEEKKTEPTVATGRKRPVEVDTTEFQPPSQRPRY
        EAFNPKTYEEALRTAKALEEPPEEKKTEPTVATGRKRPVEVDTTEFQPPSQRPRY
Subjt:  EAFNPKTYEEALRTAKALEEPPEEKKTEPTVATGRKRPVEVDTTEFQPPSQRPRY

A0A6J1EYH9 Reverse transcriptase1.6e-16795.81Show/hide
Query:  VLQKDAEVWWSDNKQSINPGGGITTWETFKEAFLKYYYPKETRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKIVLGLNPKTRRML
        VLQKDAEVWWSDNKQSINPGGGITTWETFKEAFLKYYYPKE RIKKQQEFN LTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEK VLGLNPKTRRML
Subjt:  VLQKDAEVWWSDNKQSINPGGGITTWETFKEAFLKYYYPKETRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKIVLGLNPKTRRML

Query:  EAFNPKTYEEALRTAKALEEPPEEKKTEPTVATGRKRPVEVDTTEFQPPSQRPRYQSRPPAPPPIGRYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGHV
        EAFNPKTYEEALRTAKALEEPPEEKKTEPTVA GRKRPVEVDT EFQPPSQRPRYQSRPPAPPPIGRYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGH 
Subjt:  EAFNPKTYEEALRTAKALEEPPEEKKTEPTVATGRKRPVEVDTTEFQPPSQRPRYQSRPPAPPPIGRYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGHV

Query:  GHVARTCPTKSPGIPREPLRGPVIREPTLQTRPQTKAYVTTSKEAGTSGTVVTGTLSILGHFALTLFDSGSIHSFVALPFVKQAGFVIEPLMHALLVGTP
        GHVARTCPTKSPGIPREPLRGPVIREPTLQT PQTKAYVTTS EAGTSGTVVTGTLSILGHFALTLFDSGS HSFVA PF+KQAGFVIEPLMHAL VGTP
Subjt:  GHVARTCPTKSPGIPREPLRGPVIREPTLQTRPQTKAYVTTSKEAGTSGTVVTGTLSILGHFALTLFDSGSIHSFVALPFVKQAGFVIEPLMHALLVGTP

Query:  AGVDLVTKDK
        AGVDLVTKD+
Subjt:  AGVDLVTKDK

A0A6J1FL46 uncharacterized protein LOC1114451922.1e-7795.48Show/hide
Query:  VLQKDAEVWWSDNKQSINPGGGITTWETFKEAFLKYYYPKETRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKIVLGLNPKTRRML
        VLQKDAEVWWSDNKQSINP GGITTWETFKEAFLKYYYPK+TRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRF PSLADTEEKQTEK VLGLNPKTRRML
Subjt:  VLQKDAEVWWSDNKQSINPGGGITTWETFKEAFLKYYYPKETRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKIVLGLNPKTRRML

Query:  EAFNPKTYEEALRTAKALEEPPEEKKTEPTVATGRKRPVEVDTTEFQPPSQRPRY
        EAFNPKT EEALRTAKALEEPPEEKKTEPTV TGRKRPVEVDTTEFQPP QRPRY
Subjt:  EAFNPKTYEEALRTAKALEEPPEEKKTEPTVATGRKRPVEVDTTEFQPPSQRPRY

A0A6J1GK52 Reverse transcriptase3.7e-12791.05Show/hide
Query:  GDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKIVLGLNPKTRRMLEAFNPKTYEEALRTAKALEEPPEEKKTEPTVATGRKRPVEVDTTEFQPPSQRPRY
        GDRTVDQYDQDFMRLRRFAPSL DT+EKQTEK VLGLN KTRR+LEAFNPKTYEEALRTAKALE+PPEEKKTEPTV T RKRPVEVDTTEFQPP QRPRY
Subjt:  GDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKIVLGLNPKTRRMLEAFNPKTYEEALRTAKALEEPPEEKKTEPTVATGRKRPVEVDTTEFQPPSQRPRY

Query:  QSRPPAPPPIGRYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGHVGHVARTCPTKSPGIPREPLRGPVIREPTLQTRPQTKAYVTTSKEAGTSGTVVTGT
        Q R P+PPPI RYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGH GHVARTCPTKSPGIPREPLRGPVIREPTLQT PQTKAYVTTSKEAGTSGTVVTGT
Subjt:  QSRPPAPPPIGRYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGHVGHVARTCPTKSPGIPREPLRGPVIREPTLQTRPQTKAYVTTSKEAGTSGTVVTGT

Query:  LSILGHFALTLFDSGSIHSFVALPFVKQAGFVIEPLMHALLVGTPAGVDLVTKDKSK
        LSILGHFALTLF+S S HSFVALPFVKQAGFV+EPLMHAL VGTPAGVDLVTK++ K
Subjt:  LSILGHFALTLFDSGSIHSFVALPFVKQAGFVIEPLMHALLVGTPAGVDLVTKDKSK

A0A6J1ICP5 uncharacterized protein LOC1114718748.1e-8261.86Show/hide
Query:  VLQKDAEVWWSDNKQSINPGGGITTWETFKEAFLKYYYPKETRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKIVLGLNPKTRRML
        VLQKDAEVWWSDNKQSIN G GITTWE FKEAFLKYYYPKETRIKKQQEFNHLTQGDR VDQYDQ+FMRLRRFA SLADTEEKQ    +LGLNPK+ RML
Subjt:  VLQKDAEVWWSDNKQSINPGGGITTWETFKEAFLKYYYPKETRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKIVLGLNPKTRRML

Query:  EAFNPKTYEEALRTAKALEEPPEEKKTEPTVATGRKRPVEVDTTEFQPPSQRPRYQSRPPAPPPIGRYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGHV
        EAFNPKTYE+ALRTAKALEEP EEKKTEPTV TGRKRPVEV+TT+    S+    +   P                           LA  G   I G+ 
Subjt:  EAFNPKTYEEALRTAKALEEPPEEKKTEPTVATGRKRPVEVDTTEFQPPSQRPRYQSRPPAPPPIGRYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGHV

Query:  GHVARTCPTKSPGIPREPLRGPVIREPTLQTRPQTKAYVTTSKEAGTSGTVVTGTLSILGHFALTLFDSGSIHSFVALPFVKQAGFVIEPLMHALLVGTP
               P +       P + P  R   +Q RP     V   +     GT    TLSILGHFALTLFDSGS HSFV+LPFVKQAGFV+EPL+H L VGTP
Subjt:  GHVARTCPTKSPGIPREPLRGPVIREPTLQTRPQTKAYVTTSKEAGTSGTVVTGTLSILGHFALTLFDSGSIHSFVALPFVKQAGFVIEPLMHALLVGTP

Query:  AGVDLVTKDKSK
        AGVDLVTK++ K
Subjt:  AGVDLVTKDKSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCGACATAGTGTTACAGAAGGATGCAGAAGTGTGGTGGTCGGATAATAAACAGAGCATCAACCCGGGTGGGGGAATTACAACATGGGAGACCTTTAAGGAAGCCTT
TCTAAAATATTATTATCCAAAGGAAACCCGTATAAAGAAACAGCAAGAGTTTAACCACTTAACCCAAGGTGATCGCACGGTGGATCAGTACGATCAGGACTTCATGAGAT
TGAGAAGGTTTGCACCGTCTTTAGCCGACACTGAAGAGAAACAGACAGAAAAAATTGTGTTAGGATTGAATCCGAAAACCCGCCGCATGTTAGAGGCCTTTAACCCAAAA
ACCTATGAAGAGGCCCTAAGGACGGCCAAGGCCTTAGAGGAACCCCCAGAGGAAAAGAAAACAGAGCCAACAGTCGCCACAGGGAGGAAACGCCCGGTCGAGGTCGATAC
CACAGAATTCCAACCACCGTCCCAGAGGCCTCGATATCAAAGTAGGCCACCTGCTCCACCTCCAATAGGCCGATACCTAGCAATGGAGAAGCCCCTGTGCCGTAATTGTG
GAAAGCAACATGTTGGGAGATGTTTGGCGGGCTCAGGTATGTGTTATATCTGTGGTCATGTAGGTCATGTGGCCAGAACTTGCCCTACAAAGAGCCCGGGAATTCCAAGG
GAACCCCTTAGAGGACCGGTCATCCGAGAGCCCACCTTACAAACCCGTCCACAGACCAAGGCATATGTAACGACCAGTAAAGAGGCGGGAACATCTGGCACCGTGGTGAC
AGGTACGCTTTCTATACTAGGACACTTTGCGTTGACATTGTTTGATTCTGGTTCTATCCATTCCTTTGTTGCTTTACCATTTGTTAAACAAGCAGGGTTCGTAATAGAAC
CCTTAATGCATGCGTTGTTGGTCGGTACCCCAGCAGGGGTAGACCTAGTTACGAAAGATAAGAGTAAGGGACGGACAAGTGGTAATAGCTGGACAAACCATCCACGTAGA
CTTAAAGGTAGTGGATATGACGGATTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTGCGACATAGTGTTACAGAAGGATGCAGAAGTGTGGTGGTCGGATAATAAACAGAGCATCAACCCGGGTGGGGGAATTACAACATGGGAGACCTTTAAGGAAGCCTT
TCTAAAATATTATTATCCAAAGGAAACCCGTATAAAGAAACAGCAAGAGTTTAACCACTTAACCCAAGGTGATCGCACGGTGGATCAGTACGATCAGGACTTCATGAGAT
TGAGAAGGTTTGCACCGTCTTTAGCCGACACTGAAGAGAAACAGACAGAAAAAATTGTGTTAGGATTGAATCCGAAAACCCGCCGCATGTTAGAGGCCTTTAACCCAAAA
ACCTATGAAGAGGCCCTAAGGACGGCCAAGGCCTTAGAGGAACCCCCAGAGGAAAAGAAAACAGAGCCAACAGTCGCCACAGGGAGGAAACGCCCGGTCGAGGTCGATAC
CACAGAATTCCAACCACCGTCCCAGAGGCCTCGATATCAAAGTAGGCCACCTGCTCCACCTCCAATAGGCCGATACCTAGCAATGGAGAAGCCCCTGTGCCGTAATTGTG
GAAAGCAACATGTTGGGAGATGTTTGGCGGGCTCAGGTATGTGTTATATCTGTGGTCATGTAGGTCATGTGGCCAGAACTTGCCCTACAAAGAGCCCGGGAATTCCAAGG
GAACCCCTTAGAGGACCGGTCATCCGAGAGCCCACCTTACAAACCCGTCCACAGACCAAGGCATATGTAACGACCAGTAAAGAGGCGGGAACATCTGGCACCGTGGTGAC
AGGTACGCTTTCTATACTAGGACACTTTGCGTTGACATTGTTTGATTCTGGTTCTATCCATTCCTTTGTTGCTTTACCATTTGTTAAACAAGCAGGGTTCGTAATAGAAC
CCTTAATGCATGCGTTGTTGGTCGGTACCCCAGCAGGGGTAGACCTAGTTACGAAAGATAAGAGTAAGGGACGGACAAGTGGTAATAGCTGGACAAACCATCCACGTAGA
CTTAAAGGTAGTGGATATGACGGATTTTGA
Protein sequenceShow/hide protein sequence
MCDIVLQKDAEVWWSDNKQSINPGGGITTWETFKEAFLKYYYPKETRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKIVLGLNPKTRRMLEAFNPK
TYEEALRTAKALEEPPEEKKTEPTVATGRKRPVEVDTTEFQPPSQRPRYQSRPPAPPPIGRYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGHVGHVARTCPTKSPGIPR
EPLRGPVIREPTLQTRPQTKAYVTTSKEAGTSGTVVTGTLSILGHFALTLFDSGSIHSFVALPFVKQAGFVIEPLMHALLVGTPAGVDLVTKDKSKGRTSGNSWTNHPRR
LKGSGYDGF