; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0012470 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0012470
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionCopia-like protein
Genome locationchr09:19585454..19586898
RNA-Seq ExpressionIVF0012470
SyntenyIVF0012470
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036222.1 copia-like protein [Cucumis melo var. makuwa]1.92e-260100Show/hide
Query:  MIENNKITTYILTGNNNYVPWARSVEIGLGGKGKRSFINGSKGKPKPKDQANPTDDELTAIENWETTDQMIMSWLLSTMDTKISSALMYCKTSKEIWTKA
        MIENNKITTYILTGNNNYVPWARSVEIGLGGKGKRSFINGSKGKPKPKDQANPTDDELTAIENWETTDQMIMSWLLSTMDTKISSALMYCKTSKEIWTKA
Subjt:  MIENNKITTYILTGNNNYVPWARSVEIGLGGKGKRSFINGSKGKPKPKDQANPTDDELTAIENWETTDQMIMSWLLSTMDTKISSALMYCKTSKEIWTKA

Query:  KTRYGQGKNFAHIFSLKQELSNIKQGNLNNSDLVAEILTKWEELQMYLPETINPEEIYKRNEHELIYTYLGALDSSFEPIRAQILSSAEMPQFDDVVLKI
        KTRYGQGKNFAHIFSLKQELSNIKQGNLNNSDLVAEILTKWEELQMYLPETINPEEIYKRNEHELIYTYLGALDSSFEPIRAQILSSAEMPQFDDVVLKI
Subjt:  KTRYGQGKNFAHIFSLKQELSNIKQGNLNNSDLVAEILTKWEELQMYLPETINPEEIYKRNEHELIYTYLGALDSSFEPIRAQILSSAEMPQFDDVVLKI

Query:  EQEESRRRLMNPSPAPSTDNQAFRATYNKDRGKGNLWCDHCKRSSHNKESCWVLHPHLKPQRKGGGSTNNGGWRREAHSAIGELNSGTNTKADFGMNGMN
        EQEESRRRLMNPSPAPSTDNQAFRATYNKDRGKGNLWCDHCKRSSHNKESCWVLHPHLKPQRKGGGSTNNGGWRREAHSAIGELNSGTNTKADFGMNGMN
Subjt:  EQEESRRRLMNPSPAPSTDNQAFRATYNKDRGKGNLWCDHCKRSSHNKESCWVLHPHLKPQRKGGGSTNNGGWRREAHSAIGELNSGTNTKADFGMNGMN

Query:  PNPSQGPADHGPPGFYGTTAAGPVQGPISFVTPAGPSGPNSDQLMQLVNQLNQLLQPRQQNS
        PNPSQGPADHGPPGFYGTTAAGPVQGPISFVTPAGPSGPNSDQLMQLVNQLNQLLQPRQQNS
Subjt:  PNPSQGPADHGPPGFYGTTAAGPVQGPISFVTPAGPSGPNSDQLMQLVNQLNQLLQPRQQNS

TYK00722.1 copia-like protein [Cucumis melo var. makuwa]5.63e-25999.45Show/hide
Query:  MIENNKITTYILTGNNNYVPWARSVEIGLGGKGKRSFINGSKGKPKPKDQANPTDDELTAIENWETTDQMIMSWLLSTMDTKISSALMYCKTSKEIWTKA
        MIENNKITTYILTGNNNYVPWARSVEIGLGGKGKRSFINGSKGKPKPKDQANPTDDELTAIENWETTDQMIMSWLLSTMDTKISSALMYCKTSKEIWTKA
Subjt:  MIENNKITTYILTGNNNYVPWARSVEIGLGGKGKRSFINGSKGKPKPKDQANPTDDELTAIENWETTDQMIMSWLLSTMDTKISSALMYCKTSKEIWTKA

Query:  KTRYGQGKNFAHIFSLKQELSNIKQGNLNNSDLVAEILTKWEELQMYLPETINPEEIYKRNEHELIYTYLGALDSSFEPIRAQILSSAEMPQFDDVVLKI
        KTRYGQGKNFAHIFSLKQELSNIKQGNLNNSDLVAEILTKWEELQMYLPETINPEEIYKRNEHELIYTYLGALDSSFEPIRAQILSSAEMPQFDDVVLKI
Subjt:  KTRYGQGKNFAHIFSLKQELSNIKQGNLNNSDLVAEILTKWEELQMYLPETINPEEIYKRNEHELIYTYLGALDSSFEPIRAQILSSAEMPQFDDVVLKI

Query:  EQEESRRRLMNPSPAPSTDNQAFRATYNKDRGKGNLWCDHCKRSSHNKESCWVLHPHLKPQRKGGGSTNNGGWRREAHSAIGELNSGTNTKADFGMNGMN
        EQEESRRRLMNPSPAPSTDNQAFRATYNKDRGKGNLWCDHCKRSSHNKESCWVLHPHLKPQRKGGGSTNNGGWRREAHSAIGELNSGTNTKADFGMNGMN
Subjt:  EQEESRRRLMNPSPAPSTDNQAFRATYNKDRGKGNLWCDHCKRSSHNKESCWVLHPHLKPQRKGGGSTNNGGWRREAHSAIGELNSGTNTKADFGMNGMN

Query:  PNPSQGPADHGPPGFYGTTAAGPVQGPISFVTPAGPSGPNSDQLMQLVNQLNQLLQPRQQNSGHS
        PNPSQGPADHGPPGFYGTTAAGPVQ PISFVTPAGPSGPNSDQLMQLVNQLNQLLQPRQQNSG S
Subjt:  PNPSQGPADHGPPGFYGTTAAGPVQGPISFVTPAGPSGPNSDQLMQLVNQLNQLLQPRQQNSGHS

XP_018497906.1 PREDICTED: uncharacterized protein LOC108865503 [Pyrus x bretschneideri]3.20e-5036.12Show/hide
Query:  IENNKITTYILTGNNNYVPWARSVEIGLGGKGKRSFINGSKGKPKPKDQANPTDDELTAIENWETTDQMIMSWLLSTMDTKISSALMYCKTSKEIWTKAK
        +  N+  + ++    NY+PW+R+V + LGG+ K  FINGS   P   D ++P        E+W + DQ++MSWLL++MD K++    Y ++S ++W   K
Subjt:  IENNKITTYILTGNNNYVPWARSVEIGLGGKGKRSFINGSKGKPKPKDQANPTDDELTAIENWETTDQMIMSWLLSTMDTKISSALMYCKTSKEIWTKAK

Query:  TRYGQGKNFAHIFSLKQELSNIKQGNLNNSDLVAEILTKWEELQMYLPETINPEEIYKRNEHELIYTYLGALDSSFEPIRAQILSSAEMPQFDDVVLKIE
          YG   N A +F LK+++S+++Q       L+  + + W EL++Y P T +   + KR E + I+  L +LDS++E +R  IL + E+P F  V + I+
Subjt:  TRYGQGKNFAHIFSLKQELSNIKQGNLNNSDLVAEILTKWEELQMYLPETINPEEIYKRNEHELIYTYLGALDSSFEPIRAQILSSAEMPQFDDVVLKIE

Query:  QEESRRRLMNPSPAPSTDNQAFRATYNKDRGKG---NLWCDHCKRSSHNKESCWVLHPHLKPQ
        +EE RR++MN     S        T N+ R KG   NL C HC    H K++CW+LHP LKP+
Subjt:  QEESRRRLMNPSPAPSTDNQAFRATYNKDRGKG---NLWCDHCKRSSHNKESCWVLHPHLKPQ

XP_018506028.1 PREDICTED: uncharacterized protein LOC108868128 [Pyrus x bretschneideri]8.85e-5036.5Show/hide
Query:  IENNKITTYILTGNNNYVPWARSVEIGLGGKGKRSFINGSKGKPKPKDQANPTDDELTAIENWETTDQMIMSWLLSTMDTKISSALMYCKTSKEIWTKAK
        +  N+  + ++    NY+PW+R+V + LGG+ K  FINGS   P   D ++P        E+W + DQ++MSWLL++MD K++    Y ++S ++W   K
Subjt:  IENNKITTYILTGNNNYVPWARSVEIGLGGKGKRSFINGSKGKPKPKDQANPTDDELTAIENWETTDQMIMSWLLSTMDTKISSALMYCKTSKEIWTKAK

Query:  TRYGQGKNFAHIFSLKQELSNIKQGNLNNSDLVAEILTKWEELQMYLPETINPEEIYKRNEHELIYTYLGALDSSFEPIRAQILSSAEMPQFDDVVLKIE
          YG   N A +F LK+++S+++Q       L+  + + W EL++Y P TI+   + KR E + I+  L +LDS++E +R  IL + E+P F  V   I+
Subjt:  TRYGQGKNFAHIFSLKQELSNIKQGNLNNSDLVAEILTKWEELQMYLPETINPEEIYKRNEHELIYTYLGALDSSFEPIRAQILSSAEMPQFDDVVLKIE

Query:  QEESRRRLMNPSPAPSTDNQAFRATYNKDRGKG---NLWCDHCKRSSHNKESCWVLHPHLKPQ
        +EE RR++MN     S        T N+ R KG   NL C HC    H K++CW+LHP LKP+
Subjt:  QEESRRRLMNPSPAPSTDNQAFRATYNKDRGKG---NLWCDHCKRSSHNKESCWVLHPHLKPQ

XP_021808903.1 uncharacterized protein LOC110752531 [Prunus avium]6.60e-5236.15Show/hide
Query:  ILTGNNNYVPWARSVEIGLGGKGKRSFINGSKGKPKPKDQANPTDDELTAIENWETTDQMIMSWLLSTMDTKISSALMYCKTSKEIWTKAKTRYGQGKNF
        +L    NY+PW+R+V + LGG+ K  FING+   P   D            E+W   DQ++MSWLL++M+ +++    +  +++ +WT  K  YG   N 
Subjt:  ILTGNNNYVPWARSVEIGLGGKGKRSFINGSKGKPKPKDQANPTDDELTAIENWETTDQMIMSWLLSTMDTKISSALMYCKTSKEIWTKAKTRYGQGKNF

Query:  AHIFSLKQELSNIKQGNLNNSDLVAEILTKWEELQMYLPETINPEEIYKRNEHELIYTYLGALDSSFEPIRAQILSSAEMPQFDDVVLKIEQEESRRRLM
        A IF LK++++ + Q   +  + + ++   W EL +Y P TI+   + KR E + I+  L +LD ++E +R+ IL SAEMP F+ V   I++EE R+++M
Subjt:  AHIFSLKQELSNIKQGNLNNSDLVAEILTKWEELQMYLPETINPEEIYKRNEHELIYTYLGALDSSFEPIRAQILSSAEMPQFDDVVLKIEQEESRRRLM

Query:  N---PSPAPSTDNQAF------RATYNKDRGKGNLWCDHCKRSSHNKESCWVLHPHLKPQ
        N    S A S++++AF      +A+ N    KGNL C +C++  H K+ CW+LHPHLKP+
Subjt:  N---PSPAPSTDNQAF------RATYNKDRGKGNLWCDHCKRSSHNKESCWVLHPHLKPQ

TrEMBL top hitse value%identityAlignment
A0A5A7SYC4 Copia-like protein9.4e-209100Show/hide
Query:  MIENNKITTYILTGNNNYVPWARSVEIGLGGKGKRSFINGSKGKPKPKDQANPTDDELTAIENWETTDQMIMSWLLSTMDTKISSALMYCKTSKEIWTKA
        MIENNKITTYILTGNNNYVPWARSVEIGLGGKGKRSFINGSKGKPKPKDQANPTDDELTAIENWETTDQMIMSWLLSTMDTKISSALMYCKTSKEIWTKA
Subjt:  MIENNKITTYILTGNNNYVPWARSVEIGLGGKGKRSFINGSKGKPKPKDQANPTDDELTAIENWETTDQMIMSWLLSTMDTKISSALMYCKTSKEIWTKA

Query:  KTRYGQGKNFAHIFSLKQELSNIKQGNLNNSDLVAEILTKWEELQMYLPETINPEEIYKRNEHELIYTYLGALDSSFEPIRAQILSSAEMPQFDDVVLKI
        KTRYGQGKNFAHIFSLKQELSNIKQGNLNNSDLVAEILTKWEELQMYLPETINPEEIYKRNEHELIYTYLGALDSSFEPIRAQILSSAEMPQFDDVVLKI
Subjt:  KTRYGQGKNFAHIFSLKQELSNIKQGNLNNSDLVAEILTKWEELQMYLPETINPEEIYKRNEHELIYTYLGALDSSFEPIRAQILSSAEMPQFDDVVLKI

Query:  EQEESRRRLMNPSPAPSTDNQAFRATYNKDRGKGNLWCDHCKRSSHNKESCWVLHPHLKPQRKGGGSTNNGGWRREAHSAIGELNSGTNTKADFGMNGMN
        EQEESRRRLMNPSPAPSTDNQAFRATYNKDRGKGNLWCDHCKRSSHNKESCWVLHPHLKPQRKGGGSTNNGGWRREAHSAIGELNSGTNTKADFGMNGMN
Subjt:  EQEESRRRLMNPSPAPSTDNQAFRATYNKDRGKGNLWCDHCKRSSHNKESCWVLHPHLKPQRKGGGSTNNGGWRREAHSAIGELNSGTNTKADFGMNGMN

Query:  PNPSQGPADHGPPGFYGTTAAGPVQGPISFVTPAGPSGPNSDQLMQLVNQLNQLLQPRQQNS
        PNPSQGPADHGPPGFYGTTAAGPVQGPISFVTPAGPSGPNSDQLMQLVNQLNQLLQPRQQNS
Subjt:  PNPSQGPADHGPPGFYGTTAAGPVQGPISFVTPAGPSGPNSDQLMQLVNQLNQLLQPRQQNS

A0A5D3DP15 Copia-like protein9.4e-20999.45Show/hide
Query:  MIENNKITTYILTGNNNYVPWARSVEIGLGGKGKRSFINGSKGKPKPKDQANPTDDELTAIENWETTDQMIMSWLLSTMDTKISSALMYCKTSKEIWTKA
        MIENNKITTYILTGNNNYVPWARSVEIGLGGKGKRSFINGSKGKPKPKDQANPTDDELTAIENWETTDQMIMSWLLSTMDTKISSALMYCKTSKEIWTKA
Subjt:  MIENNKITTYILTGNNNYVPWARSVEIGLGGKGKRSFINGSKGKPKPKDQANPTDDELTAIENWETTDQMIMSWLLSTMDTKISSALMYCKTSKEIWTKA

Query:  KTRYGQGKNFAHIFSLKQELSNIKQGNLNNSDLVAEILTKWEELQMYLPETINPEEIYKRNEHELIYTYLGALDSSFEPIRAQILSSAEMPQFDDVVLKI
        KTRYGQGKNFAHIFSLKQELSNIKQGNLNNSDLVAEILTKWEELQMYLPETINPEEIYKRNEHELIYTYLGALDSSFEPIRAQILSSAEMPQFDDVVLKI
Subjt:  KTRYGQGKNFAHIFSLKQELSNIKQGNLNNSDLVAEILTKWEELQMYLPETINPEEIYKRNEHELIYTYLGALDSSFEPIRAQILSSAEMPQFDDVVLKI

Query:  EQEESRRRLMNPSPAPSTDNQAFRATYNKDRGKGNLWCDHCKRSSHNKESCWVLHPHLKPQRKGGGSTNNGGWRREAHSAIGELNSGTNTKADFGMNGMN
        EQEESRRRLMNPSPAPSTDNQAFRATYNKDRGKGNLWCDHCKRSSHNKESCWVLHPHLKPQRKGGGSTNNGGWRREAHSAIGELNSGTNTKADFGMNGMN
Subjt:  EQEESRRRLMNPSPAPSTDNQAFRATYNKDRGKGNLWCDHCKRSSHNKESCWVLHPHLKPQRKGGGSTNNGGWRREAHSAIGELNSGTNTKADFGMNGMN

Query:  PNPSQGPADHGPPGFYGTTAAGPVQGPISFVTPAGPSGPNSDQLMQLVNQLNQLLQPRQQNSGHS
        PNPSQGPADHGPPGFYGTTAAGPVQ PISFVTPAGPSGPNSDQLMQLVNQLNQLLQPRQQNSG S
Subjt:  PNPSQGPADHGPPGFYGTTAAGPVQGPISFVTPAGPSGPNSDQLMQLVNQLNQLLQPRQQNSGHS

A0A5N5FSZ2 Retrotran_gag_3 domain-containing protein6.4e-4032.74Show/hide
Query:  IENNKITTYILTGNNNYVPWARSVEIGLGGKGKRSFINGSKGKPKPKDQANPTDDELTAIENWETTDQMIMSWLLSTMDTKISSALMYCKTSKEIWTKAK
        +  N+  + ++    NY+ W+R+V + LGG+ K  FINGS   P+            +  E+W + DQ++MSWLL++M+ K++    Y K+S ++W   K
Subjt:  IENNKITTYILTGNNNYVPWARSVEIGLGGKGKRSFINGSKGKPKPKDQANPTDDELTAIENWETTDQMIMSWLLSTMDTKISSALMYCKTSKEIWTKAK

Query:  TRYGQGKNFAHIFSLKQELSNIKQGNLNNSDLVAEILTKWEELQMYLPETINPEEIYKRNEHELIYTYLGALDSSFEPIRAQILSSAEMPQFDDVVLKIE
          YG   N A +F LK+++S+++Q       L+  + T W EL+ Y P T +   + KR + + I+  L +LDS++E +R  +L + E+P F  V   I+
Subjt:  TRYGQGKNFAHIFSLKQELSNIKQGNLNNSDLVAEILTKWEELQMYLPETINPEEIYKRNEHELIYTYLGALDSSFEPIRAQILSSAEMPQFDDVVLKIE

Query:  QEESRRRLMNPSPAPST-DNQAFRATYNKDRGK-GNLWCDHCKRSSHNKESCWVLHPHLKPQRKGGGSTNNGGWRREAHSA
        +EE RR++MN S  P+  +++A+     + +GK  +L C HC  + H +++CW+LHP LKP    G    N    R  H++
Subjt:  QEESRRRLMNPSPAPST-DNQAFRATYNKDRGK-GNLWCDHCKRSSHNKESCWVLHPHLKPQRKGGGSTNNGGWRREAHSA

A0A5N5HX88 Retrotran_gag_3 domain-containing protein2.4e-3933.21Show/hide
Query:  ENNKITTYILTGNNNYVPWARSVEIGLGGKGKRSFINGSKGKPKPKDQANPTDDELTAIENWETTDQMIMSWLLSTMDTKISSALMYCKTSKEIWTKAKT
        E+N     +L    NY+PW+R++ + LGG+ K  ++NG+    KP D ++ T D       W + DQ++MSW+L++M+ K+S    Y  +S  +W   K 
Subjt:  ENNKITTYILTGNNNYVPWARSVEIGLGGKGKRSFINGSKGKPKPKDQANPTDDELTAIENWETTDQMIMSWLLSTMDTKISSALMYCKTSKEIWTKAKT

Query:  RYGQGKNFAHIFSLKQELSNIKQGNLNNSDLVAEILTKWEELQMYLPETINPEEIYKRNEHELIYTYLGALDSSFEPIRAQILSSAEMPQFDDVVLKIEQ
         YG   N A +F LK+ L+ IKQGN      +  + + W EL +Y P T     + KR + + ++  L +L   +E +R+ +L + E+P F +V   +++
Subjt:  RYGQGKNFAHIFSLKQELSNIKQGNLNNSDLVAEILTKWEELQMYLPETINPEEIYKRNEHELIYTYLGALDSSFEPIRAQILSSAEMPQFDDVVLKIEQ

Query:  EESRRRLMNPSPAPSTDNQAFRATYNKD-----RGKGNLW-CDHCKRSSHNKESCWVLHPHLKPQ
        EE+RR++M+  P  S + +AF + +N        GK   W C +C    H +E CW+LHP LKP+
Subjt:  EESRRRLMNPSPAPSTDNQAFRATYNKD-----RGKGNLW-CDHCKRSSHNKESCWVLHPHLKPQ

A0A6P5S547 uncharacterized protein LOC1107525316.2e-4336.15Show/hide
Query:  ILTGNNNYVPWARSVEIGLGGKGKRSFINGSKGKPKPKDQANPTDDELTAIENWETTDQMIMSWLLSTMDTKISSALMYCKTSKEIWTKAKTRYGQGKNF
        +L    NY+PW+R+V + LGG+ K  FING+   P   D            E+W   DQ++MSWLL++M+ +++    +  +++ +WT  K  YG   N 
Subjt:  ILTGNNNYVPWARSVEIGLGGKGKRSFINGSKGKPKPKDQANPTDDELTAIENWETTDQMIMSWLLSTMDTKISSALMYCKTSKEIWTKAKTRYGQGKNF

Query:  AHIFSLKQELSNIKQGNLNNSDLVAEILTKWEELQMYLPETINPEEIYKRNEHELIYTYLGALDSSFEPIRAQILSSAEMPQFDDVVLKIEQEESRRRLM
        A IF LK++++ + Q   +  + + ++   W EL +Y P TI+   + KR E + I+  L +LD ++E +R+ IL SAEMP F+ V   I++EE R+++M
Subjt:  AHIFSLKQELSNIKQGNLNNSDLVAEILTKWEELQMYLPETINPEEIYKRNEHELIYTYLGALDSSFEPIRAQILSSAEMPQFDDVVLKIEQEESRRRLM

Query:  N---PSPAPSTDNQAF------RATYNKDRGKGNLWCDHCKRSSHNKESCWVLHPHLKPQ
        N    S A S++++AF      +A+ N    KGNL C +C++  H K+ CW+LHPHLKP+
Subjt:  N---PSPAPSTDNQAF------RATYNKDRGKGNLWCDHCKRSSHNKESCWVLHPHLKPQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCGAAAACAACAAAATCACCACATATATCCTCACTGGTAACAATAACTACGTACCATGGGCCAGATCTGTGGAAATAGGCTTAGGGGGTAAGGGTAAAAGGTCATT
CATTAATGGATCCAAAGGAAAACCAAAACCAAAAGACCAAGCCAACCCTACTGATGATGAACTAACAGCCATAGAAAATTGGGAGACCACAGATCAGATGATCATGTCGT
GGCTCTTAAGCACAATGGACACTAAAATCTCTAGTGCTCTTATGTACTGTAAAACCTCAAAGGAAATCTGGACAAAAGCCAAAACCCGATATGGTCAAGGAAAAAACTTC
GCACATATTTTTTCATTAAAACAAGAACTTTCCAACATCAAACAAGGCAACCTCAATAACTCAGACCTAGTGGCCGAAATACTGACCAAATGGGAAGAGTTACAGATGTA
TCTCCCAGAAACAATAAATCCAGAGGAAATTTACAAAAGAAACGAACATGAATTAATTTATACTTATCTCGGGGCTTTAGATTCCAGTTTTGAACCCATTCGGGCTCAAA
TTCTCTCATCGGCAGAAATGCCACAGTTTGATGATGTAGTTCTCAAGATTGAGCAGGAGGAATCAAGAAGACGGCTCATGAATCCGTCGCCAGCTCCATCAACAGACAAC
CAAGCATTTCGCGCAACCTACAACAAGGACAGAGGAAAGGGGAACTTGTGGTGTGATCATTGCAAACGATCGAGTCACAACAAGGAGAGTTGCTGGGTTCTTCATCCGCA
TCTCAAGCCCCAACGCAAAGGTGGCGGAAGTACCAACAACGGCGGGTGGCGCCGAGAGGCACATTCCGCCATAGGAGAACTCAACAGCGGGACAAATACGAAGGCTGACT
TCGGGATGAATGGGATGAACCCTAATCCCTCTCAAGGGCCAGCTGATCACGGGCCACCAGGCTTCTATGGAACGACTGCTGCTGGGCCAGTGCAAGGCCCAATCTCCTTT
GTCACGCCTGCTGGGCCTTCCGGACCTAATTCGGATCAACTCATGCAGCTGGTTAACCAATTAAATCAACTACTTCAGCCGAGGCAACAGAACTCAGGACATAGTCTCAG
GGGAGATGATTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGATCGAAAACAACAAAATCACCACATATATCCTCACTGGTAACAATAACTACGTACCATGGGCCAGATCTGTGGAAATAGGCTTAGGGGGTAAGGGTAAAAGGTCATT
CATTAATGGATCCAAAGGAAAACCAAAACCAAAAGACCAAGCCAACCCTACTGATGATGAACTAACAGCCATAGAAAATTGGGAGACCACAGATCAGATGATCATGTCGT
GGCTCTTAAGCACAATGGACACTAAAATCTCTAGTGCTCTTATGTACTGTAAAACCTCAAAGGAAATCTGGACAAAAGCCAAAACCCGATATGGTCAAGGAAAAAACTTC
GCACATATTTTTTCATTAAAACAAGAACTTTCCAACATCAAACAAGGCAACCTCAATAACTCAGACCTAGTGGCCGAAATACTGACCAAATGGGAAGAGTTACAGATGTA
TCTCCCAGAAACAATAAATCCAGAGGAAATTTACAAAAGAAACGAACATGAATTAATTTATACTTATCTCGGGGCTTTAGATTCCAGTTTTGAACCCATTCGGGCTCAAA
TTCTCTCATCGGCAGAAATGCCACAGTTTGATGATGTAGTTCTCAAGATTGAGCAGGAGGAATCAAGAAGACGGCTCATGAATCCGTCGCCAGCTCCATCAACAGACAAC
CAAGCATTTCGCGCAACCTACAACAAGGACAGAGGAAAGGGGAACTTGTGGTGTGATCATTGCAAACGATCGAGTCACAACAAGGAGAGTTGCTGGGTTCTTCATCCGCA
TCTCAAGCCCCAACGCAAAGGTGGCGGAAGTACCAACAACGGCGGGTGGCGCCGAGAGGCACATTCCGCCATAGGAGAACTCAACAGCGGGACAAATACGAAGGCTGACT
TCGGGATGAATGGGATGAACCCTAATCCCTCTCAAGGGCCAGCTGATCACGGGCCACCAGGCTTCTATGGAACGACTGCTGCTGGGCCAGTGCAAGGCCCAATCTCCTTT
GTCACGCCTGCTGGGCCTTCCGGACCTAATTCGGATCAACTCATGCAGCTGGTTAACCAATTAAATCAACTACTTCAGCCGAGGCAACAGAACTCAGGACATAGTCTCAG
GGGAGATGATTGGTGA
Protein sequenceShow/hide protein sequence
MIENNKITTYILTGNNNYVPWARSVEIGLGGKGKRSFINGSKGKPKPKDQANPTDDELTAIENWETTDQMIMSWLLSTMDTKISSALMYCKTSKEIWTKAKTRYGQGKNF
AHIFSLKQELSNIKQGNLNNSDLVAEILTKWEELQMYLPETINPEEIYKRNEHELIYTYLGALDSSFEPIRAQILSSAEMPQFDDVVLKIEQEESRRRLMNPSPAPSTDN
QAFRATYNKDRGKGNLWCDHCKRSSHNKESCWVLHPHLKPQRKGGGSTNNGGWRREAHSAIGELNSGTNTKADFGMNGMNPNPSQGPADHGPPGFYGTTAAGPVQGPISF
VTPAGPSGPNSDQLMQLVNQLNQLLQPRQQNSGHSLRGDDW