; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0012533 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0012533
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr03:12676049..12680597
RNA-Seq ExpressionIVF0012533
SyntenyIVF0012533
Gene Ontology termsGO:0032259 - methylation (biological process)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043909.1 histone-lysine N-methyltransferase SETD1B-like isoform X2 [Cucumis melo var. makuwa]2.55e-12297.46Show/hide
Query:  VHPRTAGLLLEAALRIQKQSTTARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCES
        V  RTAGLLLEAALRIQKQST ARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCES
Subjt:  VHPRTAGLLLEAALRIQKQSTTARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCES

Query:  NLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSILDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQS
        NLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVS+LDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQ 
Subjt:  NLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSILDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQS

KAA0063927.1 histone-lysine N-methyltransferase SETD1B-like isoform X2 [Cucumis melo var. makuwa]8.41e-12798.98Show/hide
Query:  VHPRTAGLLLEAALRIQKQSTTARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCES
        V  RTAGLLLEAALRIQKQSTTARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCES
Subjt:  VHPRTAGLLLEAALRIQKQSTTARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCES

Query:  NLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSILDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQS
        NLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSILDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQS
Subjt:  NLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSILDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQS

TYK11305.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]4.39e-23287.93Show/hide
Query:  MIANSIRAQYGRPPQTSFMYSKPYTKRIDNLRMPLGYQPPKFQQFDGKGNPKQHIVYFVKTCENVGSRGDQLVRQFVRSLKRNAFEWYTDLDPEHQTYRK
        MIANSIRAQYGRPPQTSFMYSKPYTKRIDNLRMPLGYQPPKFQQFDGKGNPKQHIVYFVKTCENVGSRGDQLVRQFVRSLKRNAFEWYTDLDPE  TYRK
Subjt:  MIANSIRAQYGRPPQTSFMYSKPYTKRIDNLRMPLGYQPPKFQQFDGKGNPKQHIVYFVKTCENVGSRGDQLVRQFVRSLKRNAFEWYTDLDPEHQTYRK

Query:  HDGVNKYQIVEGRASHQLHKPMESSKPGLQRQAHRTVCSRDVHPR-------------------------------------------TAGLLLEAALRI
        HDGVNKYQIVEGRASHQLHKPMESSKPGLQRQAHRTVCSRDVHPR                                           TAGLLLEAALRI
Subjt:  HDGVNKYQIVEGRASHQLHKPMESSKPGLQRQAHRTVCSRDVHPR-------------------------------------------TAGLLLEAALRI

Query:  QKQSTTARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSS
        QKQSTTARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSS
Subjt:  QKQSTTARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSS

Query:  PGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSILDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQS
        PGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVS+LDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQS
Subjt:  PGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSILDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQS

XP_008467034.2 PREDICTED: uncharacterized protein LOC103504463, partial [Cucumis melo]1.72e-12597.45Show/hide
Query:  VHPRTAGLLLEAALRIQKQSTTARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCES
        V  RTAGLLLEAALRIQKQST ARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAK+AIEENEKENDSVFRLSNVTGFDFCES
Subjt:  VHPRTAGLLLEAALRIQKQSTTARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCES

Query:  NLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSILDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQ
        NLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVS+LDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQ
Subjt:  NLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSILDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQ

XP_011651995.1 uncharacterized protein LOC105434967 [Cucumis sativus]2.42e-11894.42Show/hide
Query:  VHPRTAGLLLEAALRIQKQSTTARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCES
        V  RTAGLLLEAALRIQKQST ARSKSFGKSNGLGLLGSFLKRLTHRSR+RKREIHGDGR+NDPRDGPPLPAKMAIEENE ENDSVFRLSNVTGFDFCES
Subjt:  VHPRTAGLLLEAALRIQKQSTTARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCES

Query:  NLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSILDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQS
        NLCDSPFRFVLQSS SPGHRTPELSSP SSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVS+LDPPFEDDDEG+FEDGEDEDDYNLERSFAIVQ 
Subjt:  NLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSILDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQS

TrEMBL top hitse value%identityAlignment
A0A0A0LAR8 Uncharacterized protein8.9e-9694.9Show/hide
Query:  VHPRTAGLLLEAALRIQKQSTTARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCES
        V  RTAGLLLEAALRIQKQST ARSKSFGKSNGLGLLGSFLKRLTHRSR+RKREIHGDGR+NDPRDGPPLPAKMAIEENE ENDSVFRLSNVTGFDFCES
Subjt:  VHPRTAGLLLEAALRIQKQSTTARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCES

Query:  NLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSILDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQ
        NLCDSPFRFVLQSS SPGHRTPELSSP SSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVS+LDPPFEDDDEG+FEDGEDEDDYNLERSFAIVQ
Subjt:  NLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSILDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQ

A0A1S3CTV9 uncharacterized protein LOC1035044631.9e-9897.45Show/hide
Query:  VHPRTAGLLLEAALRIQKQSTTARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCES
        V  RTAGLLLEAALRIQKQST ARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAK+AIEENEKENDSVFRLSNVTGFDFCES
Subjt:  VHPRTAGLLLEAALRIQKQSTTARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCES

Query:  NLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSILDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQ
        NLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVS+LDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQ
Subjt:  NLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSILDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQ

A0A5A7VAA2 Histone-lysine N-methyltransferase SETD1B-like isoform X24.6e-10098.98Show/hide
Query:  VHPRTAGLLLEAALRIQKQSTTARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCES
        V  RTAGLLLEAALRIQKQSTTARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCES
Subjt:  VHPRTAGLLLEAALRIQKQSTTARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCES

Query:  NLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSILDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQS
        NLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSILDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQS
Subjt:  NLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSILDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQS

A0A5D3CIC0 Ty3-gypsy retrotransposon protein1.1e-18187.93Show/hide
Query:  MIANSIRAQYGRPPQTSFMYSKPYTKRIDNLRMPLGYQPPKFQQFDGKGNPKQHIVYFVKTCENVGSRGDQLVRQFVRSLKRNAFEWYTDLDPEHQTYRK
        MIANSIRAQYGRPPQTSFMYSKPYTKRIDNLRMPLGYQPPKFQQFDGKGNPKQHIVYFVKTCENVGSRGDQLVRQFVRSLKRNAFEWYTDLDPE  TYRK
Subjt:  MIANSIRAQYGRPPQTSFMYSKPYTKRIDNLRMPLGYQPPKFQQFDGKGNPKQHIVYFVKTCENVGSRGDQLVRQFVRSLKRNAFEWYTDLDPEHQTYRK

Query:  HDGVNKYQIVEGRASHQLHKPMESSKPGLQRQAHRTVCSRDVHP-------------------------------------------RTAGLLLEAALRI
        HDGVNKYQIVEGRASHQLHKPMESSKPGLQRQAHRTVCSRDVHP                                           RTAGLLLEAALRI
Subjt:  HDGVNKYQIVEGRASHQLHKPMESSKPGLQRQAHRTVCSRDVHP-------------------------------------------RTAGLLLEAALRI

Query:  QKQSTTARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSS
        QKQSTTARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSS
Subjt:  QKQSTTARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSS

Query:  PGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSILDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQS
        PGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVS+LDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQS
Subjt:  PGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSILDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQS

A0A5D3DNQ5 Histone-lysine N-methyltransferase SETD1B-like isoform X28.6e-9997.96Show/hide
Query:  VHPRTAGLLLEAALRIQKQSTTARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCES
        V  RTAGLLLEAALRIQKQST ARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCES
Subjt:  VHPRTAGLLLEAALRIQKQSTTARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCES

Query:  NLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSILDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQ
        NLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVS+LDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQ
Subjt:  NLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSILDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G36420.1 unknown protein2.7e-2038.42Show/hide
Query:  RTAGLLLEAALRIQKQST--TARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCESN
        RTA +LL+AA RIQKQ +     +K+  + NG G+ GS LK LT+R  ++ R  + DG              +++E   +   S  R   V   D C   
Subjt:  RTAGLLLEAALRIQKQST--TARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCESN

Query:  LCDSPFRFVLQSS-SSPGHRTPELSSPVSSPARL---DHQANDVESLQKLPAED----EEEEKEQSSPVSILDPPFEDDDEGNFEDGEDEDDYNLERSFA
         C+SPF FVLQ++ SS GH+TP  +S  +SPAR    D  +++ ESL+K+  ++    EEE+KEQ SPVS+LDP  E++++ +    E +   NL  SF 
Subjt:  LCDSPFRFVLQSS-SSPGHRTPELSSPVSSPARL---DHQANDVESLQKLPAED----EEEEKEQSSPVSILDPPFEDDDEGNFEDGEDEDDYNLERSFA

Query:  IVQ
        IVQ
Subjt:  IVQ

AT5G03670.1 unknown protein8.4e-2235.71Show/hide
Query:  DVHPRTAGLLLEAALRIQKQST-TARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGD---GRINDP------RDGPPLPAKMAI---EENEKENDS-
        ++  RTA +LLEAA+RIQKQS+  +++++    N  G+ GS LK+LT+R   +KREI G    GR++        R   P+  K+     + NE+EN S 
Subjt:  DVHPRTAGLLLEAALRIQKQST-TARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGD---GRINDP------RDGPPLPAKMAI---EENEKENDS-

Query:  -VFRLSNVTGF-----------------------DFCES--------------------------NLCDSPFRFVLQS-SSSPGHRTPELSSPVSSPA--
           ++++ T F                       DF  S                            C+SPF FVLQ+  S+ G RTP  SSP +SP   
Subjt:  -VFRLSNVTGF-----------------------DFCES--------------------------NLCDSPFRFVLQS-SSSPGHRTPELSSPVSSPA--

Query:  --RLDHQANDVESLQKLPAEDEEEEKEQSSPVSILDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQ
           ++ ++ +VE L+KL  E+EEEEKEQSSPVS+LDPPF+DDDE         DD N+  SF  VQ
Subjt:  --RLDHQANDVESLQKLPAEDEEEEKEQSSPVSILDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCGCGAATTCCATTAGAGCTCAGTATGGAAGACCACCGCAAACTTCTTTCATGTACTCTAAGCCGTACACCAAGAGAATCGATAACTTGAGAATGCCACTTGGGTA
CCAACCTCCAAAATTTCAGCAATTCGATGGAAAGGGCAACCCAAAGCAGCATATCGTCTACTTTGTCAAAACATGTGAAAATGTAGGATCAAGAGGAGACCAACTAGTCA
GGCAGTTCGTTCGAAGCTTAAAAAGAAATGCTTTCGAGTGGTATACTGATCTAGATCCAGAACACCAGACGTACCGTAAGCATGATGGAGTTAACAAATACCAAATAGTG
GAAGGGAGAGCCAGTCATCAACTACATAAACCGATGGAGAGCTCTAAGCCTGGATTGCAAAGACAAGCTCACAGAACTGTCTGCAGTAGAGATGTGCACCCAAGAACGGC
TGGACTTCTTTTGGAAGCTGCTTTGAGGATTCAGAAACAGTCAACGACTGCGAGATCCAAATCCTTTGGAAAATCGAATGGATTAGGCCTTTTGGGTTCTTTTCTTAAGC
GTTTGACTCATCGGAGCCGTTCTCGGAAGCGAGAGATCCACGGCGATGGTCGGATCAACGACCCCCGTGATGGCCCGCCATTGCCGGCGAAAATGGCGATCGAGGAGAAC
GAGAAAGAGAACGACTCTGTTTTTCGGCTGAGTAATGTAACAGGCTTTGATTTCTGTGAGAGTAATTTATGCGATAGTCCTTTTCGGTTCGTGCTTCAATCGAGTTCTTC
ACCCGGTCACCGGACGCCAGAGCTCTCTTCACCGGTGTCTTCTCCGGCTCGCCTAGACCATCAGGCCAATGATGTAGAGAGCTTGCAAAAATTGCCGGCTGAGGATGAGG
AGGAAGAGAAAGAACAGAGCAGTCCCGTGTCGATATTGGATCCTCCGTTTGAGGACGACGACGAAGGAAATTTCGAGGATGGCGAGGACGAGGATGATTACAATTTGGAA
CGCAGCTTCGCCATTGTACAAAGTGAAATGGGTGTAGATTGCTGGCTTTTTGTTTATGTATTCGAGATAATTGTTGTTCTTTGGTTATTCTTTTTTGTTTATTGTAGTCC
GATTTTTGGGGCTTTTTTTTTATTTGAGACTATGTATATTGAGGAACTGAAGGAGGAAGGAGAGTTACAGAGTGGGGAAGAGGGAAATTCGAATGACTACCACAACAGCT
GA
mRNA sequenceShow/hide mRNA sequence
ATGATCGCGAATTCCATTAGAGCTCAGTATGGAAGACCACCGCAAACTTCTTTCATGTACTCTAAGCCGTACACCAAGAGAATCGATAACTTGAGAATGCCACTTGGGTA
CCAACCTCCAAAATTTCAGCAATTCGATGGAAAGGGCAACCCAAAGCAGCATATCGTCTACTTTGTCAAAACATGTGAAAATGTAGGATCAAGAGGAGACCAACTAGTCA
GGCAGTTCGTTCGAAGCTTAAAAAGAAATGCTTTCGAGTGGTATACTGATCTAGATCCAGAACACCAGACGTACCGTAAGCATGATGGAGTTAACAAATACCAAATAGTG
GAAGGGAGAGCCAGTCATCAACTACATAAACCGATGGAGAGCTCTAAGCCTGGATTGCAAAGACAAGCTCACAGAACTGTCTGCAGTAGAGATGTGCACCCAAGAACGGC
TGGACTTCTTTTGGAAGCTGCTTTGAGGATTCAGAAACAGTCAACGACTGCGAGATCCAAATCCTTTGGAAAATCGAATGGATTAGGCCTTTTGGGTTCTTTTCTTAAGC
GTTTGACTCATCGGAGCCGTTCTCGGAAGCGAGAGATCCACGGCGATGGTCGGATCAACGACCCCCGTGATGGCCCGCCATTGCCGGCGAAAATGGCGATCGAGGAGAAC
GAGAAAGAGAACGACTCTGTTTTTCGGCTGAGTAATGTAACAGGCTTTGATTTCTGTGAGAGTAATTTATGCGATAGTCCTTTTCGGTTCGTGCTTCAATCGAGTTCTTC
ACCCGGTCACCGGACGCCAGAGCTCTCTTCACCGGTGTCTTCTCCGGCTCGCCTAGACCATCAGGCCAATGATGTAGAGAGCTTGCAAAAATTGCCGGCTGAGGATGAGG
AGGAAGAGAAAGAACAGAGCAGTCCCGTGTCGATATTGGATCCTCCGTTTGAGGACGACGACGAAGGAAATTTCGAGGATGGCGAGGACGAGGATGATTACAATTTGGAA
CGCAGCTTCGCCATTGTACAAAGTGAAATGGGTGTAGATTGCTGGCTTTTTGTTTATGTATTCGAGATAATTGTTGTTCTTTGGTTATTCTTTTTTGTTTATTGTAGTCC
GATTTTTGGGGCTTTTTTTTTATTTGAGACTATGTATATTGAGGAACTGAAGGAGGAAGGAGAGTTACAGAGTGGGGAAGAGGGAAATTCGAATGACTACCACAACAGCT
GA
Protein sequenceShow/hide protein sequence
MIANSIRAQYGRPPQTSFMYSKPYTKRIDNLRMPLGYQPPKFQQFDGKGNPKQHIVYFVKTCENVGSRGDQLVRQFVRSLKRNAFEWYTDLDPEHQTYRKHDGVNKYQIV
EGRASHQLHKPMESSKPGLQRQAHRTVCSRDVHPRTAGLLLEAALRIQKQSTTARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEEN
EKENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSILDPPFEDDDEGNFEDGEDEDDYNLE
RSFAIVQSEMGVDCWLFVYVFEIIVVLWLFFFVYCSPIFGAFFLFETMYIEELKEEGELQSGEEGNSNDYHNS