; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0010230 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0010230
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionCACTA en-spm transposon protein
Genome locationchr03:9381995..9383300
RNA-Seq ExpressionPI0010230
SyntenyPI0010230
Gene Ontology termsNA
InterPro domainsIPR004252 - Probable transposase, Ptta/En/Spm, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032748.1 hypothetical protein E6C27_scaffold853G00910 [Cucumis melo var. makuwa]2.4e-6450.15Show/hide
Query:  MSSRSFGPGNRSRSNSNTQQMNNNNEDSTSEQE---IGSNFQGSGTLKRGQGVTKGVALEKYVKENERISIEIEVEGRKLICKDSSSLNSMIGKEVHGTV
        MSSR+FG  NRSRS+SN  + +N+N++S  EQE   I S+ Q S T  +GQG T+ V LEKYV+EN    I I+ E RK I                   
Subjt:  MSSRSFGPGNRSRSNSNTQQMNNNNEDSTSEQE---IGSNFQGSGTLKRGQGVTKGVALEKYVKENERISIEIEVEGRKLICKDSSSLNSMIGKEVHGTV

Query:  PLECEHWVDVSKEVKREIIERLLNYFILDLDDPVVQDYLDHEMSVLYRDFRCSLHKTYKQYNSLAEARKHRDKRVARDANWIRLCDRWEREDFKKISEAN
                 VSKEVKREI++ L NYFILD+++P+VQDYL+HEMSVLYRDFRCSLHK+YK+Y+S A+ARKHR KRVA           W +     + +AN
Subjt:  PLECEHWVDVSKEVKREIIERLLNYFILDLDDPVVQDYLDHEMSVLYRDFRCSLHKTYKQYNSLAEARKHRDKRVARDANWIRLCDRWEREDFKKISEAN

Query:  AKA-RAALPFAHRGGTMTFSRHKKKKELEDGKKLNDIELFQLTHSRAKKGWVVEAKEKYDEIMTLK-IDSQEAGESIEDREICDRVLGKRSGHVKGLGWG
         K  R  L    R      S  +KK    +GK++++I LFQLTHS  K GWV EAKEKYDE++ LK   SQE  E + +REIC++VLGKRS HVKG G G
Subjt:  AKA-RAALPFAHRGGTMTFSRHKKKKELEDGKKLNDIELFQLTHSRAKKGWVVEAKEKYDEIMTLK-IDSQEAGESIEDREICDRVLGKRSGHVKGLGWG

Query:  PRPKYARNGARYTHTKETEAQVAHLQSIVES
        PRPK   N     +TKE  AQ+AH QSIVES
Subjt:  PRPKYARNGARYTHTKETEAQVAHLQSIVES

KAA0046978.1 formin-like protein 4 isoform X2 [Cucumis melo var. makuwa]5.3e-5645.89Show/hide
Query:  QMNNNNEDSTSEQE---IGSNFQGSGTLKRGQGVTKGVALEKYVKENERISIEIEVEGRKLICKDSSSLNSMIGKEVHGTVPLECEHWVDVSKEVKREII
        + NN+N++S  EQE   I S+ Q S T  +G+G T+GV LEKYV+ N  I I I+ E RK ICKDSS LNS IGKEV   VPLECEHW DVSKEVKREI+
Subjt:  QMNNNNEDSTSEQE---IGSNFQGSGTLKRGQGVTKGVALEKYVKENERISIEIEVEGRKLICKDSSSLNSMIGKEVHGTVPLECEHWVDVSKEVKREII

Query:  ERLLNYFILDLDDPVVQDYLDHEMSVLYRDFRCSLHKTYKQYNSLAEARKHRDKRVARDANWIRLCDRWEREDFKKISEANAKARAALPFAHRGGTMTFS
        +RL     L  ++ V  ++ +  + ++Y                              D N     ++  R D K+I   +    + +            
Subjt:  ERLLNYFILDLDDPVVQDYLDHEMSVLYRDFRCSLHKTYKQYNSLAEARKHRDKRVARDANWIRLCDRWEREDFKKISEANAKARAALPFAHRGGTMTFS

Query:  RHKKKKELEDGKKLNDIELFQLTHSRAKKGWVVEAKEKYDEIMTLK-IDSQEAGESIEDREICDRVLGKRSGHVKGLGWGPRPKYARNGARYTHTKETEA
              E E+GK++++IELFQLTHS  K GWV EAKEKYDE++ LK   SQE  E + +R+ICDRVLGKRSGHVKG GWGPRPK ARN     +TK+  A
Subjt:  RHKKKKELEDGKKLNDIELFQLTHSRAKKGWVVEAKEKYDEIMTLK-IDSQEAGESIEDREICDRVLGKRSGHVKGLGWGPRPKYARNGARYTHTKETEA

Query:  QVAHLQSIVESQQIII
        Q+AHLQSIVESQQ  I
Subjt:  QVAHLQSIVESQQIII

KAE8651942.1 hypothetical protein Csa_006405 [Cucumis sativus]4.0e-6465.5Show/hide
Query:  ILDLDDPVVQDYLDHEMSVLYRDFRCSLHKTYKQYNSLAEARKHRDKRVARDANWIRLCDRWEREDFKKISEANAKARAALPFAHRGGTMTFSRHKKKKE
        I +LDDP+VQDYLDHEMSVLYRDF CSLHK+YK+ +S  EARKH DKRVA+D++W RLCDRWERE FK  SEAN KAR+ LPF HRGGT+TF RHK+K +
Subjt:  ILDLDDPVVQDYLDHEMSVLYRDFRCSLHKTYKQYNSLAEARKHRDKRVARDANWIRLCDRWEREDFKKISEANAKARAALPFAHRGGTMTFSRHKKKKE

Query:  LEDGKKLNDIELFQLTHSRAKKGWVVEAKEKYDEIMTLKID-SQEAGESIEDREICDRVLGKRSGHVKGLGWGPRPKYARNGARYTHTKETEAQVAHLQS
         E+GK++++IELFQ THS  KK WV EAK KYDE++ LK   SQE  + + ++EIC+RVL KR G VK  GWGPRPK ARN A    TK+  AQ+AHLQS
Subjt:  LEDGKKLNDIELFQLTHSRAKKGWVVEAKEKYDEIMTLKID-SQEAGESIEDREICDRVLGKRSGHVKGLGWGPRPKYARNGARYTHTKETEAQVAHLQS

TYJ98779.1 hypothetical protein E5676_scaffold156G00880 [Cucumis melo var. makuwa]2.4e-6450.15Show/hide
Query:  MSSRSFGPGNRSRSNSNTQQMNNNNEDSTSEQE---IGSNFQGSGTLKRGQGVTKGVALEKYVKENERISIEIEVEGRKLICKDSSSLNSMIGKEVHGTV
        MSSR+FG  NRSRS+SN  + +N+N++S  EQE   I S+ Q S T  +GQG T+ V LEKYV+EN    I I+ E RK I                   
Subjt:  MSSRSFGPGNRSRSNSNTQQMNNNNEDSTSEQE---IGSNFQGSGTLKRGQGVTKGVALEKYVKENERISIEIEVEGRKLICKDSSSLNSMIGKEVHGTV

Query:  PLECEHWVDVSKEVKREIIERLLNYFILDLDDPVVQDYLDHEMSVLYRDFRCSLHKTYKQYNSLAEARKHRDKRVARDANWIRLCDRWEREDFKKISEAN
                 VSKEVKREI++ L NYFILD+++P+VQDYL+HEMSVLYRDFRCSLHK+YK+Y+S A+ARKHR KRVA           W +     + +AN
Subjt:  PLECEHWVDVSKEVKREIIERLLNYFILDLDDPVVQDYLDHEMSVLYRDFRCSLHKTYKQYNSLAEARKHRDKRVARDANWIRLCDRWEREDFKKISEAN

Query:  AKA-RAALPFAHRGGTMTFSRHKKKKELEDGKKLNDIELFQLTHSRAKKGWVVEAKEKYDEIMTLK-IDSQEAGESIEDREICDRVLGKRSGHVKGLGWG
         K  R  L    R      S  +KK    +GK++++I LFQLTHS  K GWV EAKEKYDE++ LK   SQE  E + +REIC++VLGKRS HVKG G G
Subjt:  AKA-RAALPFAHRGGTMTFSRHKKKKELEDGKKLNDIELFQLTHSRAKKGWVVEAKEKYDEIMTLK-IDSQEAGESIEDREICDRVLGKRSGHVKGLGWG

Query:  PRPKYARNGARYTHTKETEAQVAHLQSIVES
        PRPK   N     +TKE  AQ+AH QSIVES
Subjt:  PRPKYARNGARYTHTKETEAQVAHLQSIVES

TYK04806.1 formin-like protein 4 isoform X2 [Cucumis melo var. makuwa]2.6e-6347.76Show/hide
Query:  MSSRSFGPGNRSRSNSNTQQMNNNNEDSTSEQE---IGSNFQGSGTLKRGQGVTKGVALEKYVKENERISIEIEVEGRKLICKDSSSLNSMIGKEVHGTV
        MSSR+FGPGNRSRS+SN  + NN+N++S  EQE   I S+ Q S T  +G+G T+GV LEKYV+ N  I I I+ E RK ICKDSS LNS IGKEV   V
Subjt:  MSSRSFGPGNRSRSNSNTQQMNNNNEDSTSEQE---IGSNFQGSGTLKRGQGVTKGVALEKYVKENERISIEIEVEGRKLICKDSSSLNSMIGKEVHGTV

Query:  PLECEHWVDVSKEVKREIIERLLNYFILDLDDPVVQDYLDHEMSVLYRDFRCSLHKTYKQYNSLAEARKHRDKRVARDANWIRLCDRWEREDFKKISEAN
        PLECEHW DVSKEVKREI++RL     L  ++ V  ++ +  + ++Y                              D N     ++  R D K+I   +
Subjt:  PLECEHWVDVSKEVKREIIERLLNYFILDLDDPVVQDYLDHEMSVLYRDFRCSLHKTYKQYNSLAEARKHRDKRVARDANWIRLCDRWEREDFKKISEAN

Query:  AKARAALPFAHRGGTMTFSRHKKKKELEDGKKLNDIELFQLTHSRAKKGWVVEAKEKYDEIMTLK-IDSQEAGESIEDREICDRVLGKRSGHVKGLGWGP
            + +                  E E+GK++++IELFQLTHS  K GWV EAKEKYDE++ LK   SQE  E + +R+ICDRVLGKRSGHVKG GWGP
Subjt:  AKARAALPFAHRGGTMTFSRHKKKKELEDGKKLNDIELFQLTHSRAKKGWVVEAKEKYDEIMTLK-IDSQEAGESIEDREICDRVLGKRSGHVKGLGWGP

Query:  RPKYARNGARYTHTKETEAQVAHLQSIVESQQIII
        RPK ARN     +TK+  AQ+AHLQSIVESQQ  I
Subjt:  RPKYARNGARYTHTKETEAQVAHLQSIVESQQIII

TrEMBL top hitse value%identityAlignment
A0A0A0LM17 Uncharacterized protein2.8e-4755.67Show/hide
Query:  QMNNNNEDSTSEQEIGSNFQGSGTLKRGQGVTKGV-ALEKYVKENERISIEIEVEGRKLICKDSSSLNSMIGKEVHGTVPLECEHWVDVSKEVKREIIER
        Q NN+N++S  EQE+         +     + KG+ ALEKYV+ N  I I+IE++ RK ICKDSS  NS IGKE                          
Subjt:  QMNNNNEDSTSEQEIGSNFQGSGTLKRGQGVTKGV-ALEKYVKENERISIEIEVEGRKLICKDSSSLNSMIGKEVHGTVPLECEHWVDVSKEVKREIIER

Query:  LLNYFILDLDDPVVQDYLDHEMSVLYRDFRCSLHKTYKQYNSLAEARKHRDKRVARDANWIRLCDRWEREDFKKISEANAKARAALPFAHRGGTMTFSRH
          NYFILDLDDP+VQDYLDHEMSVLYRDF CSLHK+YK+ +S  EARKH DKRVA+D++W RLCDRWERE FK  SEAN KAR+ LPF HRGGT+TF RH
Subjt:  LLNYFILDLDDPVVQDYLDHEMSVLYRDFRCSLHKTYKQYNSLAEARKHRDKRVARDANWIRLCDRWEREDFKKISEANAKARAALPFAHRGGTMTFSRH

Query:  KKK
        K+K
Subjt:  KKK

A0A5A7SQC8 DUF4218 domain-containing protein1.1e-6450.15Show/hide
Query:  MSSRSFGPGNRSRSNSNTQQMNNNNEDSTSEQE---IGSNFQGSGTLKRGQGVTKGVALEKYVKENERISIEIEVEGRKLICKDSSSLNSMIGKEVHGTV
        MSSR+FG  NRSRS+SN  + +N+N++S  EQE   I S+ Q S T  +GQG T+ V LEKYV+EN    I I+ E RK I                   
Subjt:  MSSRSFGPGNRSRSNSNTQQMNNNNEDSTSEQE---IGSNFQGSGTLKRGQGVTKGVALEKYVKENERISIEIEVEGRKLICKDSSSLNSMIGKEVHGTV

Query:  PLECEHWVDVSKEVKREIIERLLNYFILDLDDPVVQDYLDHEMSVLYRDFRCSLHKTYKQYNSLAEARKHRDKRVARDANWIRLCDRWEREDFKKISEAN
                 VSKEVKREI++ L NYFILD+++P+VQDYL+HEMSVLYRDFRCSLHK+YK+Y+S A+ARKHR KRVA           W +     + +AN
Subjt:  PLECEHWVDVSKEVKREIIERLLNYFILDLDDPVVQDYLDHEMSVLYRDFRCSLHKTYKQYNSLAEARKHRDKRVARDANWIRLCDRWEREDFKKISEAN

Query:  AKA-RAALPFAHRGGTMTFSRHKKKKELEDGKKLNDIELFQLTHSRAKKGWVVEAKEKYDEIMTLK-IDSQEAGESIEDREICDRVLGKRSGHVKGLGWG
         K  R  L    R      S  +KK    +GK++++I LFQLTHS  K GWV EAKEKYDE++ LK   SQE  E + +REIC++VLGKRS HVKG G G
Subjt:  AKA-RAALPFAHRGGTMTFSRHKKKKELEDGKKLNDIELFQLTHSRAKKGWVVEAKEKYDEIMTLK-IDSQEAGESIEDREICDRVLGKRSGHVKGLGWG

Query:  PRPKYARNGARYTHTKETEAQVAHLQSIVES
        PRPK   N     +TKE  AQ+AH QSIVES
Subjt:  PRPKYARNGARYTHTKETEAQVAHLQSIVES

A0A5A7TVR8 Formin-like protein 4 isoform X22.5e-5645.89Show/hide
Query:  QMNNNNEDSTSEQE---IGSNFQGSGTLKRGQGVTKGVALEKYVKENERISIEIEVEGRKLICKDSSSLNSMIGKEVHGTVPLECEHWVDVSKEVKREII
        + NN+N++S  EQE   I S+ Q S T  +G+G T+GV LEKYV+ N  I I I+ E RK ICKDSS LNS IGKEV   VPLECEHW DVSKEVKREI+
Subjt:  QMNNNNEDSTSEQE---IGSNFQGSGTLKRGQGVTKGVALEKYVKENERISIEIEVEGRKLICKDSSSLNSMIGKEVHGTVPLECEHWVDVSKEVKREII

Query:  ERLLNYFILDLDDPVVQDYLDHEMSVLYRDFRCSLHKTYKQYNSLAEARKHRDKRVARDANWIRLCDRWEREDFKKISEANAKARAALPFAHRGGTMTFS
        +RL     L  ++ V  ++ +  + ++Y                              D N     ++  R D K+I   +    + +            
Subjt:  ERLLNYFILDLDDPVVQDYLDHEMSVLYRDFRCSLHKTYKQYNSLAEARKHRDKRVARDANWIRLCDRWEREDFKKISEANAKARAALPFAHRGGTMTFS

Query:  RHKKKKELEDGKKLNDIELFQLTHSRAKKGWVVEAKEKYDEIMTLK-IDSQEAGESIEDREICDRVLGKRSGHVKGLGWGPRPKYARNGARYTHTKETEA
              E E+GK++++IELFQLTHS  K GWV EAKEKYDE++ LK   SQE  E + +R+ICDRVLGKRSGHVKG GWGPRPK ARN     +TK+  A
Subjt:  RHKKKKELEDGKKLNDIELFQLTHSRAKKGWVVEAKEKYDEIMTLK-IDSQEAGESIEDREICDRVLGKRSGHVKGLGWGPRPKYARNGARYTHTKETEA

Query:  QVAHLQSIVESQQIII
        Q+AHLQSIVESQQ  I
Subjt:  QVAHLQSIVESQQIII

A0A5D3BIG6 DUF4218 domain-containing protein1.1e-6450.15Show/hide
Query:  MSSRSFGPGNRSRSNSNTQQMNNNNEDSTSEQE---IGSNFQGSGTLKRGQGVTKGVALEKYVKENERISIEIEVEGRKLICKDSSSLNSMIGKEVHGTV
        MSSR+FG  NRSRS+SN  + +N+N++S  EQE   I S+ Q S T  +GQG T+ V LEKYV+EN    I I+ E RK I                   
Subjt:  MSSRSFGPGNRSRSNSNTQQMNNNNEDSTSEQE---IGSNFQGSGTLKRGQGVTKGVALEKYVKENERISIEIEVEGRKLICKDSSSLNSMIGKEVHGTV

Query:  PLECEHWVDVSKEVKREIIERLLNYFILDLDDPVVQDYLDHEMSVLYRDFRCSLHKTYKQYNSLAEARKHRDKRVARDANWIRLCDRWEREDFKKISEAN
                 VSKEVKREI++ L NYFILD+++P+VQDYL+HEMSVLYRDFRCSLHK+YK+Y+S A+ARKHR KRVA           W +     + +AN
Subjt:  PLECEHWVDVSKEVKREIIERLLNYFILDLDDPVVQDYLDHEMSVLYRDFRCSLHKTYKQYNSLAEARKHRDKRVARDANWIRLCDRWEREDFKKISEAN

Query:  AKA-RAALPFAHRGGTMTFSRHKKKKELEDGKKLNDIELFQLTHSRAKKGWVVEAKEKYDEIMTLK-IDSQEAGESIEDREICDRVLGKRSGHVKGLGWG
         K  R  L    R      S  +KK    +GK++++I LFQLTHS  K GWV EAKEKYDE++ LK   SQE  E + +REIC++VLGKRS HVKG G G
Subjt:  AKA-RAALPFAHRGGTMTFSRHKKKKELEDGKKLNDIELFQLTHSRAKKGWVVEAKEKYDEIMTLK-IDSQEAGESIEDREICDRVLGKRSGHVKGLGWG

Query:  PRPKYARNGARYTHTKETEAQVAHLQSIVES
        PRPK   N     +TKE  AQ+AH QSIVES
Subjt:  PRPKYARNGARYTHTKETEAQVAHLQSIVES

A0A5D3C2Z5 Formin-like protein 4 isoform X21.3e-6347.76Show/hide
Query:  MSSRSFGPGNRSRSNSNTQQMNNNNEDSTSEQE---IGSNFQGSGTLKRGQGVTKGVALEKYVKENERISIEIEVEGRKLICKDSSSLNSMIGKEVHGTV
        MSSR+FGPGNRSRS+SN  + NN+N++S  EQE   I S+ Q S T  +G+G T+GV LEKYV+ N  I I I+ E RK ICKDSS LNS IGKEV   V
Subjt:  MSSRSFGPGNRSRSNSNTQQMNNNNEDSTSEQE---IGSNFQGSGTLKRGQGVTKGVALEKYVKENERISIEIEVEGRKLICKDSSSLNSMIGKEVHGTV

Query:  PLECEHWVDVSKEVKREIIERLLNYFILDLDDPVVQDYLDHEMSVLYRDFRCSLHKTYKQYNSLAEARKHRDKRVARDANWIRLCDRWEREDFKKISEAN
        PLECEHW DVSKEVKREI++RL     L  ++ V  ++ +  + ++Y                              D N     ++  R D K+I   +
Subjt:  PLECEHWVDVSKEVKREIIERLLNYFILDLDDPVVQDYLDHEMSVLYRDFRCSLHKTYKQYNSLAEARKHRDKRVARDANWIRLCDRWEREDFKKISEAN

Query:  AKARAALPFAHRGGTMTFSRHKKKKELEDGKKLNDIELFQLTHSRAKKGWVVEAKEKYDEIMTLK-IDSQEAGESIEDREICDRVLGKRSGHVKGLGWGP
            + +                  E E+GK++++IELFQLTHS  K GWV EAKEKYDE++ LK   SQE  E + +R+ICDRVLGKRSGHVKG GWGP
Subjt:  AKARAALPFAHRGGTMTFSRHKKKKELEDGKKLNDIELFQLTHSRAKKGWVVEAKEKYDEIMTLK-IDSQEAGESIEDREICDRVLGKRSGHVKGLGWGP

Query:  RPKYARNGARYTHTKETEAQVAHLQSIVESQQIII
        RPK ARN     +TK+  AQ+AHLQSIVESQQ  I
Subjt:  RPKYARNGARYTHTKETEAQVAHLQSIVESQQIII

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCAAGAAGCTTTGGACCAGGAAATCGTAGTAGGTCCAACTCAAACACCCAACAAATGAACAACAATAATGAGGATTCTACTAGCGAGCAGGAAATTGGTTCAAA
TTTTCAGGGTTCTGGGACTCTAAAAAGAGGTCAAGGAGTAACTAAGGGGGTTGCACTAGAGAAATATGTCAAAGAGAACGAACGTATTTCAATCGAAATTGAAGTCGAAG
GTAGGAAACTCATTTGTAAGGATTCATCTAGTTTGAACTCTATGATTGGAAAAGAGGTACATGGAACAGTACCTCTCGAGTGTGAGCATTGGGTGGATGTCTCGAAGGAG
GTAAAAAGGGAGATAATAGAGAGACTTTTGAACTACTTCATTCTGGACTTGGATGATCCTGTGGTACAAGATTATCTCGATCATGAGATGAGTGTATTGTATAGGGACTT
TCGTTGTTCATTACACAAGACTTACAAACAATACAACTCTCTAGCTGAAGCACGAAAGCATCGCGACAAACGAGTGGCACGAGATGCAAATTGGATTCGTTTATGTGATC
GATGGGAAAGAGAAGACTTCAAGAAAATTTCTGAAGCAAATGCTAAGGCTCGAGCTGCTCTTCCATTTGCTCATAGAGGTGGCACTATGACATTTTCGCGACATAAGAAG
AAAAAGGAATTGGAAGATGGTAAGAAGTTGAACGATATTGAGTTGTTTCAACTCACTCATTCTAGAGCAAAGAAAGGATGGGTGGTTGAGGCAAAAGAGAAATATGATGA
AATTATGACACTAAAAATTGATTCACAAGAAGCCGGAGAATCTATTGAAGATAGAGAGATCTGTGACCGAGTGTTGGGTAAGCGGTCGGGTCATGTGAAGGGTCTTGGAT
GGGGACCAAGACCAAAATATGCTAGGAATGGAGCAAGGTACACTCACACTAAAGAAACAGAGGCTCAAGTTGCACACTTACAAAGTATAGTTGAATCACAACAAATCATC
ATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCAAGAAGCTTTGGACCAGGAAATCGTAGTAGGTCCAACTCAAACACCCAACAAATGAACAACAATAATGAGGATTCTACTAGCGAGCAGGAAATTGGTTCAAA
TTTTCAGGGTTCTGGGACTCTAAAAAGAGGTCAAGGAGTAACTAAGGGGGTTGCACTAGAGAAATATGTCAAAGAGAACGAACGTATTTCAATCGAAATTGAAGTCGAAG
GTAGGAAACTCATTTGTAAGGATTCATCTAGTTTGAACTCTATGATTGGAAAAGAGGTACATGGAACAGTACCTCTCGAGTGTGAGCATTGGGTGGATGTCTCGAAGGAG
GTAAAAAGGGAGATAATAGAGAGACTTTTGAACTACTTCATTCTGGACTTGGATGATCCTGTGGTACAAGATTATCTCGATCATGAGATGAGTGTATTGTATAGGGACTT
TCGTTGTTCATTACACAAGACTTACAAACAATACAACTCTCTAGCTGAAGCACGAAAGCATCGCGACAAACGAGTGGCACGAGATGCAAATTGGATTCGTTTATGTGATC
GATGGGAAAGAGAAGACTTCAAGAAAATTTCTGAAGCAAATGCTAAGGCTCGAGCTGCTCTTCCATTTGCTCATAGAGGTGGCACTATGACATTTTCGCGACATAAGAAG
AAAAAGGAATTGGAAGATGGTAAGAAGTTGAACGATATTGAGTTGTTTCAACTCACTCATTCTAGAGCAAAGAAAGGATGGGTGGTTGAGGCAAAAGAGAAATATGATGA
AATTATGACACTAAAAATTGATTCACAAGAAGCCGGAGAATCTATTGAAGATAGAGAGATCTGTGACCGAGTGTTGGGTAAGCGGTCGGGTCATGTGAAGGGTCTTGGAT
GGGGACCAAGACCAAAATATGCTAGGAATGGAGCAAGGTACACTCACACTAAAGAAACAGAGGCTCAAGTTGCACACTTACAAAGTATAGTTGAATCACAACAAATCATC
ATTTAA
Protein sequenceShow/hide protein sequence
MSSRSFGPGNRSRSNSNTQQMNNNNEDSTSEQEIGSNFQGSGTLKRGQGVTKGVALEKYVKENERISIEIEVEGRKLICKDSSSLNSMIGKEVHGTVPLECEHWVDVSKE
VKREIIERLLNYFILDLDDPVVQDYLDHEMSVLYRDFRCSLHKTYKQYNSLAEARKHRDKRVARDANWIRLCDRWEREDFKKISEANAKARAALPFAHRGGTMTFSRHKK
KKELEDGKKLNDIELFQLTHSRAKKGWVVEAKEKYDEIMTLKIDSQEAGESIEDREICDRVLGKRSGHVKGLGWGPRPKYARNGARYTHTKETEAQVAHLQSIVESQQII
I