; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MELO3C035653 (gene) of Melon (DHL92) v4 genome

Gene IDMELO3C035653
OrganismCucumis melo DHL92 (Melon (DHL92) v4)
DescriptionCACTA en-spm transposon protein
Genome locationchr12:15719279..15724105
RNA-Seq ExpressionMELO3C035653
SyntenyMELO3C035653
Gene Ontology termsNA
InterPro domainsIPR004252 - Probable transposase, Ptta/En/Spm, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031960.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]3.7e-10464.23Show/hide
Query:  DSMFLEFEDDLDNIAGGSSSVGDNAESSSQQPATLTSRRCAQSRLLELERHVAINGCILMTITLRAEKPISPHAIRFS----------------------
        D MFL+FE+DLDNIAGGSSSVGDN   SS Q  T T RR AQSRLLELERHVAING I MTI   AEKPISPHA+RFS                      
Subjt:  DSMFLEFEDDLDNIAGGSSSVGDNAESSSQQPATLTSRRCAQSRLLELERHVAINGCILMTITLRAEKPISPHAIRFS----------------------

Query:  -----------QRFFVLDFNDQAMNRFVQHQMHTTFFKEFQTDYHRHFKKYSDPEEARANPPNAL--------------------EQSRTNKAARQKHPY
                   QRFFVLDFNDQAMNRFV+HQM TT FKEF+ D H+HFKKYSDPEEARANPPNAL                    EQSRTNKAARQK PY
Subjt:  -----------QRFFVLDFNDQAMNRFVQHQMHTTFFKEFQTDYHRHFKKYSDPEEARANPPNAL--------------------EQSRTNKAARQKHPY

Query:  NHSSGSKLFLQRQCELTEKKGKLIGRVELFQKIHLQ-------------NQMLELQSQPTPKGSQPLSKNEICDQVLDRRSGYLKGLGWGPKPKAHRTTS
        NHSSGSK FLQRQ EL E++G+ + RVELF++ H++             NQMLELQSQPTP+GSQPLS++EICDQVL RR GY KGLGWGPKPKA RT S
Subjt:  NHSSGSKLFLQRQCELTEKKGKLIGRVELFQKIHLQ-------------NQMLELQSQPTPKGSQPLSKNEICDQVLDRRSGYLKGLGWGPKPKAHRTTS

Query:  ASSSSTFYLQSTQTEIELQVKLNEALELIEVQDRNHQVLASQVERMQKLIKDFTR
        ASSSST   QSTQ EIELQ KL+EALE IEVQDRNHQ LASQVE M+K+I+D TR
Subjt:  ASSSSTFYLQSTQTEIELQVKLNEALELIEVQDRNHQVLASQVERMQKLIKDFTR

KAA0033296.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]6.3e-10466.57Show/hide
Query:  DSMFLEFEDDLDNIAGGSSSVGDNAESSSQQPATLTSRRCAQSRLLELERHVAINGCILMTITLRAEKPISPHAIRFS----------------------
        D MFL+FE+DLDNIAGGSSSVGDN   SS Q  T T RR AQSRLLELE HVAING I MTI   AEKPISPHA+RFS                      
Subjt:  DSMFLEFEDDLDNIAGGSSSVGDNAESSSQQPATLTSRRCAQSRLLELERHVAINGCILMTITLRAEKPISPHAIRFS----------------------

Query:  -----------QRFFVLDFNDQAMNRFVQHQMHTTFFKEFQTDYHRHFKKYSDPEEARANPPNALEQSRTNKAARQKHPYNHSSGSKLFLQRQCELTEKK
                   QR FVLDFNDQAMNRFV+HQM TT FKEF+ D HRHFKKYSDPEEARANPPNALEQSRTNK ARQK PYNHSSGSK FLQRQ EL E++
Subjt:  -----------QRFFVLDFNDQAMNRFVQHQMHTTFFKEFQTDYHRHFKKYSDPEEARANPPNALEQSRTNKAARQKHPYNHSSGSKLFLQRQCELTEKK

Query:  GKLIGRVELFQKIHLQ-------------NQMLELQSQPTPKGSQPLSKNEICDQVLDRRSGYLKGLGWGPKPKAHRTTSASSSSTFYLQSTQTEIELQV
        G+ + RVELF++  ++             NQ+LELQSQPTP+GSQPLS++EICDQVL RR  Y KGLGWGPKPKA RT SASSSST   QSTQ EIELQ 
Subjt:  GKLIGRVELFQKIHLQ-------------NQMLELQSQPTPKGSQPLSKNEICDQVLDRRSGYLKGLGWGPKPKAHRTTSASSSSTFYLQSTQTEIELQV

Query:  KLNEALELIEVQDRNHQVLASQVERMQKLIKDFTR
        KL+EALE IEVQDRNHQ LASQVE M+K+I+D TR
Subjt:  KLNEALELIEVQDRNHQVLASQVERMQKLIKDFTR

KAA0042203.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]2.2e-10973.73Show/hide
Query:  DSMFLEFEDDLD-NIAGGSSSVGDNAESSSQQPATLTSRRCAQSRLLELERHVAINGCILMTITLRAEKPISPHAIRFS-------------QRFFVLDF
        D MFL+FEDDLD NIAGGSSSVGDN ESSSQQ AT T RR AQSRLLELERHVAING I MTI   AEKPISPHA+RFS             QR FVLDF
Subjt:  DSMFLEFEDDLD-NIAGGSSSVGDNAESSSQQPATLTSRRCAQSRLLELERHVAINGCILMTITLRAEKPISPHAIRFS-------------QRFFVLDF

Query:  NDQAMNRFVQHQMHTTFFKEFQTDYHRHFKKYSDPEEARANPPNALEQSRTNKAARQKHPYNHSSGSKLFLQRQCELTEKKGKLIGRVELFQKIHLQ---
        NDQAMNRFV+HQM TT FKEFQ D HRHFKKYSDPEEARANPPNALEQSRTNKAARQK PYNHSSGSK FLQRQ EL E+KG+ + RVELF++ H++   
Subjt:  NDQAMNRFVQHQMHTTFFKEFQTDYHRHFKKYSDPEEARANPPNALEQSRTNKAARQKHPYNHSSGSKLFLQRQCELTEKKGKLIGRVELFQKIHLQ---

Query:  ----------NQMLELQSQPTPKGSQPLSKNEICDQVLDRRSGYLKGLGWGPKPKAHRTTSASSSSTFYLQSTQTEIELQVKLNEALELIEVQDRNHQVL
                  NQMLELQSQ TP+GSQPLS++EICDQVL RR GY KGLGWGPKPKA RT SASSSST   QST+ EIELQ KL EALE IEVQDRNHQ L
Subjt:  ----------NQMLELQSQPTPKGSQPLSKNEICDQVLDRRSGYLKGLGWGPKPKAHRTTSASSSSTFYLQSTQTEIELQVKLNEALELIEVQDRNHQVL

Query:  ASQVERMQKLIKDFTR
        ASQVE M+K+I++FTR
Subjt:  ASQVERMQKLIKDFTR

KAA0046652.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]1.7e-10471.57Show/hide
Query:  MFLEFEDDLDNIAGGSSSVGDNAESSSQQPATLTSRRCAQSRLLELERHVAINGCILMTITLRAEKPISPHAIRFSQ-------RFFVLDFNDQAMNRFV
        MFL+FEDDLDNIAGGSSSVGDN  SSSQQ  T T RR AQSRLLELE HVAING I MT+   AEKPISPHA+RFSQ       R FVLDFNDQ MNRFV
Subjt:  MFLEFEDDLDNIAGGSSSVGDNAESSSQQPATLTSRRCAQSRLLELERHVAINGCILMTITLRAEKPISPHAIRFSQ-------RFFVLDFNDQAMNRFV

Query:  QHQMHTTFFKEFQTDYHRHFKKYSDPEEARANPPNAL--------------------EQSRTNKAARQKHPYNHSSGSKLFLQRQCELTEKKGKLIGRVE
        +HQM TT FKEF+ D HRHFKKYSDPEEARANPPNAL                    EQSR NKAARQK PYNHSSGSK FLQRQ EL E++G+ + RVE
Subjt:  QHQMHTTFFKEFQTDYHRHFKKYSDPEEARANPPNAL--------------------EQSRTNKAARQKHPYNHSSGSKLFLQRQCELTEKKGKLIGRVE

Query:  LFQKIHLQNQMLELQSQPTPKGSQPLSKNEICDQVLDRRSGYLKGLGWGPKPKAHRTTSASSSSTFYLQSTQTEIELQVKLNEALELIEVQDRNHQVLAS
        LF++ H  NQMLELQSQPTP+GSQPLS++EICDQVL RR GY KGLGWGPKPK  RT SASSSST  LQSTQ EIELQ KL+E LE IEVQDRNHQ LAS
Subjt:  LFQKIHLQNQMLELQSQPTPKGSQPLSKNEICDQVLDRRSGYLKGLGWGPKPKAHRTTSASSSSTFYLQSTQTEIELQVKLNEALELIEVQDRNHQVLAS

Query:  QVERMQKLIKDFT
        QVE M+K+I+D T
Subjt:  QVERMQKLIKDFT

TYK16780.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]8.3e-10464.79Show/hide
Query:  DSMFLEFEDDLDNIAGGSSSVGDNAESSSQQPATLTSRRCAQSRLLELERHVAINGCILMTITLRAEKPISPHAIRFS----------------------
        D MFL+FE+DLDNIAGGSSSVGDN  SSSQQ  T T RR AQSRLLELERHVAING I MTI   AEKPISPHA+RFS                      
Subjt:  DSMFLEFEDDLDNIAGGSSSVGDNAESSSQQPATLTSRRCAQSRLLELERHVAINGCILMTITLRAEKPISPHAIRFS----------------------

Query:  -----------QRFFVLDFNDQAMNRFVQHQMHTTFFKEFQTDYHRHFKKYSDPEEARANPPNAL--------------------EQSRTNKAARQKHPY
                   QRFFVLDFNDQAMNRFV+HQM TT FKEF+ D H+HFKKYSDPEEARANPPNAL                    EQSRTNKAARQK PY
Subjt:  -----------QRFFVLDFNDQAMNRFVQHQMHTTFFKEFQTDYHRHFKKYSDPEEARANPPNAL--------------------EQSRTNKAARQKHPY

Query:  NHSSGSKLFLQRQCELTEKKGKLIGRVELFQKIHLQ-------------NQMLELQSQPTPKGSQPLSKNEICDQVLDRRSGYLKGLGWGPKPKAHRTTS
        NHSSGSK FLQRQ EL E++G+ + RVELF++ H++             NQMLELQSQPTP+GSQPLS++EICDQVL RR GY KGLGWGPKPKA RT S
Subjt:  NHSSGSKLFLQRQCELTEKKGKLIGRVELFQKIHLQ-------------NQMLELQSQPTPKGSQPLSKNEICDQVLDRRSGYLKGLGWGPKPKAHRTTS

Query:  ASSSSTFYLQSTQTEIELQVKLNEALELIEVQDRNHQVLASQVERMQKLIKDFTR
        ASSSST   QSTQ EIELQ KL+EALE IEVQDRNHQ LASQVE M+K+I+D TR
Subjt:  ASSSSTFYLQSTQTEIELQVKLNEALELIEVQDRNHQVLASQVERMQKLIKDFTR

TrEMBL top hitse value%identityAlignment
A0A5A7STH7 CACTA en-spm transposon protein3.1e-10466.57Show/hide
Query:  DSMFLEFEDDLDNIAGGSSSVGDNAESSSQQPATLTSRRCAQSRLLELERHVAINGCILMTITLRAEKPISPHAIRFS----------------------
        D MFL+FE+DLDNIAGGSSSVGDN   SS Q  T T RR AQSRLLELE HVAING I MTI   AEKPISPHA+RFS                      
Subjt:  DSMFLEFEDDLDNIAGGSSSVGDNAESSSQQPATLTSRRCAQSRLLELERHVAINGCILMTITLRAEKPISPHAIRFS----------------------

Query:  -----------QRFFVLDFNDQAMNRFVQHQMHTTFFKEFQTDYHRHFKKYSDPEEARANPPNALEQSRTNKAARQKHPYNHSSGSKLFLQRQCELTEKK
                   QR FVLDFNDQAMNRFV+HQM TT FKEF+ D HRHFKKYSDPEEARANPPNALEQSRTNK ARQK PYNHSSGSK FLQRQ EL E++
Subjt:  -----------QRFFVLDFNDQAMNRFVQHQMHTTFFKEFQTDYHRHFKKYSDPEEARANPPNALEQSRTNKAARQKHPYNHSSGSKLFLQRQCELTEKK

Query:  GKLIGRVELFQKIHLQ-------------NQMLELQSQPTPKGSQPLSKNEICDQVLDRRSGYLKGLGWGPKPKAHRTTSASSSSTFYLQSTQTEIELQV
        G+ + RVELF++  ++             NQ+LELQSQPTP+GSQPLS++EICDQVL RR  Y KGLGWGPKPKA RT SASSSST   QSTQ EIELQ 
Subjt:  GKLIGRVELFQKIHLQ-------------NQMLELQSQPTPKGSQPLSKNEICDQVLDRRSGYLKGLGWGPKPKAHRTTSASSSSTFYLQSTQTEIELQV

Query:  KLNEALELIEVQDRNHQVLASQVERMQKLIKDFTR
        KL+EALE IEVQDRNHQ LASQVE M+K+I+D TR
Subjt:  KLNEALELIEVQDRNHQVLASQVERMQKLIKDFTR

A0A5A7TG53 CACTA en-spm transposon protein1.1e-10973.73Show/hide
Query:  DSMFLEFEDDLD-NIAGGSSSVGDNAESSSQQPATLTSRRCAQSRLLELERHVAINGCILMTITLRAEKPISPHAIRFS-------------QRFFVLDF
        D MFL+FEDDLD NIAGGSSSVGDN ESSSQQ AT T RR AQSRLLELERHVAING I MTI   AEKPISPHA+RFS             QR FVLDF
Subjt:  DSMFLEFEDDLD-NIAGGSSSVGDNAESSSQQPATLTSRRCAQSRLLELERHVAINGCILMTITLRAEKPISPHAIRFS-------------QRFFVLDF

Query:  NDQAMNRFVQHQMHTTFFKEFQTDYHRHFKKYSDPEEARANPPNALEQSRTNKAARQKHPYNHSSGSKLFLQRQCELTEKKGKLIGRVELFQKIHLQ---
        NDQAMNRFV+HQM TT FKEFQ D HRHFKKYSDPEEARANPPNALEQSRTNKAARQK PYNHSSGSK FLQRQ EL E+KG+ + RVELF++ H++   
Subjt:  NDQAMNRFVQHQMHTTFFKEFQTDYHRHFKKYSDPEEARANPPNALEQSRTNKAARQKHPYNHSSGSKLFLQRQCELTEKKGKLIGRVELFQKIHLQ---

Query:  ----------NQMLELQSQPTPKGSQPLSKNEICDQVLDRRSGYLKGLGWGPKPKAHRTTSASSSSTFYLQSTQTEIELQVKLNEALELIEVQDRNHQVL
                  NQMLELQSQ TP+GSQPLS++EICDQVL RR GY KGLGWGPKPKA RT SASSSST   QST+ EIELQ KL EALE IEVQDRNHQ L
Subjt:  ----------NQMLELQSQPTPKGSQPLSKNEICDQVLDRRSGYLKGLGWGPKPKAHRTTSASSSSTFYLQSTQTEIELQVKLNEALELIEVQDRNHQVL

Query:  ASQVERMQKLIKDFTR
        ASQVE M+K+I++FTR
Subjt:  ASQVERMQKLIKDFTR

A0A5A7TT95 CACTA en-spm transposon protein8.1e-10571.57Show/hide
Query:  MFLEFEDDLDNIAGGSSSVGDNAESSSQQPATLTSRRCAQSRLLELERHVAINGCILMTITLRAEKPISPHAIRFSQ-------RFFVLDFNDQAMNRFV
        MFL+FEDDLDNIAGGSSSVGDN  SSSQQ  T T RR AQSRLLELE HVAING I MT+   AEKPISPHA+RFSQ       R FVLDFNDQ MNRFV
Subjt:  MFLEFEDDLDNIAGGSSSVGDNAESSSQQPATLTSRRCAQSRLLELERHVAINGCILMTITLRAEKPISPHAIRFSQ-------RFFVLDFNDQAMNRFV

Query:  QHQMHTTFFKEFQTDYHRHFKKYSDPEEARANPPNAL--------------------EQSRTNKAARQKHPYNHSSGSKLFLQRQCELTEKKGKLIGRVE
        +HQM TT FKEF+ D HRHFKKYSDPEEARANPPNAL                    EQSR NKAARQK PYNHSSGSK FLQRQ EL E++G+ + RVE
Subjt:  QHQMHTTFFKEFQTDYHRHFKKYSDPEEARANPPNAL--------------------EQSRTNKAARQKHPYNHSSGSKLFLQRQCELTEKKGKLIGRVE

Query:  LFQKIHLQNQMLELQSQPTPKGSQPLSKNEICDQVLDRRSGYLKGLGWGPKPKAHRTTSASSSSTFYLQSTQTEIELQVKLNEALELIEVQDRNHQVLAS
        LF++ H  NQMLELQSQPTP+GSQPLS++EICDQVL RR GY KGLGWGPKPK  RT SASSSST  LQSTQ EIELQ KL+E LE IEVQDRNHQ LAS
Subjt:  LFQKIHLQNQMLELQSQPTPKGSQPLSKNEICDQVLDRRSGYLKGLGWGPKPKAHRTTSASSSSTFYLQSTQTEIELQVKLNEALELIEVQDRNHQVLAS

Query:  QVERMQKLIKDFT
        QVE M+K+I+D T
Subjt:  QVERMQKLIKDFT

A0A5D3BPD6 CACTA en-spm transposon protein6.8e-10458.75Show/hide
Query:  HN-ALGMTSRIRELKTKGKGRREKQRFVAVRSSLSFILSSLSSLAVPLQSATL-----------------DSMFLEFEDDLDNIAGGSSSVGDNAESSSQ
        HN A     R +ELK +G+ R E +     R   + +LS  SS      SA L                 D MFL+FE+DLDNI GGSSSVGDN  SSSQ
Subjt:  HN-ALGMTSRIRELKTKGKGRREKQRFVAVRSSLSFILSSLSSLAVPLQSATL-----------------DSMFLEFEDDLDNIAGGSSSVGDNAESSSQ

Query:  QPATLTSRRCAQSRLLELERHVAINGCILMTITLRAEKPISPHAIRFSQ--------------------------RFFVLDFNDQAMNRFVQHQMHTTFF
        Q AT T RR AQSRLLELERHVAING I MTI   AEKPISPHA+RFSQ                          RFFVLDFNDQAMN+FV+HQM  T  
Subjt:  QPATLTSRRCAQSRLLELERHVAINGCILMTITLRAEKPISPHAIRFSQ--------------------------RFFVLDFNDQAMNRFVQHQMHTTFF

Query:  KEFQTDYHRHFKKYSDPEEARANPPNAL-------------------EQSRTNKAARQKHPYNHSSGSKLFLQRQCELTEKKGKLIGRVELFQKIHLQ--
        KEF+ DYHRHFKKYSDPEEARANPPNAL                   +QSRTNKAARQK PYNHSSGSK FLQRQ EL E+KG+ + RVELFQ+ H++  
Subjt:  KEFQTDYHRHFKKYSDPEEARANPPNAL-------------------EQSRTNKAARQKHPYNHSSGSKLFLQRQCELTEKKGKLIGRVELFQKIHLQ--

Query:  -----------NQMLELQSQPTPKGSQPLSKNEICDQVLDRRSGYLKGLGWGPKPKAHRTTSASSSSTFYLQSTQTEIELQVKLNEALELIEVQDRNHQV
                   NQMLELQSQPTP+GSQPLS++EICDQVL RR GY KGLGWG KPKA +T SASSSST   QST+ EIELQ KL EALE IEVQDRNHQ 
Subjt:  -----------NQMLELQSQPTPKGSQPLSKNEICDQVLDRRSGYLKGLGWGPKPKAHRTTSASSSSTFYLQSTQTEIELQVKLNEALELIEVQDRNHQV

Query:  LASQVERMQKLIKDFTR
        LASQVE M+K+I++FTR
Subjt:  LASQVERMQKLIKDFTR

A0A5D3DD82 CACTA en-spm transposon protein1.8e-10464.23Show/hide
Query:  DSMFLEFEDDLDNIAGGSSSVGDNAESSSQQPATLTSRRCAQSRLLELERHVAINGCILMTITLRAEKPISPHAIRFS----------------------
        D MFL+FE+DLDNIAGGSSSVGDN   SS Q  T T RR AQSRLLELERHVAING I MTI   AEKPISPHA+RFS                      
Subjt:  DSMFLEFEDDLDNIAGGSSSVGDNAESSSQQPATLTSRRCAQSRLLELERHVAINGCILMTITLRAEKPISPHAIRFS----------------------

Query:  -----------QRFFVLDFNDQAMNRFVQHQMHTTFFKEFQTDYHRHFKKYSDPEEARANPPNAL--------------------EQSRTNKAARQKHPY
                   QRFFVLDFNDQAMNRFV+HQM TT FKEF+ D H+HFKKYSDPEEARANPPNAL                    EQSRTNKAARQK PY
Subjt:  -----------QRFFVLDFNDQAMNRFVQHQMHTTFFKEFQTDYHRHFKKYSDPEEARANPPNAL--------------------EQSRTNKAARQKHPY

Query:  NHSSGSKLFLQRQCELTEKKGKLIGRVELFQKIHLQ-------------NQMLELQSQPTPKGSQPLSKNEICDQVLDRRSGYLKGLGWGPKPKAHRTTS
        NHSSGSK FLQRQ EL E++G+ + RVELF++ H++             NQMLELQSQPTP+GSQPLS++EICDQVL RR GY KGLGWGPKPKA RT S
Subjt:  NHSSGSKLFLQRQCELTEKKGKLIGRVELFQKIHLQ-------------NQMLELQSQPTPKGSQPLSKNEICDQVLDRRSGYLKGLGWGPKPKAHRTTS

Query:  ASSSSTFYLQSTQTEIELQVKLNEALELIEVQDRNHQVLASQVERMQKLIKDFTR
        ASSSST   QSTQ EIELQ KL+EALE IEVQDRNHQ LASQVE M+K+I+D TR
Subjt:  ASSSSTFYLQSTQTEIELQVKLNEALELIEVQDRNHQVLASQVERMQKLIKDFTR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACAATGCATTGGGAATGACGTCAAGAATAAGGGAACTGAAAACGAAAGGGAAAGGGAGAAGAGAGAAACAAAGATTTGTCGCCGTCCGTAGCTCACTGTCGTTCAT
TTTGTCATCGTTGTCGTCCCTCGCCGTTCCGCTGCAGTCTGCCACCTTAGACTCTATGTTCCTCGAGTTTGAGGATGATTTAGATAACATCGCGGGAGGGTCGTCGTCTG
TGGGCGACAATGCGGAGTCTTCTTCTCAACAACCTGCGACTCTAACTTCTAGGAGATGTGCGCAGTCTCGACTGTTGGAGTTGGAGCGTCACGTTGCAATAAATGGATGT
ATTCTGATGACGATCACCCTTAGAGCGGAGAAGCCTATTTCCCCACATGCCATTCGCTTCAGCCAACGATTTTTTGTGCTTGATTTCAATGATCAAGCTATGAATAGGTT
TGTTCAGCATCAGATGCACACGACCTTTTTTAAAGAGTTCCAGACCGACTATCACAGACACTTCAAAAAGTACAGCGACCCGGAGGAAGCTCGTGCCAACCCACCAAACG
CATTGGAGCAATCACGAACGAACAAGGCTGCTAGACAGAAGCATCCTTATAATCATAGTAGCGGGTCCAAGTTGTTTCTACAACGACAGTGCGAGCTCACTGAGAAAAAA
GGGAAGTTGATCGGTCGTGTGGAGTTATTTCAGAAAATACACTTGCAAAATCAAATGCTGGAACTCCAATCCCAACCTACCCCAAAGGGTAGTCAGCCACTCTCTAAGAA
TGAGATATGTGATCAGGTGTTGGATAGACGATCAGGCTACTTAAAAGGTCTTGGTTGGGGACCCAAGCCGAAGGCCCATAGGACGACGAGTGCTAGCAGTTCCTCGACAT
TTTATTTGCAGTCCACACAAACAGAGATCGAATTACAAGTTAAACTTAATGAAGCTTTGGAACTGATTGAAGTGCAAGATAGAAATCACCAAGTGTTAGCTTCACAAGTG
GAACGAATGCAAAAGCTGATAAAAGACTTCACTCGAATTTTCGACACACAAAAGACGTCGAGAAAAAAAGTCGGAAGAGGCTTTCTTGATGCCGCATTATGCGTCGGCAT
AAGGTCCGTCGGGAGAGGAACTCCCGATGCCACTTTGGCCGACGCATGTCACGACGTCGGGAGCGACATTCTCGACGTCGTGTTGGTACGACGTGAGGAATGCCTCTTCC
GACTCATTTCTCTCGACATTATTCTCGAGGCCGTGAAAAACATCGGCATATCATTTCCAGACATCTTTTTCACTTTCTCCGATGTTTTGTGCGTCGGGAGTACCCCATCT
CTTGTAGTGTACTCCCTCTCTCACCTTCCCTCACCTTTCATTTTCTCTTCTCCAAATCCCTCCATCTCTCTCCCTTCTTGTGTCGAATCCCATTCTCTGAAAGTCTCTGT
CTTCGTCGCTCTTCTTTATTTTGTAGCGTGGAATCGTTTAAAAAACCTTCACCAAAACCCTTTTCCTAAACCATTTGTCTCCATCTCTCTCCTTCTTCAAAACCCTTCTT
TGAAATCCTCCTTCGTTTCCTCACATATTCGATTCCAAATCCTACTTCGAAATCGTTCGTCTTGGTCTCCTTCCTTCTCTGAAACCCTCCTTTGTTTCCTTACATGTTCA
ATTTTAAGCCCTTCTCTAAAACCCTACGTCTCCATCTCCCTCCTTCTCCAAAATCCTCATTCATTTCCTTACATGTTCGATTTGGTTGGGTGGGCGAGACTTATTGGTGT
GGCATTAGGCGAGCAAACCTCCAAGGATGTTGTCATACACGACCATGGTTCACCTAGGCGAGAAAGTAAGGCTGGTGGTTGTCCTAAGTTAGTCGTTTATAAGTTGGGGT
TGTTGGTCAATTTTGAGGCGAGACAAATAAAGTTTGAGCTGAAGAAAGGAAGAAAGTGTGATGCGTTTAGACCTGTGATGGACATACAACTTACGCGTAAGGCACGCATC
TTTTTGGATATCGACAACTCTGCTGAAGCACAATTTATCTATATCTCGTCGAATATTAACTTAATCAACTCAAAACATGAGCAACATTAA
mRNA sequenceShow/hide mRNA sequence
ATGCACAATGCATTGGGAATGACGTCAAGAATAAGGGAACTGAAAACGAAAGGGAAAGGGAGAAGAGAGAAACAAAGATTTGTCGCCGTCCGTAGCTCACTGTCGTTCAT
TTTGTCATCGTTGTCGTCCCTCGCCGTTCCGCTGCAGTCTGCCACCTTAGACTCTATGTTCCTCGAGTTTGAGGATGATTTAGATAACATCGCGGGAGGGTCGTCGTCTG
TGGGCGACAATGCGGAGTCTTCTTCTCAACAACCTGCGACTCTAACTTCTAGGAGATGTGCGCAGTCTCGACTGTTGGAGTTGGAGCGTCACGTTGCAATAAATGGATGT
ATTCTGATGACGATCACCCTTAGAGCGGAGAAGCCTATTTCCCCACATGCCATTCGCTTCAGCCAACGATTTTTTGTGCTTGATTTCAATGATCAAGCTATGAATAGGTT
TGTTCAGCATCAGATGCACACGACCTTTTTTAAAGAGTTCCAGACCGACTATCACAGACACTTCAAAAAGTACAGCGACCCGGAGGAAGCTCGTGCCAACCCACCAAACG
CATTGGAGCAATCACGAACGAACAAGGCTGCTAGACAGAAGCATCCTTATAATCATAGTAGCGGGTCCAAGTTGTTTCTACAACGACAGTGCGAGCTCACTGAGAAAAAA
GGGAAGTTGATCGGTCGTGTGGAGTTATTTCAGAAAATACACTTGCAAAATCAAATGCTGGAACTCCAATCCCAACCTACCCCAAAGGGTAGTCAGCCACTCTCTAAGAA
TGAGATATGTGATCAGGTGTTGGATAGACGATCAGGCTACTTAAAAGGTCTTGGTTGGGGACCCAAGCCGAAGGCCCATAGGACGACGAGTGCTAGCAGTTCCTCGACAT
TTTATTTGCAGTCCACACAAACAGAGATCGAATTACAAGTTAAACTTAATGAAGCTTTGGAACTGATTGAAGTGCAAGATAGAAATCACCAAGTGTTAGCTTCACAAGTG
GAACGAATGCAAAAGCTGATAAAAGACTTCACTCGAATTTTCGACACACAAAAGACGTCGAGAAAAAAAGTCGGAAGAGGCTTTCTTGATGCCGCATTATGCGTCGGCAT
AAGGTCCGTCGGGAGAGGAACTCCCGATGCCACTTTGGCCGACGCATGTCACGACGTCGGGAGCGACATTCTCGACGTCGTGTTGGTACGACGTGAGGAATGCCTCTTCC
GACTCATTTCTCTCGACATTATTCTCGAGGCCGTGAAAAACATCGGCATATCATTTCCAGACATCTTTTTCACTTTCTCCGATGTTTTGTGCGTCGGGAGTACCCCATCT
CTTGTAGTGTACTCCCTCTCTCACCTTCCCTCACCTTTCATTTTCTCTTCTCCAAATCCCTCCATCTCTCTCCCTTCTTGTGTCGAATCCCATTCTCTGAAAGTCTCTGT
CTTCGTCGCTCTTCTTTATTTTGTAGCGTGGAATCGTTTAAAAAACCTTCACCAAAACCCTTTTCCTAAACCATTTGTCTCCATCTCTCTCCTTCTTCAAAACCCTTCTT
TGAAATCCTCCTTCGTTTCCTCACATATTCGATTCCAAATCCTACTTCGAAATCGTTCGTCTTGGTCTCCTTCCTTCTCTGAAACCCTCCTTTGTTTCCTTACATGTTCA
ATTTTAAGCCCTTCTCTAAAACCCTACGTCTCCATCTCCCTCCTTCTCCAAAATCCTCATTCATTTCCTTACATGTTCGATTTGGTTGGGTGGGCGAGACTTATTGGTGT
GGCATTAGGCGAGCAAACCTCCAAGGATGTTGTCATACACGACCATGGTTCACCTAGGCGAGAAAGTAAGGCTGGTGGTTGTCCTAAGTTAGTCGTTTATAAGTTGGGGT
TGTTGGTCAATTTTGAGGCGAGACAAATAAAGTTTGAGCTGAAGAAAGGAAGAAAGTGTGATGCGTTTAGACCTGTGATGGACATACAACTTACGCGTAAGGCACGCATC
TTTTTGGATATCGACAACTCTGCTGAAGCACAATTTATCTATATCTCGTCGAATATTAACTTAATCAACTCAAAACATGAGCAACATTAA
Protein sequenceShow/hide protein sequence
MHNALGMTSRIRELKTKGKGRREKQRFVAVRSSLSFILSSLSSLAVPLQSATLDSMFLEFEDDLDNIAGGSSSVGDNAESSSQQPATLTSRRCAQSRLLELERHVAINGC
ILMTITLRAEKPISPHAIRFSQRFFVLDFNDQAMNRFVQHQMHTTFFKEFQTDYHRHFKKYSDPEEARANPPNALEQSRTNKAARQKHPYNHSSGSKLFLQRQCELTEKK
GKLIGRVELFQKIHLQNQMLELQSQPTPKGSQPLSKNEICDQVLDRRSGYLKGLGWGPKPKAHRTTSASSSSTFYLQSTQTEIELQVKLNEALELIEVQDRNHQVLASQV
ERMQKLIKDFTRIFDTQKTSRKKVGRGFLDAALCVGIRSVGRGTPDATLADACHDVGSDILDVVLVRREECLFRLISLDIILEAVKNIGISFPDIFFTFSDVLCVGSTPS
LVVYSLSHLPSPFIFSSPNPSISLPSCVESHSLKVSVFVALLYFVAWNRLKNLHQNPFPKPFVSISLLLQNPSLKSSFVSSHIRFQILLRNRSSWSPSFSETLLCFLTCS
ILSPSLKPYVSISLLLQNPHSFPYMFDLVGWARLIGVALGEQTSKDVVIHDHGSPRRESKAGGCPKLVVYKLGLLVNFEARQIKFELKKGRKCDAFRPVMDIQLTRKARI
FLDIDNSAEAQFIYISSNINLINSKHEQH