; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0004310 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0004310
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionCACTA en-spm transposon protein
Genome locationchr10:17282382..17287335
RNA-Seq ExpressionIVF0004310
SyntenyIVF0004310
Gene Ontology termsNA
InterPro domainsIPR004252 - Probable transposase, Ptta/En/Spm, plant
IPR025452 - Domain of unknown function DUF4218


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033025.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]1.23e-29770.61Show/hide
Query:  MYPIERSLRTLKQYVRNKARPEGSIAEAYVMNKSSTFCSRYLRGIETRFIRDERNDDTIVESEVIGDFEIFKQKVRPLGASSVRAISEEEKRLFHWYILN
        MYPIERSLRTLKQYVRNKARPEGSIAEAYVMN+SSTFCS YLRGIETRF RDERNDDTIVE EVIGDFEI KQKV+PL ASSVRAIS+E+KRLFHWY++N
Subjt:  MYPIERSLRTLKQYVRNKARPEGSIAEAYVMNKSSTFCSRYLRGIETRFIRDERNDDTIVESEVIGDFEIFKQKVRPLGASSVRAISEEEKRLFHWYILN

Query:  NADKYQSI-KKHLRLQRRHAQNSMDLYKIHGREFPEWFRAQVLELRQSANLSDDFFSLAMGLSFNVRCYNGCIVGGVIFRTIELDSRRTTQNSGIMIIGE
        NAD+     KKHLRLQRRHAQNSMDLYKIH R FPEWF+A  LELR+SANLSDDFFSL MG SFNVRCYNGCIVGGV F TIELDSRRTTQNSGIM+IGE
Subjt:  NADKYQSI-KKHLRLQRRHAQNSMDLYKIHGREFPEWFRAQVLELRQSANLSDDFFSLAMGLSFNVRCYNGCIVGGVIFRTIELDSRRTTQNSGIMIIGE

Query:  SDASGTSDNNFYGVLDEVLHVQYP----LQLESCASHP---------------------NKRIWDVPEVEDVQNDHINILEVVVSHQVDDHIEDDTLCRN
        SDAS TS+NNFYGVLDEVLHVQYP    + L  C  +                      NK IWDVPEVEDVQNDHINILEVVVSHQVDDHIED+TLC N
Subjt:  SDASGTSDNNFYGVLDEVLHVQYP----LQLESCASHP---------------------NKRIWDVPEVEDVQNDHINILEVVVSHQVDDHIEDDTLCRN

Query:  DVDPTIVERSV------------------------------------FEDNLDNIAGGSSFVGDNT-GSSSQQ-TTPTPRRRAQSRLLELERHVAINGRI
        DVDPTIVER V                                    FED+LDNIAGGSSFVGDNT GSSSQQ  T  PRRRAQSRLLELERHVAINGRI
Subjt:  DVDPTIVERSV------------------------------------FEDNLDNIAGGSSFVGDNT-GSSSQQ-TTPTPRRRAQSRLLELERHVAINGRI

Query:  PMTIAPEEEKPISPHAVHFSQAIGVCVRKIFPLRCLKWADVEREYFEVVKGDLQQLFVLDFNDQAMNRFVEHQMLTTFKEFQVDCHRHFKKYSDSEEAHA
        PMTIAP  EKPIS HAVHFSQAIGVCV+KIFP+R  KWAD  REY EVVKGDLQ+LFVLDFNDQAMNRFVEHQML TFKEF+ DCHRHFKKYSD EEA A
Subjt:  PMTIAPEEEKPISPHAVHFSQAIGVCVRKIFPLRCLKWADVEREYFEVVKGDLQQLFVLDFNDQAMNRFVEHQMLTTFKEFQVDCHRHFKKYSDSEEAHA

Query:  NPPNALVGRDEDWHFLCVHYISRAFQEQSRTNKAARQKQPYNHSSGSKSFLQRQHELAERRGKSVDRVELFRETHIRAGTFVSQAAEDTHDQMLKLQSQP
        NPPNAL                                                +ELAER+G+SVDRVELF ETH+RA TFV QA ED H+QML+LQSQP
Subjt:  NPPNALVGRDEDWHFLCVHYISRAFQEQSRTNKAARQKQPYNHSSGSKSFLQRQHELAERRGKSVDRVELFRETHIRAGTFVSQAAEDTHDQMLKLQSQP

Query:  TPEGSQPLSEDEICDQVLGRRPCYSKGLGWGPKPKARRTTSASSSSTSCSQSTEKEIELQAKLYEAL
        T E SQPL EDEICDQVLGRRP YSKGLGWGPK KA +  SA SSSTSCSQST+KEIELQAKL+EAL
Subjt:  TPEGSQPLSEDEICDQVLGRRPCYSKGLGWGPKPKARRTTSASSSSTSCSQSTEKEIELQAKLYEAL

KAA0056494.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]5.72e-27068.25Show/hide
Query:  IVESEVIGDFEIFKQKVRPLGASSVRAISEEEKRLFHWYILNNADKYQSIKKHLRLQRRHAQNSMDLYKIHGREFPEWFRAQVLELRQSANLSDDFFSLA
        +++ +VI DFEIFKQKVRPLGASSVRAIS+EEKRLFHWYILNNAD+                     Y+ H R FPEWFRAQVLELR+SANLS+DFFSLA
Subjt:  IVESEVIGDFEIFKQKVRPLGASSVRAISEEEKRLFHWYILNNADKYQSIKKHLRLQRRHAQNSMDLYKIHGREFPEWFRAQVLELRQSANLSDDFFSLA

Query:  MGLSFNVRCYNGCIVGGVIFRTIELDSRRTTQNSGIMIIGESDASGTSDNNFYGVLDEVLHVQYPLQ----LESCA-------------------SHPNK
        MG SF++RCYNGCIV GV F T+ELDSRRTTQNSGIMIIGESDASGT DN+FYGVLDEVLHVQYPL     L  C                    S    
Subjt:  MGLSFNVRCYNGCIVGGVIFRTIELDSRRTTQNSGIMIIGESDASGTSDNNFYGVLDEVLHVQYPLQ----LESCA-------------------SHPNK

Query:  RIWDVPE-------------VEDVQNDHINILEVVVSHQVDDHIEDDTLCRNDVDPTIVERSV-------------------------------------
        R W   E             ++D +N      E+VVSHQVDDHI+DDTLCRNDVDPTIVER +                                     
Subjt:  RIWDVPE-------------VEDVQNDHINILEVVVSHQVDDHIEDDTLCRNDVDPTIVERSV-------------------------------------

Query:  ---FEDNLDNIAGGSSFVGDNT-GSSSQQ-TTPTPRRRAQSRLLELERHVAINGRIPMTIAPEEEKPISPHAVHFSQAIGVCVRKIFPLRCLKWADVERE
           FED+LDNIAG SS +G+NT GSSSQQ  TPT RRR QSRLLELERHVAINGRIPMTIAP  EKPISPHA+HF QAIGVCVRK F + CLKWADV RE
Subjt:  ---FEDNLDNIAGGSSFVGDNT-GSSSQQ-TTPTPRRRAQSRLLELERHVAINGRIPMTIAPEEEKPISPHAVHFSQAIGVCVRKIFPLRCLKWADVERE

Query:  YFEVVKGDLQQLFVLDFNDQAMNRFVEHQMLTTFKEFQVDCHRHFKKYSDSEEAHANPPNALVGRDEDWHFLCVHYISRAFQEQSRTNKAARQKQPYNHS
        Y EVVKGDLQ+LFVLDFNDQAMNRFVEHQM TTFKEF +DCHRHFKKYSD EEA ANPPNALVGR EDWHFL  HYIS AFQEQSRTNK+ARQKQPYNHS
Subjt:  YFEVVKGDLQQLFVLDFNDQAMNRFVEHQMLTTFKEFQVDCHRHFKKYSDSEEAHANPPNALVGRDEDWHFLCVHYISRAFQEQSRTNKAARQKQPYNHS

Query:  SGSKSFLQRQHELAERRGKSVDRVELFRETHIRAGTFVSQAAEDTHDQMLKLQSQPTPEGSQPLSEDEICDQVLGRRPCYSKGLGWGPKPKARRTTSASS
        SGSKSFLQRQ+ELAER+G+ V+RVELF+ETH+RAGTF+SQAAED H+QML+LQSQPTP+GSQPLSEDEICDQVLGRRP YSKGLGWG KPKARR TSASS
Subjt:  SGSKSFLQRQHELAERRGKSVDRVELFRETHIRAGTFVSQAAEDTHDQMLKLQSQPTPEGSQPLSEDEICDQVLGRRPCYSKGLGWGPKPKARRTTSASS

Query:  SSTSCSQSTEKEIELQAKLYEALNGLNAPE
        SSTSCSQSTEKEIELQAKL EAL  +   +
Subjt:  SSTSCSQSTEKEIELQAKLYEALNGLNAPE

KAA0066000.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]0.086.96Show/hide
Query:  MYPIERSLRTLKQYVRNKARPEGSIAEAYVMNKSSTFCSRYLRGIETRFIRDERNDDTIVESEVIGDFEIFKQKVRPLGASSVRAISEEEKRLFHWYILN
        MYPIERSLRTLKQYVRNKARPEGSIAEAYVMNKSSTFCSRYLRGIETRFIRDERNDDTIVESEVIGDFEIFKQKVRPLGASSVRAISEEEKRLFHWYILN
Subjt:  MYPIERSLRTLKQYVRNKARPEGSIAEAYVMNKSSTFCSRYLRGIETRFIRDERNDDTIVESEVIGDFEIFKQKVRPLGASSVRAISEEEKRLFHWYILN

Query:  NADKYQSIKKHLRLQRRHAQNSMDLYKIHGREFPEWFRAQVLELRQSANLSDDFFSLAMGLSFNVRCYNGCIVGGVIFRTIELDSRRTTQNSGIMIIGES
        NADK                                   QVLELRQSANLSDDFFSLAMGLSFNVRCYNGCIVGGVIFRTIELDSRRTTQNSGIMIIGES
Subjt:  NADKYQSIKKHLRLQRRHAQNSMDLYKIHGREFPEWFRAQVLELRQSANLSDDFFSLAMGLSFNVRCYNGCIVGGVIFRTIELDSRRTTQNSGIMIIGES

Query:  DASGTSDNNFYGVLDEVLHVQYPLQ--LESCASHP--------------------NKRIWDVPEVEDVQNDHINILEVVVSHQVDDHIEDDTLCRNDVDP
        DASGTSDNNFYGVLDEVLHVQYPL+  + +  +H                     NKRIWDVPEVEDVQNDHINILEVVVSHQVDDHIEDDTLCRNDVDP
Subjt:  DASGTSDNNFYGVLDEVLHVQYPLQ--LESCASHP--------------------NKRIWDVPEVEDVQNDHINILEVVVSHQVDDHIEDDTLCRNDVDP

Query:  TIVERSV-----------FEDNLDNIAGGSSFVGDNT-GSSSQQTTPTPRRRAQSRLLELERHVAINGRIPMTIAPEEEKPISPHAVHFSQAIGVCVRKI
        TIVERSV           FEDNLDNIAGGSSFVGDNT GSSSQQTTPTPRRRAQSRLLELERHVAINGRIPMTIAPEEEKPISPHAVHFSQAIGVCVRKI
Subjt:  TIVERSV-----------FEDNLDNIAGGSSFVGDNT-GSSSQQTTPTPRRRAQSRLLELERHVAINGRIPMTIAPEEEKPISPHAVHFSQAIGVCVRKI

Query:  FPLRCLKWADVEREYFEVVKGDLQQLFVLDFNDQAMNRFVEHQMLTTFKEFQVDCHRHFKKYSDSEEAHANPPNALVGRDEDWHFLCVHYISRAFQEQSR
        FPLRCLKWADVEREYFEVVKGDLQQLFVLDFNDQAMNRFVEHQMLTTFKEFQVDCHRHFKKYSDSEEAHANPPNALVGRDEDWHFLCVHYISRAFQEQSR
Subjt:  FPLRCLKWADVEREYFEVVKGDLQQLFVLDFNDQAMNRFVEHQMLTTFKEFQVDCHRHFKKYSDSEEAHANPPNALVGRDEDWHFLCVHYISRAFQEQSR

Query:  TNKAARQKQPYNHSSGSKSFLQRQHELAERRGKSVDRVELFRETHIRAGTFVSQAAEDTHDQMLKLQSQPTPEGSQPLSEDEICDQVLGRRPCYSKGLGW
        TNKAARQKQPYNHSSGSKSFLQRQHELAERRGKSVDRVELFRETHIRAGTFVSQAAEDTHDQMLKLQSQPTPEGSQPLSEDEICDQVLGRRPCYSKGLGW
Subjt:  TNKAARQKQPYNHSSGSKSFLQRQHELAERRGKSVDRVELFRETHIRAGTFVSQAAEDTHDQMLKLQSQPTPEGSQPLSEDEICDQVLGRRPCYSKGLGW

Query:  GPKPKARRTTSASSSSTSCSQSTEKEIELQAKLYEALNGLNAPE
        GPKPKARRTTSASSSSTSCSQSTEKEIELQAKLYEAL  +   +
Subjt:  GPKPKARRTTSASSSSTSCSQSTEKEIELQAKLYEALNGLNAPE

KAA0066286.1 uncharacterized protein E6C27_scaffold21G003350 [Cucumis melo var. makuwa]2.18e-23765.08Show/hide
Query:  MYPIERSLRTLKQYVRNKARPEGSIAEAYVMNKSSTFCSRYLRGIETRFIRDERNDDTIVESEVIGDFEIFKQKVRPLGASSVRAISEEEKRLFHWYILN
        MYPIER LRTLKQYVRNKARPEGSIAE YVMN+SSTF SRYLRGIETRF RDERNDDTIVE+EV                                    
Subjt:  MYPIERSLRTLKQYVRNKARPEGSIAEAYVMNKSSTFCSRYLRGIETRFIRDERNDDTIVESEVIGDFEIFKQKVRPLGASSVRAISEEEKRLFHWYILN

Query:  NADKYQSIKKHLRLQRRHAQNSMDLYKIHGREFPEWFRAQVLELRQSANLSDDFFSLAMGLSFNVRCYNGCIVGGVIFRTIELDSRRTTQNSGIMIIGES
                                                 LELRQSANLSDDFFSLAMG SF+VRCYNGCIVGGV F TI+LDSRRTTQNSGIM+IGES
Subjt:  NADKYQSIKKHLRLQRRHAQNSMDLYKIHGREFPEWFRAQVLELRQSANLSDDFFSLAMGLSFNVRCYNGCIVGGVIFRTIELDSRRTTQNSGIMIIGES

Query:  DASGTSDNNFYGVLDEVLHVQYPLQLESCASHPNKRIWDVPEVEDVQNDHINILEVVVSHQVDDHIEDDTLCRNDVDPTIVERSVFEDNLDNIAGGSSFV
        DASGT DNNFYGVLDEVLHVQYPL                                                RN           FE++LDNIAGGSS +
Subjt:  DASGTSDNNFYGVLDEVLHVQYPLQLESCASHPNKRIWDVPEVEDVQNDHINILEVVVSHQVDDHIEDDTLCRNDVDPTIVERSVFEDNLDNIAGGSSFV

Query:  GDNTGSSSQQTTPTPRRRAQSRLLELERHVAINGRIPMTIAPEEEKPISPHAVHFSQAIGVCVRKIFPLRCLKWADVEREYFEVVKGDLQQLFVLDFNDQ
        GDNTGSS QQTTPTPRR AQSRLLELERHVAINGRIPMTIAP  EKPISPHAV FSQAIGVC                            +LFVLDFNDQ
Subjt:  GDNTGSSSQQTTPTPRRRAQSRLLELERHVAINGRIPMTIAPEEEKPISPHAVHFSQAIGVCVRKIFPLRCLKWADVEREYFEVVKGDLQQLFVLDFNDQ

Query:  AMNRFVEHQMLTTFKEFQVDCHRHFKKYSDSEEAHANPPNALVGRDEDWHFLCVHYISRAFQEQSRTNKAARQKQPYNHSSGSKSFLQRQHELAERRGKS
        AMNRFVEHQMLTTFKEFQ DCHRHFKKYSD EEA ANPPNALVGRDEDWHFLC HYISRAFQEQSRTNKAARQKQPYNHSSGSK FLQRQ+ELAERRG+ 
Subjt:  AMNRFVEHQMLTTFKEFQVDCHRHFKKYSDSEEAHANPPNALVGRDEDWHFLCVHYISRAFQEQSRTNKAARQKQPYNHSSGSKSFLQRQHELAERRGKS

Query:  VDRVELFRETHIRAGTFVSQAAEDTHDQMLKLQSQPTPEGSQPLSEDEICDQVLGRRPCYSKGLGWGPKPKARRTTSASSSSTSCSQSTEKEIELQAKLY
        VDRVELFRETH+RAGTFVSQAAED H+QML+LQSQPTPEGSQPLSEDEIC+QVLGRRP YSKGLGWGPKPKARRT SASSSSTS  QST+KEIELQAKL+
Subjt:  VDRVELFRETHIRAGTFVSQAAEDTHDQMLKLQSQPTPEGSQPLSEDEICDQVLGRRPCYSKGLGWGPKPKARRTTSASSSSTSCSQSTEKEIELQAKLY

Query:  EALNGLNAPE
        EAL  +   +
Subjt:  EALNGLNAPE

TYK29029.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]1.83e-27668.97Show/hide
Query:  RDERNDDTIVESEVIGDFEIFKQKVRPLGASSVRAISEEEKRLFHWYILNNADKYQSIKKHLRLQRRHAQNSMDLYKIHGREFPEWFRAQVLELRQSANL
        RDE+NDDTIVE +VI DFEIFKQKVRPLGASSVRAIS+EEKRLFHWYILNNAD+                     Y+ H R FPEWFRAQVLELR+SANL
Subjt:  RDERNDDTIVESEVIGDFEIFKQKVRPLGASSVRAISEEEKRLFHWYILNNADKYQSIKKHLRLQRRHAQNSMDLYKIHGREFPEWFRAQVLELRQSANL

Query:  SDDFFSLAMGLSFNVRCYNGCIVGGVIFRTIELDSRRTTQNSGIMIIGESDASGTSDNNFYGVLDEVLHVQYPLQ----LESCA----------------
        S+DFFSLAMG SF++RCYNGCIV GV F T+ELDSRRTTQNSGIMIIGESDASGT DN+FYGVLDEVLHVQYPL     L  C                 
Subjt:  SDDFFSLAMGLSFNVRCYNGCIVGGVIFRTIELDSRRTTQNSGIMIIGESDASGTSDNNFYGVLDEVLHVQYPLQ----LESCA----------------

Query:  ---SHPNKRIWDVPE-------------VEDVQNDHINILEVVVSHQVDDHIEDDTLCRNDVDPTIVERSV-----------------------------
           S    R W   E             ++D +N      E+VVSHQVDDHI+DDTLCRNDVDPTIVER +                             
Subjt:  ---SHPNKRIWDVPE-------------VEDVQNDHINILEVVVSHQVDDHIEDDTLCRNDVDPTIVERSV-----------------------------

Query:  -----------FEDNLDNIAGGSSFVGDNT-GSSSQQ-TTPTPRRRAQSRLLELERHVAINGRIPMTIAPEEEKPISPHAVHFSQAIGVCVRKIFPLRCL
                   FED+LDNIAG SS +G+NT GSSSQQ  TPT RRR QSRLLELERHVAINGRIPMTIAP  EKPISPHA+HF QAIGVCVRK F + CL
Subjt:  -----------FEDNLDNIAGGSSFVGDNT-GSSSQQ-TTPTPRRRAQSRLLELERHVAINGRIPMTIAPEEEKPISPHAVHFSQAIGVCVRKIFPLRCL

Query:  KWADVEREYFEVVKGDLQQLFVLDFNDQAMNRFVEHQMLTTFKEFQVDCHRHFKKYSDSEEAHANPPNALVGRDEDWHFLCVHYISRAFQEQSRTNKAAR
        KWADV REY EVVKGDLQ+LFVLDFNDQAMNRFVEHQM TTFKEF +DCHRHFKKYSD EEA ANPPNALVGR EDWHFL  HYIS AFQEQSRTNK+AR
Subjt:  KWADVEREYFEVVKGDLQQLFVLDFNDQAMNRFVEHQMLTTFKEFQVDCHRHFKKYSDSEEAHANPPNALVGRDEDWHFLCVHYISRAFQEQSRTNKAAR

Query:  QKQPYNHSSGSKSFLQRQHELAERRGKSVDRVELFRETHIRAGTFVSQAAEDTHDQMLKLQSQPTPEGSQPLSEDEICDQVLGRRPCYSKGLGWGPKPKA
        QKQPYNHSSGSKSFLQRQ+ELAER+G+ V+RVELF+ETH+RAGTF+SQAAED H+QML+LQSQPTP+GSQPLSEDEICDQVLGRRP YSKGLGWG KPKA
Subjt:  QKQPYNHSSGSKSFLQRQHELAERRGKSVDRVELFRETHIRAGTFVSQAAEDTHDQMLKLQSQPTPEGSQPLSEDEICDQVLGRRPCYSKGLGWGPKPKA

Query:  RRTTSASSSSTSCSQSTEKEIELQAKLYEALNGLNAPE
        RR TSASSSSTSCSQSTEKEIELQAKL EAL  +   +
Subjt:  RRTTSASSSSTSCSQSTEKEIELQAKLYEALNGLNAPE

TrEMBL top hitse value%identityAlignment
A0A5A7SV41 CACTA en-spm transposon protein8.7e-24670.61Show/hide
Query:  MYPIERSLRTLKQYVRNKARPEGSIAEAYVMNKSSTFCSRYLRGIETRFIRDERNDDTIVESEVIGDFEIFKQKVRPLGASSVRAISEEEKRLFHWYILN
        MYPIERSLRTLKQYVRNKARPEGSIAEAYVMN+SSTFCS YLRGIETRF RDERNDDTIVE EVIGDFEI KQKV+PL ASSVRAIS+E+KRLFHWY++N
Subjt:  MYPIERSLRTLKQYVRNKARPEGSIAEAYVMNKSSTFCSRYLRGIETRFIRDERNDDTIVESEVIGDFEIFKQKVRPLGASSVRAISEEEKRLFHWYILN

Query:  NADKYQSI-KKHLRLQRRHAQNSMDLYKIHGREFPEWFRAQVLELRQSANLSDDFFSLAMGLSFNVRCYNGCIVGGVIFRTIELDSRRTTQNSGIMIIGE
        NAD+     KKHLRLQRRHAQNSMDLYKIH R FPEWF+A  LELR+SANLSDDFFSL MG SFNVRCYNGCIVGGV F TIELDSRRTTQNSGIM+IGE
Subjt:  NADKYQSI-KKHLRLQRRHAQNSMDLYKIHGREFPEWFRAQVLELRQSANLSDDFFSLAMGLSFNVRCYNGCIVGGVIFRTIELDSRRTTQNSGIMIIGE

Query:  SDASGTSDNNFYGVLDEVLHVQYP----LQLESCASH---------------------PNKRIWDVPEVEDVQNDHINILEVVVSHQVDDHIEDDTLCRN
        SDAS TS+NNFYGVLDEVLHVQYP    + L  C  +                      NK IWDVPEVEDVQNDHINILEVVVSHQVDDHIED+TLC N
Subjt:  SDASGTSDNNFYGVLDEVLHVQYP----LQLESCASH---------------------PNKRIWDVPEVEDVQNDHINILEVVVSHQVDDHIEDDTLCRN

Query:  DVDPTIVERSV------------------------------------FEDNLDNIAGGSSFVGDNT-GSSSQQ-TTPTPRRRAQSRLLELERHVAINGRI
        DVDPTIVER V                                    FED+LDNIAGGSSFVGDNT GSSSQQ  T  PRRRAQSRLLELERHVAINGRI
Subjt:  DVDPTIVERSV------------------------------------FEDNLDNIAGGSSFVGDNT-GSSSQQ-TTPTPRRRAQSRLLELERHVAINGRI

Query:  PMTIAPEEEKPISPHAVHFSQAIGVCVRKIFPLRCLKWADVEREYFEVVKGDLQQLFVLDFNDQAMNRFVEHQMLTTFKEFQVDCHRHFKKYSDSEEAHA
        PMTIAP  EKPIS HAVHFSQAIGVCV+KIFP+R  KWAD  REY EVVKGDLQ+LFVLDFNDQAMNRFVEHQML TFKEF+ DCHRHFKKYSD EEA A
Subjt:  PMTIAPEEEKPISPHAVHFSQAIGVCVRKIFPLRCLKWADVEREYFEVVKGDLQQLFVLDFNDQAMNRFVEHQMLTTFKEFQVDCHRHFKKYSDSEEAHA

Query:  NPPNALVGRDEDWHFLCVHYISRAFQEQSRTNKAARQKQPYNHSSGSKSFLQRQHELAERRGKSVDRVELFRETHIRAGTFVSQAAEDTHDQMLKLQSQP
        NPPNAL                                                +ELAER+G+SVDRVELF ETH+RA TFV QA ED H+QML+LQSQP
Subjt:  NPPNALVGRDEDWHFLCVHYISRAFQEQSRTNKAARQKQPYNHSSGSKSFLQRQHELAERRGKSVDRVELFRETHIRAGTFVSQAAEDTHDQMLKLQSQP

Query:  TPEGSQPLSEDEICDQVLGRRPCYSKGLGWGPKPKARRTTSASSSSTSCSQSTEKEIELQAKLYEAL
        T E SQPL EDEICDQVLGRRP YSKGLGWGPK KA +  SA SSSTSCSQST+KEIELQAKL+EAL
Subjt:  TPEGSQPLSEDEICDQVLGRRPCYSKGLGWGPKPKARRTTSASSSSTSCSQSTEKEIELQAKLYEAL

A0A5A7UMS8 CACTA en-spm transposon protein4.2e-22468.25Show/hide
Query:  IVESEVIGDFEIFKQKVRPLGASSVRAISEEEKRLFHWYILNNADKYQSIKKHLRLQRRHAQNSMDLYKIHGREFPEWFRAQVLELRQSANLSDDFFSLA
        +++ +VI DFEIFKQKVRPLGASSVRAIS+EEKRLFHWYILNNAD+                     Y+ H R FPEWFRAQVLELR+SANLS+DFFSLA
Subjt:  IVESEVIGDFEIFKQKVRPLGASSVRAISEEEKRLFHWYILNNADKYQSIKKHLRLQRRHAQNSMDLYKIHGREFPEWFRAQVLELRQSANLSDDFFSLA

Query:  MGLSFNVRCYNGCIVGGVIFRTIELDSRRTTQNSGIMIIGESDASGTSDNNFYGVLDEVLHVQYPL----QLESC-------------------ASHPNK
        MG SF++RCYNGCIV GV F T+ELDSRRTTQNSGIMIIGESDASGT DN+FYGVLDEVLHVQYPL     L  C                    S    
Subjt:  MGLSFNVRCYNGCIVGGVIFRTIELDSRRTTQNSGIMIIGESDASGTSDNNFYGVLDEVLHVQYPL----QLESC-------------------ASHPNK

Query:  RIWDVPE-------------VEDVQNDHINILEVVVSHQVDDHIEDDTLCRNDVDPTIVERSV-------------------------------------
        R W   E             ++D +N      E+VVSHQVDDHI+DDTLCRNDVDPTIVER +                                     
Subjt:  RIWDVPE-------------VEDVQNDHINILEVVVSHQVDDHIEDDTLCRNDVDPTIVERSV-------------------------------------

Query:  ---FEDNLDNIAGGSSFVGDN-TGSSSQQ-TTPTPRRRAQSRLLELERHVAINGRIPMTIAPEEEKPISPHAVHFSQAIGVCVRKIFPLRCLKWADVERE
           FED+LDNIAG SS +G+N TGSSSQQ  TPT RRR QSRLLELERHVAINGRIPMTIAP  EKPISPHA+HF QAIGVCVRK F + CLKWADV RE
Subjt:  ---FEDNLDNIAGGSSFVGDN-TGSSSQQ-TTPTPRRRAQSRLLELERHVAINGRIPMTIAPEEEKPISPHAVHFSQAIGVCVRKIFPLRCLKWADVERE

Query:  YFEVVKGDLQQLFVLDFNDQAMNRFVEHQMLTTFKEFQVDCHRHFKKYSDSEEAHANPPNALVGRDEDWHFLCVHYISRAFQEQSRTNKAARQKQPYNHS
        Y EVVKGDLQ+LFVLDFNDQAMNRFVEHQM TTFKEF +DCHRHFKKYSD EEA ANPPNALVGR EDWHFL  HYIS AFQEQSRTNK+ARQKQPYNHS
Subjt:  YFEVVKGDLQQLFVLDFNDQAMNRFVEHQMLTTFKEFQVDCHRHFKKYSDSEEAHANPPNALVGRDEDWHFLCVHYISRAFQEQSRTNKAARQKQPYNHS

Query:  SGSKSFLQRQHELAERRGKSVDRVELFRETHIRAGTFVSQAAEDTHDQMLKLQSQPTPEGSQPLSEDEICDQVLGRRPCYSKGLGWGPKPKARRTTSASS
        SGSKSFLQRQ+ELAER+G+ V+RVELF+ETH+RAGTF+SQAAED H+QML+LQSQPTP+GSQPLSEDEICDQVLGRRP YSKGLGWG KPKARR TSASS
Subjt:  SGSKSFLQRQHELAERRGKSVDRVELFRETHIRAGTFVSQAAEDTHDQMLKLQSQPTPEGSQPLSEDEICDQVLGRRPCYSKGLGWGPKPKARRTTSASS

Query:  SSTSCSQSTEKEIELQAKLYEALNGLNAPE
        SSTSCSQSTEKEIELQAKL EAL  +   +
Subjt:  SSTSCSQSTEKEIELQAKLYEALNGLNAPE

A0A5A7VIW1 DUF4218 domain-containing protein1.6e-19965.08Show/hide
Query:  MYPIERSLRTLKQYVRNKARPEGSIAEAYVMNKSSTFCSRYLRGIETRFIRDERNDDTIVESEVIGDFEIFKQKVRPLGASSVRAISEEEKRLFHWYILN
        MYPIER LRTLKQYVRNKARPEGSIAE YVMN+SSTF SRYLRGIETRF RDERNDDTIVE+E                                     
Subjt:  MYPIERSLRTLKQYVRNKARPEGSIAEAYVMNKSSTFCSRYLRGIETRFIRDERNDDTIVESEVIGDFEIFKQKVRPLGASSVRAISEEEKRLFHWYILN

Query:  NADKYQSIKKHLRLQRRHAQNSMDLYKIHGREFPEWFRAQVLELRQSANLSDDFFSLAMGLSFNVRCYNGCIVGGVIFRTIELDSRRTTQNSGIMIIGES
                                                VLELRQSANLSDDFFSLAMG SF+VRCYNGCIVGGV F TI+LDSRRTTQNSGIM+IGES
Subjt:  NADKYQSIKKHLRLQRRHAQNSMDLYKIHGREFPEWFRAQVLELRQSANLSDDFFSLAMGLSFNVRCYNGCIVGGVIFRTIELDSRRTTQNSGIMIIGES

Query:  DASGTSDNNFYGVLDEVLHVQYPLQLESCASHPNKRIWDVPEVEDVQNDHINILEVVVSHQVDDHIEDDTLCRNDVDPTIVERSVFEDNLDNIAGGSSFV
        DASGT DNNFYGVLDEVLHVQYPL                                                RN           FE++LDNIAGGSS +
Subjt:  DASGTSDNNFYGVLDEVLHVQYPLQLESCASHPNKRIWDVPEVEDVQNDHINILEVVVSHQVDDHIEDDTLCRNDVDPTIVERSVFEDNLDNIAGGSSFV

Query:  GDNTGSSSQQTTPTPRRRAQSRLLELERHVAINGRIPMTIAPEEEKPISPHAVHFSQAIGVCVRKIFPLRCLKWADVEREYFEVVKGDLQQLFVLDFNDQ
        GDNTGSS QQTTPTPRR AQSRLLELERHVAINGRIPMTIAP  EKPISPHAV FSQAIGVC                            +LFVLDFNDQ
Subjt:  GDNTGSSSQQTTPTPRRRAQSRLLELERHVAINGRIPMTIAPEEEKPISPHAVHFSQAIGVCVRKIFPLRCLKWADVEREYFEVVKGDLQQLFVLDFNDQ

Query:  AMNRFVEHQMLTTFKEFQVDCHRHFKKYSDSEEAHANPPNALVGRDEDWHFLCVHYISRAFQEQSRTNKAARQKQPYNHSSGSKSFLQRQHELAERRGKS
        AMNRFVEHQMLTTFKEFQ DCHRHFKKYSD EEA ANPPNALVGRDEDWHFLC HYISRAFQEQSRTNKAARQKQPYNHSSGSK FLQRQ+ELAERRG+ 
Subjt:  AMNRFVEHQMLTTFKEFQVDCHRHFKKYSDSEEAHANPPNALVGRDEDWHFLCVHYISRAFQEQSRTNKAARQKQPYNHSSGSKSFLQRQHELAERRGKS

Query:  VDRVELFRETHIRAGTFVSQAAEDTHDQMLKLQSQPTPEGSQPLSEDEICDQVLGRRPCYSKGLGWGPKPKARRTTSASSSSTSCSQSTEKEIELQAKLY
        VDRVELFRETH+RAGTFVSQAAED H+QML+LQSQPTPEGSQPLSEDEIC+QVLGRRP YSKGLGWGPKPKARRT SASSSSTS  QST+KEIELQAKL+
Subjt:  VDRVELFRETHIRAGTFVSQAAEDTHDQMLKLQSQPTPEGSQPLSEDEICDQVLGRRPCYSKGLGWGPKPKARRTTSASSSSTSCSQSTEKEIELQAKLY

Query:  EALNGLNAPE
        EAL  +   +
Subjt:  EALNGLNAPE

A0A5D3DZR1 CACTA en-spm transposon protein1.5e-30986.96Show/hide
Query:  MYPIERSLRTLKQYVRNKARPEGSIAEAYVMNKSSTFCSRYLRGIETRFIRDERNDDTIVESEVIGDFEIFKQKVRPLGASSVRAISEEEKRLFHWYILN
        MYPIERSLRTLKQYVRNKARPEGSIAEAYVMNKSSTFCSRYLRGIETRFIRDERNDDTIVESEVIGDFEIFKQKVRPLGASSVRAISEEEKRLFHWYILN
Subjt:  MYPIERSLRTLKQYVRNKARPEGSIAEAYVMNKSSTFCSRYLRGIETRFIRDERNDDTIVESEVIGDFEIFKQKVRPLGASSVRAISEEEKRLFHWYILN

Query:  NADKYQSIKKHLRLQRRHAQNSMDLYKIHGREFPEWFRAQVLELRQSANLSDDFFSLAMGLSFNVRCYNGCIVGGVIFRTIELDSRRTTQNSGIMIIGES
        NADK                                   QVLELRQSANLSDDFFSLAMGLSFNVRCYNGCIVGGVIFRTIELDSRRTTQNSGIMIIGES
Subjt:  NADKYQSIKKHLRLQRRHAQNSMDLYKIHGREFPEWFRAQVLELRQSANLSDDFFSLAMGLSFNVRCYNGCIVGGVIFRTIELDSRRTTQNSGIMIIGES

Query:  DASGTSDNNFYGVLDEVLHVQYPLQ--LESCASH--------------------PNKRIWDVPEVEDVQNDHINILEVVVSHQVDDHIEDDTLCRNDVDP
        DASGTSDNNFYGVLDEVLHVQYPL+  + +  +H                     NKRIWDVPEVEDVQNDHINILEVVVSHQVDDHIEDDTLCRNDVDP
Subjt:  DASGTSDNNFYGVLDEVLHVQYPLQ--LESCASH--------------------PNKRIWDVPEVEDVQNDHINILEVVVSHQVDDHIEDDTLCRNDVDP

Query:  TIVERSV-----------FEDNLDNIAGGSSFVGDNT-GSSSQQTTPTPRRRAQSRLLELERHVAINGRIPMTIAPEEEKPISPHAVHFSQAIGVCVRKI
        TIVERSV           FEDNLDNIAGGSSFVGDNT GSSSQQTTPTPRRRAQSRLLELERHVAINGRIPMTIAPEEEKPISPHAVHFSQAIGVCVRKI
Subjt:  TIVERSV-----------FEDNLDNIAGGSSFVGDNT-GSSSQQTTPTPRRRAQSRLLELERHVAINGRIPMTIAPEEEKPISPHAVHFSQAIGVCVRKI

Query:  FPLRCLKWADVEREYFEVVKGDLQQLFVLDFNDQAMNRFVEHQMLTTFKEFQVDCHRHFKKYSDSEEAHANPPNALVGRDEDWHFLCVHYISRAFQEQSR
        FPLRCLKWADVEREYFEVVKGDLQQLFVLDFNDQAMNRFVEHQMLTTFKEFQVDCHRHFKKYSDSEEAHANPPNALVGRDEDWHFLCVHYISRAFQEQSR
Subjt:  FPLRCLKWADVEREYFEVVKGDLQQLFVLDFNDQAMNRFVEHQMLTTFKEFQVDCHRHFKKYSDSEEAHANPPNALVGRDEDWHFLCVHYISRAFQEQSR

Query:  TNKAARQKQPYNHSSGSKSFLQRQHELAERRGKSVDRVELFRETHIRAGTFVSQAAEDTHDQMLKLQSQPTPEGSQPLSEDEICDQVLGRRPCYSKGLGW
        TNKAARQKQPYNHSSGSKSFLQRQHELAERRGKSVDRVELFRETHIRAGTFVSQAAEDTHDQMLKLQSQPTPEGSQPLSEDEICDQVLGRRPCYSKGLGW
Subjt:  TNKAARQKQPYNHSSGSKSFLQRQHELAERRGKSVDRVELFRETHIRAGTFVSQAAEDTHDQMLKLQSQPTPEGSQPLSEDEICDQVLGRRPCYSKGLGW

Query:  GPKPKARRTTSASSSSTSCSQSTEKEIELQAKLYEALNGLNAPE
        GPKPKARRTTSASSSSTSCSQSTEKEIELQAKLYEAL  +   +
Subjt:  GPKPKARRTTSASSSSTSCSQSTEKEIELQAKLYEALNGLNAPE

A0A5D3E035 CACTA en-spm transposon protein1.5e-22968.97Show/hide
Query:  RDERNDDTIVESEVIGDFEIFKQKVRPLGASSVRAISEEEKRLFHWYILNNADKYQSIKKHLRLQRRHAQNSMDLYKIHGREFPEWFRAQVLELRQSANL
        RDE+NDDTIVE +VI DFEIFKQKVRPLGASSVRAIS+EEKRLFHWYILNNAD+                     Y+ H R FPEWFRAQVLELR+SANL
Subjt:  RDERNDDTIVESEVIGDFEIFKQKVRPLGASSVRAISEEEKRLFHWYILNNADKYQSIKKHLRLQRRHAQNSMDLYKIHGREFPEWFRAQVLELRQSANL

Query:  SDDFFSLAMGLSFNVRCYNGCIVGGVIFRTIELDSRRTTQNSGIMIIGESDASGTSDNNFYGVLDEVLHVQYPL----QLESC-----------------
        S+DFFSLAMG SF++RCYNGCIV GV F T+ELDSRRTTQNSGIMIIGESDASGT DN+FYGVLDEVLHVQYPL     L  C                 
Subjt:  SDDFFSLAMGLSFNVRCYNGCIVGGVIFRTIELDSRRTTQNSGIMIIGESDASGTSDNNFYGVLDEVLHVQYPL----QLESC-----------------

Query:  --ASHPNKRIWDVPE-------------VEDVQNDHINILEVVVSHQVDDHIEDDTLCRNDVDPTIVERSV-----------------------------
           S    R W   E             ++D +N      E+VVSHQVDDHI+DDTLCRNDVDPTIVER +                             
Subjt:  --ASHPNKRIWDVPE-------------VEDVQNDHINILEVVVSHQVDDHIEDDTLCRNDVDPTIVERSV-----------------------------

Query:  -----------FEDNLDNIAGGSSFVGDN-TGSSSQQ-TTPTPRRRAQSRLLELERHVAINGRIPMTIAPEEEKPISPHAVHFSQAIGVCVRKIFPLRCL
                   FED+LDNIAG SS +G+N TGSSSQQ  TPT RRR QSRLLELERHVAINGRIPMTIAP  EKPISPHA+HF QAIGVCVRK F + CL
Subjt:  -----------FEDNLDNIAGGSSFVGDN-TGSSSQQ-TTPTPRRRAQSRLLELERHVAINGRIPMTIAPEEEKPISPHAVHFSQAIGVCVRKIFPLRCL

Query:  KWADVEREYFEVVKGDLQQLFVLDFNDQAMNRFVEHQMLTTFKEFQVDCHRHFKKYSDSEEAHANPPNALVGRDEDWHFLCVHYISRAFQEQSRTNKAAR
        KWADV REY EVVKGDLQ+LFVLDFNDQAMNRFVEHQM TTFKEF +DCHRHFKKYSD EEA ANPPNALVGR EDWHFL  HYIS AFQEQSRTNK+AR
Subjt:  KWADVEREYFEVVKGDLQQLFVLDFNDQAMNRFVEHQMLTTFKEFQVDCHRHFKKYSDSEEAHANPPNALVGRDEDWHFLCVHYISRAFQEQSRTNKAAR

Query:  QKQPYNHSSGSKSFLQRQHELAERRGKSVDRVELFRETHIRAGTFVSQAAEDTHDQMLKLQSQPTPEGSQPLSEDEICDQVLGRRPCYSKGLGWGPKPKA
        QKQPYNHSSGSKSFLQRQ+ELAER+G+ V+RVELF+ETH+RAGTF+SQAAED H+QML+LQSQPTP+GSQPLSEDEICDQVLGRRP YSKGLGWG KPKA
Subjt:  QKQPYNHSSGSKSFLQRQHELAERRGKSVDRVELFRETHIRAGTFVSQAAEDTHDQMLKLQSQPTPEGSQPLSEDEICDQVLGRRPCYSKGLGWGPKPKA

Query:  RRTTSASSSSTSCSQSTEKEIELQAKLYEALNGLNAPE
        RR TSASSSSTSCSQSTEKEIELQAKL EAL  +   +
Subjt:  RRTTSASSSSTSCSQSTEKEIELQAKLYEALNGLNAPE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATCCGATTGAAAGAAGTCTACGCACGTTAAAGCAGTATGTTAGAAACAAAGCACGTCCTGAGGGGTCAATTGCAGAAGCCTATGTCATGAATAAATCTAGCACCTT
TTGTTCACGTTACCTACGTGGTATAGAGACTCGATTCATAAGAGATGAGCGAAATGATGATACCATTGTAGAGAGCGAGGTAATTGGTGACTTTGAAATTTTCAAGCAGA
AAGTACGACCCTTAGGTGCATCAAGTGTTCGTGCTATATCAGAGGAAGAGAAACGACTCTTCCATTGGTACATACTGAATAATGCTGACAAATATCAGAGTATCAAGAAA
CATTTGAGGCTCCAACGTCGACATGCTCAAAATTCTATGGATTTGTATAAAATACATGGACGGGAATTCCCTGAATGGTTTCGAGCACAGGTGTTAGAACTGCGTCAGTC
TGCAAACCTATCTGATGATTTCTTCTCACTAGCGATGGGACTATCCTTTAACGTTCGTTGTTACAATGGATGCATCGTAGGTGGTGTAATATTTCGCACTATTGAACTTG
ATTCCCGACGTACTACTCAAAATAGTGGAATAATGATCATTGGTGAAAGCGATGCAAGTGGAACTAGCGACAATAATTTCTACGGTGTTCTGGACGAAGTGTTGCACGTA
CAATATCCGTTGCAATTGGAAAGTTGTGCAAGTCATCCAAATAAGCGTATTTGGGACGTGCCAGAAGTGGAGGATGTTCAGAATGACCATATTAATATACTAGAAGTTGT
TGTAAGCCACCAAGTGGATGACCACATTGAGGATGACACTCTATGCAGAAATGACGTTGATCCCACAATCGTTGAAAGATCGGTTTTTGAGGATAATTTAGATAACATTG
CGGGAGGATCGTCATTTGTGGGCGACAATACGGGGTCTTCTTCTCAACAAACGACTCCGACTCCTAGGAGACGTGCGCAATCTCGACTCTTGGAGTTAGAGCGCCACGTT
GCAATAAATGGGCGCATTCCGATGACGATCGCCCCTGAAGAGGAGAAGCCTATTTCTCCACACGCCGTTCACTTCAGCCAGGCGATAGGTGTGTGCGTGCGAAAGATATT
TCCCCTCCGCTGTCTTAAGTGGGCAGACGTTGAGAGAGAATACTTTGAGGTCGTCAAGGGTGACCTCCAGCAATTGTTTGTGCTTGATTTCAATGATCAAGCAATGAACA
GGTTTGTTGAGCATCAGATGCTCACGACCTTTAAAGAGTTCCAGGTCGACTGTCATAGACATTTCAAAAAGTACAGCGACTCGGAGGAGGCTCATGCCAACCCACCAAAC
GCATTGGTTGGACGTGATGAGGATTGGCACTTCCTCTGCGTCCATTATATCAGCCGTGCATTCCAGGAGCAATCACGGACGAACAAGGCTGCTAGACAGAAGCAGCCTTA
CAATCATAGTAGCGGGTCCAAGTCGTTTCTACAACGACAGCATGAGCTTGCTGAAAGAAGAGGGAAGTCGGTCGATCGTGTGGAATTGTTCCGGGAAACACACATTCGAG
CTGGGACATTTGTGTCGCAGGCCGCCGAGGATACACATGATCAAATGCTGAAACTCCAATCCCAGCCTACCCCAGAGGGTAGTCAGCCACTCTCTGAGGATGAGATATGC
GATCAGGTGTTGGGTAGACGACCATGCTACTCAAAAGGCCTTGGTTGGGGACCCAAGCCGAAGGCCCGCAGAACGACGAGTGCAAGCAGTTCGTCGACATCTTGTTCGCA
GTCCACAGAAAAAGAGATTGAATTACAAGCTAAACTTTATGAAGCTTTGAACGGATTGAATGCTCCCGAAGGATGCACATATCAGTCTAGTGCTCCCGAAGGATGCACGT
ATCAGTCTAGTGCTCCCGAAGGATGCACATATCAGTCTAGTGCTCCCGAAGGATGCACGTATCAGTCTAGTGCTCCCGAAGGATACACGTATCAGTCTAGTGCTCCCGAA
GGATGCACGTATCCCTGGACTCGGGGTCCAGTAGTAGGAAAACCTTACCTGAAACTCGGTTATGCCCCTAGATCGAAACCACGCTCAACAGATCCACCTAAACGAAACAC
GAAAGTTTTAGTAGCTGACTTTAGTAGAAGCCGTTTTAAAAAGTCAGAAGCGATATCCGTTGACTTACCCAAGGAAAGGAATCTATCTCTAACCAGGCCTGCCGCGGAAC
CCGAGAACTTAAGCGTCAACTCTGTACTACCTTCAAGGGAGAAAACAACATTAGAGAATTCATCGAAAACATCCTTACCGAAGACTCACCGTGACCCGGAGCGGAGGAAG
GAAGGCTTAGGTGGCTTGGCGAAACAAGGAGCTCGACTCGACTCGGCTCGGCTTGGTCTCGGCTCGGCTCGGCTCCACCTCGGCTCGGCTCGGCTTGGTCTCGGCTCGGC
TCGGCTTATGGCTCGGCTCGGCTCGGCTCGGCTTGGTCTCGGCTCGGCTCGGCTTGTGGCTCGGCTCGGCTCGGCTTGGCTTGGTCTCGGCTCAGCTCGGCTTGGACTCG
GCTCGGCTCTGCTTACTCGGCTCGCGGCTCGGCTCGCGGCTCGCTCGGCTTGGGGCTCGCTCGGCTTGGGGCTCGGCTCGCGGCTCGCTCGGCTCGGCTTACTCAGCTCG
GCTCACCCGGATTGTGTGGCTCGGATCGGAAACGGGCTCGGGTCAGTTGGAACCGGGTCGGGTCGGGCGAAACGGGCAGCACGATTCCGCGGCTTCTTTTCTTTCCGGCG
AACGACGGCGCTTCCTCGTGCTCTGTCGGCGTTGGAGACACGAGCTGATGACGGAGAGTACGCCGGCGGTGGTAGAACGACGAGAGAAAGATGGAGAAGAAAAGGAAAAT
GGATGGTTCGGGTGGCGGTGGCGCGGCGGAGAGAAATCGAAAACGGAGGGAAAGGAAGGGAGATGGTGGCGGCGTCGGCTTCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTATCCGATTGAAAGAAGTCTACGCACGTTAAAGCAGTATGTTAGAAACAAAGCACGTCCTGAGGGGTCAATTGCAGAAGCCTATGTCATGAATAAATCTAGCACCTT
TTGTTCACGTTACCTACGTGGTATAGAGACTCGATTCATAAGAGATGAGCGAAATGATGATACCATTGTAGAGAGCGAGGTAATTGGTGACTTTGAAATTTTCAAGCAGA
AAGTACGACCCTTAGGTGCATCAAGTGTTCGTGCTATATCAGAGGAAGAGAAACGACTCTTCCATTGGTACATACTGAATAATGCTGACAAATATCAGAGTATCAAGAAA
CATTTGAGGCTCCAACGTCGACATGCTCAAAATTCTATGGATTTGTATAAAATACATGGACGGGAATTCCCTGAATGGTTTCGAGCACAGGTGTTAGAACTGCGTCAGTC
TGCAAACCTATCTGATGATTTCTTCTCACTAGCGATGGGACTATCCTTTAACGTTCGTTGTTACAATGGATGCATCGTAGGTGGTGTAATATTTCGCACTATTGAACTTG
ATTCCCGACGTACTACTCAAAATAGTGGAATAATGATCATTGGTGAAAGCGATGCAAGTGGAACTAGCGACAATAATTTCTACGGTGTTCTGGACGAAGTGTTGCACGTA
CAATATCCGTTGCAATTGGAAAGTTGTGCAAGTCATCCAAATAAGCGTATTTGGGACGTGCCAGAAGTGGAGGATGTTCAGAATGACCATATTAATATACTAGAAGTTGT
TGTAAGCCACCAAGTGGATGACCACATTGAGGATGACACTCTATGCAGAAATGACGTTGATCCCACAATCGTTGAAAGATCGGTTTTTGAGGATAATTTAGATAACATTG
CGGGAGGATCGTCATTTGTGGGCGACAATACGGGGTCTTCTTCTCAACAAACGACTCCGACTCCTAGGAGACGTGCGCAATCTCGACTCTTGGAGTTAGAGCGCCACGTT
GCAATAAATGGGCGCATTCCGATGACGATCGCCCCTGAAGAGGAGAAGCCTATTTCTCCACACGCCGTTCACTTCAGCCAGGCGATAGGTGTGTGCGTGCGAAAGATATT
TCCCCTCCGCTGTCTTAAGTGGGCAGACGTTGAGAGAGAATACTTTGAGGTCGTCAAGGGTGACCTCCAGCAATTGTTTGTGCTTGATTTCAATGATCAAGCAATGAACA
GGTTTGTTGAGCATCAGATGCTCACGACCTTTAAAGAGTTCCAGGTCGACTGTCATAGACATTTCAAAAAGTACAGCGACTCGGAGGAGGCTCATGCCAACCCACCAAAC
GCATTGGTTGGACGTGATGAGGATTGGCACTTCCTCTGCGTCCATTATATCAGCCGTGCATTCCAGGAGCAATCACGGACGAACAAGGCTGCTAGACAGAAGCAGCCTTA
CAATCATAGTAGCGGGTCCAAGTCGTTTCTACAACGACAGCATGAGCTTGCTGAAAGAAGAGGGAAGTCGGTCGATCGTGTGGAATTGTTCCGGGAAACACACATTCGAG
CTGGGACATTTGTGTCGCAGGCCGCCGAGGATACACATGATCAAATGCTGAAACTCCAATCCCAGCCTACCCCAGAGGGTAGTCAGCCACTCTCTGAGGATGAGATATGC
GATCAGGTGTTGGGTAGACGACCATGCTACTCAAAAGGCCTTGGTTGGGGACCCAAGCCGAAGGCCCGCAGAACGACGAGTGCAAGCAGTTCGTCGACATCTTGTTCGCA
GTCCACAGAAAAAGAGATTGAATTACAAGCTAAACTTTATGAAGCTTTGAACGGATTGAATGCTCCCGAAGGATGCACATATCAGTCTAGTGCTCCCGAAGGATGCACGT
ATCAGTCTAGTGCTCCCGAAGGATGCACATATCAGTCTAGTGCTCCCGAAGGATGCACGTATCAGTCTAGTGCTCCCGAAGGATACACGTATCAGTCTAGTGCTCCCGAA
GGATGCACGTATCCCTGGACTCGGGGTCCAGTAGTAGGAAAACCTTACCTGAAACTCGGTTATGCCCCTAGATCGAAACCACGCTCAACAGATCCACCTAAACGAAACAC
GAAAGTTTTAGTAGCTGACTTTAGTAGAAGCCGTTTTAAAAAGTCAGAAGCGATATCCGTTGACTTACCCAAGGAAAGGAATCTATCTCTAACCAGGCCTGCCGCGGAAC
CCGAGAACTTAAGCGTCAACTCTGTACTACCTTCAAGGGAGAAAACAACATTAGAGAATTCATCGAAAACATCCTTACCGAAGACTCACCGTGACCCGGAGCGGAGGAAG
GAAGGCTTAGGTGGCTTGGCGAAACAAGGAGCTCGACTCGACTCGGCTCGGCTTGGTCTCGGCTCGGCTCGGCTCCACCTCGGCTCGGCTCGGCTTGGTCTCGGCTCGGC
TCGGCTTATGGCTCGGCTCGGCTCGGCTCGGCTTGGTCTCGGCTCGGCTCGGCTTGTGGCTCGGCTCGGCTCGGCTTGGCTTGGTCTCGGCTCAGCTCGGCTTGGACTCG
GCTCGGCTCTGCTTACTCGGCTCGCGGCTCGGCTCGCGGCTCGCTCGGCTTGGGGCTCGCTCGGCTTGGGGCTCGGCTCGCGGCTCGCTCGGCTCGGCTTACTCAGCTCG
GCTCACCCGGATTGTGTGGCTCGGATCGGAAACGGGCTCGGGTCAGTTGGAACCGGGTCGGGTCGGGCGAAACGGGCAGCACGATTCCGCGGCTTCTTTTCTTTCCGGCG
AACGACGGCGCTTCCTCGTGCTCTGTCGGCGTTGGAGACACGAGCTGATGACGGAGAGTACGCCGGCGGTGGTAGAACGACGAGAGAAAGATGGAGAAGAAAAGGAAAAT
GGATGGTTCGGGTGGCGGTGGCGCGGCGGAGAGAAATCGAAAACGGAGGGAAAGGAAGGGAGATGGTGGCGGCGTCGGCTTCGTGA
Protein sequenceShow/hide protein sequence
MYPIERSLRTLKQYVRNKARPEGSIAEAYVMNKSSTFCSRYLRGIETRFIRDERNDDTIVESEVIGDFEIFKQKVRPLGASSVRAISEEEKRLFHWYILNNADKYQSIKK
HLRLQRRHAQNSMDLYKIHGREFPEWFRAQVLELRQSANLSDDFFSLAMGLSFNVRCYNGCIVGGVIFRTIELDSRRTTQNSGIMIIGESDASGTSDNNFYGVLDEVLHV
QYPLQLESCASHPNKRIWDVPEVEDVQNDHINILEVVVSHQVDDHIEDDTLCRNDVDPTIVERSVFEDNLDNIAGGSSFVGDNTGSSSQQTTPTPRRRAQSRLLELERHV
AINGRIPMTIAPEEEKPISPHAVHFSQAIGVCVRKIFPLRCLKWADVEREYFEVVKGDLQQLFVLDFNDQAMNRFVEHQMLTTFKEFQVDCHRHFKKYSDSEEAHANPPN
ALVGRDEDWHFLCVHYISRAFQEQSRTNKAARQKQPYNHSSGSKSFLQRQHELAERRGKSVDRVELFRETHIRAGTFVSQAAEDTHDQMLKLQSQPTPEGSQPLSEDEIC
DQVLGRRPCYSKGLGWGPKPKARRTTSASSSSTSCSQSTEKEIELQAKLYEALNGLNAPEGCTYQSSAPEGCTYQSSAPEGCTYQSSAPEGCTYQSSAPEGYTYQSSAPE
GCTYPWTRGPVVGKPYLKLGYAPRSKPRSTDPPKRNTKVLVADFSRSRFKKSEAISVDLPKERNLSLTRPAAEPENLSVNSVLPSREKTTLENSSKTSLPKTHRDPERRK
EGLGGLAKQGARLDSARLGLGSARLHLGSARLGLGSARLMARLGSARLGLGSARLVARLGSAWLGLGSARLGLGSALLTRLAARLAARSAWGSLGLGLGSRLARLGLLSS
AHPDCVARIGNGLGSVGTGSGRAKRAARFRGFFSFRRTTALPRALSALETRADDGEYAGGGRTTRERWRRKGKWMVRVAVARRREIENGGKGREMVAASAS