; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022440 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022440
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPlant transposase
Genome locationchr7:28854612..28859283
RNA-Seq ExpressionLag0022440
SyntenyLag0022440
Gene Ontology termsNA
InterPro domainsIPR004252 - Probable transposase, Ptta/En/Spm, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_016901237.1 PREDICTED: uncharacterized protein LOC103493280 isoform X4 [Cucumis melo]9.4e-15655.67Show/hide
Query:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGLTTM---------EEGDNGT---MGEKERDESP-------NVRGNLDCTHTTPRSPLPSDSPAS-
        MEKK+ +T AKRC+ FDP A +KRRSKRLKS S+G TT          EEGDN T     ++ +D  P       N+  + D THT P SPLP D+ ++ 
Subjt:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGLTTM---------EEGDNGT---MGEKERDESP-------NVRGNLDCTHTTPRSPLPSDSPAS-

Query:  RTRGAIRKLAS----RCQASPI-VDRSLDVGENTNVSEPILQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLN
        RT  +     S       +SP  +DRS + GE+ NVSE  +QQ+    RGPTKM+T AI   NKVDI FNE+GQPI EASIG++SFLG LVREVVPVTLN
Subjt:  RTRGAIRKLAS----RCQASPI-VDRSLDVGENTNVSEPILQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLN

Query:  DWRKLSTRFKEILWTSIQI----VESW----------------KISNLSQIQNASNDEEVLKMKPANIQSTHDWVDFVKEKNSARFKARSEKFKSMKKKQ
        DWRKLSTR KEILW S+Q+     E W                K   +SQIQ+ S  EE++KMKP+NI+S HDW+DFVKEKNSA FKA+SEKFKSMKKKQ
Subjt:  DWRKLSTRFKEILWTSIQI----VESW----------------KISNLSQIQNASNDEEVLKMKPANIQSTHDWVDFVKEKNSARFKARSEKFKSMKKKQ

Query:  LPHTCSRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETLERIEKIDKEGTNTTSNNVANDAISKVLGPDRGHIRGLGFGVTLSKLS
        LPHTCSRKGYARL EEMKKS  DSS+VTRVA+WAKAHRKKDGNP+NSQVAE LERIE+ID EG NT SNN  N+ ISKVLG DR HI  LGFG T+ K S
Subjt:  LPHTCSRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETLERIEKIDKEGTNTTSNNVANDAISKVLGPDRGHIRGLGFGVTLSKLS

Query:  LLSQRDSNYA---------------------------------------------------KLEEKYKKMEGQMSEMKSLMSHILKSQGNASEQLSNATN
        LLSQ DS+YA                                                   KLEEKYKKME +MSEM+SLMS +LKSQGN SE LSNATN
Subjt:  LLSQRDSNYA---------------------------------------------------KLEEKYKKMEGQMSEMKSLMSHILKSQGNASEQLSNATN

Query:  DPMVNNIAINPIVSSPSSINNNNALRKCMLVDWCGTGEVVAEGRWSSNDPKVIVHHVPLGPHACLEMESIYSNSIWELTDLPNGVKPIGCKWIYKSK---
        + +VNN+AI+PI SSP SIN+NNALRKC ++DWCGTGEVVAEGRWSSNDPKVIVHHVPLGP A           +W   DLP   +P    W   S+   
Subjt:  DPMVNNIAINPIVSSPSSINNNNALRKCMLVDWCGTGEVVAEGRWSSNDPKVIVHHVPLGPHACLEMESIYSNSIWELTDLPNGVKPIGCKWIYKSK---

Query:  -RDAAGKSI
         +DA G +I
Subjt:  -RDAAGKSI

XP_038904085.1 uncharacterized protein LOC120090469 isoform X1 [Benincasa hispida]4.6e-17166.6Show/hide
Query:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGLTTMEEGDNGTMGEKERD--------------------ESPNVRGNLDCTHTTPRSPLPSDSPAS
        M+KK  +T +KRCL  DP A RKRRSKRLKS S+GL T+E+ ++G M EKE D                      PN+   LD THTTPRSPLP  S AS
Subjt:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGLTTMEEGDNGTMGEKERD--------------------ESPNVRGNLDCTHTTPRSPLPSDSPAS

Query:  R--TRGAIRKLASRCQASPIVDRSLDVGENTNVSEPILQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLNDWR
        R  TR A++KLASR Q SPI+DRS +VGE+ N SEP +QQ+ KK RGPT+M T A    NKVDITFNE+GQPI EASIG++SFLG LVRE VPVTLNDWR
Subjt:  R--TRGAIRKLASRCQASPIVDRSLDVGENTNVSEPILQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLNDWR

Query:  KLSTRFKEILWTSIQ----IVESW----------------KISNLSQIQNASNDEEVLKMKPANIQSTHDWVDFVKEKNSARFKARSEKFKSMKKKQLPH
        KLST  KEILWT IQ    + E W                K   +SQIQ+ S +EE++KMKP+NIQS HDW+DFVKEK SA FKA+SE+FKSMKKKQLPH
Subjt:  KLSTRFKEILWTSIQ----IVESW----------------KISNLSQIQNASNDEEVLKMKPANIQSTHDWVDFVKEKNSARFKARSEKFKSMKKKQLPH

Query:  TCSRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETLERIEKIDKEGTNTTSNNVANDAISKVLGPDRGHIRGLGFGVTLSKLS---
        TCSRKGYARLAEEMKKS SDSSSVTRVALWAKAH+KK+GNP+NSQVAE LE IE+ DKEG +TT NNV NDAISKVLGPD  HI  LGFGVT SK S   
Subjt:  TCSRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETLERIEKIDKEGTNTTSNNVANDAISKVLGPDRGHIRGLGFGVTLSKLS---

Query:  ---------LLSQRDSNYAKLEEKYKKMEGQMSEMKSLMSHILKSQGNASEQLSNATNDPMVNNIAINPIVSSPSSINNNNALRKCMLVDWCGTGEVVAE
                  LSQRDS+YA+LEEKYKKMEG+MSEM+SLMS++LKSQGN  EQLS ATN+ MVNN+A NPI SS SSINNNNALRKC L+DWCGTGEVVAE
Subjt:  ---------LLSQRDSNYAKLEEKYKKMEGQMSEMKSLMSHILKSQGNASEQLSNATNDPMVNNIAINPIVSSPSSINNNNALRKCMLVDWCGTGEVVAE

Query:  GRWSSNDPKVIVHHVPLGPHA
        GRWSSNDPKVIVHHVPLGP A
Subjt:  GRWSSNDPKVIVHHVPLGPHA

XP_038904087.1 uncharacterized protein LOC120090469 isoform X2 [Benincasa hispida]4.6e-17166.6Show/hide
Query:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGLTTMEEGDNGTMGEKERD--------------------ESPNVRGNLDCTHTTPRSPLPSDSPAS
        M+KK  +T +KRCL  DP A RKRRSKRLKS S+GL T+E+ ++G M EKE D                      PN+   LD THTTPRSPLP  S AS
Subjt:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGLTTMEEGDNGTMGEKERD--------------------ESPNVRGNLDCTHTTPRSPLPSDSPAS

Query:  R--TRGAIRKLASRCQASPIVDRSLDVGENTNVSEPILQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLNDWR
        R  TR A++KLASR Q SPI+DRS +VGE+ N SEP +QQ+ KK RGPT+M T A    NKVDITFNE+GQPI EASIG++SFLG LVRE VPVTLNDWR
Subjt:  R--TRGAIRKLASRCQASPIVDRSLDVGENTNVSEPILQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLNDWR

Query:  KLSTRFKEILWTSIQ----IVESW----------------KISNLSQIQNASNDEEVLKMKPANIQSTHDWVDFVKEKNSARFKARSEKFKSMKKKQLPH
        KLST  KEILWT IQ    + E W                K   +SQIQ+ S +EE++KMKP+NIQS HDW+DFVKEK SA FKA+SE+FKSMKKKQLPH
Subjt:  KLSTRFKEILWTSIQ----IVESW----------------KISNLSQIQNASNDEEVLKMKPANIQSTHDWVDFVKEKNSARFKARSEKFKSMKKKQLPH

Query:  TCSRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETLERIEKIDKEGTNTTSNNVANDAISKVLGPDRGHIRGLGFGVTLSKLS---
        TCSRKGYARLAEEMKKS SDSSSVTRVALWAKAH+KK+GNP+NSQVAE LE IE+ DKEG +TT NNV NDAISKVLGPD  HI  LGFGVT SK S   
Subjt:  TCSRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETLERIEKIDKEGTNTTSNNVANDAISKVLGPDRGHIRGLGFGVTLSKLS---

Query:  ---------LLSQRDSNYAKLEEKYKKMEGQMSEMKSLMSHILKSQGNASEQLSNATNDPMVNNIAINPIVSSPSSINNNNALRKCMLVDWCGTGEVVAE
                  LSQRDS+YA+LEEKYKKMEG+MSEM+SLMS++LKSQGN  EQLS ATN+ MVNN+A NPI SS SSINNNNALRKC L+DWCGTGEVVAE
Subjt:  ---------LLSQRDSNYAKLEEKYKKMEGQMSEMKSLMSHILKSQGNASEQLSNATNDPMVNNIAINPIVSSPSSINNNNALRKCMLVDWCGTGEVVAE

Query:  GRWSSNDPKVIVHHVPLGPHA
        GRWSSNDPKVIVHHVPLGP A
Subjt:  GRWSSNDPKVIVHHVPLGPHA

XP_038904088.1 uncharacterized protein LOC120090469 isoform X3 [Benincasa hispida]4.6e-17166.6Show/hide
Query:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGLTTMEEGDNGTMGEKERD--------------------ESPNVRGNLDCTHTTPRSPLPSDSPAS
        M+KK  +T +KRCL  DP A RKRRSKRLKS S+GL T+E+ ++G M EKE D                      PN+   LD THTTPRSPLP  S AS
Subjt:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGLTTMEEGDNGTMGEKERD--------------------ESPNVRGNLDCTHTTPRSPLPSDSPAS

Query:  R--TRGAIRKLASRCQASPIVDRSLDVGENTNVSEPILQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLNDWR
        R  TR A++KLASR Q SPI+DRS +VGE+ N SEP +QQ+ KK RGPT+M T A    NKVDITFNE+GQPI EASIG++SFLG LVRE VPVTLNDWR
Subjt:  R--TRGAIRKLASRCQASPIVDRSLDVGENTNVSEPILQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLNDWR

Query:  KLSTRFKEILWTSIQ----IVESW----------------KISNLSQIQNASNDEEVLKMKPANIQSTHDWVDFVKEKNSARFKARSEKFKSMKKKQLPH
        KLST  KEILWT IQ    + E W                K   +SQIQ+ S +EE++KMKP+NIQS HDW+DFVKEK SA FKA+SE+FKSMKKKQLPH
Subjt:  KLSTRFKEILWTSIQ----IVESW----------------KISNLSQIQNASNDEEVLKMKPANIQSTHDWVDFVKEKNSARFKARSEKFKSMKKKQLPH

Query:  TCSRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETLERIEKIDKEGTNTTSNNVANDAISKVLGPDRGHIRGLGFGVTLSKLS---
        TCSRKGYARLAEEMKKS SDSSSVTRVALWAKAH+KK+GNP+NSQVAE LE IE+ DKEG +TT NNV NDAISKVLGPD  HI  LGFGVT SK S   
Subjt:  TCSRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETLERIEKIDKEGTNTTSNNVANDAISKVLGPDRGHIRGLGFGVTLSKLS---

Query:  ---------LLSQRDSNYAKLEEKYKKMEGQMSEMKSLMSHILKSQGNASEQLSNATNDPMVNNIAINPIVSSPSSINNNNALRKCMLVDWCGTGEVVAE
                  LSQRDS+YA+LEEKYKKMEG+MSEM+SLMS++LKSQGN  EQLS ATN+ MVNN+A NPI SS SSINNNNALRKC L+DWCGTGEVVAE
Subjt:  ---------LLSQRDSNYAKLEEKYKKMEGQMSEMKSLMSHILKSQGNASEQLSNATNDPMVNNIAINPIVSSPSSINNNNALRKCMLVDWCGTGEVVAE

Query:  GRWSSNDPKVIVHHVPLGPHA
        GRWSSNDPKVIVHHVPLGP A
Subjt:  GRWSSNDPKVIVHHVPLGPHA

XP_038904089.1 uncharacterized protein LOC120090469 isoform X4 [Benincasa hispida]4.6e-17166.6Show/hide
Query:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGLTTMEEGDNGTMGEKERD--------------------ESPNVRGNLDCTHTTPRSPLPSDSPAS
        M+KK  +T +KRCL  DP A RKRRSKRLKS S+GL T+E+ ++G M EKE D                      PN+   LD THTTPRSPLP  S AS
Subjt:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGLTTMEEGDNGTMGEKERD--------------------ESPNVRGNLDCTHTTPRSPLPSDSPAS

Query:  R--TRGAIRKLASRCQASPIVDRSLDVGENTNVSEPILQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLNDWR
        R  TR A++KLASR Q SPI+DRS +VGE+ N SEP +QQ+ KK RGPT+M T A    NKVDITFNE+GQPI EASIG++SFLG LVRE VPVTLNDWR
Subjt:  R--TRGAIRKLASRCQASPIVDRSLDVGENTNVSEPILQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLNDWR

Query:  KLSTRFKEILWTSIQ----IVESW----------------KISNLSQIQNASNDEEVLKMKPANIQSTHDWVDFVKEKNSARFKARSEKFKSMKKKQLPH
        KLST  KEILWT IQ    + E W                K   +SQIQ+ S +EE++KMKP+NIQS HDW+DFVKEK SA FKA+SE+FKSMKKKQLPH
Subjt:  KLSTRFKEILWTSIQ----IVESW----------------KISNLSQIQNASNDEEVLKMKPANIQSTHDWVDFVKEKNSARFKARSEKFKSMKKKQLPH

Query:  TCSRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETLERIEKIDKEGTNTTSNNVANDAISKVLGPDRGHIRGLGFGVTLSKLS---
        TCSRKGYARLAEEMKKS SDSSSVTRVALWAKAH+KK+GNP+NSQVAE LE IE+ DKEG +TT NNV NDAISKVLGPD  HI  LGFGVT SK S   
Subjt:  TCSRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETLERIEKIDKEGTNTTSNNVANDAISKVLGPDRGHIRGLGFGVTLSKLS---

Query:  ---------LLSQRDSNYAKLEEKYKKMEGQMSEMKSLMSHILKSQGNASEQLSNATNDPMVNNIAINPIVSSPSSINNNNALRKCMLVDWCGTGEVVAE
                  LSQRDS+YA+LEEKYKKMEG+MSEM+SLMS++LKSQGN  EQLS ATN+ MVNN+A NPI SS SSINNNNALRKC L+DWCGTGEVVAE
Subjt:  ---------LLSQRDSNYAKLEEKYKKMEGQMSEMKSLMSHILKSQGNASEQLSNATNDPMVNNIAINPIVSSPSSINNNNALRKCMLVDWCGTGEVVAE

Query:  GRWSSNDPKVIVHHVPLGPHA
        GRWSSNDPKVIVHHVPLGP A
Subjt:  GRWSSNDPKVIVHHVPLGPHA

TrEMBL top hitse value%identityAlignment
A0A1S4DZ18 uncharacterized protein LOC103493280 isoform X34.5e-15655.67Show/hide
Query:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGLTTM---------EEGDNGT---MGEKERDESP-------NVRGNLDCTHTTPRSPLPSDSPAS-
        MEKK+ +T AKRC+ FDP A +KRRSKRLKS S+G TT          EEGDN T     ++ +D  P       N+  + D THT P SPLP D+ ++ 
Subjt:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGLTTM---------EEGDNGT---MGEKERDESP-------NVRGNLDCTHTTPRSPLPSDSPAS-

Query:  RTRGAIRKLAS----RCQASPI-VDRSLDVGENTNVSEPILQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLN
        RT  +     S       +SP  +DRS + GE+ NVSE  +QQ+    RGPTKM+T AI   NKVDI FNE+GQPI EASIG++SFLG LVREVVPVTLN
Subjt:  RTRGAIRKLAS----RCQASPI-VDRSLDVGENTNVSEPILQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLN

Query:  DWRKLSTRFKEILWTSIQI----VESW----------------KISNLSQIQNASNDEEVLKMKPANIQSTHDWVDFVKEKNSARFKARSEKFKSMKKKQ
        DWRKLSTR KEILW S+Q+     E W                K   +SQIQ+ S  EE++KMKP+NI+S HDW+DFVKEKNSA FKA+SEKFKSMKKKQ
Subjt:  DWRKLSTRFKEILWTSIQI----VESW----------------KISNLSQIQNASNDEEVLKMKPANIQSTHDWVDFVKEKNSARFKARSEKFKSMKKKQ

Query:  LPHTCSRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETLERIEKIDKEGTNTTSNNVANDAISKVLGPDRGHIRGLGFGVTLSKLS
        LPHTCSRKGYARL EEMKKS  DSS+VTRVA+WAKAHRKKDGNP+NSQVAE LERIE+ID EG NT SNN  N+ ISKVLG DR HI  LGFG T+ K S
Subjt:  LPHTCSRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETLERIEKIDKEGTNTTSNNVANDAISKVLGPDRGHIRGLGFGVTLSKLS

Query:  LLSQRDSNYA---------------------------------------------------KLEEKYKKMEGQMSEMKSLMSHILKSQGNASEQLSNATN
        LLSQ DS+YA                                                   KLEEKYKKME +MSEM+SLMS +LKSQGN SE LSNATN
Subjt:  LLSQRDSNYA---------------------------------------------------KLEEKYKKMEGQMSEMKSLMSHILKSQGNASEQLSNATN

Query:  DPMVNNIAINPIVSSPSSINNNNALRKCMLVDWCGTGEVVAEGRWSSNDPKVIVHHVPLGPHACLEMESIYSNSIWELTDLPNGVKPIGCKWIYKSK---
        + +VNN+AI+PI SSP SIN+NNALRKC ++DWCGTGEVVAEGRWSSNDPKVIVHHVPLGP A           +W   DLP   +P    W   S+   
Subjt:  DPMVNNIAINPIVSSPSSINNNNALRKCMLVDWCGTGEVVAEGRWSSNDPKVIVHHVPLGPHACLEMESIYSNSIWELTDLPNGVKPIGCKWIYKSK---

Query:  -RDAAGKSI
         +DA G +I
Subjt:  -RDAAGKSI

A0A1S4DZ27 uncharacterized protein LOC103493280 isoform X24.5e-15655.67Show/hide
Query:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGLTTM---------EEGDNGT---MGEKERDESP-------NVRGNLDCTHTTPRSPLPSDSPAS-
        MEKK+ +T AKRC+ FDP A +KRRSKRLKS S+G TT          EEGDN T     ++ +D  P       N+  + D THT P SPLP D+ ++ 
Subjt:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGLTTM---------EEGDNGT---MGEKERDESP-------NVRGNLDCTHTTPRSPLPSDSPAS-

Query:  RTRGAIRKLAS----RCQASPI-VDRSLDVGENTNVSEPILQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLN
        RT  +     S       +SP  +DRS + GE+ NVSE  +QQ+    RGPTKM+T AI   NKVDI FNE+GQPI EASIG++SFLG LVREVVPVTLN
Subjt:  RTRGAIRKLAS----RCQASPI-VDRSLDVGENTNVSEPILQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLN

Query:  DWRKLSTRFKEILWTSIQI----VESW----------------KISNLSQIQNASNDEEVLKMKPANIQSTHDWVDFVKEKNSARFKARSEKFKSMKKKQ
        DWRKLSTR KEILW S+Q+     E W                K   +SQIQ+ S  EE++KMKP+NI+S HDW+DFVKEKNSA FKA+SEKFKSMKKKQ
Subjt:  DWRKLSTRFKEILWTSIQI----VESW----------------KISNLSQIQNASNDEEVLKMKPANIQSTHDWVDFVKEKNSARFKARSEKFKSMKKKQ

Query:  LPHTCSRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETLERIEKIDKEGTNTTSNNVANDAISKVLGPDRGHIRGLGFGVTLSKLS
        LPHTCSRKGYARL EEMKKS  DSS+VTRVA+WAKAHRKKDGNP+NSQVAE LERIE+ID EG NT SNN  N+ ISKVLG DR HI  LGFG T+ K S
Subjt:  LPHTCSRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETLERIEKIDKEGTNTTSNNVANDAISKVLGPDRGHIRGLGFGVTLSKLS

Query:  LLSQRDSNYA---------------------------------------------------KLEEKYKKMEGQMSEMKSLMSHILKSQGNASEQLSNATN
        LLSQ DS+YA                                                   KLEEKYKKME +MSEM+SLMS +LKSQGN SE LSNATN
Subjt:  LLSQRDSNYA---------------------------------------------------KLEEKYKKMEGQMSEMKSLMSHILKSQGNASEQLSNATN

Query:  DPMVNNIAINPIVSSPSSINNNNALRKCMLVDWCGTGEVVAEGRWSSNDPKVIVHHVPLGPHACLEMESIYSNSIWELTDLPNGVKPIGCKWIYKSK---
        + +VNN+AI+PI SSP SIN+NNALRKC ++DWCGTGEVVAEGRWSSNDPKVIVHHVPLGP A           +W   DLP   +P    W   S+   
Subjt:  DPMVNNIAINPIVSSPSSINNNNALRKCMLVDWCGTGEVVAEGRWSSNDPKVIVHHVPLGPHACLEMESIYSNSIWELTDLPNGVKPIGCKWIYKSK---

Query:  -RDAAGKSI
         +DA G +I
Subjt:  -RDAAGKSI

A0A1S4DZ35 uncharacterized protein LOC103493280 isoform X44.5e-15655.67Show/hide
Query:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGLTTM---------EEGDNGT---MGEKERDESP-------NVRGNLDCTHTTPRSPLPSDSPAS-
        MEKK+ +T AKRC+ FDP A +KRRSKRLKS S+G TT          EEGDN T     ++ +D  P       N+  + D THT P SPLP D+ ++ 
Subjt:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGLTTM---------EEGDNGT---MGEKERDESP-------NVRGNLDCTHTTPRSPLPSDSPAS-

Query:  RTRGAIRKLAS----RCQASPI-VDRSLDVGENTNVSEPILQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLN
        RT  +     S       +SP  +DRS + GE+ NVSE  +QQ+    RGPTKM+T AI   NKVDI FNE+GQPI EASIG++SFLG LVREVVPVTLN
Subjt:  RTRGAIRKLAS----RCQASPI-VDRSLDVGENTNVSEPILQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLN

Query:  DWRKLSTRFKEILWTSIQI----VESW----------------KISNLSQIQNASNDEEVLKMKPANIQSTHDWVDFVKEKNSARFKARSEKFKSMKKKQ
        DWRKLSTR KEILW S+Q+     E W                K   +SQIQ+ S  EE++KMKP+NI+S HDW+DFVKEKNSA FKA+SEKFKSMKKKQ
Subjt:  DWRKLSTRFKEILWTSIQI----VESW----------------KISNLSQIQNASNDEEVLKMKPANIQSTHDWVDFVKEKNSARFKARSEKFKSMKKKQ

Query:  LPHTCSRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETLERIEKIDKEGTNTTSNNVANDAISKVLGPDRGHIRGLGFGVTLSKLS
        LPHTCSRKGYARL EEMKKS  DSS+VTRVA+WAKAHRKKDGNP+NSQVAE LERIE+ID EG NT SNN  N+ ISKVLG DR HI  LGFG T+ K S
Subjt:  LPHTCSRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETLERIEKIDKEGTNTTSNNVANDAISKVLGPDRGHIRGLGFGVTLSKLS

Query:  LLSQRDSNYA---------------------------------------------------KLEEKYKKMEGQMSEMKSLMSHILKSQGNASEQLSNATN
        LLSQ DS+YA                                                   KLEEKYKKME +MSEM+SLMS +LKSQGN SE LSNATN
Subjt:  LLSQRDSNYA---------------------------------------------------KLEEKYKKMEGQMSEMKSLMSHILKSQGNASEQLSNATN

Query:  DPMVNNIAINPIVSSPSSINNNNALRKCMLVDWCGTGEVVAEGRWSSNDPKVIVHHVPLGPHACLEMESIYSNSIWELTDLPNGVKPIGCKWIYKSK---
        + +VNN+AI+PI SSP SIN+NNALRKC ++DWCGTGEVVAEGRWSSNDPKVIVHHVPLGP A           +W   DLP   +P    W   S+   
Subjt:  DPMVNNIAINPIVSSPSSINNNNALRKCMLVDWCGTGEVVAEGRWSSNDPKVIVHHVPLGPHACLEMESIYSNSIWELTDLPNGVKPIGCKWIYKSK---

Query:  -RDAAGKSI
         +DA G +I
Subjt:  -RDAAGKSI

A0A1S4DZ36 uncharacterized protein LOC103493280 isoform X14.5e-15655.67Show/hide
Query:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGLTTM---------EEGDNGT---MGEKERDESP-------NVRGNLDCTHTTPRSPLPSDSPAS-
        MEKK+ +T AKRC+ FDP A +KRRSKRLKS S+G TT          EEGDN T     ++ +D  P       N+  + D THT P SPLP D+ ++ 
Subjt:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGLTTM---------EEGDNGT---MGEKERDESP-------NVRGNLDCTHTTPRSPLPSDSPAS-

Query:  RTRGAIRKLAS----RCQASPI-VDRSLDVGENTNVSEPILQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLN
        RT  +     S       +SP  +DRS + GE+ NVSE  +QQ+    RGPTKM+T AI   NKVDI FNE+GQPI EASIG++SFLG LVREVVPVTLN
Subjt:  RTRGAIRKLAS----RCQASPI-VDRSLDVGENTNVSEPILQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLN

Query:  DWRKLSTRFKEILWTSIQI----VESW----------------KISNLSQIQNASNDEEVLKMKPANIQSTHDWVDFVKEKNSARFKARSEKFKSMKKKQ
        DWRKLSTR KEILW S+Q+     E W                K   +SQIQ+ S  EE++KMKP+NI+S HDW+DFVKEKNSA FKA+SEKFKSMKKKQ
Subjt:  DWRKLSTRFKEILWTSIQI----VESW----------------KISNLSQIQNASNDEEVLKMKPANIQSTHDWVDFVKEKNSARFKARSEKFKSMKKKQ

Query:  LPHTCSRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETLERIEKIDKEGTNTTSNNVANDAISKVLGPDRGHIRGLGFGVTLSKLS
        LPHTCSRKGYARL EEMKKS  DSS+VTRVA+WAKAHRKKDGNP+NSQVAE LERIE+ID EG NT SNN  N+ ISKVLG DR HI  LGFG T+ K S
Subjt:  LPHTCSRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETLERIEKIDKEGTNTTSNNVANDAISKVLGPDRGHIRGLGFGVTLSKLS

Query:  LLSQRDSNYA---------------------------------------------------KLEEKYKKMEGQMSEMKSLMSHILKSQGNASEQLSNATN
        LLSQ DS+YA                                                   KLEEKYKKME +MSEM+SLMS +LKSQGN SE LSNATN
Subjt:  LLSQRDSNYA---------------------------------------------------KLEEKYKKMEGQMSEMKSLMSHILKSQGNASEQLSNATN

Query:  DPMVNNIAINPIVSSPSSINNNNALRKCMLVDWCGTGEVVAEGRWSSNDPKVIVHHVPLGPHACLEMESIYSNSIWELTDLPNGVKPIGCKWIYKSK---
        + +VNN+AI+PI SSP SIN+NNALRKC ++DWCGTGEVVAEGRWSSNDPKVIVHHVPLGP A           +W   DLP   +P    W   S+   
Subjt:  DPMVNNIAINPIVSSPSSINNNNALRKCMLVDWCGTGEVVAEGRWSSNDPKVIVHHVPLGPHACLEMESIYSNSIWELTDLPNGVKPIGCKWIYKSK---

Query:  -RDAAGKSI
         +DA G +I
Subjt:  -RDAAGKSI

A0A1S4DZT1 uncharacterized protein LOC103493280 isoform X74.5e-15655.67Show/hide
Query:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGLTTM---------EEGDNGT---MGEKERDESP-------NVRGNLDCTHTTPRSPLPSDSPAS-
        MEKK+ +T AKRC+ FDP A +KRRSKRLKS S+G TT          EEGDN T     ++ +D  P       N+  + D THT P SPLP D+ ++ 
Subjt:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGLTTM---------EEGDNGT---MGEKERDESP-------NVRGNLDCTHTTPRSPLPSDSPAS-

Query:  RTRGAIRKLAS----RCQASPI-VDRSLDVGENTNVSEPILQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLN
        RT  +     S       +SP  +DRS + GE+ NVSE  +QQ+    RGPTKM+T AI   NKVDI FNE+GQPI EASIG++SFLG LVREVVPVTLN
Subjt:  RTRGAIRKLAS----RCQASPI-VDRSLDVGENTNVSEPILQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLN

Query:  DWRKLSTRFKEILWTSIQI----VESW----------------KISNLSQIQNASNDEEVLKMKPANIQSTHDWVDFVKEKNSARFKARSEKFKSMKKKQ
        DWRKLSTR KEILW S+Q+     E W                K   +SQIQ+ S  EE++KMKP+NI+S HDW+DFVKEKNSA FKA+SEKFKSMKKKQ
Subjt:  DWRKLSTRFKEILWTSIQI----VESW----------------KISNLSQIQNASNDEEVLKMKPANIQSTHDWVDFVKEKNSARFKARSEKFKSMKKKQ

Query:  LPHTCSRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETLERIEKIDKEGTNTTSNNVANDAISKVLGPDRGHIRGLGFGVTLSKLS
        LPHTCSRKGYARL EEMKKS  DSS+VTRVA+WAKAHRKKDGNP+NSQVAE LERIE+ID EG NT SNN  N+ ISKVLG DR HI  LGFG T+ K S
Subjt:  LPHTCSRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETLERIEKIDKEGTNTTSNNVANDAISKVLGPDRGHIRGLGFGVTLSKLS

Query:  LLSQRDSNYA---------------------------------------------------KLEEKYKKMEGQMSEMKSLMSHILKSQGNASEQLSNATN
        LLSQ DS+YA                                                   KLEEKYKKME +MSEM+SLMS +LKSQGN SE LSNATN
Subjt:  LLSQRDSNYA---------------------------------------------------KLEEKYKKMEGQMSEMKSLMSHILKSQGNASEQLSNATN

Query:  DPMVNNIAINPIVSSPSSINNNNALRKCMLVDWCGTGEVVAEGRWSSNDPKVIVHHVPLGPHACLEMESIYSNSIWELTDLPNGVKPIGCKWIYKSK---
        + +VNN+AI+PI SSP SIN+NNALRKC ++DWCGTGEVVAEGRWSSNDPKVIVHHVPLGP A           +W   DLP   +P    W   S+   
Subjt:  DPMVNNIAINPIVSSPSSINNNNALRKCMLVDWCGTGEVVAEGRWSSNDPKVIVHHVPLGPHACLEMESIYSNSIWELTDLPNGVKPIGCKWIYKSK---

Query:  -RDAAGKSI
         +DA G +I
Subjt:  -RDAAGKSI

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.3e-1040.91Show/hide
Query:  LTDLEVVPNMDVPITLYCDNSGAVANSNEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKALTAKVFEGHLEGLGL
        LT + +   ++ PI +Y DN G ++ +N P  HKR KHI+ KYH  RE VQ   + +  I +E+ +AD FTK L A  F    + LGL
Subjt:  LTDLEVVPNMDVPITLYCDNSGAVANSNEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKALTAKVFEGHLEGLGL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-1324.71Show/hide
Query:  EMESIYSNSIWELTDLPNGVKPIGCKWIYKSKRDAAGK--------------------------------SIRILLSIATYYDYEIWQMDVKTAFLNG--
        EMES+  N  ++L +LP G +P+ CKW++K K+D   K                                SIR +LS+A   D E+ Q+DVKTAFL+G  
Subjt:  EMESIYSNSIWELTDLPNGVKPIGCKWIYKSKRDAAGK--------------------------------SIRILLSIATYYDYEIWQMDVKTAFLNG--

Query:  ---------------------------------------------------------------------------YLTD-------------IKKWLAAQ
                                                                                   Y+ D             +K  L+  
Subjt:  ---------------------------------------------------------------------------YLTD-------------IKKWLAAQ

Query:  FQMKDLGEAQYVLGIQIIRDRKNKMLALSQATYIDKMLSRYSMQNSKKGLLPFLTDLEVVPNM
        F MKDLG AQ +LG++I+R+R ++ L LSQ  YI+++L R++M+N+K    P    L++   M
Subjt:  FQMKDLGEAQYVLGIQIIRDRKNKMLALSQATYIDKMLSRYSMQNSKKGLLPFLTDLEVVPNM

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.9e-0538.36Show/hide
Query:  LYCDNSGAVANSNEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKALTAKVFEGHLEGLGL
        +YCD+  A+  S     H R KHI+ +YH IRE+V    + V KI++  N AD  TK +    FE   E +G+
Subjt:  LYCDNSGAVANSNEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKALTAKVFEGHLEGLGL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 87.0e-0827.45Show/hide
Query:  EMESIYSNSIWELTDLPNGVKPIGCKWIYKSKRDAAG--------------------------------KSIRILLSIATYYDYEIWQMDVKTAFLNGYL
        E+ ++ +   WE+  LP   KPIGCKW+YK K ++ G                                 S++++L+I+  Y++ + Q+D+  AFLNG L
Subjt:  EMESIYSNSIWELTDLPNGVKPIGCKWIYKSKRDAAG--------------------------------KSIRILLSIATYYDYEIWQMDVKTAFLNGYL

Query:  TD
         +
Subjt:  TD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAGAAAATAAGAAATACTACTGCCAAAAGGTGTCTTGGGTTCGATCCAGATGCACCACGAAAACGACGCTCTAAACGCTTGAAATCACCTTCAGTAGGCCTAAC
AACAATGGAGGAGGGGGATAATGGAACAATGGGTGAGAAGGAAAGAGATGAGTCACCTAATGTTAGAGGGAATCTTGATTGTACACATACAACTCCAAGATCACCTTTAC
CATCAGATTCGCCAGCATCTCGTACAAGAGGAGCTATTCGAAAGTTAGCTTCTAGATGTCAAGCCTCACCAATTGTAGATAGGTCACTGGATGTAGGAGAAAATACCAAT
GTTTCTGAACCAATTTTGCAACAAGTCCTTAAGAAACGAAGAGGCCCTACAAAAATGAAAACCATTGCAATTGGTAATAAAGTAGATATAACCTTCAATGAGTATGGACA
ACCGATTGAGGAGGCTTCGATTGGCATGGCATCATTTTTAGGTCCACTCGTGAGAGAGGTGGTGCCTGTGACTTTAAATGATTGGAGGAAATTGTCAACAAGATTCAAAG
AAATTTTATGGACATCAATTCAAATTGTGGAGAGCTGGAAAATCTCGAATCTGTCACAAATTCAAAATGCCTCCAACGATGAGGAGGTTCTTAAAATGAAGCCAGCAAAT
ATACAATCTACACACGATTGGGTTGACTTTGTGAAAGAAAAGAACAGTGCAAGATTCAAGGCAAGAAGTGAAAAGTTCAAATCCATGAAGAAGAAGCAACTTCCACATAC
ATGTAGTCGTAAGGGTTATGCTCGATTAGCAGAAGAAATGAAAAAAAGTTGTTCGGATTCATCATCAGTGACAAGAGTCGCATTATGGGCAAAGGCACATAGGAAGAAGG
ATGGAAATCCTATTAACTCACAAGTGGCAGAAACACTGGAGCGTATTGAAAAAATTGACAAAGAAGGGACAAACACTACTTCAAATAACGTGGCCAATGATGCGATAAGT
AAAGTTCTTGGTCCTGATCGTGGTCATATTAGAGGACTTGGATTTGGAGTAACCTTATCAAAGTTGTCTTTATTGTCTCAAAGAGATAGCAATTATGCCAAACTTGAAGA
AAAGTATAAGAAGATGGAGGGACAAATGTCTGAGATGAAATCTTTGATGTCTCACATACTCAAATCTCAAGGTAATGCAAGTGAACAACTTTCTAATGCTACAAATGATC
CTATGGTTAACAACATTGCCATTAACCCAATTGTATCTTCACCTTCGAGTATTAACAATAATAATGCTCTCCGCAAGTGCATGTTGGTAGATTGGTGTGGTACAGGAGAG
GTAGTTGCTGAAGGTCGATGGTCTTCGAATGACCCCAAAGTTATTGTTCATCATGTTCCCCTCGGTCCACACGCCTGCCTGGAAATGGAGTCAATATACTCCAATTCTAT
ATGGGAACTTACAGATCTACCAAATGGGGTAAAACCCATAGGATGCAAATGGATCTATAAGAGTAAAAGAGATGCAGCTGGAAAGTCCATAAGAATTCTCTTATCCATAG
CCACATATTATGACTATGAAATATGGCAAATGGACGTCAAGACTGCCTTTCTGAATGGATACCTCACTGACATTAAGAAATGGCTAGCAGCCCAATTCCAAATGAAAGAT
CTGGGAGAGGCTCAATATGTTCTGGGAATCCAAATCATTAGGGATCGTAAGAACAAAATGCTAGCTCTGTCTCAAGCAACGTATATTGACAAGATGTTGTCTCGATATTC
GATGCAAAACTCCAAGAAGGGACTACTACCCTTCTTGACTGATTTGGAAGTGGTTCCGAACATGGACGTGCCCATAACACTATATTGTGACAATAGTGGGGCTGTAGCCA
ACTCAAATGAACCTCGAAGCCACAAACGAGGTAAACACATCGAGAGGAAGTATCACCTGATACGGGAGATTGTGCAACGAGGAGATGTGATCGTCACTAAGATCGCCTCG
GAGCACAACATCGCTGATCCATTTACGAAGGCTCTCACGGCTAAAGTGTTTGAGGGTCATCTAGAGGGTCTAGGTCTACGAGATATGTATGCCATCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAAGAAAATAAGAAATACTACTGCCAAAAGGTGTCTTGGGTTCGATCCAGATGCACCACGAAAACGACGCTCTAAACGCTTGAAATCACCTTCAGTAGGCCTAAC
AACAATGGAGGAGGGGGATAATGGAACAATGGGTGAGAAGGAAAGAGATGAGTCACCTAATGTTAGAGGGAATCTTGATTGTACACATACAACTCCAAGATCACCTTTAC
CATCAGATTCGCCAGCATCTCGTACAAGAGGAGCTATTCGAAAGTTAGCTTCTAGATGTCAAGCCTCACCAATTGTAGATAGGTCACTGGATGTAGGAGAAAATACCAAT
GTTTCTGAACCAATTTTGCAACAAGTCCTTAAGAAACGAAGAGGCCCTACAAAAATGAAAACCATTGCAATTGGTAATAAAGTAGATATAACCTTCAATGAGTATGGACA
ACCGATTGAGGAGGCTTCGATTGGCATGGCATCATTTTTAGGTCCACTCGTGAGAGAGGTGGTGCCTGTGACTTTAAATGATTGGAGGAAATTGTCAACAAGATTCAAAG
AAATTTTATGGACATCAATTCAAATTGTGGAGAGCTGGAAAATCTCGAATCTGTCACAAATTCAAAATGCCTCCAACGATGAGGAGGTTCTTAAAATGAAGCCAGCAAAT
ATACAATCTACACACGATTGGGTTGACTTTGTGAAAGAAAAGAACAGTGCAAGATTCAAGGCAAGAAGTGAAAAGTTCAAATCCATGAAGAAGAAGCAACTTCCACATAC
ATGTAGTCGTAAGGGTTATGCTCGATTAGCAGAAGAAATGAAAAAAAGTTGTTCGGATTCATCATCAGTGACAAGAGTCGCATTATGGGCAAAGGCACATAGGAAGAAGG
ATGGAAATCCTATTAACTCACAAGTGGCAGAAACACTGGAGCGTATTGAAAAAATTGACAAAGAAGGGACAAACACTACTTCAAATAACGTGGCCAATGATGCGATAAGT
AAAGTTCTTGGTCCTGATCGTGGTCATATTAGAGGACTTGGATTTGGAGTAACCTTATCAAAGTTGTCTTTATTGTCTCAAAGAGATAGCAATTATGCCAAACTTGAAGA
AAAGTATAAGAAGATGGAGGGACAAATGTCTGAGATGAAATCTTTGATGTCTCACATACTCAAATCTCAAGGTAATGCAAGTGAACAACTTTCTAATGCTACAAATGATC
CTATGGTTAACAACATTGCCATTAACCCAATTGTATCTTCACCTTCGAGTATTAACAATAATAATGCTCTCCGCAAGTGCATGTTGGTAGATTGGTGTGGTACAGGAGAG
GTAGTTGCTGAAGGTCGATGGTCTTCGAATGACCCCAAAGTTATTGTTCATCATGTTCCCCTCGGTCCACACGCCTGCCTGGAAATGGAGTCAATATACTCCAATTCTAT
ATGGGAACTTACAGATCTACCAAATGGGGTAAAACCCATAGGATGCAAATGGATCTATAAGAGTAAAAGAGATGCAGCTGGAAAGTCCATAAGAATTCTCTTATCCATAG
CCACATATTATGACTATGAAATATGGCAAATGGACGTCAAGACTGCCTTTCTGAATGGATACCTCACTGACATTAAGAAATGGCTAGCAGCCCAATTCCAAATGAAAGAT
CTGGGAGAGGCTCAATATGTTCTGGGAATCCAAATCATTAGGGATCGTAAGAACAAAATGCTAGCTCTGTCTCAAGCAACGTATATTGACAAGATGTTGTCTCGATATTC
GATGCAAAACTCCAAGAAGGGACTACTACCCTTCTTGACTGATTTGGAAGTGGTTCCGAACATGGACGTGCCCATAACACTATATTGTGACAATAGTGGGGCTGTAGCCA
ACTCAAATGAACCTCGAAGCCACAAACGAGGTAAACACATCGAGAGGAAGTATCACCTGATACGGGAGATTGTGCAACGAGGAGATGTGATCGTCACTAAGATCGCCTCG
GAGCACAACATCGCTGATCCATTTACGAAGGCTCTCACGGCTAAAGTGTTTGAGGGTCATCTAGAGGGTCTAGGTCTACGAGATATGTATGCCATCTAA
Protein sequenceShow/hide protein sequence
MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGLTTMEEGDNGTMGEKERDESPNVRGNLDCTHTTPRSPLPSDSPASRTRGAIRKLASRCQASPIVDRSLDVGENTN
VSEPILQQVLKKRRGPTKMKTIAIGNKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLNDWRKLSTRFKEILWTSIQIVESWKISNLSQIQNASNDEEVLKMKPAN
IQSTHDWVDFVKEKNSARFKARSEKFKSMKKKQLPHTCSRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETLERIEKIDKEGTNTTSNNVANDAIS
KVLGPDRGHIRGLGFGVTLSKLSLLSQRDSNYAKLEEKYKKMEGQMSEMKSLMSHILKSQGNASEQLSNATNDPMVNNIAINPIVSSPSSINNNNALRKCMLVDWCGTGE
VVAEGRWSSNDPKVIVHHVPLGPHACLEMESIYSNSIWELTDLPNGVKPIGCKWIYKSKRDAAGKSIRILLSIATYYDYEIWQMDVKTAFLNGYLTDIKKWLAAQFQMKD
LGEAQYVLGIQIIRDRKNKMLALSQATYIDKMLSRYSMQNSKKGLLPFLTDLEVVPNMDVPITLYCDNSGAVANSNEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIAS
EHNIADPFTKALTAKVFEGHLEGLGLRDMYAI