; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g32120 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g32120
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr8:23218707..23228367
RNA-Seq ExpressionMoc08g32120
SyntenyMoc08g32120
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017233063.1 PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus]4.5e-14944Show/hide
Query:  FDPEIERTLKRRRREQRLRKEKESKKKKEVEEEETIEMNRNAQDPPPPQNPPVNGDMADEGAANRAGEIPNSILLADNRDVAMRNYVTHAFHNLNSGINN
        FDPEIERT  RRR+ QR  K+ +            + M+ N            NGD+           +P    + D++D A+R Y    F  LNSGI  
Subjt:  FDPEIERTLKRRRREQRLRKEKESKKKKEVEEEETIEMNRNAQDPPPPQNPPVNGDMADEGAANRAGEIPNSILLADNRDVAMRNYVTHAFHNLNSGINN

Query:  PLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTR
        P  QA QFELKPVMFQMLQT+GQF G+  EDP+ HL+ F+EI+++F+  GV EDALRLK+FP+S+RD ARTWLN+L   S+ TW +LTEKFL+KY     
Subjt:  PLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTR

Query:  NADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPACVQIKQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDNLNKMTDINDQGEIERSLPKKQV
        NA LR +I SF+Q+++E++ +AWERFKELLR+CP HG+  C+Q++ FY GL+  ++M+++ +ANG+LL KS N+  + L  +   N Q    R+   K+V
Subjt:  NADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPACVQIKQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDNLNKMTDINDQGEIERSLPKKQV

Query:  SAGIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAIPEPSPILQISDISCVYCGDNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNPGWRHHPN
         AGI+++D++ SM+AQ+A+M  MLK L+M        S     S I Q  ++SCV+CG+ H Y++CP+NP S+FY+G   Q    PYSNTYN  WR HPN
Subjt:  SAGIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAIPEPSPILQISDISCVYCGDNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNPGWRHHPN

Query:  FSWSNQGVASSSAEAPAQQYKQNYTPPGFPTQPASKPQQYNQQRGQNTTQQSGSNESLEAMMKEFMTRID--------------ATIRSLEMQVGQIAND
        FSWSNQG  S ++       K NY PPGF                   +QQ+  + SLE M+KE++ + +              A++R+LE QVGQ+AN+
Subjt:  FSWSNQGVASSSAEAPAQQYKQNYTPPGFPTQPASKPQQYNQQRGQNTTQQSGSNESLEAMMKEFMTRID--------------ATIRSLEMQVGQIAND

Query:  QKSRPQGTLPGHTENPKRDREGKEHCKAVITRSGLSYEGPSLPDEGTHVVTPVPASTSNPQQEEKAEPVSSEE-KDKKADKGKQVVPNTTPQVGNIKILP
         ++RP GTLP  TE PK    G EHCKA+  +SG                  +  + ++ + ++  EP  +EE  D K  +  +V P  + +  +I+  P
Subjt:  QKSRPQGTLPGHTENPKRDREGKEHCKAVITRSGLSYEGPSLPDEGTHVVTPVPASTSNPQQEEKAEPVSSEE-KDKKADKGKQVVPNTTPQVGNIKILP

Query:  PFPQRLVKKNQDGHFKKFFEILKQLHINIPLIDALEQMPNYTKFLKDIISRRKKLGEYETIALTKCSSDALGNPLPVKCKDPGSFTIPCSIGGKNLGRAL
        PFPQR  K+ Q+  FKKF ++LKQLHINIPL++ALEQMPNY KF+KDI++++++LGE+ET+ALTK  S  L + LP K KDPGSFTIPC+IG    G AL
Subjt:  PFPQRLVKKNQDGHFKKFFEILKQLHINIPLIDALEQMPNYTKFLKDIISRRKKLGEYETIALTKCSSDALGNPLPVKCKDPGSFTIPCSIGGKNLGRAL

XP_022142953.1 uncharacterized protein LOC111012947 [Momordica charantia]7.2e-16341.98Show/hide
Query:  LTFDPEIERTLKRRRREQRLRKEKESKKKKEVEEEETIEMNRNAQDPPPPQNPPVNGDMADEGAANRAGEIPNSILLADNRDVAMRNYVTHAFHNLNSGI
        L +DPEIE+T KR RREQRL+K+ ++K +KE E E   +M  N                           +PN I +AD RD AMR+Y      +LNS +
Subjt:  LTFDPEIERTLKRRRREQRLRKEKESKKKKEVEEEETIEMNRNAQDPPPPQNPPVNGDMADEGAANRAGEIPNSILLADNRDVAMRNYVTHAFHNLNSGI

Query:  NNPLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTL
         N  P  A FE KP+M QML  +GQF GL +EDP SHLKSFI++AN F+LPG+S+DALRL +FPFSL   A  WLNA   ++I T +++ +KFL KY   
Subjt:  NNPLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTL

Query:  TRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPACVQIKQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDNLNKMTDINDQ--GEIERSLP
        TRNAD+RE+I+SFRQKENEAV  AWERFK+L+R CP+ G+PACVQI+ F+R  D  + MMLN AANG    KS NEIV+ L+++++ NDQ   E  R+  
Subjt:  TRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPACVQIKQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDNLNKMTDINDQ--GEIERSLP

Query:  KKQVSAGIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAIPEPSPILQISDISCVYCGDNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNPGWR
        K+   A +  LD + SMQ Q+  + QMLK +        +  A   PSP+ QI++ +C                       Q  Q+ FNPYSN YNPGW+
Subjt:  KKQVSAGIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAIPEPSPILQISDISCVYCGDNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNPGWR

Query:  HHPNFSWSNQGVASSSAEAPAQQYKQNYTPPGFPTQPA--SKPQQYNQQR--GQNTTQQSGSNESL--------EAMMKEFMTRIDATI-----------
         HPNFSWS QG  SSS     QQYKQ YTPP FP  PA    PQQYNQQ+  GQ   Q   + E L        +A MKE MTR DATI           
Subjt:  HHPNFSWSNQGVASSSAEAPAQQYKQNYTPPGFPTQPA--SKPQQYNQQR--GQNTTQQSGSNESL--------EAMMKEFMTRIDATI-----------

Query:  ------RSLEMQVGQIANDQKSRPQGTLPGHTENPKRDREGKEHCKAVITRSGLSYEGPSLPDEGTHVVTPVPASTSNPQQEEKAEPVSSEEKDKKADKG
              R+LEMQ+GQ+AN+ ++RPQG+LP  TE P+R                              +V+P P                S EKD +    
Subjt:  ------RSLEMQVGQIANDQKSRPQGTLPGHTENPKRDREGKEHCKAVITRSGLSYEGPSLPDEGTHVVTPVPASTSNPQQEEKAEPVSSEEKDKKADKG

Query:  KQVVP----NTTPQVGNIKILPPFPQRLVKKNQDGHFKKFFEILKQLHINIPLIDALEQMPNYTKFLKDIISRRKKLGEYETIALTKCSSDALGNPLPVK
        K V P    +  PQV N +  PPFPQRLV+KNQD +F+KF +ILKQLHINIP ++ALEQMP Y KFLKDII+R+KKLGEYET+ALT+CSS+   +  P K
Subjt:  KQVVP----NTTPQVGNIKILPPFPQRLVKKNQDGHFKKFFEILKQLHINIPLIDALEQMPNYTKFLKDIISRRKKLGEYETIALTKCSSDALGNPLPVK

Query:  CKDPGSFTIPCSIGGKNLGRALCDLGANINHMPLS-----------------------------------------------------------------
         KDPGSFTI C IGGK++GRALCDLGA IN MPLS                                                                 
Subjt:  CKDPGSFTIPCSIGGKNLGRALCDLGANINHMPLS-----------------------------------------------------------------

Query:  -------------------VNDEQVTFNVLDAMRLLVEVEECSTI----GAIMEELQQMMVEDLEADLEAAEKESKIAPGAILPQLERFEFLQPTIADLK
                           V+D++VTFN+LDAM+   + EEC  I    G    EL  ++  ++EA+LE AEKE  IA  A+  +              K
Subjt:  -------------------VNDEQVTFNVLDAMRLLVEVEECSTI----GAIMEELQQMMVEDLEADLEAAEKESKIAPGAILPQLERFEFLQPTIADLK

Query:  ALQPSIIEPP
        ++QP  IEPP
Subjt:  ALQPSIIEPP

XP_022158314.1 uncharacterized protein LOC111024824 [Momordica charantia]5.6e-17671.4Show/hide
Query:  MNRNAQDPPPPQNPPVNGDMADEGAANRAGEIPNSILLADNRDVAMRNYVTHAFHNLNSGI--NNPLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPYSH
        MN N QDPP P NPPV+GD A EGAANRAGE+PN ILL DNRDVA+RNYVTHAFHNLNS +  + P+ +A                       NEDPYSH
Subjt:  MNRNAQDPPPPQNPPVNGDMADEGAANRAGEIPNSILLADNRDVAMRNYVTHAFHNLNSGI--NNPLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPYSH

Query:  LKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPS
        LKSFIEIANAFQL GVSEDALRLKM                                        NADLREDIVSFRQKENEAVQE WERFKELLRRC S
Subjt:  LKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPS

Query:  HGLPACVQIKQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDNLNKMTDINDQGEIERSLPKKQVSAGIFELDTVASMQAQMAAMNQMLKQLTMEKETKT
        HGLP CVQI+QFYRGLDR SRMMLNTAAN SL EKS++EI+D LNKMTD NDQGEI RSLPKKQVSA +FELDTVASMQAQMA +NQMLKQLTMEKETKT
Subjt:  HGLPACVQIKQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDNLNKMTDINDQGEIERSLPKKQVSAGIFELDTVASMQAQMAAMNQMLKQLTMEKETKT

Query:  VTSAIPEPSPILQISDISCVYCGDNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAEAPAQQYKQNYTPPGFPTQPAS
         TSA+ EPS  LQISDISCVYCGDN LYENCPANP S+FYVGQ AQRNFNPYSNTY+P WR+HPNFSWSNQGVASSSA+ PAQQYKQNYTPP FPTQPAS
Subjt:  VTSAIPEPSPILQISDISCVYCGDNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAEAPAQQYKQNYTPPGFPTQPAS

Query:  KPQQYNQQRGQNTTQQSGSNESLEAMMKEFMTRIDAT-----------IRSLEMQVGQIANDQKSRPQGTLPGHTENPK
        +PQQYNQQR QNTTQQ GSN SLEAM KEFMTR +AT           IR LEMQVGQIAND+KSRPQGTLPG+TENPK
Subjt:  KPQQYNQQRGQNTTQQSGSNESLEAMMKEFMTRIDAT-----------IRSLEMQVGQIANDQKSRPQGTLPGHTENPK

XP_022158836.1 uncharacterized protein LOC111025302 [Momordica charantia]3.3e-19290.1Show/hide
Query:  MNRNAQDPPPPQNPPVNGDMADEGAANRAGEIPNSILLADNRDVAMRNYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPYSHLK
        MNRNAQDPPPPQNPPVNGDMA E AANR GEIPN ILLADNRDVAMRNYVTHAFHNLNSGINNPLPQAAQFELKPVMFQ+LQTMGQFGGLTNEDPYSHLK
Subjt:  MNRNAQDPPPPQNPPVNGDMADEGAANRAGEIPNSILLADNRDVAMRNYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPYSHLK

Query:  SFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHG
        SFIEIANAFQLPG SEDALRLKMFPFSLRDGARTW+NALEPNSINTWAELT+KFLAKYHTLT+NADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHG
Subjt:  SFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHG

Query:  LPACVQIKQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDNLNKMTDINDQGEIERSLPKKQVSAGIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVT
        LP+CVQI+QFYRGLDRSS+MMLNT ANGSLLEKSVNEIVD LNKMTDINDQGE+ RSLPKKQVS GIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVT
Subjt:  LPACVQIKQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDNLNKMTDINDQGEIERSLPKKQVSAGIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVT

Query:  SAIPEPSPILQISDISCVYCGDNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAEAPAQQYK
        SAIPE SPILQISDISCVYC                   GQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVASSSA+APAQQYK
Subjt:  SAIPEPSPILQISDISCVYCGDNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAEAPAQQYK

XP_022159235.1 uncharacterized protein LOC111025653 [Momordica charantia]3.4e-18945.83Show/hide
Query:  IPNSILLADNRDVAMRNYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDG
        +PN I +AD RD AMR+Y      +LNS + N  P  A+FE KP+M QML  +GQFGGL +EDP SHLKSFI++AN F+LPG+S+DALRL +FPFS+   
Subjt:  IPNSILLADNRDVAMRNYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDG

Query:  ARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPACVQIKQFYRGLDRSSRMMLNTAANGSLL
        A  WLNA   ++I TW+++ +KFL KY   TRNAD+RE+I+SFRQKENEAV  AWERFK+L+  CP+ G+PACVQI+ F+RG D  ++MMLN AANG   
Subjt:  ARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPACVQIKQFYRGLDRSSRMMLNTAANGSLL

Query:  EKSVNEIVDNLNKMTDINDQ--GEIERSLPKKQVSAGIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAIPEPSPILQISDISCVYCGDNHLYENC
         KS NEIV+ L+++++ N Q   E  R+  K+   AG+  LD + SMQ Q+  + QMLK +           A   PSP+ QI++ +C YCGD H  ENC
Subjt:  EKSVNEIVDNLNKMTDINDQ--GEIERSLPKKQVSAGIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAIPEPSPILQISDISCVYCGDNHLYENC

Query:  PANPASIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAEAPAQQYKQNYTPPGFPTQPA--SKPQQYNQQRGQ-NTTQQSGSNESL-----
        P+NP+S++YVGQ  Q+ FNPYSNTYNPGW+ HPNFSWS QG  SS+     QQYK+ YTPPGFP  PA    P QYNQQ+      QQ+ SN  +     
Subjt:  PANPASIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAEAPAQQYKQNYTPPGFPTQPA--SKPQQYNQQRGQ-NTTQQSGSNESL-----

Query:  ----EAMMKEFMTRI-----------------DATIRSLEMQVGQIANDQKSRPQGTLPGHTENPKRDREGKEHCKAVITRSGLSYEGPSLPDEGTHVVT
            +A MKE MTR                  D T+R LEMQ+GQ+ N+ ++RPQG+LP  TE P+  R GKEHC ++ TRSGL YEGP +PDE +H   
Subjt:  ----EAMMKEFMTRI-----------------DATIRSLEMQVGQIANDQKSRPQGTLPGHTENPKRDREGKEHCKAVITRSGLSYEGPSLPDEGTHVVT

Query:  PVPASTSNPQQEEKAEPVSSEEKDKKADKGKQVVPNTT----PQVGNIKILPPFPQRLVKKNQDGHFKKFFEILKQLHINIPLIDALEQMPNYTKFLKDI
                           S EKD +A   K V P  +    PQV N +  PPFPQRLV+KNQD +F+KF +ILKQLHINIP ++ALEQMP Y KF+KDI
Subjt:  PVPASTSNPQQEEKAEPVSSEEKDKKADKGKQVVPNTT----PQVGNIKILPPFPQRLVKKNQDGHFKKFFEILKQLHINIPLIDALEQMPNYTKFLKDI

Query:  ISRRKKLGEYETIALTKCSSDALGNPLPVKCKDPGSFTIPCSIGGKNLGRALCDLGANINHMPLS-----------------------------------
        I+R+KKLGEYET+ALT+CSS+   + +P K KDPGSFTIPC IGGK++GRALCDLGA+IN MPLS                                   
Subjt:  ISRRKKLGEYETIALTKCSSDALGNPLPVKCKDPGSFTIPCSIGGKNLGRALCDLGANINHMPLS-----------------------------------

Query:  -------------------------------------------------VNDEQVTFNVLDAMRLLVEVEECSTIG----AIMEELQQMMVEDLEADLEA
                                                         V+D++VTFN+LDAM+   ++EEC+ I         EL  ++  ++EA+LE 
Subjt:  -------------------------------------------------VNDEQVTFNVLDAMRLLVEVEECSTIG----AIMEELQQMMVEDLEADLEA

Query:  AEKESKIAPGAILPQLERFEFLQPTIADLKALQPSIIEPP
        AEKE  IA  A+  +              K++QP  IEPP
Subjt:  AEKESKIAPGAILPQLERFEFLQPTIADLKALQPSIIEPP

TrEMBL top hitse value%identityAlignment
A0A6J1CPJ3 uncharacterized protein LOC1110129471.7e-16241.98Show/hide
Query:  LTFDPEIERTLKRRRREQRLRKEKESKKKKEVEEEETIEMNRNAQDPPPPQNPPVNGDMADEGAANRAGEIPNSILLADNRDVAMRNYVTHAFHNLNSGI
        L +DPEIE+T KR RREQRL+K+ ++K +KE E E   +M  N                           +PN I +AD RD AMR+Y      +LNS +
Subjt:  LTFDPEIERTLKRRRREQRLRKEKESKKKKEVEEEETIEMNRNAQDPPPPQNPPVNGDMADEGAANRAGEIPNSILLADNRDVAMRNYVTHAFHNLNSGI

Query:  NNPLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTL
         N  P  A FE KP+M QML  +GQF GL +EDP SHLKSFI++AN F+LPG+S+DALRL +FPFSL   A  WLNA   ++I T +++ +KFL KY   
Subjt:  NNPLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTL

Query:  TRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPACVQIKQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDNLNKMTDINDQ--GEIERSLP
        TRNAD+RE+I+SFRQKENEAV  AWERFK+L+R CP+ G+PACVQI+ F+R  D  + MMLN AANG    KS NEIV+ L+++++ NDQ   E  R+  
Subjt:  TRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPACVQIKQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDNLNKMTDINDQ--GEIERSLP

Query:  KKQVSAGIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAIPEPSPILQISDISCVYCGDNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNPGWR
        K+   A +  LD + SMQ Q+  + QMLK +        +  A   PSP+ QI++ +C                       Q  Q+ FNPYSN YNPGW+
Subjt:  KKQVSAGIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAIPEPSPILQISDISCVYCGDNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNPGWR

Query:  HHPNFSWSNQGVASSSAEAPAQQYKQNYTPPGFPTQPA--SKPQQYNQQR--GQNTTQQSGSNESL--------EAMMKEFMTRIDATI-----------
         HPNFSWS QG  SSS     QQYKQ YTPP FP  PA    PQQYNQQ+  GQ   Q   + E L        +A MKE MTR DATI           
Subjt:  HHPNFSWSNQGVASSSAEAPAQQYKQNYTPPGFPTQPA--SKPQQYNQQR--GQNTTQQSGSNESL--------EAMMKEFMTRIDATI-----------

Query:  ------RSLEMQVGQIANDQKSRPQGTLPGHTENPKRDREGKEHCKAVITRSGLSYEGPSLPDEGTHVVTPVPASTSNPQQEEKAEPVSSEEKDKKADKG
              R+LEMQ+GQ+AN+ ++RPQG+LP  TE P+R                              +V+P P                S EKD +    
Subjt:  ------RSLEMQVGQIANDQKSRPQGTLPGHTENPKRDREGKEHCKAVITRSGLSYEGPSLPDEGTHVVTPVPASTSNPQQEEKAEPVSSEEKDKKADKG

Query:  KQVVP----NTTPQVGNIKILPPFPQRLVKKNQDGHFKKFFEILKQLHINIPLIDALEQMPNYTKFLKDIISRRKKLGEYETIALTKCSSDALGNPLPVK
        K V P    +  PQV N +  PPFPQRLV+KNQD +F+KF +ILKQLHINIP ++ALEQMP Y KFLKDII+R+KKLGEYET+ALT+CSS+   +  P K
Subjt:  KQVVP----NTTPQVGNIKILPPFPQRLVKKNQDGHFKKFFEILKQLHINIPLIDALEQMPNYTKFLKDIISRRKKLGEYETIALTKCSSDALGNPLPVK

Query:  CKDPGSFTIPCSIGGKNLGRALCDLGANINHMPLS-----------------------------------------------------------------
         KDPGSFTI C IGGK++GRALCDLGA IN MPLS                                                                 
Subjt:  CKDPGSFTIPCSIGGKNLGRALCDLGANINHMPLS-----------------------------------------------------------------

Query:  -------------------VNDEQVTFNVLDAMRLLVEVEECSTI----GAIMEELQQMMVEDLEADLEAAEKESKIAPGAILPQLERFEFLQPTIADLK
                           V+D++VTFN+LDAM+   + EEC  I    G    EL  ++  ++EA+LE AEKE  IA  A+  +              K
Subjt:  -------------------VNDEQVTFNVLDAMRLLVEVEECSTI----GAIMEELQQMMVEDLEADLEAAEKESKIAPGAILPQLERFEFLQPTIADLK

Query:  ALQPSIIEPP
        ++QP  IEPP
Subjt:  ALQPSIIEPP

A0A6J1DY39 uncharacterized protein LOC1110256531.7e-18945.83Show/hide
Query:  IPNSILLADNRDVAMRNYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDG
        +PN I +AD RD AMR+Y      +LNS + N  P  A+FE KP+M QML  +GQFGGL +EDP SHLKSFI++AN F+LPG+S+DALRL +FPFS+   
Subjt:  IPNSILLADNRDVAMRNYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEDALRLKMFPFSLRDG

Query:  ARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPACVQIKQFYRGLDRSSRMMLNTAANGSLL
        A  WLNA   ++I TW+++ +KFL KY   TRNAD+RE+I+SFRQKENEAV  AWERFK+L+  CP+ G+PACVQI+ F+RG D  ++MMLN AANG   
Subjt:  ARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPACVQIKQFYRGLDRSSRMMLNTAANGSLL

Query:  EKSVNEIVDNLNKMTDINDQ--GEIERSLPKKQVSAGIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAIPEPSPILQISDISCVYCGDNHLYENC
         KS NEIV+ L+++++ N Q   E  R+  K+   AG+  LD + SMQ Q+  + QMLK +           A   PSP+ QI++ +C YCGD H  ENC
Subjt:  EKSVNEIVDNLNKMTDINDQ--GEIERSLPKKQVSAGIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAIPEPSPILQISDISCVYCGDNHLYENC

Query:  PANPASIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAEAPAQQYKQNYTPPGFPTQPA--SKPQQYNQQRGQ-NTTQQSGSNESL-----
        P+NP+S++YVGQ  Q+ FNPYSNTYNPGW+ HPNFSWS QG  SS+     QQYK+ YTPPGFP  PA    P QYNQQ+      QQ+ SN  +     
Subjt:  PANPASIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAEAPAQQYKQNYTPPGFPTQPA--SKPQQYNQQRGQ-NTTQQSGSNESL-----

Query:  ----EAMMKEFMTRI-----------------DATIRSLEMQVGQIANDQKSRPQGTLPGHTENPKRDREGKEHCKAVITRSGLSYEGPSLPDEGTHVVT
            +A MKE MTR                  D T+R LEMQ+GQ+ N+ ++RPQG+LP  TE P+  R GKEHC ++ TRSGL YEGP +PDE +H   
Subjt:  ----EAMMKEFMTRI-----------------DATIRSLEMQVGQIANDQKSRPQGTLPGHTENPKRDREGKEHCKAVITRSGLSYEGPSLPDEGTHVVT

Query:  PVPASTSNPQQEEKAEPVSSEEKDKKADKGKQVVPNTT----PQVGNIKILPPFPQRLVKKNQDGHFKKFFEILKQLHINIPLIDALEQMPNYTKFLKDI
                           S EKD +A   K V P  +    PQV N +  PPFPQRLV+KNQD +F+KF +ILKQLHINIP ++ALEQMP Y KF+KDI
Subjt:  PVPASTSNPQQEEKAEPVSSEEKDKKADKGKQVVPNTT----PQVGNIKILPPFPQRLVKKNQDGHFKKFFEILKQLHINIPLIDALEQMPNYTKFLKDI

Query:  ISRRKKLGEYETIALTKCSSDALGNPLPVKCKDPGSFTIPCSIGGKNLGRALCDLGANINHMPLS-----------------------------------
        I+R+KKLGEYET+ALT+CSS+   + +P K KDPGSFTIPC IGGK++GRALCDLGA+IN MPLS                                   
Subjt:  ISRRKKLGEYETIALTKCSSDALGNPLPVKCKDPGSFTIPCSIGGKNLGRALCDLGANINHMPLS-----------------------------------

Query:  -------------------------------------------------VNDEQVTFNVLDAMRLLVEVEECSTIG----AIMEELQQMMVEDLEADLEA
                                                         V+D++VTFN+LDAM+   ++EEC+ I         EL  ++  ++EA+LE 
Subjt:  -------------------------------------------------VNDEQVTFNVLDAMRLLVEVEECSTIG----AIMEELQQMMVEDLEADLEA

Query:  AEKESKIAPGAILPQLERFEFLQPTIADLKALQPSIIEPP
        AEKE  IA  A+  +              K++QP  IEPP
Subjt:  AEKESKIAPGAILPQLERFEFLQPTIADLKALQPSIIEPP

A0A6J1DZ19 uncharacterized protein LOC1110248242.7e-17671.4Show/hide
Query:  MNRNAQDPPPPQNPPVNGDMADEGAANRAGEIPNSILLADNRDVAMRNYVTHAFHNLNSGI--NNPLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPYSH
        MN N QDPP P NPPV+GD A EGAANRAGE+PN ILL DNRDVA+RNYVTHAFHNLNS +  + P+ +A                       NEDPYSH
Subjt:  MNRNAQDPPPPQNPPVNGDMADEGAANRAGEIPNSILLADNRDVAMRNYVTHAFHNLNSGI--NNPLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPYSH

Query:  LKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPS
        LKSFIEIANAFQL GVSEDALRLKM                                        NADLREDIVSFRQKENEAVQE WERFKELLRRC S
Subjt:  LKSFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPS

Query:  HGLPACVQIKQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDNLNKMTDINDQGEIERSLPKKQVSAGIFELDTVASMQAQMAAMNQMLKQLTMEKETKT
        HGLP CVQI+QFYRGLDR SRMMLNTAAN SL EKS++EI+D LNKMTD NDQGEI RSLPKKQVSA +FELDTVASMQAQMA +NQMLKQLTMEKETKT
Subjt:  HGLPACVQIKQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDNLNKMTDINDQGEIERSLPKKQVSAGIFELDTVASMQAQMAAMNQMLKQLTMEKETKT

Query:  VTSAIPEPSPILQISDISCVYCGDNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAEAPAQQYKQNYTPPGFPTQPAS
         TSA+ EPS  LQISDISCVYCGDN LYENCPANP S+FYVGQ AQRNFNPYSNTY+P WR+HPNFSWSNQGVASSSA+ PAQQYKQNYTPP FPTQPAS
Subjt:  VTSAIPEPSPILQISDISCVYCGDNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAEAPAQQYKQNYTPPGFPTQPAS

Query:  KPQQYNQQRGQNTTQQSGSNESLEAMMKEFMTRIDAT-----------IRSLEMQVGQIANDQKSRPQGTLPGHTENPK
        +PQQYNQQR QNTTQQ GSN SLEAM KEFMTR +AT           IR LEMQVGQIAND+KSRPQGTLPG+TENPK
Subjt:  KPQQYNQQRGQNTTQQSGSNESLEAMMKEFMTRIDAT-----------IRSLEMQVGQIANDQKSRPQGTLPGHTENPK

A0A6J1E251 uncharacterized protein LOC1110253021.6e-19290.1Show/hide
Query:  MNRNAQDPPPPQNPPVNGDMADEGAANRAGEIPNSILLADNRDVAMRNYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPYSHLK
        MNRNAQDPPPPQNPPVNGDMA E AANR GEIPN ILLADNRDVAMRNYVTHAFHNLNSGINNPLPQAAQFELKPVMFQ+LQTMGQFGGLTNEDPYSHLK
Subjt:  MNRNAQDPPPPQNPPVNGDMADEGAANRAGEIPNSILLADNRDVAMRNYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPYSHLK

Query:  SFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHG
        SFIEIANAFQLPG SEDALRLKMFPFSLRDGARTW+NALEPNSINTWAELT+KFLAKYHTLT+NADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHG
Subjt:  SFIEIANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHG

Query:  LPACVQIKQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDNLNKMTDINDQGEIERSLPKKQVSAGIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVT
        LP+CVQI+QFYRGLDRSS+MMLNT ANGSLLEKSVNEIVD LNKMTDINDQGE+ RSLPKKQVS GIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVT
Subjt:  LPACVQIKQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDNLNKMTDINDQGEIERSLPKKQVSAGIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVT

Query:  SAIPEPSPILQISDISCVYCGDNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAEAPAQQYK
        SAIPE SPILQISDISCVYC                   GQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVASSSA+APAQQYK
Subjt:  SAIPEPSPILQISDISCVYCGDNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAEAPAQQYK

A0A6J1EQ90 uncharacterized protein LOC1114364113.9e-13041.39Show/hide
Query:  DPEIERTLKRRRREQRLRKEKESKKKKEVEEEETIEMNRNAQDPPPPQNPPVNGDMADEGAANRAGEIPNSILLADNRDVAMRNYVTHAFHNLNSGINNP
        DPEIERT +RR            KKKK++ E+   ++   AQ     +NP +         AN+     N I LAD+R+ A+R Y   A   LN  I  P
Subjt:  DPEIERTLKRRRREQRLRKEKESKKKKEVEEEETIEMNRNAQDPPPPQNPPVNGDMADEGAANRAGEIPNSILLADNRDVAMRNYVTHAFHNLNSGINNP

Query:  LPQAAQFELKPVMFQMLQTMGQFGGLTNEDPYSHLKSFIEI-------ANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAK
          Q   FELKPVMFQMLQT+GQF GL  EDP+ HLKSF+ +       +++F+  GV +D +RL +FP+ LRDGA++WLN L P +I++W  L E FL K
Subjt:  LPQAAQFELKPVMFQMLQTMGQFGGLTNEDPYSHLKSFIEI-------ANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAK

Query:  YHTLTRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPACVQIKQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDNLNKMTDINDQGEIERS
        Y   TRNA  + +IV+F+Q E+E + EA ERFKE+LR+CP HGLP C+Q++ FY GL+  ++ +++ +ANG++L K+ NE  + L ++   N Q    RS
Subjt:  YHTLTRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPACVQIKQFYRGLDRSSRMMLNTAANGSLLEKSVNEIVDNLNKMTDINDQGEIERS

Query:  LPKKQVSAGIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAIPEPSPILQISDISCVYCGDNHLYENCPANPASIFYVG-QGAQRNF--NPYSNTY
         P ++ + G+ E+D ++S+ AQ+A++  +L+ L + +++  + + +   + I Q +  SCVYCG+ H ++ CP+NPASIFYVG Q +Q N   NP+SNTY
Subjt:  LPKKQVSAGIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAIPEPSPILQISDISCVYCGDNHLYENCPANPASIFYVG-QGAQRNF--NPYSNTY

Query:  NPGWRHHPNFSWSNQGVASSSAEAPAQQYKQNYTPPGF--PTQPASKPQQYNQQRGQNTTQ-QSGSNESLEAMMKEFMTRIDATI-------RSLEMQVG
        NPGWR+HPNFSW  Q + +          K NY P GF    Q A   QQ N Q G+ TTQ Q  S  S+E+++KE+M + DA I       R+LE+Q+G
Subjt:  NPGWRHHPNFSWSNQGVASSSAEAPAQQYKQNYTPPGF--PTQPASKPQQYNQQRGQNTTQ-QSGSNESLEAMMKEFMTRIDATI-------RSLEMQVG

Query:  QIANDQK--SRPQGTLPGHTENPKRDREGKEHCKAVITRSGLSYEGPSLPDEGTHVVTPVPASTSNPQQEEKAEPVSSEEKDKKADKGKQVVPNTTPQVG
           N ++  S  Q T      N +   + KEH K                              +  +++ K +  +S E++ +           TP   
Subjt:  QIANDQK--SRPQGTLPGHTENPKRDREGKEHCKAVITRSGLSYEGPSLPDEGTHVVTPVPASTSNPQQEEKAEPVSSEEKDKKADKGKQVVPNTTPQVG

Query:  NIKILPPFPQRLVKKNQDGHFKKFFEILKQLHINIPLIDALEQMPNYTKFLKDIISRRKKLGEYETIALTKCSSDALGNPLPVKCKDPGSFTIPCSIGGK
             PPFPQR+ +K ++ HF+KF +ILK++HINIPL++AL+QMPNY KFLKD++  R+K  E++ ++L +  S  L N +P+K KDPGSFTIP SIGGK
Subjt:  NIKILPPFPQRLVKKNQDGHFKKFFEILKQLHINIPLIDALEQMPNYTKFLKDIISRRKKLGEYETIALTKCSSDALGNPLPVKCKDPGSFTIPCSIGGK

Query:  NLGRALCDLGANINHMPLSV
         LGRALCDLGANIN MPLS+
Subjt:  NLGRALCDLGANINHMPLSV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGAGATCGACCAACCGACCGAACGCAGAGACGGGGATCGGAAGAGCGAGACAAGCTTCGTGGTTGGCCCCTTGTTGATCTTCTGTTTGTTTTGCTCTTTGTTGAA
AAATTCTCAATTCGCCAATTCCGACGATGATGATGAAATTGGGGCGTGGAGGAGGAGTAGTGGGAACTTGATTTATACACCAAAGCCCGACCTTGGGAAAGGTGAGTTAG
GGTCTGGTTGGTTGACCTTTGATCCAGAGATTGAAAGAACCCTTAAGAGGAGAAGGCGTGAGCAGAGGTTGAGAAAAGAGAAAGAAAGTAAAAAAAAGAAAGAAGTTGAA
GAAGAAGAGACCATCGAGATGAATAGAAATGCACAAGATCCTCCACCGCCACAAAATCCACCTGTGAATGGAGATATGGCAGATGAAGGAGCAGCAAACCGAGCAGGAGA
AATTCCTAATTCGATCCTTTTAGCAGATAATCGAGATGTAGCCATGCGGAATTATGTCACTCATGCGTTCCACAACCTAAATTCAGGGATAAATAATCCTTTACCCCAAG
CCGCACAGTTCGAGCTCAAACCAGTCATGTTCCAGATGTTACAGACGATGGGCCAGTTCGGAGGATTGACTAATGAAGATCCTTACTCCCATCTCAAATCCTTTATTGAA
ATAGCTAATGCATTTCAACTTCCTGGTGTTTCTGAGGATGCACTAAGATTAAAAATGTTTCCTTTTTCTCTCAGGGATGGTGCAAGGACTTGGCTAAACGCGTTAGAACC
AAATTCTATCAACACATGGGCAGAACTGACGGAGAAATTTTTGGCAAAGTACCACACTTTGACCAGGAACGCAGACCTCCGAGAGGACATTGTGTCTTTTAGACAAAAGG
AGAACGAAGCAGTTCAAGAAGCTTGGGAGCGTTTTAAGGAGTTACTTAGAAGATGCCCGAGCCATGGATTGCCCGCATGTGTGCAGATTAAACAATTCTATAGAGGATTG
GATCGTTCATCAAGGATGATGTTGAACACCGCAGCCAATGGCTCGTTGTTAGAGAAGTCGGTAAATGAGATCGTTGATAACTTGAATAAGATGACAGACATTAATGACCA
AGGCGAAATAGAAAGGTCATTACCAAAGAAGCAAGTATCAGCTGGAATCTTTGAGTTAGACACAGTAGCTTCAATGCAAGCCCAAATGGCGGCTATGAACCAAATGTTAA
AGCAGTTGACAATGGAGAAGGAAACCAAAACCGTCACTTCGGCGATACCTGAACCCTCTCCTATTTTGCAAATTTCAGATATATCTTGTGTCTATTGTGGTGATAACCAC
TTGTATGAGAATTGTCCAGCTAATCCAGCGTCTATTTTTTATGTAGGTCAAGGTGCCCAGCGGAATTTCAACCCGTATTCAAACACTTACAACCCTGGATGGAGGCACCA
TCCAAACTTTTCCTGGAGTAACCAAGGAGTAGCTAGTAGCAGTGCAGAAGCACCCGCTCAACAATACAAGCAAAACTATACTCCTCCTGGTTTTCCAACTCAACCGGCGT
CGAAGCCTCAACAATACAATCAGCAAAGAGGTCAAAATACTACTCAGCAAAGTGGTAGCAACGAAAGTTTGGAGGCCATGATGAAAGAGTTCATGACAAGAATTGATGCT
ACGATAAGAAGCTTGGAGATGCAAGTGGGGCAGATTGCAAATGACCAGAAATCTAGACCCCAAGGTACATTGCCTGGACACACAGAAAACCCGAAGCGAGATCGTGAGGG
AAAGGAGCACTGTAAGGCGGTTATCACAAGAAGCGGATTAAGTTATGAAGGACCCTCACTTCCTGACGAAGGAACTCATGTAGTTACACCTGTTCCTGCATCCACCTCCA
ATCCACAACAAGAAGAGAAAGCAGAACCTGTAAGTTCAGAAGAAAAAGATAAGAAGGCAGATAAAGGTAAGCAAGTAGTGCCCAACACTACTCCACAGGTAGGTAATATT
AAGATTCTTCCACCTTTCCCCCAAAGGCTAGTTAAAAAGAATCAGGATGGTCATTTTAAGAAGTTCTTTGAGATTCTAAAACAGTTGCATATCAATATACCTCTCATAGA
TGCTTTAGAACAAATGCCTAACTACACTAAGTTTTTAAAGGACATCATATCTAGGCGTAAGAAGTTAGGTGAGTATGAGACGATAGCCTTGACAAAGTGTAGTAGTGATG
CTCTAGGGAATCCATTGCCTGTTAAATGTAAGGACCCAGGAAGTTTTACCATCCCCTGCTCAATAGGAGGTAAGAACCTAGGAAGAGCATTGTGTGACTTAGGAGCAAAC
ATTAATCATATGCCTCTTTCTGTCAATGATGAACAGGTAACCTTCAATGTCCTCGATGCGATGCGTCTCCTGGTTGAAGTCGAGGAGTGCTCTACAATAGGAGCAATCAT
GGAGGAACTCCAGCAAATGATGGTGGAAGACTTGGAAGCAGATTTGGAGGCCGCAGAAAAAGAATCCAAAATTGCGCCTGGCGCAATTTTGCCCCAACTTGAGCGTTTTG
AGTTTTTGCAGCCGACAATAGCGGATTTGAAGGCCTTGCAACCTTCCATCATTGAACCTCCAGAATTGGAAAAGAAACCCCTACCTTCTCATTTAAAATATGCTTATTTG
GGTTTAAACGATACTTTGCCCGTTATCATTTCTTCATATTTGTCTAATGAACATGAATCTTTGCTTTTGCAGAAATCTCAAAGATTCTCAAAGTCTCTTTCATCTCTCTC
TTTTTCGGCACTTTTAGGGTTTATTGGAGGGATTTTCGTTGTCGATCGTGAAGCAAACTCAAGAACCATGGAAGGTTCATCTTCCTCCAAGCCACACGACAAAGAGAAGG
AAAAGAAGAGAGTGTTGTTGCCTCCACCAACCAAACCGGGTATGATTCCTCTTGAACCTCCTAGGATTTCTCATGAAAAGTTAGTTTTTGATCCTAGGGAACAAAGAAGA
AAATATGAGGAAGCTATAAGAATGAACCCTAGGAGAAATCAATCCTTAGGTGAGGAAAAAGGTGCTGTTTTAGATGAAGAAATAGCTAGACTTCAAAAGAGGGCGGAGAT
GTTCAGTAAAAATAACGAAATTAGGGACAAAGAGAATGAGAGGGTTTACGCGAAAATTGAGGAATTAAACATAAAATGGCAAGAATTCATGGAAAACTCAAAGAAAGTGA
GTGAGGAGATTCAACTTGAGTTAAATAGCATGAGTATACGTCGTAGGATGAATCTTTCTCAAGATAACCCCGTTTCTGAGTCTTTAGAACTGTCTATCTCTCCCCCTCTT
TCCACTATTGTTGCTGTGCATGTTGAAGGTCAAGAACAGGGTAGTGGGGATTCAGAACACGACACGGAGCCCTTGGAGCATTCAGATTTAGCCACGGTCGAAATTCAATG
CCAAATTGCGCCTGGCGCAATTATGGATGAGACTCCACCGGCCACTCTACAAGGTATTTTGTCTCCATCTTTTCCAGATCCTATCTTGACTAAAAAGCCCCTAGTTTTTT
ATGATTTAGAACAGGAAAGGACAACGTCGAAAATTGCCGAAATTTTGGTGGCGTTGAATGAAGCAAGGGGAGAGGATCCATTGGAGGATGATGGAAACAGTGGGGCAGCA
CAAGAACAATTGAATGCTGATAGAGAAGATGAAGATTTTGGAGAATTACCCCAAGAAGTGCATGGAGACGAGTTTGAGGACGAGGAAGACAATGACGATATCTTCCAATA
TGAAGTGAGAGTACGAACTCCGGTGCACGAATCTCAGCAAGTTGATGAGGAGCCCCCTACAAAAGAGCAAGAAGGAACATCTGGTCCTGTGGATGTCCCTAGTGAGGCCA
TGGCGGAATCATCTTCCTCTTCTTCACAAGGTAAGACCCCTTCTTTGTCGAGTTTGAATGTTTCTGACCCAAACTTTGTTGCTAATGCAGAGACTTCAGATGAGGAGGTG
AGCTTGACCGAAGTGGTGAAGAAAACACAAAAGAAGAAAAAAGTGGCGGAAATTGCGCCAGGCACAATTTCTAGGCCTACGACCCGCGCCGCTGTAGCACGTTTGGCTGC
TCAAAAAGAAGCCGAGGCCGGTCAATCTAAAAAAGCCAAGAGGGCTAGGGTGCAAAGAGGGGCAAAAGAGCCACTTGAGGAGGCCAATGAAGAGGATACCGATTCTACCG
AGCAGACACCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGAGATCGACCAACCGACCGAACGCAGAGACGGGGATCGGAAGAGCGAGACAAGCTTCGTGGTTGGCCCCTTGTTGATCTTCTGTTTGTTTTGCTCTTTGTTGAA
AAATTCTCAATTCGCCAATTCCGACGATGATGATGAAATTGGGGCGTGGAGGAGGAGTAGTGGGAACTTGATTTATACACCAAAGCCCGACCTTGGGAAAGGTGAGTTAG
GGTCTGGTTGGTTGACCTTTGATCCAGAGATTGAAAGAACCCTTAAGAGGAGAAGGCGTGAGCAGAGGTTGAGAAAAGAGAAAGAAAGTAAAAAAAAGAAAGAAGTTGAA
GAAGAAGAGACCATCGAGATGAATAGAAATGCACAAGATCCTCCACCGCCACAAAATCCACCTGTGAATGGAGATATGGCAGATGAAGGAGCAGCAAACCGAGCAGGAGA
AATTCCTAATTCGATCCTTTTAGCAGATAATCGAGATGTAGCCATGCGGAATTATGTCACTCATGCGTTCCACAACCTAAATTCAGGGATAAATAATCCTTTACCCCAAG
CCGCACAGTTCGAGCTCAAACCAGTCATGTTCCAGATGTTACAGACGATGGGCCAGTTCGGAGGATTGACTAATGAAGATCCTTACTCCCATCTCAAATCCTTTATTGAA
ATAGCTAATGCATTTCAACTTCCTGGTGTTTCTGAGGATGCACTAAGATTAAAAATGTTTCCTTTTTCTCTCAGGGATGGTGCAAGGACTTGGCTAAACGCGTTAGAACC
AAATTCTATCAACACATGGGCAGAACTGACGGAGAAATTTTTGGCAAAGTACCACACTTTGACCAGGAACGCAGACCTCCGAGAGGACATTGTGTCTTTTAGACAAAAGG
AGAACGAAGCAGTTCAAGAAGCTTGGGAGCGTTTTAAGGAGTTACTTAGAAGATGCCCGAGCCATGGATTGCCCGCATGTGTGCAGATTAAACAATTCTATAGAGGATTG
GATCGTTCATCAAGGATGATGTTGAACACCGCAGCCAATGGCTCGTTGTTAGAGAAGTCGGTAAATGAGATCGTTGATAACTTGAATAAGATGACAGACATTAATGACCA
AGGCGAAATAGAAAGGTCATTACCAAAGAAGCAAGTATCAGCTGGAATCTTTGAGTTAGACACAGTAGCTTCAATGCAAGCCCAAATGGCGGCTATGAACCAAATGTTAA
AGCAGTTGACAATGGAGAAGGAAACCAAAACCGTCACTTCGGCGATACCTGAACCCTCTCCTATTTTGCAAATTTCAGATATATCTTGTGTCTATTGTGGTGATAACCAC
TTGTATGAGAATTGTCCAGCTAATCCAGCGTCTATTTTTTATGTAGGTCAAGGTGCCCAGCGGAATTTCAACCCGTATTCAAACACTTACAACCCTGGATGGAGGCACCA
TCCAAACTTTTCCTGGAGTAACCAAGGAGTAGCTAGTAGCAGTGCAGAAGCACCCGCTCAACAATACAAGCAAAACTATACTCCTCCTGGTTTTCCAACTCAACCGGCGT
CGAAGCCTCAACAATACAATCAGCAAAGAGGTCAAAATACTACTCAGCAAAGTGGTAGCAACGAAAGTTTGGAGGCCATGATGAAAGAGTTCATGACAAGAATTGATGCT
ACGATAAGAAGCTTGGAGATGCAAGTGGGGCAGATTGCAAATGACCAGAAATCTAGACCCCAAGGTACATTGCCTGGACACACAGAAAACCCGAAGCGAGATCGTGAGGG
AAAGGAGCACTGTAAGGCGGTTATCACAAGAAGCGGATTAAGTTATGAAGGACCCTCACTTCCTGACGAAGGAACTCATGTAGTTACACCTGTTCCTGCATCCACCTCCA
ATCCACAACAAGAAGAGAAAGCAGAACCTGTAAGTTCAGAAGAAAAAGATAAGAAGGCAGATAAAGGTAAGCAAGTAGTGCCCAACACTACTCCACAGGTAGGTAATATT
AAGATTCTTCCACCTTTCCCCCAAAGGCTAGTTAAAAAGAATCAGGATGGTCATTTTAAGAAGTTCTTTGAGATTCTAAAACAGTTGCATATCAATATACCTCTCATAGA
TGCTTTAGAACAAATGCCTAACTACACTAAGTTTTTAAAGGACATCATATCTAGGCGTAAGAAGTTAGGTGAGTATGAGACGATAGCCTTGACAAAGTGTAGTAGTGATG
CTCTAGGGAATCCATTGCCTGTTAAATGTAAGGACCCAGGAAGTTTTACCATCCCCTGCTCAATAGGAGGTAAGAACCTAGGAAGAGCATTGTGTGACTTAGGAGCAAAC
ATTAATCATATGCCTCTTTCTGTCAATGATGAACAGGTAACCTTCAATGTCCTCGATGCGATGCGTCTCCTGGTTGAAGTCGAGGAGTGCTCTACAATAGGAGCAATCAT
GGAGGAACTCCAGCAAATGATGGTGGAAGACTTGGAAGCAGATTTGGAGGCCGCAGAAAAAGAATCCAAAATTGCGCCTGGCGCAATTTTGCCCCAACTTGAGCGTTTTG
AGTTTTTGCAGCCGACAATAGCGGATTTGAAGGCCTTGCAACCTTCCATCATTGAACCTCCAGAATTGGAAAAGAAACCCCTACCTTCTCATTTAAAATATGCTTATTTG
GGTTTAAACGATACTTTGCCCGTTATCATTTCTTCATATTTGTCTAATGAACATGAATCTTTGCTTTTGCAGAAATCTCAAAGATTCTCAAAGTCTCTTTCATCTCTCTC
TTTTTCGGCACTTTTAGGGTTTATTGGAGGGATTTTCGTTGTCGATCGTGAAGCAAACTCAAGAACCATGGAAGGTTCATCTTCCTCCAAGCCACACGACAAAGAGAAGG
AAAAGAAGAGAGTGTTGTTGCCTCCACCAACCAAACCGGGTATGATTCCTCTTGAACCTCCTAGGATTTCTCATGAAAAGTTAGTTTTTGATCCTAGGGAACAAAGAAGA
AAATATGAGGAAGCTATAAGAATGAACCCTAGGAGAAATCAATCCTTAGGTGAGGAAAAAGGTGCTGTTTTAGATGAAGAAATAGCTAGACTTCAAAAGAGGGCGGAGAT
GTTCAGTAAAAATAACGAAATTAGGGACAAAGAGAATGAGAGGGTTTACGCGAAAATTGAGGAATTAAACATAAAATGGCAAGAATTCATGGAAAACTCAAAGAAAGTGA
GTGAGGAGATTCAACTTGAGTTAAATAGCATGAGTATACGTCGTAGGATGAATCTTTCTCAAGATAACCCCGTTTCTGAGTCTTTAGAACTGTCTATCTCTCCCCCTCTT
TCCACTATTGTTGCTGTGCATGTTGAAGGTCAAGAACAGGGTAGTGGGGATTCAGAACACGACACGGAGCCCTTGGAGCATTCAGATTTAGCCACGGTCGAAATTCAATG
CCAAATTGCGCCTGGCGCAATTATGGATGAGACTCCACCGGCCACTCTACAAGGTATTTTGTCTCCATCTTTTCCAGATCCTATCTTGACTAAAAAGCCCCTAGTTTTTT
ATGATTTAGAACAGGAAAGGACAACGTCGAAAATTGCCGAAATTTTGGTGGCGTTGAATGAAGCAAGGGGAGAGGATCCATTGGAGGATGATGGAAACAGTGGGGCAGCA
CAAGAACAATTGAATGCTGATAGAGAAGATGAAGATTTTGGAGAATTACCCCAAGAAGTGCATGGAGACGAGTTTGAGGACGAGGAAGACAATGACGATATCTTCCAATA
TGAAGTGAGAGTACGAACTCCGGTGCACGAATCTCAGCAAGTTGATGAGGAGCCCCCTACAAAAGAGCAAGAAGGAACATCTGGTCCTGTGGATGTCCCTAGTGAGGCCA
TGGCGGAATCATCTTCCTCTTCTTCACAAGGTAAGACCCCTTCTTTGTCGAGTTTGAATGTTTCTGACCCAAACTTTGTTGCTAATGCAGAGACTTCAGATGAGGAGGTG
AGCTTGACCGAAGTGGTGAAGAAAACACAAAAGAAGAAAAAAGTGGCGGAAATTGCGCCAGGCACAATTTCTAGGCCTACGACCCGCGCCGCTGTAGCACGTTTGGCTGC
TCAAAAAGAAGCCGAGGCCGGTCAATCTAAAAAAGCCAAGAGGGCTAGGGTGCAAAGAGGGGCAAAAGAGCCACTTGAGGAGGCCAATGAAGAGGATACCGATTCTACCG
AGCAGACACCATAA
Protein sequenceShow/hide protein sequence
MAEIDQPTERRDGDRKSETSFVVGPLLIFCLFCSLLKNSQFANSDDDDEIGAWRRSSGNLIYTPKPDLGKGELGSGWLTFDPEIERTLKRRRREQRLRKEKESKKKKEVE
EEETIEMNRNAQDPPPPQNPPVNGDMADEGAANRAGEIPNSILLADNRDVAMRNYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPYSHLKSFIE
IANAFQLPGVSEDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERFKELLRRCPSHGLPACVQIKQFYRGL
DRSSRMMLNTAANGSLLEKSVNEIVDNLNKMTDINDQGEIERSLPKKQVSAGIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAIPEPSPILQISDISCVYCGDNH
LYENCPANPASIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAEAPAQQYKQNYTPPGFPTQPASKPQQYNQQRGQNTTQQSGSNESLEAMMKEFMTRIDA
TIRSLEMQVGQIANDQKSRPQGTLPGHTENPKRDREGKEHCKAVITRSGLSYEGPSLPDEGTHVVTPVPASTSNPQQEEKAEPVSSEEKDKKADKGKQVVPNTTPQVGNI
KILPPFPQRLVKKNQDGHFKKFFEILKQLHINIPLIDALEQMPNYTKFLKDIISRRKKLGEYETIALTKCSSDALGNPLPVKCKDPGSFTIPCSIGGKNLGRALCDLGAN
INHMPLSVNDEQVTFNVLDAMRLLVEVEECSTIGAIMEELQQMMVEDLEADLEAAEKESKIAPGAILPQLERFEFLQPTIADLKALQPSIIEPPELEKKPLPSHLKYAYL
GLNDTLPVIISSYLSNEHESLLLQKSQRFSKSLSSLSFSALLGFIGGIFVVDREANSRTMEGSSSSKPHDKEKEKKRVLLPPPTKPGMIPLEPPRISHEKLVFDPREQRR
KYEEAIRMNPRRNQSLGEEKGAVLDEEIARLQKRAEMFSKNNEIRDKENERVYAKIEELNIKWQEFMENSKKVSEEIQLELNSMSIRRRMNLSQDNPVSESLELSISPPL
STIVAVHVEGQEQGSGDSEHDTEPLEHSDLATVEIQCQIAPGAIMDETPPATLQGILSPSFPDPILTKKPLVFYDLEQERTTSKIAEILVALNEARGEDPLEDDGNSGAA
QEQLNADREDEDFGELPQEVHGDEFEDEEDNDDIFQYEVRVRTPVHESQQVDEEPPTKEQEGTSGPVDVPSEAMAESSSSSSQGKTPSLSSLNVSDPNFVANAETSDEEV
SLTEVVKKTQKKKKVAEIAPGTISRPTTRAAVARLAAQKEAEAGQSKKAKRARVQRGAKEPLEEANEEDTDSTEQTP