; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0219281 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0219281
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCMiso1.1chr08:7402685..7404248
RNA-Seq ExpressionCmc08g0219281
SyntenyCmc08g0219281
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]2.7e-15159.62Show/hide
Query:  MPSFVLNGEISYCVLFPTKSLFPIALKIFGCVCFVRDVRPHHTKLDPKFLKCIFLSYSRVQKRYRCYCPILKRYLVM--LLFFRIYLLLHHHRVRAMGR-
        MPS VLNGEI Y VLFPTK LFPIA KIFGCVCFVRDVRPHHTKLDPK LKCIFL YSRVQK YRCYCP LKRYLV   ++FF              G  
Subjt:  MPSFVLNGEISYCVLFPTKSLFPIALKIFGCVCFVRDVRPHHTKLDPKFLKCIFLSYSRVQKRYRCYCPILKRYLVM--LLFFRIYLLLHHHRVRAMGR-

Query:  MTIFLY-MRLPLP------------------HHPFPL------MHLLPARC-----------------------------------------LLEST--P
          +F+Y +  P P                    P P         +LP+ C                                          LEST  P
Subjt:  MTIFLY-MRLPLP------------------HHPFPL------MHLLPARC-----------------------------------------LLEST--P

Query:  DNLHQCLLHHAIRDQVM----------IFPLLFAKVNTSAIGCKWVFSVKVNPDGTMARLKARLVANGYAQTYGIDYSDTFSPIAKLTSIHLFLSIAATY
        +++H+ L H   ++ ++           + L+       AIGCKWVF+VK+NPDGT+ARLKARLVA GYAQ YG DYSDTFSP+AKLTSI LFLS+AAT 
Subjt:  DNLHQCLLHHAIRDQVM----------IFPLLFAKVNTSAIGCKWVFSVKVNPDGTMARLKARLVANGYAQTYGIDYSDTFSPIAKLTSIHLFLSIAATY

Query:  NWPLHQLDIKNAFLHGNLQEEVYMEQLPRFVAQGESDRVCHLRKSPYGLNRAHMRGLVSLVKLLYALVCRRVHLIIMFFYRRSDNGIVLLVVYVDDIVIT
         W LHQLDIKNAFLHG+LQEEVYMEQ P FVAQGESD+VC LRKS YGL ++         + L     ++       FYRRS+ GIVLLVVYVDDIVIT
Subjt:  NWPLHQLDIKNAFLHGNLQEEVYMEQLPRFVAQGESDRVCHLRKSPYGLNRAHMRGLVSLVKLLYALVCRRVHLIIMFFYRRSDNGIVLLVVYVDDIVIT

Query:  GNDASGISSLKTFRQGQFHMKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGTTPSGTSIMPNQQLVKEGKLCKAPERYRRLVGKLNYLIVT
        GNDA GISSLKTF QGQF+ KDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLG  PSGT +MPNQQLVKEG+LCK PERYRRLVGKLNYL VT
Subjt:  GNDASGISSLKTFRQGQFHMKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGTTPSGTSIMPNQQLVKEGKLCKAPERYRRLVGKLNYLIVT

Query:  QPNLAYSVSVVSRFMSSPTVDHWAAVEQIL
        +P++AYSVSVVS+FMSSPTVDHWAAVEQIL
Subjt:  QPNLAYSVSVVSRFMSSPTVDHWAAVEQIL

XP_031744754.1 uncharacterized protein LOC101212255 isoform X2 [Cucumis sativus]2.7e-15159.62Show/hide
Query:  MPSFVLNGEISYCVLFPTKSLFPIALKIFGCVCFVRDVRPHHTKLDPKFLKCIFLSYSRVQKRYRCYCPILKRYLVM--LLFFRIYLLLHHHRVRAMGR-
        MPS VLNGEI Y VLFPTK LFPIA KIFGCVCFVRDVRPHHTKLDPK LKCIFL YSRVQK YRCYCP LKRYLV   ++FF              G  
Subjt:  MPSFVLNGEISYCVLFPTKSLFPIALKIFGCVCFVRDVRPHHTKLDPKFLKCIFLSYSRVQKRYRCYCPILKRYLVM--LLFFRIYLLLHHHRVRAMGR-

Query:  MTIFLY-MRLPLP------------------HHPFPL------MHLLPARC-----------------------------------------LLEST--P
          +F+Y +  P P                    P P         +LP+ C                                          LEST  P
Subjt:  MTIFLY-MRLPLP------------------HHPFPL------MHLLPARC-----------------------------------------LLEST--P

Query:  DNLHQCLLHHAIRDQVM----------IFPLLFAKVNTSAIGCKWVFSVKVNPDGTMARLKARLVANGYAQTYGIDYSDTFSPIAKLTSIHLFLSIAATY
        +++H+ L H   ++ ++           + L+       AIGCKWVF+VK+NPDGT+ARLKARLVA GYAQ YG DYSDTFSP+AKLTSI LFLS+AAT 
Subjt:  DNLHQCLLHHAIRDQVM----------IFPLLFAKVNTSAIGCKWVFSVKVNPDGTMARLKARLVANGYAQTYGIDYSDTFSPIAKLTSIHLFLSIAATY

Query:  NWPLHQLDIKNAFLHGNLQEEVYMEQLPRFVAQGESDRVCHLRKSPYGLNRAHMRGLVSLVKLLYALVCRRVHLIIMFFYRRSDNGIVLLVVYVDDIVIT
         W LHQLDIKNAFLHG+LQEEVYMEQ P FVAQGESD+VC LRKS YGL ++         + L     ++       FYRRS+ GIVLLVVYVDDIVIT
Subjt:  NWPLHQLDIKNAFLHGNLQEEVYMEQLPRFVAQGESDRVCHLRKSPYGLNRAHMRGLVSLVKLLYALVCRRVHLIIMFFYRRSDNGIVLLVVYVDDIVIT

Query:  GNDASGISSLKTFRQGQFHMKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGTTPSGTSIMPNQQLVKEGKLCKAPERYRRLVGKLNYLIVT
        GNDA GISSLKTF QGQF+ KDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLG  PSGT +MPNQQLVKEG+LCK PERYRRLVGKLNYL VT
Subjt:  GNDASGISSLKTFRQGQFHMKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGTTPSGTSIMPNQQLVKEGKLCKAPERYRRLVGKLNYLIVT

Query:  QPNLAYSVSVVSRFMSSPTVDHWAAVEQIL
        +P++AYSVSVVS+FMSSPTVDHWAAVEQIL
Subjt:  QPNLAYSVSVVSRFMSSPTVDHWAAVEQIL

XP_031744755.1 uncharacterized protein LOC101212255 isoform X3 [Cucumis sativus]2.7e-15159.62Show/hide
Query:  MPSFVLNGEISYCVLFPTKSLFPIALKIFGCVCFVRDVRPHHTKLDPKFLKCIFLSYSRVQKRYRCYCPILKRYLVM--LLFFRIYLLLHHHRVRAMGR-
        MPS VLNGEI Y VLFPTK LFPIA KIFGCVCFVRDVRPHHTKLDPK LKCIFL YSRVQK YRCYCP LKRYLV   ++FF              G  
Subjt:  MPSFVLNGEISYCVLFPTKSLFPIALKIFGCVCFVRDVRPHHTKLDPKFLKCIFLSYSRVQKRYRCYCPILKRYLVM--LLFFRIYLLLHHHRVRAMGR-

Query:  MTIFLY-MRLPLP------------------HHPFPL------MHLLPARC-----------------------------------------LLEST--P
          +F+Y +  P P                    P P         +LP+ C                                          LEST  P
Subjt:  MTIFLY-MRLPLP------------------HHPFPL------MHLLPARC-----------------------------------------LLEST--P

Query:  DNLHQCLLHHAIRDQVM----------IFPLLFAKVNTSAIGCKWVFSVKVNPDGTMARLKARLVANGYAQTYGIDYSDTFSPIAKLTSIHLFLSIAATY
        +++H+ L H   ++ ++           + L+       AIGCKWVF+VK+NPDGT+ARLKARLVA GYAQ YG DYSDTFSP+AKLTSI LFLS+AAT 
Subjt:  DNLHQCLLHHAIRDQVM----------IFPLLFAKVNTSAIGCKWVFSVKVNPDGTMARLKARLVANGYAQTYGIDYSDTFSPIAKLTSIHLFLSIAATY

Query:  NWPLHQLDIKNAFLHGNLQEEVYMEQLPRFVAQGESDRVCHLRKSPYGLNRAHMRGLVSLVKLLYALVCRRVHLIIMFFYRRSDNGIVLLVVYVDDIVIT
         W LHQLDIKNAFLHG+LQEEVYMEQ P FVAQGESD+VC LRKS YGL ++         + L     ++       FYRRS+ GIVLLVVYVDDIVIT
Subjt:  NWPLHQLDIKNAFLHGNLQEEVYMEQLPRFVAQGESDRVCHLRKSPYGLNRAHMRGLVSLVKLLYALVCRRVHLIIMFFYRRSDNGIVLLVVYVDDIVIT

Query:  GNDASGISSLKTFRQGQFHMKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGTTPSGTSIMPNQQLVKEGKLCKAPERYRRLVGKLNYLIVT
        GNDA GISSLKTF QGQF+ KDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLG  PSGT +MPNQQLVKEG+LCK PERYRRLVGKLNYL VT
Subjt:  GNDASGISSLKTFRQGQFHMKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGTTPSGTSIMPNQQLVKEGKLCKAPERYRRLVGKLNYLIVT

Query:  QPNLAYSVSVVSRFMSSPTVDHWAAVEQIL
        +P++AYSVSVVS+FMSSPTVDHWAAVEQIL
Subjt:  QPNLAYSVSVVSRFMSSPTVDHWAAVEQIL

XP_031744756.1 uncharacterized protein LOC101212255 isoform X4 [Cucumis sativus]2.7e-15159.62Show/hide
Query:  MPSFVLNGEISYCVLFPTKSLFPIALKIFGCVCFVRDVRPHHTKLDPKFLKCIFLSYSRVQKRYRCYCPILKRYLVM--LLFFRIYLLLHHHRVRAMGR-
        MPS VLNGEI Y VLFPTK LFPIA KIFGCVCFVRDVRPHHTKLDPK LKCIFL YSRVQK YRCYCP LKRYLV   ++FF              G  
Subjt:  MPSFVLNGEISYCVLFPTKSLFPIALKIFGCVCFVRDVRPHHTKLDPKFLKCIFLSYSRVQKRYRCYCPILKRYLVM--LLFFRIYLLLHHHRVRAMGR-

Query:  MTIFLY-MRLPLP------------------HHPFPL------MHLLPARC-----------------------------------------LLEST--P
          +F+Y +  P P                    P P         +LP+ C                                          LEST  P
Subjt:  MTIFLY-MRLPLP------------------HHPFPL------MHLLPARC-----------------------------------------LLEST--P

Query:  DNLHQCLLHHAIRDQVM----------IFPLLFAKVNTSAIGCKWVFSVKVNPDGTMARLKARLVANGYAQTYGIDYSDTFSPIAKLTSIHLFLSIAATY
        +++H+ L H   ++ ++           + L+       AIGCKWVF+VK+NPDGT+ARLKARLVA GYAQ YG DYSDTFSP+AKLTSI LFLS+AAT 
Subjt:  DNLHQCLLHHAIRDQVM----------IFPLLFAKVNTSAIGCKWVFSVKVNPDGTMARLKARLVANGYAQTYGIDYSDTFSPIAKLTSIHLFLSIAATY

Query:  NWPLHQLDIKNAFLHGNLQEEVYMEQLPRFVAQGESDRVCHLRKSPYGLNRAHMRGLVSLVKLLYALVCRRVHLIIMFFYRRSDNGIVLLVVYVDDIVIT
         W LHQLDIKNAFLHG+LQEEVYMEQ P FVAQGESD+VC LRKS YGL ++         + L     ++       FYRRS+ GIVLLVVYVDDIVIT
Subjt:  NWPLHQLDIKNAFLHGNLQEEVYMEQLPRFVAQGESDRVCHLRKSPYGLNRAHMRGLVSLVKLLYALVCRRVHLIIMFFYRRSDNGIVLLVVYVDDIVIT

Query:  GNDASGISSLKTFRQGQFHMKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGTTPSGTSIMPNQQLVKEGKLCKAPERYRRLVGKLNYLIVT
        GNDA GISSLKTF QGQF+ KDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLG  PSGT +MPNQQLVKEG+LCK PERYRRLVGKLNYL VT
Subjt:  GNDASGISSLKTFRQGQFHMKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGTTPSGTSIMPNQQLVKEGKLCKAPERYRRLVGKLNYLIVT

Query:  QPNLAYSVSVVSRFMSSPTVDHWAAVEQIL
        +P++AYSVSVVS+FMSSPTVDHWAAVEQIL
Subjt:  QPNLAYSVSVVSRFMSSPTVDHWAAVEQIL

XP_031744758.1 uncharacterized protein LOC101212255 isoform X5 [Cucumis sativus]2.7e-15159.62Show/hide
Query:  MPSFVLNGEISYCVLFPTKSLFPIALKIFGCVCFVRDVRPHHTKLDPKFLKCIFLSYSRVQKRYRCYCPILKRYLVM--LLFFRIYLLLHHHRVRAMGR-
        MPS VLNGEI Y VLFPTK LFPIA KIFGCVCFVRDVRPHHTKLDPK LKCIFL YSRVQK YRCYCP LKRYLV   ++FF              G  
Subjt:  MPSFVLNGEISYCVLFPTKSLFPIALKIFGCVCFVRDVRPHHTKLDPKFLKCIFLSYSRVQKRYRCYCPILKRYLVM--LLFFRIYLLLHHHRVRAMGR-

Query:  MTIFLY-MRLPLP------------------HHPFPL------MHLLPARC-----------------------------------------LLEST--P
          +F+Y +  P P                    P P         +LP+ C                                          LEST  P
Subjt:  MTIFLY-MRLPLP------------------HHPFPL------MHLLPARC-----------------------------------------LLEST--P

Query:  DNLHQCLLHHAIRDQVM----------IFPLLFAKVNTSAIGCKWVFSVKVNPDGTMARLKARLVANGYAQTYGIDYSDTFSPIAKLTSIHLFLSIAATY
        +++H+ L H   ++ ++           + L+       AIGCKWVF+VK+NPDGT+ARLKARLVA GYAQ YG DYSDTFSP+AKLTSI LFLS+AAT 
Subjt:  DNLHQCLLHHAIRDQVM----------IFPLLFAKVNTSAIGCKWVFSVKVNPDGTMARLKARLVANGYAQTYGIDYSDTFSPIAKLTSIHLFLSIAATY

Query:  NWPLHQLDIKNAFLHGNLQEEVYMEQLPRFVAQGESDRVCHLRKSPYGLNRAHMRGLVSLVKLLYALVCRRVHLIIMFFYRRSDNGIVLLVVYVDDIVIT
         W LHQLDIKNAFLHG+LQEEVYMEQ P FVAQGESD+VC LRKS YGL ++         + L     ++       FYRRS+ GIVLLVVYVDDIVIT
Subjt:  NWPLHQLDIKNAFLHGNLQEEVYMEQLPRFVAQGESDRVCHLRKSPYGLNRAHMRGLVSLVKLLYALVCRRVHLIIMFFYRRSDNGIVLLVVYVDDIVIT

Query:  GNDASGISSLKTFRQGQFHMKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGTTPSGTSIMPNQQLVKEGKLCKAPERYRRLVGKLNYLIVT
        GNDA GISSLKTF QGQF+ KDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLG  PSGT +MPNQQLVKEG+LCK PERYRRLVGKLNYL VT
Subjt:  GNDASGISSLKTFRQGQFHMKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGTTPSGTSIMPNQQLVKEGKLCKAPERYRRLVGKLNYLIVT

Query:  QPNLAYSVSVVSRFMSSPTVDHWAAVEQIL
        +P++AYSVSVVS+FMSSPTVDHWAAVEQIL
Subjt:  QPNLAYSVSVVSRFMSSPTVDHWAAVEQIL

TrEMBL top hitse value%identityAlignment
A0A438CQ50 Retrovirus-related Pol polyprotein from transposon RE17.1e-11847.5Show/hide
Query:  MPSFVLNGEISYCVLFPTKSLFPIALKIFGCVCFVRDVRPHHTKLDPKFLKCIFLSYSRVQKRYRCYCPILKRYLVM--LLFFR---IYLLLHHHRVRAM
        MPS VLN +I Y +LFP KSLFPI  +IFG  C+VRDVRP  TKLDPK LKC+FL YSR+QK YRC+ P L +Y+V   ++F      Y    +      
Subjt:  MPSFVLNGEISYCVLFPTKSLFPIALKIFGCVCFVRDVRPHHTKLDPKFLKCIFLSYSRVQKRYRCYCPILKRYLVM--LLFFR---IYLLLHHHRVRAM

Query:  GRMTIFLYMRLP-----LPHHPFPLMHLLPA-----------------------RCLLESTPDNL----------HQCLLHHAIR---------------
        G   +     +P         P  ++ LLPA                          L   P +L           QC   ++I                
Subjt:  GRMTIFLYMRLP-----LPHHPFPLMHLLPA-----------------------RCLLESTPDNL----------HQCLLHHAIR---------------

Query:  ----DQVMI-----------------------------FPLLFAKVNTSAIGCKWVFSVKVNPDGTMARLKARLVANGYAQTYGIDYSDTFSPIAKLTSI
            D V I                             + L+   +  + +GCKWVF++KVNPDG+MARLKARLVA GYAQTYG+DYS+TFSP+A+L S+
Subjt:  ----DQVMI-----------------------------FPLLFAKVNTSAIGCKWVFSVKVNPDGTMARLKARLVANGYAQTYGIDYSDTFSPIAKLTSI

Query:  HLFLSIAATYNWPLHQLDIKNAFLHGNLQEEVYMEQLPRFVAQGESDRVCHLRKSPYGLNRAHMRGLVSLVKLLYALVCRRVHLIIMFFYRRSDNGIVLL
        HL +SIAA+ +WPL QLDIKNAFLH +LQ+EVYMEQ PRFVAQGE  +VCHLRKS YGL ++         +++      +  +    FYR+S NG +LL
Subjt:  HLFLSIAATYNWPLHQLDIKNAFLHGNLQEEVYMEQLPRFVAQGESDRVCHLRKSPYGLNRAHMRGLVSLVKLLYALVCRRVHLIIMFFYRRSDNGIVLL

Query:  VVYVDDIVITGNDASGISSLKTFRQGQFHMKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGTTPSGTSIMPNQQLVK-EGKLCKAPERYRR
        VVYVDDIVITG+D +GISSLK F   +FH KDLG+LKYFLG+EV RSK+GI+LSQRKYVLDLL ETGK+   P  T ++PN  L K +G     PERY+R
Subjt:  VVYVDDIVITGNDASGISSLKTFRQGQFHMKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGTTPSGTSIMPNQQLVK-EGKLCKAPERYRR

Query:  LVGKLNYLIVTQPNLAYSVSVVSRFMSSPTVDHWAAVEQIL
        LVGKLNYL VT P++AY VS+VS+FMS+PTV HWAA+EQIL
Subjt:  LVGKLNYLIVTQPNLAYSVSVVSRFMSSPTVDHWAAVEQIL

A0A438IJ87 Retrovirus-related Pol polyprotein from transposon RE19.3e-11848.08Show/hide
Query:  MPSFVLNGEISYCVLFPTKSLFPIALKIFGCVCFVRDVRPHHTKLDPKFLKCIFLSYSRVQKRYRCYCPILKRYLVM--------LLFF-----------
        MP+ VL G+I Y V+ P KSLFP+A +IFGC C+VRD RP  TKLDPK L+C+FL YSR+QK YRC+ P L +YLV           FF           
Subjt:  MPSFVLNGEISYCVLFPTKSLFPIALKIFGCVCFVRDVRPHHTKLDPKFLKCIFLSYSRVQKRYRCYCPILKRYLVM--------LLFF-----------

Query:  RIYLLLHHHRVRAMGRMTIFLYMRLPLPHHPFPLM-------------------------------------HLLPARCLLE------STPDNLHQCLLH
          +L+      R     +  +Y R P+     P                                       HL  +  +L       S P  + + L H
Subjt:  RIYLLLHHHRVRAMGRMTIFLYMRLPLPHHPFPLM-------------------------------------HLLPARCLLE------STPDNLHQCLLH

Query:  HAIRDQVM----------IFPLLFAKVNTSAIGCKWVFSVKVNPDGTMARLKARLVANGYAQTYGIDYSDTFSPIAKLTSIHLFLSIAATYNWPLHQLDI
           ++ ++           + L+        +GCKWVF VKVNPDG++ARLKARLVA GYAQTYG+DYSDTFSP+AKL S+ LF+SIAA+  W +HQLDI
Subjt:  HAIRDQVM----------IFPLLFAKVNTSAIGCKWVFSVKVNPDGTMARLKARLVANGYAQTYGIDYSDTFSPIAKLTSIHLFLSIAATYNWPLHQLDI

Query:  KNAFLHGNLQEEVYMEQLPRFVAQGESDRVCHLRKSPYGLNRAHMRGLVSLVKLLYALVCRRVHLIIMFFYRRSDNGIVLLVVYVDDIVITGNDASGISS
        KNAFLHG+L+EEVY+EQ P FVAQGE  +VC L+K+ YGL ++         K + A    +       FY++S  GI+LLVVYVDDIVITGND +GIS 
Subjt:  KNAFLHGNLQEEVYMEQLPRFVAQGESDRVCHLRKSPYGLNRAHMRGLVSLVKLLYALVCRRVHLIIMFFYRRSDNGIVLLVVYVDDIVITGNDASGISS

Query:  LKTFRQGQFHMKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGTTPSGTSIMPNQQLV-KEGKLCKAPERYRRLVGKLNYLIVTQPNLAYSV
        LKTF   +FH KDLG+LKYFLGIEV RSKKG++LSQRKYVLDLL ETGK+   P  T ++PN QL+  +G     PERYRR+VGKLNYL VT+P++AY+V
Subjt:  LKTFRQGQFHMKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGTTPSGTSIMPNQQLV-KEGKLCKAPERYRRLVGKLNYLIVTQPNLAYSV

Query:  SVVSRFMSSPTVDHWAAVEQIL
        SVVS+F S+PT+ HWAA+EQIL
Subjt:  SVVSRFMSSPTVDHWAAVEQIL

A5AWD0 Uncharacterized protein3.2e-11848.46Show/hide
Query:  MPSFVLNGEISYCVLFPTKSLFPIALKIFGCVCFVRDVRPHHTKLDPKFLKCIFLSYSRVQKRYRCYCPILKRYLVM--------LLFFR----------
        MP+ VL G+I Y V+ P KSLFP+A +IFGC C+VRD RP  TKLDPK L+C+FL YSR+QK YRC+ P L +YLV           FF           
Subjt:  MPSFVLNGEISYCVLFPTKSLFPIALKIFGCVCFVRDVRPHHTKLDPKFLKCIFLSYSRVQKRYRCYCPILKRYLVM--------LLFFR----------

Query:  ----IYLLLHHHRVRAMGRMT------------------------IFLYMRLPL---------PHHPFPLMHL-LPARCLLE--STPDNLHQCLLHHAIR
            +Y +++         +                         + +Y R P+         P    P   L LP     +  S P  + + L H   +
Subjt:  ----IYLLLHHHRVRAMGRMT------------------------IFLYMRLPL---------PHHPFPLMHL-LPARCLLE--STPDNLHQCLLHHAIR

Query:  DQVM----------IFPLLFAKVNTSAIGCKWVFSVKVNPDGTMARLKARLVANGYAQTYGIDYSDTFSPIAKLTSIHLFLSIAATYNWPLHQLDIKNAF
        + ++           + L+        +GCKWVF+VKVNPDG++ARLKARLVA GYAQTYG+DYSDTFSP+AKL S+ LF+SIAA+  W +HQLDIKNAF
Subjt:  DQVM----------IFPLLFAKVNTSAIGCKWVFSVKVNPDGTMARLKARLVANGYAQTYGIDYSDTFSPIAKLTSIHLFLSIAATYNWPLHQLDIKNAF

Query:  LHGNLQEEVYMEQLPRFVAQGESDRVCHLRKSPYGLNRAHMRGLVSLVKLLYALVCRRVHLIIMFFYRRSDNGIVLLVVYVDDIVITGNDASGISSLKTF
        LHG+L+EEVY+EQ P FVAQGE  +VC L+K+ YGL ++         K + A    +       FY++S  GI+LLVVYVDDIVITGND +GIS LKTF
Subjt:  LHGNLQEEVYMEQLPRFVAQGESDRVCHLRKSPYGLNRAHMRGLVSLVKLLYALVCRRVHLIIMFFYRRSDNGIVLLVVYVDDIVITGNDASGISSLKTF

Query:  RQGQFHMKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGTTPSGTSIMPNQQLV-KEGKLCKAPERYRRLVGKLNYLIVTQPNLAYSVSVVS
           +FH KDLG+LKYFLGIEV RSKKG++LSQRKYVLDLL ETGK+   P  T ++PN QL+  +G     PERYRR+VGKLNYL VT+P++AY+VSVVS
Subjt:  RQGQFHMKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGTTPSGTSIMPNQQLV-KEGKLCKAPERYRRLVGKLNYLIVTQPNLAYSVSVVS

Query:  RFMSSPTVDHWAAVEQIL
        +F S+PT+ HWAA+EQIL
Subjt:  RFMSSPTVDHWAAVEQIL

A5B136 Uncharacterized protein2.9e-11951.48Show/hide
Query:  MPSFVLNGEISYCVLFPTKSLFPIALKIFGCVCFVRDVRPHHTKLDPKFLKCIFLSYSRVQKRYRCYCPILKRYLVM--LLFFRIYLLLHHHRVRAMGRM
        MP+ VL G+I Y  + P KSLFP+A +IFGC C+VRD RP  TKLDPK L+C+FL YSR+QK YRC+ P L +YLV   ++F             A    
Subjt:  MPSFVLNGEISYCVLFPTKSLFPIALKIFGCVCFVRDVRPHHTKLDPKFLKCIFLSYSRVQKRYRCYCPILKRYLVM--LLFFRIYLLLHHHRVRAMGRM

Query:  TIFLYMRLPLPH----------------HPFPLMHLLPARCLLESTPDNLHQCLLHHAIRDQVM------IFPLLFAKVNTSAIGCKWVFSVKVNPDGTM
          +L  ++                    H  P++++ PA       P  L+     +A+ +++        + L+        +GCKWVF+VKV PDG++
Subjt:  TIFLYMRLPLPH----------------HPFPLMHLLPARCLLESTPDNLHQCLLHHAIRDQVM------IFPLLFAKVNTSAIGCKWVFSVKVNPDGTM

Query:  ARLKARLVANGYAQTYGIDYSDTFSPIAKLTSIHLFLSIAATYNWPLHQLDIKNAFLHGNLQEEVYMEQLPRFVAQGESDRVCHLRKSPYGLNRAHMRGL
        ARLKARLVA GYAQTYG+DYSDTFSP+AKL S+ LF+SIAA+  W +HQLDIKNAFLHG+L+EEVY+EQ P FVAQGE  +VC L+K+ YGL ++     
Subjt:  ARLKARLVANGYAQTYGIDYSDTFSPIAKLTSIHLFLSIAATYNWPLHQLDIKNAFLHGNLQEEVYMEQLPRFVAQGESDRVCHLRKSPYGLNRAHMRGL

Query:  VSLVKLLYALVCRRVHLIIMFFYRRSDNGIVLLVVYVDDIVITGNDASGISSLKTFRQGQFHMKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETG
            K + A    +       FY++S  GI+LLVVYVDDIVITGND +GIS LKTF   +FH KDLG+LKYFLGIEV RSKKG++LSQRKYVLDLL ETG
Subjt:  VSLVKLLYALVCRRVHLIIMFFYRRSDNGIVLLVVYVDDIVITGNDASGISSLKTFRQGQFHMKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETG

Query:  KLGTTPSGTSIMPNQQLV-KEGKLCKAPERYRRLVGKLNYLIVTQPNLAYSVSVVSRFMSSPTVDHWAAVEQIL
        K+   P  T ++PN QL+  +G     PERYRR+VGKLNYL VT+P++AY+VSVVS+F S+PT+ HWAA+EQIL
Subjt:  KLGTTPSGTSIMPNQQLV-KEGKLCKAPERYRRLVGKLNYLIVTQPNLAYSVSVVSRFMSSPTVDHWAAVEQIL

Q6L3Q0 Polyprotein, putative1.6e-12247.49Show/hide
Query:  MPSFVLNGEISYCVLFPTKSLFPIALKIFGCVCFVRDVRPHHTKLDPKFLKCIFLSYSRVQKRYRCYCPILKRYLVML----------------------
        MPS VLNG+I Y VLFP K LFP+  K+FG  C+VRDVRPH TKLDPK LKC+FL YSR+QK YRCY P L RY+V +                      
Subjt:  MPSFVLNGEISYCVLFPTKSLFPIALKIFGCVCFVRDVRPHHTKLDPKFLKCIFLSYSRVQKRYRCYCPILKRYLVML----------------------

Query:  ------LFFRIYLLLHHHRVRAMGRM-------------------TIFLYMRL----------------PLPHHPFPLM---------------------
              L +R             G +                    + +Y R                 PLP +P P                       
Subjt:  ------LFFRIYLLLHHHRVRAMGRM-------------------TIFLYMRL----------------PLPHHPFPLM---------------------

Query:  -------HLLPARCLLEST------PDNLHQCLLH-----------HAIRDQVMIFPLLFAKVNTSAIGCKWVFSVKVNPDGTMARLKARLVANGYAQTY
               HL P  C L ++      P  + + L H           HA+ D    + L+       A+GCKWVF++KVNPDG+MARLKARLVA GYAQTY
Subjt:  -------HLLPARCLLEST------PDNLHQCLLH-----------HAIRDQVMIFPLLFAKVNTSAIGCKWVFSVKVNPDGTMARLKARLVANGYAQTY

Query:  GIDYSDTFSPIAKLTSIHLFLSIAATYNWPLHQLDIKNAFLHGNLQEEVYMEQLPRFVAQGESDRVCHLRKSPYGLNRAHMRGLVSLVKLLYALVCRRVH
        G+DYSDTFSP+AKLTS+ LF+S+AA+ NWPLHQL IKNAFLHG+LQEEVYMEQ P FVAQGE+ +VCHL+K  YGL ++         +++      + +
Subjt:  GIDYSDTFSPIAKLTSIHLFLSIAATYNWPLHQLDIKNAFLHGNLQEEVYMEQLPRFVAQGESDRVCHLRKSPYGLNRAHMRGLVSLVKLLYALVCRRVH

Query:  LIIMFFYRRSDNGIVLLVVYVDDIVITGNDASGISSLKTFRQGQFHMKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGTTPSGTSIMPNQQ
             FYR+S  GI+LLVVYVDDIVIT +D +GISSLK F    FH KDLGQLKYFLGIEV RSKKGI+LSQRKY+LDLL ETGK    P  T ++PN Q
Subjt:  LIIMFFYRRSDNGIVLLVVYVDDIVITGNDASGISSLKTFRQGQFHMKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGTTPSGTSIMPNQQ

Query:  LVK-EGKLCKAPERYRRLVGKLNYLIVTQPNLAYSVSVVSRFMSSPTVDHWAAVEQIL
        L   +G     PERYRRLVGKLNYL VT+P+++++VS+VS+FMS+PT+ HWAA+EQIL
Subjt:  LVK-EGKLCKAPERYRRLVGKLNYLIVTQPNLAYSVSVVSRFMSSPTVDHWAAVEQIL

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.2e-4335.88Show/hide
Query:  NTSAIGCKWVFSVKVNPDGTMARLKARLVANGYAQTYGIDYSDTFSPIAKLTSIHLFLSIAATYNWPLHQLDIKNAFLHGNLQEEVYMEQLPRFVAQGES
        N + +  +WVFSVK N  G   R KARLVA G+ Q Y IDY +TF+P+A+++S    LS+   YN  +HQ+D+K AFL+G L+EE+YM +LP+ ++   S
Subjt:  NTSAIGCKWVFSVKVNPDGTMARLKARLVANGYAQTYGIDYSDTFSPIAKLTSIHLFLSIAATYNWPLHQLDIKNAFLHGNLQEEVYMEQLPRFVAQGES

Query:  DRVCHLRKSPYGLNRAHMRGLVSLVKLL--YALVCRRVHLIIMFFYRRSDNGIVLLVVYVDDIVITGNDASGISSLKTFRQGQFHMKDLGQLKYFLGIEV
        D VC L K+ YGL +A         + L     V   V   I    + + N  + +++YVDD+VI   D + +++ K +   +F M DL ++K+F+GI +
Subjt:  DRVCHLRKSPYGLNRAHMRGLVSLVKLL--YALVCRRVHLIIMFFYRRSDNGIVLLVVYVDDIVITGNDASGISSLKTFRQGQFHMKDLGQLKYFLGIEV

Query:  MRSKKGIYLSQRKYVLDLLS----ETGKLGTTPSGTSIMPNQQLVKEGKLCKAPERYRRLVGKLNYLIV-TQPNLAYSVSVVSRFMSSPTVDHWAAVEQI
           +  IYLSQ  YV  +LS    E     +TP  + I  N +L+   + C  P   R L+G L Y+++ T+P+L  +V+++SR+ S    + W  ++++
Subjt:  MRSKKGIYLSQRKYVLDLLS----ETGKLGTTPSGTSIMPNQQLVKEGKLCKAPERYRRLVGKLNYLIV-TQPNLAYSVSVVSRFMSSPTVDHWAAVEQI

Query:  L
        L
Subjt:  L

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.6e-5037.87Show/hide
Query:  CKWVFSVKVNPDGTMARLKARLVANGYAQTYGIDYSDTFSPIAKLTSIHLFLSIAATYNWPLHQLDIKNAFLHGNLQEEVYMEQLPRFVAQGESDRVCHL
        CKWVF +K + D  + R KARLV  G+ Q  GID+ + FSP+ K+TSI   LS+AA+ +  + QLD+K AFLHG+L+EE+YMEQ   F   G+   VC L
Subjt:  CKWVFSVKVNPDGTMARLKARLVANGYAQTYGIDYSDTFSPIAKLTSIHLFLSIAATYNWPLHQLDIKNAFLHGNLQEEVYMEQLPRFVAQGESDRVCHL

Query:  RKSPYGLNRAHMRGLVSLVKLLYALVCRRVHL-IIMFFYRRSDNGIVLLVVYVDDIVITGNDASGISSLKTFRQGQFHMKDLGQLKYFLGIEVMRSK--K
         KS YGL +A  +  +     + +    + +    ++F R S+N  ++L++YVDD++I G D   I+ LK      F MKDLG  +  LG++++R +  +
Subjt:  RKSPYGLNRAHMRGLVSLVKLLYALVCRRVHL-IIMFFYRRSDNGIVLLVVYVDDIVITGNDASGISSLKTFRQGQFHMKDLGQLKYFLGIEVMRSK--K

Query:  GIYLSQRKYVLDLLSETGKLGTTPSGTSIMPNQQLVK---------EGKLCKAPERYRRLVGKLNY-LIVTQPNLAYSVSVVSRFMSSPTVDHWAAVEQI
         ++LSQ KY+  +L         P  T +  + +L K         +G + K P  Y   VG L Y ++ T+P++A++V VVSRF+ +P  +HW AV+ I
Subjt:  GIYLSQRKYVLDLLSETGKLGTTPSGTSIMPNQQLVK---------EGKLCKAPERYRRLVGKLNY-LIVTQPNLAYSVSVVSRFMSSPTVDHWAAVEQI

Query:  L
        L
Subjt:  L

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-949.6e-0338Show/hide
Query:  LKIFGCVCFVRDVRPHHTKLDPKFLKCIFLSYSRVQKRYRCYCPILKRYL
        LK+FGC  F    +   TKLD K + CIF+ Y   +  YR + P+ K+ +
Subjt:  LKIFGCVCFVRDVRPHHTKLDPKFLKCIFLSYSRVQKRYRCYCPILKRYL

P92519 Uncharacterized mitochondrial protein AtMg008106.8e-1733.33Show/hide
Query:  LVVYVDDIVITGNDASGISSLKTFRQGQFHMKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGTTPSGTSIMPNQQLVKEGKLCKAPERYRR
        L++YVDDI++TG+  + ++ L       F MKDLG + YFLGI++     G++LSQ KY   +L+  G L   P  T +               P  +R 
Subjt:  LVVYVDDIVITGNDASGISSLKTFRQGQFHMKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGTTPSGTSIMPNQQLVKEGKLCKAPERYRR

Query:  LVGKLNYLIVTQPNLAYSVSVVSRFMSSPTVDHWAAVEQIL
        +VG L YL +T+P+++Y+V++V + M  PT+  +  ++++L
Subjt:  LVGKLNYLIVTQPNLAYSVSVVSRFMSSPTVDHWAAVEQIL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-5941.36Show/hide
Query:  NTSAIGCKWVFSVKVNPDGTMARLKARLVANGYAQTYGIDYSDTFSPIAKLTSIHLFLSIAATYNWPLHQLDIKNAFLHGNLQEEVYMEQLPRFVAQGES
        + + +GC+W+F+ K N DG++ R KARLVA GY Q  G+DY++TFSP+ K TSI + L +A   +WP+ QLD+ NAFL G L ++VYM Q P F+ +   
Subjt:  NTSAIGCKWVFSVKVNPDGTMARLKARLVANGYAQTYGIDYSDTFSPIAKLTSIHLFLSIAATYNWPLHQLDIKNAFLHGNLQEEVYMEQLPRFVAQGES

Query:  DRVCHLRKSPYGLNRAHMRGLVSLVKLLYALVCRRVHLIIMFFYRRSDNGIVLLVVYVDDIVITGNDASGISSLKTFRQGQFHMKDLGQLKYFLGIEVMR
        + VC LRK+ YGL +A     V L   L  +           F  +    IV ++VYVDDI+ITGND + + +       +F +KD  +L YFLGIE  R
Subjt:  DRVCHLRKSPYGLNRAHMRGLVSLVKLLYALVCRRVHLIIMFFYRRSDNGIVLLVVYVDDIVITGNDASGISSLKTFRQGQFHMKDLGQLKYFLGIEVMR

Query:  SKKGIYLSQRKYVLDLLSETGKLGTTPSGTSIMPNQQL-VKEGKLCKAPERYRRLVGKLNYLIVTQPNLAYSVSVVSRFMSSPTVDHWAAVEQIL
           G++LSQR+Y+LDLL+ T  +   P  T + P+ +L +  G     P  YR +VG L YL  T+P+++Y+V+ +S+FM  PT +H  A+++IL
Subjt:  SKKGIYLSQRKYVLDLLSETGKLGTTPSGTSIMPNQQL-VKEGKLCKAPERYRRLVGKLNYLIVTQPNLAYSVSVVSRFMSSPTVDHWAAVEQIL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-6042.76Show/hide
Query:  NTSAIGCKWVFSVKVNPDGTMARLKARLVANGYAQTYGIDYSDTFSPIAKLTSIHLFLSIAATYNWPLHQLDIKNAFLHGNLQEEVYMEQLPRFVAQGES
        + + +GC+W+F+ K N DG++ R KARLVA GY Q  G+DY++TFSP+ K TSI + L +A   +WP+ QLD+ NAFL G L +EVYM Q P FV +   
Subjt:  NTSAIGCKWVFSVKVNPDGTMARLKARLVANGYAQTYGIDYSDTFSPIAKLTSIHLFLSIAATYNWPLHQLDIKNAFLHGNLQEEVYMEQLPRFVAQGES

Query:  DRVCHLRKSPYGLNRAHMRGLVSLVKLLYALVCRRVHLI--IMFFYRRSDNGIVLLVVYVDDIVITGNDASGISSLKTFRQGQFHMKDLGQLKYFLGIEV
        D VC LRK+ YGL +A     V L    Y L    V+ I     F  +    I+ ++VYVDDI+ITGND   +         +F +K+   L YFLGIE 
Subjt:  DRVCHLRKSPYGLNRAHMRGLVSLVKLLYALVCRRVHLI--IMFFYRRSDNGIVLLVVYVDDIVITGNDASGISSLKTFRQGQFHMKDLGQLKYFLGIEV

Query:  MRSKKGIYLSQRKYVLDLLSETGKLGTTPSGTSIMPNQQL-VKEGKLCKAPERYRRLVGKLNYLIVTQPNLAYSVSVVSRFMSSPTVDHWAAVEQIL
         R  +G++LSQR+Y LDLL+ T  L   P  T +  + +L +  G     P  YR +VG L YL  T+P+L+Y+V+ +S++M  PT DHW A++++L
Subjt:  MRSKKGIYLSQRKYVLDLLSETGKLGTTPSGTSIMPNQQL-VKEGKLCKAPERYRRLVGKLNYLIVTQPNLAYSVSVVSRFMSSPTVDHWAAVEQIL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.0e-6344.15Show/hide
Query:  NTSAIGCKWVFSVKVNPDGTMARLKARLVANGYAQTYGIDYSDTFSPIAKLTSIHLFLSIAATYNWPLHQLDIKNAFLHGNLQEEVYMEQLPRFVA-QGE
        N   IGCKWV+ +K N DGT+ R KARLVA GY Q  GID+ +TFSP+ KLTS+ L L+I+A YN+ LHQLDI NAFL+G+L EE+YM+  P + A QG+
Subjt:  NTSAIGCKWVFSVKVNPDGTMARLKARLVANGYAQTYGIDYSDTFSPIAKLTSIHLFLSIAATYNWPLHQLDIKNAFLHGNLQEEVYMEQLPRFVA-QGE

Query:  S---DRVCHLRKSPYGLNRAHMRGLVSLVKLLYALVCRRVHLIIMFFYRRSDNGIVLLVVYVDDIVITGNDASGISSLKTFRQGQFHMKDLGQLKYFLGI
        S   + VC+L+KS YGL +A  +  +     L      + H    +F + +    + ++VYVDDI+I  N+ + +  LK+  +  F ++DLG LKYFLG+
Subjt:  S---DRVCHLRKSPYGLNRAHMRGLVSLVKLLYALVCRRVHLIIMFFYRRSDNGIVLLVVYVDDIVITGNDASGISSLKTFRQGQFHMKDLGQLKYFLGI

Query:  EVMRSKKGIYLSQRKYVLDLLSETGKLGTTPSGTSIMPNQQL-VKEGKLCKAPERYRRLVGKLNYLIVTQPNLAYSVSVVSRFMSSPTVDHWAAVEQIL
        E+ RS  GI + QRKY LDLL ETG LG  PS   + P+       G      + YRRL+G+L YL +T+ +++++V+ +S+F  +P + H  AV +IL
Subjt:  EVMRSKKGIYLSQRKYVLDLLSETGKLGTTPSGTSIMPNQQL-VKEGKLCKAPERYRRLVGKLNYLIVTQPNLAYSVSVVSRFMSSPTVDHWAAVEQIL

ATMG00810.1 DNA/RNA polymerases superfamily protein4.8e-1833.33Show/hide
Query:  LVVYVDDIVITGNDASGISSLKTFRQGQFHMKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGTTPSGTSIMPNQQLVKEGKLCKAPERYRR
        L++YVDDI++TG+  + ++ L       F MKDLG + YFLGI++     G++LSQ KY   +L+  G L   P  T +               P  +R 
Subjt:  LVVYVDDIVITGNDASGISSLKTFRQGQFHMKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGTTPSGTSIMPNQQLVKEGKLCKAPERYRR

Query:  LVGKLNYLIVTQPNLAYSVSVVSRFMSSPTVDHWAAVEQIL
        +VG L YL +T+P+++Y+V++V + M  PT+  +  ++++L
Subjt:  LVGKLNYLIVTQPNLAYSVSVVSRFMSSPTVDHWAAVEQIL

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)4.0e-1248.39Show/hide
Query:  VNTSAIGCKWVFSVKVNPDGTMARLKARLVANGYAQTYGIDYSDTFSPIAKLTSIHLFLSIA
        VN + +GCKWVF  K++ DGT+ RLKARLVA G+ Q  GI + +T+SP+ +  +I   L++A
Subjt:  VNTSAIGCKWVFSVKVNPDGTMARLKARLVANGYAQTYGIDYSDTFSPIAKLTSIHLFLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTTCATTTGTTCTCAATGGTGAGATTTCTTATTGTGTTCTTTTTCCTACCAAGTCTTTGTTTCCTATTGCTCTTAAGATATTTGGTTGTGTTTGTTTTGTTCGCGA
CGTTCGTCCTCATCATACTAAGTTAGATCCCAAATTCTTAAAATGTATCTTCTTGAGTTATTCGCGTGTTCAAAAGAGGTATCGTTGTTATTGTCCTATACTTAAAAGAT
ATCTTGTGATGTTGCTTTTTTTTAGGATATACCTTTTACTTCATCACCATCGAGTTCGTGCCATGGGGAGGATGACAATCTTTTTATATATGAGATTACCTCTCCCACAC
CATCCTTTTCCACTAATGCACCTCCTTCCCGCCCGTTGCCTTCTCGAGTCTACTCCCGACAACCTCCATCAATGCCTCCTTCATCATGCAATCCGAGACCAAGTGATGAT
CTTCCCATTGCTCTTCGCAAAGGTAAATACAAGTGCCATTGGTTGTAAATGGGTGTTTTCTGTTAAGGTGAATCCTGATGGAACAATGGCTCGATTGAAAGCTCGTCTTG
TTGCCAATGGTTATGCTCAAACCTACGGAATTGATTATTCAGATACATTTTCTCCAATTGCCAAATTAACTTCCATCCACCTATTTCTTTCCATAGCTGCTACCTATAAC
TGGCCTTTGCATCAACTTGACATTAAGAATGCTTTTCTGCATGGTAATCTTCAAGAGGAAGTTTATATGGAGCAACTACCTAGGTTTGTCGCTCAGGGGGAGAGTGATAG
AGTATGTCATCTTCGAAAATCTCCGTATGGTTTGAACAGAGCCCACATGCGTGGTTTGGTAAGTTTAGTCAAGCTCTTGTACGCTTTGGTATGCAGAAGAGTACATCTGA
TCATTATGTTTTTTTATCGCCGATCTGATAATGGTATAGTTTTACTAGTTGTATATGTTGATGATATTGTTATTACTGGAAATGATGCATCGGGTATTTCATCTCTCAAA
ACTTTTCGTCAGGGTCAGTTTCATATGAAAGATTTGGGTCAATTGAAATATTTTTTGGGCATTGAAGTGATGAGAAGTAAGAAAGGTATTTATTTGTCTCAACGAAAATA
TGTACTTGATTTGTTGTCTGAGACAGGAAAATTAGGAACCACACCAAGTGGTACTTCGATTATGCCGAATCAGCAACTTGTTAAAGAAGGAAAATTATGTAAAGCTCCTG
AGAGATATAGGAGATTAGTTGGGAAGTTGAACTACTTAATAGTGACTCAACCAAACCTTGCTTATTCTGTAAGTGTGGTAAGTCGGTTCATGTCTTCCCCTACAGTGGAT
CACTGGGCTGCAGTAGAGCAGATTCTTGTTATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTTCATTTGTTCTCAATGGTGAGATTTCTTATTGTGTTCTTTTTCCTACCAAGTCTTTGTTTCCTATTGCTCTTAAGATATTTGGTTGTGTTTGTTTTGTTCGCGA
CGTTCGTCCTCATCATACTAAGTTAGATCCCAAATTCTTAAAATGTATCTTCTTGAGTTATTCGCGTGTTCAAAAGAGGTATCGTTGTTATTGTCCTATACTTAAAAGAT
ATCTTGTGATGTTGCTTTTTTTTAGGATATACCTTTTACTTCATCACCATCGAGTTCGTGCCATGGGGAGGATGACAATCTTTTTATATATGAGATTACCTCTCCCACAC
CATCCTTTTCCACTAATGCACCTCCTTCCCGCCCGTTGCCTTCTCGAGTCTACTCCCGACAACCTCCATCAATGCCTCCTTCATCATGCAATCCGAGACCAAGTGATGAT
CTTCCCATTGCTCTTCGCAAAGGTAAATACAAGTGCCATTGGTTGTAAATGGGTGTTTTCTGTTAAGGTGAATCCTGATGGAACAATGGCTCGATTGAAAGCTCGTCTTG
TTGCCAATGGTTATGCTCAAACCTACGGAATTGATTATTCAGATACATTTTCTCCAATTGCCAAATTAACTTCCATCCACCTATTTCTTTCCATAGCTGCTACCTATAAC
TGGCCTTTGCATCAACTTGACATTAAGAATGCTTTTCTGCATGGTAATCTTCAAGAGGAAGTTTATATGGAGCAACTACCTAGGTTTGTCGCTCAGGGGGAGAGTGATAG
AGTATGTCATCTTCGAAAATCTCCGTATGGTTTGAACAGAGCCCACATGCGTGGTTTGGTAAGTTTAGTCAAGCTCTTGTACGCTTTGGTATGCAGAAGAGTACATCTGA
TCATTATGTTTTTTTATCGCCGATCTGATAATGGTATAGTTTTACTAGTTGTATATGTTGATGATATTGTTATTACTGGAAATGATGCATCGGGTATTTCATCTCTCAAA
ACTTTTCGTCAGGGTCAGTTTCATATGAAAGATTTGGGTCAATTGAAATATTTTTTGGGCATTGAAGTGATGAGAAGTAAGAAAGGTATTTATTTGTCTCAACGAAAATA
TGTACTTGATTTGTTGTCTGAGACAGGAAAATTAGGAACCACACCAAGTGGTACTTCGATTATGCCGAATCAGCAACTTGTTAAAGAAGGAAAATTATGTAAAGCTCCTG
AGAGATATAGGAGATTAGTTGGGAAGTTGAACTACTTAATAGTGACTCAACCAAACCTTGCTTATTCTGTAAGTGTGGTAAGTCGGTTCATGTCTTCCCCTACAGTGGAT
CACTGGGCTGCAGTAGAGCAGATTCTTGTTATTTGA
Protein sequenceShow/hide protein sequence
MPSFVLNGEISYCVLFPTKSLFPIALKIFGCVCFVRDVRPHHTKLDPKFLKCIFLSYSRVQKRYRCYCPILKRYLVMLLFFRIYLLLHHHRVRAMGRMTIFLYMRLPLPH
HPFPLMHLLPARCLLESTPDNLHQCLLHHAIRDQVMIFPLLFAKVNTSAIGCKWVFSVKVNPDGTMARLKARLVANGYAQTYGIDYSDTFSPIAKLTSIHLFLSIAATYN
WPLHQLDIKNAFLHGNLQEEVYMEQLPRFVAQGESDRVCHLRKSPYGLNRAHMRGLVSLVKLLYALVCRRVHLIIMFFYRRSDNGIVLLVVYVDDIVITGNDASGISSLK
TFRQGQFHMKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGTTPSGTSIMPNQQLVKEGKLCKAPERYRRLVGKLNYLIVTQPNLAYSVSVVSRFMSSPTVD
HWAAVEQILVI