; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035274 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035274
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr3:17895485..17902703
RNA-Seq ExpressionLag0035274
SyntenyLag0035274
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0009987 - cellular process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_016646912.1 PREDICTED: uncharacterized protein LOC103318979 [Prunus mume]1.8e-24446.87Show/hide
Query:  IANDRTRAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW--
        +A +  RA+  +  P+       I RP I A NFE+KP M  MLQ    F GL +EDP++HL  FL + D+    GV  DA+RL LFP+SL+D AK W  
Subjt:  IANDRTRAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW--

Query:  -------------------------------SEIVGFRQLEDETFSEAWERFKELLRKCTHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKPFDE
                                        +I+ F Q + E   EAWERFK+LLRKC HH LP  IQ++TFYNGL+  ++ +VDA+AGGAL+AK   E
Subjt:  -------------------------------SEIVGFRQLEDETFSEAWERFKELLRKCTHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKPFDE

Query:  AYEMLERISINSCQWSDVRGKNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPS-NPAS-----
        A+E+LE ++ N+ QW   R  N K   VLEVD ++ + A ++ +   + +++V S            +N  T+  C  C   H    C + NP +     
Subjt:  AYEMLERISINSCQWSDVRGKNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPS-NPAS-----

Query:  --VFFVGNQRNNPYSNFYNPGWRNHPNFSWGGQGSNVQAQQKMNQSGFAKTQVMPQQNKQALPQQNSGNSLEGMMKEFMARTDAAIQSNQASMRALELQV
          V  +  QRNNPYSN YNPGWRNHPNFSW    SN Q  Q+    GF      P Q K+ +  +++   L     +FM  T    Q+ QAS++ LE+QV
Subjt:  --VFFVGNQRNNPYSNFYNPGWRNHPNFSWGGQGSNVQAQQKMNQSGFAKTQVMPQQNKQALPQQNSGNSLEGMMKEFMARTDAAIQSNQASMRALELQV

Query:  GQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIIVIEKELES-----------GQGVGGSKENAGASGSVPDVEP
        GQLAN +  R QG  PS  E +P+ +  EQ KA+TLR GK +  +    DL   +     +EKE E+              +  ++++     S+P    
Subjt:  GQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIIVIEKELES-----------GQGVGGSKENAGASGSVPDVEP

Query:  PYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYVKFLKDILTKKKRLGEFETVSLTEECSVILKNGLPPKAKDPRSFTIP
        P +   PYVP +PFPQR +    DGQF KFLE+ ++L INIP  EA+EQMP+Y KF+KDIL+KK++ GE E + LTEECS IL+  LPPK KD  SF IP
Subjt:  PYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYVKFLKDILTKKKRLGEFETVSLTEECSVILKNGLPPKAKDPRSFTIP

Query:  VSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRS-----------------------------------------SILATGRTLID
         +IG     RALCDLG+SINL+PLSV +K+GIGE +PTTV+LQ+ADRS                                           L T RTLID
Subjt:  VSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRS-----------------------------------------SILATGRTLID

Query:  VQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRI-----------------LESTVIEKATQDSAD---------------------KHSEKHGEAP
        V++G LT+RV NE+  F VF+A+K+P E EDC  I +                 LEST++  AT    +                        E  G AP
Subjt:  VQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRI-----------------LESTVIEKATQDSAD---------------------KHSEKHGEAP

Query:  P-IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVALDLMPEHEEALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMK
        P   PS+I APTL LKPLP HL+Y YLG SETLP+I+A +L    EE ++++L++++ AIGWT+ADI+GIS S CMH+I +EE    S+E QRRLNP MK
Subjt:  P-IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVALDLMPEHEEALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMK

Query:  EVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYYCFLDGYSG
        EVV+ EV+K LDAGIIYPI+DS+WVSP Q VPKKGG+TVV N++NEL+PTRTVTGWRVC+DYR+LN ATRKDHFPLPFIDQML+RLAG  YYCFLDGYSG
Subjt:  EVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYYCFLDGYSG

Query:  YNQITIAPEDQEKTTFTCPYGTFAFRRMPFSLCNAQATFQRCMLTIFSDMIESTVEVFMDDFSVLEGLF
        YNQI IAPEDQEKTTFTCP+GTFA+RRMPF LCNA ATFQRCM++IFSDM+E  +EVFMDDFSV    F
Subjt:  YNQITIAPEDQEKTTFTCPYGTFAFRRMPFSLCNAQATFQRCMLTIFSDMIESTVEVFMDDFSVLEGLF

XP_017239676.1 PREDICTED: uncharacterized protein LOC108212460 [Daucus carota subsp. sativus]9.7e-25447.69Show/hide
Query:  RTRAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW------
        R   ++ Y +P F+ ++  IARP I A NF +     Q ++   +F+GLS+EDP+ HL++FL + D+F + GVP + +RL LF  SL   A+ W      
Subjt:  RTRAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW------

Query:  ---------------------------SEIVGFRQLEDETFSEAWERFKELLRKCTHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKPFDEAYEM
                                    EI  F+Q + E+  EA+ERFK+LLRKC HHGL    ++ TFYNGL       VD +A G+L  +  ++A+E+
Subjt:  ---------------------------SEIVGFRQLEDETFSEAWERFKELLRKCTHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKPFDEAYEM

Query:  LERISINSCQWSDVRGKNKKVKSVLEVDGVSTIRADLAMIANAL-KNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCP------SNPASVFF
        LE I+ N+C+  D R  +KKV  V EVD +++  A +     AL K +  +S  +   ++ A  V  +    C  CGE H  + CP      +  +SV +
Subjt:  LERISINSCQWSDVRGKNKKVKSVLEVDGVSTIRADLAMIANAL-KNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCP------SNPASVFF

Query:  VG---NQRNNPYSNFYNPGWRNHPNFSWGGQGSNVQAQQKMNQS---GFAKTQVMPQQNKQALPQQNSGNSLEGMMKEFMARTDAAIQSNQASMRALELQ
        VG   NQ+NNP+SN YNPGWRNHPNFSW    +NV+      Q+   GF       QQN +    +   N+ E ++ ++M +TDA IQS  ASMRALE+Q
Subjt:  VG---NQRNNPYSNFYNPGWRNHPNFSWGGQGSNVQAQQKMNQS---GFAKTQVMPQQNKQALPQQNSGNSLEGMMKEFMARTDAAIQSNQASMRALELQ

Query:  VGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIIVIEKELESGQGVGGSKENAGASGSVPDVEPPYVPPPPYVP
        VGQLA+ +  RP G LPS+TE +P+ + +E  KA+TLRSGK +E + K  D   + + ++  E  + S         N  A  S P     +V PPP   
Subjt:  VGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIIVIEKELESGQGVGGSKENAGASGSVPDVEPPYVPPPPYVP

Query:  PLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYVKFLKDILTKKKRLGEFETVSLTEECSVILKNGLPPKAKDPRSFTIPVSIGGKELGR
          PFPQR + + QD QF+KF+++ K+L INIP  EA+EQM +YVKF+KDIL++K+RL EFETV+LTEECS IL+  LPPK KDP SFTIP +IG +  G+
Subjt:  PLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYVKFLKDILTKKKRLGEFETVSLTEECSVILKNGLPPKAKDPRSFTIPVSIGGKELGR

Query:  ALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRS-----------------------------------------SILATGRTLIDVQKGELTMRV
        ALCDLGAS+NLMPLS++ KLG+GE +PT+V LQLADRS                                           LATGRTLIDVQKGELTMRV
Subjt:  ALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRS-----------------------------------------SILATGRTLIDVQKGELTMRV

Query:  CNEEVKFNVFKAMKYPDEMEDCSFIRIL-------------ESTVIEKATQDSADKHSEKHGE--------------------------APPIKPSLIEA
         +E+V FNVF AMK+ ++ E C  +                 +  +E + +++ D+ +E+  E                          +   KPS+ E 
Subjt:  CNEEVKFNVFKAMKYPDEMEDCSFIRIL-------------ESTVIEKATQDSADKHSEKHGE--------------------------APPIKPSLIEA

Query:  PTLDLKPLPDHLKYVYLGESETLPIIVALDLMPEHEEALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKW
        P L+LK LP HLKY +LGE  TLP+I++  L  EHEE L+++L++Y++AIGW +ADI+GIS SFCMHKI++E+    +IE QRRLNP MKEVVKKE+IKW
Subjt:  PTLDLKPLPDHLKYVYLGESETLPIIVALDLMPEHEEALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKW

Query:  LDAGIIYPIADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYYCFLDGYSGYNQITIAPED
        LDAGIIYPI+DS+WVSP+QCVPKKGG+TVV N+ NELIPTRTVTGWRVCMDYR+LNKATRKDHFPLPFIDQMLDRLAG+ +YCFLDGYSGY+QI IAPED
Subjt:  LDAGIIYPIADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYYCFLDGYSGYNQITIAPED

Query:  QEKTTFTCPYGTFAFRRMPFSLCNAQATFQRCMLTIFSDMIESTVEVFMDDFSVLEGLF
        QEKTTFTCP+GTFAFR++ F LCNA +TFQRCM+ IFSDMIE  VEVFMDDFSVL   F
Subjt:  QEKTTFTCPYGTFAFRRMPFSLCNAQATFQRCMLTIFSDMIESTVEVFMDDFSVLEGLF

XP_038972405.1 uncharacterized protein LOC120104748 [Phoenix dactylifera]9.1e-24445.4Show/hide
Query:  NDRTRAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW----
        N   R +  YAVP  N   P I RP + A NFE+KP + QM+Q   QF G  SEDPH HL +FL + D+  + GV  DA+RL LFP+SL+D AK+W    
Subjt:  NDRTRAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW----

Query:  -----------------------------SEIVGFRQLEDETFSEAWERFKELLRKCTHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKPFDEAY
                                     ++I  F Q + E+  EAWERFK+L RKC HHGLP  + ++TFYNGL    +  +DA+AGG L++K  +EAY
Subjt:  -----------------------------SEIVGFRQLEDETFSEAWERFKELLRKCTHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKPFDEAY

Query:  EMLERISINSCQWSDVRGKNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN--
        E+LE ++ N+ QWS+ R   KKV  + +VDG++ + A +  +      +  ++    P +            +C  CG  H    C      V FV N  
Subjt:  EMLERISINSCQWSDVRGKNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN--

Query:  ---QRNNPYSNFYNPGWRNHPNFSWGGQGSNVQAQQKMNQSGFAKTQVMPQQNKQALPQQNSGNSLEGMMKEFMARTDAAIQSNQASMRALELQVGQLAN
           Q+NNPYSN YNPGWRNHPNFSW  QG+   + + ++  GF   Q  P Q +     + +   L     E   R +A +    +S R +E+Q+GQLAN
Subjt:  ---QRNNPYSNFYNPGWRNHPNFSWGGQGSNVQAQQKMNQSGFAKTQVMPQQNKQALPQQNSGNSLEGMMKEFMARTDAAIQSNQASMRALELQVGQLAN

Query:  ELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIIVIEKELESGQGVGGSKENAGASGSVPDV---EPPYVPPPPYVPPLPF
         + +R QG LPS TE      KE  KAVTLRSGK L +                +  E   G  V   + N   S  V D+     P  P  PYVPP+PF
Subjt:  ELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIIVIEKELESGQGVGGSKENAGASGSVPDV---EPPYVPPPPYVPPLPF

Query:  PQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYVKFLKDILTKKKRLGEFETVSLTEECSVILKNGLPPKAKDPRSFTIPVSIGGKELGRALCD
        PQR K    D QF+KFL++ +QLHINIP  +A+ Q+P Y KFLK+I++KK++L +FET++LTEECS I++N LPPK +DP SF+IP +IG  +  RALCD
Subjt:  PQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYVKFLKDILTKKKRLGEFETVSLTEECSVILKNGLPPKAKDPRSFTIPVSIGGKELGRALCD

Query:  LGASINLMPLSVYRKLGIGEARPTTVTLQLADRS-----------------------------------------SILATGRTLIDVQKGELTMRVCNEE
        LGAS++LMPLSV RKLG+ E +PTT++LQLADRS                                           LAT   +IDV+ G LT++V  EE
Subjt:  LGASINLMPLSVYRKLGIGEARPTTVTLQLADRS-----------------------------------------SILATGRTLIDVQKGELTMRVCNEE

Query:  VKFNVFKAMKYPDEMEDCSFIRI-----------------LESTVIEKATQ--------------DSADKHSEKHG--------EAPPIKPSLIEAPTLD
        V+FN+F+A KYP   +    + +                 LE+ ++   T               ++     +K G          PP  PS ++AP L+
Subjt:  VKFNVFKAMKYPDEMEDCSFIRI-----------------LESTVIEKATQ--------------DSADKHSEKHG--------EAPPIKPSLIEAPTLD

Query:  LKPLPDHLKYVYLGESETLPIIVALDLMPEHEEALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAG
        LKPLP HL Y +LGE+ TLP+IV++ L  E  + LI++L+  +KAIGWT++D++GIS S CMH+I +E+     +E QRRLNP MKEVV+ EV+KWLDAG
Subjt:  LKPLPDHLKYVYLGESETLPIIVALDLMPEHEEALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAG

Query:  IIYPIADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYYCFLDGYSGYNQITIAPEDQEKT
        IIYPI+DS W+SPVQ VPKKGG+TVV N++NELIPTRTVTGWRVC+DYR+LN  TRKDHFPLPF+DQ+L+RLAG  YYCFLDGYSGYNQI+I+PEDQEKT
Subjt:  IIYPIADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYYCFLDGYSGYNQITIAPEDQEKT

Query:  TFTCPYGTFAFRRMPFSLCNAQATFQRCMLTIFSDMIESTVEVFMDDFSVLEGLF
        TFTCPYGTFAFRRMPF LCNA ATFQRCM+ IFSD +E  +EVFMDDFSV    F
Subjt:  TFTCPYGTFAFRRMPFSLCNAQATFQRCMLTIFSDMIESTVEVFMDDFSVLEGLF

XP_038973683.1 uncharacterized protein LOC120105384 [Phoenix dactylifera]9.1e-24445.4Show/hide
Query:  NDRTRAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW----
        N   R +  YAVP  N   P I RP + A NFE+KP + QM+Q   QF G  SEDPH HL +FL + D+  + GV  DA+RL LFP+SL+D AK+W    
Subjt:  NDRTRAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW----

Query:  -----------------------------SEIVGFRQLEDETFSEAWERFKELLRKCTHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKPFDEAY
                                     ++I  F Q + E+  EAWERFK+L RKC HHGLP  + ++TFYNGL    +  +DA+AGG L++K  +EAY
Subjt:  -----------------------------SEIVGFRQLEDETFSEAWERFKELLRKCTHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKPFDEAY

Query:  EMLERISINSCQWSDVRGKNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN--
        E+LE ++ N+ QWS+ R   KKV  + +VDG++ + A +  +      +  ++    P +            +C  CG  H    C      V FV N  
Subjt:  EMLERISINSCQWSDVRGKNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN--

Query:  ---QRNNPYSNFYNPGWRNHPNFSWGGQGSNVQAQQKMNQSGFAKTQVMPQQNKQALPQQNSGNSLEGMMKEFMARTDAAIQSNQASMRALELQVGQLAN
           Q+NNPYSN YNPGWRNHPNFSW  QG+   + + ++  GF   Q  P Q +     + +   L     E   R +A +    +S R +E+Q+GQLAN
Subjt:  ---QRNNPYSNFYNPGWRNHPNFSWGGQGSNVQAQQKMNQSGFAKTQVMPQQNKQALPQQNSGNSLEGMMKEFMARTDAAIQSNQASMRALELQVGQLAN

Query:  ELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIIVIEKELESGQGVGGSKENAGASGSVPDV---EPPYVPPPPYVPPLPF
         + +R QG LPS TE      KE  KAVTLRSGK L +                +  E   G  V   + N   S  V D+     P  P  PYVPP+PF
Subjt:  ELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIIVIEKELESGQGVGGSKENAGASGSVPDV---EPPYVPPPPYVPPLPF

Query:  PQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYVKFLKDILTKKKRLGEFETVSLTEECSVILKNGLPPKAKDPRSFTIPVSIGGKELGRALCD
        PQR K    D QF+KFL++ +QLHINIP  +A+ Q+P Y KFLK+I++KK++L +FET++LTEECS I++N LPPK +DP SF+IP +IG  +  RALCD
Subjt:  PQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYVKFLKDILTKKKRLGEFETVSLTEECSVILKNGLPPKAKDPRSFTIPVSIGGKELGRALCD

Query:  LGASINLMPLSVYRKLGIGEARPTTVTLQLADRS-----------------------------------------SILATGRTLIDVQKGELTMRVCNEE
        LGAS++LMPLSV RKLG+ E +PTT++LQLADRS                                           LAT   +IDV+ G LT++V  EE
Subjt:  LGASINLMPLSVYRKLGIGEARPTTVTLQLADRS-----------------------------------------SILATGRTLIDVQKGELTMRVCNEE

Query:  VKFNVFKAMKYPDEMEDCSFIRI-----------------LESTVIEKATQ--------------DSADKHSEKHG--------EAPPIKPSLIEAPTLD
        V+FN+F+A KYP   +    + +                 LE+ ++   T               ++     +K G          PP  PS ++AP L+
Subjt:  VKFNVFKAMKYPDEMEDCSFIRI-----------------LESTVIEKATQ--------------DSADKHSEKHG--------EAPPIKPSLIEAPTLD

Query:  LKPLPDHLKYVYLGESETLPIIVALDLMPEHEEALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAG
        LKPLP HL Y +LGE+ TLP+IV++ L  E  + LI++L+  +KAIGWT++D++GIS S CMH+I +E+     +E QRRLNP MKEVV+ EV+KWLDAG
Subjt:  LKPLPDHLKYVYLGESETLPIIVALDLMPEHEEALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAG

Query:  IIYPIADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYYCFLDGYSGYNQITIAPEDQEKT
        IIYPI+DS W+SPVQ VPKKGG+TVV N++NELIPTRTVTGWRVC+DYR+LN  TRKDHFPLPF+DQ+L+RLAG  YYCFLDGYSGYNQI+I+PEDQEKT
Subjt:  IIYPIADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYYCFLDGYSGYNQITIAPEDQEKT

Query:  TFTCPYGTFAFRRMPFSLCNAQATFQRCMLTIFSDMIESTVEVFMDDFSVLEGLF
        TFTCPYGTFAFRRMPF LCNA ATFQRCM+ IFSD +E  +EVFMDDFSV    F
Subjt:  TFTCPYGTFAFRRMPFSLCNAQATFQRCMLTIFSDMIESTVEVFMDDFSVLEGLF

XP_038976300.1 uncharacterized protein LOC120107204 [Phoenix dactylifera]5.3e-24445.4Show/hide
Query:  NDRTRAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW----
        N   R +  YAVP  N   P I RP + A NFE+KP + QM+Q   QF G  SEDPH HL +FL + D+  + GV  DA+RL LFP+SL+D AK+W    
Subjt:  NDRTRAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW----

Query:  -----------------------------SEIVGFRQLEDETFSEAWERFKELLRKCTHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKPFDEAY
                                     ++I  F Q + E+  EAWERFK+L RKC HHGLP  + ++TFYNGL    +  +DA+AGG L++K  +EAY
Subjt:  -----------------------------SEIVGFRQLEDETFSEAWERFKELLRKCTHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKPFDEAY

Query:  EMLERISINSCQWSDVRGKNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN--
        E+LE ++ N+ QWS+ R   KKV  + +VDG++ + A +  +      +  ++    P +            +C  CG  H    C      V FV N  
Subjt:  EMLERISINSCQWSDVRGKNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN--

Query:  ---QRNNPYSNFYNPGWRNHPNFSWGGQGSNVQAQQKMNQSGFAKTQVMPQQNKQALPQQNSGNSLEGMMKEFMARTDAAIQSNQASMRALELQVGQLAN
           Q+NNPYSN YNPGWRNHPNFSW  QG+   + + ++  GF   Q  P Q +     + +   L     E   R +A +    +S R +E+Q+GQLAN
Subjt:  ---QRNNPYSNFYNPGWRNHPNFSWGGQGSNVQAQQKMNQSGFAKTQVMPQQNKQALPQQNSGNSLEGMMKEFMARTDAAIQSNQASMRALELQVGQLAN

Query:  ELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIIVIEKELESGQGVGGSKENAGASGSVPDV---EPPYVPPPPYVPPLPF
         + +R QG LPS TE      KE  KAVTLRSGK L +                +  E   G  V   + N   S  V D+     P  P  PYVPP+PF
Subjt:  ELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIIVIEKELESGQGVGGSKENAGASGSVPDV---EPPYVPPPPYVPPLPF

Query:  PQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYVKFLKDILTKKKRLGEFETVSLTEECSVILKNGLPPKAKDPRSFTIPVSIGGKELGRALCD
        PQR K    D QF+KFL++ +QLHINIP  +A+ Q+P Y KFLK+I++KK++L +FET++LTEECS I++N LPPK +DP SF+IP +IG  +  RALCD
Subjt:  PQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYVKFLKDILTKKKRLGEFETVSLTEECSVILKNGLPPKAKDPRSFTIPVSIGGKELGRALCD

Query:  LGASINLMPLSVYRKLGIGEARPTTVTLQLADRS-----------------------------------------SILATGRTLIDVQKGELTMRVCNEE
        LGAS++LMPLSV RKLG+ E +PTT++LQLADRS                                           LAT   +IDV+ G LT++V  EE
Subjt:  LGASINLMPLSVYRKLGIGEARPTTVTLQLADRS-----------------------------------------SILATGRTLIDVQKGELTMRVCNEE

Query:  VKFNVFKAMKYPDEMEDCSFIRI-----------------LESTVIEKATQ--------------DSADKHSEKHG--------EAPPIKPSLIEAPTLD
        V+FN+F+A KYP   +    + +                 LE+ ++   T               ++     +K G          PP  PS ++AP L+
Subjt:  VKFNVFKAMKYPDEMEDCSFIRI-----------------LESTVIEKATQ--------------DSADKHSEKHG--------EAPPIKPSLIEAPTLD

Query:  LKPLPDHLKYVYLGESETLPIIVALDLMPEHEEALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAG
        LKPLP HL Y +LGE+ TLP+IV++ L  E  + LI++L+  +KAIGWT++D++GIS S CMH+I +E+     +E QRRLNP MKEVV+ EV+KWLDAG
Subjt:  LKPLPDHLKYVYLGESETLPIIVALDLMPEHEEALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAG

Query:  IIYPIADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYYCFLDGYSGYNQITIAPEDQEKT
        IIYPI+DS W+SPVQ VPKKGG+TVV N++NELIPTRTVTGWRVC+DYR+LN  TRKDHFPLPF+DQ+L+RLAG  YYCFLDGYSGYNQI+I+PEDQEKT
Subjt:  IIYPIADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYYCFLDGYSGYNQITIAPEDQEKT

Query:  TFTCPYGTFAFRRMPFSLCNAQATFQRCMLTIFSDMIESTVEVFMDDFSVLEGLF
        TFTCPYGTFAFRRMPF LCNA ATFQRCM+ IFSD +E  +EVFMDDFSV    F
Subjt:  TFTCPYGTFAFRRMPFSLCNAQATFQRCMLTIFSDMIESTVEVFMDDFSVLEGLF

TrEMBL top hitse value%identityAlignment
A0A2G9HH15 Reverse transcriptase1.5e-23144.78Show/hide
Query:  QQNPLFEQNEQRNNQAENPILIAND--RTRAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGV
        ++  L E  EQ     EN I+I  D      +R  A+P   E    +  P++  A  +++  M +M+Q   QF GLS E+P+ H+ +FL + D+   +GV
Subjt:  QQNPLFEQNEQRNNQAENPILIAND--RTRAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGV

Query:  PRDALRLTLFPYSLRDGAKSW---------------------------------SEIVGFRQLEDETFSEAWERFKELLRKCTHHGLPHCIQMETFYNGL
         +DALRL LF +SL   A  W                                 +EI+ FRQ   ET  EAW RF+++LR C +H +P  IQ+ TFY+GL
Subjt:  PRDALRLTLFPYSLRDGAKSW---------------------------------SEIVGFRQLEDETFSEAWERFKELLRKCTHHGLPHCIQMETFYNGL

Query:  NGVTQGMVDASAGGALLAKPFDEAYEMLERISINSCQWSDVRGKNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACV
            +  +D   G + L+    E + +L  +  N  +    R    K   V+EVD V+ + A +  +  ++KN  V   Q  P               C 
Subjt:  NGVTQGMVDASAGGALLAKPFDEAYEMLERISINSCQWSDVRGKNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACV

Query:  YCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWG---GQGSNVQAQQKMNQSGFAKTQVMPQQNKQALPQQNSGNSLEGMMKEFM
         CGE H  + CP +  S+ FV N R   NNPYSN YNPGWR HPNFSW    GQG  ++ QQ               Q +   P Q    SLE  + +FM
Subjt:  YCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWG---GQGSNVQAQQKMNQSGFAKTQVMPQQNKQALPQQNSGNSLEGMMKEFM

Query:  ARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIIVIEKELESGQGVGGSKENAG
        A       S  A+ + +E Q+GQLAN + +RP+  LPS+TE +PR++ K Q +AVTLR+G  L+E  K +   S    +I  EK  E             
Subjt:  ARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNSDNIIVIEKELESGQGVGGSKENAG

Query:  ASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPL-VEAIEQMPNYVKFLKDILTKKKRLGEFETVSLTEECSVILKNGLPP
                                                        I  PL V+A+EQMP+YVKF+KDIL+KK+RLG++ETV+LTEECS I++N LPP
Subjt:  ASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPL-VEAIEQMPNYVKFLKDILTKKKRLGEFETVSLTEECSVILKNGLPP

Query:  KAKDPRSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRS-----------------------------------------
        K KDP SFTIP +IG    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLADRS                                         
Subjt:  KAKDPRSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRS-----------------------------------------

Query:  SILATGRTLIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------IEKATQDSADKHSEKHGE-----------------
          LATGRTLIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C  + + +               +E+A  D  D+ +E+  E                 
Subjt:  SILATGRTLIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------IEKATQDSADKHSEKHGE-----------------

Query:  -----APP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVALDLMPEHEEALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQ
             AP   +KPS+ E PTL+LKPLP HL Y YLGES+TLP+I++  L     E L+++L+ ++ AIGWT+ADI+GIS SFCMHKI LE+G   S+E Q
Subjt:  -----APP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVALDLMPEHEEALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQ

Query:  RRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYY
        RRLNP MKEVVKKE+IKWLDAGIIYPI+DS+WVSPVQCVPKKGG+TVV N  NELIPTRTVTGWRVCMDYR+LNKATRKDHFPLPFIDQMLDRLAG+ +Y
Subjt:  RRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYY

Query:  CFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFSLCNAQATFQRCMLTIFSDMIESTVEVFMDDFSV
        CFLDGYSGYNQI I PEDQEKTTFTCPYGTF FR+MPF LCNA ATFQRCM+ IF+DM+E+ +EVFMDDFSV
Subjt:  CFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFSLCNAQATFQRCMLTIFSDMIESTVEVFMDDFSV

A0A2G9HYA0 Reverse transcriptase1.1e-23450.33Show/hide
Query:  FRQLEDETFSEAWERFKELLRKCTHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKPFDEAYEMLERISINSCQWSDVRGKNKKVKSVLEVDGVST
        FRQ   ET  EAW RF+++LR C +H +P  IQ+ TFY+GL    +  +D   G + L+    E + +L  +  N  +    R    K   V+EVD V+ 
Subjt:  FRQLEDETFSEAWERFKELLRKCTHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKPFDEAYEMLERISINSCQWSDVRGKNKKVKSVLEVDGVST

Query:  IRADLAMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWG---GQGSNVQ
        + A +  +  ++KN  V   Q  P               C  CGE H  + CP +  S+ FV N R   NNPYSN YNPGWR HPNFSW    GQGS  +
Subjt:  IRADLAMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWG---GQGSNVQ

Query:  AQQKMNQSGFAKTQVMPQQNKQALPQQNSGNSLEGMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRS
         QQ               Q +   P Q    SLE  + +FMA       S  A+ + +E Q+GQLAN + +RPQG LPS+TE +PR++GK Q +AVTLR+
Subjt:  AQQKMNQSGFAKTQVMPQQNKQALPQQNSGNSLEGMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRS

Query:  GKPLEESRKTQDLNSNSDNIIVIEKELESGQGVGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQ
        G+ L+E  K +   S    +I  EKE E                    VE P     P     PFPQR + +  + QF KFLE+ K+LHINIP  EA+EQ
Subjt:  GKPLEESRKTQDLNSNSDNIIVIEKELESGQGVGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQ

Query:  MPNYVKFLKDILTKKKRLGEFETVSLTEECSVILKNGLPPKAKDPRSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRS-
        MP+YVKF+KDIL+KK+RLG++ETV+LTEECS I++N LPPK KDP SFTIP +IG    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLADRS 
Subjt:  MPNYVKFLKDILTKKKRLGEFETVSLTEECSVILKNGLPPKAKDPRSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRS-

Query:  ----------------------------------------SILATGRTLIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV-------
                                                  LATGRTLIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C  + + +          
Subjt:  ----------------------------------------SILATGRTLIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV-------

Query:  -----IEKATQDSADKHSEKHGE------------------------APPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVALDLMPEHEEALIKL
             +E+A  D  D+ +E+  E                        +  +KPS+ + PTL+LKPLP HL Y YLGES+TLP+I++  L     E L+++
Subjt:  -----IEKATQDSADKHSEKHGE------------------------APPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVALDLMPEHEEALIKL

Query:  LQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRT
        L+ ++ AIGWT+ADI+GIS SFCMHKI LE+    S+E QRRLNP MKEVVKKE+IKWLDAGIIYPI+DS+WVSPVQCVPKKGG+TVV N  NELIPTRT
Subjt:  LQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRT

Query:  VTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFSLCNAQATFQRCMLTIFSDMIE
        VTGWRVCMDYR+LNKATRKDHFPLPFIDQMLDRLAG+ +YCFLDGYSGYNQI IAPEDQEKTTFTCPYGTFAFRRMPF LCNA ATFQRCM+ IF+DM+E
Subjt:  VTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFSLCNAQATFQRCMLTIFSDMIE

Query:  STVEVFMDDFSV
        + +EVFMDDFSV
Subjt:  STVEVFMDDFSV

A0A2G9HYD8 Reverse transcriptase1.7e-22749.45Show/hide
Query:  FRQLEDETFSEAWERFKELLRKCTHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKPFDEAYEMLERISINSCQWSDVRGKNKKVKSVLEVDGVST
        FRQ   ET  EAW RF+++LR C +H +P  IQ+ TFY+GL    +  +D   G + L+    E + +L  +  N  +    R    K   V+EVD V+ 
Subjt:  FRQLEDETFSEAWERFKELLRKCTHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKPFDEAYEMLERISINSCQWSDVRGKNKKVKSVLEVDGVST

Query:  IRADLAMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWG---GQGSNVQ
        + A +  +  ++KN                              E H  + CP +  S+ FV N R   NNPYSN YNPGWR HPNFSW    GQGS  +
Subjt:  IRADLAMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWG---GQGSNVQ

Query:  AQQKMNQSGFAKTQVMPQQNKQALPQQNSGNSLEGMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRS
         QQ   Q         P Q     P Q    SLE  + +FMA       S  A+ + +E Q+GQLAN + +RPQG LPS+TE +PR++GK Q +AVTLR+
Subjt:  AQQKMNQSGFAKTQVMPQQNKQALPQQNSGNSLEGMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRS

Query:  GKPLEESRKTQDLNSNSDNIIVIEKELESGQGVGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQ
        G+ L+E  K +   S    +I  EKE E                    VE P     P     PFPQ+ + +  + QF KFLE+ K+LHINIP  EA+EQ
Subjt:  GKPLEESRKTQDLNSNSDNIIVIEKELESGQGVGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQ

Query:  MPNYVKFLKDILTKKKRLGEFETVSLTEECSVILKNGLPPKAKDPRSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRS-
        MP+YVKF+KDIL+KK+RLG++ET +LTEEC+ I++N LPPK KDP SFTIP +IG    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLADRS 
Subjt:  MPNYVKFLKDILTKKKRLGEFETVSLTEECSVILKNGLPPKAKDPRSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRS-

Query:  ----------------------------------------SILATGRTLIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV-------
                                                  LATGRTLIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C  + + ++         
Subjt:  ----------------------------------------SILATGRTLIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV-------

Query:  -----IEKATQDSADKHSEKHGE------------------------APPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVALDLMPEHEEALIKL
             +E+A  D  ++ +E+  E                        +  +KPS+ + PTL+LKPLP+HL YVYLGES+TLP+I++  L     E L+++
Subjt:  -----IEKATQDSADKHSEKHGE------------------------APPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVALDLMPEHEEALIKL

Query:  LQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRT
        L+ ++ AIGWT+ADI+GIS SFCMHKI LE+    S+E QRRLN  MKEVVKKE+IKWLDAGIIYPI+DS+WVSPVQCVPKKGG+TVV N  NELIPTRT
Subjt:  LQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRT

Query:  VTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFSLCNAQATFQRCMLTIFSDMIE
        VTGWRVCMDYR+LNKATRKDHFPLPFIDQMLDRLAG+ +YCFLDGYSGYNQI IAPEDQEKTTFTCPYGTFAFRRMPF LCNA ATFQRCM+ IF+DM+E
Subjt:  VTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFSLCNAQATFQRCMLTIFSDMIE

Query:  STVEVFMDDFSV
        + +EVFMDDFSV
Subjt:  STVEVFMDDFSV

A0A6P8DD93 uncharacterized protein LOC1162064531.4e-22644.74Show/hide
Query:  RAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW--------
        RA+R YAVP    +   I RP I A NFE+KP + QM+Q+  QF G  +E P  H+  FL   ++  +  V  D +RL LFP+SLRD A++W        
Subjt:  RAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW--------

Query:  -------------------------SEIVGFRQLEDETFSEAWERFKELLRKCTHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKPFDEAYEMLE
                                 +EI  F +   E+  EAWERFKE +RKC HHGLP  + +E FY  L+   + +VDA+AGGAL+ K +DEA  ++E
Subjt:  -------------------------SEIVGFRQLEDETFSEAWERFKELLRKCTHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKPFDEAYEMLE

Query:  RISINSCQWSDVRGKNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPS-NPAS------VFFVG
         ++ ++  W + R K+ +V SV ++D ++ +   ++ +   +  +T            +   NQV    C  C   H+   C S NP++      V FV 
Subjt:  RISINSCQWSDVRGKNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPS-NPAS------VFFVG

Query:  N-QRNN--PYSNFYNPGWRNHPNFSWGGQGSNVQAQQKMNQSGFAKTQVMPQQNKQALPQQNSGNSLEGMMKEFMARTDAAIQSNQASMRALELQVGQLA
        N QR+N  PYSN YNPGWRNHPNFSW  + + ++      + G       P QN    P Q S + +E +M  +M +TD  +Q+ QA++R LE Q+ Q++
Subjt:  N-QRNN--PYSNFYNPGWRNHPNFSWGGQGSNVQAQQKMNQSGFAKTQVMPQQNKQALPQQNSGNSLEGMMKEFMARTDAAIQSNQASMRALELQVGQLA

Query:  NELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLE-ESRKTQDLNSNSDNIIVIEKELESGQGVGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFP
         +L  RP G LPS+TE    E  + V A+ LRSGK LE  +RK Q    + +     +K  E  Q   G K                    PYVPP+PFP
Subjt:  NELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLE-ESRKTQDLNSNSDNIIVIEKELESGQGVGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFP

Query:  QRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYVKFLKDILTKKKRLGEFETVSLTEECSVILKN---GLPPKAKDPRSFTIPVSIGGKELGRAL
        +R K +  D QF KFL++ K+L INIP  EA++QMP+Y +F+KD+LTKK++    E V LT ECS+IL+     LP K +D  SFT+P +IG       L
Subjt:  QRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYVKFLKDILTKKKRLGEFETVSLTEECSVILKN---GLPPKAKDPRSFTIPVSIGGKELGRAL

Query:  CDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRS-----------------------------------------SILATGRTLIDVQKGELTMRVCN
         D GASINLMPLS++RKLG+GE + T VTLQLADRS                                           LATG+ LIDV++G+LT+RV N
Subjt:  CDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRS-----------------------------------------SILATGRTLIDVQKGELTMRVCN

Query:  EEVKFNVFKAMKYPDEMEDCSFIRILE----STVIEKATQDSA-------------DKHSEKHGE--------------APPIKP--SLIEAPTLDLKPL
        E++ FNV+ A+K  D+ + C  I I++     +V EKA  D+              D+H E+  E                  KP  SL ++P L+LKPL
Subjt:  EEVKFNVFKAMKYPDEMEDCSFIRILE----STVIEKATQDSA-------------DKHSEKHGE--------------APPIKP--SLIEAPTLDLKPL

Query:  PDHLKYVYLGESETLPIIVALDLMPEHEEALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYP
        P HLKY YLG  +TLPII++  L  + E+ L+ +L+++++AIGWT+ADI+GIS   C H+I LE      ++ QRRLNP +KEVVKKEV+K LDAGIIYP
Subjt:  PDHLKYVYLGESETLPIIVALDLMPEHEEALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYP

Query:  IADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYYCFLDGYSGYNQITIAPEDQEKTTFTC
        I+DS WVSPVQ VPKKGG+TVV N+ N+LIPTRTVTGWRVC+DYR+LN ATRKDH PLPFIDQML++LAG  YYCFLDGYSGYNQI IAPEDQEKTTFTC
Subjt:  IADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYYCFLDGYSGYNQITIAPEDQEKTTFTC

Query:  PYGTFAFRRMPFSLCNAQATFQRCMLTIFSDMIESTVEVFMDDFSV
        PYGTFAFRRMPF LCNA ATFQRCM++IFSDM+E+ +E+FMDDFSV
Subjt:  PYGTFAFRRMPFSLCNAQATFQRCMLTIFSDMIESTVEVFMDDFSV

A0A6P8DKJ2 uncharacterized protein LOC1162042311.9e-22644.84Show/hide
Query:  RAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW--------
        RA+R YAVP    +   I RP I A NFE+KP + QM+Q+  QF G  +E P  H+  FL   ++  +  V  D +RL LFP+SLRD A++W        
Subjt:  RAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSW--------

Query:  -------------------------SEIVGFRQLEDETFSEAWERFKELLRKCTHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKPFDEAYEMLE
                                 +EI  F +   E+  EAWERFKE +RKC HHGLP  + +E FY  L+   + +VDA+AGGAL+ K +DEA  ++E
Subjt:  -------------------------SEIVGFRQLEDETFSEAWERFKELLRKCTHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKPFDEAYEMLE

Query:  RISINSCQWSDVRGKNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPS-NPAS------VFFVG
         ++ ++  W + R K+ +V SV ++D ++ +   ++ +   +  +T            +   NQV    C  C   H+   C S NP++      V FV 
Subjt:  RISINSCQWSDVRGKNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPS-NPAS------VFFVG

Query:  N-QRNN--PYSNFYNPGWRNHPNFSWGGQGSNVQAQQKMNQSGFAKTQVMPQQNKQALPQQNSGNSLEGMMKEFMARTDAAIQSNQASMRALELQVGQLA
        N QR+N  PYSN YNPGWRNHPNFSW  + + ++      + G       P QN    P Q S + +E +M  +M +TD  +Q+ QA++R LE Q+ Q++
Subjt:  N-QRNN--PYSNFYNPGWRNHPNFSWGGQGSNVQAQQKMNQSGFAKTQVMPQQNKQALPQQNSGNSLEGMMKEFMARTDAAIQSNQASMRALELQVGQLA

Query:  NELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLE-ESRKTQDLNSNSDNIIVIEKELESGQGVGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFP
         +L  RP G LPS+TE    E  + V A+ LRSGK LE  +RK Q            E+  E  +G    +E    S  V           PYVPP+PFP
Subjt:  NELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLE-ESRKTQDLNSNSDNIIVIEKELESGQGVGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFP

Query:  QRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYVKFLKDILTKKKRLGEFETVSLTEECSVILKN---GLPPKAKDPRSFTIPVSIGGKELGRAL
         R K +  D QF KFL++ K+L INIP  EA++QMP+Y +F+KD+LTKK++    E V LT ECS+IL+     LP K +D  SFT+P +IG       L
Subjt:  QRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYVKFLKDILTKKKRLGEFETVSLTEECSVILKN---GLPPKAKDPRSFTIPVSIGGKELGRAL

Query:  CDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRS-----------------------------------------SILATGRTLIDVQKGELTMRVCN
         D GASINLMPLS++RKLG+GE + T +TLQLADRS                                           LATG+ LIDV++G+LT+RV N
Subjt:  CDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRS-----------------------------------------SILATGRTLIDVQKGELTMRVCN

Query:  EEVKFNVFKAMKYPDEMEDCSFIRILE----STVIEKATQDSA-------------DKHSEKHGE--------------APPIKP--SLIEAPTLDLKPL
        E++ FNV+ A+K  D+ + C  I I++     +V EKA  D+              D+H E+  E                  KP  SL ++P L+LKPL
Subjt:  EEVKFNVFKAMKYPDEMEDCSFIRILE----STVIEKATQDSA-------------DKHSEKHGE--------------APPIKP--SLIEAPTLDLKPL

Query:  PDHLKYVYLGESETLPIIVALDLMPEHEEALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYP
        P HLKY YLG  +TLPII++  L  + E+ L+ +L+++++AIGWT+ADI+GIS   C H+I LE      ++ QRRLNP +KEVVKKEV+K LDAGIIYP
Subjt:  PDHLKYVYLGESETLPIIVALDLMPEHEEALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYP

Query:  IADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYYCFLDGYSGYNQITIAPEDQEKTTFTC
        I+DS WVSPVQ VPKKGG+TVV N+ N+LIPTRTVTGWRVC+DYR+LN ATRKDHFPLPFIDQML++LAG  YYCFLDGYSGYNQI IAPEDQEKTTFTC
Subjt:  IADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYYCFLDGYSGYNQITIAPEDQEKTTFTC

Query:  PYGTFAFRRMPFSLCNAQATFQRCMLTIFSDMIESTVEVFMDDFSV
        PYGTFAFRRMPF LCNA ATFQRCM++IFSDM+E+ +E+FMDDFSV
Subjt:  PYGTFAFRRMPFSLCNAQATFQRCMLTIFSDMIESTVEVFMDDFSV

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.66.5e-1934.13Show/hide
Query:  AMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYYCFLDG
        A ++ V+ ++   L+ GII   ++S + SP+  VPKK   +    K            +R+ +DYR+LN+ T  D  P+P +D++L +L    Y+  +D 
Subjt:  AMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYYCFLDG

Query:  YSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFSLCNAQATFQRCMLTIFSDMIESTVEVFMDDFSV
          G++QI + PE   KT F+  +G + + RMPF L NA ATFQRCM  I   ++     V++DD  V
Subjt:  YSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFSLCNAQATFQRCMLTIFSDMIESTVEVFMDDFSV

P10394 Retrovirus-related Pol polyprotein from transposon 4123.2e-1834.55Show/hide
Query:  EVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYYCFLDGYSG
        E ++ +V K +   I+ P + S + SP+  VPKK              P      WR+ +DYR++NK    D FPLP ID +LD+L    Y+  LD  SG
Subjt:  EVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYYCFLDGYSG

Query:  YNQITIAPEDQEKTTFTCPYGTFAFRRMPFSLCNAQATFQRCMLTIFSDMIESTVEVFMDDFSVL
        ++QI +    ++ T+F+   G++ F R+PF L  A  +FQR M   FS +  S   ++MDD  V+
Subjt:  YNQITIAPEDQEKTTFTCPYGTFAFRRMPFSLCNAQATFQRCMLTIFSDMIESTVEVFMDDFSVL

P31843 RNA-directed DNA polymerase homolog6.5e-1945.37Show/hide
Query:  RVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFSLCNAQATFQRCMLTIFSDMIESTVE
        R+C+DYR L K T K+ +P+P +D + DRLA  T++  LD  SGY Q+ IA  D+ KTT    YG+F FR MPF L NA ATF   M  +  + ++  V 
Subjt:  RVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFSLCNAQATFQRCMLTIFSDMIESTVE

Query:  VFMDDFSV
        V++DD  V
Subjt:  VFMDDFSV

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein8.8e-2436.41Show/hide
Query:  LLQQYRKAIGWTL----ADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVCNKDNEL
        L Q+YR+ I   L    ADI  I      H I ++ G+     Q   +    ++ + K V K LD   I P + S   SPV  VPKK G           
Subjt:  LLQQYRKAIGWTL----ADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVCNKDNEL

Query:  IPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFSLCNAQATFQRCMLTIF
                +R+C+DYR LNKAT  D FPLP ID +L R+     +  LD +SGY+QI + P+D+ KT F  P G + +  MPF L NA +TF R M   F
Subjt:  IPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFSLCNAQATFQRCMLTIF

Query:  SDMIESTVEVFMDDFSV
         D+    V V++DD  +
Subjt:  SDMIESTVEVFMDDFSV

Q99315 Transposon Ty3-G Gag-Pol polyprotein8.8e-2436.41Show/hide
Query:  LLQQYRKAIGWTL----ADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVCNKDNEL
        L Q+YR+ I   L    ADI  I      H I ++ G+     Q   +    ++ + K V K LD   I P + S   SPV  VPKK G           
Subjt:  LLQQYRKAIGWTL----ADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVCNKDNEL

Query:  IPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFSLCNAQATFQRCMLTIF
                +R+C+DYR LNKAT  D FPLP ID +L R+     +  LD +SGY+QI + P+D+ KT F  P G + +  MPF L NA +TF R M   F
Subjt:  IPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQTYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFSLCNAQATFQRCMLTIF

Query:  SDMIESTVEVFMDDFSV
         D+    V V++DD  +
Subjt:  SDMIESTVEVFMDDFSV

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGAATTCCCTGACTTAGAATTCGGTTTGGACCCTGAAATTGAAAGGACTTTTCATAGGAGGAGAAGGGAACAAAGGGAGAGAAACAATAGGATGGATCCACCACC
TCTTTTACCCCCTGTACCTCCTAGAAATCAAGAAAACCAAAACAACCAGCAGCCTAATCTTGAGGTATATCAGCAACCTAGGGTAGAAAATCAAGCTGAAAACCCTGTCC
TTATAGCCAATGATAGGGGAAGAGCTATTAGAGCCTATGCCGTCCCTACCTTCAATGAGTTGTACCCTGGGATAGCTAGACCCGAGATACAGGCACACAATTTTGAAATG
AAGCCTGTTATGTTCCAAATGTTACAAACGGTGGGACAATTTCATGGATTACCTTCTGAGGACCCACATCTACACCTTAAATCCTTTTTGGGAGTGAGTGACTCATTTAA
AATTCAAGGAGTTTCTCAAGATGCTTTGAGGTTAACCTTGTTTCCTTATTCTTTGAGAGATGGTGCCAAAACTTGGTTAAATAACTTTGCTCCTGGCTCCATCACTACGT
GGAATGAGTTAGCAGAGAAATTCCTCATTAAGTACTTCCCTCCCACAAGAAATGCCAAGCTTAGATCGGATATAGTGTCTTTTAGACAATTTGATGATGAATCTTTTAGT
GAAGCTTGGGAGAGATTTAAGGAATTATTGAGAAAATGCCCTCACCATGGACTCCCCCATTGCATTCAAATGGAGACTTTTTATAATGGGTTGAATTGGGCTACTCAAAA
CATGGTAGATGCTTCAGCTAATGGTGCTTTATTATCCAAGACTTATGATGAAGCTTATGCCATTTTGGAGAGAATTTCCACAAATAGTTGCCAATGGTCTGATTCTAGAA
GTGTGGCTGGGAAGAAAAATAAGGGTATGCTTGAGGTTGACACTGTTACATCCTTTAATGCAAGGTTTGATTCAATGGAAAGCATGATGAAAAATTTAAATTCAAGCTTT
GAGAATTTGCAGTTGATGAATTCCAGTGCAAACCAGTCTGCTGCAGCAATCAATATAAATCAGAATGCAGCAGATTCTTGCGTTTACTGTGGAGAGGATCACAAGTTTGA
ATTTTGCCCTAGAAATCCAGCCTCAGTGTGTTATGTAGGAAACCAAAATGCTCCTAGGAATAATCCCTATTCCAATACCTATAATCCGGGTTGGAGAAATCATCCAAATT
TTGCTTGGGGAGGTGATAACTGCCCAAAAGATTATGTTGCTGAGCGACTGGAAGGAGCAAATTCTATGCTGCAGCAAAACTGGAAGCAGAAACTGCCACATCACAGCTCG
TTAGCCAACTTCATGAACCGACTTCTATTGAGATATTTTCGGGATAAAGGATCAAGGAGAGCCTTACACGAAATGGTTGATCGGAGCTATAATGCAATATCAGAGTTAAT
CGGGTGCTTGGGGCGTGAAAAGATGCAAAGGAATGAAAAGAGTAAAATTGGAGAAAAGTCAAATCTCGGTCAACAGCAGATTAGCGTCGAGACGCTAGCTCTTGAGCGTC
TCGACGCTCACATTCCATATCAGATTAGGCGCGTAAAGCTTACAGTGTCGAGACGCTATGATAGGAAGCGTCCCGACGCTTCCGTTTTTCCTTATTCAGAACGCGCGTAT
AAGAGGCAGCGTCGCGACGCTGTTCATGAAGGTCCAACAGCAGCAAACCCCCAGCAGAACTCGTTGCTGCAGCAAAACCCACTGTTTGAGCAAAATGAGCAGCGAAATAA
TCAGGCTGAGAATCCTATCTTGATAGCGAACGATAGGACCAGAGCCATTCGAGCATATGCTGTCCCGATGTTTAATGAGTTGAATCCAGGGATTGCACGTCCCCAAATCC
AAGCGGCGAATTTTGAAATGAAACCGGTAATGTTTCAGATGTTGCAAACCGTGGGGCAATTCCATGGTTTGTCATCTGAAGACCCTCATTTACATCTTAAGTCTTTTCTA
GGAGTTAGTGATTCTTTTGTAATTCAGGGAGTGCCTAGAGATGCTCTTAGATTAACTTTGTTCCCGTATTCTCTTAGAGATGGAGCAAAGTCATGGAGTGAAATAGTAGG
GTTTAGGCAACTTGAGGATGAGACTTTTAGTGAGGCTTGGGAAAGGTTCAAGGAGCTTTTGCGAAAGTGTACCCACCATGGTTTACCTCATTGTATTCAAATGGAAACAT
TTTACAATGGTTTAAATGGAGTAACCCAAGGTATGGTCGATGCTTCGGCTGGAGGGGCCCTTTTGGCAAAACCTTTTGATGAAGCCTATGAAATGTTAGAAAGAATATCT
ATTAATAGTTGTCAGTGGTCGGATGTTAGAGGCAAAAATAAAAAGGTTAAGAGTGTGTTAGAGGTTGATGGTGTGTCCACCATTAGGGCCGATCTTGCTATGATTGCTAA
CGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCCACCAGCCATGGAGCCTGCAGCAGTGGTGAACCAAGTCACGGATGAAGCATGTGTCTATTGCGGTGAAGACC
ACAACTACGAGTTTTGCCCCAGCAATCCAGCTTCTGTGTTTTTTGTAGGTAATCAGAGGAACAACCCTTATTCTAACTTTTATAATCCAGGTTGGCGCAACCACCCCAAT
TTCTCATGGGGAGGTCAAGGAAGTAACGTACAAGCGCAACAGAAGATGAACCAGTCGGGATTTGCTAAAACGCAGGTAATGCCCCAGCAAAATAAGCAGGCTTTGCCCCA
GCAAAATTCGGGAAATTCTCTCGAGGGGATGATGAAAGAATTTATGGCTCGCACAGACGCCGCAATTCAAAGTAATCAAGCTTCGATGAGGGCCCTGGAATTGCAAGTGG
GTCAGCTAGCTAATGAGCTAAAGGCAAGGCCTCAAGGGAAACTTCCATCCGATACTGAACACCCTCGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTAACTCTTAGGAGT
GGTAAGCCACTAGAAGAGTCTAGAAAGACCCAGGATTTAAATAGTAATAGTGATAATATTATTGTTATTGAAAAAGAGTTGGAGTCTGGTCAGGGTGTTGGAGGTAGCAA
AGAGAATGCTGGAGCATCTGGTTCGGTGCCAGATGTAGAACCACCATATGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAAAGCCTAAGAATC
AGGATGGTCAATTTAAAAAGTTTTTAGAGATTCTTAAGCAGTTGCACATAAATATCCCTTTAGTAGAAGCTATTGAGCAAATGCCTAATTATGTTAAATTTCTTAAGGAT
ATTTTGACTAAAAAGAAAAGGTTAGGTGAGTTTGAAACTGTATCTCTTACTGAGGAGTGTAGTGTTATTCTTAAGAATGGGCTACCCCCCAAGGCTAAGGATCCAAGGTC
ATTTACTATACCTGTGTCTATAGGTGGAAAGGAATTAGGTAGAGCACTCTGTGATTTAGGTGCAAGCATTAACCTTATGCCTCTTTCGGTCTATCGAAAGTTAGGTATTG
GTGAAGCTAGGCCTACCACAGTTACACTCCAATTAGCTGATAGGTCGTCCATTTTGGCTACTGGTAGGACATTAATAGATGTTCAGAAAGGAGAATTAACAATGAGAGTC
TGTAATGAGGAAGTAAAATTTAATGTGTTTAAAGCCATGAAATATCCAGACGAAATGGAGGATTGCTCCTTCATTAGGATTCTAGAGAGCACAGTTATTGAGAAAGCAAC
ACAGGATTCGGCTGATAAGCATTCGGAAAAGCATGGAGAGGCTCCCCCAATTAAGCCATCCCTAATTGAGGCACCCACTTTAGATTTGAAACCTTTACCGGATCATCTAA
AGTATGTGTATCTTGGGGAAAGTGAGACGTTGCCCATTATTGTTGCATTAGATTTAATGCCGGAGCATGAGGAGGCCTTAATAAAATTGCTGCAGCAATACCGCAAAGCT
ATAGGTTGGACATTGGCTGATATTCAGGGAATTAGCTCGTCCTTTTGTATGCACAAAATCACTCTAGAGGAGGGATCCTTTAGGAGTATTGAGCAACAAAGAAGGCTTAA
CCCTGCAATGAAAGAGGTTGTTAAAAAGGAGGTAATTAAATGGTTGGATGCTGGGATCATTTATCCAATTGCCGATAGCAATTGGGTAAGCCCTGTCCAATGTGTTCCTA
AGAAAGGAGGTGTCACTGTGGTGTGCAATAAAGATAATGAGTTGATCCCCACAAGGACAGTAACTGGCTGGAGGGTTTGTATGGATTACAGAAGGCTTAATAAAGCCACT
CGAAAGGATCATTTCCCTCTACCATTTATCGATCAGATGTTGGATCGATTGGCTGGTCAGACCTATTACTGTTTCTTGGATGGTTATTCTGGGTATAACCAGATTACTAT
TGCTCCTGAGGATCAGGAAAAAACCACTTTCACTTGCCCTTATGGGACATTCGCTTTTAGGAGAATGCCTTTCAGCCTTTGCAATGCTCAAGCAACATTTCAGCGGTGTA
TGTTAACAATTTTTTCTGATATGATTGAGTCCACTGTTGAAGTCTTTATGGACGATTTTTCAGTTTTGGAGGGTCTTTTCAGAGTTGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACGAATTCCCTGACTTAGAATTCGGTTTGGACCCTGAAATTGAAAGGACTTTTCATAGGAGGAGAAGGGAACAAAGGGAGAGAAACAATAGGATGGATCCACCACC
TCTTTTACCCCCTGTACCTCCTAGAAATCAAGAAAACCAAAACAACCAGCAGCCTAATCTTGAGGTATATCAGCAACCTAGGGTAGAAAATCAAGCTGAAAACCCTGTCC
TTATAGCCAATGATAGGGGAAGAGCTATTAGAGCCTATGCCGTCCCTACCTTCAATGAGTTGTACCCTGGGATAGCTAGACCCGAGATACAGGCACACAATTTTGAAATG
AAGCCTGTTATGTTCCAAATGTTACAAACGGTGGGACAATTTCATGGATTACCTTCTGAGGACCCACATCTACACCTTAAATCCTTTTTGGGAGTGAGTGACTCATTTAA
AATTCAAGGAGTTTCTCAAGATGCTTTGAGGTTAACCTTGTTTCCTTATTCTTTGAGAGATGGTGCCAAAACTTGGTTAAATAACTTTGCTCCTGGCTCCATCACTACGT
GGAATGAGTTAGCAGAGAAATTCCTCATTAAGTACTTCCCTCCCACAAGAAATGCCAAGCTTAGATCGGATATAGTGTCTTTTAGACAATTTGATGATGAATCTTTTAGT
GAAGCTTGGGAGAGATTTAAGGAATTATTGAGAAAATGCCCTCACCATGGACTCCCCCATTGCATTCAAATGGAGACTTTTTATAATGGGTTGAATTGGGCTACTCAAAA
CATGGTAGATGCTTCAGCTAATGGTGCTTTATTATCCAAGACTTATGATGAAGCTTATGCCATTTTGGAGAGAATTTCCACAAATAGTTGCCAATGGTCTGATTCTAGAA
GTGTGGCTGGGAAGAAAAATAAGGGTATGCTTGAGGTTGACACTGTTACATCCTTTAATGCAAGGTTTGATTCAATGGAAAGCATGATGAAAAATTTAAATTCAAGCTTT
GAGAATTTGCAGTTGATGAATTCCAGTGCAAACCAGTCTGCTGCAGCAATCAATATAAATCAGAATGCAGCAGATTCTTGCGTTTACTGTGGAGAGGATCACAAGTTTGA
ATTTTGCCCTAGAAATCCAGCCTCAGTGTGTTATGTAGGAAACCAAAATGCTCCTAGGAATAATCCCTATTCCAATACCTATAATCCGGGTTGGAGAAATCATCCAAATT
TTGCTTGGGGAGGTGATAACTGCCCAAAAGATTATGTTGCTGAGCGACTGGAAGGAGCAAATTCTATGCTGCAGCAAAACTGGAAGCAGAAACTGCCACATCACAGCTCG
TTAGCCAACTTCATGAACCGACTTCTATTGAGATATTTTCGGGATAAAGGATCAAGGAGAGCCTTACACGAAATGGTTGATCGGAGCTATAATGCAATATCAGAGTTAAT
CGGGTGCTTGGGGCGTGAAAAGATGCAAAGGAATGAAAAGAGTAAAATTGGAGAAAAGTCAAATCTCGGTCAACAGCAGATTAGCGTCGAGACGCTAGCTCTTGAGCGTC
TCGACGCTCACATTCCATATCAGATTAGGCGCGTAAAGCTTACAGTGTCGAGACGCTATGATAGGAAGCGTCCCGACGCTTCCGTTTTTCCTTATTCAGAACGCGCGTAT
AAGAGGCAGCGTCGCGACGCTGTTCATGAAGGTCCAACAGCAGCAAACCCCCAGCAGAACTCGTTGCTGCAGCAAAACCCACTGTTTGAGCAAAATGAGCAGCGAAATAA
TCAGGCTGAGAATCCTATCTTGATAGCGAACGATAGGACCAGAGCCATTCGAGCATATGCTGTCCCGATGTTTAATGAGTTGAATCCAGGGATTGCACGTCCCCAAATCC
AAGCGGCGAATTTTGAAATGAAACCGGTAATGTTTCAGATGTTGCAAACCGTGGGGCAATTCCATGGTTTGTCATCTGAAGACCCTCATTTACATCTTAAGTCTTTTCTA
GGAGTTAGTGATTCTTTTGTAATTCAGGGAGTGCCTAGAGATGCTCTTAGATTAACTTTGTTCCCGTATTCTCTTAGAGATGGAGCAAAGTCATGGAGTGAAATAGTAGG
GTTTAGGCAACTTGAGGATGAGACTTTTAGTGAGGCTTGGGAAAGGTTCAAGGAGCTTTTGCGAAAGTGTACCCACCATGGTTTACCTCATTGTATTCAAATGGAAACAT
TTTACAATGGTTTAAATGGAGTAACCCAAGGTATGGTCGATGCTTCGGCTGGAGGGGCCCTTTTGGCAAAACCTTTTGATGAAGCCTATGAAATGTTAGAAAGAATATCT
ATTAATAGTTGTCAGTGGTCGGATGTTAGAGGCAAAAATAAAAAGGTTAAGAGTGTGTTAGAGGTTGATGGTGTGTCCACCATTAGGGCCGATCTTGCTATGATTGCTAA
CGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCCACCAGCCATGGAGCCTGCAGCAGTGGTGAACCAAGTCACGGATGAAGCATGTGTCTATTGCGGTGAAGACC
ACAACTACGAGTTTTGCCCCAGCAATCCAGCTTCTGTGTTTTTTGTAGGTAATCAGAGGAACAACCCTTATTCTAACTTTTATAATCCAGGTTGGCGCAACCACCCCAAT
TTCTCATGGGGAGGTCAAGGAAGTAACGTACAAGCGCAACAGAAGATGAACCAGTCGGGATTTGCTAAAACGCAGGTAATGCCCCAGCAAAATAAGCAGGCTTTGCCCCA
GCAAAATTCGGGAAATTCTCTCGAGGGGATGATGAAAGAATTTATGGCTCGCACAGACGCCGCAATTCAAAGTAATCAAGCTTCGATGAGGGCCCTGGAATTGCAAGTGG
GTCAGCTAGCTAATGAGCTAAAGGCAAGGCCTCAAGGGAAACTTCCATCCGATACTGAACACCCTCGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTAACTCTTAGGAGT
GGTAAGCCACTAGAAGAGTCTAGAAAGACCCAGGATTTAAATAGTAATAGTGATAATATTATTGTTATTGAAAAAGAGTTGGAGTCTGGTCAGGGTGTTGGAGGTAGCAA
AGAGAATGCTGGAGCATCTGGTTCGGTGCCAGATGTAGAACCACCATATGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAAAGCCTAAGAATC
AGGATGGTCAATTTAAAAAGTTTTTAGAGATTCTTAAGCAGTTGCACATAAATATCCCTTTAGTAGAAGCTATTGAGCAAATGCCTAATTATGTTAAATTTCTTAAGGAT
ATTTTGACTAAAAAGAAAAGGTTAGGTGAGTTTGAAACTGTATCTCTTACTGAGGAGTGTAGTGTTATTCTTAAGAATGGGCTACCCCCCAAGGCTAAGGATCCAAGGTC
ATTTACTATACCTGTGTCTATAGGTGGAAAGGAATTAGGTAGAGCACTCTGTGATTTAGGTGCAAGCATTAACCTTATGCCTCTTTCGGTCTATCGAAAGTTAGGTATTG
GTGAAGCTAGGCCTACCACAGTTACACTCCAATTAGCTGATAGGTCGTCCATTTTGGCTACTGGTAGGACATTAATAGATGTTCAGAAAGGAGAATTAACAATGAGAGTC
TGTAATGAGGAAGTAAAATTTAATGTGTTTAAAGCCATGAAATATCCAGACGAAATGGAGGATTGCTCCTTCATTAGGATTCTAGAGAGCACAGTTATTGAGAAAGCAAC
ACAGGATTCGGCTGATAAGCATTCGGAAAAGCATGGAGAGGCTCCCCCAATTAAGCCATCCCTAATTGAGGCACCCACTTTAGATTTGAAACCTTTACCGGATCATCTAA
AGTATGTGTATCTTGGGGAAAGTGAGACGTTGCCCATTATTGTTGCATTAGATTTAATGCCGGAGCATGAGGAGGCCTTAATAAAATTGCTGCAGCAATACCGCAAAGCT
ATAGGTTGGACATTGGCTGATATTCAGGGAATTAGCTCGTCCTTTTGTATGCACAAAATCACTCTAGAGGAGGGATCCTTTAGGAGTATTGAGCAACAAAGAAGGCTTAA
CCCTGCAATGAAAGAGGTTGTTAAAAAGGAGGTAATTAAATGGTTGGATGCTGGGATCATTTATCCAATTGCCGATAGCAATTGGGTAAGCCCTGTCCAATGTGTTCCTA
AGAAAGGAGGTGTCACTGTGGTGTGCAATAAAGATAATGAGTTGATCCCCACAAGGACAGTAACTGGCTGGAGGGTTTGTATGGATTACAGAAGGCTTAATAAAGCCACT
CGAAAGGATCATTTCCCTCTACCATTTATCGATCAGATGTTGGATCGATTGGCTGGTCAGACCTATTACTGTTTCTTGGATGGTTATTCTGGGTATAACCAGATTACTAT
TGCTCCTGAGGATCAGGAAAAAACCACTTTCACTTGCCCTTATGGGACATTCGCTTTTAGGAGAATGCCTTTCAGCCTTTGCAATGCTCAAGCAACATTTCAGCGGTGTA
TGTTAACAATTTTTTCTGATATGATTGAGTCCACTGTTGAAGTCTTTATGGACGATTTTTCAGTTTTGGAGGGTCTTTTCAGAGTTGTTTAG
Protein sequenceShow/hide protein sequence
MNEFPDLEFGLDPEIERTFHRRRREQRERNNRMDPPPLLPPVPPRNQENQNNQQPNLEVYQQPRVENQAENPVLIANDRGRAIRAYAVPTFNELYPGIARPEIQAHNFEM
KPVMFQMLQTVGQFHGLPSEDPHLHLKSFLGVSDSFKIQGVSQDALRLTLFPYSLRDGAKTWLNNFAPGSITTWNELAEKFLIKYFPPTRNAKLRSDIVSFRQFDDESFS
EAWERFKELLRKCPHHGLPHCIQMETFYNGLNWATQNMVDASANGALLSKTYDEAYAILERISTNSCQWSDSRSVAGKKNKGMLEVDTVTSFNARFDSMESMMKNLNSSF
ENLQLMNSSANQSAAAININQNAADSCVYCGEDHKFEFCPRNPASVCYVGNQNAPRNNPYSNTYNPGWRNHPNFAWGGDNCPKDYVAERLEGANSMLQQNWKQKLPHHSS
LANFMNRLLLRYFRDKGSRRALHEMVDRSYNAISELIGCLGREKMQRNEKSKIGEKSNLGQQQISVETLALERLDAHIPYQIRRVKLTVSRRYDRKRPDASVFPYSERAY
KRQRRDAVHEGPTAANPQQNSLLQQNPLFEQNEQRNNQAENPILIANDRTRAIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFL
GVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWSEIVGFRQLEDETFSEAWERFKELLRKCTHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKPFDEAYEMLERIS
INSCQWSDVRGKNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQRNNPYSNFYNPGWRNHPN
FSWGGQGSNVQAQQKMNQSGFAKTQVMPQQNKQALPQQNSGNSLEGMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRS
GKPLEESRKTQDLNSNSDNIIVIEKELESGQGVGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYVKFLKD
ILTKKKRLGEFETVSLTEECSVILKNGLPPKAKDPRSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSSILATGRTLIDVQKGELTMRV
CNEEVKFNVFKAMKYPDEMEDCSFIRILESTVIEKATQDSADKHSEKHGEAPPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVALDLMPEHEEALIKLLQQYRKA
IGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVCNKDNELIPTRTVTGWRVCMDYRRLNKAT
RKDHFPLPFIDQMLDRLAGQTYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFSLCNAQATFQRCMLTIFSDMIESTVEVFMDDFSVLEGLFRVV