; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028477 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028477
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr8:22881789..22887067
RNA-Seq ExpressionLag0028477
SyntenyLag0028477
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0009987 - cellular process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIN16590.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]2.5e-22144.68Show/hide
Query:  DPEIERTFRNRRRELRRNQMENVPQLPQAENPILIAND--RTRVIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHHKS
        D EIERTF+ RRR+L  +  + V      EN I+I  D      +R  A+P   E    +  P++  A  +++  M +M+Q   QF GL+ E+P+ H  +
Subjt:  DPEIERTFRNRRRELRRNQMENVPQLPQAENPILIAND--RTRVIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHHKS

Query:  FLGVSDSFVIQGVPRDALRLTLFMYSLRDGAKAWLNSFAPGSIRTWDELARKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERYGRC---------
        FL + D+   +GV +DALRL LF +SL   A  W  S    SI TW +L  +F+SK+F P + A LR+EI+ FRQ   ET  EAW R+ +          
Subjt:  FLGVSDSFVIQGVPRDALRLTLFMYSLRDGAKAWLNSFAPGSIRTWDELARKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERYGRC---------

Query:  -----------------------LGSRGLLAKTFDEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIDNALKNVTVISHQQPPTMEP
                               L     L+ T  E + +L  +  N  +    R T  K   V+EVD V+ + A +  +  ++KN  V   Q  P    
Subjt:  -----------------------LGSRGLLAKTFDEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIDNALKNVTVISHQQPPTMEP

Query:  TTLVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NDPYSNFYNPGWRNHPNFSWGGQGNNVQAQEKTNHPGFAKAPVLPQQNKQALPQQNSRS
                   C  CGE H  + CP +  S+ FV N R   N+PYSN YNPGWR HPNFSW    NN Q Q      G A       Q +   P Q  + 
Subjt:  TTLVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NDPYSNFYNPGWRNHPNFSWGGQGNNVQAQEKTNHPGFAKAPVLPQQNKQALPQQNSRS

Query:  SLEAMMKEFMARTDAAIQSNQASMRALEMQVGQLASELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEVRIELSKTQVIDKNGDKNDIVEKELE
        SLE  + +FMA       S  A+ + +E Q+GQLA+ + +RP+  LPS+TE +PR++ K Q +AVTLR+G  L+   E+ K     K        EKE+ 
Subjt:  SLEAMMKEFMARTDAAIQSNQASMRALEMQVGQLASELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEVRIELSKTQVIDKNGDKNDIVEKELE

Query:  SGQGVGGSNKNAGASGSVPDVEPPYVPPSHLMYHLYLFHKGKSLRIRMDLLTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGR
        S +   G    A       +  P YV                  +   D+L+KK+RLG++ETV+LTEECSAI++N LPPK KDPGSFTIP +IG    GR
Subjt:  SGQGVGGSNKNAGASGSVPDVEPPYVPPSHLMYHLYLFHKGKSLRIRMDLLTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGR

Query:  ALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADKSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPINLGRPFLATSRALIDVQKGELTMRV
        ALCDLGASINLMP S+YR LG+GEA+PT++TLQLAD+S+TYP G IED+LVKVDKFIFP DF++LD E D +VPI LGRPFLAT R LIDVQKGELTMRV
Subjt:  ALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADKSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPINLGRPFLATSRALIDVQKGELTMRV

Query:  CNKEVKFNVFKAMKYQDEMEDCSFIRIRESTV------------IETAIQDSADKHSEKHGEVSIEDFEVCSLERKNEKELFRCEDVFESLDLDQRKAPP
         ++++ FNVFKAMK+ +E ++C  + + +               +E A+ D  D+ +E       ED EV  ++  +  + F+   V ESL   +R AP 
Subjt:  CNKEVKFNVFKAMKYQDEMEDCSFIRIRESTV------------IETAIQDSADKHSEKHGEVSIEDFEVCSLERKNEKELFRCEDVFESLDLDQRKAPP

Query:  --IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGLTLADIQGISPSLCVHKITLEEGSFRSIEQQRRLNPAMK
          +KPS+ E PTL+LKPLP HL Y YLGES+TLP+I++S L     E L+++L+ ++ AIG T+ADI+GISPS C+HKI LE+G   S+E QRRLNP MK
Subjt:  --IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGLTLADIQGISPSLCVHKITLEEGSFRSIEQQRRLNPAMK

Query:  EVVKKEVIKLLDAGIIYPIADSNWVSPVQCVHKKGGATVRWTRQGKGELCQMYFPTAVAPLQEARHGKRQSVAPLEEARNMKT-LDYRRLNKATRKDHFP
        EVVKKE+IK LDAGIIYPI+DS+WVSPVQCV KKGG TV                          H     + P       +  +DYR+LNKATRKDHFP
Subjt:  EVVKKEVIKLLDAGIIYPIADSNWVSPVQCVHKKGGATVRWTRQGKGELCQMYFPTAVAPLQEARHGKRQSVAPLEEARNMKT-LDYRRLNKATRKDHFP

Query:  LPFIDQMLDRLAGQAYYCFLDGYFGYNQITIASEEQEKTTFTCPYGTFAFRRMHFGLCNAPTT
        LPFIDQMLDRLAG+ +YCFLDGY GYNQI I  E+QEKTTFTCPYGTF FR+M FGLCNAP T
Subjt:  LPFIDQMLDRLAGQAYYCFLDGYFGYNQITIASEEQEKTTFTCPYGTFAFRRMHFGLCNAPTT

XP_017239676.1 PREDICTED: uncharacterized protein LOC108212460 [Daucus carota subsp. sativus]3.8e-23044.97Show/hide
Query:  LDPEIERTFRNRRRELRRNQMENVPQLPQAENPILIANDRTRVIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHHKSF
        LD EIE+T +  RR+ + ++  +      A  P      R   ++ Y +P F+ ++  IARP I A NF +     Q ++   +F+GL++EDP+ H ++F
Subjt:  LDPEIERTFRNRRRELRRNQMENVPQLPQAENPILIANDRTRVIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHHKSF

Query:  LGVSDSFVIQGVPRDALRLTLFMYSLRDGAKAWLNSFAPGSIRTWDELARKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERYGRCL---------
        L + D+F + GVP + +RL LF  SL   A+ WL+S    SI TW++L  KFL+KYF P +  +L  EI  F+Q + E+  EA+ER+   L         
Subjt:  LGVSDSFVIQGVPRDALRLTLFMYSLRDGAKAWLNSFAPGSIRTWDELARKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERYGRCL---------

Query:  ----------------------GSRGLLAKTF-DEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIDNAL-KNVTVISHQQPPTMEP
                               +RG L   + ++A+EILE I+ N+C+  D R ++KKV  V EVD +++  A +     AL K +  +S  +   ++ 
Subjt:  ----------------------GSRGLLAKTF-DEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIDNAL-KNVTVISHQQPPTMEP

Query:  TTLVNQVTDEACVYCGEDHNYEFCP------SNPASVFFVG---NQRNDPYSNFYNPGWRNHPNFSWGGQGNNVQAQEKTNHPGFAKAPVLPQQNKQALP
           V  +    C  CGE H  + CP      +  +SV +VG   NQ+N+P+SN YNPGWRNHPNFSW    NNV    + N P     P   QQN +  P
Subjt:  TTLVNQVTDEACVYCGEDHNYEFCP------SNPASVFFVG---NQRNDPYSNFYNPGWRNHPNFSWGGQGNNVQAQEKTNHPGFAKAPVLPQQNKQALP

Query:  QQ-NSRSSLEAMMKEFMARTDAAIQSNQASMRALEMQVGQLASELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEVRIELSKTQVIDKNGDKND
        Q+   + + E ++ ++M +TDA IQS  ASMRALEMQVGQLAS +  RP G LPS+TE +P+ + +E  KA+TLRSGK +E       T+ +D  GD   
Subjt:  QQ-NSRSSLEAMMKEFMARTDAAIQSNQASMRALEMQVGQLASELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEVRIELSKTQVIDKNGDKND

Query:  IVEKELESGQGVGGSNKNAGASGSVPDVEPPYVPPSHLMYH---------LYLFHK--------------GKSLRIRMDLLTKKKRLGEFETVSLTEECS
        ++ +E         SN  A AS     V PP   P  L            + +F K                 ++   D+L++K+RL EFETV+LTEECS
Subjt:  IVEKELESGQGVGGSNKNAGASGSVPDVEPPYVPPSHLMYH---------LYLFHK--------------GKSLRIRMDLLTKKKRLGEFETVSLTEECS

Query:  AILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADKSITYPEGKIEDVLVKVDKFIFPVDFIILDYEAD
        AIL+  LPPK KDPGSFTIP +IG +  G+ALCDLGAS+NLMPLS++ KLG+GE +PT+V LQLAD+S+ YP G +EDVLVKVDKFIFP DFI+LD E D
Subjt:  AILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADKSITYPEGKIEDVLVKVDKFIFPVDFIILDYEAD

Query:  KDVPINLGRPFLATSRALIDVQKGELTMRVCNKEVKFNVFKAMKYQDEMEDCSFIRIRES-----TVIETAIQDSADKHSEKHGEVSIEDFEVCSLERKN
         D+P+ LGRPFLAT R LIDVQKGELTMRV +++V FNVF AMK+ ++ E C  +           ++E    D  +    + G+ S E+   C  E   
Subjt:  KDVPINLGRPFLATSRALIDVQKGELTMRVCNKEVKFNVFKAMKYQDEMEDCSFIRIRES-----TVIETAIQDSADKHSEKHGEVSIEDFEVCSLERKN

Query:  EKELFRCEDVFESLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGLTLADIQGISPSLCVHKI
             R    FES ++   K+   KPS+ E P L+LK LP HLKY +LGE  TLP+I++S L  EHEE L+++L++Y++AIG  +ADI+GISPS C+HKI
Subjt:  EKELFRCEDVFESLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGLTLADIQGISPSLCVHKI

Query:  TLEEGSFRSIEQQRRLNPAMKEVVKKEVIKLLDAGIIYPIADSNWVSPVQCVHKKGGATVRWTRQGKGELCQMYFPTAVAPLQEARHGKRQSVAPLEEAR
        ++E+    +IE QRRLNP MKEVVKKE+IK LDAGIIYPI+DS+WVSP+QCV KKGG TV      K EL     PT          G R          
Subjt:  TLEEGSFRSIEQQRRLNPAMKEVVKKEVIKLLDAGIIYPIADSNWVSPVQCVHKKGGATVRWTRQGKGELCQMYFPTAVAPLQEARHGKRQSVAPLEEAR

Query:  NMKTLDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYFGYNQITIASEEQEKTTFTCPYGTFAFRRMHFGLCNAPTT
            +DYR+LNKATRKDHFPLPFIDQMLDRLAG+ +YCFLDGY GY+QI IA E+QEKTTFTCP+GTFAFR++ FGLCNAP+T
Subjt:  NMKTLDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYFGYNQITIASEEQEKTTFTCPYGTFAFRRMHFGLCNAPTT

XP_038972405.1 uncharacterized protein LOC120104748 [Phoenix dactylifera]1.9e-21342.97Show/hide
Query:  NDRTRVIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHHKSFLGVSDSFVIQGVPRDALRLTLFMYSLRDGAKAWLNSF
        N   R++  YAVP  N   P I RP + A NFE+KP + QM+Q   QF G  SEDPH H  +FL + D+  + GV  DA+RL LF +SL+D AKAWLNS 
Subjt:  NDRTRVIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHHKSFLGVSDSFVIQGVPRDALRLTLFMYSLRDGAKAWLNSF

Query:  APGSIRTWDELARKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERY----GRC----------------------------LGSRGLLAKTFDEAY
        AP S  TW+ L++ FLSKYFPP + AKLR++I  F Q + E+  EAWER+     +C                                 L++K+ +EAY
Subjt:  APGSIRTWDELARKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERY----GRC----------------------------LGSRGLLAKTFDEAY

Query:  EILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIDNALKNVTVISHQQPPTMEPTTLVNQVTDE--ACVYCGEDHNYEFCPSNPASVFFVGN
        E+LE ++ N+ QWS+ R   KKV  + +VDG++ +    A +D+ +K    + +           VN V+    +C  CG  H    C      V FV N
Subjt:  EILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIDNALKNVTVISHQQPPTMEPTTLVNQVTDE--ACVYCGEDHNYEFCPSNPASVFFVGN

Query:  -----QRNDPYSNFYNPGWRNHPNFSWGGQGNNVQAQEKTNHPGFAKAPVLPQQNKQALPQQNSRSSLEAMMKEFMARTDAAIQSNQASMRALEMQVGQL
             Q+N+PYSN YNPGWRNHPNFSW  QGN   +    + PGF   P  P ++KQ+   + +   L     E   R +A +    +S R +E+Q+GQL
Subjt:  -----QRNDPYSNFYNPGWRNHPNFSWGGQGNNVQAQEKTNHPGFAKAPVLPQQNKQALPQQNSRSSLEAMMKEFMARTDAAIQSNQASMRALEMQVGQL

Query:  ASELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEVRIELSKTQVIDKNGDKNDIVEKELESGQGVGGSNKNAGASGSVPDVEPPYVPPSHLMYHL
        A+ + +R QG LPS TE      KE  KAVTLRSGK L    ++S   ++    D  ++ +K  E  + +      A     +P VE PYVPP      L
Subjt:  ASELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEVRIELSKTQVIDKNGDKNDIVEKELESGQGVGGSNKNAGASGSVPDVEPPYVPPSHLMYHL

Query:  ------YLFHK----GKSLRIRM-----------------DLLTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGAS
                F K     + L I +                 ++++KK++L +FET++LTEECSAI++N LPPK +DPGSF+IP +IG  +  RALCDLGAS
Subjt:  ------YLFHK----GKSLRIRM-----------------DLLTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGAS

Query:  INLMPLSVYRKLGIGEARPTTVTLQLADKSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPINLGRPFLATSRALIDVQKGELTMRVCNKEVKFN
        ++LMPLSV RKLG+ E +PTT++LQLAD+S+ YP G +E+VL+KV KFI PVDFI+L+ E D ++PI LGRPFLAT+ A+IDV+ G LT++V  +EV+FN
Subjt:  INLMPLSVYRKLGIGEARPTTVTLQLADKSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPINLGRPFLATSRALIDVQKGELTMRVCNKEVKFN

Query:  VFKAMKYQDEMEDCSFIRIRESTVIETAIQDSADKHSE----KHGEVSIEDFEV----CSLER---KNEKELFRCEDVFESLDLDQRKAPPIKPSLIEAP
        +F+A KY    +    + + + +  E    ++  +  E      G    ++ E+    C+LE    K +K     ED+        +  PP  PS ++AP
Subjt:  VFKAMKYQDEMEDCSFIRIRESTVIETAIQDSADKHSE----KHGEVSIEDFEV----CSLER---KNEKELFRCEDVFESLDLDQRKAPPIKPSLIEAP

Query:  TLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGLTLADIQGISPSLCVHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKLL
         L+LKPLP HL Y +LGE+ TLP+IV+  L  E  + LI++L+  +KAIG T++D++GISPSLC+H+I +E+     +E QRRLNP MKEVV+ EV+K L
Subjt:  TLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGLTLADIQGISPSLCVHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKLL

Query:  DAGIIYPIADSNWVSPVQCVHKKGGATVRWTRQGKGELCQMYFPTAVAPLQEARHGKRQSVAPLEEARNMKT-LDYRRLNKATRKDHFPLPFIDQMLDRL
        DAGIIYPI+DS W+SPVQ V KKGG TV                          H +   + P       +  +DYR+LN  TRKDHFPLPF+DQ+L+RL
Subjt:  DAGIIYPIADSNWVSPVQCVHKKGGATVRWTRQGKGELCQMYFPTAVAPLQEARHGKRQSVAPLEEARNMKT-LDYRRLNKATRKDHFPLPFIDQMLDRL

Query:  AGQAYYCFLDGYFGYNQITIASEEQEKTTFTCPYGTFAFRRMHFGLCNAPTT
        AG AYYCFLDGY GYNQI+I+ E+QEKTTFTCPYGTFAFRRM FGLCNAP T
Subjt:  AGQAYYCFLDGYFGYNQITIASEEQEKTTFTCPYGTFAFRRMHFGLCNAPTT

XP_038973683.1 uncharacterized protein LOC120105384 [Phoenix dactylifera]1.9e-21342.97Show/hide
Query:  NDRTRVIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHHKSFLGVSDSFVIQGVPRDALRLTLFMYSLRDGAKAWLNSF
        N   R++  YAVP  N   P I RP + A NFE+KP + QM+Q   QF G  SEDPH H  +FL + D+  + GV  DA+RL LF +SL+D AKAWLNS 
Subjt:  NDRTRVIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHHKSFLGVSDSFVIQGVPRDALRLTLFMYSLRDGAKAWLNSF

Query:  APGSIRTWDELARKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERY----GRC----------------------------LGSRGLLAKTFDEAY
        AP S  TW+ L++ FLSKYFPP + AKLR++I  F Q + E+  EAWER+     +C                                 L++K+ +EAY
Subjt:  APGSIRTWDELARKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERY----GRC----------------------------LGSRGLLAKTFDEAY

Query:  EILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIDNALKNVTVISHQQPPTMEPTTLVNQVTDE--ACVYCGEDHNYEFCPSNPASVFFVGN
        E+LE ++ N+ QWS+ R   KKV  + +VDG++ +    A +D+ +K    + +           VN V+    +C  CG  H    C      V FV N
Subjt:  EILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIDNALKNVTVISHQQPPTMEPTTLVNQVTDE--ACVYCGEDHNYEFCPSNPASVFFVGN

Query:  -----QRNDPYSNFYNPGWRNHPNFSWGGQGNNVQAQEKTNHPGFAKAPVLPQQNKQALPQQNSRSSLEAMMKEFMARTDAAIQSNQASMRALEMQVGQL
             Q+N+PYSN YNPGWRNHPNFSW  QGN   +    + PGF   P  P ++KQ+   + +   L     E   R +A +    +S R +E+Q+GQL
Subjt:  -----QRNDPYSNFYNPGWRNHPNFSWGGQGNNVQAQEKTNHPGFAKAPVLPQQNKQALPQQNSRSSLEAMMKEFMARTDAAIQSNQASMRALEMQVGQL

Query:  ASELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEVRIELSKTQVIDKNGDKNDIVEKELESGQGVGGSNKNAGASGSVPDVEPPYVPPSHLMYHL
        A+ + +R QG LPS TE      KE  KAVTLRSGK L    ++S   ++    D  ++ +K  E  + +      A     +P VE PYVPP      L
Subjt:  ASELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEVRIELSKTQVIDKNGDKNDIVEKELESGQGVGGSNKNAGASGSVPDVEPPYVPPSHLMYHL

Query:  ------YLFHK----GKSLRIRM-----------------DLLTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGAS
                F K     + L I +                 ++++KK++L +FET++LTEECSAI++N LPPK +DPGSF+IP +IG  +  RALCDLGAS
Subjt:  ------YLFHK----GKSLRIRM-----------------DLLTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGAS

Query:  INLMPLSVYRKLGIGEARPTTVTLQLADKSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPINLGRPFLATSRALIDVQKGELTMRVCNKEVKFN
        ++LMPLSV RKLG+ E +PTT++LQLAD+S+ YP G +E+VL+KV KFI PVDFI+L+ E D ++PI LGRPFLAT+ A+IDV+ G LT++V  +EV+FN
Subjt:  INLMPLSVYRKLGIGEARPTTVTLQLADKSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPINLGRPFLATSRALIDVQKGELTMRVCNKEVKFN

Query:  VFKAMKYQDEMEDCSFIRIRESTVIETAIQDSADKHSE----KHGEVSIEDFEV----CSLER---KNEKELFRCEDVFESLDLDQRKAPPIKPSLIEAP
        +F+A KY    +    + + + +  E    ++  +  E      G    ++ E+    C+LE    K +K     ED+        +  PP  PS ++AP
Subjt:  VFKAMKYQDEMEDCSFIRIRESTVIETAIQDSADKHSE----KHGEVSIEDFEV----CSLER---KNEKELFRCEDVFESLDLDQRKAPPIKPSLIEAP

Query:  TLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGLTLADIQGISPSLCVHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKLL
         L+LKPLP HL Y +LGE+ TLP+IV+  L  E  + LI++L+  +KAIG T++D++GISPSLC+H+I +E+     +E QRRLNP MKEVV+ EV+K L
Subjt:  TLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGLTLADIQGISPSLCVHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKLL

Query:  DAGIIYPIADSNWVSPVQCVHKKGGATVRWTRQGKGELCQMYFPTAVAPLQEARHGKRQSVAPLEEARNMKT-LDYRRLNKATRKDHFPLPFIDQMLDRL
        DAGIIYPI+DS W+SPVQ V KKGG TV                          H +   + P       +  +DYR+LN  TRKDHFPLPF+DQ+L+RL
Subjt:  DAGIIYPIADSNWVSPVQCVHKKGGATVRWTRQGKGELCQMYFPTAVAPLQEARHGKRQSVAPLEEARNMKT-LDYRRLNKATRKDHFPLPFIDQMLDRL

Query:  AGQAYYCFLDGYFGYNQITIASEEQEKTTFTCPYGTFAFRRMHFGLCNAPTT
        AG AYYCFLDGY GYNQI+I+ E+QEKTTFTCPYGTFAFRRM FGLCNAP T
Subjt:  AGQAYYCFLDGYFGYNQITIASEEQEKTTFTCPYGTFAFRRMHFGLCNAPTT

XP_038976300.1 uncharacterized protein LOC120107204 [Phoenix dactylifera]8.5e-21442.97Show/hide
Query:  NDRTRVIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHHKSFLGVSDSFVIQGVPRDALRLTLFMYSLRDGAKAWLNSF
        N   R++  YAVP  N   P I RP + A NFE+KP + QM+Q   QF G  SEDPH H  +FL + D+  + GV  DA+RL LF +SL+D AKAWLNS 
Subjt:  NDRTRVIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHHKSFLGVSDSFVIQGVPRDALRLTLFMYSLRDGAKAWLNSF

Query:  APGSIRTWDELARKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERY----GRC----------------------------LGSRGLLAKTFDEAY
        AP S  TW+ L++ FLSKYFPP + AKLR++I  F Q + E+  EAWER+     +C                                 L++K+ +EAY
Subjt:  APGSIRTWDELARKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERY----GRC----------------------------LGSRGLLAKTFDEAY

Query:  EILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIDNALKNVTVISHQQPPTMEPTTLVNQVTDE--ACVYCGEDHNYEFCPSNPASVFFVGN
        E+LE ++ N+ QWS+ R   KKV  + +VDG++ +    A +D+ +K  + + +           VN V+    +C  CG  H    C      V FV N
Subjt:  EILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIDNALKNVTVISHQQPPTMEPTTLVNQVTDE--ACVYCGEDHNYEFCPSNPASVFFVGN

Query:  -----QRNDPYSNFYNPGWRNHPNFSWGGQGNNVQAQEKTNHPGFAKAPVLPQQNKQALPQQNSRSSLEAMMKEFMARTDAAIQSNQASMRALEMQVGQL
             Q+N+PYSN YNPGWRNHPNFSW  QGN   +    + PGF   P  P ++KQ+   + +   L     E   R +A +    +S R +E+Q+GQL
Subjt:  -----QRNDPYSNFYNPGWRNHPNFSWGGQGNNVQAQEKTNHPGFAKAPVLPQQNKQALPQQNSRSSLEAMMKEFMARTDAAIQSNQASMRALEMQVGQL

Query:  ASELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEVRIELSKTQVIDKNGDKNDIVEKELESGQGVGGSNKNAGASGSVPDVEPPYVPPSHLMYHL
        A+ + +R QG LPS TE      KE  KAVTLRSGK L    ++S   ++    D  ++ +K  E  + +      A     +P VE PYVPP      L
Subjt:  ASELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEVRIELSKTQVIDKNGDKNDIVEKELESGQGVGGSNKNAGASGSVPDVEPPYVPPSHLMYHL

Query:  ------YLFHK----GKSLRIRM-----------------DLLTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGAS
                F K     + L I +                 ++++KK++L +FET++LTEECSAI++N LPPK +DPGSF+IP +IG  +  RALCDLGAS
Subjt:  ------YLFHK----GKSLRIRM-----------------DLLTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGAS

Query:  INLMPLSVYRKLGIGEARPTTVTLQLADKSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPINLGRPFLATSRALIDVQKGELTMRVCNKEVKFN
        ++LMPLSV RKLG+ E +PTT++LQLAD+S+ YP G +E+VL+KV KFI PVDFI+L+ E D ++PI LGRPFLAT+ A+IDV+ G LT++V  +EV+FN
Subjt:  INLMPLSVYRKLGIGEARPTTVTLQLADKSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPINLGRPFLATSRALIDVQKGELTMRVCNKEVKFN

Query:  VFKAMKYQDEMEDCSFIRIRESTVIETAIQDSADKHSE----KHGEVSIEDFEV----CSLER---KNEKELFRCEDVFESLDLDQRKAPPIKPSLIEAP
        +F+A KY    +    + + + +  E    ++  +  E      G    ++ E+    C+LE    K +K     ED+        +  PP  PS ++AP
Subjt:  VFKAMKYQDEMEDCSFIRIRESTVIETAIQDSADKHSE----KHGEVSIEDFEV----CSLER---KNEKELFRCEDVFESLDLDQRKAPPIKPSLIEAP

Query:  TLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGLTLADIQGISPSLCVHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKLL
         L+LKPLP HL Y +LGE+ TLP+IV+  L  E  + LI++L+  +KAIG T++D++GISPSLC+H+I +E+     +E QRRLNP MKEVV+ EV+K L
Subjt:  TLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGLTLADIQGISPSLCVHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKLL

Query:  DAGIIYPIADSNWVSPVQCVHKKGGATVRWTRQGKGELCQMYFPTAVAPLQEARHGKRQSVAPLEEARNMKT-LDYRRLNKATRKDHFPLPFIDQMLDRL
        DAGIIYPI+DS W+SPVQ V KKGG TV                          H +   + P       +  +DYR+LN  TRKDHFPLPF+DQ+L+RL
Subjt:  DAGIIYPIADSNWVSPVQCVHKKGGATVRWTRQGKGELCQMYFPTAVAPLQEARHGKRQSVAPLEEARNMKT-LDYRRLNKATRKDHFPLPFIDQMLDRL

Query:  AGQAYYCFLDGYFGYNQITIASEEQEKTTFTCPYGTFAFRRMHFGLCNAPTT
        AG AYYCFLDGY GYNQI+I+ E+QEKTTFTCPYGTFAFRRM FGLCNAP T
Subjt:  AGQAYYCFLDGYFGYNQITIASEEQEKTTFTCPYGTFAFRRMHFGLCNAPTT

TrEMBL top hitse value%identityAlignment
A0A2G9HH15 Reverse transcriptase1.2e-22144.68Show/hide
Query:  DPEIERTFRNRRRELRRNQMENVPQLPQAENPILIAND--RTRVIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHHKS
        D EIERTF+ RRR+L  +  + V      EN I+I  D      +R  A+P   E    +  P++  A  +++  M +M+Q   QF GL+ E+P+ H  +
Subjt:  DPEIERTFRNRRRELRRNQMENVPQLPQAENPILIAND--RTRVIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHHKS

Query:  FLGVSDSFVIQGVPRDALRLTLFMYSLRDGAKAWLNSFAPGSIRTWDELARKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERYGRC---------
        FL + D+   +GV +DALRL LF +SL   A  W  S    SI TW +L  +F+SK+F P + A LR+EI+ FRQ   ET  EAW R+ +          
Subjt:  FLGVSDSFVIQGVPRDALRLTLFMYSLRDGAKAWLNSFAPGSIRTWDELARKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERYGRC---------

Query:  -----------------------LGSRGLLAKTFDEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIDNALKNVTVISHQQPPTMEP
                               L     L+ T  E + +L  +  N  +    R T  K   V+EVD V+ + A +  +  ++KN  V   Q  P    
Subjt:  -----------------------LGSRGLLAKTFDEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIDNALKNVTVISHQQPPTMEP

Query:  TTLVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NDPYSNFYNPGWRNHPNFSWGGQGNNVQAQEKTNHPGFAKAPVLPQQNKQALPQQNSRS
                   C  CGE H  + CP +  S+ FV N R   N+PYSN YNPGWR HPNFSW    NN Q Q      G A       Q +   P Q  + 
Subjt:  TTLVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NDPYSNFYNPGWRNHPNFSWGGQGNNVQAQEKTNHPGFAKAPVLPQQNKQALPQQNSRS

Query:  SLEAMMKEFMARTDAAIQSNQASMRALEMQVGQLASELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEVRIELSKTQVIDKNGDKNDIVEKELE
        SLE  + +FMA       S  A+ + +E Q+GQLA+ + +RP+  LPS+TE +PR++ K Q +AVTLR+G  L+   E+ K     K        EKE+ 
Subjt:  SLEAMMKEFMARTDAAIQSNQASMRALEMQVGQLASELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEVRIELSKTQVIDKNGDKNDIVEKELE

Query:  SGQGVGGSNKNAGASGSVPDVEPPYVPPSHLMYHLYLFHKGKSLRIRMDLLTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGR
        S +   G    A       +  P YV                  +   D+L+KK+RLG++ETV+LTEECSAI++N LPPK KDPGSFTIP +IG    GR
Subjt:  SGQGVGGSNKNAGASGSVPDVEPPYVPPSHLMYHLYLFHKGKSLRIRMDLLTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGR

Query:  ALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADKSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPINLGRPFLATSRALIDVQKGELTMRV
        ALCDLGASINLMP S+YR LG+GEA+PT++TLQLAD+S+TYP G IED+LVKVDKFIFP DF++LD E D +VPI LGRPFLAT R LIDVQKGELTMRV
Subjt:  ALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADKSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPINLGRPFLATSRALIDVQKGELTMRV

Query:  CNKEVKFNVFKAMKYQDEMEDCSFIRIRESTV------------IETAIQDSADKHSEKHGEVSIEDFEVCSLERKNEKELFRCEDVFESLDLDQRKAPP
         ++++ FNVFKAMK+ +E ++C  + + +               +E A+ D  D+ +E       ED EV  ++  +  + F+   V ESL   +R AP 
Subjt:  CNKEVKFNVFKAMKYQDEMEDCSFIRIRESTV------------IETAIQDSADKHSEKHGEVSIEDFEVCSLERKNEKELFRCEDVFESLDLDQRKAPP

Query:  --IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGLTLADIQGISPSLCVHKITLEEGSFRSIEQQRRLNPAMK
          +KPS+ E PTL+LKPLP HL Y YLGES+TLP+I++S L     E L+++L+ ++ AIG T+ADI+GISPS C+HKI LE+G   S+E QRRLNP MK
Subjt:  --IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGLTLADIQGISPSLCVHKITLEEGSFRSIEQQRRLNPAMK

Query:  EVVKKEVIKLLDAGIIYPIADSNWVSPVQCVHKKGGATVRWTRQGKGELCQMYFPTAVAPLQEARHGKRQSVAPLEEARNMKT-LDYRRLNKATRKDHFP
        EVVKKE+IK LDAGIIYPI+DS+WVSPVQCV KKGG TV                          H     + P       +  +DYR+LNKATRKDHFP
Subjt:  EVVKKEVIKLLDAGIIYPIADSNWVSPVQCVHKKGGATVRWTRQGKGELCQMYFPTAVAPLQEARHGKRQSVAPLEEARNMKT-LDYRRLNKATRKDHFP

Query:  LPFIDQMLDRLAGQAYYCFLDGYFGYNQITIASEEQEKTTFTCPYGTFAFRRMHFGLCNAPTT
        LPFIDQMLDRLAG+ +YCFLDGY GYNQI I  E+QEKTTFTCPYGTF FR+M FGLCNAP T
Subjt:  LPFIDQMLDRLAGQAYYCFLDGYFGYNQITIASEEQEKTTFTCPYGTFAFRRMHFGLCNAPTT

A0A2G9HWF8 Reverse transcriptase4.2e-19841.08Show/hide
Query:  DPEIERTFRNRRRELRRNQMENVPQLPQAENPILIAND--RTRVIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHHKS
        DPEIERTFR RRR+L  +  + V      EN I++  D      +R  A+P   E    +  P++  A  +++  M QM+Q   QF GL+ E+P+ H  +
Subjt:  DPEIERTFRNRRRELRRNQMENVPQLPQAENPILIAND--RTRVIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHHKS

Query:  FLGVSDSFVIQGVPRDALRLTLFMYSLRDGAKAWLNSFAPGSIRTWDELARKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERYGRC---------
        FL + D+   +GV +DALRL LF +SL   A  W  S    SI TW +L  +F+SK+F P + A LR+EI+ FRQ   ET  EAW R+ +          
Subjt:  FLGVSDSFVIQGVPRDALRLTLFMYSLRDGAKAWLNSFAPGSIRTWDELARKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERYGRC---------

Query:  -----------------------LGSRGLLAKTFDEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIDNALKNVTVISHQQPPTMEP
                               L     L+ T  E + +L  + +N  +    R T  K   V+EVD V+ + A +  +  ++KN              
Subjt:  -----------------------LGSRGLLAKTFDEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIDNALKNVTVISHQQPPTMEP

Query:  TTLVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NDPYSNFYNPGWRNHPNFSWGGQGNNVQAQEKTNHPGFAKAPVLPQQNKQALPQQNSRS
                      CGE H  + CP +  S+ FV N R   N+PYSN YNPGWR HPNFSW             N+ G   AP   Q       Q N++ 
Subjt:  TTLVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NDPYSNFYNPGWRNHPNFSWGGQGNNVQAQEKTNHPGFAKAPVLPQQNKQALPQQNSRS

Query:  SLEAMMKEFMARTDAAIQSNQASMRALEMQVGQLASELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPL-EVRIELSKTQVIDKNGDKNDIVEKELE
                                                           +PR++GK Q +AVTLR+G+ L EV  E +K++  +   ++    EKE+E
Subjt:  SLEAMMKEFMARTDAAIQSNQASMRALEMQVGQLASELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPL-EVRIELSKTQVIDKNGDKNDIVEKELE

Query:  SGQGVGGSNKNAGASGSVPDVEPPYVPPSHLMYHLYLFHK--------------GKSLRIRMDLLTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGS
        +   V   +K        P            +  L +F K                 ++   D+L+KK+RLG++ETV+LTEECSAI++N LPPK KDPG 
Subjt:  SGQGVGGSNKNAGASGSVPDVEPPYVPPSHLMYHLYLFHK--------------GKSLRIRMDLLTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGS

Query:  FTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADKSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPINLGRPFLATSR
                     RALCDLGASINLMP S+YR LG+ EA+PT++TLQLAD+S+TYP+G IED+LVKVDKFIFP DF++LD E D +VPI LGRPFLAT R
Subjt:  FTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADKSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPINLGRPFLATSR

Query:  ALIDVQKGELTMRVCNKEVKFNVFKAMKYQDEMEDCSFIRIRESTV------------IETAIQDSADKHSEKHGEVSIEDFEVCSLERKNEKELFRCED
         LIDVQKGELTMRV ++++ FNVFKAMK+ +E ++C  + + ++              +E A+ D  D+ +E+  EV         ++  +  + F+   
Subjt:  ALIDVQKGELTMRVCNKEVKFNVFKAMKYQDEMEDCSFIRIRESTV------------IETAIQDSADKHSEKHGEVSIEDFEVCSLERKNEKELFRCED

Query:  VFESLDLDQRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGLTLADIQGISPSLCVHKITLEEGSF
        V ESL   +R AP   +KPS+ E PTL+LKPLP HL Y YLGES+TLP+I++S L     E L+++L+ ++ AIG T+ADI+GISPS C+HKI LE+   
Subjt:  VFESLDLDQRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGLTLADIQGISPSLCVHKITLEEGSF

Query:  RSIEQQRRLNPAMKEVVKKEVIKLLDAGIIYPIADSNWVSPVQCVHKKGGATVRWTRQGKGELCQMYFPTAVAPLQEARHGKRQSVAPLEEARNMKTLDY
         S+E QRRLNP MKEVVKKE+IK LDAGIIYPI+D +W+SPVQCV KKGG TV         +   + PT          G R              +DY
Subjt:  RSIEQQRRLNPAMKEVVKKEVIKLLDAGIIYPIADSNWVSPVQCVHKKGGATVRWTRQGKGELCQMYFPTAVAPLQEARHGKRQSVAPLEEARNMKTLDY

Query:  RRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYFGYNQITIASEEQEKTTFTCPYGTFAFRRMHFGLCNAPTT
        R+LNKATRKDHFPLPFIDQMLDRLAG+ +YCFLDGY GYNQI IA E+QEKTTFTCPYGTFAFRR+ F LCNAP T
Subjt:  RRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYFGYNQITIASEEQEKTTFTCPYGTFAFRRMHFGLCNAPTT

A0A6P6XAQ1 Reverse transcriptase4.7e-20241.12Show/hide
Query:  RVIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHHKSFLGVSDSFVIQGVPRDALRLTLFMYSLRDGAKAWLNSFAPGS
        R++R +A+P        I RP + A NFE+KP + QM+Q   Q+ G  +EDP+ H  +FL + D+    GV  DA++L LF +SLRD AK WL S  P +
Subjt:  RVIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHHKSFLGVSDSFVIQGVPRDALRLTLFMYSLRDGAKAWLNSFAPGS

Query:  IRTWDELARKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERY----GRC----------------------------LGSRGLLAKTFDEAYEILE
          TWDELA+ FL+K+FPP + AKLR +I  F Q E ET  EAWERY     RC                                 L+ KT +EA +++E
Subjt:  IRTWDELARKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERY----GRC----------------------------LGSRGLLAKTFDEAYEILE

Query:  RISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIDNALKNVTVISHQQPPTMEPTTLVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN----QR
         ++ N+ QW++ RG +++   +LEVD ++ + A +   DN +K   +++ Q   +     +V      +C  CG DH+ +F  S+   V ++ N     +
Subjt:  RISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIDNALKNVTVISHQQPPTMEPTTLVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN----QR

Query:  NDPYSNFYNPGWRNHPNFSWGGQGNNVQAQEKTNHPGFAKAPVLPQQ--------NKQALPQQNSRSSLEAMMKEFMARTDAAIQSNQASMRALEMQVGQ
        N+PYSN YNPGWRNHPNF W  QGN    Q   N PGF +   L +          K A    +    L +   +   R +  +       R +E+Q+GQ
Subjt:  NDPYSNFYNPGWRNHPNFSWGGQGNNVQAQEKTNHPGFAKAPVLPQQ--------NKQALPQQNSRSSLEAMMKEFMARTDAAIQSNQASMRALEMQVGQ

Query:  LASELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEVRIELSKTQVIDKNGDKNDIVEKELESGQGVGGSNKNAGASGSVPDVEPPYVPPSHLMY
        +A+ +  R QG LPS TE +PR    E VKA+TLRSGK L     +   +  +K  +K   + +  E  +   G  K       + D  P  +PP    Y
Subjt:  LASELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEVRIELSKTQVIDKNGDKNDIVEKELESGQGVGGSNKNAGASGSVPDVEPPYVPPSHLMY

Query:  HLYLFHKGKSLRIRMDLLTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQ
          +L           +++TKK++L + ET++LTEECSAI++N LPPK KDPGSFT+P +IG  E  +ALCDLGAS++L+PL+V R+LG+ E + T ++LQ
Subjt:  HLYLFHKGKSLRIRMDLLTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQ

Query:  LADKSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPINLGRPFLATSRALIDVQKGELTMRVCNKEVKFNVFKAMKYQDEMEDCSFIRIRESTVI
        LAD+SI +P G +E+VL+KV KFI PVDF++LD E D +VPI LGRPFLAT+  +IDV++G+   ++  +EV+F++ K  KY    +    + I +   +
Subjt:  LADKSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPINLGRPFLATSRALIDVQKGELTMRVCNKEVKFNVFKAMKYQDEMEDCSFIRIRESTVI

Query:  ETAIQDSADKHSEK--HGEVSIEDFEVCSLERKNEKEL-FRCEDVFESLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPE
        E + Q + D  S +     + I++ ++  +    + ++ ++  + +E+L L +   P   PS  +AP L+LKPLP HLKY +LGE ETLP+IV S L  E
Subjt:  ETAIQDSADKHSEK--HGEVSIEDFEVCSLERKNEKEL-FRCEDVFESLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPE

Query:  HEEALIKLLQQYRKAIGLTLADIQGISPSLCVHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKLLDAGIIYPIADSNWVSPVQCVHKKGGATVRWTRQ
          + L+++L+++ KAIG T++DI+GISP++C+H+I LEE S   +E QRRLNP MKEVV+ E++K LDAGII+PI+DS W+SP+  V KKGG T+     
Subjt:  HEEALIKLLQQYRKAIGLTLADIQGISPSLCVHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKLLDAGIIYPIADSNWVSPVQCVHKKGGATVRWTRQ

Query:  GKGELCQMYFPTAVAPLQEARHGKRQSVAPLEEARNMKT-LDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYFGYNQITIASEEQEKTTFTCP
                             HGK   + P       +  +DYR+LN ATRKDHFPLPF+DQM++RLAG  +YCFLDG+ GYNQI IA E+QEKTTFTCP
Subjt:  GKGELCQMYFPTAVAPLQEARHGKRQSVAPLEEARNMKT-LDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYFGYNQITIASEEQEKTTFTCP

Query:  YGTFAFRRMHFGLCNAPTT
        YGTFAFRRM FGLCNAP T
Subjt:  YGTFAFRRMHFGLCNAPTT

A0A6P8DD93 uncharacterized protein LOC1162064533.0e-20441.9Show/hide
Query:  LDPEIERTFRNRRRELRRNQMENVPQLPQAENPILIANDRTRVIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHHKSF
        LDPEIERT    RRE RR +   V ++   +    I     R +R YAVP    +   I RP I A NFE+KP + QM+Q+  QF G  +E P  H   F
Subjt:  LDPEIERTFRNRRRELRRNQMENVPQLPQAENPILIANDRTRVIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHHKSF

Query:  LGVSDSFVIQGVPRDALRLTLFMYSLRDGAKAWLNSFAPGSIRTWDELARKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERYGRCL---------
        L   ++  +  V  D +RL LF +SLRD A+AW NS    SI TW +L+ KFL ++FPP R A+LR+EI  F +   E+  EAWER+   +         
Subjt:  LGVSDSFVIQGVPRDALRLTLFMYSLRDGAKAWLNSFAPGSIRTWDELARKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERYGRCL---------

Query:  -----------------------GSRGLLAKTFDEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIDNALKNVTVISHQQPPTMEPT
                                   L+ K +DEA  ++E ++ ++  W + R +  +V SV ++D ++ +   ++ +   +  +         T   +
Subjt:  -----------------------GSRGLLAKTFDEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIDNALKNVTVISHQQPPTMEPT

Query:  TLVNQVTDEACVYCGEDHNYEFCPS-NPAS------VFFVGN-QRND--PYSNFYNPGWRNHPNFSWGGQGNNVQAQEKTNHPGFAKAPVLPQQNKQALP
           NQV    C  C   H+   C S NP++      V FV N QR++  PYSN YNPGWRNHPNFSW  + N ++       PGF K     Q   Q  P
Subjt:  TLVNQVTDEACVYCGEDHNYEFCPS-NPAS------VFFVGN-QRND--PYSNFYNPGWRNHPNFSWGGQGNNVQAQEKTNHPGFAKAPVLPQQNKQALP

Query:  QQNSRSSLEAMMKEFMARTDAAIQSNQASMRALEMQVGQLASELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEVRIELSKTQVIDKNGDK-NDI
         Q S+S +E +M  +M +TD  +Q+ QA++R LE Q+ Q++ +L  RP G LPS+TE    E  + V A+ LRSGK LE+    ++TQ      DK    
Subjt:  QQNSRSSLEAMMKEFMARTDAAIQSNQASMRALEMQVGQLASELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEVRIELSKTQVIDKNGDK-NDI

Query:  VEKELESGQGVGGSNKNAGASGSVPDVEPPYVPP-------------SHLMYHLYLFHK--------------GKSLRIRMDLLTKKKRLGEFETVSLTE
        VE+  +   GV                  PYVPP             +     L +F K                  R   DLLTKK++    E V LT 
Subjt:  VEKELESGQGVGGSNKNAGASGSVPDVEPPYVPP-------------SHLMYHLYLFHK--------------GKSLRIRMDLLTKKKRLGEFETVSLTE

Query:  ECSAILKN---GLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADKSITYPEGKIEDVLVKVDKFIFPVDFII
        ECS IL+     LP K +D GSFT+P +IG       L D GASINLMPLS++RKLG+GE + T VTLQLAD+SI YP+G +E+VLVKVDKFIFPVDFI+
Subjt:  ECSAILKN---GLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADKSITYPEGKIEDVLVKVDKFIFPVDFII

Query:  LDYEADKDVPINLGRPFLATSRALIDVQKGELTMRVCNKEVKFNVFKAMKYQDEMEDCSFIRIRESTVIETAIQDSADKHSEKHGEVSIEDFEVCSLERK
        L+ E D++VP+ LGRPFLAT +ALIDV++G+LT+RV N+++ FNV+ A+K  D+ + C  I I +  + E+      +K      E  + D +  S + +
Subjt:  LDYEADKDVPINLGRPFLATSRALIDVQKGELTMRVCNKEVKFNVFKAMKYQDEMEDCSFIRIRESTVIETAIQDSADKHSEKHGEVSIEDFEVCSLERK

Query:  NEKELFRCEDVFESLDLDQRKAPPIKP--SLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGLTLADIQGISPSLCV
        +E+E        ++   ++      KP  SL ++P L+LKPLP HLKY YLG  +TLPII++S L  + E+ L+ +L+++++AIG T+ADI+GISP +C 
Subjt:  NEKELFRCEDVFESLDLDQRKAPPIKP--SLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGLTLADIQGISPSLCV

Query:  HKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKLLDAGIIYPIADSNWVSPVQCVHKKGGATVRWTRQGKGELCQMYFPTAVAPLQEARHGKRQSVAPLE
        H+I LE      ++ QRRLNP +KEVVKKEV+KLLDAGIIYPI+DS WVSPVQ V KKGG TV      K E+ ++  PT          G R  +    
Subjt:  HKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKLLDAGIIYPIADSNWVSPVQCVHKKGGATVRWTRQGKGELCQMYFPTAVAPLQEARHGKRQSVAPLE

Query:  EARNMKTLDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYFGYNQITIASEEQEKTTFTCPYGTFAFRRMHFGLCNAPTT
                DYR+LN ATRKDH PLPFIDQML++LAG  YYCFLDGY GYNQI IA E+QEKTTFTCPYGTFAFRRM FGLCNAP T
Subjt:  EARNMKTLDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYFGYNQITIASEEQEKTTFTCPYGTFAFRRMHFGLCNAPTT

A0A6P8DKJ2 uncharacterized protein LOC1162042311.3e-20441.9Show/hide
Query:  LDPEIERTFRNRRRELRRNQMENVPQLPQAENPILIANDRTRVIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHHKSF
        LDPEIERT    RRE RR +   V ++   +    I     R +R YAVP    +   I RP I A NFE+KP + QM+Q+  QF G  +E P  H   F
Subjt:  LDPEIERTFRNRRRELRRNQMENVPQLPQAENPILIANDRTRVIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHHKSF

Query:  LGVSDSFVIQGVPRDALRLTLFMYSLRDGAKAWLNSFAPGSIRTWDELARKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERYGRCL---------
        L   ++  +  V  D +RL LF +SLRD A+AW NS    SI TW +L+ KFL ++FPP R A+LR+EI  F +   E+  EAWER+   +         
Subjt:  LGVSDSFVIQGVPRDALRLTLFMYSLRDGAKAWLNSFAPGSIRTWDELARKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERYGRCL---------

Query:  -----------------------GSRGLLAKTFDEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIDNALKNVTVISHQQPPTMEPT
                                   L+ K +DEA  ++E ++ ++  W + R +  +V SV ++D ++ +   ++ +   +  +         T   +
Subjt:  -----------------------GSRGLLAKTFDEAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIDNALKNVTVISHQQPPTMEPT

Query:  TLVNQVTDEACVYCGEDHNYEFCPS-NPAS------VFFVGN-QRND--PYSNFYNPGWRNHPNFSWGGQGNNVQAQEKTNHPGFAKAPVLPQQNKQALP
           NQV    C  C   H+   C S NP++      V FV N QR++  PYSN YNPGWRNHPNFSW  + N ++       PGF K     Q   Q  P
Subjt:  TLVNQVTDEACVYCGEDHNYEFCPS-NPAS------VFFVGN-QRND--PYSNFYNPGWRNHPNFSWGGQGNNVQAQEKTNHPGFAKAPVLPQQNKQALP

Query:  QQNSRSSLEAMMKEFMARTDAAIQSNQASMRALEMQVGQLASELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEVRIELSKTQVIDKNGDK-NDI
         Q S+S +E +M  +M +TD  +Q+ QA++R LE Q+ Q++ +L  RP G LPS+TE    E  + V A+ LRSGK LE+    ++TQ      DK    
Subjt:  QQNSRSSLEAMMKEFMARTDAAIQSNQASMRALEMQVGQLASELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEVRIELSKTQVIDKNGDK-NDI

Query:  VEKELESGQGVGGSNKNAGASGSVPDVEPPYVPP-------------SHLMYHLYLFHK--------------GKSLRIRMDLLTKKKRLGEFETVSLTE
        VE+      GV                  PYVPP             +     L +F K                  R   DLLTKK++    E V LT 
Subjt:  VEKELESGQGVGGSNKNAGASGSVPDVEPPYVPP-------------SHLMYHLYLFHK--------------GKSLRIRMDLLTKKKRLGEFETVSLTE

Query:  ECSAILKN---GLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADKSITYPEGKIEDVLVKVDKFIFPVDFII
        ECS IL+     LP K +D GSFT+P +IG       L D GASINLMPLS++RKLG+GE + T +TLQLAD+SI YP+G +E+VLVKVDKFIFPVDFI+
Subjt:  ECSAILKN---GLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADKSITYPEGKIEDVLVKVDKFIFPVDFII

Query:  LDYEADKDVPINLGRPFLATSRALIDVQKGELTMRVCNKEVKFNVFKAMKYQDEMEDCSFIRIRESTVIETAIQDSADKHSEKHGEVSIEDFEVCSLERK
        L+ E D++VP+ LGRPFLAT +ALIDV++G+LT+RV N+++ FNV+ A+K  D+ + C  I I +  + E+      +K      E  + D +  S + +
Subjt:  LDYEADKDVPINLGRPFLATSRALIDVQKGELTMRVCNKEVKFNVFKAMKYQDEMEDCSFIRIRESTVIETAIQDSADKHSEKHGEVSIEDFEVCSLERK

Query:  NEKELFRCEDVFESLDLDQRKAPPIKP--SLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGLTLADIQGISPSLCV
        +E+E        ++   ++      KP  SL ++P L+LKPLP HLKY YLG  +TLPII++S L  + E+ L+ +L+++++AIG T+ADI+GISP +C 
Subjt:  NEKELFRCEDVFESLDLDQRKAPPIKP--SLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGLTLADIQGISPSLCV

Query:  HKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKLLDAGIIYPIADSNWVSPVQCVHKKGGATVRWTRQGKGELCQMYFPTAVAPLQEARHGKRQSVAPLE
        H+I LE      ++ QRRLNP +KEVVKKEV+KLLDAGIIYPI+DS WVSPVQ V KKGG TV      K E+ ++  PT          G R  +    
Subjt:  HKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKLLDAGIIYPIADSNWVSPVQCVHKKGGATVRWTRQGKGELCQMYFPTAVAPLQEARHGKRQSVAPLE

Query:  EARNMKTLDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYFGYNQITIASEEQEKTTFTCPYGTFAFRRMHFGLCNAPTT
                DYR+LN ATRKDHFPLPFIDQML++LAG  YYCFLDGY GYNQI IA E+QEKTTFTCPYGTFAFRRM FGLCNAP T
Subjt:  EARNMKTLDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYFGYNQITIASEEQEKTTFTCPYGTFAFRRMHFGLCNAPTT

SwissProt top hitse value%identityAlignment
P10394 Retrovirus-related Pol polyprotein from transposon 4123.1e-0939.24Show/hide
Query:  LDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYFGYNQITIASEEQEKTTFTCPYGTFAFRRMHFGLCNAPTT
        +DYR++NK    D FPLP ID +LD+L    Y+  LD   G++QI +    ++ T+F+   G++ F R+ FGL  AP +
Subjt:  LDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYFGYNQITIASEEQEKTTFTCPYGTFAFRRMHFGLCNAPTT

P20825 Retrovirus-related Pol polyprotein from transposon 2971.4e-0939.24Show/hide
Query:  LDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYFGYNQITIASEEQEKTTFTCPYGTFAFRRMHFGLCNAPTT
        +DYR+LN+ T  D +P+P +D++L +L    Y+  +D   G++QI +  E   KT F+   G + + RM FGL NAP T
Subjt:  LDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYFGYNQITIASEEQEKTTFTCPYGTFAFRRMHFGLCNAPTT

P31843 RNA-directed DNA polymerase homolog1.5e-1145.57Show/hide
Query:  LDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYFGYNQITIASEEQEKTTFTCPYGTFAFRRMHFGLCNAPTT
        +DYR L K T K+ +P+P +D + DRLA   ++  LD   GY Q+ IA  ++ KTT    YG+F FR M FGL NA  T
Subjt:  LDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYFGYNQITIASEEQEKTTFTCPYGTFAFRRMHFGLCNAPTT

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein5.5e-1431.63Show/hide
Query:  LLQQYRKAIGLTL----ADIQGISPSLCVHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKLLDAGIIYPIADSNWVSPVQCVHKKGGATVRWTRQGKG
        L Q+YR+ I   L    ADI  I      H I ++ G+     Q   +    ++ + K V KLLD   I P + S   SPV  V KK G           
Subjt:  LLQQYRKAIGLTL----ADIQGISPSLCVHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKLLDAGIIYPIADSNWVSPVQCVHKKGGATVRWTRQGKG

Query:  ELCQMYFPTAVAPLQEARHGKRQSVAPLEEARNMKTLDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYFGYNQITIASEEQEKTTFTCPYGTF
         LC                                 +DYR LNKAT  D FPLP ID +L R+     +  LD + GY+QI +  +++ KT F  P G +
Subjt:  ELCQMYFPTAVAPLQEARHGKRQSVAPLEEARNMKTLDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYFGYNQITIASEEQEKTTFTCPYGTF

Query:  AFRRMHFGLCNAPTT
         +  M FGL NAP+T
Subjt:  AFRRMHFGLCNAPTT

Q99315 Transposon Ty3-G Gag-Pol polyprotein5.5e-1431.63Show/hide
Query:  LLQQYRKAIGLTL----ADIQGISPSLCVHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKLLDAGIIYPIADSNWVSPVQCVHKKGGATVRWTRQGKG
        L Q+YR+ I   L    ADI  I      H I ++ G+     Q   +    ++ + K V KLLD   I P + S   SPV  V KK G           
Subjt:  LLQQYRKAIGLTL----ADIQGISPSLCVHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKLLDAGIIYPIADSNWVSPVQCVHKKGGATVRWTRQGKG

Query:  ELCQMYFPTAVAPLQEARHGKRQSVAPLEEARNMKTLDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYFGYNQITIASEEQEKTTFTCPYGTF
         LC                                 +DYR LNKAT  D FPLP ID +L R+     +  LD + GY+QI +  +++ KT F  P G +
Subjt:  ELCQMYFPTAVAPLQEARHGKRQSVAPLEEARNMKTLDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYFGYNQITIASEEQEKTTFTCPYGTF

Query:  AFRRMHFGLCNAPTT
         +  M FGL NAP+T
Subjt:  AFRRMHFGLCNAPTT

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGATCCACCTGGGGTATGGTTTGAGCTTGATCCGGAAATTGAAAGGACATTTAGGAACAGAAGGAGGGAGCTGCGCAGAAACCAGATGGAGAACGTGCCGCAACT
TCCGCAGGCTGAGAATCCTATCCTGATAGCGAATGATAGGACTAGAGTCATTCGAGCATATGCTGTCCCGATGTTTAATGAGTTGAATCCAGGGATTGCACGTCCCCAAA
TCCAAGCAGCAAATTTTGAAATGAAACCGGTAATGTTTCAAATGTTGCAAACTGTGGGACAATTCCATGGTTTGACATCTGAGGACCCTCATTTACATCATAAGTCTTTT
CTAGGAGTTAGTGATTCTTTTGTAATTCAAGGAGTGCCTAGAGATGCCCTTAGATTAACTTTGTTCATGTATTCTCTTAGAGATGGAGCAAAGGCATGGTTAAATTCTTT
TGCTCCAGGATCAATTAGGACGTGGGATGAGTTAGCTAGAAAATTTTTGAGTAAATATTTCCCGCCTAATAGAAATGCTAAATTAAGGAGTGAAATAGTAGGGTTTAGGC
AACTTGAAGATGAGACTTTTAGTGAGGCTTGGGAGAGGTATGGTCGATGCCTCGGCAGTAGGGGCCTTTTGGCAAAAACTTTTGATGAAGCTTATGAAATTTTAGAGAGA
ATATCTATTAATAGTTGTCAGTGGTCGGATGTTAGAGGCACAAATAAAAAGGTTAAGAGTGTGTTAGAGGTTGATGGTGTGTCCACCATTAGGGCTGATCTTGCTATGAT
TGATAATGCTCTTAAGAATGTGACAGTGATTAGTCATCAACAGCCACCGACTATGGAGCCTACAACATTGGTGAACCAAGTCACAGACGAAGCATGTGTCTATTGTGGTG
AAGACCACAACTACGAGTTTTGCCCCAGCAATCCAGCTTCTGTGTTTTTTGTAGGAAATCAGAGGAACGATCCTTATTCTAACTTCTATAATCCAGGTTGGCGCAACCAC
CCAAACTTCTCATGGGGAGGTCAGGGAAATAACGTACAAGCACAAGAAAAGACGAACCACCCGGGATTTGCTAAAGCGCCGGTATTGCCCCAACAAAACAAGCAGGCTTT
GCCCCAGCAAAATTCGAGAAGTTCTCTCGAGGCGATGATGAAAGAATTTATGGCTCGTACAGATGCCGCAATTCAAAGTAATCAAGCTTCGATGAGAGCCCTGGAAATGC
AAGTGGGTCAGCTAGCTAGTGAGCTAAAGGCAAGGCCTCAAGGGAAACTTCCATCAGATACTGAACACCCTAGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTAACTCTT
AGGAGTGGTAAGCCATTAGAAGTAAGAATTGAGCTTAGTAAAACCCAGGTTATAGATAAAAATGGTGATAAAAATGATATTGTTGAAAAAGAGTTGGAGTCTGGTCAGGG
TGTTGGAGGCAGCAATAAAAATGCTGGAGCATCTGGTTCTGTTCCAGATGTGGAACCACCTTATGTACCCCCCTCCCACCTTATGTACCACCTCTACCTATTCCACAAAG
GCAAAAGCCTAAGAATCAGGATGGATCTTTTAACTAAAAAGAAAAGGTTAGGTGAATTTGAAACTGTATCTCTTACTGAGGAGTGTAGTGCTATTCTTAAGAATGGGCTA
CCCCCCAAGGCTAAGGATCCAGGGTCATTTACCATACCTGTGTCTATAGGTGGAAAAGAATTAGGTAGAGCACTATGTGATTTAGGCGCAAGCATTAACCTTATGCCTCT
TTCGGTCTATCGAAAGTTAGGTATTGGTGAAGCTAGGCCTACCACAGTCACACTCCAATTAGCTGATAAGTCTATCACATATCCAGAGGGTAAAATTGAGGATGTCTTAG
TAAAAGTGGATAAATTCATATTTCCTGTTGATTTTATTATTTTAGACTATGAGGCTGATAAAGATGTCCCAATTAATCTAGGTCGTCCATTTTTGGCTACTAGTAGGGCA
TTAATAGATGTTCAAAAAGGGGAATTAACAATGAGAGTCTGTAATAAGGAAGTAAAATTTAATGTGTTTAAAGCCATGAAATATCAAGACGAAATGGAGGATTGCTCCTT
CATCAGGATTCGGGAGAGCACAGTTATTGAGACAGCAATACAGGATTCGGCTGATAAGCATTCAGAAAAGCATGGAGAGGTTAGTATAGAGGATTTTGAAGTTTGTTCAT
TAGAAAGAAAAAATGAAAAAGAGTTGTTTAGGTGTGAGGATGTTTTTGAGTCTTTAGATTTAGATCAAAGGAAGGCTCCTCCTATTAAGCCATCCCTGATTGAGGCACCT
ACTTTAGATTTGAAACCCTTGCCGGATCATCTAAAGTACGTGTATCTTGGGGAAAGTGAGACGTTGCCCATTATTGTTGCATCAGATTTAATGCCAGAGCATGAAGAGGC
CTTAATAAAATTGCTGCAGCAATACCGCAAGGCTATAGGTTTGACATTGGCTGATATTCAGGGAATTAGCCCATCTCTTTGTGTGCACAAAATCACTCTAGAGGAGGGAT
CATTTAGGAGTATTGAGCAACAAAGAAGGCTTAACCCTGCAATGAAAGAGGTTGTTAAAAAGGAGGTAATTAAATTGTTGGATGCTGGTATTATTTATCCAATTGCCGAT
AGCAATTGGGTGAGCCCTGTCCAATGTGTTCATAAGAAAGGAGGTGCCACCGTGAGGTGGACAAGACAGGGAAAAGGAGAGCTCTGTCAGATGTATTTTCCCACAGCAGT
CGCGCCTCTTCAAGAGGCGCGACACGGAAAACGACAAAGTGTCGCGCCTTTAGAAGAGGCGCGAAACATGAAAACCCTGGATTACAGGAGGCTTAATAAAGCTACCCGTA
AGGACCATTTCCCTCTACCATTCATCGACCAGATGTTGGATCGATTGGCTGGTCAGGCCTATTACTGTTTCTTAGATGGTTATTTTGGGTACAACCAGATTACTATTGCT
TCTGAGGAACAGGAGAAAACCACTTTCACCTGTCCTTATGGGACATTCGCTTTTAGGAGAATGCATTTCGGCCTCTGCAATGCTCCAACAACAGTTCTGATATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCGATCCACCTGGGGTATGGTTTGAGCTTGATCCGGAAATTGAAAGGACATTTAGGAACAGAAGGAGGGAGCTGCGCAGAAACCAGATGGAGAACGTGCCGCAACT
TCCGCAGGCTGAGAATCCTATCCTGATAGCGAATGATAGGACTAGAGTCATTCGAGCATATGCTGTCCCGATGTTTAATGAGTTGAATCCAGGGATTGCACGTCCCCAAA
TCCAAGCAGCAAATTTTGAAATGAAACCGGTAATGTTTCAAATGTTGCAAACTGTGGGACAATTCCATGGTTTGACATCTGAGGACCCTCATTTACATCATAAGTCTTTT
CTAGGAGTTAGTGATTCTTTTGTAATTCAAGGAGTGCCTAGAGATGCCCTTAGATTAACTTTGTTCATGTATTCTCTTAGAGATGGAGCAAAGGCATGGTTAAATTCTTT
TGCTCCAGGATCAATTAGGACGTGGGATGAGTTAGCTAGAAAATTTTTGAGTAAATATTTCCCGCCTAATAGAAATGCTAAATTAAGGAGTGAAATAGTAGGGTTTAGGC
AACTTGAAGATGAGACTTTTAGTGAGGCTTGGGAGAGGTATGGTCGATGCCTCGGCAGTAGGGGCCTTTTGGCAAAAACTTTTGATGAAGCTTATGAAATTTTAGAGAGA
ATATCTATTAATAGTTGTCAGTGGTCGGATGTTAGAGGCACAAATAAAAAGGTTAAGAGTGTGTTAGAGGTTGATGGTGTGTCCACCATTAGGGCTGATCTTGCTATGAT
TGATAATGCTCTTAAGAATGTGACAGTGATTAGTCATCAACAGCCACCGACTATGGAGCCTACAACATTGGTGAACCAAGTCACAGACGAAGCATGTGTCTATTGTGGTG
AAGACCACAACTACGAGTTTTGCCCCAGCAATCCAGCTTCTGTGTTTTTTGTAGGAAATCAGAGGAACGATCCTTATTCTAACTTCTATAATCCAGGTTGGCGCAACCAC
CCAAACTTCTCATGGGGAGGTCAGGGAAATAACGTACAAGCACAAGAAAAGACGAACCACCCGGGATTTGCTAAAGCGCCGGTATTGCCCCAACAAAACAAGCAGGCTTT
GCCCCAGCAAAATTCGAGAAGTTCTCTCGAGGCGATGATGAAAGAATTTATGGCTCGTACAGATGCCGCAATTCAAAGTAATCAAGCTTCGATGAGAGCCCTGGAAATGC
AAGTGGGTCAGCTAGCTAGTGAGCTAAAGGCAAGGCCTCAAGGGAAACTTCCATCAGATACTGAACACCCTAGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTAACTCTT
AGGAGTGGTAAGCCATTAGAAGTAAGAATTGAGCTTAGTAAAACCCAGGTTATAGATAAAAATGGTGATAAAAATGATATTGTTGAAAAAGAGTTGGAGTCTGGTCAGGG
TGTTGGAGGCAGCAATAAAAATGCTGGAGCATCTGGTTCTGTTCCAGATGTGGAACCACCTTATGTACCCCCCTCCCACCTTATGTACCACCTCTACCTATTCCACAAAG
GCAAAAGCCTAAGAATCAGGATGGATCTTTTAACTAAAAAGAAAAGGTTAGGTGAATTTGAAACTGTATCTCTTACTGAGGAGTGTAGTGCTATTCTTAAGAATGGGCTA
CCCCCCAAGGCTAAGGATCCAGGGTCATTTACCATACCTGTGTCTATAGGTGGAAAAGAATTAGGTAGAGCACTATGTGATTTAGGCGCAAGCATTAACCTTATGCCTCT
TTCGGTCTATCGAAAGTTAGGTATTGGTGAAGCTAGGCCTACCACAGTCACACTCCAATTAGCTGATAAGTCTATCACATATCCAGAGGGTAAAATTGAGGATGTCTTAG
TAAAAGTGGATAAATTCATATTTCCTGTTGATTTTATTATTTTAGACTATGAGGCTGATAAAGATGTCCCAATTAATCTAGGTCGTCCATTTTTGGCTACTAGTAGGGCA
TTAATAGATGTTCAAAAAGGGGAATTAACAATGAGAGTCTGTAATAAGGAAGTAAAATTTAATGTGTTTAAAGCCATGAAATATCAAGACGAAATGGAGGATTGCTCCTT
CATCAGGATTCGGGAGAGCACAGTTATTGAGACAGCAATACAGGATTCGGCTGATAAGCATTCAGAAAAGCATGGAGAGGTTAGTATAGAGGATTTTGAAGTTTGTTCAT
TAGAAAGAAAAAATGAAAAAGAGTTGTTTAGGTGTGAGGATGTTTTTGAGTCTTTAGATTTAGATCAAAGGAAGGCTCCTCCTATTAAGCCATCCCTGATTGAGGCACCT
ACTTTAGATTTGAAACCCTTGCCGGATCATCTAAAGTACGTGTATCTTGGGGAAAGTGAGACGTTGCCCATTATTGTTGCATCAGATTTAATGCCAGAGCATGAAGAGGC
CTTAATAAAATTGCTGCAGCAATACCGCAAGGCTATAGGTTTGACATTGGCTGATATTCAGGGAATTAGCCCATCTCTTTGTGTGCACAAAATCACTCTAGAGGAGGGAT
CATTTAGGAGTATTGAGCAACAAAGAAGGCTTAACCCTGCAATGAAAGAGGTTGTTAAAAAGGAGGTAATTAAATTGTTGGATGCTGGTATTATTTATCCAATTGCCGAT
AGCAATTGGGTGAGCCCTGTCCAATGTGTTCATAAGAAAGGAGGTGCCACCGTGAGGTGGACAAGACAGGGAAAAGGAGAGCTCTGTCAGATGTATTTTCCCACAGCAGT
CGCGCCTCTTCAAGAGGCGCGACACGGAAAACGACAAAGTGTCGCGCCTTTAGAAGAGGCGCGAAACATGAAAACCCTGGATTACAGGAGGCTTAATAAAGCTACCCGTA
AGGACCATTTCCCTCTACCATTCATCGACCAGATGTTGGATCGATTGGCTGGTCAGGCCTATTACTGTTTCTTAGATGGTTATTTTGGGTACAACCAGATTACTATTGCT
TCTGAGGAACAGGAGAAAACCACTTTCACCTGTCCTTATGGGACATTCGCTTTTAGGAGAATGCATTTCGGCCTCTGCAATGCTCCAACAACAGTTCTGATATGA
Protein sequenceShow/hide protein sequence
MSDPPGVWFELDPEIERTFRNRRRELRRNQMENVPQLPQAENPILIANDRTRVIRAYAVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHHKSF
LGVSDSFVIQGVPRDALRLTLFMYSLRDGAKAWLNSFAPGSIRTWDELARKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERYGRCLGSRGLLAKTFDEAYEILER
ISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIDNALKNVTVISHQQPPTMEPTTLVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQRNDPYSNFYNPGWRNH
PNFSWGGQGNNVQAQEKTNHPGFAKAPVLPQQNKQALPQQNSRSSLEAMMKEFMARTDAAIQSNQASMRALEMQVGQLASELKARPQGKLPSDTEHPRREGKEQVKAVTL
RSGKPLEVRIELSKTQVIDKNGDKNDIVEKELESGQGVGGSNKNAGASGSVPDVEPPYVPPSHLMYHLYLFHKGKSLRIRMDLLTKKKRLGEFETVSLTEECSAILKNGL
PPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADKSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPINLGRPFLATSRA
LIDVQKGELTMRVCNKEVKFNVFKAMKYQDEMEDCSFIRIRESTVIETAIQDSADKHSEKHGEVSIEDFEVCSLERKNEKELFRCEDVFESLDLDQRKAPPIKPSLIEAP
TLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIGLTLADIQGISPSLCVHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKLLDAGIIYPIAD
SNWVSPVQCVHKKGGATVRWTRQGKGELCQMYFPTAVAPLQEARHGKRQSVAPLEEARNMKTLDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYFGYNQITIA
SEEQEKTTFTCPYGTFAFRRMHFGLCNAPTTVLI