; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041653 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041653
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr13:23097850..23100552
RNA-Seq ExpressionLag0041653
SyntenyLag0041653
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023871998.1 uncharacterized protein LOC111984613 [Quercus suber]1.5e-19543.73Show/hide
Query:  RSGGLCLMWKDDINVSIRSFTFYHIDAAIKWD--SKSWRFTGIYGHPNAGHRDHTWKLLRRLHNQDESAWVIGGDFNATLLYEEKEGGVPVRESQIQNFR
        RSGGL L+WK +++VS++S++  HIDA +  +  S+ WRFTG YG+P+   R+ +W LL+RL + +   WV  GDFN  +   EKEGG      Q+ NF 
Subjt:  RSGGLCLMWKDDINVSIRSFTFYHIDAAIKWD--SKSWRFTGIYGHPNAGHRDHTWKLLRRLHNQDESAWVIGGDFNATLLYEEKEGGVPVRESQIQNFR

Query:  SALDDCGLQDLDYLGDTFTWTNRQGESDQTNERLDRFIANEEYWQLFPNCSVTHLNLANSDHCPIMLQATSDLDSIELGKHKHRIFRFEEVWATQPECHS
         A++ C L+DL Y+G  FTW+ R G      ERLDR + +  +   FP   + H   ++SDHC ++L+   D  S    K   + FRFE +W  +  C  
Subjt:  SALDDCGLQDLDYLGDTFTWTNRQGESDQTNERLDRFIANEEYWQLFPNCSVTHLNLANSDHCPIMLQATSDLDSIELGKHKHRIFRFEEVWATQPECHS

Query:  LITRTGQWGSHHTGFSRLELCLKNVSTGLKQWGKGPLANIRRDISKYKSMLQ-AHYSKPRPWDFQSIRIIEGLLDKALEQEEIYWKQQSHENWLKWGDKN
        +++     G      S L  CL+     L  W K    ++ + I+  +  L+     K  P   + I      L+K LE EE+ W Q+S  +WLK GDKN
Subjt:  LITRTGQWGSHHTGFSRLELCLKNVSTGLKQWGKGPLANIRRDISKYKSMLQ-AHYSKPRPWDFQSIRIIEGLLDKALEQEEIYWKQQSHENWLKWGDKN

Query:  TQWFHKSASMKKRRNTISGIIADNGEWVTDRVQIEQTFTEYFTNIFSSTNPHLGYFDVALQDIQRKVTDEMNDKQMESFDRGELIRAVKQMHPSKAPGPD
        T +FH  AS + +RNTI  I   NGEW  D   I + F EYF ++F+++NP +   D  L  +Q KVT +MN   +  F   E+ RA+KQM P+ APGPD
Subjt:  TQWFHKSASMKKRRNTISGIIADNGEWVTDRVQIEQTFTEYFTNIFSSTNPHLGYFDVALQDIQRKVTDEMNDKQMESFDRGELIRAVKQMHPSKAPGPD

Query:  GFPTLFYQKYWTEVGETTILNCLDILNQKRFVKAWNETNITLLPKVNQPKQVSDFRPISLCNVSYKIIAKVIVNRMKWILQDIISENQSAFVPGRSIFDN
        G P +FYQ YW  V        LD LN       +NET+I L+PKV  PK V+DFRP SLCNV+YKI +K I +R+K IL  ++ ENQSAFV  R I DN
Subjt:  GFPTLFYQKYWTEVGETTILNCLDILNQKRFVKAWNETNITLLPKVNQPKQVSDFRPISLCNVSYKIIAKVIVNRMKWILQDIISENQSAFVPGRSIFDN

Query:  IIIGHECLHTIRSRQKGRKGWVAMKLDMSKAYDRVEWKFINKLMLNLVFHPRWVALIMDCVTTAKFSLLINGSPTSSITPHRGLRQGDPLSPYLFLLCSE
        +++  E +H I  ++KG+ G +A+KLDMSKAYDRVEW  ++++ML L F  RWV LIM CV++  +++ ING P   ITP RGLRQGDPLSPYLFL C+E
Subjt:  IIIGHECLHTIRSRQKGRKGWVAMKLDMSKAYDRVEWKFINKLMLNLVFHPRWVALIMDCVTTAKFSLLINGSPTSSITPHRGLRQGDPLSPYLFLLCSE

Query:  ALSALLSGAISRKILTGIKAGKCCPKISHLFFADDSLVFCQASVDQIEELRSVLNKYELASGQVVNVTKSTMFFSPNVGEEVRDNIKNIMGMQVVENLGR
         LSA+   A+ R+ L GI A +  PK+SHLFFADDSL+F QA+ ++  E+R +L  YE +SGQ +N  K++++FS N   EV++ IK + G QV++    
Subjt:  ALSALLSGAISRKILTGIKAGKCCPKISHLFFADDSLVFCQASVDQIEELRSVLNKYELASGQVVNVTKSTMFFSPNVGEEVRDNIKNIMGMQVVENLGR

Query:  YLGVSSSFTRSQREDFKGVKQRVWQTLQGWKGKFFSMGGKETLIKSVAQAIPTFIMSCFHLPSTLCDDIHKLVARFWWGSTEVKRKMHWKKWESLCAP
        YLG+ S   RS+   F  +K++V + L GWK K  S  GKE LIK+VA+A+PT+ MSCF +P+++CD++  +V++FWWG  + +RKM W  W+ LC P
Subjt:  YLGVSSSFTRSQREDFKGVKQRVWQTLQGWKGKFFSMGGKETLIKSVAQAIPTFIMSCFHLPSTLCDDIHKLVARFWWGSTEVKRKMHWKKWESLCAP

XP_030939698.1 uncharacterized protein LOC115964550 [Quercus lobata]3.8e-19143.04Show/hide
Query:  RSGGLCLMWKDDINVSIRSFTFYHIDAAIKWD-SKSWRFTGIYGHPNAGHRDHTWKLLRRLHNQDESAWVIGGDFNATLLYEEKEGGVPVRESQIQNFRS
        RSGGL L+W ++I++ I++FT  HIDA I  D +  WR TG YG P    +  +W+LL+ LH++    W+  GDFN  L  EEK+GG+P   + + NFR 
Subjt:  RSGGLCLMWKDDINVSIRSFTFYHIDAAIKWD-SKSWRFTGIYGHPNAGHRDHTWKLLRRLHNQDESAWVIGGDFNATLLYEEKEGGVPVRESQIQNFRS

Query:  ALDDCGLQDLDYLGDTFTWTNRQGESDQTNERLDRFIANEEYWQLFPNCSVTHLNLANSDHCPIMLQATSDLDSIELGKHKHRIFRFEEVWATQPECHSL
        AL  CGL DL Y G+ FTWTN  G  D   ERLDR  A  E+   F    VTHL  + SDH PI++            K  H   RFEE WAT P+C ++
Subjt:  ALDDCGLQDLDYLGDTFTWTNRQGESDQTNERLDRFIANEEYWQLFPNCSVTHLNLANSDHCPIMLQATSDLDSIELGKHKHRIFRFEEVWATQPECHSL

Query:  ITRTGQWGS---HHTGFSRLELCLKNVSTGLKQWGKGPLANIRRDISKYKSMLQAHYSKPRPWDFQSIRIIEGLLDKALEQEEIYWKQQSHENWLKWGDK
        I     W S   + +  ++L   +K     L  W +      +  + + + +L+    + R  + ++I+ ++  +   + Q+E++W+Q+S   WL  GDK
Subjt:  ITRTGQWGS---HHTGFSRLELCLKNVSTGLKQWGKGPLANIRRDISKYKSMLQAHYSKPRPWDFQSIRIIEGLLDKALEQEEIYWKQQSHENWLKWGDK

Query:  NTQWFHKSASMKKRRNTISGIIADNGEWVTDRVQIEQTFTEYFTNIFSSTNPHLGYFDVALQDIQRKVTDEMNDKQMESFDRGELIRAVKQMHPSKAPGP
        NT++FH  AS ++R+N ISG+   +  W T   QI +    YF  +FS+ +P     +  LQ +QRKVT  MN+     +   E+  A+ QMHPSK+PGP
Subjt:  NTQWFHKSASMKKRRNTISGIIADNGEWVTDRVQIEQTFTEYFTNIFSSTNPHLGYFDVALQDIQRKVTDEMNDKQMESFDRGELIRAVKQMHPSKAPGP

Query:  DGFPTLFYQKYWTEVGETTILNCLDILNQKRFVKAWNETNITLLPKVNQPKQVSDFRPISLCNVSYKIIAKVIVNRMKWILQDIISENQSAFVPGRSIFD
        DG    F+QKYW  +G       L  L     +K  N T+I L+PK   PK ++D+RPISL NV  +II+KVI NR+K IL ++IS++QSAFVP R I D
Subjt:  DGFPTLFYQKYWTEVGETTILNCLDILNQKRFVKAWNETNITLLPKVNQPKQVSDFRPISLCNVSYKIIAKVIVNRMKWILQDIISENQSAFVPGRSIFD

Query:  NIIIGHECLHTIRSRQKGRKGWVAMKLDMSKAYDRVEWKFINKLMLNLVFHPRWVALIMDCVTTAKFSLLINGSPTSSITPHRGLRQGDPLSPYLFLLCS
        N  + +E LH +R+R++G+ G +A+KLD+SKAYDRVEW F+  +M  L F PRWV L M+ VTTA +S+ ING P   IT  RG+RQGDPLSPYLFLLC+
Subjt:  NIIIGHECLHTIRSRQKGRKGWVAMKLDMSKAYDRVEWKFINKLMLNLVFHPRWVALIMDCVTTAKFSLLINGSPTSSITPHRGLRQGDPLSPYLFLLCS

Query:  EALSALLSGAISRKILTGIKAGKCCPKISHLFFADDSLVFCQASVDQIEELRSVLNKYELASGQVVNVTKSTMFFSPNVGEEVRDNIKNIMGMQVVENLG
        E L+ALL+ A+  +++ GI + +    ISHL FADDSL+FC+A+V + ++L S+L +YE ASGQ +N  K+++FFS N   E+R  I+  MG +V+ N  
Subjt:  EALSALLSGAISRKILTGIKAGKCCPKISHLFFADDSLVFCQASVDQIEELRSVLNKYELASGQVVNVTKSTMFFSPNVGEEVRDNIKNIMGMQVVENLG

Query:  RYLGVSSSFTRSQREDFKGVKQRVWQTLQGWKGKFFSMGGKETLIKSVAQAIPTFIMSCFHLPSTLCDDIHKLVARFWWGSTEVKRKMHWKKWESLC
        +YLG+  +  +S+   FK +++++ + + GWK KF S  G+E LIK+VAQAIPT+ MS F LP T+CD I+ L+A++WWG  + +RK+HW  W+ LC
Subjt:  RYLGVSSSFTRSQREDFKGVKQRVWQTLQGWKGKFFSMGGKETLIKSVAQAIPTFIMSCFHLPSTLCDDIHKLVARFWWGSTEVKRKMHWKKWESLC

XP_030969743.1 uncharacterized protein LOC115990020 [Quercus lobata]6.5e-19142.61Show/hide
Query:  GGLCLMWKDDINVSIRSFTFYHIDAAIKWDSK-SWRFTGIYGHPNAGHRDHTWKLLRRLHNQDESAWVIGGDFNATLLYEEKEGGVPVRESQIQNFRSAL
        GGL L+WK+D+ + + SF+ YHIDA +   S+ +WR TG YG P    R   W +LR L ++ +  W   GDFN  L   +K GGVP   +Q+Q+FR AL
Subjt:  GGLCLMWKDDINVSIRSFTFYHIDAAIKWDSK-SWRFTGIYGHPNAGHRDHTWKLLRRLHNQDESAWVIGGDFNATLLYEEKEGGVPVRESQIQNFRSAL

Query:  DDCGLQDLDYLGDTFTWTNRQGESDQTNERLDRFIANEEYWQLFPNCSVTHLNLANSDHCPIMLQATSDLDSIELGKH---KHRIFRFEEVWATQPECHS
        D CG  DL + G  FTW  R+   ++  ERLDR +AN E+   FP   V HLN   SDH P++L   S+      GK    + + FRFE +W + P C  
Subjt:  DDCGLQDLDYLGDTFTWTNRQGESDQTNERLDRFIANEEYWQLFPNCSVTHLNLANSDHCPIMLQATSDLDSIELGKH---KHRIFRFEEVWATQPECHS

Query:  LITRTGQWGSHHTGFSRLELC--LKNVSTGLKQWGKGPLANIRRDISKYKSML-QAHYSKPRPWDFQSIRIIEGLLDKALEQEEIYWKQQSHENWLKWGD
         +T    W     G   L     +K     LK+W K    N++  I   K  L  A     +  D  ++  ++  L   LE+EE  W Q+S   WL+ GD
Subjt:  LITRTGQWGSHHTGFSRLELC--LKNVSTGLKQWGKGPLANIRRDISKYKSML-QAHYSKPRPWDFQSIRIIEGLLDKALEQEEIYWKQQSHENWLKWGD

Query:  KNTQWFHKSASMKKRRNTISGIIADNGEWVTDRVQIEQTFTEYFTNIFSSTNPHLGYFDVALQDIQRKVTDEMNDKQMESFDRGELIRAVKQMHPSKAPG
        +NT++FH +A+ +KR+N I G+  +NG W ++        T+++  +F S+NP     D  +  +Q+ VT+ MN    + +   E+ RA+K M P KAPG
Subjt:  KNTQWFHKSASMKKRRNTISGIIADNGEWVTDRVQIEQTFTEYFTNIFSSTNPHLGYFDVALQDIQRKVTDEMNDKQMESFDRGELIRAVKQMHPSKAPG

Query:  PDGFPTLFYQKYWTEVGETTILNCLDILNQKRFVKAWNETNITLLPKVNQPKQVSDFRPISLCNVSYKIIAKVIVNRMKWILQDIISENQSAFVPGRSIF
        PDG P LFYQ YW++V        L  LN    +K+ N T ITL+PKV  P++V++FRPISLCNV YKI++K I NR+K +L  IIS+ QSAF+  R I 
Subjt:  PDGFPTLFYQKYWTEVGETTILNCLDILNQKRFVKAWNETNITLLPKVNQPKQVSDFRPISLCNVSYKIIAKVIVNRMKWILQDIISENQSAFVPGRSIF

Query:  DNIIIGHECLHTIRSRQKGRKGWVAMKLDMSKAYDRVEWKFINKLMLNLVFHPRWVALIMDCVTTAKFSLLINGSPTSSITPHRGLRQGDPLSPYLFLLC
        DN++I  E LH +++   G+ G++A+KLDMSKAYDRVEW F+ K++L L F   WV+LIM+C+TT  +S+L+NG P   ITP RGLRQGDPLSPYLFL C
Subjt:  DNIIIGHECLHTIRSRQKGRKGWVAMKLDMSKAYDRVEWKFINKLMLNLVFHPRWVALIMDCVTTAKFSLLINGSPTSSITPHRGLRQGDPLSPYLFLLC

Query:  SEALSALLSGAISRKILTGIKAGKCCPKISHLFFADDSLVFCQASVDQIEELRSVLNKYELASGQVVNVTKSTMFFSPNVGEEVRDNIKNIMGMQVVENL
        +E L+A+   A     + G    +  PK++HLFFADD L+FC++S+++ E+++ +L  YE ASGQ+VN  K+T+FFS N    V++ IKN +G+  + + 
Subjt:  SEALSALLSGAISRKILTGIKAGKCCPKISHLFFADDSLVFCQASVDQIEELRSVLNKYELASGQVVNVTKSTMFFSPNVGEEVRDNIKNIMGMQVVENL

Query:  GRYLGVSSSFTRSQREDFKGVKQRVWQTLQGWKGKFFSMGGKETLIKSVAQAIPTFIMSCFHLPSTLCDDIHKLVARFWWGSTEVKRKMHWKKWESLC
         +YLG+ S   R+++  F  +K+R+W  +QGWK K  S  GKE +IK+V Q+IPT+ MS F LP  LC DI  ++ +FWWG  E  RK+HW  W +LC
Subjt:  GRYLGVSSSFTRSQREDFKGVKQRVWQTLQGWKGKFFSMGGKETLIKSVAQAIPTFIMSCFHLPSTLCDDIHKLVARFWWGSTEVKRKMHWKKWESLC

XP_042962672.1 uncharacterized protein LOC122296942 [Carya illinoinensis]3.1e-20144.93Show/hide
Query:  RSGGLCLMWKDDINVSIRSFTFYHIDAAI-KWDSKSWRFTGIYGHPNAGHRDHTWKLLRRLHNQDESAWVIGGDFNATLLYEEKEGGVPVRESQIQNFRS
        RSGGL L+WKDDIN+ I +++ +HI A+I   D   W  TG+YGH ++G R   W+LL+ L       W++ GDFN  L + EK GG    + Q++ FR 
Subjt:  RSGGLCLMWKDDINVSIRSFTFYHIDAAI-KWDSKSWRFTGIYGHPNAGHRDHTWKLLRRLHNQDESAWVIGGDFNATLLYEEKEGGVPVRESQIQNFRS

Query:  ALDDCGLQDLDYLGDTFTWTNRQGESDQTNERLDRFIANEEYWQLFPNCSVTHLNLANSDHCPIMLQATSDLDSIELGKHKHRIFRFEEVWATQPECHSL
         L DC L+DL Y+G  FTW+NR+GE D   ERLDRF+AN  +  +FPN  VTH   A SDH P+ L    D +   + +   R+FRFE +W  + EC S+
Subjt:  ALDDCGLQDLDYLGDTFTWTNRQGESDQTNERLDRFIANEEYWQLFPNCSVTHLNLANSDHCPIMLQATSDLDSIELGKHKHRIFRFEEVWATQPECHSL

Query:  ITRTGQWGSHHTGFSRLELC--LKNVSTGLKQWGKGPLANIRRDISKYKSMLQA---HYSKPRPWDFQSIRIIEGLLDKALEQEEIYWKQQSHENWLKWG
        I R   WG  H   S  ++   + + +T L +W K    +++++++  K  LQ    + S     +      +E  + K LE++E+ WKQ+S   WL+ G
Subjt:  ITRTGQWGSHHTGFSRLELC--LKNVSTGLKQWGKGPLANIRRDISKYKSMLQA---HYSKPRPWDFQSIRIIEGLLDKALEQEEIYWKQQSHENWLKWG

Query:  DKNTQWFHKSASMKKRRNTISGIIADNGEWVTDRVQIEQTFTEYFTNIFSSTNPHLGYFDVALQDIQRKVTDEMNDKQMESFDRGELIRAVKQMHPSKAP
        D N+++FH  AS ++R+N+I  +  ++G W     Q++   TEYF  +F++ +  +   DV L  ++ +VT EMN+  ++ +   E+  A+KQMHPSKAP
Subjt:  DKNTQWFHKSASMKKRRNTISGIIADNGEWVTDRVQIEQTFTEYFTNIFSSTNPHLGYFDVALQDIQRKVTDEMNDKQMESFDRGELIRAVKQMHPSKAP

Query:  GPDGFPTLFYQKYWTEVGETTILNCLDILNQKRFVKAWNETNITLLPKVNQPKQVSDFRPISLCNVSYKIIAKVIVNRMKWILQDIISENQSAFVPGRSI
        GPDG P LF+QKYW  +G +     L  LN   F    N T ITL+PK   P +V+DFRPISLCNV YKI++KVI NR+K +L DIIS +QSAFVPGR I
Subjt:  GPDGFPTLFYQKYWTEVGETTILNCLDILNQKRFVKAWNETNITLLPKVNQPKQVSDFRPISLCNVSYKIIAKVIVNRMKWILQDIISENQSAFVPGRSI

Query:  FDNIIIGHECLHTIRSRQKGRKGWVAMKLDMSKAYDRVEWKFINKLMLNLVFHPRWVALIMDCVTTAKFSLLINGSPTSSITPHRGLRQGDPLSPYLFLL
         DN++I +E LH +R+++KGRKG++++KLDMSKAYDRV+W F+ K+M +L F  + ++LIM CV T  FS+L+NGSP   I P RGLRQGDPLSPYLFLL
Subjt:  FDNIIIGHECLHTIRSRQKGRKGWVAMKLDMSKAYDRVEWKFINKLMLNLVFHPRWVALIMDCVTTAKFSLLINGSPTSSITPHRGLRQGDPLSPYLFLL

Query:  CSEALSALLSGAISRKILTGIKAGKCCPKISHLFFADDSLVFCQASVDQIEELRSVLNKYELASGQVVNVTKSTMFFSPNVGEEVRDNIKNIMGMQVVEN
        C+E L +LL    SR+ + GI+  +  P+I+HL FADDS++FC+A V    +++S+LNKYE ASGQ +N  K++M FS NV ++++ +I  + G    + 
Subjt:  CSEALSALLSGAISRKILTGIKAGKCCPKISHLFFADDSLVFCQASVDQIEELRSVLNKYELASGQVVNVTKSTMFFSPNVGEEVRDNIKNIMGMQVVEN

Query:  LGRYLGVSSSFTRSQREDFKGVKQRVWQTLQGWKGKFFSMGGKETLIKSVAQAIPTFIMSCFHLPSTLCDDIHKLVARFWWGSTEVKRKMHWKKWESLC
          +YLG      RS+++ F  +K+RVWQ LQ WKG   S GG+E LIK+VA +IPT+ MSCF  P TLC ++  ++ARFWWG    + K+HW +WE LC
Subjt:  LGRYLGVSSSFTRSQREDFKGVKQRVWQTLQGWKGKFFSMGGKETLIKSVAQAIPTFIMSCFHLPSTLCDDIHKLVARFWWGSTEVKRKMHWKKWESLC

XP_042965942.1 uncharacterized protein LOC122299620 [Carya illinoinensis]8.5e-19142.91Show/hide
Query:  RSGGLCLMWKDDINVSIRSFTFYHIDAAI---KWDSKSWRFTGIYGHPNAGHRDHTWKLLRRLHNQDESAWVIGGDFNATLLYEEKEGGVPVRESQIQNF
        R GGL L WK +++++I  ++  HI A I   + +S+ W  TGIYG P    R  TW L+R L N D   W++ GDFN  +   EK GGV   E Q++NF
Subjt:  RSGGLCLMWKDDINVSIRSFTFYHIDAAI---KWDSKSWRFTGIYGHPNAGHRDHTWKLLRRLHNQDESAWVIGGDFNATLLYEEKEGGVPVRESQIQNF

Query:  RSALDDCGLQDLDYLGDTFTWTNRQGESDQTNERLDRFIANEEYWQLFPNCSVTHLNLANSDHCPIMLQATSDLDSIELGKHKHRIFRFEEVWATQPECH
        R A+DDCG++DL Y G  +TW+NR+GES+  + RLDR +ANE +    P  SV H ++A SDH P+ +  T      +  +  HR FRFE +W+ +  C 
Subjt:  RSALDDCGLQDLDYLGDTFTWTNRQGESDQTNERLDRFIANEEYWQLFPNCSVTHLNLANSDHCPIMLQATSDLDSIELGKHKHRIFRFEEVWATQPECH

Query:  SLITRTGQWGSHHTGFSRLELCLKNVSTGLKQWGKGPLANIRRDISKYKSML-QAHYSKPRPWDFQSIRIIEGLLDKALEQEEIYWKQQSHENWLKWGDK
         LI    +  S   G   L    + V   LK W K    N++  +++ K  L Q     P   + Q        ++K L +EE  W Q+S   W++ GD+
Subjt:  SLITRTGQWGSHHTGFSRLELCLKNVSTGLKQWGKGPLANIRRDISKYKSML-QAHYSKPRPWDFQSIRIIEGLLDKALEQEEIYWKQQSHENWLKWGDK

Query:  NTQWFHKSASMKKRRNTISGIIADNGEWVTDRVQIEQTFTEYFTNIFSSTNPHLGYFDVALQDIQRKVTDEMNDKQMESFDRGELIRAVKQMHPSKAPGP
        N+++FH  AS +K++NTI  +  +  +W   R  +E+    YF  +FSS+     + + A + ++ KVT  MN++  + F   E+  A+ QMHP+KAPGP
Subjt:  NTQWFHKSASMKKRRNTISGIIADNGEWVTDRVQIEQTFTEYFTNIFSSTNPHLGYFDVALQDIQRKVTDEMNDKQMESFDRGELIRAVKQMHPSKAPGP

Query:  DGFPTLFYQKYWTEVGETTILNCLDILNQKRFVKAWNETNITLLPKVNQPKQVSDFRPISLCNVSYKIIAKVIVNRMKWILQDIISENQSAFVPGRSIFD
        DG P LFYQKYW+ +G +     L+ LN   F K  N + I L+PK   P +V DFRPISLCNV YK+++K I NR+K +L  +IS +QSAFVPGR I D
Subjt:  DGFPTLFYQKYWTEVGETTILNCLDILNQKRFVKAWNETNITLLPKVNQPKQVSDFRPISLCNVSYKIIAKVIVNRMKWILQDIISENQSAFVPGRSIFD

Query:  NIIIGHECLHTIRSRQKGRKGWVAMKLDMSKAYDRVEWKFINKLMLNLVFHPRWVALIMDCVTTAKFSLLINGSPTSSITPHRGLRQGDPLSPYLFLLCS
        N+++ +E +H +R R+ G+KG++++KLDMSKAYDRVEW F+ ++M+ + F  RW+ LIM CVT+  FS+++NG PT  I P RGLRQGDPLSPYLFLLC+
Subjt:  NIIIGHECLHTIRSRQKGRKGWVAMKLDMSKAYDRVEWKFINKLMLNLVFHPRWVALIMDCVTTAKFSLLINGSPTSSITPHRGLRQGDPLSPYLFLLCS

Query:  EALSALLSGAISRKILTGIKAGKCCPKISHLFFADDSLVFCQASVDQIEELRSVLNKYELASGQVVNVTKSTMFFSPNVGEEVRDNIKNIMGMQVVENLG
        E L ++L  A    ++ GI+  +  P I+HL FADDS++FC+A V    EL+ +L +YELASGQ +N+ K++M FS NV   +++ I+N+ G   ++   
Subjt:  EALSALLSGAISRKILTGIKAGKCCPKISHLFFADDSLVFCQASVDQIEELRSVLNKYELASGQVVNVTKSTMFFSPNVGEEVRDNIKNIMGMQVVENLG

Query:  RYLGVSSSFTRSQREDFKGVKQRVWQTLQGWKGKFFSMGGKETLIKSVAQAIPTFIMSCFHLPSTLCDDIHKLVARFWWGSTEVKRKMHWKKWESLC
        +YLG+    +R +   F  +K RVW+ LQ WK K  S GGKE L+K+VA AIPT+ MSCF LP  L  ++  L+ARFWWG TE  +++HW  W++LC
Subjt:  RYLGVSSSFTRSQREDFKGVKQRVWQTLQGWKGKFFSMGGKETLIKSVAQAIPTFIMSCFHLPSTLCDDIHKLVARFWWGSTEVKRKMHWKKWESLC

TrEMBL top hitse value%identityAlignment
A0A2N9E9A1 Reverse transcriptase domain-containing protein2.2e-20044.01Show/hide
Query:  DRSGGLCLMWKDDINVSIRSFTFYHIDAAIKWD-SKSWRFTGIYGHPNAGHRDHTWKLLRRLHNQDESAWVIGGDFNATLLYEEKEGGVPVRESQIQNFR
        +  GGLCL WK  +N+ ++SF+  HIDA +  + + +WR TG YG P    R+ +W LLRRL +     W   GDFN     EEK+G +   E+Q+Q FR
Subjt:  DRSGGLCLMWKDDINVSIRSFTFYHIDAAIKWD-SKSWRFTGIYGHPNAGHRDHTWKLLRRLHNQDESAWVIGGDFNATLLYEEKEGGVPVRESQIQNFR

Query:  SALDDCGLQDLDYLGDTFTWTNRQGESDQTNERLDRFIANEEYWQLFPNCSVTHLNLANSDHCPIMLQATSDLDSIELGKHKHRIFRFEEVWATQPECHS
         A+DDCG  DL Y G +FTWTN +G  D T ERLDR +A  E+  LFP   V HL+   SDH PI++          L     ++FRFEE+W     C  
Subjt:  SALDDCGLQDLDYLGDTFTWTNRQGESDQTNERLDRFIANEEYWQLFPNCSVTHLNLANSDHCPIMLQATSDLDSIELGKHKHRIFRFEEVWATQPECHS

Query:  LITRTGQWGSHHTG------FSRLELCLKNVSTGLKQWGKGPLANIRRDISKYKSML-QAHYSKPRPWDFQSIRIIEGLLDKALEQEEIYWKQQSHENWL
          T    W S   G      + ++  C K    GL++W      ++++ I + ++ L +A  +  R  D    ++++  L   L +EE  W+Q+S   WL
Subjt:  LITRTGQWGSHHTG------FSRLELCLKNVSTGLKQWGKGPLANIRRDISKYKSML-QAHYSKPRPWDFQSIRIIEGLLDKALEQEEIYWKQQSHENWL

Query:  KWGDKNTQWFHKSASMKKRRNTISGIIADNGEWVTDRVQIEQTFTEYFTNIFSSTNPHLGYFDVALQDIQRKVTDEMNDKQMESFDRGELIRAVKQMHPS
        + GDKNT++FH  A+ ++RRN I  +    G W+  + Q+ Q F  ++ ++F+S NP+    +  ++ I R VT EMN    + F   E++ AVKQM P 
Subjt:  KWGDKNTQWFHKSASMKKRRNTISGIIADNGEWVTDRVQIEQTFTEYFTNIFSSTNPHLGYFDVALQDIQRKVTDEMNDKQMESFDRGELIRAVKQMHPS

Query:  KAPGPDGFPTLFYQKYWTEVGETTILNCLDILNQKRFVKAWNETNITLLPKVNQPKQVSDFRPISLCNVSYKIIAKVIVNRMKWILQDIISENQSAFVPG
        K+PGPDGFP +FYQKYW  +GE      L  LN  + +KA N T+ITL+PKV  P+ V DFRPISLCNV YKII+KV+ NR+K IL  I+SE+QSAFVPG
Subjt:  KAPGPDGFPTLFYQKYWTEVGETTILNCLDILNQKRFVKAWNETNITLLPKVNQPKQVSDFRPISLCNVSYKIIAKVIVNRMKWILQDIISENQSAFVPG

Query:  RSIFDNIIIGHECLHTIRSRQKGRKGWVAMKLDMSKAYDRVEWKFINKLMLNLVFHPRWVALIMDCVTTAKFSLLINGSPTSSITPHRGLRQGDPLSPYL
        R I DNI++  E LH +  +++G+ G VA+KLDMSKAYDRVEWK++ ++M  + FH +WV ++M+C++T  +S+LING P   I P RGLRQGDPLSPYL
Subjt:  RSIFDNIIIGHECLHTIRSRQKGRKGWVAMKLDMSKAYDRVEWKFINKLMLNLVFHPRWVALIMDCVTTAKFSLLINGSPTSSITPHRGLRQGDPLSPYL

Query:  FLLCSEALSALLSGAISRKILTGIKAGKCCPKISHLFFADDSLVFCQASVDQIEELRSVLNKYELASGQVVNVTKSTMFFSPNVGEEVRDNIKNIMGMQV
        FL C+E L +LL  A +   + G+   +C PK++HLFFADDSL+FC+A+ +++  ++ +L  YE ASGQ +N  K+T+FFS +    ++++I+ ++G+ V
Subjt:  FLLCSEALSALLSGAISRKILTGIKAGKCCPKISHLFFADDSLVFCQASVDQIEELRSVLNKYELASGQVVNVTKSTMFFSPNVGEEVRDNIKNIMGMQV

Query:  VENLGRYLGVSSSFTRSQREDFKGVKQRVWQTLQGWKGKFFSMGGKETLIKSVAQAIPTFIMSCFHLPSTLCDDIHKLVARFWWGSTEVKRKMHWKKWES
        +    +YLG+ S   R++   F  +K+RVW  L+GWK K  S  GKE LIKSVAQAIPT+ MSCF LP  L  +I  L+ RFWWG    + KMHW  W+S
Subjt:  VENLGRYLGVSSSFTRSQREDFKGVKQRVWQTLQGWKGKFFSMGGKETLIKSVAQAIPTFIMSCFHLPSTLCDDIHKLVARFWWGSTEVKRKMHWKKWES

Query:  LC
        LC
Subjt:  LC

A0A2N9EWI8 Uncharacterized protein4.0e-19442.75Show/hide
Query:  GGLCLMWKDDINVSIRSFTFYHIDAAIKWDS-KSWRFTGIYGHPNAGHRDHTWKLLRRLHNQDESAWVIGGDFNATLLYEEKEGGVPVRESQIQNFRSAL
        GGL L+W D ++V ++S++ +HID+ +   S + WRFTG YGHP    R ++W+LLRRL    +  W++ GDFN  +  +EK+G +    +Q+  FR AL
Subjt:  GGLCLMWKDDINVSIRSFTFYHIDAAIKWDS-KSWRFTGIYGHPNAGHRDHTWKLLRRLHNQDESAWVIGGDFNATLLYEEKEGGVPVRESQIQNFRSAL

Query:  DDCGLQDLDYLGDTFTWTNRQGESDQTNERLDRFIANEEYWQLFPNCSVTHLNLANSDHCPIMLQATSDLDSIELGKHKHRIFRFEEVWATQPECHSLIT
        +DC L DL + G  FTWTN +   +  +ERLDR +A E++  LFP   + H+  A SDH  ++L + + +   +    K R F FE  W  +  C   I+
Subjt:  DDCGLQDLDYLGDTFTWTNRQGESDQTNERLDRFIANEEYWQLFPNCSVTHLNLANSDHCPIMLQATSDLDSIELGKHKHRIFRFEEVWATQPECHSLIT

Query:  RTGQWGSHHTGFSRLELCLKNVSTGLKQWGKGPLANI-RRDISKYKSMLQAHYSKPRPWDFQSIRIIEGLLDKALEQEEIYWKQQSHENWLKWGDKNTQW
        +  +     T   RL   +K    GL  W K  L  I +  + K K + + +       ++   R + G L   L++EEIYW+Q+S   WL+ GD+NT +
Subjt:  RTGQWGSHHTGFSRLELCLKNVSTGLKQWGKGPLANI-RRDISKYKSMLQAHYSKPRPWDFQSIRIIEGLLDKALEQEEIYWKQQSHENWLKWGDKNTQW

Query:  FHKSASMKKRRNTISGIIADNGEWVTDRVQIEQTFTEYFTNIFSSTNPHLGYFDVALQDIQRKVTDEMNDKQMESFDRGELIRAVKQMHPSKAPGPDGFP
        FH  AS +K+ NTI GI      W  +  +I      YF  I+++T+P     D  ++++ + V+ +MN + ++ F R E+  A+ QM PSKAPGPDG  
Subjt:  FHKSASMKKRRNTISGIIADNGEWVTDRVQIEQTFTEYFTNIFSSTNPHLGYFDVALQDIQRKVTDEMNDKQMESFDRGELIRAVKQMHPSKAPGPDGFP

Query:  TLFYQKYWTEVGETTILNCLDILNQKRFVKAWNETNITLLPKVNQPKQVSDFRPISLCNVSYKIIAKVIVNRMKWILQDIISENQSAFVPGRSIFDNIII
         LF+QK+W  VG       LD LN+   +K+ N T+I L+PKV  P+ ++ FRPISLCNV YKII+KV+VNRMK IL  ++S++QSAFVPGR I DNI+I
Subjt:  TLFYQKYWTEVGETTILNCLDILNQKRFVKAWNETNITLLPKVNQPKQVSDFRPISLCNVSYKIIAKVIVNRMKWILQDIISENQSAFVPGRSIFDNIII

Query:  GHECLHTIRSRQKGRKGWVAMKLDMSKAYDRVEWKFINKLMLNLVFHPRWVALIMDCVTTAKFSLLINGSPTSSITPHRGLRQGDPLSPYLFLLCSEALS
          E +H +++++ G+   +A KLDMSKAY+RVEW ++ K+ML L FH +WVALIM+CVT+  +S+L+NG P   + P RGLRQGDPLSPYLFL+C+E LS
Subjt:  GHECLHTIRSRQKGRKGWVAMKLDMSKAYDRVEWKFINKLMLNLVFHPRWVALIMDCVTTAKFSLLINGSPTSSITPHRGLRQGDPLSPYLFLLCSEALS

Query:  ALLSGAISRKILTGIKAGKCCPKISHLFFADDSLVFCQASVDQIEELRSVLNKYELASGQVVNVTKSTMFFSPNVGEEVRDNIKNIMGMQVVENLGRYLG
        ALL  A   + + GI   +  P++SHLFFADDSL+FC+A+    + L+ +L  YE ASGQ +N  K+ +FFS N    ++  I  + G        +YLG
Subjt:  ALLSGAISRKILTGIKAGKCCPKISHLFFADDSLVFCQASVDQIEELRSVLNKYELASGQVVNVTKSTMFFSPNVGEEVRDNIKNIMGMQVVENLGRYLG

Query:  VSSSFTRSQREDFKGVKQRVWQTLQGWKGKFFSMGGKETLIKSVAQAIPTFIMSCFHLPSTLCDDIHKLVARFWWGSTEVKRKMHWKKWESLC
        +     RS+++ F  +K R+W+ LQGWK KF S  GKE LIK+V QAIPT+ MSCF LP+ LCD+I  +  RFWWG    +RK+HW   + LC
Subjt:  VSSSFTRSQREDFKGVKQRVWQTLQGWKGKFFSMGGKETLIKSVAQAIPTFIMSCFHLPSTLCDDIHKLVARFWWGSTEVKRKMHWKKWESLC

A0A2N9GLU2 Reverse transcriptase domain-containing protein6.8e-19441.79Show/hide
Query:  RSGGLCLMWKDDINVSIRSFTFYHIDAAIKWDS-KSWRFTGIYGHPNAGHRDHTWKLLRRLHNQDESAWVIGGDFNATLLYEEKEGGVPVRESQIQNFRS
        +SGGL + W   + VSI S++ +HIDA + + S  +WRFTG YG P    ++  W +LR L +     W+  GDFN  L   EK G  P  E Q+  FR 
Subjt:  RSGGLCLMWKDDINVSIRSFTFYHIDAAIKWDS-KSWRFTGIYGHPNAGHRDHTWKLLRRLHNQDESAWVIGGDFNATLLYEEKEGGVPVRESQIQNFRS

Query:  ALDDCGLQDLDYLGDTFTWTNRQGESDQTNERLDRFIANEEYWQLFPNCSVTHLNLANSDHCPIMLQATSDLDSIELGKHKHRIFRFEEVWATQPECHSL
         +DDCG  DL + G  +TW N Q    +  ERLDR +A  ++   FPNC + HL++  SDH  +  +      S+   + + R FRFEE+W     C   
Subjt:  ALDDCGLQDLDYLGDTFTWTNRQGESDQTNERLDRFIANEEYWQLFPNCSVTHLNLANSDHCPIMLQATSDLDSIELGKHKHRIFRFEEVWATQPECHSL

Query:  ITRTGQWGSHHTGFSRLELCLKNVSTGLKQWGKGPLANIRRDIS-KYKSMLQAHYSKPRPWDFQSIRIIEGLLDKALEQEEIYWKQQSHENWLKWGDKNT
        I +  +     T   R+   +K     L  W K    ++R  I  K + + +   + P   + Q IR I   L     +EE  WKQ+S   WL+ GD+NT
Subjt:  ITRTGQWGSHHTGFSRLELCLKNVSTGLKQWGKGPLANIRRDIS-KYKSMLQAHYSKPRPWDFQSIRIIEGLLDKALEQEEIYWKQQSHENWLKWGDKNT

Query:  QWFHKSASMKKRRNTISGIIADNGEWVTDRVQIEQTFTEYFTNIFSSTNPHLGYFDVALQDIQRKVTDEMNDKQMESFDRGELIRAVKQMHPSKAPGPDG
        ++FH  A+ ++RRN I GI+ ++G W  +  +IE T   Y+ ++F+S NP  G  D  L  + R V++EMND+ +  F   E+ +A+ QM P KAPGPDG
Subjt:  QWFHKSASMKKRRNTISGIIADNGEWVTDRVQIEQTFTEYFTNIFSSTNPHLGYFDVALQDIQRKVTDEMNDKQMESFDRGELIRAVKQMHPSKAPGPDG

Query:  FPTLFYQKYWTEVGETTILNCLDILNQKRFVKAWNETNITLLPKVNQPKQVSDFRPISLCNVSYKIIAKVIVNRMKWILQDIISENQSAFVPGRSIFDNI
           +FYQKYW  VG       L  L     ++  N TNI L+PK+  P    DFRPISLCNV YKI+AKV+ NR+K +L  +ISE QSAFVPGR I DNI
Subjt:  FPTLFYQKYWTEVGETTILNCLDILNQKRFVKAWNETNITLLPKVNQPKQVSDFRPISLCNVSYKIIAKVIVNRMKWILQDIISENQSAFVPGRSIFDNI

Query:  IIGHECLHTIRSRQKGRKGWVAMKLDMSKAYDRVEWKFINKLMLNLVFHPRWVALIMDCVTTAKFSLLINGSPTSSITPHRGLRQGDPLSPYLFLLCSEA
        +I  E LH +   ++G++G++A+KLDMSKAYDRVEW F+ K+M  + FH +WVAL+M+CV +  +S+LING P     P RGLRQGDP+SPYLFLLC+E 
Subjt:  IIGHECLHTIRSRQKGRKGWVAMKLDMSKAYDRVEWKFINKLMLNLVFHPRWVALIMDCVTTAKFSLLINGSPTSSITPHRGLRQGDPLSPYLFLLCSEA

Query:  LSALLSGAISRKILTGIKAGKCCPKISHLFFADDSLVFCQASVDQIEELRSVLNKYELASGQVVNVTKSTMFFSPNVGEEVRDNIKNIMGMQVVENLGRY
        L ALLS A   + L G+   +  PK++HLFFADDS++FC+A++ +   +  +L++YE ASGQ +N  K+T+FFS +     RD IK  + + V+++   Y
Subjt:  LSALLSGAISRKILTGIKAGKCCPKISHLFFADDSLVFCQASVDQIEELRSVLNKYELASGQVVNVTKSTMFFSPNVGEEVRDNIKNIMGMQVVENLGRY

Query:  LGVSSSFTRSQREDFKGVKQRVWQTLQGWKGKFFSMGGKETLIKSVAQAIPTFIMSCFHLPSTLCDDIHKLVARFWWGSTEVKRKMHWKKWESLCAPDNC
        LG+ S   RS+   F  +K+ +W+ +QGWK K  +  GKE LIK+V QAIPT+ M CF LP  LC D+  ++  FWWG  +  RK+HW KW SLC P  C
Subjt:  LGVSSSFTRSQREDFKGVKQRVWQTLQGWKGKFFSMGGKETLIKSVAQAIPTFIMSCFHLPSTLCDDIHKLVARFWWGSTEVKRKMHWKKWESLCAPDNC

Query:  PKDYAAERLEGANSVLQQNWEQKLPHHS
                L   N  L      +L H++
Subjt:  PKDYAAERLEGANSVLQQNWEQKLPHHS

A0A2N9HYE3 Reverse transcriptase domain-containing protein2.6e-19342.77Show/hide
Query:  DRSGGLCLMWKDDINVSIRSFTFYHIDAAIKWDS-KSWRFTGIYGHPNAGHRDHTWKLLRRLHNQDESAWVIGGDFNATLLYEEKEGGVPVRESQIQNFR
        ++ GGLCL WK D+ +S++SF+  HIDA +  +   +WRFTG YG P    R+ +W LLRRL+ Q +  W   GDFN  +  EEK+G     ESQ+Q FR
Subjt:  DRSGGLCLMWKDDINVSIRSFTFYHIDAAIKWDS-KSWRFTGIYGHPNAGHRDHTWKLLRRLHNQDESAWVIGGDFNATLLYEEKEGGVPVRESQIQNFR

Query:  SALDDCGLQDLDYLGDTFTWTNRQGESDQTNERLDRFIANEEYWQLFPNCSVTHLNLANSDHCPIMLQATSDLDSIELGKHKHRIFRFEEVWATQPECHS
          LD+CG  DL + G  FTWTN +   D T ERLDR +A  ++   FP+  V+HL    SDH PI +   + +        K + FRFEEVW +   C +
Subjt:  SALDDCGLQDLDYLGDTFTWTNRQGESDQTNERLDRFIANEEYWQLFPNCSVTHLNLANSDHCPIMLQATSDLDSIELGKHKHRIFRFEEVWATQPECHS

Query:  LITRTGQWGSHHTG------FSRLELCLKNVSTGLKQWGKGPLANIRRDISKYKSMLQ-AHYSKPRPWDFQSIRIIEGLLDKALEQEEIYWKQQSHENWL
        +I     W    TG      + ++  C +    GL+ W +    NI   I + + +L+ A  +  +  D   +  ++  L   L +EE  W+Q+S   WL
Subjt:  LITRTGQWGSHHTG------FSRLELCLKNVSTGLKQWGKGPLANIRRDISKYKSMLQ-AHYSKPRPWDFQSIRIIEGLLDKALEQEEIYWKQQSHENWL

Query:  KWGDKNTQWFHKSASMKKRRNTISGIIADNGEWVTDRVQIEQTFTEYFTNIFSSTNPHLGYFDVALQDIQRKVTDEMNDKQMESFDRGELIRAVKQMHPS
          GD+NT++FH  A+ +KR+N ++ +   +G+W   + Q+   F EY+ ++F + NP     +  ++DIQ  VT EMN + +  F   E+  A+KQM P 
Subjt:  KWGDKNTQWFHKSASMKKRRNTISGIIADNGEWVTDRVQIEQTFTEYFTNIFSSTNPHLGYFDVALQDIQRKVTDEMNDKQMESFDRGELIRAVKQMHPS

Query:  KAPGPDGFPTLFYQKYWTEVGETTILNCLDILNQKRFVKAWNETNITLLPKVNQPKQVSDFRPISLCNVSYKIIAKVIVNRMKWILQDIISENQSAFVPG
        KAPGPD  P +FYQKYW  +G       L  LN  + +KA N T+ITL+PKV  P++V +FRPISLCNV YK+I+KV+ NR+K +L  I+ E+QSAF+PG
Subjt:  KAPGPDGFPTLFYQKYWTEVGETTILNCLDILNQKRFVKAWNETNITLLPKVNQPKQVSDFRPISLCNVSYKIIAKVIVNRMKWILQDIISENQSAFVPG

Query:  RSIFDNIIIGHECLHTIRSRQKGRKGWVAMKLDMSKAYDRVEWKFINKLMLNLVFHPRWVALIMDCVTTAKFSLLINGSPTSSITPHRGLRQGDPLSPYL
        R I DNI++  E LH ++ ++ G+ G +A+KLDMSKAYDRVEW+++  +M  + FH +WV L+M+C++T  +S+L+NG P   I P RGLRQGDPLSPYL
Subjt:  RSIFDNIIIGHECLHTIRSRQKGRKGWVAMKLDMSKAYDRVEWKFINKLMLNLVFHPRWVALIMDCVTTAKFSLLINGSPTSSITPHRGLRQGDPLSPYL

Query:  FLLCSEALSALLSGAISRKILTGIKAGKCCPKISHLFFADDSLVFCQASVDQIEELRSVLNKYELASGQVVNVTKSTMFFSPNVGEEVRDNIKNIMGMQV
        FLLC+E L +L+        L G+   +  PKI+HLFFADDSL+FC+A+ D +  ++ +L++YE ASGQ VN  K+T+FFS +     + +I+N++G+  
Subjt:  FLLCSEALSALLSGAISRKILTGIKAGKCCPKISHLFFADDSLVFCQASVDQIEELRSVLNKYELASGQVVNVTKSTMFFSPNVGEEVRDNIKNIMGMQV

Query:  VENLGRYLGVSSSFTRSQREDFKGVKQRVWQTLQGWKGKFFSMGGKETLIKSVAQAIPTFIMSCFHLPSTLCDDIHKLVARFWWGSTEVKRKMHWKKWES
        ++   RYLG+ S   R++   F  +K+RVW  L+GWK K  S  G+E LIKSVAQAIP + MSCF LP+ L  +I  L+ RFWWG    K KMHW  W +
Subjt:  VENLGRYLGVSSSFTRSQREDFKGVKQRVWQTLQGWKGKFFSMGGKETLIKSVAQAIPTFIMSCFHLPSTLCDDIHKLVARFWWGSTEVKRKMHWKKWES

Query:  LC
        LC
Subjt:  LC

A0A2N9J7Z5 Reverse transcriptase domain-containing protein7.7e-19843.52Show/hide
Query:  RSGGLCLMWKDDINVSIRSFTFYHIDAAI-KWDSKSWRFTGIYGHPNAGHRDHTWKLLRRLHNQDESAWVIGGDFNATLLYEEKEGGVPVRESQIQNFRS
        R GGLCL W+D++N+SIRSF+  HIDA I   D+  WRFTG YG P+  HR+ +W LLR L++Q    W+  GDFN      EK+G +P  E Q++ FR 
Subjt:  RSGGLCLMWKDDINVSIRSFTFYHIDAAI-KWDSKSWRFTGIYGHPNAGHRDHTWKLLRRLHNQDESAWVIGGDFNATLLYEEKEGGVPVRESQIQNFRS

Query:  ALDDCGLQDLDYLGDTFTWTNRQGESDQTNERLDRFIANEEYWQLFPNCSVTHLNLANSDHCPIMLQATSDLDSIELGKHKHRIFRFEEVWATQPECHSL
        ALD+C L DL Y G  +TW N + ++     RLDR +A+ ++   F    V HL   +SDHCP+++   +     ++     + FRFE++W     C   
Subjt:  ALDDCGLQDLDYLGDTFTWTNRQGESDQTNERLDRFIANEEYWQLFPNCSVTHLNLANSDHCPIMLQATSDLDSIELGKHKHRIFRFEEVWATQPECHSL

Query:  ITRTGQWGSHHTGFSRLELCLKNVSTGLKQWGKGPLANIRRDI-SKYKSMLQAHYSKPRPWDFQSIRIIEGLLDKALEQEEIYWKQQSHENWLKWGDKNT
        +  T +      G  RL    K     L  W K    ++RR++ +K   + QA     +  D    + ++  + + ++++E  W+Q+S   WLK GD+N+
Subjt:  ITRTGQWGSHHTGFSRLELCLKNVSTGLKQWGKGPLANIRRDI-SKYKSMLQAHYSKPRPWDFQSIRIIEGLLDKALEQEEIYWKQQSHENWLKWGDKNT

Query:  QWFHKSASMKKRRNTISGIIADNGEWVTDRVQIEQTFTEYFTNIFSSTNPHLGYFDVALQDIQRKVTDEMNDKQMESFDRGELIRAVKQMHPSKAPGPDG
        ++FH  A+ ++RRN I  I    G   TD   I   F  YF N+F ++NP+   F+  L  + R +T+EMND  +  F   E+  A+ QM P KAPGPDG
Subjt:  QWFHKSASMKKRRNTISGIIADNGEWVTDRVQIEQTFTEYFTNIFSSTNPHLGYFDVALQDIQRKVTDEMNDKQMESFDRGELIRAVKQMHPSKAPGPDG

Query:  FPTLFYQKYWTEVGETTILNCLDILNQKRFVKAWNETNITLLPKVNQPKQVSDFRPISLCNVSYKIIAKVIVNRMKWILQDIISENQSAFVPGRSIFDNI
         P LFY +YW  +GE  I   L  L+  R     N T++TL+PKV  P+++S+FRPISLCNV YKII+KVI NR+K IL  IISE QSAFVPGR I DN+
Subjt:  FPTLFYQKYWTEVGETTILNCLDILNQKRFVKAWNETNITLLPKVNQPKQVSDFRPISLCNVSYKIIAKVIVNRMKWILQDIISENQSAFVPGRSIFDNI

Query:  IIGHECLHTIRSRQKGRKGWVAMKLDMSKAYDRVEWKFINKLMLNLVFHPRWVALIMDCVTTAKFSLLINGSPTSSITPHRGLRQGDPLSPYLFLLCSEA
        ++  E LH +++ Q GR   +A+KLDMSKAYDRVEW F+ K+M  + F+ +W+ L+M+CV T  +S+L+NG P   I P RGLRQGDPLSPYLFL+C+E 
Subjt:  IIGHECLHTIRSRQKGRKGWVAMKLDMSKAYDRVEWKFINKLMLNLVFHPRWVALIMDCVTTAKFSLLINGSPTSSITPHRGLRQGDPLSPYLFLLCSEA

Query:  LSALLSGAISRKILTGIKAGKCCPKISHLFFADDSLVFCQASVDQIEELRSVLNKYELASGQVVNVTKSTMFFSPNVGEEVRDNIKNIMGMQVVENLGRY
        L AL++ A     + G+   +  PKI+HLFFADDSL+FC+A+ ++  +++++L+ YE ASGQ +N  K+T+FFS N   E ++ +KN++G+  +    +Y
Subjt:  LSALLSGAISRKILTGIKAGKCCPKISHLFFADDSLVFCQASVDQIEELRSVLNKYELASGQVVNVTKSTMFFSPNVGEEVRDNIKNIMGMQVVENLGRY

Query:  LGVSSSFTRSQREDFKGVKQRVWQTLQGWKGKFFSMGGKETLIKSVAQAIPTFIMSCFHLPSTLCDDIHKLVARFWWGSTEVKRKMHWKKWESLC
        LG+ S   RS++  F  +K+RVWQ LQGWK K  S  GKE LIK+V QA+PT+ M CF LP +LC DI  ++ +F+WG T  KR++HW KWE LC
Subjt:  LGVSSSFTRSQREDFKGVKQRVWQTLQGWKGKFFSMGGKETLIKSVAQAIPTFIMSCFHLPSTLCDDIHKLVARFWWGSTEVKRKMHWKKWESLC

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.6e-4322.83Show/hide
Query:  PNAGHRDHTWKLLRRLHNQDESAWVIGGDFNATLLYEEKEGGVPVRESQIQNFRSALDDCGLQDL-DYLGDTFTWTNRQGESDQTNERLDRFIANEEYWQ
        PN G      ++L  L    +S  +I GDFN  L   ++     V +   Q   SAL    L D+   L    T          T  ++D  + ++    
Subjt:  PNAGHRDHTWKLLRRLHNQDESAWVIGGDFNATLLYEEKEGGVPVRESQIQNFRSALDDCGLQDL-DYLGDTFTWTNRQGESDQTNERLDRFIANEEYWQ

Query:  LFPNCSVTHLNLAN--SDHCPIMLQ-ATSDLDSIELGKHKHRIFRFEEVWATQPECHSLITRTGQWGSHHTGFSRLELCLKNVSTG--------LKQWGK
        L   C  T + + N  SDH  I L+    +L        K       + W        +        +  T +  L    K V  G         ++  +
Subjt:  LFPNCSVTHLNLAN--SDHCPIMLQ-ATSDLDSIELGKHKHRIFRFEEVWATQPECHSLITRTGQWGSHHTGFSRLELCLKNVSTG--------LKQWGK

Query:  GPLANIRRDISKYKSMLQAHYSKPRPWDFQSIRIIEGLLDKALEQEEIYWKQQSHENW-LKWGDKNTQWFHKSASMKKRRNTISGIIADNGEWVTDRVQI
          +  +   + + +   Q H    R  +   IR       K +E ++   K     +W  +  +K  +   +    K+ +N I  I  D G+  TD  +I
Subjt:  GPLANIRRDISKYKSMLQAHYSKPRPWDFQSIRIIEGLLDKALEQEEIYWKQQSHENW-LKWGDKNTQWFHKSASMKKRRNTISGIIADNGEWVTDRVQI

Query:  EQTFTEYFTNIFSSTNPHLGYFDVALQDIQRKVTDEMNDKQMESFDR----GELIRAVKQMHPSKAPGPDGFPTLFYQKYWTEVGETTILNCLDILNQKR
        + T  EY+ +++++   +L   D  L          +N +++ES +R     E++  +  +   K+PGPDGF   FYQ+Y  E+    +     I  +  
Subjt:  EQTFTEYFTNIFSSTNPHLGYFDVALQDIQRKVTDEMNDKQMESFDR----GELIRAVKQMHPSKAPGPDGFPTLFYQKYWTEVGETTILNCLDILNQKR

Query:  FVKAWNETNITLLPKVNQ-PKQVSDFRPISLCNVSYKIIAKVIVNRMKWILQDIISENQSAFVPGRSIFDNIIIGHECLHTIRSRQKGR-KGWVAMKLDM
           ++ E +I L+PK  +   +  +FRPISL N+  KI+ K++ NR++  ++ +I  +Q  F+PG   + NI    + ++ I+   + + K  V + +D 
Subjt:  FVKAWNETNITLLPKVNQ-PKQVSDFRPISLCNVSYKIIAKVIVNRMKWILQDIISENQSAFVPGRSIFDNIIIGHECLHTIRSRQKGR-KGWVAMKLDM

Query:  SKAYDRVEWKFINKLMLNLVFHPRWVALIMDCVTTAKFSLLINGSPTSSITPHRGLRQGDPLSPYLFLLCSEALSALLSGAISRKILTGIKAGKCCPKIS
         KA+D+++  F+ K +  L     ++ +I         ++++NG    +     G RQG PLSP LF +  E L+  +      K + GI+ GK   K+S
Subjt:  SKAYDRVEWKFINKLMLNLVFHPRWVALIMDCVTTAKFSLLINGSPTSSITPHRGLRQGDPLSPYLFLLCSEALSALLSGAISRKILTGIKAGKCCPKIS

Query:  HLFFADDSLVFCQASVDQIEELRSVLNKYELASGQVVNVTKSTMFFSPNVGEEVRDNIKNIMGMQVVENLGRYLGVSSSFTRSQ----REDFKGVKQRVW
           FADD +V+ +  +   + L  +++ +   SG  +NV KS  F   N   +    I   +   +     +YLG+    TR      +E++K + + + 
Subjt:  HLFFADDSLVFCQASVDQIEELRSVLNKYELASGQVVNVTKSTMFFSPNVGEEVRDNIKNIMGMQVVENLGRYLGVSSSFTRSQ----REDFKGVKQRVW

Query:  QTLQGWKGKFFSMGGKETLIKS--VAQAIPTFIMSCFHLPSTLCDDIHKLVARFWWGSTEVK
        +    WK    S  G+  ++K   + + I  F      LP T   ++ K   +F W     +
Subjt:  QTLQGWKGKFFSMGGKETLIKS--VAQAIPTFIMSCFHLPSTLCDDIHKLVARFWWGSTEVK

P08548 LINE-1 reverse transcriptase homolog1.1e-4225.51Show/hide
Query:  KSMLQAHYSKPRPWDFQSIRIIEGLLDKALEQEEIYWKQQSHENWLKWGDKNTQWFHKSASMKKR-RNTISGIIADNGEWVTDRVQIEQTFTEYFTNIFS
        K + +  +S P+P   + I  I   L++ +E + I  +    ++W             + + KKR ++ IS I   N E  TD  +I++   EY+  ++S
Subjt:  KSMLQAHYSKPRPWDFQSIRIIEGLLDKALEQEEIYWKQQSHENWLKWGDKNTQWFHKSASMKKR-RNTISGIIADNGEWVTDRVQIEQTFTEYFTNIFS

Query:  STNPHLGYFDVALQDIQRKVTDEMNDKQMESFDR----GELIRAVKQMHPSKAPGPDGFPTLFYQKYWTEVGETTILNCLDILNQKRFVKAWNETNITLL
            +L   D   Q ++      ++ K++E  +R     E+   ++ +   K+PGPDGF + FYQ +  E+    +    +I  +      + E NITL+
Subjt:  STNPHLGYFDVALQDIQRKVTDEMNDKQMESFDR----GELIRAVKQMHPSKAPGPDGFPTLFYQKYWTEVGETTILNCLDILNQKRFVKAWNETNITLL

Query:  PKVNQ-PKQVSDFRPISLCNVSYKIIAKVIVNRMKWILQDIISENQSAFVPGRSIFDNIIIGHECLHTIRSRQK-GRKGWVAMKLDMSKAYDRVEWKFIN
        PK  + P +  ++RPISL N+  KI+ K++ NR++  ++ II  +Q  F+PG   + NI    + ++ I+   K   K  + + +D  KA+D ++  F+ 
Subjt:  PKVNQ-PKQVSDFRPISLCNVSYKIIAKVIVNRMKWILQDIISENQSAFVPGRSIFDNIIIGHECLHTIRSRQK-GRKGWVAMKLDMSKAYDRVEWKFIN

Query:  KLMLNLVFHPRWVALIMDCVTTAKFSLLINGSPTSSITPHRGLRQGDPLSPYLFLLCSEALSALLSGAISRKILTGIKAGKCCPKISHLFFADDSLVFCQ
        + +  +     ++ LI    +    ++++NG    S     G RQG PLSP LF +  E L+  +      K + GI  G    K+S   FADD +V+ +
Subjt:  KLMLNLVFHPRWVALIMDCVTTAKFSLLINGSPTSSITPHRGLRQGDPLSPYLFLLCSEALSALLSGAISRKILTGIKAGKCCPKISHLFFADDSLVFCQ

Query:  ASVDQIEELRSVLNKYELASGQVVNVTKSTMFFSPNVGEEVRDNIKNIMGMQVVENLGRYLGV--SSSFTRSQREDFKGVKQRVWQTLQGWKGKFFSMGG
         + D   +L  V+ +Y   SG  +N  KS  F   N   +    +K+ +   VV    +YLGV  +       +E+++ +++ + + +  WK    S  G
Subjt:  ASVDQIEELRSVLNKYELASGQVVNVTKSTMFFSPNVGEEVRDNIKNIMGMQVVENLGRYLGV--SSSFTRSQREDFKGVKQRVWQTLQGWKGKFFSMGG

Query:  KETLIKS--VAQAIPTFIMSCFHLPSTLCDDIHKLVARFWW
        +  ++K   + +AI  F       P +   D+ K++  F W
Subjt:  KETLIKS--VAQAIPTFIMSCFHLPSTLCDDIHKLVARFWW

P11369 LINE-1 retrotransposable element ORF2 protein6.4e-4027Show/hide
Query:  ISGIIADNGEWVTDRVQIEQTFTEYFTNIFSSTNPHLGYFDVALQDIQ-RKVTDEMNDKQMESFDRGELIRAVKQMHPSKAPGPDGFPTLFYQKYWTEVG
        I+ I  + G+  TD  +I+ T   ++  ++S+   +L   D  L   Q  K+  +  D         E+   +  +   K+PGPDGF   FYQ +  ++ 
Subjt:  ISGIIADNGEWVTDRVQIEQTFTEYFTNIFSSTNPHLGYFDVALQDIQ-RKVTDEMNDKQMESFDRGELIRAVKQMHPSKAPGPDGFPTLFYQKYWTEVG

Query:  ETTILNCL--DILNQKRFVKAWNETNITLLPKVNQ-PKQVSDFRPISLCNVSYKIIAKVIVNRMKWILQDIISENQSAFVPGRSIFDNIIIGHECLHTIR
           IL+ L   I  +     ++ E  ITL+PK  + P ++ +FRPISL N+  KI+ K++ NR++  ++ II  +Q  F+PG   + NI      +H I 
Subjt:  ETTILNCL--DILNQKRFVKAWNETNITLLPKVNQ-PKQVSDFRPISLCNVSYKIIAKVIVNRMKWILQDIISENQSAFVPGRSIFDNIIIGHECLHTIR

Query:  SRQKGRKGWVAMKLDMSKAYDRVEWKFINKLMLNLVFHPRWVALIMDCVTTAKFSLLINGSPTSSITPHRGLRQGDPLSPYLFLLCSEALSALLSGAISR
          +   K  + + LD  KA+D+++  F+ K++        ++ +I    +    ++ +NG    +I    G RQG PLSPYLF +  E L+  +     +
Subjt:  SRQKGRKGWVAMKLDMSKAYDRVEWKFINKLMLNLVFHPRWVALIMDCVTTAKFSLLINGSPTSSITPHRGLRQGDPLSPYLFLLCSEALSALLSGAISR

Query:  KILTGIKAGKCCPKISHLFFADDSLVFCQASVDQIEELRSVLNKYELASGQVVNVTKSTMFFSPNVGEEVRDNIKNIMGMQVVENLGRYLGVSSSFTRSQ
        K + GI+ GK   KIS L  ADD +V+     +   EL +++N +    G  +N  KS M F     ++    I+      +V N  +YLGV  + T+  
Subjt:  KILTGIKAGKCCPKISHLFFADDSLVFCQASVDQIEELRSVLNKYELASGQVVNVTKSTMFFSPNVGEEVRDNIKNIMGMQVVENLGRYLGVSSSFTRSQ

Query:  RE----DFKGVKQRVWQTLQGWKGKFFSMGGKETLIKS--VAQAIPTFIMSCFHLPSTLCDDIHKLVARFWWGS
        ++    +FK +K+ + + L+ WK    S  G+  ++K   + +AI  F      +P+   +++   + +F W +
Subjt:  RE----DFKGVKQRVWQTLQGWKGKFFSMGGKETLIKS--VAQAIPTFIMSCFHLPSTLCDDIHKLVARFWWGS

P14381 Transposon TX1 uncharacterized 149 kDa protein6.6e-3726.39Show/hide
Query:  DKNTQWFHKSASMKKRRNTISGIIADNGEWVTDRVQIEQTFTEYFTNIFSSTNPHLGYFDVALQDIQRKVTDEMNDKQMESFDRGELIRAVKQMHPSKAP
        D+ +++F+     K  R  I+ + A++G  + D   I      ++ N+F S +P        L D    V++   ++        EL +A++ M  +K+P
Subjt:  DKNTQWFHKSASMKKRRNTISGIIADNGEWVTDRVQIEQTFTEYFTNIFSSTNPHLGYFDVALQDIQRKVTDEMNDKQMESFDRGELIRAVKQMHPSKAP

Query:  GPDGFPTLFYQKYWTEVGETTILNCLDILNQKRFVKAWNETNITLLPKVNQPKQVSDFRPISLCNVSYKIIAKVIVNRMKWILQDIISENQSAFVPGRSI
        G DG    F+Q +W  +G        +   +     +     ++LLPK    + + ++RP+SL +  YKI+AK I  R+K +L ++I  +QS  VPGR+I
Subjt:  GPDGFPTLFYQKYWTEVGETTILNCLDILNQKRFVKAWNETNITLLPKVNQPKQVSDFRPISLCNVSYKIIAKVIVNRMKWILQDIISENQSAFVPGRSI

Query:  FDNIIIGHECLHTIRSRQKGRKGWVAMKLDMSKAYDRVEWKFINKLMLNLVFHPRWVALIMDCVTTAKFSLLINGSPTSSITPHRGLRQGDPLSPYLFLL
        FDN+ +  + LH  R   +       + LD  KA+DRV+ +++   +    F P++V  +     +A+  + IN S T+ +   RG+RQG PLS  L+ L
Subjt:  FDNIIIGHECLHTIRSRQKGRKGWVAMKLDMSKAYDRVEWKFINKLMLNLVFHPRWVALIMDCVTTAKFSLLINGSPTSSITPHRGLRQGDPLSPYLFLL

Query:  CSEALSALLSGAISRKILTGIKAGKCCPKISHLFFADDSLVFCQASVDQIEELRSVLNKYELASGQVVNVTKSTMFFSPNVGEE-VRDNIKNI-MGMQVV
          E    LL     RK LTG+   +   ++    +ADD ++  Q  VD +E  +     Y  AS   +N +KS+     ++  + +    ++I    +++
Subjt:  CSEALSALLSGAISRKILTGIKAGKCCPKISHLFFADDSLVFCQASVDQIEELRSVLNKYELASGQVVNVTKSTMFFSPNVGEE-VRDNIKNI-MGMQVV

Query:  ENLGRYLGVSSSFTRSQREDFKGVKQRVWQTLQGWKG--KFFSMGGKETLIKSVAQAIPTFIMSCFHLPSTLCDDIHKLVARFWW
        + LG YL  +  +  SQ  +F  +++ V   L  WKG  K  SM G+  +I  +  +   + + C          I + +  F W
Subjt:  ENLGRYLGVSSSFTRSQREDFKGVKQRVWQTLQGWKG--KFFSMGGKETLIKSVAQAIPTFIMSCFHLPSTLCDDIHKLVARFWW

P92555 Uncharacterized mitochondrial protein AtMg012504.6e-1455.88Show/hide
Query:  LINGSPTSSITPHRGLRQGDPLSPYLFLLCSEALSALLSGAISRKILTGIKAGKCCPKISHLFFADDS
        +ING+P   +TP RGLRQGDPLSPYLF+LC+E LS L   A  +  L GI+     P+I+HL FADD+
Subjt:  LINGSPTSSITPHRGLRQGDPLSPYLFLLCSEALSALLSGAISRKILTGIKAGKCCPKISHLFFADDS

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.0e-3225.78Show/hide
Query:  SWRFTGIYGHPNAGHRDHTW--KLLRRLHNQDESAWVIGGDFNATLL----YEEKEGGVPVRESQIQNFRSALDDCGLQDLDYLGDTFTWTNRQGESDQT
        SWR    Y     G     W   +   +  + +   ++ GDF+        Y   +  +P+R   ++ F++ L D  L D+   G  +TW+N Q + +  
Subjt:  SWRFTGIYGHPNAGHRDHTW--KLLRRLHNQDESAWVIGGDFNATLL----YEEKEGGVPVRESQIQNFRSALDDCGLQDLDYLGDTFTWTNRQGESDQT

Query:  NERLDRFIANEEYWQLFPNCSVTHLNLANSDHCPIMLQATSDLDSIELGKHKHRIFRFEEVWATQPECHSLITRTGQW------GSHHTGFSRLELCLKN
          +LDR IAN +++  FP+          SDH P ++   +      L K   + FR+    +T P    L++ T  W      GSH           K 
Subjt:  NERLDRFIANEEYWQLFPNCSVTHLNLANSDHCPIMLQATSDLDSIELGKHKHRIFRFEEVWATQPECHSLITRTGQW------GSHHTGFSRLELCLKN

Query:  VSTGLKQWGKGPLANIRRD-ISKYKSMLQAHYSKPRPWDFQSIRIIEGLLDKALEQEEIYWKQQSHENWLKWGDKNTQWFHKSASMKKRRNTISGIIADN
            L + G G + +  ++ +   +S+     + P    F+   +     +      E +++Q+S   WL+ GD NT++FHK     + +N I  +  D+
Subjt:  VSTGLKQWGKGPLANIRRD-ISKYKSMLQAHYSKPRPWDFQSIRIIEGLLDKALEQEEIYWKQQSHENWLKWGDKNTQWFHKSASMKKRRNTISGIIADN

Query:  GEWVTDRVQIEQTFTEYFTNIFSSTNPHLGYFDV-ALQDIQR-KVTDEMNDKQMESFDRGELIRAVKQMHPSKAPGPDGFPTLFYQKYWTEVGETTILNC
           V +  Q+++    Y+T++  S +  L    V  ++DI   +  D +  +        E+  AV  M  +KAPGPD F   F+ + W  V ++TI   
Subjt:  GEWVTDRVQIEQTFTEYFTNIFSSTNPHLGYFDV-ALQDIQR-KVTDEMNDKQMESFDRGELIRAVKQMHPSKAPGPDGFPTLFYQKYWTEVGETTILNC

Query:  LDILNQKRFVKAWNETNITLLPKVNQPKQVSDFRPISLCNVSYKII
         +       +K +N T ITL+PKV    Q+S FRP+S C V YKII
Subjt:  LDILNQKRFVKAWNETNITLLPKVNQPKQVSDFRPISLCNVSYKII

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.2e-1431.72Show/hide
Query:  IVNRMKWILQDIISENQSAFVPGRSIFDNIIIGHECLHTIRSRQKGRKGWVAMKLDMSKAYDRVEWKFINKLMLNLVFHPRWVALIMDCVTTAKFSLLIN
        +V R+K ++ ++I   Q++F+PGR   DNI+   E +H++R R+KG KGW+ +KLD+ KAYDR+ W ++   +++  F   W+  I      A+      
Subjt:  IVNRMKWILQDIISENQSAFVPGRSIFDNIIIGHECLHTIRSRQKGRKGWVAMKLDMSKAYDRVEWKFINKLMLNLVFHPRWVALIMDCVTTAKFSLLIN

Query:  GSPTSS----ITPHR-GLRQGDPLSPYL--FLLCSEALSALLSGA
        G   +S    ++ HR G R  D  +P+    + C+E L  +  G+
Subjt:  GSPTSS----ITPHR-GLRQGDPLSPYL--FLLCSEALSALLSGA

AT4G29090.1 Ribonuclease H-like superfamily protein5.1e-0845.65Show/hide
Query:  AIPTFIMSCFHLPSTLCDDIHKLVARFWWGSTEVKRKMHWKKWESL
        A+PT+ M+CF LP T+C  I  ++A FWW + +  + MHWK W+ L
Subjt:  AIPTFIMSCFHLPSTLCDDIHKLVARFWWGSTEVKRKMHWKKWESL

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)3.3e-1555.88Show/hide
Query:  LINGSPTSSITPHRGLRQGDPLSPYLFLLCSEALSALLSGAISRKILTGIKAGKCCPKISHLFFADDS
        +ING+P   +TP RGLRQGDPLSPYLF+LC+E LS L   A  +  L GI+     P+I+HL FADD+
Subjt:  LINGSPTSSITPHRGLRQGDPLSPYLFLLCSEALSALLSGAISRKILTGIKAGKCCPKISHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCGAAGCGGGGGATTATGTCTGATGTGGAAAGATGATATCAATGTTTCTATTCGTTCGTTCACTTTTTACCACATTGACGCGGCAATAAAATGGGACTCTAAATC
TTGGCGATTTACTGGGATATATGGCCATCCGAATGCTGGTCATAGAGATCACACTTGGAAATTACTTCGAAGGTTACATAACCAAGATGAGTCGGCTTGGGTCATAGGTG
GAGATTTCAATGCGACACTATTATATGAGGAAAAAGAAGGAGGCGTGCCGGTTAGGGAATCTCAAATTCAGAATTTTAGATCGGCTCTAGATGATTGTGGATTGCAAGAT
CTTGATTACTTAGGCGATACATTCACGTGGACAAATCGACAAGGGGAATCAGACCAAACTAATGAAAGGCTGGATAGGTTTATTGCAAACGAGGAATATTGGCAGTTATT
CCCTAATTGCTCAGTCACGCACTTAAATTTGGCTAATTCTGACCATTGCCCAATAATGTTACAAGCAACATCAGATTTGGATTCTATTGAACTTGGCAAGCATAAACATA
GAATTTTCCGCTTTGAAGAAGTTTGGGCCACTCAACCAGAGTGTCACAGTCTAATCACCCGGACCGGCCAGTGGGGGTCTCATCATACAGGTTTTTCTCGTCTAGAACTC
TGTTTAAAAAATGTGTCGACAGGTTTGAAACAATGGGGCAAAGGACCTTTGGCTAACATTCGTAGGGACATCTCGAAGTATAAATCCATGCTTCAGGCACACTATAGCAA
ACCAAGACCGTGGGATTTTCAATCAATCCGTATCATAGAGGGTCTTCTAGATAAGGCGTTGGAACAGGAGGAAATATATTGGAAGCAGCAGTCGCACGAAAATTGGCTGA
AATGGGGTGATAAAAATACCCAATGGTTCCATAAAAGTGCTTCAATGAAAAAACGCAGAAATACAATCTCGGGAATCATTGCTGATAATGGAGAATGGGTTACAGACAGA
GTACAAATAGAACAAACGTTCACAGAATACTTCACAAATATCTTTTCCTCCACTAATCCCCATTTAGGCTATTTTGATGTTGCATTACAGGACATACAACGGAAGGTGAC
AGATGAGATGAACGATAAACAGATGGAGTCGTTTGACAGAGGAGAGTTGATTAGAGCAGTAAAGCAAATGCATCCATCTAAGGCTCCTGGACCAGATGGATTCCCAACTC
TTTTCTACCAAAAATATTGGACTGAGGTTGGTGAAACAACTATTTTAAATTGCCTTGATATTTTGAATCAAAAAAGGTTTGTCAAAGCTTGGAATGAGACGAATATTACC
CTCTTACCAAAAGTAAACCAACCGAAACAGGTGTCGGATTTCAGACCCATAAGCCTGTGTAATGTCTCATATAAAATAATTGCCAAGGTGATAGTTAATCGCATGAAATG
GATTCTTCAGGATATAATATCAGAAAATCAATCAGCTTTTGTTCCAGGGAGATCTATTTTTGATAACATTATCATTGGCCATGAATGTCTTCATACAATTAGATCGAGAC
AAAAAGGTCGAAAAGGATGGGTTGCTATGAAATTGGATATGAGTAAAGCATATGATAGGGTTGAGTGGAAATTCATTAATAAGCTTATGCTGAATTTAGTTTTCCATCCT
AGATGGGTTGCCCTCATTATGGATTGTGTCACTACTGCAAAATTTTCTCTATTGATTAATGGCTCACCCACTAGCTCTATCACACCTCATCGTGGTCTGCGTCAAGGTGA
CCCTCTTTCTCCTTATTTATTCCTGCTTTGCTCAGAAGCTTTATCTGCGTTATTATCTGGGGCGATTTCTCGCAAAATTCTTACAGGTATTAAAGCGGGTAAATGTTGCC
CAAAAATCTCTCACCTGTTTTTTGCGGACGACAGTCTTGTTTTCTGTCAAGCGTCTGTGGACCAAATTGAAGAGTTGCGATCTGTGCTAAACAAATATGAATTAGCCTCG
GGGCAAGTTGTCAACGTAACTAAATCAACGATGTTCTTTTCGCCTAATGTGGGAGAGGAGGTCCGTGATAATATAAAAAATATTATGGGCATGCAAGTTGTTGAGAATTT
AGGACGATATCTAGGGGTATCGTCATCCTTCACAAGAAGCCAGAGAGAGGATTTTAAGGGGGTTAAACAAAGAGTGTGGCAAACCTTGCAAGGTTGGAAAGGGAAATTTT
TCTCTATGGGTGGAAAGGAGACTCTCATTAAAAGTGTAGCTCAAGCTATACCCACATTTATAATGAGTTGTTTCCACCTTCCAAGTACTCTCTGTGATGACATTCATAAA
TTGGTGGCGAGATTCTGGTGGGGCTCTACAGAAGTTAAAAGGAAAATGCATTGGAAAAAATGGGAGAGCCTGTGTGCGCCTGATAACTGCCCAAAAGATTATGCTGCTGA
GCGACTGGAGGGAGCAAATTCTGTGCTGCAACAAAACTGGGAGCAGAAACTGCCACATCACAGCTCGTTAGCCAACTTGATGAACCGACTTCTGTTGAGTAATTTTCGTG
ATAAAGGAGCAAGGAGAGCCCTACACGTGTCCAAGTTGACCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATCGAAGCGGGGGATTATGTCTGATGTGGAAAGATGATATCAATGTTTCTATTCGTTCGTTCACTTTTTACCACATTGACGCGGCAATAAAATGGGACTCTAAATC
TTGGCGATTTACTGGGATATATGGCCATCCGAATGCTGGTCATAGAGATCACACTTGGAAATTACTTCGAAGGTTACATAACCAAGATGAGTCGGCTTGGGTCATAGGTG
GAGATTTCAATGCGACACTATTATATGAGGAAAAAGAAGGAGGCGTGCCGGTTAGGGAATCTCAAATTCAGAATTTTAGATCGGCTCTAGATGATTGTGGATTGCAAGAT
CTTGATTACTTAGGCGATACATTCACGTGGACAAATCGACAAGGGGAATCAGACCAAACTAATGAAAGGCTGGATAGGTTTATTGCAAACGAGGAATATTGGCAGTTATT
CCCTAATTGCTCAGTCACGCACTTAAATTTGGCTAATTCTGACCATTGCCCAATAATGTTACAAGCAACATCAGATTTGGATTCTATTGAACTTGGCAAGCATAAACATA
GAATTTTCCGCTTTGAAGAAGTTTGGGCCACTCAACCAGAGTGTCACAGTCTAATCACCCGGACCGGCCAGTGGGGGTCTCATCATACAGGTTTTTCTCGTCTAGAACTC
TGTTTAAAAAATGTGTCGACAGGTTTGAAACAATGGGGCAAAGGACCTTTGGCTAACATTCGTAGGGACATCTCGAAGTATAAATCCATGCTTCAGGCACACTATAGCAA
ACCAAGACCGTGGGATTTTCAATCAATCCGTATCATAGAGGGTCTTCTAGATAAGGCGTTGGAACAGGAGGAAATATATTGGAAGCAGCAGTCGCACGAAAATTGGCTGA
AATGGGGTGATAAAAATACCCAATGGTTCCATAAAAGTGCTTCAATGAAAAAACGCAGAAATACAATCTCGGGAATCATTGCTGATAATGGAGAATGGGTTACAGACAGA
GTACAAATAGAACAAACGTTCACAGAATACTTCACAAATATCTTTTCCTCCACTAATCCCCATTTAGGCTATTTTGATGTTGCATTACAGGACATACAACGGAAGGTGAC
AGATGAGATGAACGATAAACAGATGGAGTCGTTTGACAGAGGAGAGTTGATTAGAGCAGTAAAGCAAATGCATCCATCTAAGGCTCCTGGACCAGATGGATTCCCAACTC
TTTTCTACCAAAAATATTGGACTGAGGTTGGTGAAACAACTATTTTAAATTGCCTTGATATTTTGAATCAAAAAAGGTTTGTCAAAGCTTGGAATGAGACGAATATTACC
CTCTTACCAAAAGTAAACCAACCGAAACAGGTGTCGGATTTCAGACCCATAAGCCTGTGTAATGTCTCATATAAAATAATTGCCAAGGTGATAGTTAATCGCATGAAATG
GATTCTTCAGGATATAATATCAGAAAATCAATCAGCTTTTGTTCCAGGGAGATCTATTTTTGATAACATTATCATTGGCCATGAATGTCTTCATACAATTAGATCGAGAC
AAAAAGGTCGAAAAGGATGGGTTGCTATGAAATTGGATATGAGTAAAGCATATGATAGGGTTGAGTGGAAATTCATTAATAAGCTTATGCTGAATTTAGTTTTCCATCCT
AGATGGGTTGCCCTCATTATGGATTGTGTCACTACTGCAAAATTTTCTCTATTGATTAATGGCTCACCCACTAGCTCTATCACACCTCATCGTGGTCTGCGTCAAGGTGA
CCCTCTTTCTCCTTATTTATTCCTGCTTTGCTCAGAAGCTTTATCTGCGTTATTATCTGGGGCGATTTCTCGCAAAATTCTTACAGGTATTAAAGCGGGTAAATGTTGCC
CAAAAATCTCTCACCTGTTTTTTGCGGACGACAGTCTTGTTTTCTGTCAAGCGTCTGTGGACCAAATTGAAGAGTTGCGATCTGTGCTAAACAAATATGAATTAGCCTCG
GGGCAAGTTGTCAACGTAACTAAATCAACGATGTTCTTTTCGCCTAATGTGGGAGAGGAGGTCCGTGATAATATAAAAAATATTATGGGCATGCAAGTTGTTGAGAATTT
AGGACGATATCTAGGGGTATCGTCATCCTTCACAAGAAGCCAGAGAGAGGATTTTAAGGGGGTTAAACAAAGAGTGTGGCAAACCTTGCAAGGTTGGAAAGGGAAATTTT
TCTCTATGGGTGGAAAGGAGACTCTCATTAAAAGTGTAGCTCAAGCTATACCCACATTTATAATGAGTTGTTTCCACCTTCCAAGTACTCTCTGTGATGACATTCATAAA
TTGGTGGCGAGATTCTGGTGGGGCTCTACAGAAGTTAAAAGGAAAATGCATTGGAAAAAATGGGAGAGCCTGTGTGCGCCTGATAACTGCCCAAAAGATTATGCTGCTGA
GCGACTGGAGGGAGCAAATTCTGTGCTGCAACAAAACTGGGAGCAGAAACTGCCACATCACAGCTCGTTAGCCAACTTGATGAACCGACTTCTGTTGAGTAATTTTCGTG
ATAAAGGAGCAAGGAGAGCCCTACACGTGTCCAAGTTGACCTAA
Protein sequenceShow/hide protein sequence
MDRSGGLCLMWKDDINVSIRSFTFYHIDAAIKWDSKSWRFTGIYGHPNAGHRDHTWKLLRRLHNQDESAWVIGGDFNATLLYEEKEGGVPVRESQIQNFRSALDDCGLQD
LDYLGDTFTWTNRQGESDQTNERLDRFIANEEYWQLFPNCSVTHLNLANSDHCPIMLQATSDLDSIELGKHKHRIFRFEEVWATQPECHSLITRTGQWGSHHTGFSRLEL
CLKNVSTGLKQWGKGPLANIRRDISKYKSMLQAHYSKPRPWDFQSIRIIEGLLDKALEQEEIYWKQQSHENWLKWGDKNTQWFHKSASMKKRRNTISGIIADNGEWVTDR
VQIEQTFTEYFTNIFSSTNPHLGYFDVALQDIQRKVTDEMNDKQMESFDRGELIRAVKQMHPSKAPGPDGFPTLFYQKYWTEVGETTILNCLDILNQKRFVKAWNETNIT
LLPKVNQPKQVSDFRPISLCNVSYKIIAKVIVNRMKWILQDIISENQSAFVPGRSIFDNIIIGHECLHTIRSRQKGRKGWVAMKLDMSKAYDRVEWKFINKLMLNLVFHP
RWVALIMDCVTTAKFSLLINGSPTSSITPHRGLRQGDPLSPYLFLLCSEALSALLSGAISRKILTGIKAGKCCPKISHLFFADDSLVFCQASVDQIEELRSVLNKYELAS
GQVVNVTKSTMFFSPNVGEEVRDNIKNIMGMQVVENLGRYLGVSSSFTRSQREDFKGVKQRVWQTLQGWKGKFFSMGGKETLIKSVAQAIPTFIMSCFHLPSTLCDDIHK
LVARFWWGSTEVKRKMHWKKWESLCAPDNCPKDYAAERLEGANSVLQQNWEQKLPHHSSLANLMNRLLLSNFRDKGARRALHVSKLT