; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032358 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032358
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr11:31238178..31240190
RNA-Seq ExpressionLag0032358
SyntenyLag0032358
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]2.8e-11138.73Show/hide
Query:  MNCVESVEFSIFVNGSPSESVKPERGLRQGDPLSPYLFLICAEGLSSLLNREESLSHFNGIRINNHCPSISHLFFADDNLVFCRASKKDCRSIREILKVY
        M+C+ SV +SI VNG    S+ P RGLRQGDP+SPY+FL+CA+G SSLLN        +G+ I   CP I+HLFFADD+L+FC+A+ ++C+++ +IL++Y
Subjt:  MNCVESVEFSIFVNGSPSESVKPERGLRQGDPLSPYLFLICAEGLSSLLNREESLSHFNGIRINNHCPSISHLFFADDNLVFCRASKKDCRSIREILKVY

Query:  KVASGQTINLEKSIFMTSKNIGAEKLKGLSNILGIVHTKSISHYLGMPSQNARSKSSLFKKLKEKVGKALQGWKVSFFSQGGKETFIKAIVQAIPTYIMS
        + ASGQ IN++KS    S N   EK   +  +LG +       YLG+PS   +SK  +F ++KE+V + L GWK    S GG+E  IKA+ QAIPTY MS
Subjt:  KVASGQTINLEKSIFMTSKNIGAEKLKGLSNILGIVHTKSISHYLGMPSQNARSKSSLFKKLKEKVGKALQGWKVSFFSQGGKETFIKAIVQAIPTYIMS

Query:  CFKLPKKFCDDIDRACAR---------------------------------ISLFNQALLVKQIWKILKNPNSLLSKILRGRYFNHGDFLDANIGHNHSL
        CF++PK  C++I+    R                                 +  FN A+L KQ W+++ NPNSL+++I + RY+ HGD   A +G + S 
Subjt:  CFKLPKKFCDDIDRACAR---------------------------------ISLFNQALLVKQIWKILKNPNSLLSKILRGRYFNHGDFLDANIGHNHSL

Query:  TWRSILWGRDLFLKGYRWRVGNGKYIPIHKESWLNREGSRSILVTPVEFLN-SKVCSLIG-EDGCWKEGEIRNSFLDQDARDILNITLGSNSTADEIIWV
        TWRSI  G ++  +G RWRVGNG+ I I ++ WL    +  ++  P  F +  +V +LI  E   WK+  +R+ FL  +AR IL+I L  N   D+IIWV
Subjt:  TWRSILWGRDLFLKGYRWRVGNGKYIPIHKESWLNREGSRSILVTPVEFLN-SKVCSLIG-EDGCWKEGEIRNSFLDQDARDILNITLGSNSTADEIIWV

Query:  ENSIGFFTVKSAYHLA---ISSSEAKEASSSDSLEVTKLWKRFWKLNLMPRVKIYSWKFLNNITPTNVNLISKGLDINPICVFCRKKDETTPYILWDCKF
         N  G F+VKSAY++A   I + E  E+SS DS  +  LW++ W LN+ P+V+I++WK   N  PT +NL+ KG++I  +C  C  + E+  +I   C+ 
Subjt:  ENSIGFFTVKSAYHLA---ISSSEAKEASSSDSLEVTKLWKRFWKLNLMPRVKIYSWKFLNNITPTNVNLISKGLDINPICVFCRKKDETTPYILWDCKF

Query:  SKDIWLSFFPNLHTSVDLFRASRDVISTWEKISNSLKDEDLKVVVILIWKLWEFRNRISIKCAKAVKD
        +K +W  +  N     DL   + D++    KI +     DL++  ++ W +W  RN+I  +    V +
Subjt:  SKDIWLSFFPNLHTSVDLFRASRDVISTWEKISNSLKDEDLKVVVILIWKLWEFRNRISIKCAKAVKD

XP_023901742.1 uncharacterized protein LOC112013579 [Quercus suber]1.6e-10339.39Show/hide
Query:  MNCVESVEFSIFVNGSPSESVKPERGLRQGDPLSPYLFLICAEGLSSLLNREESLSHFNGIRINNHCPSISHLFFADDNLVFCRASKKDCRSIREILKVY
        MNCV SV +S+ +NG    ++ P RG+RQGDPLSP LFL+CAEGLS+L++        NGI I   CP I+HLFFADD+L+FC+A +++C ++  IL  Y
Subjt:  MNCVESVEFSIFVNGSPSESVKPERGLRQGDPLSPYLFLICAEGLSSLLNREESLSHFNGIRINNHCPSISHLFFADDNLVFCRASKKDCRSIREILKVY

Query:  KVASGQTINLEKSIFMTSKNIGAEKLKGLSNILGIVHTKSISHYLGMPSQNARSKSSLFKKLKEKVGKALQGWKVSFFSQGGKETFIKAIVQAIPTYIMS
        + ASGQ IN +KS    S N   E  + + NILG +     S YLG+PS   +SK+ +F ++K++V K L GWK    S GG+E  IKA+ QA+PTY MS
Subjt:  KVASGQTINLEKSIFMTSKNIGAEKLKGLSNILGIVHTKSISHYLGMPSQNARSKSSLFKKLKEKVGKALQGWKVSFFSQGGKETFIKAIVQAIPTYIMS

Query:  CFKLPKKFCDDID-----------------------RAC----------ARISLFNQALLVKQIWKILKNPNSLLSKILRGRYFNHGDFLDANIGHNHSL
        CF+LPK  C D++                       + C            I  FN A+L KQ W+IL NPNSL++++ + +YF   D L++  G N S 
Subjt:  CFKLPKKFCDDID-----------------------RAC----------ARISLFNQALLVKQIWKILKNPNSLLSKILRGRYFNHGDFLDANIGHNHSL

Query:  TWRSILWGRDLFLKGYRWRVGNGKYIPIHKESWLNREGSRSILVTPVEFLN-SKVCSLIGED-GCWKEGEIRNSFLDQDARDILNITLGSNSTADEIIWV
         WRSI    D+  KG RWRVGNG+ I I  + WL    +  ++   V++ +   V SLI  D   WK   I  +FL  +A  IL I L  N   D +IW+
Subjt:  TWRSILWGRDLFLKGYRWRVGNGKYIPIHKESWLNREGSRSILVTPVEFLN-SKVCSLIGED-GCWKEGEIRNSFLDQDARDILNITLGSNSTADEIIWV

Query:  ENSIGFFTVKSAYHLA---ISSSEAKEASSSDSLEVTKLWKRFWKLNLMPRVKIYSWKFLNNITPTNVNLISKGLDINPICVFCRKKDETTPYILWDCKF
         N  G FTVKSAY++A   + S E  E++S +S  +  LWKR W+L + P++KI++W+   N  PT  NL  +G+  +  C  C K  ET  + L  C+ 
Subjt:  ENSIGFFTVKSAYHLA---ISSSEAKEASSSDSLEVTKLWKRFWKLNLMPRVKIYSWKFLNNITPTNVNLISKGLDINPICVFCRKKDETTPYILWDCKF

Query:  SKDIWLSFFPN----LHTSVDLFRASRDVISTWEKISNSLKDEDLKVVVILIWKLWEFRNR
        +K  W +F+ N    L TS D+   + D+I+       SL   DL++   + W +W  RN+
Subjt:  SKDIWLSFFPN----LHTSVDLFRASRDVISTWEKISNSLKDEDLKVVVILIWKLWEFRNR

XP_030508852.1 uncharacterized protein LOC115723496 [Cannabis sativa]7.1e-10739.1Show/hide
Query:  MNCVESVEFSIFVNGSPSESVKPERGLRQGDPLSPYLFLICAEGLSSLLNREESLSHFNGIRINNHCPSISHLFFADDNLVFCRASKKDCRSIREILKVY
        + C++SV +S  +NG+    V P RG+RQGDPLSPYLFLICAEGLS LL  EE      G++I+   PS+SHLFFADD+++FCRA+++  R+I   L  Y
Subjt:  MNCVESVEFSIFVNGSPSESVKPERGLRQGDPLSPYLFLICAEGLSSLLNREESLSHFNGIRINNHCPSISHLFFADDNLVFCRASKKDCRSIREILKVY

Query:  KVASGQTINLEKSIFMTSKNIGAEKLKGLSNILGIVHTKSISHYLGMPSQNARSKSSLFKKLKEKVGKALQGWKVSFFSQGGKETFIKAIVQAIPTYIMS
          ASGQ IN EK +   S+N    +     ++LG+        YLG+PS + ++K  LF  + +K+ K L  WK   FS GGKE  +KA+VQAIPTY MS
Subjt:  KVASGQTINLEKSIFMTSKNIGAEKLKGLSNILGIVHTKSISHYLGMPSQNARSKSSLFKKLKEKVGKALQGWKVSFFSQGGKETFIKAIVQAIPTYIMS

Query:  CFKLPKKFCDDIDRACAR---------------------------------ISLFNQALLVKQIWKILKNPNSLLSKILRGRYFNHGDFLDANIGHNHSL
        CF+LP   C  I+   AR                                    FNQALL KQ W+IL+ PNSLLS +LR RYF++G++L A +G N SL
Subjt:  CFKLPKKFCDDIDRACAR---------------------------------ISLFNQALLVKQIWKILKNPNSLLSKILRGRYFNHGDFLDANIGHNHSL

Query:  TWRSILWGRDLFLKGYRWRVGNGKYIPIHKESWLNREGSRSILVTPVEFLNSKVCSLIGEDGCWKEGEIRNSFLDQDARDILNITLGSNSTADEIIWVEN
        TWRS++WG++L LKG RWRVG+G+ I    +SWL    +           N  V  LI E   W    +  +F   D   +L+I L      D +IW ++
Subjt:  TWRSILWGRDLFLKGYRWRVGNGKYIPIHKESWLNREGSRSILVTPVEFLNSKVCSLIGEDGCWKEGEIRNSFLDQDARDILNITLGSNSTADEIIWVEN

Query:  SIGFFTVKSAYHLAISSSEAKEASSSDSLEVTKLWKRFWKLNLMPRVKIYSWKFLNNITPTNVNLISKGLDINPICVFCRKKDETTPYILWDCKFSKDIW
          G + VKS YH A+S +E  +++ S+S+E    W  FWKL L P+V+I+ WK  +   P    L  + +  +P C  C   +ET  + L+ C  +K +W
Subjt:  SIGFFTVKSAYHLAISSSEAKEASSSDSLEVTKLWKRFWKLNLMPRVKIYSWKFLNNITPTNVNLISKGLDINPICVFCRKKDETTPYILWDCKFSKDIW

Query:  -LSFFPNLHTSVDLFRASR-DVISTWEKISNSLKDEDLKVVVILIWKLWEFRNRI
         LS F     S+D     R     T   +S SL   +L++ ++L W +W  RN I
Subjt:  -LSFFPNLHTSVDLFRASR-DVISTWEKISNSLKDEDLKVVVILIWKLWEFRNRI

XP_030922958.1 uncharacterized protein LOC115949824 [Quercus lobata]1.3e-10340.99Show/hide
Query:  MNCVESVEFSIFVNGSPSESVKPERGLRQGDPLSPYLFLICAEGLSSLLNREESLSHFNGIRINNHCPSISHLFFADDNLVFCRASKKDCRSIREILKVY
        M C+ SV +S+ +NGS    + P RGLRQGDPLSPYLFL+CAEG ++L+N        NGI I    P ISHLFFADDNL+FC+A++ +C  + EIL++Y
Subjt:  MNCVESVEFSIFVNGSPSESVKPERGLRQGDPLSPYLFLICAEGLSSLLNREESLSHFNGIRINNHCPSISHLFFADDNLVFCRASKKDCRSIREILKVY

Query:  KVASGQTINLEKSIFMTSKNIGAEKLKGLSNILGIVHTKSISHYLGMPSQNARSKSSLFKKLKEKVGKALQGWKVSFFSQGGKETFIKAIVQAIPTYIMS
        + ASGQ IN EKS    S N   E+   + +ILG ++    + YLG+PS   RSK  +F +++EKVGK L  WK    S GGKE  IKA+ QAIP+Y MS
Subjt:  KVASGQTINLEKSIFMTSKNIGAEKLKGLSNILGIVHTKSISHYLGMPSQNARSKSSLFKKLKEKVGKALQGWKVSFFSQGGKETFIKAIVQAIPTYIMS

Query:  CFKLPKKFCDDIDR-----------------------ACARISL----------FNQALLVKQIWKILKNPNSLLSKILRGRYFNHGDFLDANIGHNHSL
        CF LP   C+D++R                        C   SL          FN ALL KQ+W+I  NP+SL ++IL+ +YF + D L+  +G+N S 
Subjt:  CFKLPKKFCDDIDR-----------------------ACARISL----------FNQALLVKQIWKILKNPNSLLSKILRGRYFNHGDFLDANIGHNHSL

Query:  TWRSILWGRDLFLKGYRWRVGNGKYIPIHKESWLNREGSRSILVTPVEFLN-SKVCSLIGED-GCWKEGEIRNSFLDQDARDILNITLGSNSTADEIIWV
         WRSI    ++   G RWRVGNG  I I K+ WL    +  ++    +F +   V SLI +D   WK   +R  FL  +A  IL I L      D +IW+
Subjt:  TWRSILWGRDLFLKGYRWRVGNGKYIPIHKESWLNREGSRSILVTPVEFLN-SKVCSLIGED-GCWKEGEIRNSFLDQDARDILNITLGSNSTADEIIWV

Query:  ENSIGFFTVKSAYHLAI---SSSEAKEASSSDSLEVTKLWKRFWKLNLMPRVKIYSWKFLNNITPTNVNLISKGLDINPICVFCRKKDETTPYILWDCKF
         N  G FTVKSAYH+A    S  +  E+SS DS  +  +WK+ WK+ L P++KI++W+   N  P  V +  +G+ +   C  C K  E+  + L  C F
Subjt:  ENSIGFFTVKSAYHLAI---SSSEAKEASSSDSLEVTKLWKRFWKLNLMPRVKIYSWKFLNNITPTNVNLISKGLDINPICVFCRKKDETTPYILWDCKF

Query:  SKDIW
        +  +W
Subjt:  SKDIW

XP_030958588.1 uncharacterized protein LOC115980487 [Quercus lobata]6.5e-10840.61Show/hide
Query:  MNCVESVEFSIFVNGSPSESVKPERGLRQGDPLSPYLFLICAEGLSSLLNREESLSHFNGIRINNHCPSISHLFFADDNLVFCRASKKDCRSIREILKVY
        MNC+ SV +S+ +N     ++ P RG+RQGDPLSPYLFL+CAEGLS+L ++       NGI I   CPS++HLFFADD+L FC+AS ++C+ +  IL  Y
Subjt:  MNCVESVEFSIFVNGSPSESVKPERGLRQGDPLSPYLFLICAEGLSSLLNREESLSHFNGIRINNHCPSISHLFFADDNLVFCRASKKDCRSIREILKVY

Query:  KVASGQTINLEKSIFMTSKNIGAEKLKGLSNILGIVHTKSISHYLGMPSQNARSKSSLFKKLKEKVGKALQGWKVSFFSQGGKETFIKAIVQAIPTYIMS
        + ASGQ IN +KS    S N   E+ + + NILG +       YLG+PS   +SK+ +F ++KE+V K L GWK    S G +E  IKA+ QA+PTY MS
Subjt:  KVASGQTINLEKSIFMTSKNIGAEKLKGLSNILGIVHTKSISHYLGMPSQNARSKSSLFKKLKEKVGKALQGWKVSFFSQGGKETFIKAIVQAIPTYIMS

Query:  CFKLPKKFCDDID---RACARISLFNQALLVKQIWKILKNPNSLLSKILRGRYFNHGDFLDANIGHNHSLTWRSILWGRDLFLKGYRWRVGNGKYIPIHK
        CF+L K  C D++   R    I  FN A+L KQ W+IL NPN+L+++I + +YF H + L++  G+N S  WRSI    ++  +G RWRVGNG+ I I +
Subjt:  CFKLPKKFCDDID---RACARISLFNQALLVKQIWKILKNPNSLLSKILRGRYFNHGDFLDANIGHNHSLTWRSILWGRDLFLKGYRWRVGNGKYIPIHK

Query:  ESWLNREGSRSILVTPVEFLN-SKVCSLIGED-GCWKEGEIRNSFLDQDARDILNITLGSNSTADEIIWVENSIGFFTVKSAYHLA---ISSSEAKEASS
        + WL    S  ++    +F N   V SLI  D   WK   +++ FL  DA  IL I L  N   D +IW+ N  G FTVKSAYH+A   + S+E +E SS
Subjt:  ESWLNREGSRSILVTPVEFLN-SKVCSLIGED-GCWKEGEIRNSFLDQDARDILNITLGSNSTADEIIWVENSIGFFTVKSAYHLA---ISSSEAKEASS

Query:  SDSLEVTKLWKRFWKLNLMPRVKIYSWKFLNNITPTNVNLISKGLDINPICVFCRKKDETTPYILWDCKFSKDIWLSFFPNLHTSVDLFRASRDVISTWE
        S+S   T L KR W   +  ++KI++W+F  N  PT  NL  +G+  +  C  C    E+T + L+ C  +K  W  +    +  VD+  ++RD +    
Subjt:  SDSLEVTKLWKRFWKLNLMPRVKIYSWKFLNNITPTNVNLISKGLDINPICVFCRKKDETTPYILWDCKFSKDIWLSFFPNLHTSVDLFRASRDVISTWE

Query:  KISNSLKDEDLKVVVILIWKLWEFRNR
           +     DL++   + W +W  RN+
Subjt:  KISNSLKDEDLKVVVILIWKLWEFRNR

TrEMBL top hitse value%identityAlignment
A0A2N9ELB0 Uncharacterized protein1.7e-10635.85Show/hide
Query:  MNCVESVEFSIFVNGSPSESVKPERGLRQGDPLSPYLFLICAEGLSSLLNREESLSHFNGIRINNHCPSISHLFFADDNLVFCRASKKDCRSIREILKVY
        M CV S  +S+ VNG     +KP RGLRQGDPLSPYLFLICAEGLSSL+ + E    F G+ I    P ISHLFFADD+++FCRAS  DC  I+ +L +Y
Subjt:  MNCVESVEFSIFVNGSPSESVKPERGLRQGDPLSPYLFLICAEGLSSLLNREESLSHFNGIRINNHCPSISHLFFADDNLVFCRASKKDCRSIREILKVY

Query:  KVASGQTINLEKSIFMTSKNIGAEKLKGLSNILGIVHTKSISHYLGMPSQNARSKSSLFKKLKEKVGKALQGWKVSFFSQGGKETFIKAIVQAIPTYIMS
        + ASGQ +N +K+    SKN  + K   + ++ G   T     YLG+P    RSK   F  +KE++ K LQGWK    SQ G+ET IKA+VQAIPTY MS
Subjt:  KVASGQTINLEKSIFMTSKNIGAEKLKGLSNILGIVHTKSISHYLGMPSQNARSKSSLFKKLKEKVGKALQGWKVSFFSQGGKETFIKAIVQAIPTYIMS

Query:  CFKLPKKFCDDIDRACAR---------------------------------ISLFNQALLVKQIWKILKNPNSLLSKILRGRYFNHGDFLDANIGHNHSL
        CFK P   C +I     R                                 + LFN+ALL +Q W++L++P SL+S+ L+ +YF H  FLD  I +N S 
Subjt:  CFKLPKKFCDDIDRACAR---------------------------------ISLFNQALLVKQIWKILKNPNSLLSKILRGRYFNHGDFLDANIGHNHSL

Query:  TWRSILWGRDLFLKGYRWRVGNGKYIPIHKESWLNREGSRSILVTPVEFLNSK--VCSLIGED-GCWKEGEIRNSFLDQDARDILNITLGSNSTADEIIW
         WRS+   R +   G RWRVG G  I + K++WL    S  + ++PV  L+ +  V  LI +D  CW    +   FL +D   I  I L     +D +IW
Subjt:  TWRSILWGRDLFLKGYRWRVGNGKYIPIHKESWLNREGSRSILVTPVEFLNSK--VCSLIGED-GCWKEGEIRNSFLDQDARDILNITLGSNSTADEIIW

Query:  VENSIGFFTVKSAYHLAISSSEAKEASSSDSL-EVTKLWKRFWKLNLMPRVKIYSWKFLNNITPTNVNLISKGLDINPICVFCRKKDETTPYILWDCKFS
        V    G FTV+SAYHL +        SSS  L E  ++W   W   + P+++++ W+   +I PT   L  +G+     C +C +  ETT ++LW C F+
Subjt:  VENSIGFFTVKSAYHLAISSSEAKEASSSDSL-EVTKLWKRFWKLNLMPRVKIYSWKFLNNITPTNVNLISKGLDINPICVFCRKKDETTPYILWDCKFS

Query:  KDIWLSFFPNLHTSVDLFRASRDVISTWEKISNSLKDEDLKVVVILIWKLWEFRNRISIKCAKAVKD-------QCISE-IETAISDHSRRDFTSTLDFS
        + +W +    +        +  D IST     + L + +L+++    W+LW  RNR+      +  D       + +S+ +E  + DH      S  +FS
Subjt:  KDIWLSFFPNLHTSVDLFRASRDVISTWEKISNSLKDEDLKVVVILIWKLWEFRNRISIKCAKAVKD-------QCISE-IETAISDHSRRDFTSTLDFS

Query:  ----PE---------------SLQSQGVGWTIRDSDRSLLLAGAKSIRRERPILMLEAMAI
            PE               S+Q+ GVG+ IRD+   ++ A  + I R    L L+A+ +
Subjt:  ----PE---------------SLQSQGVGWTIRDSDRSLLLAGAKSIRRERPILMLEAMAI

A0A2N9F647 Uncharacterized protein2.4e-10839.18Show/hide
Query:  MNCVESVEFSIFVNGSPSESVKPERGLRQGDPLSPYLFLICAEGLSSLLNREESLSHFNGIRINNHCPSISHLFFADDNLVFCRASKKDCRSIREILKVY
        M CV SV +SI VNG P   VKP RGLRQGDPLSPYLFLICAEGL++LL + E  S   GI I    P +SHLFFADD+L+FCRA+  +C+++++IL +Y
Subjt:  MNCVESVEFSIFVNGSPSESVKPERGLRQGDPLSPYLFLICAEGLSSLLNREESLSHFNGIRINNHCPSISHLFFADDNLVFCRASKKDCRSIREILKVY

Query:  KVASGQTINLEKSIFMTSKNIGAEKLKGLSNILGIVHTKSISHYLGMPSQNARSKSSLFKKLKEKVGKALQGWKVSFFSQGGKETFIKAIVQAIPTYIMS
        + ASGQ IN  K+    S+N        + N  G   T     YLG+P    RSK   F ++K+++ + LQGWK    SQ GK   IKA++QAIPTY MS
Subjt:  KVASGQTINLEKSIFMTSKNIGAEKLKGLSNILGIVHTKSISHYLGMPSQNARSKSSLFKKLKEKVGKALQGWKVSFFSQGGKETFIKAIVQAIPTYIMS

Query:  CFKLPKKFCDDIDRACAR---------------------------------ISLFNQALLVKQIWKILKNPNSLLSKILRGRYFNHGDFLDANIGHNHSL
        CFK P   C++I     R                                 +  FNQALL +Q W++LKNP SL+ + L+ +YF H  F++A I  N S 
Subjt:  CFKLPKKFCDDIDRACAR---------------------------------ISLFNQALLVKQIWKILKNPNSLLSKILRGRYFNHGDFLDANIGHNHSL

Query:  TWRSILWGRDLFLKGYRWRVGNGKYIPIHKESWLNREGSRSILVTPVEFL--NSKVCSLIG-EDGCWKEGEIRNSFLDQDARDILNITLGSNSTADEIIW
         WRSI   +++   G RWRVG+G+ I I K+ W+    +  I+ +P+  L  N+ V SLI      W    ++  FL +D   I  I L      D +IW
Subjt:  TWRSILWGRDLFLKGYRWRVGNGKYIPIHKESWLNREGSRSILVTPVEFL--NSKVCSLIG-EDGCWKEGEIRNSFLDQDARDILNITLGSNSTADEIIW

Query:  VENSIGFFTVKSAYHLAISSSEAKEASSSDSLEVTKLWKRFWKLNLMPRVKIYSWKFLNNITPTNVNLISKGLDINPICVFCRKKDETTPYILWDCKFSK
             G FTVKSAY++ +  S A EA SS S E++  WK  W   + P+VK+++W+   NI PT   L  KGL     C +C ++ ETT ++LW C+F++
Subjt:  VENSIGFFTVKSAYHLAISSSEAKEASSSDSLEVTKLWKRFWKLNLMPRVKIYSWKFLNNITPTNVNLISKGLDINPICVFCRKKDETTPYILWDCKFSK

Query:  DIWLSFFPNLHTSVDLFRASRDVISTWEKISNSLKDEDLKVVVILI---WKLWEFRNRI
         +W +      +SV L       +S  + ++   KD    +V I I   W LW+ RN +
Subjt:  DIWLSFFPNLHTSVDLFRASRDVISTWEKISNSLKDEDLKVVVILI---WKLWEFRNRI

A0A2N9J936 Uncharacterized protein5.0e-10637.38Show/hide
Query:  MNCVESVEFSIFVNGSPSESVKPERGLRQGDPLSPYLFLICAEGLSSLLNREESLSHFNGIRINNHCPSISHLFFADDNLVFCRASKKDCRSIREILKVY
        M CV S  +S+ VNG P   +KP RGLRQGDPLSPYLFL+CAEGLS+L+ + E      GI I    P +SHLFFADD+++FCRAS+ D  ++  ILK+Y
Subjt:  MNCVESVEFSIFVNGSPSESVKPERGLRQGDPLSPYLFLICAEGLSSLLNREESLSHFNGIRINNHCPSISHLFFADDNLVFCRASKKDCRSIREILKVY

Query:  KVASGQTINLEKSIFMTSKNIGAEKLKGLSNILGIVHTKSISHYLGMPSQNARSKSSLFKKLKEKVGKALQGWKVSFFSQGGKETFIKAIVQAIPTYIMS
        + ASGQ IN EK+    SKN        + ++ G   +     YLG+P    RSK   F ++K+++ K LQGWK    SQ G+E  IKA+VQAIP Y MS
Subjt:  KVASGQTINLEKSIFMTSKNIGAEKLKGLSNILGIVHTKSISHYLGMPSQNARSKSSLFKKLKEKVGKALQGWKVSFFSQGGKETFIKAIVQAIPTYIMS

Query:  CFKLPKKFCDDIDRACAR---------------------------------ISLFNQALLVKQIWKILKNPNSLLSKILRGRYFNHGDFLDANIGHNHSL
        CFKLP   CD+I     +                                 + LFN+ALL +Q W++L+ P+SL+ +IL+ +YF H  FL+A +  N S 
Subjt:  CFKLPKKFCDDIDRACAR---------------------------------ISLFNQALLVKQIWKILKNPNSLLSKILRGRYFNHGDFLDANIGHNHSL

Query:  TWRSILWGRDLFLKGYRWRVGNGKYIPIHKESWLNREGSRSILVTPVEFLNSK--VCSLIGE-DGCWKEGEIRNSFLDQDARDILNITLGSNSTADEIIW
         WRSI   R +   G RWRVGNG  I I K++WL    +  + ++P+   NS+  V SLI E D  W E ++   FL +D   I  I L      D++IW
Subjt:  TWRSILWGRDLFLKGYRWRVGNGKYIPIHKESWLNREGSRSILVTPVEFLNSK--VCSLIGE-DGCWKEGEIRNSFLDQDARDILNITLGSNSTADEIIW

Query:  VENSIGFFTVKSAYHLAISSSEAKEASSSDSLEVTK-LWKRFWKLNLMPRVKIYSWKFLNNITPTNVNLISKGLDINPICVFCRKKDETTPYILWDCKFS
             G FTV+SAY L +  S     SSS+ L   + LW   W   + P+V+++ W+   +I PT   L  KGL  +  C++C  + ET+ ++LW C FS
Subjt:  VENSIGFFTVKSAYHLAISSSEAKEASSSDSLEVTK-LWKRFWKLNLMPRVKIYSWKFLNNITPTNVNLISKGLDINPICVFCRKKDETTPYILWDCKFS

Query:  KDIWLSFFPNLHTSVDLFRASRDVISTWEKISNSLKDEDLKVVVILIWKLWEFRNRISIKCAKAVKDQCISEIETAISDHSRRDFTSTLDFSPESLQSQG
        + IW++   N+ +SV +  + +D +     +   L   D+  +  + W++W  RNR         +++ +S     + D  R+  +S LDF    L   G
Subjt:  KDIWLSFFPNLHTSVDLFRASRDVISTWEKISNSLKDEDLKVVVILIWKLWEFRNRISIKCAKAVKDQCISEIETAISDHSRRDFTSTLDFSPESLQSQG

Query:  VG
        VG
Subjt:  VG

A0A7N2L6Z9 Reverse transcriptase domain-containing protein9.1e-10836.92Show/hide
Query:  MNCVESVEFSIFVNGSPSESVKPERGLRQGDPLSPYLFLICAEGLSSLLNREESLSHFNGIRINNHCPSISHLFFADDNLVFCRASKKDCRSIREILKVY
        M C+ SV +S+ +NG    ++ P RGLRQGDPLSPYLFL+CAEGLS+LL+        NGI +   CP I+HLFFADD+L+FC+A++++C  ++EIL+ Y
Subjt:  MNCVESVEFSIFVNGSPSESVKPERGLRQGDPLSPYLFLICAEGLSSLLNREESLSHFNGIRINNHCPSISHLFFADDNLVFCRASKKDCRSIREILKVY

Query:  KVASGQTINLEKSIFMTSKNIGAEKLKGLSNILGIVHTKSISHYLGMPSQNARSKSSLFKKLKEKVGKALQGWKVSFFSQGGKETFIKAIVQAIPTYIMS
        + ASGQ +N +KS    S N   E  + + NILG +     + YLG+PS   RSK  +F ++KE+VG  L GWK    S GGKE  IKA+ QAIPTY MS
Subjt:  KVASGQTINLEKSIFMTSKNIGAEKLKGLSNILGIVHTKSISHYLGMPSQNARSKSSLFKKLKEKVGKALQGWKVSFFSQGGKETFIKAIVQAIPTYIMS

Query:  CFKLPKKFCDDID-----------------------RACARISL----------FNQALLVKQIWKILKNPNSLLSKILRGRYFNHGDFLDANIGHNHSL
        CF LPK  CD+++                       + C   SL          FN ALL KQ W+IL NP SL ++IL+ +YF +GD L+A++G N S 
Subjt:  CFKLPKKFCDDID-----------------------RACARISL----------FNQALLVKQIWKILKNPNSLLSKILRGRYFNHGDFLDANIGHNHSL

Query:  TWRSILWGRDLFLKGYRWRVGNGKYIPIHKESWLNREGSRSILVTP-VEFLNSKVCSLIGED-GCWKEGEIRNSFLDQDARDILNITLGSNSTADEIIWV
        TWRSI    ++  KG RWRVGNG+ I I  + WL    +  ++  P +      V SLI  D   WK   IR  FL  DA  IL I L  N   D IIW+
Subjt:  TWRSILWGRDLFLKGYRWRVGNGKYIPIHKESWLNREGSRSILVTP-VEFLNSKVCSLIGED-GCWKEGEIRNSFLDQDARDILNITLGSNSTADEIIWV

Query:  ENSIGFFTVKSAYHLAIS---SSEAKEASSSDSLEVTKLWKRFWKLNLMPRVKIYSWKFLNNITPTNVNLISKGLDINPICVFCRKKDETTPYILWDCKF
         N  G F+VKSAY +A++   S+E  E SS D      LWK  WKL+L  +VKI++W+   N  PT  N+  +GL+ N  C  C ++ E   + L  C F
Subjt:  ENSIGFFTVKSAYHLAIS---SSEAKEASSSDSLEVTKLWKRFWKLNLMPRVKIYSWKFLNNITPTNVNLISKGLDINPICVFCRKKDETTPYILWDCKF

Query:  SKDIWLSFFPNLHTSVDLFRASRDVISTWEKISNSLKDEDLKVVVILIWKLWEFRNRISIKCAKAVKDQCISEIET-AISDHSRRDFTST--LDFSP---
        +  +W  +       + L   SRD+ +       +     L +   + W +W  RN       +   + C+S ++T  ++     D+     LDF P   
Subjt:  SKDIWLSFFPNLHTSVDLFRASRDVISTWEKISNSLKDEDLKVVVILIWKLWEFRNRISIKCAKAVKDQCISEIET-AISDHSRRDFTST--LDFSP---

Query:  -----------------------ESLQSQGVGWTIRDSDRSLLLAGAKSIRRERPILMLEAMAIKRDLV
                               + + S GVG  IRD    ++ A  K +    P    E  AI++ L+
Subjt:  -----------------------ESLQSQGVGWTIRDSDRSLLLAGAKSIRRERPILMLEAMAIKRDLV

A0A803Q9W0 Uncharacterized protein7.7e-10735.08Show/hide
Query:  MNCVESVEFSIFVNGSPSESVKPERGLRQGDPLSPYLFLICAEGLSSLLNREESLSHFNGIRINNHCPSISHLFFADDNLVFCRASKKDCRSIREILKVY
        MNC++S+ FSI +NG  S  + P RGLRQGDPLSPY+FL+C+EGLS L+   E  +  +G+R       +SHLFFADD+ +F  A+  DC+S++ IL  Y
Subjt:  MNCVESVEFSIFVNGSPSESVKPERGLRQGDPLSPYLFLICAEGLSSLLNREESLSHFNGIRINNHCPSISHLFFADDNLVFCRASKKDCRSIREILKVY

Query:  KVASGQTINLEKSIFMTSKNIGAEKLKGLSNILGIVHTKSISHYLGMPSQNARSKSSLFKKLKEKVGKALQGWKVSFFSQGGKETFIKAIVQAIPTYIMS
         + SGQ IN +KS     K I       L+ ILG+      + YLG+P+   + K  +F+ ++ K+   LQGWK S FSQ G+E  +KAI+QAIPTYIMS
Subjt:  KVASGQTINLEKSIFMTSKNIGAEKLKGLSNILGIVHTKSISHYLGMPSQNARSKSSLFKKLKEKVGKALQGWKVSFFSQGGKETFIKAIVQAIPTYIMS

Query:  CFKLPKKFCDDIDRACAR---------------------------------ISLFNQALLVKQIWKILKNPNSLLSKILRGRYFNHGDFLDANIGHNHSL
        CF+LPK+   DI    AR                                 + LFNQ+LL KQ WKI+ NP+S+L+++L+  Y+ + +FL+A +G   S 
Subjt:  CFKLPKKFCDDIDRACAR---------------------------------ISLFNQALLVKQIWKILKNPNSLLSKILRGRYFNHGDFLDANIGHNHSL

Query:  TWRSILWGRDLFLKGYRWRVGNGKYIPIHKESWLNREGSRSILVTPVEFLNSKVCSLIGEDGCWKEGEIRNSFLDQDARDILNITLGSNSTADEIIWVEN
         WRSILWGR +  KG RWRV  G+ + I+++ WL R  + S+         + + ++  EDG W   +I   F   D   IL IT GS +  D++IW   
Subjt:  TWRSILWGRDLFLKGYRWRVGNGKYIPIHKESWLNREGSRSILVTPVEFLNSKVCSLIGEDGCWKEGEIRNSFLDQDARDILNITLGSNSTADEIIWVEN

Query:  SIGFFTVKSAYHLAISSSEAKEASSSDSLEVTKLWKRFWKLNLMPRVKIYSWKFLNNITPTNVNLISKGLDINPICVFCRKKDETTPYILWDCKFSKDIW
          G +TV S Y +  ++ +   A +S+  +  K W+  WK    P+V+ + W+  N   P N+NL ++G+D+NP+C +C +++ET  + LW C  +K+IW
Subjt:  SIGFFTVKSAYHLAISSSEAKEASSSDSLEVTKLWKRFWKLNLMPRVKIYSWKFLNNITPTNVNLISKGLDINPICVFCRKKDETTPYILWDCKFSKDIW

Query:  LSFFPNLHTSVDLFRASRD-VISTWEKISNSLKDEDLKVVVILIWKLWEFRNRISIKCAKAVKDQCIS----EIETAISDHSRRDFTSTLDFSPESLQSQ
         +F     T     + SR+  +     I   +  +D    +I+ W +W  RN+  +       ++ I     E    +S H   D  +T      +  S 
Subjt:  LSFFPNLHTSVDLFRASRD-VISTWEKISNSLKDEDLKVVVILIWKLWEFRNRISIKCAKAVKDQCIS----EIETAISDHSRRDFTSTLDFSPESLQSQ

Query:  GVGWTIRDSD
         +GW     D
Subjt:  GVGWTIRDSD

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657503.2e-1722.54Show/hide
Query:  MPSQNARSKSSLFKKLKEKVGKALQGWKVSFFSQGGKETFIKAIVQAIPTYIMSCFKLPKKFCDDID-----------------------RACA------
        MP    R     F ++ E+V   + GW+    S  G+ T  KA++ ++P + MS   LP+   + +D                       + C+      
Subjt:  MPSQNARSKSSLFKKLKEKVGKALQGWKVSFFSQGGKETFIKAIVQAIPTYIMSCFKLPKKFCDDID-----------------------RACA------

Query:  ----RISLFNQALLVKQIWKILKNPNSLLSKILRGRYFNHGDFLDAN---IGHNHSLTWRSILWG-RDLFLKGYRWRVGNGKYIPIHKESWLNREGSRSI
                 N+AL+ K  W++L+  NSL + +L+ +Y + G+  D+       + S TWRSI  G RD+   G  W  G+G+ I      W +R  S   
Subjt:  ----RISLFNQALLVKQIWKILKNPNSLLSKILRGRYFNHGDFLDAN---IGHNHSLTWRSILWG-RDLFLKGYRWRVGNGKYIPIHKESWLNREGSRSI

Query:  LVTPVEFLNSKVCSLIGEDGCWKEGE-----------IRNSFLDQDARDILNITLGSNSTADEIIWVENSIGFFTVKSAYHLAISSSEAKEASSSDSLEV
        L+          C  +     W  G              N+ L+  A  +L++  G+    D + W  +  G F+V+SAY +       +   +S     
Subjt:  LVTPVEFLNSKVCSLIGEDGCWKEGE-----------IRNSFLDQDARDILNITLGSNSTADEIIWVENSIGFFTVKSAYHLAISSSEAKEASSSDSLEV

Query:  TKLWKRFWKLNLMPRVKIYSWKFLNNITPTNVNLISKGLDINPICVFCRKKDETTPYILWDCKFSKDIWLSFFPNLHTSVDLFRASRDVISTWEKISNSL
           +   WK+ +  RVK + W   N    T      + L  + +C  C+   E+  ++L DC     IW+   P         +      S +E + ++L
Subjt:  TKLWKRFWKLNLMPRVKIYSWKFLNNITPTNVNLISKGLDINPICVFCRKKDETTPYILWDCKFSKDIWLSFFPNLHTSVDLFRASRDVISTWEKISNSL

Query:  KD----EDL---KVVVILIWKLWEFR
         D    ED+    +  ++IW  W++R
Subjt:  KD----EDL---KVVVILIWKLWEFR

P11369 LINE-1 retrotransposable element ORF2 protein1.5e-1130.37Show/hide
Query:  SIFVNGSPSESVKPERGLRQGDPLSPYLFLICAEGLSSLLNREESLSHFNGIRINNHCPSISHLFFADDNLVFCRASKKDCRSIREILKVYKVASGQTIN
        +I VNG   E++  + G RQG PLSPYLF I  E L+  + +++ +    GI+I      IS L  ADD +V+    K   R +  ++  +    G  IN
Subjt:  SIFVNGSPSESVKPERGLRQGDPLSPYLFLICAEGLSSLLNREESLSHFNGIRINNHCPSISHLFFADDNLVFCRASKKDCRSIREILKVYKVASGQTIN

Query:  LEKSI-FMTSKNIGAEKLKGLSNILGIVHTKSISHYLG--MPSQNARSKSSLFKKLKEKVGKALQGWKVSFFSQGGKETFIKAIV--QAIPTYIMSCFKL
          KS+ F+ +KN  AEK    +    IV T +I  YLG  +  +        FK LK+++ + L+ WK    S  G+   +K  +  +AI  +     K+
Subjt:  LEKSI-FMTSKNIGAEKLKGLSNILGIVHTKSISHYLG--MPSQNARSKSSLFKKLKEKVGKALQGWKVSFFSQGGKETFIKAIV--QAIPTYIMSCFKL

Query:  PKKFCDDIDRACAR
        P +F ++++ A  +
Subjt:  PKKFCDDIDRACAR

P92555 Uncharacterized mitochondrial protein AtMg012505.1e-1556.06Show/hide
Query:  VNGSPSESVKPERGLRQGDPLSPYLFLICAEGLSSLLNREESLSHFNGIRINNHCPSISHLFFADD
        +NG+P   V P RGLRQGDPLSPYLF++C E LS L  R +      GIR++N+ P I+HL FADD
Subjt:  VNGSPSESVKPERGLRQGDPLSPYLFLICAEGLSSLLNREESLSHFNGIRINNHCPSISHLFFADD

P93295 Uncharacterized mitochondrial protein AtMg003102.3e-1532.41Show/hide
Query:  AIPTYIMSCFKLPKKFCDDIDRACAR----------------------------------ISLFNQALLVKQIWKILKNPNSLLSKILRGRYFNHGDFLD
        A+P Y MSCF+L K  C  +  A                                     +  FNQALL KQ ++I+  P++LLS++LR RYF H   ++
Subjt:  AIPTYIMSCFKLPKKFCDDIDRACAR----------------------------------ISLFNQALLVKQIWKILKNPNSLLSKILRGRYFNHGDFLD

Query:  ANIGHNHSLTWRSILWGRDLFLKGYRWRVGNGKYIPIHKESWLNR
         ++G   S  WRSI+ GR+L  +G    +G+G    IH + WL+R
Subjt:  ANIGHNHSLTWRSILWGRDLFLKGYRWRVGNGKYIPIHKESWLNR

Arabidopsis top hitse value%identityAlignment
AT2G02650.1 Ribonuclease H-like superfamily protein3.0e-1024.53Show/hide
Query:  VKSAYHLAISSSEAKEASSSDSLEVTKLWKRFWKLNLMPRVKIYSWKFLNNITPTNVNLISKGLDINPICVFCRKKDETTPYILWDCKFSKDIW------
        ++S Y +A      +E +       T++ +  WKL++ P++K + W+ +     TN  L S+ +D +PIC  C  ++ET  +I+++C +++ +W      
Subjt:  VKSAYHLAISSSEAKEASSSDSLEVTKLWKRFWKLNLMPRVKIYSWKFLNNITPTNVNLISKGLDINPICVFCRKKDETTPYILWDCKFSKDIW------

Query:  --------LSFFPNLHTSVDLFRASRDVISTWEKISNSLKDEDLKVVVILIWKLWEFRN
                 SF  NL+  + L +          + +NSL   D  +   ++W+LW+ RN
Subjt:  --------LSFFPNLHTSVDLFRASRDVISTWEKISNSLKDEDLKVVVILIWKLWEFRN

AT3G09510.1 Ribonuclease H-like superfamily protein1.4e-2327.21Show/hide
Query:  LRGRYFNHGDFLDANIGHNHSLTWRSILWGRDLFLKGYRWRVGNGKYIPIHKESWLNREGSRSILVTPVEFLNSKVCSLIGEDGC---WKEGEIRNSFLD
        ++ RYF     LDA +    S  W S+L G  L  KG R  +G+G+ I I  ++ ++    R  L T   +    + +L    G    W + +I + F+D
Subjt:  LRGRYFNHGDFLDANIGHNHSLTWRSILWGRDLFLKGYRWRVGNGKYIPIHKESWLNREGSRSILVTPVEFLNSKVCSLIGEDGC---WKEGEIRNSFLD

Query:  Q-DARDILNITLGSNSTADEIIWVENSIGFFTVKSAYHLAISSSEAKEASSSDSLEVTKLWKRFWKLNLMPRVKIYSWKFLNNITPTNVNLISKGLDINP
        Q D   I  I L  +   D+IIW  N+ G +TV+S Y L          + +       L  R W L +MP++K + W+ L+    T   L ++G+ I+P
Subjt:  Q-DARDILNITLGSNSTADEIIWVENSIGFFTVKSAYHLAISSSEAKEASSSDSLEVTKLWKRFWKLNLMPRVKIYSWKFLNNITPTNVNLISKGLDINP

Query:  ICVFCRKKDETTPYILWDCKFSKDIW----LSFFPNLHTSVDLFRASRDVISTWEKISNSLKDEDLKVVVILIWKLWEFRNRI
         C  C +++E+  + L+ C F+   W     S   N   S D      ++++  +    ++ D    + V LIW++W+ RN +
Subjt:  ICVFCRKKDETTPYILWDCKFSKDIW----LSFFPNLHTSVDLFRASRDVISTWEKISNSLKDEDLKVVVILIWKLWEFRNRI

AT4G29090.1 Ribonuclease H-like superfamily protein1.4e-3624.8Show/hide
Query:  AIPTYIMSCFKLPKKFCDDIDRACA---------------------------------RISLFNQALLVKQIWKILKNPNSLLSKILRGRYFNHGDFLDA
        A+PTY M+CF LPK  C  I    A                                  I  FN ALL KQ+W++L  P SL++K+ + RYF+  D L+A
Subjt:  AIPTYIMSCFKLPKKFCDDIDRACA---------------------------------RISLFNQALLVKQIWKILKNPNSLLSKILRGRYFNHGDFLDA

Query:  NIGHNHSLTWRSILWGRDLFLKGYRWRVGNGKYIPIHKESWLNREGSRSIL----VTPVEFLN----SKVCSLIGEDGC-WKEGEIRNSFLDQDARDILN
         +G   S  W+SI   +++  +G R  VGNG+ I I +  WL+ + + + L    V P E+ +     KV  LI E G  W++  I   F + + + I  
Subjt:  NIGHNHSLTWRSILWGRDLFLKGYRWRVGNGKYIPIHKESWLNREGSRSIL----VTPVEFLN----SKVCSLIGEDGC-WKEGEIRNSFLDQDARDILN

Query:  ITLGSNSTADEIIWVENSIGFFTVKSAYHL---AISSSEAKEASSSDSLEVTKLWKRFWKLNLMPRVKIYSWKFLNNITPTNVNLISKGLDINPICVFCR
        +  G     D   W   S G +TVKS Y +    I+   + +  S  SL    ++++ WK    P+++ + WK L+N  P    L  + L     C+ C 
Subjt:  ITLGSNSTADEIIWVENSIGFFTVKSAYHL---AISSSEAKEASSSDSLEVTKLWKRFWKLNLMPRVKIYSWKFLNNITPTNVNLISKGLDINPICVFCR

Query:  KKDETTPYILWDCKFSKDIWL--------------SFFPNLHTSVDLFRASRDVISTWEKISNSLKDEDLKVVVILIWKLWEFRNRISIKCAKAVKDQCI
           ET  ++L+ C F++  W               S + NL+   +L   +      WEK S        ++V  L+W+LW+ RN +  +  +    + +
Subjt:  KKDETTPYILWDCKFSKDIWL--------------SFFPNLHTSVDLFRASRDVISTWEKISNSLKDEDLKVVVILIWKLWEFRNRISIKCAKAVKDQCI

Query:  SEIETAISDHSRRDFTSTLDFSPESLQSQ---------------------------GVGWTIRDSDRSLLLAGAKSIRRERPILMLEAMAIK
           E  + +   R    +    P+  +S                            G+GW +R+    +   GA+++ + + +L  E  A++
Subjt:  SEIETAISDHSRRDFTSTLDFSPESLQSQ---------------------------GVGWTIRDSDRSLLLAGAKSIRRERPILMLEAMAIK

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.6e-1632.41Show/hide
Query:  AIPTYIMSCFKLPKKFCDDIDRACAR----------------------------------ISLFNQALLVKQIWKILKNPNSLLSKILRGRYFNHGDFLD
        A+P Y MSCF+L K  C  +  A                                     +  FNQALL KQ ++I+  P++LLS++LR RYF H   ++
Subjt:  AIPTYIMSCFKLPKKFCDDIDRACAR----------------------------------ISLFNQALLVKQIWKILKNPNSLLSKILRGRYFNHGDFLD

Query:  ANIGHNHSLTWRSILWGRDLFLKGYRWRVGNGKYIPIHKESWLNR
         ++G   S  WRSI+ GR+L  +G    +G+G    IH + WL+R
Subjt:  ANIGHNHSLTWRSILWGRDLFLKGYRWRVGNGKYIPIHKESWLNR

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)3.6e-1656.06Show/hide
Query:  VNGSPSESVKPERGLRQGDPLSPYLFLICAEGLSSLLNREESLSHFNGIRINNHCPSISHLFFADD
        +NG+P   V P RGLRQGDPLSPYLF++C E LS L  R +      GIR++N+ P I+HL FADD
Subjt:  VNGSPSESVKPERGLRQGDPLSPYLFLICAEGLSSLLNREESLSHFNGIRINNHCPSISHLFFADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTGCGTGGAGTCAGTAGAGTTCTCGATCTTCGTGAATGGCTCCCCGAGCGAATCTGTCAAACCTGAAAGAGGTTTAAGGCAAGGCGACCCGCTGTCCCCCTACCT
TTTCCTTATTTGCGCAGAAGGCCTCTCCAGCTTATTGAACAGGGAAGAATCTCTCTCGCATTTTAACGGTATCCGTATTAATAATCACTGCCCCTCTATTTCTCACTTAT
TCTTTGCTGATGACAATTTGGTTTTTTGTAGGGCATCAAAGAAAGATTGCAGGAGCATTAGAGAGATCCTAAAGGTGTATAAAGTGGCTTCGGGGCAAACTATCAATCTT
GAGAAATCTATCTTCATGACAAGCAAAAACATTGGGGCTGAAAAGCTCAAAGGGCTCTCGAATATTTTGGGAATTGTGCATACTAAGTCTATCAGTCACTACCTTGGTAT
GCCCTCCCAAAACGCTCGAAGCAAGAGCTCCCTATTCAAAAAGCTAAAGGAGAAAGTGGGGAAAGCGCTTCAAGGATGGAAGGTGTCGTTCTTTTCTCAAGGAGGCAAAG
AAACCTTTATTAAGGCCATAGTTCAAGCGATTCCTACTTATATTATGTCTTGCTTTAAGCTCCCTAAAAAATTTTGTGATGATATTGATAGGGCATGTGCTCGGATTAGC
CTTTTCAATCAGGCTTTGTTGGTCAAGCAAATCTGGAAAATTTTGAAGAATCCAAATAGCCTTCTTTCTAAGATTTTGAGAGGCAGGTATTTCAATCATGGAGACTTCCT
TGATGCCAATATAGGCCACAACCACTCACTTACATGGAGAAGCATTCTTTGGGGGCGTGATCTCTTCTTAAAAGGCTACAGATGGAGGGTAGGAAATGGCAAATACATCC
CTATCCACAAAGAGTCTTGGCTCAATAGGGAAGGCAGCAGATCAATCTTGGTCACCCCTGTAGAGTTTTTGAACAGCAAGGTTTGTAGCCTTATAGGGGAGGATGGTTGT
TGGAAGGAGGGAGAGATAAGGAACTCTTTCTTGGACCAAGATGCCAGAGACATTCTTAATATTACTCTTGGCTCCAATTCAACTGCAGATGAGATTATCTGGGTTGAAAA
CTCTATAGGCTTTTTCACTGTTAAATCTGCATATCATCTTGCTATCTCTTCTTCCGAAGCCAAGGAGGCATCCAGTTCTGACTCTCTAGAAGTTACAAAATTGTGGAAGA
GATTTTGGAAGCTCAATCTTATGCCTAGAGTGAAAATTTATTCCTGGAAGTTCCTAAACAATATCACCCCAACGAATGTTAATCTTATCTCTAAGGGTTTGGATATTAAT
CCCATTTGTGTGTTTTGCAGGAAAAAAGATGAAACTACCCCCTACATCCTCTGGGATTGTAAGTTCTCCAAGGACATATGGTTAAGTTTTTTTCCAAACTTACACACTTC
AGTTGATCTTTTCAGGGCTAGCAGAGATGTTATTTCTACATGGGAAAAGATATCGAATTCTCTCAAAGATGAGGACCTTAAAGTAGTTGTGATCCTAATCTGGAAATTGT
GGGAGTTCCGTAACAGAATCTCAATCAAATGTGCAAAAGCAGTCAAAGACCAATGCATTTCAGAAATTGAAACGGCCATTTCGGATCATAGTAGAAGAGACTTTACCTCC
ACCCTGGATTTCAGCCCGGAGAGCCTCCAGAGTCAGGGGGTGGGATGGACGATCCGCGACTCCGACAGATCTCTGTTGCTGGCAGGAGCTAAATCCATCAGAAGGGAAAG
GCCAATTTTAATGCTTGAAGCCATGGCAATTAAACGGGACCTGGTGCAGTTCTTCTCCTCGCATATTGTGGGTGGAATTGGATTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATTGCGTGGAGTCAGTAGAGTTCTCGATCTTCGTGAATGGCTCCCCGAGCGAATCTGTCAAACCTGAAAGAGGTTTAAGGCAAGGCGACCCGCTGTCCCCCTACCT
TTTCCTTATTTGCGCAGAAGGCCTCTCCAGCTTATTGAACAGGGAAGAATCTCTCTCGCATTTTAACGGTATCCGTATTAATAATCACTGCCCCTCTATTTCTCACTTAT
TCTTTGCTGATGACAATTTGGTTTTTTGTAGGGCATCAAAGAAAGATTGCAGGAGCATTAGAGAGATCCTAAAGGTGTATAAAGTGGCTTCGGGGCAAACTATCAATCTT
GAGAAATCTATCTTCATGACAAGCAAAAACATTGGGGCTGAAAAGCTCAAAGGGCTCTCGAATATTTTGGGAATTGTGCATACTAAGTCTATCAGTCACTACCTTGGTAT
GCCCTCCCAAAACGCTCGAAGCAAGAGCTCCCTATTCAAAAAGCTAAAGGAGAAAGTGGGGAAAGCGCTTCAAGGATGGAAGGTGTCGTTCTTTTCTCAAGGAGGCAAAG
AAACCTTTATTAAGGCCATAGTTCAAGCGATTCCTACTTATATTATGTCTTGCTTTAAGCTCCCTAAAAAATTTTGTGATGATATTGATAGGGCATGTGCTCGGATTAGC
CTTTTCAATCAGGCTTTGTTGGTCAAGCAAATCTGGAAAATTTTGAAGAATCCAAATAGCCTTCTTTCTAAGATTTTGAGAGGCAGGTATTTCAATCATGGAGACTTCCT
TGATGCCAATATAGGCCACAACCACTCACTTACATGGAGAAGCATTCTTTGGGGGCGTGATCTCTTCTTAAAAGGCTACAGATGGAGGGTAGGAAATGGCAAATACATCC
CTATCCACAAAGAGTCTTGGCTCAATAGGGAAGGCAGCAGATCAATCTTGGTCACCCCTGTAGAGTTTTTGAACAGCAAGGTTTGTAGCCTTATAGGGGAGGATGGTTGT
TGGAAGGAGGGAGAGATAAGGAACTCTTTCTTGGACCAAGATGCCAGAGACATTCTTAATATTACTCTTGGCTCCAATTCAACTGCAGATGAGATTATCTGGGTTGAAAA
CTCTATAGGCTTTTTCACTGTTAAATCTGCATATCATCTTGCTATCTCTTCTTCCGAAGCCAAGGAGGCATCCAGTTCTGACTCTCTAGAAGTTACAAAATTGTGGAAGA
GATTTTGGAAGCTCAATCTTATGCCTAGAGTGAAAATTTATTCCTGGAAGTTCCTAAACAATATCACCCCAACGAATGTTAATCTTATCTCTAAGGGTTTGGATATTAAT
CCCATTTGTGTGTTTTGCAGGAAAAAAGATGAAACTACCCCCTACATCCTCTGGGATTGTAAGTTCTCCAAGGACATATGGTTAAGTTTTTTTCCAAACTTACACACTTC
AGTTGATCTTTTCAGGGCTAGCAGAGATGTTATTTCTACATGGGAAAAGATATCGAATTCTCTCAAAGATGAGGACCTTAAAGTAGTTGTGATCCTAATCTGGAAATTGT
GGGAGTTCCGTAACAGAATCTCAATCAAATGTGCAAAAGCAGTCAAAGACCAATGCATTTCAGAAATTGAAACGGCCATTTCGGATCATAGTAGAAGAGACTTTACCTCC
ACCCTGGATTTCAGCCCGGAGAGCCTCCAGAGTCAGGGGGTGGGATGGACGATCCGCGACTCCGACAGATCTCTGTTGCTGGCAGGAGCTAAATCCATCAGAAGGGAAAG
GCCAATTTTAATGCTTGAAGCCATGGCAATTAAACGGGACCTGGTGCAGTTCTTCTCCTCGCATATTGTGGGTGGAATTGGATTCTGA
Protein sequenceShow/hide protein sequence
MNCVESVEFSIFVNGSPSESVKPERGLRQGDPLSPYLFLICAEGLSSLLNREESLSHFNGIRINNHCPSISHLFFADDNLVFCRASKKDCRSIREILKVYKVASGQTINL
EKSIFMTSKNIGAEKLKGLSNILGIVHTKSISHYLGMPSQNARSKSSLFKKLKEKVGKALQGWKVSFFSQGGKETFIKAIVQAIPTYIMSCFKLPKKFCDDIDRACARIS
LFNQALLVKQIWKILKNPNSLLSKILRGRYFNHGDFLDANIGHNHSLTWRSILWGRDLFLKGYRWRVGNGKYIPIHKESWLNREGSRSILVTPVEFLNSKVCSLIGEDGC
WKEGEIRNSFLDQDARDILNITLGSNSTADEIIWVENSIGFFTVKSAYHLAISSSEAKEASSSDSLEVTKLWKRFWKLNLMPRVKIYSWKFLNNITPTNVNLISKGLDIN
PICVFCRKKDETTPYILWDCKFSKDIWLSFFPNLHTSVDLFRASRDVISTWEKISNSLKDEDLKVVVILIWKLWEFRNRISIKCAKAVKDQCISEIETAISDHSRRDFTS
TLDFSPESLQSQGVGWTIRDSDRSLLLAGAKSIRRERPILMLEAMAIKRDLVQFFSSHIVGGIGF