; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001768 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001768
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr4:35190302..35191635
RNA-Seq ExpressionLag0001768
SyntenyLag0001768
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023874626.1 uncharacterized protein LOC111987155 [Quercus suber]2.1e-8247.29Show/hide
Query:  FDEIWLRYADLPEVVSSSWG-VNESPVDPSVLAIKTSRCMETLASWGRSKLGAYPRRIRLASQRVEE-AIADLRFSSSREKLVRVETQLADILLEEELYW
        F+ +W R     EV+  +W  +   P     L+ +   C E+L  W RS  G   + +R    ++++  I D     + E +  ++ ++ + L +EE+ W
Subjt:  FDEIWLRYADLPEVVSSSWG-VNESPVDPSVLAIKTSRCMETLASWGRSKLGAYPRRIRLASQRVEE-AIADLRFSSSREKLVRVETQLADILLEEELYW

Query:  KQRSRELWLQEGDRNTRWFHSCASHRQRKNRIQGLLDQHNVWQEDQSRVLSILSDYFSGIFTSSGPSELDFDRVLCDVSPRVDDQMNQSLLRPFCAKEVR
        KQRSR LWL+ GDRNT++FH+ AS R+RKNRI GLL+   VW EDQ  +  I+ DYFS IF S  PS   FD+ L  +S RV ++MN  LL  F A+EVR
Subjt:  KQRSRELWLQEGDRNTRWFHSCASHRQRKNRIQGLLDQHNVWQEDQSRVLSILSDYFSGIFTSSGPSELDFDRVLCDVSPRVDDQMNQSLLRPFCAKEVR

Query:  LALKQIHPNKSPGSDGLSGLFYKQHWDLVGKDVIQCCLVVLNHGVSPLSLNDTMIVLIPKVQSPQRITDFRPISLCNVTYKIISKVLVNRMKEFLHHIIS
        +AL+Q+HP K+PG DG+S +FY+++W++V  DVI+C L VLN GV P  +N+T I LIPKV SPQ+IT+FRPISLCNV YKIISKVL NR+K  L  +I 
Subjt:  LALKQIHPNKSPGSDGLSGLFYKQHWDLVGKDVIQCCLVVLNHGVSPLSLNDTMIVLIPKVQSPQRITDFRPISLCNVTYKIISKVLVNRMKEFLHHIIS

Query:  PIQSAFIPGRCVVDNAVLGFECIHALRLRYLGLTGWAALKLDMFKAYDRVE
          QSAF+PGR ++DN ++ FE +H +  R  G     A+KLDM KAYDRVE
Subjt:  PIQSAFIPGRCVVDNAVLGFECIHALRLRYLGLTGWAALKLDMFKAYDRVE

XP_027118368.1 uncharacterized protein LOC113735569 [Coffea arabica]5.0e-7641.64Show/hide
Query:  KICRFDEIWLRYADLPEVVSSSWGVNESPVDPSVLAIKTSRCMETLASWGRSKLGAYPRRIRLASQRVEEAIADLRFSSSREKLVRVETQLADILLEEEL
        K   F+ +W++  D   ++  SW  + +    + L +KT+ C   L  W +   G + +      QR+      +   S + ++  + TQL ++L ++++
Subjt:  KICRFDEIWLRYADLPEVVSSSWGVNESPVDPSVLAIKTSRCMETLASWGRSKLGAYPRRIRLASQRVEEAIADLRFSSSREKLVRVETQLADILLEEEL

Query:  YWKQRSRELWLQEGDRNTRWFHSCASHRQRKNRIQGLLDQHNVWQEDQSRVLSILSDYFSGIFTSSGPSELDFDRVLCDVSPRVDDQMNQSLLRPFCAKE
         WKQRS+  W +EGDRNT +FH  AS R++ N+I GL D +  W + Q  + SI+ ++F  IF++S  + L  D V+  V   V  +MN  LLR +   E
Subjt:  YWKQRSRELWLQEGDRNTRWFHSCASHRQRKNRIQGLLDQHNVWQEDQSRVLSILSDYFSGIFTSSGPSELDFDRVLCDVSPRVDDQMNQSLLRPFCAKE

Query:  VRLALKQIHPNKSPGSDGLSGLFYKQHWDLVGKDVIQCCLVVLNHGVSPLSLNDTMIVLIPKVQSPQRITDFRPISLCNVTYKIISKVLVNRMKEFLHHI
        V  AL Q+HP KSPG DG+S +FY+++W +VG+DV +  L VLN G  P SLN T +VLIPK  SP  IT FRPISLCNV YK+ SKVL NR++  L  +
Subjt:  VRLALKQIHPNKSPGSDGLSGLFYKQHWDLVGKDVIQCCLVVLNHGVSPLSLNDTMIVLIPKVQSPQRITDFRPISLCNVTYKIISKVLVNRMKEFLHHI

Query:  ISPIQSAFIPGRCVVDNAVLGFECIHALRLRYLGLTGWAALKLDMFKAYDRVE
        ISP QSAFIPGRC+ DN ++ +E  H+L+L+  G  G  ++KLDM KA+D+VE
Subjt:  ISPIQSAFIPGRCVVDNAVLGFECIHALRLRYLGLTGWAALKLDMFKAYDRVE

XP_027118730.1 uncharacterized protein LOC113735973 [Coffea arabica]2.5e-7541.83Show/hide
Query:  FDEIWLRYADLPEVVSSSWGVNESPVDPSVLAIKTSRCMETLASWGRSKLGAYPRRIRLASQRVEEAIADLRFSSSREKLVRVETQLADILLEEELYWKQ
        F+ +W+R  D   ++ +SW  + +      L +KT+ C   L  W +   G + +      QR+      +   S + ++  + TQL ++L ++++ WKQ
Subjt:  FDEIWLRYADLPEVVSSSWGVNESPVDPSVLAIKTSRCMETLASWGRSKLGAYPRRIRLASQRVEEAIADLRFSSSREKLVRVETQLADILLEEELYWKQ

Query:  RSRELWLQEGDRNTRWFHSCASHRQRKNRIQGLLDQHNVWQEDQSRVLSILSDYFSGIFTSSGPSELDFDRVLCDVSPRVDDQMNQSLLRPFCAKEVRLA
        R +  W +E DRNT +FH  AS R++ N+I GL D +  W + Q  + SI+ ++F  IF++S P+ L  D V+  V   V  +MN  LLR + A EV  A
Subjt:  RSRELWLQEGDRNTRWFHSCASHRQRKNRIQGLLDQHNVWQEDQSRVLSILSDYFSGIFTSSGPSELDFDRVLCDVSPRVDDQMNQSLLRPFCAKEVRLA

Query:  LKQIHPNKSPGSDGLSGLFYKQHWDLVGKDVIQCCLVVLNHGVSPLSLNDTMIVLIPKVQSPQRITDFRPISLCNVTYKIISKVLVNRMKEFLHHIISPI
        L Q+HP KSPG DG+S +FY+++W +VG+DV +  L VLN G  P SLN T +VLIPK  SP  IT FRPISLCNV YK+ SK+L NR++  L  +ISP 
Subjt:  LKQIHPNKSPGSDGLSGLFYKQHWDLVGKDVIQCCLVVLNHGVSPLSLNDTMIVLIPKVQSPQRITDFRPISLCNVTYKIISKVLVNRMKEFLHHIISPI

Query:  QSAFIPGRCVVDNAVLGFECIHALRLRYLGLTGWAALKLDMFKAYDRVE
        QSAFIPGRC+ DN ++ +E  H+L+L+  G  G  ++KLDM KA+D+VE
Subjt:  QSAFIPGRCVVDNAVLGFECIHALRLRYLGLTGWAALKLDMFKAYDRVE

XP_030497600.1 uncharacterized protein LOC115713257 [Cannabis sativa]1.2e-7444.23Show/hide
Query:  RFDEIWLRYADLPEVVSSSWGVNESPVDPSVLAIKTSR-CMETLASWGRSKLGAYPRRIRLASQRVE----EAIADLRFSSSREKLVRVETQLADILLEE
        RF++ WL+  +  E++S+SW ++ S  DP+   + + R C   L  W   K G   + I+LA + V      A +D  FS+   K+   E+ L ++L  E
Subjt:  RFDEIWLRYADLPEVVSSSWGVNESPVDPSVLAIKTSR-CMETLASWGRSKLGAYPRRIRLASQRVE----EAIADLRFSSSREKLVRVETQLADILLEE

Query:  ELYWKQRSRELWLQEGDRNTRWFHSCASHRQRKNRIQGLLDQHNVWQEDQSRVLSILSDYFSGIFTSSGPSELDFDRVLCDVSPRVDDQMNQSLLRPFCA
        E YW+QRSR  WLQ GDRNT++FHS AS R   NRI+ L D H      +  +  +++DYF  +FT+S         VL  +   + D+ N  L + F A
Subjt:  ELYWKQRSRELWLQEGDRNTRWFHSCASHRQRKNRIQGLLDQHNVWQEDQSRVLSILSDYFSGIFTSSGPSELDFDRVLCDVSPRVDDQMNQSLLRPFCA

Query:  KEVRLALKQIHPNKSPGSDGLSGLFYKQHWDLVGKDVIQCCLVVLNHGVSPLSLNDTMIVLIPKVQSPQRITDFRPISLCNVTYKIISKVLVNRMKEFLH
         EV  ALK I  +KSPG DG+S +FY  +W++VG+ V Q  L VLN+G +P S N T+I LIPK++ P+ + DFRPISLCNVTYKIISK+L  R KE LH
Subjt:  KEVRLALKQIHPNKSPGSDGLSGLFYKQHWDLVGKDVIQCCLVVLNHGVSPLSLNDTMIVLIPKVQSPQRITDFRPISLCNVTYKIISKVLVNRMKEFLH

Query:  HIISPIQSAFIPGRCVVDNAVLGFECIHALRLRYLGLTGWAALKLDMFKAYDRVE
         +IS  QSAF+  R + DN ++ FE +H+L+ R  G  G+AALKLDM KA+DRVE
Subjt:  HIISPIQSAFIPGRCVVDNAVLGFECIHALRLRYLGLTGWAALKLDMFKAYDRVE

XP_030502555.1 uncharacterized protein LOC115717715 [Cannabis sativa]1.6e-7443.02Show/hide
Query:  RFDEIWLRYADLPEVVSSSWGVNESPVDPSVLAIKTSRCMETLASWGRSKLGAYPRRIRLASQRVEEAIADLRFSSS-REKLVRVETQLADILLEEELYW
        RF+++WL+  +  E++SSSW  N      + L    S C   L  W   K G   + I+ A Q V      +      + K+   ET L D+L  EE YW
Subjt:  RFDEIWLRYADLPEVVSSSWGVNESPVDPSVLAIKTSRCMETLASWGRSKLGAYPRRIRLASQRVEEAIADLRFSSS-REKLVRVETQLADILLEEELYW

Query:  KQRSRELWLQEGDRNTRWFHSCASHRQRKNRIQGLLDQHNVWQEDQSRVLSILSDYFSGIFTSSGPSELDFDRVLCDVSPRVDDQMNQSLLRPFCAKEVR
        +QRSR  WLQ GDRNT++FHS AS R   NRI+ L+D H      +  +  +++DYF  +FT+S         VL  +   + D+ N+ L + F A +V 
Subjt:  KQRSRELWLQEGDRNTRWFHSCASHRQRKNRIQGLLDQHNVWQEDQSRVLSILSDYFSGIFTSSGPSELDFDRVLCDVSPRVDDQMNQSLLRPFCAKEVR

Query:  LALKQIHPNKSPGSDGLSGLFYKQHWDLVGKDVIQCCLVVLNHGVSPLSLNDTMIVLIPKVQSPQRITDFRPISLCNVTYKIISKVLVNRMKEFLHHIIS
          LK I  +KSPG DG+S +FY  +WD+VG  V +  L VLNHG +P + N T+I LIPK++ P+ + DFRPISLCNVTYKIISK+L  R KE LH +IS
Subjt:  LALKQIHPNKSPGSDGLSGLFYKQHWDLVGKDVIQCCLVVLNHGVSPLSLNDTMIVLIPKVQSPQRITDFRPISLCNVTYKIISKVLVNRMKEFLHHIIS

Query:  PIQSAFIPGRCVVDNAVLGFECIHALRLRYLGLTGWAALKLDMFKAYDRVE
          QSAF+  R + DN ++ FE +H+L+ R  G  G+ A KLDM KA+DRVE
Subjt:  PIQSAFIPGRCVVDNAVLGFECIHALRLRYLGLTGWAALKLDMFKAYDRVE

TrEMBL top hitse value%identityAlignment
A0A2N9EDY7 Reverse transcriptase domain-containing protein1.1e-7643.92Show/hide
Query:  VFCFV-KICRFDEIWLRYADLPEVVSSSWGVNESPVDPSVL---AIKTSRCMETLASWGRSKLGAYPRRIRLASQRVEEAIADLRFSSSREKLVRVETQL
        V CF  K  RF+E+W   A   E + ++W   +S V  S +   A K   C   L  W R K G+  R++R   +   EA  +     S +KL ++++++
Subjt:  VFCFV-KICRFDEIWLRYADLPEVVSSSWGVNESPVDPSVL---AIKTSRCMETLASWGRSKLGAYPRRIRLASQRVEEAIADLRFSSSREKLVRVETQL

Query:  ADILLEEELYWKQRSRELWLQEGDRNTRWFHSCASHRQRKNRIQGLLDQHNVWQEDQSRVLSILSDYFSGIFTSSGPSELDFDRVLCDVSPRVDDQMNQS
          +L +EE  W+QRSR  WL+EGDRNTR+FH  AS R+R+NRI GL D   VW+E++S    ++  +F  IF +S P  +  +  +  V   +  ++N S
Subjt:  ADILLEEELYWKQRSRELWLQEGDRNTRWFHSCASHRQRKNRIQGLLDQHNVWQEDQSRVLSILSDYFSGIFTSSGPSELDFDRVLCDVSPRVDDQMNQS

Query:  LLRPFCAKEVRLALKQIHPNKSPGSDGLSGLFYKQHWDLVGKDVIQCCLVVLNHGVSPLSLNDTMIVLIPKVQSPQRITDFRPISLCNVTYKIISKVLVN
        L   F A+EV LALKQ+ P K+PG DG+  LF++++W LVG +V Q  L  LN G    ++N T I LIPKV++P+RIT+FRPISLCNVTYK+ISKV+ N
Subjt:  LLRPFCAKEVRLALKQIHPNKSPGSDGLSGLFYKQHWDLVGKDVIQCCLVVLNHGVSPLSLNDTMIVLIPKVQSPQRITDFRPISLCNVTYKIISKVLVN

Query:  RMKEFLHHIISPIQSAFIPGRCVVDNAVLGFECIHALRLRYLGLTGWAALKLDMFKAYDRVE
        R+K  L  IIS  QSAF+PGR + DN ++ FE +H +    +G  G  A+KLDM KAYDRVE
Subjt:  RMKEFLHHIISPIQSAFIPGRCVVDNAVLGFECIHALRLRYLGLTGWAALKLDMFKAYDRVE

A0A2N9GPZ7 Reverse transcriptase domain-containing protein6.8e-7942.66Show/hide
Query:  KICRFDEIWLRYADLPEVVSSSWGVNESPVDPSVLAI-KTSRCMETLASWGRSKLGAYPRRIRLASQRVEEAIADLRFSSSREKLVRVETQLADILLEEE
        K+ RF+ +W++     EV+  +WG   +   P  + + K   C  +L  W R + G+    I+   ++++  I +   S     ++ ++  L  +L +EE
Subjt:  KICRFDEIWLRYADLPEVVSSSWGVNESPVDPSVLAI-KTSRCMETLASWGRSKLGAYPRRIRLASQRVEEAIADLRFSSSREKLVRVETQLADILLEEE

Query:  LYWKQRSRELWLQEGDRNTRWFHSCASHRQRKNRIQGLLDQHNVWQEDQSRVLSILSDYFSGIFTSSGPSELDFDRVLCDVSPRVDDQMNQSLLRPFCAK
        ++W+QRSR  W+ EGD+NT++FH+  + R+R N I GL D+  VWQ +++++  I  DYF GIFTSS PS      VL  +   V + MN  L   F   
Subjt:  LYWKQRSRELWLQEGDRNTRWFHSCASHRQRKNRIQGLLDQHNVWQEDQSRVLSILSDYFSGIFTSSGPSELDFDRVLCDVSPRVDDQMNQSLLRPFCAK

Query:  EVRLALKQIHPNKSPGSDGLSGLFYKQHWDLVGKDVIQCCLVVLNHGVSPLSLNDTMIVLIPKVQSPQRITDFRPISLCNVTYKIISKVLVNRMKEFLHH
        EV LALKQ++P K+PG DG+S +FY+ +WD+VG +V Q  L +L+ G     +N T I LIPKV++P+ ITDFRPISLCNV YKI+SKVL NR+K+ L  
Subjt:  EVRLALKQIHPNKSPGSDGLSGLFYKQHWDLVGKDVIQCCLVVLNHGVSPLSLNDTMIVLIPKVQSPQRITDFRPISLCNVTYKIISKVLVNRMKEFLHH

Query:  IISPIQSAFIPGRCVVDNAVLGFECIHALRLRYLGLTGWAALKLDMFKAYDRVE
        +IS  QSAF+PGR + DN ++ FE +H++ L+  G  G  ALKLDM KAYDRVE
Subjt:  IISPIQSAFIPGRCVVDNAVLGFECIHALRLRYLGLTGWAALKLDMFKAYDRVE

A0A2N9IPS8 Reverse transcriptase domain-containing protein6.8e-7942.66Show/hide
Query:  KICRFDEIWLRYADLPEVVSSSWGVNESPVDPSVLAI-KTSRCMETLASWGRSKLGAYPRRIRLASQRVEEAIADLRFSSSREKLVRVETQLADILLEEE
        K+ RF+ +W++     EV+  +WG   +   P  + + K   C  +L  W R + G+    I+   ++++  I +   S     ++ ++  L  +L +EE
Subjt:  KICRFDEIWLRYADLPEVVSSSWGVNESPVDPSVLAI-KTSRCMETLASWGRSKLGAYPRRIRLASQRVEEAIADLRFSSSREKLVRVETQLADILLEEE

Query:  LYWKQRSRELWLQEGDRNTRWFHSCASHRQRKNRIQGLLDQHNVWQEDQSRVLSILSDYFSGIFTSSGPSELDFDRVLCDVSPRVDDQMNQSLLRPFCAK
        ++W+QRSR  W+ EGD+NT++FH+  + R+R N I GL D+  VWQ +++++  I  DYF GIFTSS PS      VL  +   V + MN  L   F   
Subjt:  LYWKQRSRELWLQEGDRNTRWFHSCASHRQRKNRIQGLLDQHNVWQEDQSRVLSILSDYFSGIFTSSGPSELDFDRVLCDVSPRVDDQMNQSLLRPFCAK

Query:  EVRLALKQIHPNKSPGSDGLSGLFYKQHWDLVGKDVIQCCLVVLNHGVSPLSLNDTMIVLIPKVQSPQRITDFRPISLCNVTYKIISKVLVNRMKEFLHH
        EV LALKQ++P K+PG DG+S +FY+ +WD+VG +V Q  L +L+ G     +N T I LIPKV++P+ ITDFRPISLCNV YKI+SKVL NR+K+ L  
Subjt:  EVRLALKQIHPNKSPGSDGLSGLFYKQHWDLVGKDVIQCCLVVLNHGVSPLSLNDTMIVLIPKVQSPQRITDFRPISLCNVTYKIISKVLVNRMKEFLHH

Query:  IISPIQSAFIPGRCVVDNAVLGFECIHALRLRYLGLTGWAALKLDMFKAYDRVE
        +IS  QSAF+PGR + DN ++ FE +H++ L+  G  G  ALKLDM KAYDRVE
Subjt:  IISPIQSAFIPGRCVVDNAVLGFECIHALRLRYLGLTGWAALKLDMFKAYDRVE

A0A6P6WSG1 uncharacterized protein LOC1137355692.4e-7641.64Show/hide
Query:  KICRFDEIWLRYADLPEVVSSSWGVNESPVDPSVLAIKTSRCMETLASWGRSKLGAYPRRIRLASQRVEEAIADLRFSSSREKLVRVETQLADILLEEEL
        K   F+ +W++  D   ++  SW  + +    + L +KT+ C   L  W +   G + +      QR+      +   S + ++  + TQL ++L ++++
Subjt:  KICRFDEIWLRYADLPEVVSSSWGVNESPVDPSVLAIKTSRCMETLASWGRSKLGAYPRRIRLASQRVEEAIADLRFSSSREKLVRVETQLADILLEEEL

Query:  YWKQRSRELWLQEGDRNTRWFHSCASHRQRKNRIQGLLDQHNVWQEDQSRVLSILSDYFSGIFTSSGPSELDFDRVLCDVSPRVDDQMNQSLLRPFCAKE
         WKQRS+  W +EGDRNT +FH  AS R++ N+I GL D +  W + Q  + SI+ ++F  IF++S  + L  D V+  V   V  +MN  LLR +   E
Subjt:  YWKQRSRELWLQEGDRNTRWFHSCASHRQRKNRIQGLLDQHNVWQEDQSRVLSILSDYFSGIFTSSGPSELDFDRVLCDVSPRVDDQMNQSLLRPFCAKE

Query:  VRLALKQIHPNKSPGSDGLSGLFYKQHWDLVGKDVIQCCLVVLNHGVSPLSLNDTMIVLIPKVQSPQRITDFRPISLCNVTYKIISKVLVNRMKEFLHHI
        V  AL Q+HP KSPG DG+S +FY+++W +VG+DV +  L VLN G  P SLN T +VLIPK  SP  IT FRPISLCNV YK+ SKVL NR++  L  +
Subjt:  VRLALKQIHPNKSPGSDGLSGLFYKQHWDLVGKDVIQCCLVVLNHGVSPLSLNDTMIVLIPKVQSPQRITDFRPISLCNVTYKIISKVLVNRMKEFLHHI

Query:  ISPIQSAFIPGRCVVDNAVLGFECIHALRLRYLGLTGWAALKLDMFKAYDRVE
        ISP QSAFIPGRC+ DN ++ +E  H+L+L+  G  G  ++KLDM KA+D+VE
Subjt:  ISPIQSAFIPGRCVVDNAVLGFECIHALRLRYLGLTGWAALKLDMFKAYDRVE

A0A7N2LIH6 Uncharacterized protein2.6e-7843.43Show/hide
Query:  FDEIWLRYADLPEVVSSSWG-VNESPVDPSVLAIKTSRCMETLASWGRSKLGAYPRRIRLASQRVEEAIADLRFSSSREKLVRVETQLADILLEEELYWK
        F+E+W R  +  E+V  +W    E    P  +  +  RC + L  W ++  G   + I+    R+++  +      + E++  ++ ++ ++   EE+ WK
Subjt:  FDEIWLRYADLPEVVSSSWG-VNESPVDPSVLAIKTSRCMETLASWGRSKLGAYPRRIRLASQRVEEAIADLRFSSSREKLVRVETQLADILLEEELYWK

Query:  QRSRELWLQEGDRNTRWFHSCASHRQRKNRIQGLLDQHNVWQEDQSRVLSILSDYFSGIFTSSGPSELDFDRVLCDVSPRVDDQMNQSLLRPFCAKEVRL
        QRSR  WLQ GD+N+++FH+ AS R++KNRI GL+D   VW EDQ     ++ DYF  I++S+ P+   FD  L  +  RV  +MN  L + F A EV  
Subjt:  QRSRELWLQEGDRNTRWFHSCASHRQRKNRIQGLLDQHNVWQEDQSRVLSILSDYFSGIFTSSGPSELDFDRVLCDVSPRVDDQMNQSLLRPFCAKEVRL

Query:  ALKQIHPNKSPGSDGLSGLFYKQHWDLVGKDVIQCCLVVLNHGVSPLSLNDTMIVLIPKVQSPQRITDFRPISLCNVTYKIISKVLVNRMKEFLHHIISP
        AL+Q+HP K+PG DG+S +FY+++WD+VG  V  C L  LN GV P  +N T I LIPK ++PQ+IT+FRPISLCNV YKIISKVL NR+K+ LH +I  
Subjt:  ALKQIHPNKSPGSDGLSGLFYKQHWDLVGKDVIQCCLVVLNHGVSPLSLNDTMIVLIPKVQSPQRITDFRPISLCNVTYKIISKVLVNRMKEFLHHIISP

Query:  IQSAFIPGRCVVDNAVLGFECIHALRLRYLGLTGWAALKLDMFKAYDRVE
         QSAF+PGR + DN ++ FE +H++  R  G  G  A+KLDM KAYDRVE
Subjt:  IQSAFIPGRCVVDNAVLGFECIHALRLRYLGLTGWAALKLDMFKAYDRVE

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein6.4e-1824.15Show/hide
Query:  SSSREKLVRVETQLADILLEEELYWKQRSRELWLQEGDRNTRWFHSCASHRQRKNRIQGLLDQHNVWQEDQSRVLSILSDYFSGIFTSSGPSELDFDRVL
        +S R+++ ++  +L +I  ++ L     SR  + +  ++  R        ++ KN+I  + +       D + + + + +Y+  ++ +   +  + D  L
Subjt:  SSSREKLVRVETQLADILLEEELYWKQRSRELWLQEGDRNTRWFHSCASHRQRKNRIQGLLDQHNVWQEDQSRVLSILSDYFSGIFTSSGPSELDFDRVL

Query:  CDVS-PRVDDQMNQSLLRPFCAKEVRLALKQIHPNKSPGSDGLSGLFYKQHWDLVGKDVIQCCLVVLNHGVSPLSLNDTMIVLIPKV-QSPQRITDFRPI
           + PR++ +  +SL RP    E+   +  +   KSPG DG +  FY+++ + +   +++    +   G+ P S  +  I+LIPK  +   +  +FRPI
Subjt:  CDVS-PRVDDQMNQSLLRPFCAKEVRLALKQIHPNKSPGSDGLSGLFYKQHWDLVGKDVIQCCLVVLNHGVSPLSLNDTMIVLIPKV-QSPQRITDFRPI

Query:  SLCNVTYKIISKVLVNRMKEFLHHIISPIQSAFIPG
        SL N+  KI++K+L NR+++ +  +I   Q  FIPG
Subjt:  SLCNVTYKIISKVLVNRMKEFLHHIISPIQSAFIPG

P08548 LINE-1 reverse transcriptase homolog3.2e-1723.38Show/hide
Query:  SSREKLVRVETQLADILLEEELYWKQRSRELWLQEGDRNTRWFHSCASHRQRKNRIQGLLDQHNVWQEDQSRVLSILSDYFSGIFTSSGPSELDFDRVL-
        S R+++ ++  +L +I  +  +    +S+  + ++ ++  +   +    ++ K+ I  + + ++    D S +  IL++Y+  +++    +  + D+ L 
Subjt:  SSREKLVRVETQLADILLEEELYWKQRSRELWLQEGDRNTRWFHSCASHRQRKNRIQGLLDQHNVWQEDQSRVLSILSDYFSGIFTSSGPSELDFDRVL-

Query:  -CDVSPRVDDQMNQSLLRPFCAKEVRLALKQIHPNKSPGSDGLSGLFYKQHWDLVGKDVIQCCLVVLNHGVSPLSLNDTMIVLIPKV-QSPQRITDFRPI
         C + PR+  +  + L RP  + E+   ++ +   KSPG DG +  FY+   + +   ++     +   G+ P +  +  I LIPK  + P R  ++RPI
Subjt:  -CDVSPRVDDQMNQSLLRPFCAKEVRLALKQIHPNKSPGSDGLSGLFYKQHWDLVGKDVIQCCLVVLNHGVSPLSLNDTMIVLIPKV-QSPQRITDFRPI

Query:  SLCNVTYKIISKVLVNRMKEFLHHIISPIQSAFIPGRCVVDNAVLGFECIHALRLRYLGLTGWAALKLDMFKAYDRVE
        SL N+  KI++K+L NR+++ +  II   Q  FIPG     N       I    +  L       L +D  KA+D ++
Subjt:  SLCNVTYKIISKVLVNRMKEFLHHIISPIQSAFIPGRCVVDNAVLGFECIHALRLRYLGLTGWAALKLDMFKAYDRVE

P11369 LINE-1 retrotransposable element ORF2 protein1.0e-1523.1Show/hide
Query:  SSREKLVRVETQLADILLEEELYWKQRSRELWLQEGDRNTRWFHSCASHRQRKNRIQGLLDQHNVWQEDQSRVLSILSDYFSGIFTSSGPSELDFDRVLC
        S R++++++  ++  +     +    ++R  + ++ ++  +         + K  I  + ++      D   + + +  ++  ++++   +  + D+ L 
Subjt:  SSREKLVRVETQLADILLEEELYWKQRSRELWLQEGDRNTRWFHSCASHRQRKNRIQGLLDQHNVWQEDQSRVLSILSDYFSGIFTSSGPSELDFDRVLC

Query:  DVS-PRVDDQMNQSLLRPFCAKEVRLALKQIHPNKSPGSDGLSGLFYKQHWDLVGKDVIQCCLVVLNHGVSPLSLNDTMIVLIPKVQ-SPQRITDFRPIS
            P+++      L  P   KE+   +  +   KSPG DG S  FY+   + +   + +    +   G  P S  +  I LIPK Q  P +I +FRPIS
Subjt:  DVS-PRVDDQMNQSLLRPFCAKEVRLALKQIHPNKSPGSDGLSGLFYKQHWDLVGKDVIQCCLVVLNHGVSPLSLNDTMIVLIPKVQ-SPQRITDFRPIS

Query:  LCNVTYKIISKVLVNRMKEFLHHIISPIQSAFIPGRCVVDNAVLGFECIHALRLRYLGLTGWAALKLDMFKAYDRVE
        L N+  KI++K+L NR++E +  II P Q  FIPG     N       IH   +  L       + LD  KA+D+++
Subjt:  LCNVTYKIISKVLVNRMKEFLHHIISPIQSAFIPGRCVVDNAVLGFECIHALRLRYLGLTGWAALKLDMFKAYDRVE

P14381 Transposon TX1 uncharacterized 149 kDa protein7.6e-2732.13Show/hide
Query:  RSRELWLQEGDRNTRWFHSCASHRQRKNRIQGLLDQHNVWQEDQSRVLSILSDYFSGIFTSSGPSELDFDRVLCDVSPRVDDQMNQSLLRPFCAKEVRLA
        RSR   L + DR +R+F++    +  + +I  L  +     ED   +      ++  +F S  P   D    L D  P V ++  + L  P    E+  A
Subjt:  RSRELWLQEGDRNTRWFHSCASHRQRKNRIQGLLDQHNVWQEDQSRVLSILSDYFSGIFTSSGPSELDFDRVLCDVSPRVDDQMNQSLLRPFCAKEVRLA

Query:  LKQIHPNKSPGSDGLSGLFYKQHWDLVGKDVIQCCLVVLNHGVSPLSLNDTMIVLIPKVQSPQRITDFRPISLCNVTYKIISKVLVNRMKEFLHHIISPI
        L+ +  NKSPG DGL+  F++  WD +G D  +        G  PLS    ++ L+PK    + I ++RP+SL +  YKI++K +  R+K  L  +I P 
Subjt:  LKQIHPNKSPGSDGLSGLFYKQHWDLVGKDVIQCCLVVLNHGVSPLSLNDTMIVLIPKVQSPQRITDFRPISLCNVTYKIISKVLVNRMKEFLHHIISPI

Query:  QSAFIPGRCVVDNAVLGFECIHALRLRYLGLTGWAALKLDMFKAYDRVE
        QS  +PGR + DN  L  + +H  R   L L   A L LD  KA+DRV+
Subjt:  QSAFIPGRCVVDNAVLGFECIHALRLRYLGLTGWAALKLDMFKAYDRVE

P16423 Retrovirus-related Pol polyprotein from type-2 retrotransposable element R2DM4.3e-0625.11Show/hide
Query:  SHRQRKNRIQGLLDQH---------NVWQEDQSRVLSILSDYFSGIFTSSGPSELDFDRVLCDVSPRVDDQMNQSLLRPFCA-KEVRLALKQIHPNKSPG
        S RQ+   +Q   D+H         N   E       I+  Y+  + T   PS    + +          QM+ SL R + A  E  L   ++  + SPG
Subjt:  SHRQRKNRIQGLLDQH---------NVWQEDQSRVLSILSDYFSGIFTSSGPSELDFDRVLCDVSPRVDDQMNQSLLRPFCA-KEVRLALKQIHPNKSPG

Query:  SDGLSGLFYKQHWDLVGKDVIQCCLVVLNHGVSPLSLNDTMIVLIPKVQSPQRITDFRPISLCNVTYKIISKVLVNRMKEFLHHIISPIQSAFIPGRCVV
         DG++    K   ++    +++   ++L  G  P S+     V IPK  + +R  DFRPIS+ +V  + ++ +L  R+   ++    P Q  F+P     
Subjt:  SDGLSGLFYKQHWDLVGKDVIQCCLVVLNHGVSPLSLNDTMIVLIPKVQSPQRITDFRPISLCNVTYKIISKVLVNRMKEFLHHIISPIQSAFIPGRCVV

Query:  DNAVLGFECIHALRLRYLGLTGWAALKLDMFKAYD
        DNA +       LR  +          LD+ KA+D
Subjt:  DNAVLGFECIHALRLRYLGLTGWAALKLDMFKAYD

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein6.0e-1932.63Show/hide
Query:  ELYWKQRSRELWLQEGDRNTRWFHSCASHRQRKNRIQGLLDQHNVWQEDQSRVLSILSDYFSGIFTS-SGPSELDFDRVLCDVSP-RVDDQMNQSLLRPF
        E +++Q+SR  WLQ+GD NTR+FH      Q KN I+ L    +V  E+ ++V  ++  Y++ +  S S     D  + + D+ P R +D +   L    
Subjt:  ELYWKQRSRELWLQEGDRNTRWFHSCASHRQRKNRIQGLLDQHNVWQEDQSRVLSILSDYFSGIFTS-SGPSELDFDRVLCDVSP-RVDDQMNQSLLRPF

Query:  CAKEVRLALKQIHPNKSPGSDGLSGLFYKQHWDLVGKDVIQCCLVVLNHGVSPLSLNDTMIVLIPKVQSPQRITDFRPISLCNVTYKIIS
          KE+  A+  +  NK+PG D  +  F+ + W +V    I         G      N T I LIPKV    +++ FRP+S C V YKII+
Subjt:  CAKEVRLALKQIHPNKSPGSDGLSGLFYKQHWDLVGKDVIQCCLVVLNHGVSPLSLNDTMIVLIPKVQSPQRITDFRPISLCNVTYKIIS

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.0e-1046.88Show/hide
Query:  LVNRMKEFLHHIISPIQSAFIPGRCVVDNAVLGFECIHALRLRYLGLTGWAALKLDMFKAYDRV
        +V R+K  + ++I P Q++FIPGR   DN V   E +H++R R  G+ GW  LKLD+ KAYDR+
Subjt:  LVNRMKEFLHHIISPIQSAFIPGRCVVDNAVLGFECIHALRLRYLGLTGWAALKLDMFKAYDRV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTGTATTTTGTTTTGTTAAGATATGCCGTTTTGATGAGATATGGCTACGATATGCTGACCTTCCAGAGGTTGTTAGTTCTTCGTGGGGAGTTAATGAGTCTCCTGT
GGATCCTTCCGTTTTGGCAATCAAAACTTCTCGTTGTATGGAGACATTGGCTTCGTGGGGGCGTTCGAAATTAGGAGCCTATCCTCGCCGTATTCGTTTGGCTTCTCAGC
GGGTGGAGGAGGCCATTGCTGATTTGCGCTTTTCTTCTTCTCGGGAGAAGTTGGTTCGGGTTGAGACTCAGTTGGCAGATATTCTCCTTGAGGAGGAACTATATTGGAAG
CAACGCTCCAGGGAACTATGGTTGCAGGAAGGAGACCGTAATACCAGATGGTTTCATAGTTGTGCATCTCATCGACAACGGAAAAATCGAATTCAGGGTCTGTTAGATCA
GCACAATGTTTGGCAAGAGGATCAATCACGGGTTCTATCAATTCTTTCGGACTATTTCTCTGGCATTTTCACTAGTTCTGGGCCTTCTGAATTGGATTTCGATAGAGTTC
TTTGTGATGTCAGTCCTCGTGTGGATGATCAGATGAATCAGTCGTTGTTACGACCGTTTTGTGCTAAGGAGGTTCGATTGGCTCTTAAGCAAATTCACCCTAATAAATCC
CCTGGGTCGGATGGGCTTTCCGGTTTATTCTACAAACAGCACTGGGATCTAGTTGGTAAGGATGTGATTCAGTGTTGTCTGGTTGTTCTGAATCATGGAGTTTCTCCGCT
GTCGTTAAATGATACCATGATAGTACTGATTCCGAAAGTGCAGTCGCCTCAGCGGATTACAGACTTCAGGCCCATCTCTTTATGTAATGTCACTTATAAGATTATCTCCA
AGGTCCTCGTGAATCGGATGAAGGAGTTTTTGCATCATATTATTTCCCCAATTCAGAGTGCTTTCATTCCGGGGCGATGTGTGGTGGATAACGCCGTATTGGGTTTTGAA
TGTATCCACGCACTTCGCTTGAGATATTTGGGGCTGACAGGATGGGCGGCCCTGAAGCTGGATATGTTTAAAGCGTATGATCGTGTGGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTGTGTATTTTGTTTTGTTAAGATATGCCGTTTTGATGAGATATGGCTACGATATGCTGACCTTCCAGAGGTTGTTAGTTCTTCGTGGGGAGTTAATGAGTCTCCTGT
GGATCCTTCCGTTTTGGCAATCAAAACTTCTCGTTGTATGGAGACATTGGCTTCGTGGGGGCGTTCGAAATTAGGAGCCTATCCTCGCCGTATTCGTTTGGCTTCTCAGC
GGGTGGAGGAGGCCATTGCTGATTTGCGCTTTTCTTCTTCTCGGGAGAAGTTGGTTCGGGTTGAGACTCAGTTGGCAGATATTCTCCTTGAGGAGGAACTATATTGGAAG
CAACGCTCCAGGGAACTATGGTTGCAGGAAGGAGACCGTAATACCAGATGGTTTCATAGTTGTGCATCTCATCGACAACGGAAAAATCGAATTCAGGGTCTGTTAGATCA
GCACAATGTTTGGCAAGAGGATCAATCACGGGTTCTATCAATTCTTTCGGACTATTTCTCTGGCATTTTCACTAGTTCTGGGCCTTCTGAATTGGATTTCGATAGAGTTC
TTTGTGATGTCAGTCCTCGTGTGGATGATCAGATGAATCAGTCGTTGTTACGACCGTTTTGTGCTAAGGAGGTTCGATTGGCTCTTAAGCAAATTCACCCTAATAAATCC
CCTGGGTCGGATGGGCTTTCCGGTTTATTCTACAAACAGCACTGGGATCTAGTTGGTAAGGATGTGATTCAGTGTTGTCTGGTTGTTCTGAATCATGGAGTTTCTCCGCT
GTCGTTAAATGATACCATGATAGTACTGATTCCGAAAGTGCAGTCGCCTCAGCGGATTACAGACTTCAGGCCCATCTCTTTATGTAATGTCACTTATAAGATTATCTCCA
AGGTCCTCGTGAATCGGATGAAGGAGTTTTTGCATCATATTATTTCCCCAATTCAGAGTGCTTTCATTCCGGGGCGATGTGTGGTGGATAACGCCGTATTGGGTTTTGAA
TGTATCCACGCACTTCGCTTGAGATATTTGGGGCTGACAGGATGGGCGGCCCTGAAGCTGGATATGTTTAAAGCGTATGATCGTGTGGAATGA
Protein sequenceShow/hide protein sequence
MCVFCFVKICRFDEIWLRYADLPEVVSSSWGVNESPVDPSVLAIKTSRCMETLASWGRSKLGAYPRRIRLASQRVEEAIADLRFSSSREKLVRVETQLADILLEEELYWK
QRSRELWLQEGDRNTRWFHSCASHRQRKNRIQGLLDQHNVWQEDQSRVLSILSDYFSGIFTSSGPSELDFDRVLCDVSPRVDDQMNQSLLRPFCAKEVRLALKQIHPNKS
PGSDGLSGLFYKQHWDLVGKDVIQCCLVVLNHGVSPLSLNDTMIVLIPKVQSPQRITDFRPISLCNVTYKIISKVLVNRMKEFLHHIISPIQSAFIPGRCVVDNAVLGFE
CIHALRLRYLGLTGWAALKLDMFKAYDRVE