; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0042089 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0042089
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationchr13:36295977..36305096
RNA-Seq ExpressionLag0042089
SyntenyLag0042089
Gene Ontology termsGO:0050789 - regulation of biological process (biological process)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN78803.1 hypothetical protein VITISV_032700 [Vitis vinifera]7.6e-8946.77Show/hide
Query:  SIDDLEESGSSFEGLKEERMALRLQLVEIVRRESCSWRQKAKVQWAKEWDCNSSFYHQ---ERRS---------------------------------LD
        + D +E+ G     L  +R + + +L E++ RE   WRQKAKV+W KE  CNS FYH+    RR+                                 LD
Subjt:  SIDDLEESGSSFEGLKEERMALRLQLVEIVRRESCSWRQKAKVQWAKEWDCNSSFYHQ---ERRS---------------------------------LD

Query:  THFSLEEIRRVIFESDGSKSPGLDEFSMGFFQENWEIIKQDLERVFKEFFDKGIIDNGMNETFVCLIPKKDKVERGKDCRLISLVTSVYKIIAKVLANRS
        + F+ EEI + IF+ D  K+PG D F++  FQE W++IK+DL RVF EF   G+I+   N +F+ LIPKK   +R  D R ISL+TS+YKIIAKVL+ R 
Subjt:  THFSLEEIRRVIFESDGSKSPGLDEFSMGFFQENWEIIKQDLERVFKEFFDKGIIDNGMNETFVCLIPKKDKVERGKDCRLISLVTSVYKIIAKVLANRS

Query:  KKILPSMIFESQGAFIEWRQILDQTLIANEAIEDYRRRKKEEVLFKIDFEKAYDHVDWDVLDKVLAMKGFGHHWRSWVWRCVRLVRYFILINGCPKGEIT
        + +L   I  +QGAF++ RQILD  LIANE +++ RR  +E V+FKIDFEKAYDHV WD LD VL  KGF   WR W+  C+  V Y IL+NG  KG + 
Subjt:  KKILPSMIFESQGAFIEWRQILDQTLIANEAIEDYRRRKKEEVLFKIDFEKAYDHVDWDVLDKVLAMKGFGHHWRSWVWRCVRLVRYFILINGCPKGEIT

Query:  ASRGLRQGDPLSPFLFLVVLDVLSRLVSRSVERGILKGFEVERDGVALSHLQFADDTLFFCSREEDSFLLLSHILEFFESMSGLKIN
        ASRGLRQGDPLSPFLF +V DVLSR++ R+ ER +++GF V  +   +SHLQFADDT+FF +  E+    L  +L  F  +SGLK+N
Subjt:  ASRGLRQGDPLSPFLFLVVLDVLSRLVSRSVERGILKGFEVERDGVALSHLQFADDTLFFCSREEDSFLLLSHILEFFESMSGLKIN

RVW16109.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]3.1e-9047.41Show/hide
Query:  SIDDLEESGSSFEGLKEERMALRLQLVEIVRRESCSWRQKAKVQWAKEWDCNSSFYHQ---ERRS--------------------------------LDT
        + D +E+ G     L  +R++ + +L E++ RE   WRQKAKV+W KE DCNS FYH+    RR+                                L++
Subjt:  SIDDLEESGSSFEGLKEERMALRLQLVEIVRRESCSWRQKAKVQWAKEWDCNSSFYHQ---ERRS--------------------------------LDT

Query:  HFSLEEIRRVIFESDGSKSPGLDEFSMGFFQENWEIIKQDLERVFKEFFDKGIIDNGMNETFVCLIPKKDKVERGKDCRLISLVTSVYKIIAKVLANRSK
         F+ EEI + IF+ D +K+PG D F++  FQE W++IK+DL RVF EF   GII+   N +F+ LIPKK   +R  D R ISL+TS+YKIIAKVL+ R +
Subjt:  HFSLEEIRRVIFESDGSKSPGLDEFSMGFFQENWEIIKQDLERVFKEFFDKGIIDNGMNETFVCLIPKKDKVERGKDCRLISLVTSVYKIIAKVLANRSK

Query:  KILPSMIFESQGAFIEWRQILDQTLIANEAIEDYRRRKKEEVLFKIDFEKAYDHVDWDVLDKVLAMKGFGHHWRSWVWRCVRLVRYFILINGCPKGEITA
         +L   I  +QGAF++ RQILD  LIANE +++ RR ++E V+FKIDFEKAYDHV WD LD VL  KGF   WR W+  C+  V + IL+NG  KG + A
Subjt:  KILPSMIFESQGAFIEWRQILDQTLIANEAIEDYRRRKKEEVLFKIDFEKAYDHVDWDVLDKVLAMKGFGHHWRSWVWRCVRLVRYFILINGCPKGEITA

Query:  SRGLRQGDPLSPFLFLVVLDVLSRLVSRSVERGILKGFEVERDGVALSHLQFADDTLFFCSREEDSFLLLSHILEFFESMSGLKIN
        SRGLRQGDPLSPFLF +V DVLSR++ R+ ER +L+GF V R+   +SHLQFADDT+FF +  E+    L  +L  F  +SGLK+N
Subjt:  SRGLRQGDPLSPFLFLVVLDVLSRLVSRSVERGILKGFEVERDGVALSHLQFADDTLFFCSREEDSFLLLSHILEFFESMSGLKIN

RVW39477.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]3.6e-9146.63Show/hide
Query:  ISIFLSIDDLEESGSSFEGLKEERMALRLQLVEIVRRESCSWRQKAKVQWAKEWDCNSSFYH------------------------------QERRSLDT
        +++  + D LE+ G     L  +R   + +L E++ RE   WRQKA+V+W KE DCNS F+H                              +E   L++
Subjt:  ISIFLSIDDLEESGSSFEGLKEERMALRLQLVEIVRRESCSWRQKAKVQWAKEWDCNSSFYH------------------------------QERRSLDT

Query:  HFSLEEIRRVIFESDGSKSPGLDEFSMGFFQENWEIIKQDLERVFKEFFDKGIIDNGMNETFVCLIPKKDKVERGKDCRLISLVTSVYKIIAKVLANRSK
         F+ EEI + IF+ D  K+PG D F++  FQ+ W++IK+DL RVF EF   GII+   N +F+ L+PKK +  R  D R ISL+TS+YKIIAKVLA R +
Subjt:  HFSLEEIRRVIFESDGSKSPGLDEFSMGFFQENWEIIKQDLERVFKEFFDKGIIDNGMNETFVCLIPKKDKVERGKDCRLISLVTSVYKIIAKVLANRSK

Query:  KILPSMIFESQGAFIEWRQILDQTLIANEAIEDYRRRKKEEVLFKIDFEKAYDHVDWDVLDKVLAMKGFGHHWRSWVWRCVRLVRYFILINGCPKGEITA
          L   I  +QGAF++ RQILD  LIANE +++ RR  +E V+FKIDFEKAYDHV WD LD VL MKGF   WR W+  C+  V Y +L+NG  KG + A
Subjt:  KILPSMIFESQGAFIEWRQILDQTLIANEAIEDYRRRKKEEVLFKIDFEKAYDHVDWDVLDKVLAMKGFGHHWRSWVWRCVRLVRYFILINGCPKGEITA

Query:  SRGLRQGDPLSPFLFLVVLDVLSRLVSRSVERGILKGFEVERDGVALSHLQFADDTLFFCSREEDSFLLLSHILEFFESMSGLKIN
        SRGLRQGDPLSPFLF +V DVLSR++ ++ ER +L+GF V R+   +SHLQFADDT+FF S  E+  + L  +L  F  + GLK+N
Subjt:  SRGLRQGDPLSPFLFLVVLDVLSRLVSRSVERGILKGFEVERDGVALSHLQFADDTLFFCSREEDSFLLLSHILEFFESMSGLKIN

RVX11537.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.2e-8948.38Show/hide
Query:  IDDLEESGSSFEGLKEERMALRLQLVEIVRRESCSWRQKAKVQWAKEWDCNSSFYHQERRS--------------------LDTHFSLEEIRRVIFESDG
        ID +E+ G+    L  ER   R +L +++ +E   WRQK+KV+W KE DCNS F+H+                        LD  F+ EE+RR +F+ + 
Subjt:  IDDLEESGSSFEGLKEERMALRLQLVEIVRRESCSWRQKAKVQWAKEWDCNSSFYHQERRS--------------------LDTHFSLEEIRRVIFESDG

Query:  SKSPGLDEFSMGFFQENWEIIKQDLERVFKEFFDKGIIDNGMNETFVCLIPKKDKVERGKDCRLISLVTSVYKIIAKVLANRSKKILPSMIFESQGAFIE
         K+PG D F++  +QE W++IK+DL RVF EF   G+I+   N TF+ L+PKK +  +  D R ISLVTS+YKIIAKVL+ R +K+L   I +SQGAF+E
Subjt:  SKSPGLDEFSMGFFQENWEIIKQDLERVFKEFFDKGIIDNGMNETFVCLIPKKDKVERGKDCRLISLVTSVYKIIAKVLANRSKKILPSMIFESQGAFIE

Query:  WRQILDQTLIANEAIEDYRRRKKEEVLFKIDFEKAYDHVDWDVLDKVLAMKGFGHHWRSWVWRCVRLVRYFILINGCPKGEITASRGLRQGDPLSPFLFL
         R ILD  LIANE +++ RR  +E ++FKIDFEKAYDHVDW  LD VL  KGF   WRSW+  C+    + IL+NG  KG + ASRGLRQGDPLSPFLF 
Subjt:  WRQILDQTLIANEAIEDYRRRKKEEVLFKIDFEKAYDHVDWDVLDKVLAMKGFGHHWRSWVWRCVRLVRYFILINGCPKGEITASRGLRQGDPLSPFLFL

Query:  VVLDVLSRLVSRSVERGILKGFEVERDGVALSHLQFADDTLFFCSREEDSFLLLSHILEFFESMSGLKIN
        +V DVLSR++ R+ E G+ +GF V RD   +S LQFADDT+FF     +    L  IL  F  +SGLKIN
Subjt:  VVLDVLSRLVSRSVERGILKGFEVERDGVALSHLQFADDTLFFCSREEDSFLLLSHILEFFESMSGLKIN

RVX13544.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]9.9e-8942.5Show/hide
Query:  LKLDCMIYVKATLMSLS------ISIFLSIDDLEESGSSFEGLKEERMALRLQLVEIVRRESCSWRQKAKVQWAKEWDCNSSFYHQ---ERRS-------
        +K    ++ KA+   LS      +S  ++ D LE+ G     L  +R   + +L E++ RE   WRQKA+V+W KE DCNS F+H+    RR+       
Subjt:  LKLDCMIYVKATLMSLS------ISIFLSIDDLEESGSSFEGLKEERMALRLQLVEIVRRESCSWRQKAKVQWAKEWDCNSSFYHQ---ERRS-------

Query:  ---------------------------------------------------LDTHFSLEEIRRVIFESDGSKSPGLDEFSMGFFQENWEIIKQDLERVFK
                                                           L++ F+ EEI + IF+ D  K+PG D F++  FQ+ WE+IK+DL +VF 
Subjt:  ---------------------------------------------------LDTHFSLEEIRRVIFESDGSKSPGLDEFSMGFFQENWEIIKQDLERVFK

Query:  EFFDKGIIDNGMNETFVCLIPKKDKVERGKDCRLISLVTSVYKIIAKVLANRSKKILPSMIFESQGAFIEWRQILDQTLIANEAIEDYRRRKKEEVLFKI
        EF   GII+   N +F+ L+PKK    R  D R ISL+TS+YKIIAKVLA R +++L   I  +QGAF++ RQILD  LIANE +++ RR  +E V+FKI
Subjt:  EFFDKGIIDNGMNETFVCLIPKKDKVERGKDCRLISLVTSVYKIIAKVLANRSKKILPSMIFESQGAFIEWRQILDQTLIANEAIEDYRRRKKEEVLFKI

Query:  DFEKAYDHVDWDVLDKVLAMKGFGHHWRSWVWRCVRLVRYFILINGCPKGEITASRGLRQGDPLSPFLFLVVLDVLSRLVSRSVERGILKGFEVERDGVA
        DFEKAYDHV WD LD V+ MKGFG  WR W+  C+  V + +L+NG  KG + ASRGLRQGDPLSPFLF +V DVLSR++ ++ ER +L+GF+V R+   
Subjt:  DFEKAYDHVDWDVLDKVLAMKGFGHHWRSWVWRCVRLVRYFILINGCPKGEITASRGLRQGDPLSPFLFLVVLDVLSRLVSRSVERGILKGFEVERDGVA

Query:  LSHLQFADDTLFFCSREEDSFLLLSHILEFFESMSGLKIN
        +SHLQFADDT+FF S  E+  + L ++L  F  +SGLK+N
Subjt:  LSHLQFADDTLFFCSREEDSFLLLSHILEFFESMSGLKIN

TrEMBL top hitse value%identityAlignment
A0A438BYN0 Transposon TX1 uncharacterized 149 kDa protein1.5e-9047.41Show/hide
Query:  SIDDLEESGSSFEGLKEERMALRLQLVEIVRRESCSWRQKAKVQWAKEWDCNSSFYHQ---ERRS--------------------------------LDT
        + D +E+ G     L  +R++ + +L E++ RE   WRQKAKV+W KE DCNS FYH+    RR+                                L++
Subjt:  SIDDLEESGSSFEGLKEERMALRLQLVEIVRRESCSWRQKAKVQWAKEWDCNSSFYHQ---ERRS--------------------------------LDT

Query:  HFSLEEIRRVIFESDGSKSPGLDEFSMGFFQENWEIIKQDLERVFKEFFDKGIIDNGMNETFVCLIPKKDKVERGKDCRLISLVTSVYKIIAKVLANRSK
         F+ EEI + IF+ D +K+PG D F++  FQE W++IK+DL RVF EF   GII+   N +F+ LIPKK   +R  D R ISL+TS+YKIIAKVL+ R +
Subjt:  HFSLEEIRRVIFESDGSKSPGLDEFSMGFFQENWEIIKQDLERVFKEFFDKGIIDNGMNETFVCLIPKKDKVERGKDCRLISLVTSVYKIIAKVLANRSK

Query:  KILPSMIFESQGAFIEWRQILDQTLIANEAIEDYRRRKKEEVLFKIDFEKAYDHVDWDVLDKVLAMKGFGHHWRSWVWRCVRLVRYFILINGCPKGEITA
         +L   I  +QGAF++ RQILD  LIANE +++ RR ++E V+FKIDFEKAYDHV WD LD VL  KGF   WR W+  C+  V + IL+NG  KG + A
Subjt:  KILPSMIFESQGAFIEWRQILDQTLIANEAIEDYRRRKKEEVLFKIDFEKAYDHVDWDVLDKVLAMKGFGHHWRSWVWRCVRLVRYFILINGCPKGEITA

Query:  SRGLRQGDPLSPFLFLVVLDVLSRLVSRSVERGILKGFEVERDGVALSHLQFADDTLFFCSREEDSFLLLSHILEFFESMSGLKIN
        SRGLRQGDPLSPFLF +V DVLSR++ R+ ER +L+GF V R+   +SHLQFADDT+FF +  E+    L  +L  F  +SGLK+N
Subjt:  SRGLRQGDPLSPFLFLVVLDVLSRLVSRSVERGILKGFEVERDGVALSHLQFADDTLFFCSREEDSFLLLSHILEFFESMSGLKIN

A0A438DVH3 Transposon TX1 uncharacterized 149 kDa protein1.8e-9146.63Show/hide
Query:  ISIFLSIDDLEESGSSFEGLKEERMALRLQLVEIVRRESCSWRQKAKVQWAKEWDCNSSFYH------------------------------QERRSLDT
        +++  + D LE+ G     L  +R   + +L E++ RE   WRQKA+V+W KE DCNS F+H                              +E   L++
Subjt:  ISIFLSIDDLEESGSSFEGLKEERMALRLQLVEIVRRESCSWRQKAKVQWAKEWDCNSSFYH------------------------------QERRSLDT

Query:  HFSLEEIRRVIFESDGSKSPGLDEFSMGFFQENWEIIKQDLERVFKEFFDKGIIDNGMNETFVCLIPKKDKVERGKDCRLISLVTSVYKIIAKVLANRSK
         F+ EEI + IF+ D  K+PG D F++  FQ+ W++IK+DL RVF EF   GII+   N +F+ L+PKK +  R  D R ISL+TS+YKIIAKVLA R +
Subjt:  HFSLEEIRRVIFESDGSKSPGLDEFSMGFFQENWEIIKQDLERVFKEFFDKGIIDNGMNETFVCLIPKKDKVERGKDCRLISLVTSVYKIIAKVLANRSK

Query:  KILPSMIFESQGAFIEWRQILDQTLIANEAIEDYRRRKKEEVLFKIDFEKAYDHVDWDVLDKVLAMKGFGHHWRSWVWRCVRLVRYFILINGCPKGEITA
          L   I  +QGAF++ RQILD  LIANE +++ RR  +E V+FKIDFEKAYDHV WD LD VL MKGF   WR W+  C+  V Y +L+NG  KG + A
Subjt:  KILPSMIFESQGAFIEWRQILDQTLIANEAIEDYRRRKKEEVLFKIDFEKAYDHVDWDVLDKVLAMKGFGHHWRSWVWRCVRLVRYFILINGCPKGEITA

Query:  SRGLRQGDPLSPFLFLVVLDVLSRLVSRSVERGILKGFEVERDGVALSHLQFADDTLFFCSREEDSFLLLSHILEFFESMSGLKIN
        SRGLRQGDPLSPFLF +V DVLSR++ ++ ER +L+GF V R+   +SHLQFADDT+FF S  E+  + L  +L  F  + GLK+N
Subjt:  SRGLRQGDPLSPFLFLVVLDVLSRLVSRSVERGILKGFEVERDGVALSHLQFADDTLFFCSREEDSFLLLSHILEFFESMSGLKIN

A0A438JRF4 LINE-1 retrotransposable element ORF2 protein5.7e-9048.38Show/hide
Query:  IDDLEESGSSFEGLKEERMALRLQLVEIVRRESCSWRQKAKVQWAKEWDCNSSFYHQERRS--------------------LDTHFSLEEIRRVIFESDG
        ID +E+ G+    L  ER   R +L +++ +E   WRQK+KV+W KE DCNS F+H+                        LD  F+ EE+RR +F+ + 
Subjt:  IDDLEESGSSFEGLKEERMALRLQLVEIVRRESCSWRQKAKVQWAKEWDCNSSFYHQERRS--------------------LDTHFSLEEIRRVIFESDG

Query:  SKSPGLDEFSMGFFQENWEIIKQDLERVFKEFFDKGIIDNGMNETFVCLIPKKDKVERGKDCRLISLVTSVYKIIAKVLANRSKKILPSMIFESQGAFIE
         K+PG D F++  +QE W++IK+DL RVF EF   G+I+   N TF+ L+PKK +  +  D R ISLVTS+YKIIAKVL+ R +K+L   I +SQGAF+E
Subjt:  SKSPGLDEFSMGFFQENWEIIKQDLERVFKEFFDKGIIDNGMNETFVCLIPKKDKVERGKDCRLISLVTSVYKIIAKVLANRSKKILPSMIFESQGAFIE

Query:  WRQILDQTLIANEAIEDYRRRKKEEVLFKIDFEKAYDHVDWDVLDKVLAMKGFGHHWRSWVWRCVRLVRYFILINGCPKGEITASRGLRQGDPLSPFLFL
         R ILD  LIANE +++ RR  +E ++FKIDFEKAYDHVDW  LD VL  KGF   WRSW+  C+    + IL+NG  KG + ASRGLRQGDPLSPFLF 
Subjt:  WRQILDQTLIANEAIEDYRRRKKEEVLFKIDFEKAYDHVDWDVLDKVLAMKGFGHHWRSWVWRCVRLVRYFILINGCPKGEITASRGLRQGDPLSPFLFL

Query:  VVLDVLSRLVSRSVERGILKGFEVERDGVALSHLQFADDTLFFCSREEDSFLLLSHILEFFESMSGLKIN
        +V DVLSR++ R+ E G+ +GF V RD   +S LQFADDT+FF     +    L  IL  F  +SGLKIN
Subjt:  VVLDVLSRLVSRSVERGILKGFEVERDGVALSHLQFADDTLFFCSREEDSFLLLSHILEFFESMSGLKIN

A5BA26 Reverse transcriptase domain-containing protein3.7e-8946.77Show/hide
Query:  SIDDLEESGSSFEGLKEERMALRLQLVEIVRRESCSWRQKAKVQWAKEWDCNSSFYHQ---ERRS---------------------------------LD
        + D +E+ G     L  +R + + +L E++ RE   WRQKAKV+W KE  CNS FYH+    RR+                                 LD
Subjt:  SIDDLEESGSSFEGLKEERMALRLQLVEIVRRESCSWRQKAKVQWAKEWDCNSSFYHQ---ERRS---------------------------------LD

Query:  THFSLEEIRRVIFESDGSKSPGLDEFSMGFFQENWEIIKQDLERVFKEFFDKGIIDNGMNETFVCLIPKKDKVERGKDCRLISLVTSVYKIIAKVLANRS
        + F+ EEI + IF+ D  K+PG D F++  FQE W++IK+DL RVF EF   G+I+   N +F+ LIPKK   +R  D R ISL+TS+YKIIAKVL+ R 
Subjt:  THFSLEEIRRVIFESDGSKSPGLDEFSMGFFQENWEIIKQDLERVFKEFFDKGIIDNGMNETFVCLIPKKDKVERGKDCRLISLVTSVYKIIAKVLANRS

Query:  KKILPSMIFESQGAFIEWRQILDQTLIANEAIEDYRRRKKEEVLFKIDFEKAYDHVDWDVLDKVLAMKGFGHHWRSWVWRCVRLVRYFILINGCPKGEIT
        + +L   I  +QGAF++ RQILD  LIANE +++ RR  +E V+FKIDFEKAYDHV WD LD VL  KGF   WR W+  C+  V Y IL+NG  KG + 
Subjt:  KKILPSMIFESQGAFIEWRQILDQTLIANEAIEDYRRRKKEEVLFKIDFEKAYDHVDWDVLDKVLAMKGFGHHWRSWVWRCVRLVRYFILINGCPKGEIT

Query:  ASRGLRQGDPLSPFLFLVVLDVLSRLVSRSVERGILKGFEVERDGVALSHLQFADDTLFFCSREEDSFLLLSHILEFFESMSGLKIN
        ASRGLRQGDPLSPFLF +V DVLSR++ R+ ER +++GF V  +   +SHLQFADDT+FF +  E+    L  +L  F  +SGLK+N
Subjt:  ASRGLRQGDPLSPFLFLVVLDVLSRLVSRSVERGILKGFEVERDGVALSHLQFADDTLFFCSREEDSFLLLSHILEFFESMSGLKIN

A5CAA2 Reverse transcriptase domain-containing protein4.8e-8942.5Show/hide
Query:  LKLDCMIYVKATLMSLS------ISIFLSIDDLEESGSSFEGLKEERMALRLQLVEIVRRESCSWRQKAKVQWAKEWDCNSSFYHQ---ERRS-------
        +K    ++ KA+   LS      +S  ++ D LE+ G     L  +R   + +L E++ RE   WRQKA+V+W KE DCNS F+H+    RR+       
Subjt:  LKLDCMIYVKATLMSLS------ISIFLSIDDLEESGSSFEGLKEERMALRLQLVEIVRRESCSWRQKAKVQWAKEWDCNSSFYHQ---ERRS-------

Query:  ---------------------------------------------------LDTHFSLEEIRRVIFESDGSKSPGLDEFSMGFFQENWEIIKQDLERVFK
                                                           L++ F+ EEI + IF+ D  K+PG D F++  FQ+ WE+IK+DL +VF 
Subjt:  ---------------------------------------------------LDTHFSLEEIRRVIFESDGSKSPGLDEFSMGFFQENWEIIKQDLERVFK

Query:  EFFDKGIIDNGMNETFVCLIPKKDKVERGKDCRLISLVTSVYKIIAKVLANRSKKILPSMIFESQGAFIEWRQILDQTLIANEAIEDYRRRKKEEVLFKI
        EF   GII+   N +F+ L+PKK    R  D R ISL+TS+YKIIAKVLA R +++L   I  +QGAF++ RQILD  LIANE +++ RR  +E V+FKI
Subjt:  EFFDKGIIDNGMNETFVCLIPKKDKVERGKDCRLISLVTSVYKIIAKVLANRSKKILPSMIFESQGAFIEWRQILDQTLIANEAIEDYRRRKKEEVLFKI

Query:  DFEKAYDHVDWDVLDKVLAMKGFGHHWRSWVWRCVRLVRYFILINGCPKGEITASRGLRQGDPLSPFLFLVVLDVLSRLVSRSVERGILKGFEVERDGVA
        DFEKAYDHV WD LD V+ MKGFG  WR W+  C+  V + +L+NG  KG + ASRGLRQGDPLSPFLF +V DVLSR++ ++ ER +L+GF+V R+   
Subjt:  DFEKAYDHVDWDVLDKVLAMKGFGHHWRSWVWRCVRLVRYFILINGCPKGEITASRGLRQGDPLSPFLFLVVLDVLSRLVSRSVERGILKGFEVERDGVA

Query:  LSHLQFADDTLFFCSREEDSFLLLSHILEFFESMSGLKIN
        +SHLQFADDT+FF S  E+  + L ++L  F  +SGLK+N
Subjt:  LSHLQFADDTLFFCSREEDSFLLLSHILEFFESMSGLKIN

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein4.0e-2430.74Show/hide
Query:  QERRSLDTHFSLEEIRRVIFESDGSKSPGLDEFSMGFFQENWEIIKQDLERVFKEFFDKGIIDNGMNETFVCLIPKKDK-VERGKDCRLISLVTSVYKII
        +E  SL+   +  EI  +I      KSPG D F+  F+Q   E +   L ++F+    +GI+ N   E  + LIPK  +   + ++ R ISL+    KI+
Subjt:  QERRSLDTHFSLEEIRRVIFESDGSKSPGLDEFSMGFFQENWEIIKQDLERVFKEFFDKGIIDNGMNETFVCLIPKKDK-VERGKDCRLISLVTSVYKII

Query:  AKVLANRSKKILPSMIFESQGAFIEWRQILDQTLIANEAIEDYRRRK-KEEVLFKIDFEKAYDHVDWDVLDKVLAMKGFGHHWRSWVWRCVRLVRYFILI
         K+LANR ++ +  +I   Q  FI   Q       +   I+   R K K  V+  ID EKA+D +    + K L   G    +   +          I++
Subjt:  AKVLANRSKKILPSMIFESQGAFIEWRQILDQTLIANEAIEDYRRRK-KEEVLFKIDFEKAYDHVDWDVLDKVLAMKGFGHHWRSWVWRCVRLVRYFILI

Query:  NGCPKGEITASRGLRQGDPLSPFLFLVVLDVLSRLVSRSVERGILKGFEVERDGVALSHLQFADDTLFFCSREEDSFLLLSHILEFFESMSGLKIN
        NG          G RQG PLSP LF +VL+VL+R + +  E   +KG ++ ++ V LS   FADD + +      S   L  ++  F  +SG KIN
Subjt:  NGCPKGEITASRGLRQGDPLSPFLFLVVLDVLSRLVSRSVERGILKGFEVERDGVALSHLQFADDTLFFCSREEDSFLLLSHILEFFESMSGLKIN

P08548 LINE-1 reverse transcriptase homolog1.7e-2229.05Show/hide
Query:  QERRSLDTHFSLEEIRRVIFESDGSKSPGLDEFSMGFFQENWEIIKQDLERVFKEFFDKGIIDNGMNETFVCLIPKKDK-VERGKDCRLISLVTSVYKII
        +E   L+   S  EI   I      KSPG D F+  F+Q   E +   L  +F+    +GI+ N   E  + LIPK  K   R ++ R ISL+    KI+
Subjt:  QERRSLDTHFSLEEIRRVIFESDGSKSPGLDEFSMGFFQENWEIIKQDLERVFKEFFDKGIIDNGMNETFVCLIPKKDK-VERGKDCRLISLVTSVYKII

Query:  AKVLANRSKKILPSMIFESQGAFIEWRQILDQTLIANEAIEDYRRRK-KEEVLFKIDFEKAYDHVDWDVLDKVLAMKGFGHHWRSWVWRCVRLVRYFILI
         K+L NR ++ +  +I   Q  FI   Q       +   I+   + K K+ ++  ID EKA+D++    + + L   G    +   +          I++
Subjt:  AKVLANRSKKILPSMIFESQGAFIEWRQILDQTLIANEAIEDYRRRK-KEEVLFKIDFEKAYDHVDWDVLDKVLAMKGFGHHWRSWVWRCVRLVRYFILI

Query:  NGCPKGEITASRGLRQGDPLSPFLFLVVLDVLSRLVSRSVERGILKGFEVERDGVALSHLQFADDTLFFCSREEDSFLLLSHILEFFESMSGLKIN
        NG          G RQG PLSP LF +V++VL+  +    E   +KG  +  + + LS   FADD + +     DS   L  +++ + ++SG KIN
Subjt:  NGCPKGEITASRGLRQGDPLSPFLFLVVLDVLSRLVSRSVERGILKGFEVERDGVALSHLQFADDTLFFCSREEDSFLLLSHILEFFESMSGLKIN

P11369 LINE-1 retrotransposable element ORF2 protein4.0e-2430.93Show/hide
Query:  LDTHFSLEEIRRVIFESDGSKSPGLDEFSMGFFQENWEIIKQDLERVFKEFFDKGIIDNGMNETFVCLIPKKDK-VERGKDCRLISLVTSVYKIIAKVLA
        L++  S +EI  VI      KSPG D FS  F+Q   E +   L ++F +   +G + N   E  + LIPK  K   + ++ R ISL+    KI+ K+LA
Subjt:  LDTHFSLEEIRRVIFESDGSKSPGLDEFSMGFFQENWEIIKQDLERVFKEFFDKGIIDNGMNETFVCLIPKKDK-VERGKDCRLISLVTSVYKIIAKVLA

Query:  NRSKKILPSMIFESQGAFIEWRQILDQTLIANEAIEDYRRRK-KEEVLFKIDFEKAYDHVDWDVLDKVLAMKGFGHHWRSWVWRCVRLVRYFILINGCPK
        NR ++ + ++I   Q  FI   Q       +   I    + K K  ++  +D EKA+D +    + KVL   G    + + +          I +NG   
Subjt:  NRSKKILPSMIFESQGAFIEWRQILDQTLIANEAIEDYRRRK-KEEVLFKIDFEKAYDHVDWDVLDKVLAMKGFGHHWRSWVWRCVRLVRYFILINGCPK

Query:  GEITASRGLRQGDPLSPFLFLVVLDVLSRLVSRSVERGILKGFEVERDGVALSHLQFADDTLFFCSREEDSFLLLSHILEFFESMSGLKIN
          I    G RQG PLSP+LF +VL+VL+R + +  E   +KG ++ ++ V +S L  ADD + + S  ++S   L +++  F  + G KIN
Subjt:  GEITASRGLRQGDPLSPFLFLVVLDVLSRLVSRSVERGILKGFEVERDGVALSHLQFADDTLFFCSREEDSFLLLSHILEFFESMSGLKIN

P14381 Transposon TX1 uncharacterized 149 kDa protein1.8e-2428.1Show/hide
Query:  WDCNSSFYHQERRSLDTHFSLEEIRRVIFESDGSKSPGLDEFSMGFFQENWEIIKQDLERVFKEFFDKGIIDNGMNETFVCLIPKKDKVERGKDCRLISL
        WD       + +  L+T  +L+E+ + +     +KSPGLD  ++ FFQ  W+ +  D  RV  E F KG +        + L+PKK  +   K+ R +SL
Subjt:  WDCNSSFYHQERRSLDTHFSLEEIRRVIFESDGSKSPGLDEFSMGFFQENWEIIKQDLERVFKEFFDKGIIDNGMNETFVCLIPKKDKVERGKDCRLISL

Query:  VTSVYKIIAKVLANRSKKILPSMIFESQGAFIEWRQILDQTLIANEAIEDYRRRKKEEVLFKIDFEKAYDHVDWDVLDKVLAMKGFGHHWRSWVWRCVRL
        +++ YKI+AK ++ R K +L  +I   Q   +  R I D   +  + +   RR         +D EKA+D VD   L   L    FG  +  ++      
Subjt:  VTSVYKIIAKVLANRSKKILPSMIFESQGAFIEWRQILDQTLIANEAIEDYRRRKKEEVLFKIDFEKAYDHVDWDVLDKVLAMKGFGHHWRSWVWRCVRL

Query:  VRYFILINGCPKGEITASRGLRQGDPLSPFLFLVVLDVLSRLVSRSVERGILKGFEVERDGVALSHLQFADDTL
            + IN      +   RG+RQG PLS  L+ + ++    L+ + +   +LK  ++    V LS   +ADD +
Subjt:  VRYFILINGCPKGEITASRGLRQGDPLSPFLFLVVLDVLSRLVSRSVERGILKGFEVERDGVALSHLQFADDTL

Q9LTY1 Mitotic spindle checkpoint protein MAD13.9e-1947.54Show/hide
Query:  MIVRTPPPKKQRSDVRALPDSSSAGASDLPLVIYEDPLTLVPATTQPESSHEPSEHMLCTYQCRQMVKSDFLDALSNAEKQVDDYELKLGVLNENLSKVV
        MI+RTP PK+ RSD    P  + A  S   L+IYED     PA  Q    H   +H LCTYQCRQMVK+D LDALS AEKQV++ + KL  LN N ++  
Subjt:  MIVRTPPPKKQRSDVRALPDSSSAGASDLPLVIYEDPLTLVPATTQPESSHEPSEHMLCTYQCRQMVKSDFLDALSNAEKQVDDYELKLGVLNENLSKVV

Query:  ALWSLRFFFWDSFSHGKKEVAA
        A    R  F D F + ++E+AA
Subjt:  ALWSLRFFFWDSFSHGKKEVAA

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases9.1e-0835.8Show/hide
Query:  LANRSKKILPSMIFESQGAFIEWRQILDQTLIANEAIEDYRRRK--KEEVLFKIDFEKAYDHVDWDVLDKVLAMKGFGHHW
        +  R K ++ ++I  +Q +FI  R   D  +   EA+   RR+K  K  +L K+D EKAYD + WD L+  L   GF   W
Subjt:  LANRSKKILPSMIFESQGAFIEWRQILDQTLIANEAIEDYRRRK--KEEVLFKIDFEKAYDHVDWDVLDKVLAMKGFGHHW

AT5G49880.1 mitotic checkpoint family protein2.7e-2047.54Show/hide
Query:  MIVRTPPPKKQRSDVRALPDSSSAGASDLPLVIYEDPLTLVPATTQPESSHEPSEHMLCTYQCRQMVKSDFLDALSNAEKQVDDYELKLGVLNENLSKVV
        MI+RTP PK+ RSD    P  + A  S   L+IYED     PA  Q    H   +H LCTYQCRQMVK+D LDALS AEKQV++ + KL  LN N ++  
Subjt:  MIVRTPPPKKQRSDVRALPDSSSAGASDLPLVIYEDPLTLVPATTQPESSHEPSEHMLCTYQCRQMVKSDFLDALSNAEKQVDDYELKLGVLNENLSKVV

Query:  ALWSLRFFFWDSFSHGKKEVAA
        A    R  F D F + ++E+AA
Subjt:  ALWSLRFFFWDSFSHGKKEVAA

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.0e-1554.41Show/hide
Query:  LINGCPKGEITASRGLRQGDPLSPFLFLVVLDVLSRLVSRSVERGILKGFEVERDGVALSHLQFADDT
        +ING P+G +T SRGLRQGDPLSP+LF++  +VLS L  R+ E+G L G  V  +   ++HL FADDT
Subjt:  LINGCPKGEITASRGLRQGDPLSPFLFLVVLDVLSRLVSRSVERGILKGFEVERDGVALSHLQFADDT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGTGAGGACTCCTCCACCAAAGAAGCAACGATCCGATGTTAGGGCTCTGCCTGATTCCTCATCGGCAGGTGCCTCAGATCTGCCGCTCGTTATCTATGAAGATCC
TCTTACCTTGGTGCCAGCAACGACCCAACCAGAGTCGTCGCACGAGCCCTCTGAGCACATGCTCTGCACTTACCAATGCCGGCAAATGGTGAAGTCAGACTTTTTAGACG
CACTAAGCAATGCAGAGAAGCAAGTTGATGATTATGAATTGAAGTTAGGCGTGTTGAATGAGAACCTCTCCAAAGTTGTGGCTCTTTGGTCTTTGCGGTTTTTCTTCTGG
GATAGTTTCTCCCATGGAAAAAAAGAAGTGGCTGCCAATGTGAAAGATTTTAGGCCAATCAGCTTAACTTCCTCAATCTATAAAATCCTTGCTAAGGTGCTTGCCGAAAG
GGTAAAGAAAGTAATGCCCTCGACAATTTCTTGCTCCCAAAGTGCCTTCTTAGATGGAAGACAGATTTTAGATCCAATTCTCATTGCGAATGAAGTGCCGGAGGAATATA
GGATGGGTTCTTTAGAAGACGCTAGAAGATGGTCCCTTGATTCCTGGGGTGTTTTCTCTATCAAGTCTTTATCATGTTACTTGGCTTCCGCTTCTCCTATGGATAATGAA
GTTTATTTTGCTTTATGGAAATCTAATAGTGCTAAACATTTCTCTATCACTCGCCTGCTACCACTCGTCCGTCGCCGCTCAGCCACTGTCGATCCTCCGCTAGGGTTATT
GTGGGTCTTAAGAATCATAGTTGATTTTGGGTATTCTGATGGTCTCGGTAGTCTGAATTACTCTATGACTCTAGATGTTGGAGAGAATTTTCTATTGGTCAAAATGGCAA
AAGAGATACGGGATGCTGCCTGTGACACCTACTCTAGTTTTGAGAATTCATCTGCTATATTGGCATTGAAACTCGATTGTATGATCTATGTCAAGGCGACCTTAATGTCA
CTCAGTATTTCAATCTTCTTATCCATTGACGATTTGGAAGAGTCTGGGAGCTCATTTGAGGGCCTTAAGGAGGAGCGCATGGCCTTAAGACTTCAGTTGGTAGAGATAGT
GAGGAGAGAATCTTGTAGTTGGAGGCAAAAGGCGAAAGTTCAATGGGCTAAGGAGTGGGATTGTAACTCTTCTTTTTATCATCAGGAGAGGAGGAGTTTGGACACCCATT
TTTCATTGGAGGAAATTAGGAGGGTTATTTTCGAGAGTGATGGTAGCAAATCCCCCGGTCTTGATGAATTTTCGATGGGTTTCTTCCAGGAGAATTGGGAGATTATTAAA
CAGGACTTGGAGAGAGTGTTCAAGGAGTTTTTTGACAAGGGTATTATTGATAATGGGATGAATGAAACTTTTGTATGCCTTATTCCAAAGAAGGACAAGGTGGAAAGAGG
GAAGGATTGTAGACTTATTAGTCTAGTTACAAGCGTTTATAAGATCATTGCTAAAGTGCTAGCCAATAGATCGAAAAAGATCCTCCCTTCAATGATATTCGAGTCTCAAG
GAGCTTTTATTGAATGGAGACAAATCCTTGATCAAACGCTGATAGCTAATGAGGCTATAGAGGATTATAGGAGGAGAAAGAAGGAGGAAGTGTTATTTAAGATTGATTTT
GAGAAAGCGTATGACCACGTAGATTGGGATGTCCTCGATAAGGTTCTAGCTATGAAAGGTTTTGGGCACCATTGGAGATCGTGGGTCTGGAGATGTGTGAGATTGGTTAG
GTATTTTATTCTCATTAATGGTTGCCCTAAAGGGGAAATTACTGCTTCCAGAGGCCTTAGACAAGGTGATCCGTTGTCCCCTTTCCTTTTCTTAGTAGTTCTGGATGTCT
TGAGCAGGTTGGTCTCTAGAAGTGTGGAGAGAGGTATTTTGAAGGGGTTTGAGGTGGAGAGGGATGGAGTGGCCTTATCTCACCTTCAGTTTGCAGACGACACTTTATTC
TTTTGTTCGAGGGAGGAGGATTCCTTTCTCTTATTGAGTCATATCTTAGAGTTCTTCGAGTCGATGTCGGGGCTCAAGATAAATAGAAGGAGTGTCAGATGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGATTGTGAGGACTCCTCCACCAAAGAAGCAACGATCCGATGTTAGGGCTCTGCCTGATTCCTCATCGGCAGGTGCCTCAGATCTGCCGCTCGTTATCTATGAAGATCC
TCTTACCTTGGTGCCAGCAACGACCCAACCAGAGTCGTCGCACGAGCCCTCTGAGCACATGCTCTGCACTTACCAATGCCGGCAAATGGTGAAGTCAGACTTTTTAGACG
CACTAAGCAATGCAGAGAAGCAAGTTGATGATTATGAATTGAAGTTAGGCGTGTTGAATGAGAACCTCTCCAAAGTTGTGGCTCTTTGGTCTTTGCGGTTTTTCTTCTGG
GATAGTTTCTCCCATGGAAAAAAAGAAGTGGCTGCCAATGTGAAAGATTTTAGGCCAATCAGCTTAACTTCCTCAATCTATAAAATCCTTGCTAAGGTGCTTGCCGAAAG
GGTAAAGAAAGTAATGCCCTCGACAATTTCTTGCTCCCAAAGTGCCTTCTTAGATGGAAGACAGATTTTAGATCCAATTCTCATTGCGAATGAAGTGCCGGAGGAATATA
GGATGGGTTCTTTAGAAGACGCTAGAAGATGGTCCCTTGATTCCTGGGGTGTTTTCTCTATCAAGTCTTTATCATGTTACTTGGCTTCCGCTTCTCCTATGGATAATGAA
GTTTATTTTGCTTTATGGAAATCTAATAGTGCTAAACATTTCTCTATCACTCGCCTGCTACCACTCGTCCGTCGCCGCTCAGCCACTGTCGATCCTCCGCTAGGGTTATT
GTGGGTCTTAAGAATCATAGTTGATTTTGGGTATTCTGATGGTCTCGGTAGTCTGAATTACTCTATGACTCTAGATGTTGGAGAGAATTTTCTATTGGTCAAAATGGCAA
AAGAGATACGGGATGCTGCCTGTGACACCTACTCTAGTTTTGAGAATTCATCTGCTATATTGGCATTGAAACTCGATTGTATGATCTATGTCAAGGCGACCTTAATGTCA
CTCAGTATTTCAATCTTCTTATCCATTGACGATTTGGAAGAGTCTGGGAGCTCATTTGAGGGCCTTAAGGAGGAGCGCATGGCCTTAAGACTTCAGTTGGTAGAGATAGT
GAGGAGAGAATCTTGTAGTTGGAGGCAAAAGGCGAAAGTTCAATGGGCTAAGGAGTGGGATTGTAACTCTTCTTTTTATCATCAGGAGAGGAGGAGTTTGGACACCCATT
TTTCATTGGAGGAAATTAGGAGGGTTATTTTCGAGAGTGATGGTAGCAAATCCCCCGGTCTTGATGAATTTTCGATGGGTTTCTTCCAGGAGAATTGGGAGATTATTAAA
CAGGACTTGGAGAGAGTGTTCAAGGAGTTTTTTGACAAGGGTATTATTGATAATGGGATGAATGAAACTTTTGTATGCCTTATTCCAAAGAAGGACAAGGTGGAAAGAGG
GAAGGATTGTAGACTTATTAGTCTAGTTACAAGCGTTTATAAGATCATTGCTAAAGTGCTAGCCAATAGATCGAAAAAGATCCTCCCTTCAATGATATTCGAGTCTCAAG
GAGCTTTTATTGAATGGAGACAAATCCTTGATCAAACGCTGATAGCTAATGAGGCTATAGAGGATTATAGGAGGAGAAAGAAGGAGGAAGTGTTATTTAAGATTGATTTT
GAGAAAGCGTATGACCACGTAGATTGGGATGTCCTCGATAAGGTTCTAGCTATGAAAGGTTTTGGGCACCATTGGAGATCGTGGGTCTGGAGATGTGTGAGATTGGTTAG
GTATTTTATTCTCATTAATGGTTGCCCTAAAGGGGAAATTACTGCTTCCAGAGGCCTTAGACAAGGTGATCCGTTGTCCCCTTTCCTTTTCTTAGTAGTTCTGGATGTCT
TGAGCAGGTTGGTCTCTAGAAGTGTGGAGAGAGGTATTTTGAAGGGGTTTGAGGTGGAGAGGGATGGAGTGGCCTTATCTCACCTTCAGTTTGCAGACGACACTTTATTC
TTTTGTTCGAGGGAGGAGGATTCCTTTCTCTTATTGAGTCATATCTTAGAGTTCTTCGAGTCGATGTCGGGGCTCAAGATAAATAGAAGGAGTGTCAGATGTTAA
Protein sequenceShow/hide protein sequence
MIVRTPPPKKQRSDVRALPDSSSAGASDLPLVIYEDPLTLVPATTQPESSHEPSEHMLCTYQCRQMVKSDFLDALSNAEKQVDDYELKLGVLNENLSKVVALWSLRFFFW
DSFSHGKKEVAANVKDFRPISLTSSIYKILAKVLAERVKKVMPSTISCSQSAFLDGRQILDPILIANEVPEEYRMGSLEDARRWSLDSWGVFSIKSLSCYLASASPMDNE
VYFALWKSNSAKHFSITRLLPLVRRRSATVDPPLGLLWVLRIIVDFGYSDGLGSLNYSMTLDVGENFLLVKMAKEIRDAACDTYSSFENSSAILALKLDCMIYVKATLMS
LSISIFLSIDDLEESGSSFEGLKEERMALRLQLVEIVRRESCSWRQKAKVQWAKEWDCNSSFYHQERRSLDTHFSLEEIRRVIFESDGSKSPGLDEFSMGFFQENWEIIK
QDLERVFKEFFDKGIIDNGMNETFVCLIPKKDKVERGKDCRLISLVTSVYKIIAKVLANRSKKILPSMIFESQGAFIEWRQILDQTLIANEAIEDYRRRKKEEVLFKIDF
EKAYDHVDWDVLDKVLAMKGFGHHWRSWVWRCVRLVRYFILINGCPKGEITASRGLRQGDPLSPFLFLVVLDVLSRLVSRSVERGILKGFEVERDGVALSHLQFADDTLF
FCSREEDSFLLLSHILEFFESMSGLKINRRSVRC