; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017915 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017915
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr5:11604087..11605220
RNA-Seq ExpressionLag0017915
SyntenyLag0017915
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4308601.1 unnamed protein product [Prunus armeniaca]6.6e-6339.42Show/hide
Query:  HIWERIDRFLCNQSFDSIFKFSRTRNLDWIFSDHKPIEAKLDVRLATRSR--RRIKNFKFEELWTRYEKCAELIESNKYWEEGNSYLSSVSNDLFKCSKA
        H+  R+DR   N +    +    + +L    SDH PI    D     ++R  R  + F FEE+WT    C E++  +  W+   S LS+++N     S+ 
Subjt:  HIWERIDRFLCNQSFDSIFKFSRTRNLDWIFSDHKPIEAKLDVRLATRSR--RRIKNFKFEELWTRYEKCAELIESNKYWEEGNSYLSSVSNDLFKCSKA

Query:  LGEVGKDIYNNRKKRIIECKRALNDAFKNIPAV-NFEIIHGIELELDSLFE-EEIYWKERSREDWIRWGDQNSKWLHRKATIRRKTNEINGILNEEGIWV
          E G  +     K++ E +R L     + P+   F     IE ELD   E EEIYW +RSR  W++ GD+N+ + H++AT RRK N + GIL+E   W 
Subjt:  LGEVGKDIYNNRKKRIIECKRALNDAFKNIPAV-NFEIIHGIELELDSLFE-EEIYWKERSREDWIRWGDQNSKWLHRKATIRRKTNEINGILNEEGIWV

Query:  EDISEIESSFTNYFQNLFKSTNPDSQQIALSLEGLQSRVSSTMNDKLKVPFTKYEIERAINDMFPSKAPGPDEFSAIFNQNYWNIVGKKPINGCLKILNG
         +  +I   F  +F NLF S    +     S   +Q+RVSS     L +P+++ EIE A+N + PSKAPGPD   A+F Q YW+IVG    + CL++LNG
Subjt:  EDISEIESSFTNYFQNLFKSTNPDSQQIALSLEGLQSRVSSTMNDKLKVPFTKYEIERAINDMFPSKAPGPDEFSAIFNQNYWNIVGKKPINGCLKILNG

Query:  DKEIKEWNNTNITLVPKVPKPREVSDFRPISLCNVNYKIVTKTIANRLKGILKEVISNYQSAFIQDRLITNNIIIAHE
           + ++N+T + L+PKV  P  VS++RPISLCNV YKI++KT+ANRLK +L EVIS +QSAFI +R+I +N++ A E
Subjt:  DKEIKEWNNTNITLVPKVPKPREVSDFRPISLCNVNYKIVTKTIANRLKGILKEVISNYQSAFIQDRLITNNIIIAHE

XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]2.1e-7242.19Show/hide
Query:  HIWERIDRFLCNQSFDSIFKFSRTRNL-DWIFSDHKPI--EAKLDVRLATRSRRRIKNFKFEELWTRYEKCAELIESNKYWE--EGNSYLSSVSNDLFKC
        +I ER+DR LC++ + S F+     +L +W+ SDH PI  E K+  +     +       +E++W+ YE C+ ++ S   WE  +GNS+ S V       
Subjt:  HIWERIDRFLCNQSFDSIFKFSRTRNL-DWIFSDHKPI--EAKLDVRLATRSRRRIKNFKFEELWTRYEKCAELIESNKYWE--EGNSYLSSVSNDLFKC

Query:  SKALGEV---GKDIYNNRKKRIIE-CKRALNDAFKNIPAVNFEIIHGIELELDS-LFEEEIYWKERSREDWIRWGDQNSKWLHRKATIRRKTNEINGILN
         ++L  +    K+ +  RKK+  E   R      + + A++ E I  +E ++ + L +EE+YWK+RSR DW++ GD+N+K+ H KA+ RR+ N+I G+ +
Subjt:  SKALGEV---GKDIYNNRKKRIIE-CKRALNDAFKNIPAVNFEIIHGIELELDS-LFEEEIYWKERSREDWIRWGDQNSKWLHRKATIRRKTNEINGILN

Query:  EEGIWVEDISEIESSFTNYFQNLFKSTNPDSQQIALSLEGLQSRVSSTMNDKLKVPFTKYEIERAINDMFPSKAPGPDEFSAIFNQNYWNIVGKKPINGC
        ++G WV+D   IE  F  +FQ LF S+NP   QI+ +L+GL  +VS  MN  L+ PFT  +I RA+++M P+KAPGPD   A F Q +W IVG+     C
Subjt:  EEGIWVEDISEIESSFTNYFQNLFKSTNPDSQQIALSLEGLQSRVSSTMNDKLKVPFTKYEIERAINDMFPSKAPGPDEFSAIFNQNYWNIVGKKPINGC

Query:  LKILNGDKEIKEWNNTNITLVPKVPKPREVSDFRPISLCNVNYKIVTKTIANRLKGILKEVISNYQSAFIQDRLITNNIIIAHE
        L ILN    +   N+T I L+PKV KPR+V +FRPISLCNV Y+IV K IANRLK IL  +IS  QSAFI +RLIT+N+II +E
Subjt:  LKILNGDKEIKEWNNTNITLVPKVPKPREVSDFRPISLCNVNYKIVTKTIANRLKGILKEVISNYQSAFIQDRLITNNIIIAHE

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]3.5e-6439.27Show/hide
Query:  THIWERIDRFLCNQSFDSIFKFSRTRNLDWIFSDHKPI---EAKLDVRLATRSRRRIKNFKFEELWTRYEKCAELIESNKYWEEGNSYLS--SVSNDLFK
        + ++ R+DR L    +   +K  +  +L    SDH  +   +A +  + A R R     F+FE +WTR E C ++I+    W   +   S   ++  L  
Subjt:  THIWERIDRFLCNQSFDSIFKFSRTRNLDWIFSDHKPI---EAKLDVRLATRSRRRIKNFKFEELWTRYEKCAELIESNKYWEEGNSYLS--SVSNDLFK

Query:  CSKALGEVGKDIYNNRKKRIIECKRALNDAFKNIPAVNF-EIIHGIELELDSLFE-EEIYWKERSREDWIRWGDQNSKWLHRKATIRRKTNEINGILNEE
        C++ L E  K I+ N  ++I E K  LN    +    +    I+ +  E++ L + EEI W++RSR  W+  GD+N+K+ H KA+ RR+ N INGI++E 
Subjt:  CSKALGEVGKDIYNNRKKRIIECKRALNDAFKNIPAVNF-EIIHGIELELDSLFE-EEIYWKERSREDWIRWGDQNSKWLHRKATIRRKTNEINGILNEE

Query:  GIWVEDISEIESSFTNYFQNLFKSTNPDSQQIALSLEGLQSRVSSTMNDKLKVPFTKYEIERAINDMFPSKAPGPDEFSAIFNQNYWNIVGKKPINGCLK
        G W +    I     +YFQ ++ S+ P   +I+  L+ + + V+  MN  L   FT+ EIE A+N M P+KAPGPD  SAIF Q YWNIVG   +   L 
Subjt:  GIWVEDISEIESSFTNYFQNLFKSTNPDSQQIALSLEGLQSRVSSTMNDKLKVPFTKYEIERAINDMFPSKAPGPDEFSAIFNQNYWNIVGKKPINGCLK

Query:  ILNGDKEIKEWNNTNITLVPKVPKPREVSDFRPISLCNVNYKIVTKTIANRLKGILKEVISNYQSAFIQDRLITNNIIIAHE
        +LN +  + E N TNITLVPK+  P ++SDFRPISLCNV YK+++K +ANRLK IL ++IS  QSAF+  RLIT+N+++A E
Subjt:  ILNGDKEIKEWNNTNITLVPKVPKPREVSDFRPISLCNVNYKIVTKTIANRLKGILKEVISNYQSAFIQDRLITNNIIIAHE

XP_024038343.1 uncharacterized protein LOC112097373 [Citrus clementina]3.3e-7040.26Show/hide
Query:  ERIDRFLCNQSFDSIFKFSRTRNLDWIFSDHKPIEAKLDVRLA--TRSRRRIKNFKFEELWTRYEKCAELIESNKYWEEGNSYLSSVSNDLF-KCSK---
        ER+DRF+CN ++  +F      N+D   SDH P+  ++ VR +    ++RR     +E++W+ Y+ C E+IE  K W     + +     +F K SK   
Subjt:  ERIDRFLCNQSFDSIFKFSRTRNLDWIFSDHKPIEAKLDVRLA--TRSRRRIKNFKFEELWTRYEKCAELIESNKYWEEGNSYLSSVSNDLF-KCSK---

Query:  -ALGEVGKDIYNNRKKRIIECKRALND-AFKNIPAVNFEIIHGIELELD-SLFEEEIYWKERSREDWIRWGDQNSKWLHRKATIRRKTNEINGILNEEGI
          L    K+ +  R+K++ +    L       +  V    I  +E ++   L ++EIYWK+RSR DW++ GD+N+K+ H KA+ R+K N I GI N  G 
Subjt:  -ALGEVGKDIYNNRKKRIIECKRALND-AFKNIPAVNFEIIHGIELELD-SLFEEEIYWKERSREDWIRWGDQNSKWLHRKATIRRKTNEINGILNEEGI

Query:  WVEDISEIESSFTNYFQNLFKSTNPDSQQIALSLEGLQSRVSSTMNDKLKVPFTKYEIERAINDMFPSKAPGPDEFSAIFNQNYWNIVGKKPINGCLKIL
        W+E+   +E  F  YF NLF ++ P+  QIA +L G+  RVS+ MN+ L++PFT  E+  A+  M P+KAPGPD   A+F Q +W  V +  ++ CL IL
Subjt:  WVEDISEIESSFTNYFQNLFKSTNPDSQQIALSLEGLQSRVSSTMNDKLKVPFTKYEIERAINDMFPSKAPGPDEFSAIFNQNYWNIVGKKPINGCLKIL

Query:  NGDKEIKEWNNTNITLVPKVPKPREVSDFRPISLCNVNYKIVTKTIANRLKGILKEVISNYQSAFIQDRLITNNIIIAHE
        N   ++  +N+T I L+ K  KPR+V+DFRPISLCNV Y+IV K IANRLK +L  +IS  QSAFI + LIT+NII+ +E
Subjt:  NGDKEIKEWNNTNITLVPKVPKPREVSDFRPISLCNVNYKIVTKTIANRLKGILKEVISNYQSAFIQDRLITNNIIIAHE

XP_030969743.1 uncharacterized protein LOC115990020 [Quercus lobata]3.7e-6638.89Show/hide
Query:  GTHIWERIDRFLCNQSFDSIFKFSRTRNLDWIFSDHKPIEAKLDVRLATRSRRRIKNFKFEELWTRYEKCAELIESNKYWEEGNSYLSSVSNDLFKCSKA
        G  IWER+DR + N  + + F   R ++L+   SDH+P+   LD     R R R K F+FE +W     C   +          + + + +  + +C K 
Subjt:  GTHIWERIDRFLCNQSFDSIFKFSRTRNLDWIFSDHKPIEAKLDVRLATRSRRRIKNFKFEELWTRYEKCAELIESNKYWEEGNSYLSSVSNDLFKCSKA

Query:  LGEVGKDIYNNRKKRIIECKRALNDA-FKNIPAVNFEIIHGIELELDSLFE-EEIYWKERSREDWIRWGDQNSKWLHRKATIRRKTNEINGILNEEGIWV
        L    K+ + N K++I   K  L  A  +++   +   +  ++ EL  L E EE  W +RSR  W++ GDQN+++ H  AT R++ N I G+ +E G+W 
Subjt:  LGEVGKDIYNNRKKRIIECKRALNDA-FKNIPAVNFEIIHGIELELDSLFE-EEIYWKERSREDWIRWGDQNSKWLHRKATIRRKTNEINGILNEEGIWV

Query:  EDISEIESSFTNYFQNLFKSTNPDSQQIALSLEGLQSRVSSTMNDKLKVPFTKYEIERAINDMFPSKAPGPDEFSAIFNQNYWNIVGKKPINGCLKILNG
         +        T++++ LFKS+NP  Q I   ++G+Q  V+++MN  L  P++  E+ERAI DM P KAPGPD    +F Q YW+ V        L  LN 
Subjt:  EDISEIESSFTNYFQNLFKSTNPDSQQIALSLEGLQSRVSSTMNDKLKVPFTKYEIERAINDMFPSKAPGPDEFSAIFNQNYWNIVGKKPINGCLKILNG

Query:  DKEIKEWNNTNITLVPKVPKPREVSDFRPISLCNVNYKIVTKTIANRLKGILKEVISNYQSAFIQDRLITNNIIIAHE
           +K  N+T ITL+PKV  P +V++FRPISLCNV YKIV+K IANRLK +L  +IS+ QSAFI DRLIT+N++IA E
Subjt:  DKEIKEWNNTNITLVPKVPKPREVSDFRPISLCNVNYKIVTKTIANRLKGILKEVISNYQSAFIQDRLITNNIIIAHE

TrEMBL top hitse value%identityAlignment
A0A2N9FMJ0 Reverse transcriptase domain-containing protein7.6e-6537.7Show/hide
Query:  WERIDRFLCNQSFDSIFKFSRTRNLDWIFSDHKPIEAKLDVRLATRSRRRIKNFKFEELWTRYEKCAELIESNKYWEEGNSYLSSVSNDLFKCSKALGEV
        W R+DR L N  +      +   ++D   SDHK +    +    +  +R  K F+FEE+WT  E C   I+ +       + +  V+N L +C + LG  
Subjt:  WERIDRFLCNQSFDSIFKFSRTRNLDWIFSDHKPIEAKLDVRLATRSRRRIKNFKFEELWTRYEKCAELIESNKYWEEGNSYLSSVSNDLFKCSKALGEV

Query:  GKDIYNNRKKRIIECKRALNDA-FKNIPAVNFEIIHGIELELDSLF-EEEIYWKERSREDWIRWGDQNSKWLHRKATIRRKTNEINGILNEEGIWVEDIS
         +  + N  K++ E KR L  A  + + + N   +  +++E++SL  +EE  W++RSR +W+R GDQN+++ H +A+ RR+ N I G+ +E G W +   
Subjt:  GKDIYNNRKKRIIECKRALNDA-FKNIPAVNFEIIHGIELELDSLF-EEEIYWKERSREDWIRWGDQNSKWLHRKATIRRKTNEINGILNEEGIWVEDIS

Query:  EIESSFTNYFQNLFKSTNPDSQQIALSLEGLQSRVSSTMNDKLKVPFTKYEIERAINDMFPSKAPGPDEFSAIFNQNYWNIVGKKPINGCLKILNGDKEI
        EI +  T Y++++F+++ PD  QI  ++  + + +S +MND L   F+  E+++A+  M P  APGPD    +F Q YW++VG +  +G L  LN  K +
Subjt:  EIESSFTNYFQNLFKSTNPDSQQIALSLEGLQSRVSSTMNDKLKVPFTKYEIERAINDMFPSKAPGPDEFSAIFNQNYWNIVGKKPINGCLKILNGDKEI

Query:  KEWNNTNITLVPKVPKPREVSDFRPISLCNVNYKIVTKTIANRLKGILKEVISNYQSAFIQDRLITNNIIIAHE
           N+T ITL+PKV  P +V++FRPISLCNV YK+V+K IANRLK IL ++IS  QSAF+  RLIT+NI++A E
Subjt:  KEWNNTNITLVPKVPKPREVSDFRPISLCNVNYKIVTKTIANRLKGILKEVISNYQSAFIQDRLITNNIIIAHE

A0A2N9GPZ7 Reverse transcriptase domain-containing protein1.7e-6438.24Show/hide
Query:  RIDRFLCNQSFDSIFKFSRTRNLDWIFSDHKPIEAKLDVRLATRSRRRIKNFKFEELWTRYEKCAELIESNKYWEEG---NSYLSSVSNDLFKCSKALGE
        R+DR + + S+ + +  S   +L    SDH PI   LD+      +R+ K F+FE +W + E+C E+I+    W +G    S +  V   +  C  +L  
Subjt:  RIDRFLCNQSFDSIFKFSRTRNLDWIFSDHKPIEAKLDVRLATRSRRRIKNFKFEELWTRYEKCAELIESNKYWEEG---NSYLSSVSNDLFKCSKALGE

Query:  VGKDIYNNRKKRIIECKRALNDAFKNIPAVNFEIIHGIELELDSLFE-EEIYWKERSREDWIRWGDQNSKWLHRKATIRRKTNEINGILNEEGIWVEDIS
          ++ + +    I   +  L       P+     I  ++ +L+ L E EEI+W++RSR  W+  GD+N+K+ H +   RR+TN I+G+ + +G+W  + +
Subjt:  VGKDIYNNRKKRIIECKRALNDAFKNIPAVNFEIIHGIELELDSLFE-EEIYWKERSREDWIRWGDQNSKWLHRKATIRRKTNEINGILNEEGIWVEDIS

Query:  EIESSFTNYFQNLFKSTNPDSQQIALSLEGLQSRVSSTMNDKLKVPFTKYEIERAINDMFPSKAPGPDEFSAIFNQNYWNIVGKKPINGCLKILNGDKEI
        +I     +YFQ +F S+NP ++ I   L+G++S V++ MND+L+  FTK E+  A+  M+P+KAPGPD  SAIF Q YW+IVG +     L IL+    +
Subjt:  EIESSFTNYFQNLFKSTNPDSQQIALSLEGLQSRVSSTMNDKLKVPFTKYEIERAINDMFPSKAPGPDEFSAIFNQNYWNIVGKKPINGCLKILNGDKEI

Query:  KEWNNTNITLVPKVPKPREVSDFRPISLCNVNYKIVTKTIANRLKGILKEVISNYQSAFIQDRLITNNIIIAHE
        ++ N T+I L+PKV  P  ++DFRPISLCNV YKIV+K +ANRLK +L  VIS  QSAF+  RLIT+N+++A E
Subjt:  KEWNNTNITLVPKVPKPREVSDFRPISLCNVNYKIVTKTIANRLKGILKEVISNYQSAFIQDRLITNNIIIAHE

A0A2N9IPS8 Reverse transcriptase domain-containing protein1.7e-6438.24Show/hide
Query:  RIDRFLCNQSFDSIFKFSRTRNLDWIFSDHKPIEAKLDVRLATRSRRRIKNFKFEELWTRYEKCAELIESNKYWEEG---NSYLSSVSNDLFKCSKALGE
        R+DR + + S+ + +  S   +L    SDH PI   LD+      +R+ K F+FE +W + E+C E+I+    W +G    S +  V   +  C  +L  
Subjt:  RIDRFLCNQSFDSIFKFSRTRNLDWIFSDHKPIEAKLDVRLATRSRRRIKNFKFEELWTRYEKCAELIESNKYWEEG---NSYLSSVSNDLFKCSKALGE

Query:  VGKDIYNNRKKRIIECKRALNDAFKNIPAVNFEIIHGIELELDSLFE-EEIYWKERSREDWIRWGDQNSKWLHRKATIRRKTNEINGILNEEGIWVEDIS
          ++ + +    I   +  L       P+     I  ++ +L+ L E EEI+W++RSR  W+  GD+N+K+ H +   RR+TN I+G+ + +G+W  + +
Subjt:  VGKDIYNNRKKRIIECKRALNDAFKNIPAVNFEIIHGIELELDSLFE-EEIYWKERSREDWIRWGDQNSKWLHRKATIRRKTNEINGILNEEGIWVEDIS

Query:  EIESSFTNYFQNLFKSTNPDSQQIALSLEGLQSRVSSTMNDKLKVPFTKYEIERAINDMFPSKAPGPDEFSAIFNQNYWNIVGKKPINGCLKILNGDKEI
        +I     +YFQ +F S+NP ++ I   L+G++S V++ MND+L+  FTK E+  A+  M+P+KAPGPD  SAIF Q YW+IVG +     L IL+    +
Subjt:  EIESSFTNYFQNLFKSTNPDSQQIALSLEGLQSRVSSTMNDKLKVPFTKYEIERAINDMFPSKAPGPDEFSAIFNQNYWNIVGKKPINGCLKILNGDKEI

Query:  KEWNNTNITLVPKVPKPREVSDFRPISLCNVNYKIVTKTIANRLKGILKEVISNYQSAFIQDRLITNNIIIAHE
        ++ N T+I L+PKV  P  ++DFRPISLCNV YKIV+K +ANRLK +L  VIS  QSAF+  RLIT+N+++A E
Subjt:  KEWNNTNITLVPKVPKPREVSDFRPISLCNVNYKIVTKTIANRLKGILKEVISNYQSAFIQDRLITNNIIIAHE

A0A2N9J109 Uncharacterized protein3.7e-6738.1Show/hide
Query:  GTHIWERIDRFLCNQSFDSIFKFSRTRNLDWIFSDHKPIEAKLDVRLATRSRRRIKNFKFEELWTRYEKCAELIESNKYWEEGNSYLSSVSNDLFKCSKA
        G  +WER+D+ +    + +IF  +R  +LD+  SDHKP+        +  + R  K F FEE+W     CAE I ++       +++  V + L  C K 
Subjt:  GTHIWERIDRFLCNQSFDSIFKFSRTRNLDWIFSDHKPIEAKLDVRLATRSRRRIKNFKFEELWTRYEKCAELIESNKYWEEGNSYLSSVSNDLFKCSKA

Query:  LGEVGKDIYNNRKKRIIECKRALNDAFKNIPAVNFEI-IHGIELELDSLF-EEEIYWKERSREDWIRWGDQNSKWLHRKATIRRKTNEINGILNEEGIWV
        L    K  + + +K++ + +R L  A +N       + IH +  E+  L  +EE  W++RSR  W+++GD+N+ + H +AT R++ N I G+ + +G+W 
Subjt:  LGEVGKDIYNNRKKRIIECKRALNDAFKNIPAVNFEI-IHGIELELDSLF-EEEIYWKERSREDWIRWGDQNSKWLHRKATIRRKTNEINGILNEEGIWV

Query:  EDISEIESSFTNYFQNLFKSTNPDSQQIALSLEGLQSRVSSTMNDKLKVPFTKYEIERAINDMFPSKAPGPDEFSAIFNQNYWNIVGKKPINGCLKILNG
         +  +I++  T+YFQ++F ++NP S  I   L+ + + V+ +MN+ L  P+T  E+E A+  M P  APGPD    +F QN+W+++G+  I G L  LN 
Subjt:  EDISEIESSFTNYFQNLFKSTNPDSQQIALSLEGLQSRVSSTMNDKLKVPFTKYEIERAINDMFPSKAPGPDEFSAIFNQNYWNIVGKKPINGCLKILNG

Query:  DKEIKEWNNTNITLVPKVPKPREVSDFRPISLCNVNYKIVTKTIANRLKGILKEVISNYQSAFIQDRLITNNIIIAHE
         K ++  N+TNITL+PKV  P +VS+FRPISLCNV YKI++K +ANRLK IL  +IS  QSAF+  RLIT+NI++A E
Subjt:  DKEIKEWNNTNITLVPKVPKPREVSDFRPISLCNVNYKIVTKTIANRLKGILKEVISNYQSAFIQDRLITNNIIIAHE

A0A803PWX1 Uncharacterized protein5.8e-6536.94Show/hide
Query:  THIWERIDRFLCNQSFDSIFKFSRTRNLDWIFSDHKPIEAKLDVRL---ATRSRRRIKNFKFEELWTRYEKCAELIESNKYWEEGNSYLSSVSNDLFKCS
        + I ER+DR LCN+ +   F+ +  + LDW  SDH+ +   + VR+        +R   F FEE W + E+C E+I++     +G     S    + KC 
Subjt:  THIWERIDRFLCNQSFDSIFKFSRTRNLDWIFSDHKPIEAKLDVRL---ATRSRRRIKNFKFEELWTRYEKCAELIESNKYWEEGNSYLSSVSNDLFKCS

Query:  KALGEVGKDIYNNRKKRIIECKRALNDAFKNIPAVNFEIIHGIELELDSLFE-EEIYWKERSREDWIRWGDQNSKWLHRKATIRRKTNEINGILNEEGIW
        KAL +  K         I + K+ L++         +E I  +E +L+ L E +E YW++RSR  W++WGD+N+K+ H KA+ RRK NEI G+ ++ G+W
Subjt:  KALGEVGKDIYNNRKKRIIECKRALNDAFKNIPAVNFEIIHGIELELDSLFE-EEIYWKERSREDWIRWGDQNSKWLHRKATIRRKTNEINGILNEEGIW

Query:  VEDISEIESSFTNYFQNLFKSTNPDSQQIALSLEGLQSRVSSTMNDKLKVPFTKYEIERAINDMFPSKAPGPDEFSAIFNQNYWNIVGKKPINGCLKILN
         +D   +     +Y++ LF  ++ D   +   LE +Q +VS +MN++L V F+  E+ +A+  M P+KAPG D   A+F Q +W+ +  + I  CL +LN
Subjt:  VEDISEIESSFTNYFQNLFKSTNPDSQQIALSLEGLQSRVSSTMNDKLKVPFTKYEIERAINDMFPSKAPGPDEFSAIFNQNYWNIVGKKPINGCLKILN

Query:  GDKEIKEWNNTNITLVPKVPKPREVSDFRPISLCNVNYKIVTKTIANRLKGILKEVISNYQSAFIQDRLITNNIIIAHE
           ++   N+T + L+PKV KP+++ +FRPISLCNV YKIV+K +ANRL+  L +V+S+ QSAF++ RLI +N I+ +E
Subjt:  GDKEIKEWNNTNITLVPKVPKPREVSDFRPISLCNVNYKIVTKTIANRLKGILKEVISNYQSAFIQDRLITNNIIIAHE

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein8.0e-1926.05Show/hide
Query:  HIWERIDRFLCNQSFDSIFKFSRTRNLDWIFSDHKPIEAKLDVRLATRSRR---RIKNFKFEELWTRYEKCAELIESNKYWEEGNSYLSSVSNDLFKCSK
        H + +ID  + +++   + K  RT  +    SDH  I+ +L ++  T+SR    ++ N    + W   E  AE+    K + E N    +   +L+   K
Subjt:  HIWERIDRFLCNQSFDSIFKFSRTRNLDWIFSDHKPIEAKLDVRLATRSRR---RIKNFKFEELWTRYEKCAELIESNKYWEEGNSYLSSVSNDLFKCSK

Query:  ALGEVGKDIYNNRKKRIIECKR--ALNDAFKNI--------PAVNFEIIHGIELELDSLFEEEIYWKERSREDW-IRWGDQNSKWLHRKATIRRKTNEIN
        A+   GK I  N  KR  E  +   L    K +         A   + I  I  EL  +  ++   K      W     ++  + L R    +R+ N+I+
Subjt:  ALGEVGKDIYNNRKKRIIECKR--ALNDAFKNI--------PAVNFEIIHGIELELDSLFEEEIYWKERSREDW-IRWGDQNSKWLHRKATIRRKTNEIN

Query:  GILNEEGIWVEDISEIESSFTNYFQNLFKSTNPDSQQIALSLEGLQ-SRVSSTMNDKLKVPFTKYEIERAINDMFPSKAPGPDEFSAIFNQNYWNIVGKK
         I N++G    D +EI+++   Y+++L+ +   + +++   L+     R++    + L  P T  EI   IN +   K+PGPD F+A F Q Y     ++
Subjt:  GILNEEGIWVEDISEIESSFTNYFQNLFKSTNPDSQQIALSLEGLQ-SRVSSTMNDKLKVPFTKYEIERAINDMFPSKAPGPDEFSAIFNQNYWNIVGKK

Query:  PINGCLKILNG-DKE---IKEWNNTNITLVPKVPK-PREVSDFRPISLCNVNYKIVTKTIANRLKGILKEVISNYQSAFI
         +   LK+    +KE      +   +I L+PK  +   +  +FRPISL N++ KI+ K +ANR++  +K++I + Q  FI
Subjt:  PINGCLKILNG-DKE---IKEWNNTNITLVPKVPK-PREVSDFRPISLCNVNYKIVTKTIANRLKGILKEVISNYQSAFI

P08548 LINE-1 reverse transcriptase homolog1.1e-1525.33Show/hide
Query:  WERIDRFLCNQSFDSIFKFSRTRNLDWIFSDHKPIEAKL--DVRLATRSRR-RIKNFKFEELWTRYEKCAELIESNKYWEEGNSYLSSVSNDLFKCSKA-
        + +ID  L ++S  ++ KF +   +  IFSDH  I+ +L  +  L T ++  ++ N   ++ W   E   E+    K+ E+ N+  ++  N L+  +KA 
Subjt:  WERIDRFLCNQSFDSIFKFSRTRNLDWIFSDHKPIEAKL--DVRLATRSRR-RIKNFKFEELWTRYEKCAELIESNKYWEEGNSYLSSVSNDLFKCSKA-

Query:  -----------LGEVGKDIYNNRKKRIIECKRALNDAFKNIPAVNFEIIHGIELELDSLFEEEIYWKERSREDW----IRWGDQNSKWLHRKATIRRKTN
                   L +  ++  NN    +   K+   +   N      + I  I  EL+ +  + I  +    + W    I   D+    L RK   +R  +
Subjt:  -----------LGEVGKDIYNNRKKRIIECKRALNDAFKNIPAVNFEIIHGIELELDSLFEEEIYWKERSREDW----IRWGDQNSKWLHRKATIRRKTN

Query:  EINGILNEEGIWVEDISEIESSFTNYFQNLFKSTNPDSQQIALSLEGLQ-SRVSSTMNDKLKVPFTKYEIERAINDMFPSKAPGPDEFSAIFNQNYWNIV
         I+ I N       D SEI+     Y++ L+     + ++I   LE     R+S    + L  P +  EI   I ++   K+PGPD F++ F Q +   +
Subjt:  EINGILNEEGIWVEDISEIESSFTNYFQNLFKSTNPDSQQIALSLEGLQ-SRVSSTMNDKLKVPFTKYEIERAINDMFPSKAPGPDEFSAIFNQNYWNIV

Query:  GKKPINGCLKILNGDKEIKEWNNTNITLVPKVPK-PREVSDFRPISLCNVNYKIVTKTIANRLKGILKEVISNYQSAFI
            +N    I         +   NITL+PK  K P    ++RPISL N++ KI+ K + NR++  +K++I + Q  FI
Subjt:  GKKPINGCLKILNGDKEIKEWNNTNITLVPKVPK-PREVSDFRPISLCNVNYKIVTKTIANRLKGILKEVISNYQSAFI

P11369 LINE-1 retrotransposable element ORF2 protein1.4e-1532.24Show/hide
Query:  INGILNEEGIWVEDISEIESSFTNYFQNLFKSTNPDSQQIALSLEGLQ-SRVSSTMNDKLKVPFTKYEIERAINDMFPSKAPGPDEFSAIFNQNYWNIVG
        IN I NE+G    D  EI+++  ++++ L+ +   +  ++   L+  Q  +++    D L  P +  EIE  IN +   K+PGPD FSA F Q +     
Subjt:  INGILNEEGIWVEDISEIESSFTNYFQNLFKSTNPDSQQIALSLEGLQ-SRVSSTMNDKLKVPFTKYEIERAINDMFPSKAPGPDEFSAIFNQNYWNIVG

Query:  KKPINGCLKILNGDKEIK-----EWNNTNITLVPKVPK-PREVSDFRPISLCNVNYKIVTKTIANRLKGILKEVISNYQSAFI
        K+ +   L  L    E++      +    ITL+PK  K P ++ +FRPISL N++ KI+ K +ANR++  +K +I   Q  FI
Subjt:  KKPINGCLKILNGDKEIK-----EWNNTNITLVPKVPK-PREVSDFRPISLCNVNYKIVTKTIANRLKGILKEVISNYQSAFI

P14381 Transposon TX1 uncharacterized 149 kDa protein3.1e-1527.85Show/hide
Query:  RSREDWIRWGDQNSKWLHRKATIRRKTNEINGILNEEGIWVEDISEIESSFTNYFQNLFKSTNPDSQQIALSLEGLQSRVSSTMNDKLKVPFTKYEIERA
        RSR   +   D+ S++ +     +    +I  +  E+G  +ED   I     +++QNLF S +P S      L      VS    ++L+ P T  E+ +A
Subjt:  RSREDWIRWGDQNSKWLHRKATIRRKTNEINGILNEEGIWVEDISEIESSFTNYFQNLFKSTNPDSQQIALSLEGLQSRVSSTMNDKLKVPFTKYEIERA

Query:  INDMFPSKAPGPDEFSAIFNQNYWNIVGKKPINGCLKILNGDKEIKEWNNTNITLVPKVPKPREVSDFRPISLCNVNYKIVTKTIANRLKGILKEVISNY
        +  M  +K+PG D  +  F Q +W+ +G        +     +         ++L+PK    R + ++RP+SL + +YKIV K I+ RLK +L EVI   
Subjt:  INDMFPSKAPGPDEFSAIFNQNYWNIVGKKPINGCLKILNGDKEIKEWNNTNITLVPKVPKPREVSDFRPISLCNVNYKIVTKTIANRLKGILKEVISNY

Query:  QSAFIQDRLITNNIIIAHE
        QS  +  R I +N+ +  +
Subjt:  QSAFIQDRLITNNIIIAHE

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.9e-2325.79Show/hide
Query:  IWERIDRFLCNQSFDSIFKFS-RTRNLDWIFSDHKPIEAKLDVRLATRSRRRIKNFKFEELWTRYEKCAELIESNKYWEEG---NSYLSSVSNDL---FK
        I  ++DR + N  + S F  +     L  + SDH P    L+  L  RS++  + F F      +     L+     WEE     S++ S+   L    K
Subjt:  IWERIDRFLCNQSFDSIFKFS-RTRNLDWIFSDHKPIEAKLDVRLATRSRRRIKNFKFEELWTRYEKCAELIESNKYWEEG---NSYLSSVSNDL---FK

Query:  CSKALGEVGKDIYNNRKKRIIECKRALNDAFKNIPAVNFEIIHGIELELDSLFEE--EIYWKERSREDWIRWGDQNSKWLHRKATIRRKTNEINGILNEE
        C K L   G     ++ K  ++   ++       P+ +   +  +  +  + F    E +++++SR  W++ GD N+++ H+     +  N I  +  ++
Subjt:  CSKALGEVGKDIYNNRKKRIIECKRALNDAFKNIPAVNFEIIHGIELELDSLFEE--EIYWKERSREDWIRWGDQNSKWLHRKATIRRKTNEINGILNEE

Query:  GIWVEDISEIESSFTNYFQNLFKSTN----PDSQQIALSLEGLQSRVSSTMNDKLKVPFTKYEIERAINDMFPSKAPGPDEFSAIFNQNYWNIVGKKPIN
         + VE++++++     Y+ +L  S +    PDS Q    +     R + T+  +L    +  EI  A+  M  +KAPGPD F+A F    W +V    I 
Subjt:  GIWVEDISEIESSFTNYFQNLFKSTN----PDSQQIALSLEGLQSRVSSTMNDKLKVPFTKYEIERAINDMFPSKAPGPDEFSAIFNQNYWNIVGKKPIN

Query:  GCLKILNGDKEIKEWNNTNITLVPKVPKPREVSDFRPISLCNVNYKIVT
           +       +K +N T ITL+PKV    ++S FRP+S C V YKI+T
Subjt:  GCLKILNGDKEIKEWNNTNITLVPKVPKPREVSDFRPISLCNVNYKIVT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAACTCATATCTGGGAGAGAATTGACCGTTTCTTGTGCAACCAGAGTTTTGACTCCATTTTCAAATTTTCAAGAACAAGAAACTTGGATTGGATATTTTCA
GATCATAAGCCCATAGAAGCAAAACTTGATGTTCGACTGGCTACCCGAAGTAGGAGAAGAATAAAAAATTTTAAATTCGAGGAACTTTGGACAAGGTATGAGAAG
TGTGCTGAATTGATTGAAAGCAACAAATATTGGGAAGAAGGCAACTCTTATCTCTCTTCTGTTTCAAATGATTTATTCAAATGTTCTAAGGCCCTCGGGGAAGTG
GGGAAGGACATTTATAATAACAGAAAGAAAAGAATCATAGAATGCAAAAGAGCCCTTAATGATGCTTTTAAAAATATTCCAGCAGTAAATTTCGAAATCATTCAT
GGTATAGAATTAGAACTAGATTCTTTATTTGAAGAAGAGATATATTGGAAAGAAAGATCCAGAGAGGATTGGATAAGGTGGGGTGACCAAAATTCTAAATGGTTA
CATAGAAAAGCCACTATTAGGAGAAAAACAAATGAAATAAATGGAATCCTAAATGAGGAAGGTATTTGGGTAGAAGATATTTCTGAAATAGAGAGTTCTTTCACT
AACTATTTCCAAAACCTTTTTAAATCAACCAATCCAGATTCTCAACAAATTGCATTATCTTTGGAAGGCCTTCAATCTAGGGTTAGTTCTACCATGAATGATAAA
CTAAAGGTCCCCTTCACCAAATACGAAATAGAAAGAGCTATTAACGATATGTTTCCATCAAAAGCCCCAGGACCGGACGAGTTTTCTGCTATTTTCAACCAGAAT
TACTGGAACATAGTTGGGAAGAAACCAATTAATGGATGTCTCAAAATTTTAAATGGGGATAAAGAAATTAAAGAATGGAATAACACTAATATAACTCTAGTCCCA
AAGGTTCCTAAACCTAGAGAGGTGAGCGATTTTAGGCCTATTAGCCTATGCAATGTGAATTACAAAATTGTTACGAAAACTATTGCAAATAGGTTAAAAGGAATC
TTAAAAGAAGTTATTTCAAACTACCAATCTGCCTTTATCCAGGACAGACTAATAACCAACAACATCATAATAGCCCACGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAACTCATATCTGGGAGAGAATTGACCGTTTCTTGTGCAACCAGAGTTTTGACTCCATTTTCAAATTTTCAAGAACAAGAAACTTGGATTGGATATTTTCA
GATCATAAGCCCATAGAAGCAAAACTTGATGTTCGACTGGCTACCCGAAGTAGGAGAAGAATAAAAAATTTTAAATTCGAGGAACTTTGGACAAGGTATGAGAAG
TGTGCTGAATTGATTGAAAGCAACAAATATTGGGAAGAAGGCAACTCTTATCTCTCTTCTGTTTCAAATGATTTATTCAAATGTTCTAAGGCCCTCGGGGAAGTG
GGGAAGGACATTTATAATAACAGAAAGAAAAGAATCATAGAATGCAAAAGAGCCCTTAATGATGCTTTTAAAAATATTCCAGCAGTAAATTTCGAAATCATTCAT
GGTATAGAATTAGAACTAGATTCTTTATTTGAAGAAGAGATATATTGGAAAGAAAGATCCAGAGAGGATTGGATAAGGTGGGGTGACCAAAATTCTAAATGGTTA
CATAGAAAAGCCACTATTAGGAGAAAAACAAATGAAATAAATGGAATCCTAAATGAGGAAGGTATTTGGGTAGAAGATATTTCTGAAATAGAGAGTTCTTTCACT
AACTATTTCCAAAACCTTTTTAAATCAACCAATCCAGATTCTCAACAAATTGCATTATCTTTGGAAGGCCTTCAATCTAGGGTTAGTTCTACCATGAATGATAAA
CTAAAGGTCCCCTTCACCAAATACGAAATAGAAAGAGCTATTAACGATATGTTTCCATCAAAAGCCCCAGGACCGGACGAGTTTTCTGCTATTTTCAACCAGAAT
TACTGGAACATAGTTGGGAAGAAACCAATTAATGGATGTCTCAAAATTTTAAATGGGGATAAAGAAATTAAAGAATGGAATAACACTAATATAACTCTAGTCCCA
AAGGTTCCTAAACCTAGAGAGGTGAGCGATTTTAGGCCTATTAGCCTATGCAATGTGAATTACAAAATTGTTACGAAAACTATTGCAAATAGGTTAAAAGGAATC
TTAAAAGAAGTTATTTCAAACTACCAATCTGCCTTTATCCAGGACAGACTAATAACCAACAACATCATAATAGCCCACGAGTGA
Protein sequenceShow/hide protein sequence
MGTHIWERIDRFLCNQSFDSIFKFSRTRNLDWIFSDHKPIEAKLDVRLATRSRRRIKNFKFEELWTRYEKCAELIESNKYWEEGNSYLSSVSNDLFKCSKALGEV
GKDIYNNRKKRIIECKRALNDAFKNIPAVNFEIIHGIELELDSLFEEEIYWKERSREDWIRWGDQNSKWLHRKATIRRKTNEINGILNEEGIWVEDISEIESSFT
NYFQNLFKSTNPDSQQIALSLEGLQSRVSSTMNDKLKVPFTKYEIERAINDMFPSKAPGPDEFSAIFNQNYWNIVGKKPINGCLKILNGDKEIKEWNNTNITLVP
KVPKPREVSDFRPISLCNVNYKIVTKTIANRLKGILKEVISNYQSAFIQDRLITNNIIIAHE