; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026247 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026247
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr10:33193876..33195622
RNA-Seq ExpressionLag0026247
SyntenyLag0026247
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3462561.1 reverse transcriptase [Gossypium australe]1.6e-7338.57Show/hide
Query:  MKTLCWNARGLGNPRAIRALRHLVGKENPQLIFVMETKSDYGKCERIKVGLQYEFMFSVPSLGNNGGLMILWNSETKVTINSYSEGHIDVLI-KDKPVSW
        MK L WN RGLGNPRA+R LRH +  +NPQ++F METK +  K E+++    Y+    V SLG+ GGL + W  E  +T+ S+S  HIDV+I +D     
Subjt:  MKTLCWNARGLGNPRAIRALRHLVGKENPQLIFVMETKSDYGKCERIKVGLQYEFMFSVPSLGNNGGLMILWNSETKVTINSYSEGHIDVLI-KDKPVSW

Query:  RFTGFYGTLKQTKDTFLGTCWKGLRLVLKVLGWPIVAQWEKQSALSRTIKATKLLRFEEGWLKLKDTKKIIAEEWKTTSGSGAQFFSAKILRSRDDWLKS
        R TGFYG+                         P +   EK   L R ++                      + W+               R+R +WLK 
Subjt:  RFTGFYGTLKQTKDTFLGTCWKGLRLVLKVLGWPIVAQWEKQSALSRTIKATKLLRFEEGWLKLKDTKKIIAEEWKTTSGSGAQFFSAKILRSRDDWLKS

Query:  GDRNTKWFHAKASQQKKRNKIAGLLSESGEWVEEEDKISEVATNYFTRLFS-SSNPNPQAIQKIVECISGKISEQQKQELDQPYTKTEIEAAMKSLSPSK
        GDRNT +FH +A+Q+K+RN I  L  E G    E +++ E+A +YF +LFS  S PN     +I+  I   ++E+    L   +TK EI  A+  + P+K
Subjt:  GDRNTKWFHAKASQQKKRNKIAGLLSESGEWVEEEDKISEVATNYFTRLFS-SSNPNPQAIQKIVECISGKISEQQKQELDQPYTKTEIEAAMKSLSPSK

Query:  APRRDGTHASFYQEYWEVVGDDTVNTCLQILNNDADMAPINKTVIALIPKTKNPKRMQEFRPISLCNVSYKIIAKSIANRLKKALDSIISPHQAAFVPGR
        AP  DG  A FYQ+ W ++G++  N CL  LNN  D++ INKT I L+PK  +P  + +FRPISLCNV YK+IAK++ANRL+  +   I   Q+AFVPGR
Subjt:  APRRDGTHASFYQEYWEVVGDDTVNTCLQILNNDADMAPINKTVIALIPKTKNPKRMQEFRPISLCNVSYKIIAKSIANRLKKALDSIISPHQAAFVPGR

Query:  LISDNVTIGFECIHAITSKR
        LISDNV + +E +H + +K+
Subjt:  LISDNVTIGFECIHAITSKR

XP_006487889.1 uncharacterized protein LOC102617714 [Citrus sinensis]7.5e-7134.47Show/hide
Query:  MKTLCWNARGLGNPRAIRALRHLVGKENPQLIFVMETKSDYGKCERIKVGLQYEFMFSVPSLGNNGGLMILWNSETKVTINSYSEGHIDVLI-KDKPVSW
        MK + WN +GLG  R  R  + L+ +  PQ++F+ ETK    + E  +  L++E  F V   G  GGL +LW  +  + + SYS+ HID +I  +   SW
Subjt:  MKTLCWNARGLGNPRAIRALRHLVGKENPQLIFVMETKSDYGKCERIKVGLQYEFMFSVPSLGNNGGLMILWNSETKVTINSYSEGHIDVLI-KDKPVSW

Query:  RFTGFYG--TLKQTKDTFLGTCWKGLRLV--LKVLGWPIVAQWEKQSALSRTI-----KATKLLRFEEG-----------------WLKLKDTKKIIAEE
        R T  YG    +Q K T     W  LR +  +  L W     + +   L+  +       +++  F +                  W   ++  KI+  +
Subjt:  RFTGFYG--TLKQTKDTFLGTCWKGLRLV--LKVLGWPIVAQWEKQSALSRTI-----KATKLLRFEEG-----------------WLKLKDTKKIIAEE

Query:  WKTTSGSGAQFFSAK-----------------------------------------IL---------RSRDDWLKSGDRNTKWFHAKASQQKKRNKIAGL
            S    Q +S K                                         IL         RSR DWLK GD+NTK+FHAKAS ++K+N+I G+
Subjt:  WKTTSGSGAQFFSAK-----------------------------------------IL---------RSRDDWLKSGDRNTKWFHAKASQQKKRNKIAGL

Query:  LSESGEWVEEEDKISEVATNYFTRLFSSSNPNPQAIQKIVECISGKISEQQKQELDQPYTKTEIEAAMKSLSPSKAPRRDGTHASFYQEYWEVVGDDTVN
        L E G+W E+ D++  +   +FT LFS++ P  + +    +  S K++E+   +LD P+ + EI  A+  + P+KAP  DG  A+F+Q++W  V +  + 
Subjt:  LSESGEWVEEEDKISEVATNYFTRLFSSSNPNPQAIQKIVECISGKISEQQKQELDQPYTKTEIEAAMKSLSPSKAPRRDGTHASFYQEYWEVVGDDTVN

Query:  TCLQILNNDADMAPINKTVIALIPKTKNPKRMQEFRPISLCNVSYKIIAKSIANRLKKALDSIISPHQAAFVPGRLISDNVTIGFECIHAI-TSKRKEN
        TCL ILN+  ++AP+N T IALIPKT  PK + EFRPISLCNV Y+IIAKS+AN LK  LD I+SP+Q+AF+  RLI+DN+ IG+E ++ I   K K+N
Subjt:  TCLQILNNDADMAPINKTVIALIPKTKNPKRMQEFRPISLCNVSYKIIAKSIANRLKKALDSIISPHQAAFVPGRLISDNVTIGFECIHAI-TSKRKEN

XP_012477795.1 PREDICTED: uncharacterized protein LOC105793429 [Gossypium raimondii]3.3e-7136.16Show/hide
Query:  MKTLCWNARGLGNPRAIRALRHLVGKENPQLIFVMETKSDYGKCERIKVGLQYEFMFSVPSLGNNGGLMILWNSETKVTINSYSEGHIDVLIK--DKPVS
        MK +CWN RGLG+PRA+R LR L+ + NPQ++F+METK    + E I+    +     V  +G   G+ + W  E ++ + S S  HIDVL+K  D    
Subjt:  MKTLCWNARGLGNPRAIRALRHLVGKENPQLIFVMETKSDYGKCERIKVGLQYEFMFSVPSLGNNGGLMILWNSETKVTINSYSEGHIDVLIK--DKPVS

Query:  WRFTGFYGT-LKQTKDTFLGTCWKGLRLVLKVLGWPIVAQWEKQSALSRTIKATKLLRFEEGWLKLKDTKKIIAEEWKTTSGSGA---------------
        WRFTGFYG+   Q K+      W     +LK LG         Q     + K     +FE  W   +  ++ I E WK+ +GS                 
Subjt:  WRFTGFYGT-LKQTKDTFLGTCWKGLRLVLKVLGWPIVAQWEKQSALSRTIKATKLLRFEEGWLKLKDTKKIIAEEWKTTSGSGA---------------

Query:  -------------------------------QFFSAKI--------------LRSRDDWLKSGDRNTKWFHAKASQQKKRNKIAGLLSESGEWVEEEDKI
                                       Q    +I               R+R +WLK  D+NT +FH  AS +++ N I+ L S+ G  + EE +I
Subjt:  -------------------------------QFFSAKI--------------LRSRDDWLKSGDRNTKWFHAKASQQKKRNKIAGLLSESGEWVEEEDKI

Query:  SEVATNYFTRLFSSSNPNPQA--IQKIVECISGKISEQQKQELDQPYTKTEIEAAMKSLSPSKAPRRDGTHASFYQEYWEVVGDDTVNTCLQILNNDADM
        +E+AT YF +LFS++     +  +  I  CIS  I+E     L + +T  EI  A+K +  +KAP  DG    F+Q+YW++VG+D  N CL++LNN  D+
Subjt:  SEVATNYFTRLFSSSNPNPQA--IQKIVECISGKISEQQKQELDQPYTKTEIEAAMKSLSPSKAPRRDGTHASFYQEYWEVVGDDTVNTCLQILNNDADM

Query:  APINKTVIALIPKTKNPKRMQEFRPISLCNVSYKIIAKSIANRLKKALDSIISPHQAAFVPGRLISDNVTIGFECIHAITSKRK
          +N T I LIPK  NP  + +FRPISLC V YKIIAK+IANRL++ +   I   Q+AFVPGRLISDNV I +E +H +  KRK
Subjt:  APINKTVIALIPKTKNPKRMQEFRPISLCNVSYKIIAKSIANRLKKALDSIISPHQAAFVPGRLISDNVTIGFECIHAITSKRK

XP_024038343.1 uncharacterized protein LOC112097373 [Citrus clementina]1.0e-7233.21Show/hide
Query:  MKTLCWNARGLGNPRAIRALRHLVGKENPQLIFVMETKSDYGKCERIKVGLQYEFMFSVPSLGNNGGLMILWNSETKVTINSYSEGHIDVLIK-DKPVSW
        MK L WN RGLGNPR   AL+ ++    P L+F+ ETK    +   +   L YE  F+V S+G  GGL +LWNSETKV I S+++ HID  I+ +     
Subjt:  MKTLCWNARGLGNPRAIRALRHLVGKENPQLIFVMETKSDYGKCERIKVGLQYEFMFSVPSLGNNGGLMILWNSETKVTINSYSEGHIDVLIK-DKPVSW

Query:  RFTGFYG--TLKQTKDTF------------------------------------LGTCWKGL---RLVLKVLGW-----PIVAQWE-KQSALSRTIKATK
        R TG YG    +Q K T+                                        W+ +        +  W     P+V + + + S ++   +   
Subjt:  RFTGFYG--TLKQTKDTF------------------------------------LGTCWKGL---RLVLKVLGW-----PIVAQWE-KQSALSRTIKATK

Query:  LLRFEEGWLKLKDTKKIIAEEW----------------KTTSGSGA----------------------QFFSAKI-------------------------
        L+ +E+ W      K+II +EW                K +  S A                      Q  S K+                         
Subjt:  LLRFEEGWLKLKDTKKIIAEEW----------------KTTSGSGA----------------------QFFSAKI-------------------------

Query:  -----LRSRDDWLKSGDRNTKWFHAKASQQKKRNKIAGLLSESGEWVEEEDKISEVATNYFTRLFSSSNPNPQAIQKIVECISGKISEQQKQELDQPYTK
              RSR DWLK GD+NTK+FH KAS +KK+N+I G+ + +G W+E  + +      YFT LF++S PN   I   +  IS ++S +  + L+ P+T 
Subjt:  -----LRSRDDWLKSGDRNTKWFHAKASQQKKRNKIAGLLSESGEWVEEEDKISEVATNYFTRLFSSSNPNPQAIQKIVECISGKISEQQKQELDQPYTK

Query:  TEIEAAMKSLSPSKAPRRDGTHASFYQEYWEVVGDDTVNTCLQILNNDADMAPINKTVIALIPKTKNPKRMQEFRPISLCNVSYKIIAKSIANRLKKALD
         E+  A+  + P+KAP  DG  A F+Q++W+ V    ++TCL ILN   D+AP N T I LI K   P+++ +FRPISLCNV Y+I+AK+IANRLK  L 
Subjt:  TEIEAAMKSLSPSKAPRRDGTHASFYQEYWEVVGDDTVNTCLQILNNDADMAPINKTVIALIPKTKNPKRMQEFRPISLCNVSYKIIAKSIANRLKKALD

Query:  SIISPHQAAFVPGRLISDNVTIGFECIHAI
        ++ISP Q+AF+P  LI+DN+ +G+EC+H I
Subjt:  SIISPHQAAFVPGRLISDNVTIGFECIHAI

XP_030926688.1 uncharacterized protein LOC115953246 [Quercus lobata]1.2e-7337.78Show/hide
Query:  MKTLCWNARGLGNPRAIRALRHLVGKENPQLIFVMETKSDYGKCERIKVGLQYEFMFSVPSLGNNGGLMILWNSETKVTINSYSEGHIDVLIKDKPV-SW
        M +L WN RGLGN RA++ L  +V  + P+++F+ ET S+  + ++IK  L+++ +F VPS    GGL +LW SE  + ++S+S  HID ++   P  +W
Subjt:  MKTLCWNARGLGNPRAIRALRHLVGKENPQLIFVMETKSDYGKCERIKVGLQYEFMFSVPSLGNNGGLMILWNSETKVTINSYSEGHIDVLIKDKPV-SW

Query:  -RFTGFYGTLKQTKDTFLGTCWKGLRLVLKVLGWPIVAQWEKQSALS------RTIKATKLLRFEEGWL--------KLKDTKKII--AEEWKTTSGSGA
         R  G     ++         W       +V          +   LS      R   + K  RFE  WL        ++K+TK+++  AEE     G+  
Subjt:  -RFTGFYGTLKQTKDTFLGTCWKGLRLVLKVLGWPIVAQWEKQSALS------RTIKATKLLRFEEGWL--------KLKDTKKII--AEEWKTTSGSGA

Query:  QFFSAK----ILRSRDD----------WLKSGDRNTKWFHAKASQQKKRNKIAGLLSESGEWVEEEDKISEVATNYFTRLFSSSNPNPQAIQKIVECISG
        +    K    +L  R++          W++SGDRNTK+FH  A+Q+K+RN I GL  ESG W  +E+ +S V T Y+T+LF+SS  NPQ + +++E +  
Subjt:  QFFSAK----ILRSRDD----------WLKSGDRNTKWFHAKASQQKKRNKIAGLLSESGEWVEEEDKISEVATNYFTRLFSSSNPNPQAIQKIVECISG

Query:  KISEQQKQELDQPYTKTEIEAAMKSLSPSKAPRRDGTHASFYQEYWEVVGDDTVNTCLQILNNDADMAPINKTVIALIPKTKNPKRMQEFRPISLCNVSY
         +SE+  ++L + YT  E+E A+K + P KAP  DG    F+Q YW  +G D     L  LN+ + +  IN T I LIPK KNP ++ E+RPISLCNV Y
Subjt:  KISEQQKQELDQPYTKTEIEAAMKSLSPSKAPRRDGTHASFYQEYWEVVGDDTVNTCLQILNNDADMAPINKTVIALIPKTKNPKRMQEFRPISLCNVSY

Query:  KIIAKSIANRLKKALDSIISPHQAAFVPGRLISDNVTIGFECIHAITSKR
        KII+K IANRLK  L++IIS  Q+AF+  R+I+DNV + FE +H +T+ R
Subjt:  KIIAKSIANRLKKALDSIISPHQAAFVPGRLISDNVTIGFECIHAITSKR

TrEMBL top hitse value%identityAlignment
A0A2N9FBC1 Reverse transcriptase domain-containing protein1.2e-7132.21Show/hide
Query:  RGLGNPRAIRALRHLVGKENPQLIFVMETKSDYGKCERIKVGLQYEFMFSVPSLGNNGGLMILWNSETKVTINSYSEGHIDVLIKDKPVS-WRFTGFYGT
        +GLGNP A+RAL H+V K+ P+++F+METK D G+ E I+V L ++  F+VPSLG +GGL +LW ++ +V I +YS+ HID  +  K    WR TGFYG 
Subjt:  RGLGNPRAIRALRHLVGKENPQLIFVMETKSDYGKCERIKVGLQYEFMFSVPSLGNNGGLMILWNSETKVTINSYSEGHIDVLIKDKPVS-WRFTGFYGT

Query:  LKQTKDTFLGTCWKGLRLVLKVLGWPIVAQWEK----------------------------------------------------QSALSRTIKATKLL-
         +Q +        K L   L VL W  +  + +                                                    Q  L R +     L 
Subjt:  LKQTKDTFLGTCWKGLRLVLKVLGWPIVAQWEK----------------------------------------------------QSALSRTIKATKLL-

Query:  --------------------------------------RFEEGWLKLKDTKKIIAEEWK--------------TTSGSGAQFFSAKIL------------
                                              RFEE W    D +K+I E W+              T +  G      KIL            
Subjt:  --------------------------------------RFEEGWLKLKDTKKIIAEEWK--------------TTSGSGAQFFSAKIL------------

Query:  -----RSRDDWLKSGDRNTKWFHAKASQQKKRNKIAGLLSESGEWVEEEDKISEVATNYFTRLFSSSNPNPQAIQKIVECISGKISEQQKQELDQPYTKT
             RSR+ WL +GD+NT++FH KA Q++ +N + GLL  +G W EEE ++  +   YF  +FS+S  +   ++  V CI   ++    ++L   +T  
Subjt:  -----RSRDDWLKSGDRNTKWFHAKASQQKKRNKIAGLLSESGEWVEEEDKISEVATNYFTRLFSSSNPNPQAIQKIVECISGKISEQQKQELDQPYTKT

Query:  EIEAAMKSLSPSKAPRRDGTHASFYQEYWEVVGDDTVNTCLQILNNDADMAPINKTVIALIPKTKNPKRMQEFRPISLCNVSYKIIAKSIANRLKKALDS
        EI+ A   + PSKAP  DG  + F+Q+YW +VG D V   L ++N+   +  +N + + LIPK KNP+ + ++RPISL NV YKI++K +ANRLK  L  
Subjt:  EIEAAMKSLSPSKAPRRDGTHASFYQEYWEVVGDDTVNTCLQILNNDADMAPINKTVIALIPKTKNPKRMQEFRPISLCNVSYKIIAKSIANRLKKALDS

Query:  IISPHQAAFVPGRLISDNVTIGFECIHAITSKRK
        IIS  Q+AFVPGR I+DN+ + FE +H + ++RK
Subjt:  IISPHQAAFVPGRLISDNVTIGFECIHAITSKRK

A0A2N9G497 Reverse transcriptase domain-containing protein5.4e-7534.13Show/hide
Query:  RGLGNPRAIRALRHLVGKENPQLIFVMETKSDYGKCERIKVGLQYEFMFSVPSLGNNGGLMILWNSETKVTINSYSEGHIDVLIKDKPVS-WRFTGFYGT
        +GLGNP A+RAL H+V K+ P+++F+METK D G+ E I+V L ++  F+VPSLG +GGL +LW ++ +V I +YS+ HID  +  K    WR TGFYG 
Subjt:  RGLGNPRAIRALRHLVGKENPQLIFVMETKSDYGKCERIKVGLQYEFMFSVPSLGNNGGLMILWNSETKVTINSYSEGHIDVLIKDKPVS-WRFTGFYGT

Query:  LKQTKDTFLGTCWKGLRLVLKVLGWPIVAQWEK-------------------QSALSRTIKATKLL----------------------------------
         +Q +        K L   L VL W  +  + +                   +  L R +     L                                  
Subjt:  LKQTKDTFLGTCWKGLRLVLKVLGWPIVAQWEK-------------------QSALSRTIKATKLL----------------------------------

Query:  -----RFEEGWLKLKDTKKIIAEEWK--------------TTSGSGAQFFSAKIL-----------------RSRDDWLKSGDRNTKWFHAKASQQKKRN
             RFEE W    D +K+I E W+              T +  G      KIL                 RSR+ WL +GD+NT++FH KA Q++ +N
Subjt:  -----RFEEGWLKLKDTKKIIAEEWK--------------TTSGSGAQFFSAKIL-----------------RSRDDWLKSGDRNTKWFHAKASQQKKRN

Query:  KIAGLLSESGEWVEEEDKISEVATNYFTRLFSSSNPNPQAIQKIVECISGKISEQQKQELDQPYTKTEIEAAMKSLSPSKAPRRDGTHASFYQEYWEVVG
         + GLL  +G W EEE ++  +   YF  +FS+S  +   ++  V CI   ++    ++L   +T  EI+ A   + PSKAP  DG  + F+Q+YW +VG
Subjt:  KIAGLLSESGEWVEEEDKISEVATNYFTRLFSSSNPNPQAIQKIVECISGKISEQQKQELDQPYTKTEIEAAMKSLSPSKAPRRDGTHASFYQEYWEVVG

Query:  DDTVNTCLQILNNDADMAPINKTVIALIPKTKNPKRMQEFRPISLCNVSYKIIAKSIANRLKKALDSIISPHQAAFVPGRLISDNVTIGFECIHAITSKR
         D V   L ++N+   +  +N + + LIPK KNP+ + ++RPISL NV YKI++K +ANRLK  L  IIS  Q+AFVPGR I+DN+ + FE +H + ++R
Subjt:  DDTVNTCLQILNNDADMAPINKTVIALIPKTKNPKRMQEFRPISLCNVSYKIIAKSIANRLKKALDSIISPHQAAFVPGRLISDNVTIGFECIHAITSKR

Query:  K
        K
Subjt:  K

A0A2N9HDH5 Uncharacterized protein4.3e-7231.77Show/hide
Query:  MKTLCWNARGLGNPRAIRALRHLVGKENPQLIFVMETKSDYGKCERIKVGLQYEFMFSVPSLGNNGGLMILWNSETKVTINSYSEGHIDVLI-KDKPVSW
        M  + WN RGLGN RA+ AL +LV  + P+++F+METK D  K E I+V L+++F F+VPSLG +GGL +LWN + ++TI ++S  HID  +     + W
Subjt:  MKTLCWNARGLGNPRAIRALRHLVGKENPQLIFVMETKSDYGKCERIKVGLQYEFMFSVPSLGNNGGLMILWNSETKVTINSYSEGHIDVLI-KDKPVSW

Query:  RFTGFYG-----------TLKQTKDTFLGTCW-------------------------------------------------------------KGLRLVL
        RFTGFYG            L    D+ +   W                                                             K L   L
Subjt:  RFTGFYG-----------TLKQTKDTFLGTCW-------------------------------------------------------------KGLRLVL

Query:  KVLGW----PIVAQWEKQSALS--------------RTIKATKLLRFEEGWLKLKDTKKIIAEEWKTTSGSGAQFF------------------------
            W    P+   +  QS+ S              +  +  +  +FEE W    + +KII + W   +  G+  F                        
Subjt:  KVLGW----PIVAQWEKQSALS--------------RTIKATKLLRFEEGWLKLKDTKKIIAEEWKTTSGSGAQFF------------------------

Query:  ----SAKIL--------------------------------------RSRDDWLKSGDRNTKWFHAKASQQKKRNKIAGLLSESGEWVEEEDKISEVATN
              K L                                      RSR  WL++GD+NTK+FH  A+Q++++N I GL +++  W   E++I E+A  
Subjt:  ----SAKIL--------------------------------------RSRDDWLKSGDRNTKWFHAKASQQKKRNKIAGLLSESGEWVEEEDKISEVATN

Query:  YFTRLFSSSNPNPQAIQKIVECISGKISEQQKQELDQPYTKTEIEAAMKSLSPSKAPRRDGTHASFYQEYWEVVGDDTVNTCLQILNNDADMAPINKTVI
        YF  +F++S P    I + +  +   +SE+  Q L QPYT  E+ AA+  + PSKAP  DG  + F+Q+YW +VG    N  L ILN+   +  IN T +
Subjt:  YFTRLFSSSNPNPQAIQKIVECISGKISEQQKQELDQPYTKTEIEAAMKSLSPSKAPRRDGTHASFYQEYWEVVGDDTVNTCLQILNNDADMAPINKTVI

Query:  ALIPKTKNPKRMQEFRPISLCNVSYKIIAKSIANRLKKALDSIISPHQAAFVPGRLISDNVTIGFECIHAITSKRK
        +LIPK KNP+++ ++RPISLCNV YKII+K +ANRLK  L  IIS  Q+AFVPGRLI+DNV + FE +H + +KR+
Subjt:  ALIPKTKNPKRMQEFRPISLCNVSYKIIAKSIANRLKKALDSIISPHQAAFVPGRLISDNVTIGFECIHAITSKRK

A0A2N9I611 Uncharacterized protein3.9e-7333.71Show/hide
Query:  ARGLGNPRAIRALRHLVGKENPQLIFVMETKSDYGKCERIKVGLQYEFMFSVPSLGNNGGLMILWNSETKVTINSYSEGHIDVLIKDKPVSWRFTGFYG-
        +RGLGNPRA+R LR L   + P+++F+ ETK +  + E I+VGL+Y+  F VPS G +GGL +LW+ +  ++I SY+  HID  IK +   WRFTGFYG 
Subjt:  ARGLGNPRAIRALRHLVGKENPQLIFVMETKSDYGKCERIKVGLQYEFMFSVPSLGNNGGLMILWNSETKVTINSYSEGHIDVLIKDKPVSWRFTGFYG-

Query:  -----------TLKQTK-------------------DTFLGTCWKGLRLVLKVLG-----------WP----------IVAQWEKQSALSRTI-------
                    LK  K                   D   G  W   R V    G           W           I   +    A+S  +       
Subjt:  -----------TLKQTK-------------------DTFLGTCWKGLRLVLKVLG-----------WP----------IVAQWEKQSALSRTI-------

Query:  -KATKLLRFEEGWLKLKDTKKIIAEEWKTTSG----------------------SGAQFFSAKI------------------------------------
         +A +L RFE+ W K ++ +K+I + W+ T                        S A F + KI                                    
Subjt:  -KATKLLRFEEGWLKLKDTKKIIAEEWKTTSG----------------------SGAQFFSAKI------------------------------------

Query:  -------LRSRDDWLKSGDRNTKWFHAKASQQKKRNKIAGLLSESGEWVEEEDKISEVATNYFTRLFSSSNPNPQAIQKIVECISGKISEQQKQELDQPY
                R+R  WLK GDRNTK+FH+KA+Q++K+N + GL+ + G W ++ +K+ E+A  YF  +F+S+  N   +    E I   +++   Q L   +
Subjt:  -------LRSRDDWLKSGDRNTKWFHAKASQQKKRNKIAGLLSESGEWVEEEDKISEVATNYFTRLFSSSNPNPQAIQKIVECISGKISEQQKQELDQPY

Query:  TKTEIEAAMKSLSPSKAPRRDGTHASFYQEYWEVVGDDTVNTCLQILNNDADMAPINKTVIALIPKTKNPKRMQEFRPISLCNVSYKIIAKSIANRLKKA
         + E++ A+  +  SKAP  DG  A+FYQ+YW  VG    +  L +LN+   +  IN T I LIPK KNPK M EFRPISLCNV YKI+AK +ANRLK  
Subjt:  TKTEIEAAMKSLSPSKAPRRDGTHASFYQEYWEVVGDDTVNTCLQILNNDADMAPINKTVIALIPKTKNPKRMQEFRPISLCNVSYKIIAKSIANRLKKA

Query:  LDSIISPHQAAFVPGRLISDNVTIGFECIHAITSKRK
        L  +IS +Q+AFVP RLI++N+ I +E +H + S+R+
Subjt:  LDSIISPHQAAFVPGRLISDNVTIGFECIHAITSKRK

A0A5B6V0I7 Reverse transcriptase7.8e-7438.57Show/hide
Query:  MKTLCWNARGLGNPRAIRALRHLVGKENPQLIFVMETKSDYGKCERIKVGLQYEFMFSVPSLGNNGGLMILWNSETKVTINSYSEGHIDVLI-KDKPVSW
        MK L WN RGLGNPRA+R LRH +  +NPQ++F METK +  K E+++    Y+    V SLG+ GGL + W  E  +T+ S+S  HIDV+I +D     
Subjt:  MKTLCWNARGLGNPRAIRALRHLVGKENPQLIFVMETKSDYGKCERIKVGLQYEFMFSVPSLGNNGGLMILWNSETKVTINSYSEGHIDVLI-KDKPVSW

Query:  RFTGFYGTLKQTKDTFLGTCWKGLRLVLKVLGWPIVAQWEKQSALSRTIKATKLLRFEEGWLKLKDTKKIIAEEWKTTSGSGAQFFSAKILRSRDDWLKS
        R TGFYG+                         P +   EK   L R ++                      + W+               R+R +WLK 
Subjt:  RFTGFYGTLKQTKDTFLGTCWKGLRLVLKVLGWPIVAQWEKQSALSRTIKATKLLRFEEGWLKLKDTKKIIAEEWKTTSGSGAQFFSAKILRSRDDWLKS

Query:  GDRNTKWFHAKASQQKKRNKIAGLLSESGEWVEEEDKISEVATNYFTRLFS-SSNPNPQAIQKIVECISGKISEQQKQELDQPYTKTEIEAAMKSLSPSK
        GDRNT +FH +A+Q+K+RN I  L  E G    E +++ E+A +YF +LFS  S PN     +I+  I   ++E+    L   +TK EI  A+  + P+K
Subjt:  GDRNTKWFHAKASQQKKRNKIAGLLSESGEWVEEEDKISEVATNYFTRLFS-SSNPNPQAIQKIVECISGKISEQQKQELDQPYTKTEIEAAMKSLSPSK

Query:  APRRDGTHASFYQEYWEVVGDDTVNTCLQILNNDADMAPINKTVIALIPKTKNPKRMQEFRPISLCNVSYKIIAKSIANRLKKALDSIISPHQAAFVPGR
        AP  DG  A FYQ+ W ++G++  N CL  LNN  D++ INKT I L+PK  +P  + +FRPISLCNV YK+IAK++ANRL+  +   I   Q+AFVPGR
Subjt:  APRRDGTHASFYQEYWEVVGDDTVNTCLQILNNDADMAPINKTVIALIPKTKNPKRMQEFRPISLCNVSYKIIAKSIANRLKKALDSIISPHQAAFVPGR

Query:  LISDNVTIGFECIHAITSKR
        LISDNV + +E +H + +K+
Subjt:  LISDNVTIGFECIHAITSKR

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein7.9e-1525.35Show/hide
Query:  QQKKRNKIAGLLSESGEWVEEEDKISEVATNYFTRLFSSSNPNPQAIQKIVECIS-GKISEQQKQELDQPYTKTEIEAAMKSLSPSKAPRRDGTHASFYQ
        +++++N+I  + ++ G+   +  +I      Y+  L+++   N + +   ++  +  ++++++ + L++P T +EI A + SL   K+P  DG  A FYQ
Subjt:  QQKKRNKIAGLLSESGEWVEEEDKISEVATNYFTRLFSSSNPNPQAIQKIVECIS-GKISEQQKQELDQPYTKTEIEAAMKSLSPSKAPRRDGTHASFYQ

Query:  EYWEVVGDDTVNTCLQILNNDADMAPINKTVIALIPKT-KNPKRMQEFRPISLCNVSYKIIAKSIANRLKKALDSIISPHQAAFVPGRLISDNVTIGFEC
         Y E +    +     I           +  I LIPK  ++  + + FRPISL N+  KI+ K +ANR+++ +  +I   Q  F+PG     N+      
Subjt:  EYWEVVGDDTVNTCLQILNNDADMAPINKTVIALIPKT-KNPKRMQEFRPISLCNVSYKIIAKSIANRLKKALDSIISPHQAAFVPGRLISDNVTIGFEC

Query:  IHAITSKRKENRV
        I  I   + +N V
Subjt:  IHAITSKRKENRV

P08548 LINE-1 reverse transcriptase homolog1.2e-1527.62Show/hide
Query:  DRNTKWFHAKASQ---------QKKRNK--IAGLLSESGEWVEEEDKISEVATNYFTRLFSSSNPNPQAIQKIVE-CISGKISEQQKQELDQPYTKTEIE
        +++  WF  K ++         +KKR K  I+ + + + E   +  +I ++   Y+ +L+S    N + I + +E C   ++S+++ + L++P + +EI 
Subjt:  DRNTKWFHAKASQ---------QKKRNK--IAGLLSESGEWVEEEDKISEVATNYFTRLFSSSNPNPQAIQKIVE-CISGKISEQQKQELDQPYTKTEIE

Query:  AAMKSLSPSKAPRRDGTHASFYQEYWEVVGDDTVNTCLQILNNDADMAPINKTVIALIPKT-KNPKRMQEFRPISLCNVSYKIIAKSIANRLKKALDSII
        + +++L   K+P  DG  + FYQ + E +    +N    I           +  I LIPK  K+P R + +RPISL N+  KI+ K + NR+++ +  II
Subjt:  AAMKSLSPSKAPRRDGTHASFYQEYWEVVGDDTVNTCLQILNNDADMAPINKTVIALIPKT-KNPKRMQEFRPISLCNVSYKIIAKSIANRLKKALDSII

Query:  SPHQAAFVPG
           Q  F+PG
Subjt:  SPHQAAFVPG

P11369 LINE-1 retrotransposable element ORF2 protein7.6e-1828.39Show/hide
Query:  DRNTKWFHAKASQQKK---------RNKIA--GLLSESGEWVEEEDKISEVATNYFTRLFSSSNPNPQAIQKIVECIS-GKISEQQKQELDQPYTKTEIE
        ++   WF  K ++  K         R+KI    + +E G+   + ++I     +++ RL+S+   N   + K ++     K+++ Q   L+ P +  EIE
Subjt:  DRNTKWFHAKASQQKK---------RNKIA--GLLSESGEWVEEEDKISEVATNYFTRLFSSSNPNPQAIQKIVECIS-GKISEQQKQELDQPYTKTEIE

Query:  AAMKSLSPSKAPRRDGTHASFYQEYWEVVGDDTVNTCLQILNNDADMAPINKTVIALIPK-TKNPKRMQEFRPISLCNVSYKIIAKSIANRLKKALDSII
        A + SL   K+P  DG  A FYQ + E +         +I           +  I LIPK  K+P +++ FRPISL N+  KI+ K +ANR+++ + +II
Subjt:  AAMKSLSPSKAPRRDGTHASFYQEYWEVVGDDTVNTCLQILNNDADMAPINKTVIALIPK-TKNPKRMQEFRPISLCNVSYKIIAKSIANRLKKALDSII

Query:  SPHQAAFVPGRLISDNVTIGFECIHAITSKRKENRV
         P Q  F+PG     N+      IH I   + +N +
Subjt:  SPHQAAFVPGRLISDNVTIGFECIHAITSKRKENRV

P14381 Transposon TX1 uncharacterized 149 kDa protein4.5e-2630.49Show/hide
Query:  LRSRDDWLKSGDRNTKWFHAKASQQKKRNKIAGLLSESGEWVEEEDKISEVATNYFTRLFSSSNPNPQAIQKIVECISGKISEQQKQELDQPYTKTEIEA
        +RSR   L   DR +++F+A   ++  R +I  L +E G  +E+ + I + A +++  LFS    +P A +++ + +   +SE++K+ L+ P T  E+  
Subjt:  LRSRDDWLKSGDRNTKWFHAKASQQKKRNKIAGLLSESGEWVEEEDKISEVATNYFTRLFSSSNPNPQAIQKIVECISGKISEQQKQELDQPYTKTEIEA

Query:  AMKSLSPSKAPRRDGTHASFYQEYWEVVGDDTVNTCLQILNNDADMAPINKTVIALIPKTKNPKRMQEFRPISLCNVSYKIIAKSIANRLKKALDSIISP
        A++ +  +K+P  DG    F+Q +W+ +G D      +            + V++L+PK  + + ++ +RP+SL +  YKI+AK+I+ RLK  L  +I P
Subjt:  AMKSLSPSKAPRRDGTHASFYQEYWEVVGDDTVNTCLQILNNDADMAPINKTVIALIPKTKNPKRMQEFRPISLCNVSYKIIAKSIANRLKKALDSIISP

Query:  HQAAFVPGRLISDNVTIGFECIH
         Q+  VPGR I DNV +  + +H
Subjt:  HQAAFVPGRLISDNVTIGFECIH

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein9.2e-1929.27Show/hide
Query:  IIAEEWKTTSGSGAQFFSAKILRSRDDWLKSGDRNTKWFHAKASQQKKRNKIAGLLSESGEWVEEEDKISEVATNYFTRLFSSSNP--NPQAIQKIVECI
        +  ++W   + +   F+  K   SR  WL+ GD NT++FH      + +N I  L  +    VE   ++ E+   Y+T L  S +    P ++Q+I +  
Subjt:  IIAEEWKTTSGSGAQFFSAKILRSRDDWLKSGDRNTKWFHAKASQQKKRNKIAGLLSESGEWVEEEDKISEVATNYFTRLFSSSNP--NPQAIQKIVECI

Query:  SGKISEQQKQELDQPYTKTEIEAAMKSLSPSKAPRRDGTHASFYQEYWEVVGDDTVNTCLQILNNDADMAPINKTVIALIPKTKNPKRMQEFRPISLCNV
          + ++     L    +  EI AA+ ++  +KAP  D   A F+ E W VV D T+    +       +   N T I LIPK     ++  FRP+S C V
Subjt:  SGKISEQQKQELDQPYTKTEIEAAMKSLSPSKAPRRDGTHASFYQEYWEVVGDDTVNTCLQILNNDADMAPINKTVIALIPKTKNPKRMQEFRPISLCNV

Query:  SYKII
         YKII
Subjt:  SYKII


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAACTTTATGTTGGAACGCTCGAGGTTTGGGGAATCCTCGAGCGATCCGAGCGCTTCGCCACTTGGTTGGTAAAGAAAACCCCCAATTGATCTTTGTTATGGAAAC
CAAAAGTGATTATGGGAAGTGTGAGAGAATCAAGGTTGGTCTACAATATGAATTCATGTTTAGTGTTCCTAGTCTTGGCAATAATGGAGGGCTCATGATTCTTTGGAATT
CTGAAACGAAAGTTACAATCAACTCCTACTCTGAAGGCCATATCGATGTGTTGATAAAAGACAAGCCAGTTTCGTGGAGATTCACGGGTTTCTATGGAACCCTGAAACAG
ACAAAAGATACTTTTCTTGGGACTTGTTGGAAAGGCTTAAGACTTGTTTTGAAGGTCCTTGGCTGGCCCATTGTTGCTCAATGGGAGAAGCAATCTGCTCTTAGCAGAAC
TATTAAGGCCACAAAGCTGTTGAGATTTGAAGAAGGTTGGCTTAAGCTAAAAGATACTAAAAAGATCATAGCAGAGGAATGGAAGACCACGTCTGGTAGTGGTGCTCAGT
TCTTTAGCGCAAAAATCTTAAGATCAAGAGATGATTGGTTGAAAAGTGGGGACCGGAATACAAAATGGTTCCACGCAAAAGCTTCTCAACAAAAGAAAAGAAATAAAATT
GCGGGCTTGCTCTCGGAATCAGGGGAGTGGGTGGAGGAAGAAGATAAAATCAGCGAGGTGGCCACAAACTACTTTACAAGGCTCTTTAGTTCATCAAATCCCAACCCACA
AGCAATCCAGAAAATAGTAGAATGCATCTCAGGCAAAATCTCAGAACAACAAAAGCAAGAGCTGGATCAGCCCTACACAAAAACCGAAATTGAAGCTGCTATGAAAAGCC
TAAGCCCAAGCAAAGCGCCAAGAAGGGATGGTACGCATGCATCTTTCTACCAAGAATATTGGGAGGTAGTAGGTGATGATACAGTTAATACTTGCCTTCAAATCTTAAAT
AATGATGCGGATATGGCCCCTATAAACAAGACTGTGATTGCGCTCATTCCTAAGACAAAGAATCCAAAACGTATGCAAGAGTTTAGACCGATTAGCTTGTGCAATGTAAG
CTACAAAATCATTGCCAAGTCAATAGCAAATAGGCTCAAGAAGGCTTTGGATTCGATAATATCACCCCATCAAGCGGCTTTTGTTCCTGGAAGACTAATATCGGACAATG
TTACTATCGGTTTCGAGTGCATTCATGCGATTACATCAAAGAGAAAGGAAAACAGGGTCAAGTGGCTATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAACTTTATGTTGGAACGCTCGAGGTTTGGGGAATCCTCGAGCGATCCGAGCGCTTCGCCACTTGGTTGGTAAAGAAAACCCCCAATTGATCTTTGTTATGGAAAC
CAAAAGTGATTATGGGAAGTGTGAGAGAATCAAGGTTGGTCTACAATATGAATTCATGTTTAGTGTTCCTAGTCTTGGCAATAATGGAGGGCTCATGATTCTTTGGAATT
CTGAAACGAAAGTTACAATCAACTCCTACTCTGAAGGCCATATCGATGTGTTGATAAAAGACAAGCCAGTTTCGTGGAGATTCACGGGTTTCTATGGAACCCTGAAACAG
ACAAAAGATACTTTTCTTGGGACTTGTTGGAAAGGCTTAAGACTTGTTTTGAAGGTCCTTGGCTGGCCCATTGTTGCTCAATGGGAGAAGCAATCTGCTCTTAGCAGAAC
TATTAAGGCCACAAAGCTGTTGAGATTTGAAGAAGGTTGGCTTAAGCTAAAAGATACTAAAAAGATCATAGCAGAGGAATGGAAGACCACGTCTGGTAGTGGTGCTCAGT
TCTTTAGCGCAAAAATCTTAAGATCAAGAGATGATTGGTTGAAAAGTGGGGACCGGAATACAAAATGGTTCCACGCAAAAGCTTCTCAACAAAAGAAAAGAAATAAAATT
GCGGGCTTGCTCTCGGAATCAGGGGAGTGGGTGGAGGAAGAAGATAAAATCAGCGAGGTGGCCACAAACTACTTTACAAGGCTCTTTAGTTCATCAAATCCCAACCCACA
AGCAATCCAGAAAATAGTAGAATGCATCTCAGGCAAAATCTCAGAACAACAAAAGCAAGAGCTGGATCAGCCCTACACAAAAACCGAAATTGAAGCTGCTATGAAAAGCC
TAAGCCCAAGCAAAGCGCCAAGAAGGGATGGTACGCATGCATCTTTCTACCAAGAATATTGGGAGGTAGTAGGTGATGATACAGTTAATACTTGCCTTCAAATCTTAAAT
AATGATGCGGATATGGCCCCTATAAACAAGACTGTGATTGCGCTCATTCCTAAGACAAAGAATCCAAAACGTATGCAAGAGTTTAGACCGATTAGCTTGTGCAATGTAAG
CTACAAAATCATTGCCAAGTCAATAGCAAATAGGCTCAAGAAGGCTTTGGATTCGATAATATCACCCCATCAAGCGGCTTTTGTTCCTGGAAGACTAATATCGGACAATG
TTACTATCGGTTTCGAGTGCATTCATGCGATTACATCAAAGAGAAAGGAAAACAGGGTCAAGTGGCTATAA
Protein sequenceShow/hide protein sequence
MKTLCWNARGLGNPRAIRALRHLVGKENPQLIFVMETKSDYGKCERIKVGLQYEFMFSVPSLGNNGGLMILWNSETKVTINSYSEGHIDVLIKDKPVSWRFTGFYGTLKQ
TKDTFLGTCWKGLRLVLKVLGWPIVAQWEKQSALSRTIKATKLLRFEEGWLKLKDTKKIIAEEWKTTSGSGAQFFSAKILRSRDDWLKSGDRNTKWFHAKASQQKKRNKI
AGLLSESGEWVEEEDKISEVATNYFTRLFSSSNPNPQAIQKIVECISGKISEQQKQELDQPYTKTEIEAAMKSLSPSKAPRRDGTHASFYQEYWEVVGDDTVNTCLQILN
NDADMAPINKTVIALIPKTKNPKRMQEFRPISLCNVSYKIIAKSIANRLKKALDSIISPHQAAFVPGRLISDNVTIGFECIHAITSKRKENRVKWL