; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026186 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026186
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr10:31769995..31771399
RNA-Seq ExpressionLag0026186
SyntenyLag0026186
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ONI09819.1 hypothetical protein PRUPE_4G011200 [Prunus persica]3.8e-5737.22Show/hide
Query:  WYKRRDKQVILEERLDRYLCNSAFFVMFPFVEVRHLDFCLSDHCPLLIEADVLANKSRARGKPCFRFEEVWTQFDECRGIVEKGWS------------RP
        W +R     ++EERLDR + N  FF  +  +   HL    SDH P+L+EA V   ++ A+    F FEE+WT+  E   ++E+ W               
Subjt:  WYKRRDKQVILEERLDRYLCNSAFFVMFPFVEVRHLDFCLSDHCPLLIEADVLANKSRARGKPCFRFEEVWTQFDECRGIVEKGWS------------RP

Query:  CRPQGR--------DQEKLLNHGVLELRIVPMSCLHGKNPKRETM---KASEDLENLLGEEEIYWHQQSRVEWMRWGDRNSKWFHQRATHRRRRNQMEGL
        C  + +        +  K L H   EL     + L G+    + +   K  E + +LL ++EI W Q+SRV W++ GD+N+ +FH RA+ R +RN++ G+
Subjt:  CRPQGR--------DQEKLLNHGVLELRIVPMSCLHGKNPKRETM---KASEDLENLLGEEEIYWHQQSRVEWMRWGDRNSKWFHQRATHRRRRNQMEGL

Query:  LDRDGKWTTDEGEMGQLVVEYFQALFTSNGHQNSESLQEFLTHIQPCIERDVNKSLSRPYLEEELLSALKQMSPLKAPGEDGLLALFYQRFWNVIGKEVT
         D +  W T+E  +G L  +YF+ LF+S+G Q  E +   L  ++P I   +N  L + +  EEL   L QM P KAPG DG+ ALF+Q++W+++G +V 
Subjt:  LDRDGKWTTDEGEMGQLVVEYFQALFTSNGHQNSESLQEFLTHIQPCIERDVNKSLSRPYLEEELLSALKQMSPLKAPGEDGLLALFYQRFWNVIGKEVT

Query:  FLCLNNLNDKGYVDLINSTVITLIPKCKDAQLISDYRPISLCNIVYKLISKAIANRLKGV
          CL  LN +G V   N T+I LIPK K    +S++RPISLC  VYK+I+K IANRLK V
Subjt:  FLCLNNLNDKGYVDLINSTVITLIPKCKDAQLISDYRPISLCNIVYKLISKAIANRLKGV

XP_023899730.1 uncharacterized protein LOC112011598 [Quercus suber]2.6e-5837.91Show/hide
Query:  WYKRRDKQVILEERLDRYLCNSAFFVMFPFVEVRHLDFCLSDHCPLLIEADVLANKSRARGKPCFRFEEVWTQFDECRGIVEKGWSRPCRPQGRDQEKLL
        W+ R   ++I  ERLDR + N  +   FP   +RHL+   SDH P+L+  ++     R R KP FRFE +W    EC GI+ + W          +E+L 
Subjt:  WYKRRDKQVILEERLDRYLCNSAFFVMFPFVEVRHLDFCLSDHCPLLIEADVLANKSRARGKPCFRFEEVWTQFDECRGIVEKGWSRPCRPQGRDQEKLL

Query:  NHGVLELRIVPMSCLHGKNPKRETMKASEDLENLLGEEEIYWHQQSRVEWMRWGDRNSKWFHQRATHRRRRNQMEGLLDRDGKWTTDEGEMGQLVVEYFQ
            + +R   +  +         +   ++L  L  +EE  W Q+SRV+W++ GD+N+K+FH  +T R+RRN ++GL D +G W  +E  +  L++EY+ 
Subjt:  NHGVLELRIVPMSCLHGKNPKRETMKASEDLENLLGEEEIYWHQQSRVEWMRWGDRNSKWFHQRATHRRRRNQMEGLLDRDGKWTTDEGEMGQLVVEYFQ

Query:  ALFTSNGHQNSESLQEFLTHIQPCIERDVNKSLSRPYLEEELLSALKQMSPLKAPGEDGLLALFYQRFWNVIGKEVTFLCLNNLNDKGYVDLINSTVITL
         LF S+   N  +L   L  +QP +  ++  +L+RP++ EE+  A+K M+PLKA G DG+  LFYQ +W  IG +V+   L+ LN +  +  IN T ITL
Subjt:  ALFTSNGHQNSESLQEFLTHIQPCIERDVNKSLSRPYLEEELLSALKQMSPLKAPGEDGLLALFYQRFWNVIGKEVTFLCLNNLNDKGYVDLINSTVITL

Query:  IPKCKDAQLISDYRPISLCNIVYKLISKAIANRLK
        IPK ++ + +SD+RPISLCN++YK++SK IA+RLK
Subjt:  IPKCKDAQLISDYRPISLCNIVYKLISKAIANRLK

XP_024178556.2 uncharacterized protein LOC112184522 [Rosa chinensis]6.5e-5736.78Show/hide
Query:  EERLDRYLCNSAFFVMFPFVEVRHLDFCLSDHCPLLIEADVLANKSRARGKPCFRFEEVWTQFDECRGIVEKGW-----SRPCRPQGRDQEKL------L
        +ERLDR   N  +  ++PF     L    SDHCPLLIE        R R    FRFEE+W Q  +C  +V++GW       P    GR  ++        
Subjt:  EERLDRYLCNSAFFVMFPFVEVRHLDFCLSDHCPLLIEADVLANKSRARGKPCFRFEEVWTQFDECRGIVEKGW-----SRPCRPQGRDQEKL------L

Query:  NHGVLELRIVPMSCLHGK---------NPKRETMKASED--LENLLGEEEIYWHQQSRVEWMRWGDRNSKWFHQRATHRRRRNQMEGLLDRDGKWTTDEG
        + GV + R   M  + GK          P+   ++ S    L  LL   E YW Q+SRV+W++ GDRN+ +FH+RA++RR RN+++GLL+ +G+WT D+G
Subjt:  NHGVLELRIVPMSCLHGK---------NPKRETMKASED--LENLLGEEEIYWHQQSRVEWMRWGDRNSKWFHQRATHRRRRNQMEGLLDRDGKWTTDEG

Query:  EMGQLVVEYFQALFTSNGHQNSESLQEFLTHIQPCIERDVNKSLSRPYLEEELLSALKQMSPLKAPGEDGLLALFYQRFWNVIGKEVTFLCLNNLNDKGY
        E+ +++++Y++ +F + G  +  + +  L  + P +  ++N  L+ PY + E+  AL QM P K+PG DG+   F+Q+FW+V+ K+V       L     
Subjt:  EMGQLVVEYFQALFTSNGHQNSESLQEFLTHIQPCIERDVNKSLSRPYLEEELLSALKQMSPLKAPGEDGLLALFYQRFWNVIGKEVTFLCLNNLNDKGY

Query:  VDLINSTVITLIPKCKDAQLISDYRPISLCNIVYKLISKAIANRLKGV
            N T +TLIPK K+ + +SD RPI+LCN+VYK+ SK +ANRLK +
Subjt:  VDLINSTVITLIPKCKDAQLISDYRPISLCNIVYKLISKAIANRLKGV

XP_030924745.1 uncharacterized protein LOC115951731 [Quercus lobata]9.0e-5937.5Show/hide
Query:  WYKRRDKQVILEERLDRYLCNSAFFVMFPFVEVRHLDFCLSDHCPLLIEADVLANKSRARGKPCFRFEEVWTQFDECRGIVEKGW------------SRP
        W  RR     +  RLDR +    + + FP   + HLD   SDH P+L+  D    +    G+P FRFE +W + D C  +++  W            +  
Subjt:  WYKRRDKQVILEERLDRYLCNSAFFVMFPFVEVRHLDFCLSDHCPLLIEADVLANKSRARGKPCFRFEEVWTQFDECRGIVEKGW------------SRP

Query:  CRP-QGRDQ----------EKLLNHGVLELRIVPMSCLHGKNPKRETMKASEDLENLLGEEEIYWHQQSRVEWMRWGDRNSKWFHQRATHRRRRNQMEGL
          P QG  Q             L   + EL++V  S  +  +P R  +  +E +E L  +EE  W Q+SR  W++ GDRN+ +FH RAT R +RN + GL
Subjt:  CRP-QGRDQ----------EKLLNHGVLELRIVPMSCLHGKNPKRETMKASEDLENLLGEEEIYWHQQSRVEWMRWGDRNSKWFHQRATHRRRRNQMEGL

Query:  LDRDGKWTTDEGEMGQLVVEYFQALFTSNGHQNSESLQEFLTHIQPCIERDVNKSLSRPYLEEELLSALKQMSPLKAPGEDGLLALFYQRFWNVIGKEVT
         D  G+W   E ++G++V +YFQ +FTS+   N    +E L  +QP I  +++ SLSR Y  EE+L ALKQM+PL APG DG+  +FY+ +W+++G++V 
Subjt:  LDRDGKWTTDEGEMGQLVVEYFQALFTSNGHQNSESLQEFLTHIQPCIERDVNKSLSRPYLEEELLSALKQMSPLKAPGEDGLLALFYQRFWNVIGKEVT

Query:  FLCLNNLNDKGYVDLINSTVITLIPKCKDAQLISDYRPISLCNIVYKLISKAIANRLKGV
         + L+ LN     + +NST I LIPK K+ + ++++RPISLCN+VYKLI+K + NRLK +
Subjt:  FLCLNNLNDKGYVDLINSTVITLIPKCKDAQLISDYRPISLCNIVYKLISKAIANRLKGV

XP_042962672.1 uncharacterized protein LOC122296942 [Carya illinoinensis]1.7e-5736.77Show/hide
Query:  WYKRRDKQVILEERLDRYLCNSAFFVMFPFVEVRHLDFCLSDHCPLLIEADVLANKSRARGKPCFRFEEVWTQFDECRGIVEKGWSRPCRPQGRDQ----
        W  RR ++ +++ERLDR+L NS +  MFP + V H     SDH PL ++ +      R R +  FRFE +W    EC  I+E+ W R   P   DQ    
Subjt:  WYKRRDKQVILEERLDRYLCNSAFFVMFPFVEVRHLDFCLSDHCPLLIEADVLANKSRARGKPCFRFEEVWTQFDECRGIVEKGWSRPCRPQGRDQ----

Query:  ------------EKLLNHGVLELRIVP--MSCLH----GKNPKRETMKASEDLENLLGEEEIYWHQQSRVEWMRWGDRNSKWFHQRATHRRRRNQMEGLL
                    +    H    L      + CL     G++   E  +A  +++  L  +E+ W Q+SRV+W+R GD NS++FH +A+ RRR+N +  L 
Subjt:  ------------EKLLNHGVLELRIVP--MSCLH----GKNPKRETMKASEDLENLLGEEEIYWHQQSRVEWMRWGDRNSKWFHQRATHRRRRNQMEGLL

Query:  DRDGKWTTDEGEMGQLVVEYFQALFTSNGHQNSESLQEFLTHIQPCIERDVNKSLSRPYLEEELLSALKQMSPLKAPGEDGLLALFYQRFWNVIGKEVTF
        D  G W   + +M  L+ EYFQ LFT+    +   +++ L+ ++  +  ++N+ L +PY+ EE+  ALKQM P KAPG DG+  LF+Q++W ++G  +T 
Subjt:  DRDGKWTTDEGEMGQLVVEYFQALFTSNGHQNSESLQEFLTHIQPCIERDVNKSLSRPYLEEELLSALKQMSPLKAPGEDGLLALFYQRFWNVIGKEVTF

Query:  LCLNNLNDKGYVDLINSTVITLIPKCKDAQLISDYRPISLCNIVYKLISKAIANRLKGV
          L+ LN   +   +N T ITLIPK      ++D+RPISLCN++YK++SK IANRLK V
Subjt:  LCLNNLNDKGYVDLINSTVITLIPKCKDAQLISDYRPISLCNIVYKLISKAIANRLKGV

TrEMBL top hitse value%identityAlignment
A0A2N9EVW3 Uncharacterized protein2.0e-5939.88Show/hide
Query:  RLDRYLCNSAFFVMFPFVEVRHLDFCLSDHCPLLIEADVLANKSRARGKPCFRFEEVWTQFDECRGIVEKGWSRPCR--PQGRDQEKLLNHGVLELRIVP
        RLDR++  + + + F    V HL+   SDH P+ +    +    R R KP FRFE++W     C   + K W    R  P  +  EK+   G   L    
Subjt:  RLDRYLCNSAFFVMFPFVEVRHLDFCLSDHCPLLIEADVLANKSRARGKPCFRFEEVWTQFDECRGIVEKGWSRPCR--PQGRDQEKLLNHGVLELRIVP

Query:  MSCLHGKNPKRETMKASEDLENLLGEEEIYWHQQSRVEWMRWGDRNSKWFHQRATHRRRRNQMEGLLDRDGKWTTDEGEMGQLVVEYFQALFTSNGHQNS
        +   HG       +   +++  LL +EE  W Q+SR  W++ GD N+K+FH RA+HR+RRN +  L   DG+  TDE  +G   +EY+QALFT+   +  
Subjt:  MSCLHGKNPKRETMKASEDLENLLGEEEIYWHQQSRVEWMRWGDRNSKWFHQRATHRRRRNQMEGLLDRDGKWTTDEGEMGQLVVEYFQALFTSNGHQNS

Query:  ESLQEFLTHIQPCIERDVNKSLSRPYLEEELLSALKQMSPLKAPGEDGLLALFYQRFWNVIGKEVTFLCLNNLNDKGYVDLINSTVITLIPKCKDAQLIS
        E ++  L  IQPC+ +++N+SL+ P+ EEE+  A+KQM PLKAPG DG+  +FYQ +W+V+GK++T   L  L     +  +N T +TLIPK K+ + ++
Subjt:  ESLQEFLTHIQPCIERDVNKSLSRPYLEEELLSALKQMSPLKAPGEDGLLALFYQRFWNVIGKEVTFLCLNNLNDKGYVDLINSTVITLIPKCKDAQLIS

Query:  DYRPISLCNIVYKLISKAIANRLKGV
        +YRPISLCN++YKLISK +ANRLK +
Subjt:  DYRPISLCNIVYKLISKAIANRLKGV

A0A2N9G219 RNase H domain-containing protein2.0e-5937.5Show/hide
Query:  WYKRRDKQVILEERLDRYLCNSAFFVMFPFVEVRHLDFCLSDHCPLLIEADVLANKSRARGKPCFRFEEVWTQFDECRGIVEKGW------SRPCRPQGR
        W   R        RLDR++  + + + F    V HL+   SDH P+ +    +   +R R K  FRFE++W     C   + K W      S   +  G+
Subjt:  WYKRRDKQVILEERLDRYLCNSAFFVMFPFVEVRHLDFCLSDHCPLLIEADVLANKSRARGKPCFRFEEVWTQFDECRGIVEKGW------SRPCRPQGR

Query:  DQE-----------------KLLNHGVLELRIVPMSCLHGKNPKRETMKASEDLENLLGEEEIYWHQQSRVEWMRWGDRNSKWFHQRATHRRRRNQMEGL
         Q                  KLL     +LR   M  +HG       +   +++ +LL +EE  W Q+SR  W++ GD N+K+FH RA+HR+RRN +  L
Subjt:  DQE-----------------KLLNHGVLELRIVPMSCLHGKNPKRETMKASEDLENLLGEEEIYWHQQSRVEWMRWGDRNSKWFHQRATHRRRRNQMEGL

Query:  LDRDGKWTTDEGEMGQLVVEYFQALFTSNGHQNSESLQEFLTHIQPCIERDVNKSLSRPYLEEELLSALKQMSPLKAPGEDGLLALFYQRFWNVIGKEVT
           +G+  TDE  +G   +EY+QALFT+   Q  E  +  L  IQPC+ +++N+SL+ P+ EEE+L A+KQM PLKAPG DG+  +FYQ +W+V+GK++T
Subjt:  LDRDGKWTTDEGEMGQLVVEYFQALFTSNGHQNSESLQEFLTHIQPCIERDVNKSLSRPYLEEELLSALKQMSPLKAPGEDGLLALFYQRFWNVIGKEVT

Query:  FLCLNNLNDKGYVDLINSTVITLIPKCKDAQLISDYRPISLCNIVYKLISKAIANRLKGV
           L  L     +  +N T +TLIPK K  + +++YRPISLCN++YKLISK +ANRLK +
Subjt:  FLCLNNLNDKGYVDLINSTVITLIPKCKDAQLISDYRPISLCNIVYKLISKAIANRLKGV

A0A2N9GC56 Reverse transcriptase domain-containing protein1.1e-5937.79Show/hide
Query:  WYKRRDKQVILEERLDRYLCNSAFFVMFPFVEVRHLDFCLSDHCPLLIEADVLANKSRARGKPCFRFEEVWTQFDECRGIVEKGW-------SRPCRPQG
        W  RR     ++ERLDR++ N  +  ++P +   ++    SDHCP+ +E + +    R +    FRFE +W    +C+  VE  W       +R  + + 
Subjt:  WYKRRDKQVILEERLDRYLCNSAFFVMFPFVEVRHLDFCLSDHCPLLIEADVLANKSRARGKPCFRFEEVWTQFDECRGIVEKGW-------SRPCRPQG

Query:  RDQEKLLNHGVLELRIVPMSCLHGKNPKRETMKASEDLENLLGEEEIYWHQQSRVEWMRWGDRNSKWFHQRATHRRRRNQMEGLLDRDGKWTTDEGEMGQ
        +  E L      EL    + CL             ++L  LLG+EEI W Q+SRV+W+R GDRN+K+FH RA  R+RRN MEGL D +G+W T + ++ Q
Subjt:  RDQEKLLNHGVLELRIVPMSCLHGKNPKRETMKASEDLENLLGEEEIYWHQQSRVEWMRWGDRNSKWFHQRATHRRRRNQMEGLLDRDGKWTTDEGEMGQ

Query:  LVVEYFQALFTSNGHQNSESLQEFLTHIQPCIERDVNKSLSRPYLEEELLSALKQMSPLKAPGEDGLLALFYQRFWNVIGKEVTFLCLNNLNDKGYVDLI
        + V+YF ++F S  H     +   L  + P +  ++N++L + +   E+ +AL QM P KAPG DG+  +FYQ FW  IG  VT   L  LN    +  I
Subjt:  LVVEYFQALFTSNGHQNSESLQEFLTHIQPCIERDVNKSLSRPYLEEELLSALKQMSPLKAPGEDGLLALFYQRFWNVIGKEVTFLCLNNLNDKGYVDLI

Query:  NSTVITLIPKCKDAQLISDYRPISLCNIVYKLISKAIANRLKGV
        N T + LIPK     ++++YRPISLCN++YKL+SK +ANRLKGV
Subjt:  NSTVITLIPKCKDAQLISDYRPISLCNIVYKLISKAIANRLKGV

A0A2N9HFT1 Uncharacterized protein2.7e-6141.49Show/hide
Query:  WYKRRDKQVILEERLDRYLCNSAFFVMFPFVEVRHLDFCLSDHCPLLIEADVLANKSRARGKPCFRFEEVWTQFDECRGIVEKGWSRPCRPQGRDQEKLL
        W   R     + E+LDR + ++A+  +FP   V HLD+  SDH PL +   V  N +R   KP FRFEE+W     C   +   W +P + + R+     
Subjt:  WYKRRDKQVILEERLDRYLCNSAFFVMFPFVEVRHLDFCLSDHCPLLIEADVLANKSRARGKPCFRFEEVWTQFDECRGIVEKGWSRPCRPQGRDQEKLL

Query:  NHGVLELRIVPMSCLHGKNPKRETMKASEDLENLLGEEEIYWHQQSRVEWMRWGDRNSKWFHQRATHRRRRNQMEGLLDRDGKWTTDEGEMGQLVVEYFQ
             EL+I   + + G++    T   +E +  LL +EE  W Q+SR +W++ GDRN+ +FH RATHR+RRN + GL D DG+W  D  ++  L++ YFQ
Subjt:  NHGVLELRIVPMSCLHGKNPKRETMKASEDLENLLGEEEIYWHQQSRVEWMRWGDRNSKWFHQRATHRRRRNQMEGLLDRDGKWTTDEGEMGQLVVEYFQ

Query:  ALFTSNGHQNSESLQEFLTHIQPCIERDVNKSLSRPYLEEELLSALKQMSPLKAPGEDGLLALFYQRFWNVIGKEVTFLCLNNLNDKGYVDLINSTVITL
         +F S+   N  S+   L  I   I   +NK+LSRPY   E+ +AL+QM+PL APG DGL  +FYQ  W++IG++V    L++L     V  IN T ITL
Subjt:  ALFTSNGHQNSESLQEFLTHIQPCIERDVNKSLSRPYLEEELLSALKQMSPLKAPGEDGLLALFYQRFWNVIGKEVTFLCLNNLNDKGYVDLINSTVITL

Query:  IPKCKDAQLISDYRPISLCNIVYKLISKAIANRLK
        IPK K+ + +S+YRPISLCN++YK+ISK IAN LK
Subjt:  IPKCKDAQLISDYRPISLCNIVYKLISKAIANRLK

A0A2N9I921 Reverse transcriptase domain-containing protein8.0e-6139.66Show/hide
Query:  WYKRRDKQVILEERLDRYLCNSAFFVMFPFVEVRHLDFCLSDHCPLLIEADVLANKSRARGKPCFRFEEVWTQFDECRGIVEKGWSRP------------
        W   R     + ERLDR + ++A+   FP   V HLD+  SDH PL +      N+  A  KP FRFEE+W     C   +   W  P            
Subjt:  WYKRRDKQVILEERLDRYLCNSAFFVMFPFVEVRHLDFCLSDHCPLLIEADVLANKSRARGKPCFRFEEVWTQFDECRGIVEKGWSRP------------

Query:  ---CRPQGRD--------QEKLLNHGVLELRIVPMSCLHGKNPKRETMKASEDLENLLGEEEIYWHQQSRVEWMRWGDRNSKWFHQRATHRRRRNQMEGL
           CR Q ++          K L     EL++     + G++P       +E +  LL +EE  W Q+SR +W++ GDRN+ +FH RATHR+RRN + GL
Subjt:  ---CRPQGRD--------QEKLLNHGVLELRIVPMSCLHGKNPKRETMKASEDLENLLGEEEIYWHQQSRVEWMRWGDRNSKWFHQRATHRRRRNQMEGL

Query:  LDRDGKWTTDEGEMGQLVVEYFQALFTSNGHQNSESLQEFLTHIQPCIERDVNKSLSRPYLEEELLSALKQMSPLKAPGEDGLLALFYQRFWNVIGKEVT
         D DG+W TD  ++  +++ YFQ +F S+   N  S+   L  I   I   +N++LSRPY   E+ +AL+QM+PL APG DGL  +F+Q  W++IG +V 
Subjt:  LDRDGKWTTDEGEMGQLVVEYFQALFTSNGHQNSESLQEFLTHIQPCIERDVNKSLSRPYLEEELLSALKQMSPLKAPGEDGLLALFYQRFWNVIGKEVT

Query:  FLCLNNLNDKGYVDLINSTVITLIPKCKDAQLISDYRPISLCNIVYKLISKAIANRLK
           L++LN    V  IN T ITLIPK K+ + +S+YRPISLCN++YK+ISK IANRLK
Subjt:  FLCLNNLNDKGYVDLINSTVITLIPKCKDAQLISDYRPISLCNIVYKLISKAIANRLK

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein9.2e-1426.98Show/hide
Query:  FEEVWTQFDE-CRG--IVEKGWSRPCRPQGRDQEKLLNHGVLELRIVPMSCLHGKNPKR-ETMKASEDLENLLGEEEIYWHQQSRVEWMRWGDRNSKWFH
        ++ +W  F   CRG  I    +    R Q R +   L   + EL        H K  +R E  K   +L+ +  ++ +    +SR  +    ++  +   
Subjt:  FEEVWTQFDE-CRG--IVEKGWSRPCRPQGRDQEKLLNHGVLELRIVPMSCLHGKNPKR-ETMKASEDLENLLGEEEIYWHQQSRVEWMRWGDRNSKWFH

Query:  QRATHRRRRNQMEGLLDRDGKWTTDEGEMGQLVVEYFQALFTSNGHQNSESLQEFL-THIQPCIERDVNKSLSRPYLEEELLSALKQMSPLKAPGEDGLL
        +    +R +NQ++ + +  G  TTD  E+   + EY++ L+ +N  +N E +  FL T+  P + ++  +SL+RP    E+++ +  +   K+PG DG  
Subjt:  QRATHRRRRNQMEGLLDRDGKWTTDEGEMGQLVVEYFQALFTSNGHQNSESLQEFL-THIQPCIERDVNKSLSRPYLEEELLSALKQMSPLKAPGEDGLL

Query:  ALFYQRFWNVIGKEVTFL--CLNNLNDKGYV-DLINSTVITLIPK-CKDAQLISDYRPISLCNIVYKLISKAIANRLK
        A FYQR+   +   V FL     ++  +G + +      I LIPK  +D     ++RPISL NI  K+++K +ANR++
Subjt:  ALFYQRFWNVIGKEVTFL--CLNNLNDKGYV-DLINSTVITLIPK-CKDAQLISDYRPISLCNIVYKLISKAIANRLK

P08548 LINE-1 reverse transcriptase homolog3.3e-1126.75Show/hide
Query:  NPK----RETMKASEDLENLLGEEEIYWHQQSRVEWMRWGDRNSKWFHQRATHRRRRNQMEGLLDRDGKWTTDEGEMGQLVVEYFQALFTSNGHQNSESL
        NPK    +E  K   +L  +  +  I    +S+  +    ++  K        +R ++ +  + + + + TTD  E+ +++ EY++ L+ S+ ++N + +
Subjt:  NPK----RETMKASEDLENLLGEEEIYWHQQSRVEWMRWGDRNSKWFHQRATHRRRRNQMEGLLDRDGKWTTDEGEMGQLVVEYFQALFTSNGHQNSESL

Query:  QEFL--THIQPCIERDVNKSLSRPYLEEELLSALKQMSPLKAPGEDGLLALFYQRFWNVIGKEVTFLCLN---NLNDKGYV-DLINSTVITLIPK-CKDA
         ++L   H+    +++V + L+RP    E+ S ++ +   K+PG DG  + FYQ F     +E+  + LN   N+  +G + +      ITLIPK  KD 
Subjt:  QEFL--THIQPCIERDVNKSLSRPYLEEELLSALKQMSPLKAPGEDGLLALFYQRFWNVIGKEVTFLCLN---NLNDKGYV-DLINSTVITLIPK-CKDA

Query:  QLISDYRPISLCNIVYKLISKAIANRLK
            +YRPISL NI  K+++K + NR++
Subjt:  QLISDYRPISLCNIVYKLISKAIANRLK

P11369 LINE-1 retrotransposable element ORF2 protein3.3e-1134.78Show/hide
Query:  GKWTTDEGEMGQLVVEYFQALFTSNGHQNSESLQEFLTHIQ-PCIERDVNKSLSRPYLEEELLSALKQMSPLKAPGEDGLLALFYQRFWNVIGKEVTFLC
        G  TTD  E+   +  +++ L+ S   +N + + +FL   Q P + +D    L+ P   +E+ + +  +   K+PG DG  A FYQ F     KE     
Subjt:  GKWTTDEGEMGQLVVEYFQALFTSNGHQNSESLQEFLTHIQ-PCIERDVNKSLSRPYLEEELLSALKQMSPLKAPGEDGLLALFYQRFWNVIGKEVTFLC

Query:  LNNLNDKGYVD--LINS---TVITLIPK-CKDAQLISDYRPISLCNIVYKLISKAIANRLK
        L+ L  K  V+  L NS     ITLIPK  KD   I ++RPISL NI  K+++K +ANR++
Subjt:  LNNLNDKGYVD--LINS---TVITLIPK-CKDAQLISDYRPISLCNIVYKLISKAIANRLK

P14381 Transposon TX1 uncharacterized 149 kDa protein4.1e-1430.09Show/hide
Query:  ETMKASEDLENLLGEEEIYWHQQSRVEWMRWGDRNSKWFHQRATHRRRRNQMEGLLDRDGKWTTDEGEMGQLVVEYFQALFTSNGHQNSESLQEFLTHIQ
        E ++  E L N+   +      +SR++ +   DR S++F+     +  R Q+  L   DG    D   +      ++Q LF+ +    S    E L    
Subjt:  ETMKASEDLENLLGEEEIYWHQQSRVEWMRWGDRNSKWFHQRATHRRRRNQMEGLLDRDGKWTTDEGEMGQLVVEYFQALFTSNGHQNSESLQEFLTHIQ

Query:  PCIERDVNKSLSRPYLEEELLSALKQMSPLKAPGEDGLLALFYQRFWNVIGKEVTFLCLNNLNDKGYVDL-INSTVITLIPKCKDAQLISDYRPISLCNI
        P +     + L  P   +EL  AL+ M   K+PG DGL   F+Q FW+ +G +     L     KG + L     V++L+PK  D +LI ++RP+SL + 
Subjt:  PCIERDVNKSLSRPYLEEELLSALKQMSPLKAPGEDGLLALFYQRFWNVIGKEVTFLCLNNLNDKGYVDL-INSTVITLIPKCKDAQLISDYRPISLCNI

Query:  VYKLISKAIANRLKGV
         YK+++KAI+ RLK V
Subjt:  VYKLISKAIANRLKGV

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein4.4e-1924.72Show/hide
Query:  WYKRRDKQVILEERLDRYLCNSAFFVMFPFVEVRHLDFCLSDHCPLLIEADVLANKSRARGKPCFRFEEVWTQFDECRGIVEKGWSRPCRPQGRDQEKLL
        W   +D   I+  +LDR + N  +F  FP          +SDH P +I   +L N  + R K CFR+    +        +   W     P G     L 
Subjt:  WYKRRDKQVILEERLDRYLCNSAFFVMFPFVEVRHLDFCLSDHCPLLIEADVLANKSRARGKPCFRFEEVWTQFDECRGIVEKGWSRPCRPQGRDQEKLL

Query:  NHGVLELRIVPMSCLHG-KNPKRETMKASEDLENLLGE--------------------------EEIYWHQQSRVEWMRWGDRNSKWFHQRATHRRRRNQ
         H     +   +    G  N + +T +A + LE++  +                           E ++ Q+SR++W++ GD N+++FH+     + +N 
Subjt:  NHGVLELRIVPMSCLHG-KNPKRETMKASEDLENLLGE--------------------------EEIYWHQQSRVEWMRWGDRNSKWFHQRATHRRRRNQ

Query:  MEGLLDRDGKWTTDEGEMGQLVVEYFQALFTSNGHQNSESLQEFLTHIQPCIERDVNKS-LSRPYLEEELLSALKQMSPLKAPGEDGLLALFYQRFWNVI
        ++ L   D     +  ++ +++V Y+  L  S+    +    + +  I P    D   S LS    ++E+ +A+  M   KAPG D   A F+   W V+
Subjt:  MEGLLDRDGKWTTDEGEMGQLVVEYFQALFTSNGHQNSESLQEFLTHIQPCIERDVNKS-LSRPYLEEELLSALKQMSPLKAPGEDGLLALFYQRFWNVI

Query:  GKEVTFLCLNNLNDKGY-VDLINSTVITLIPKCKDAQLISDYRPISLCNIVYKLIS
         K+ T   +      G+ +   N+T ITLIPK      +S +RP+S C +VYK+I+
Subjt:  GKEVTFLCLNNLNDKGY-VDLINSTVITLIPKCKDAQLISDYRPISLCNIVYKLIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGTACAAGAGGAGAGATAAGCAGGTGATTCTTGAAGAACGGTTAGATCGTTATTTGTGTAACTCTGCCTTTTTTGTCATGTTTCCTTTTGTTGAGGTAAGGCATCT
AGACTTTTGCTTATCAGATCATTGTCCCTTGTTGATAGAGGCTGATGTTTTAGCCAACAAATCGAGAGCTCGGGGAAAGCCTTGTTTTCGTTTTGAGGAGGTGTGGACTC
AGTTTGATGAGTGTAGAGGTATTGTTGAAAAGGGATGGTCTCGTCCCTGTCGGCCCCAAGGTAGAGACCAGGAAAAATTGCTGAATCATGGTGTTCTAGAGTTAAGAATT
GTGCCTATGAGTTGTCTGCATGGGAAAAATCCAAAAAGGGAGACTATGAAAGCCAGTGAGGATTTGGAGAACTTGTTGGGAGAGGAAGAGATCTATTGGCACCAACAATC
TAGGGTGGAATGGATGCGGTGGGGAGATAGGAACTCAAAGTGGTTTCACCAGAGAGCAACCCATCGGAGGAGGAGGAATCAGATGGAAGGGTTGTTGGACCGAGATGGAA
AATGGACGACAGATGAAGGGGAAATGGGTCAGTTGGTGGTCGAATATTTTCAGGCTCTGTTTACTTCTAATGGCCATCAGAACAGTGAAAGTTTGCAGGAGTTTTTGACC
CATATTCAGCCGTGTATCGAGAGGGATGTGAATAAATCTCTGAGTCGTCCATATTTAGAGGAAGAGTTGTTGAGTGCTCTGAAGCAAATGAGCCCTTTAAAAGCCCCTGG
GGAGGATGGTCTTCTAGCTTTGTTCTATCAAAGATTCTGGAATGTGATAGGTAAGGAGGTTACTTTTCTTTGTCTAAATAACCTTAATGATAAGGGCTATGTTGATCTGA
TTAATAGCACTGTAATTACTCTAATACCAAAATGTAAGGATGCACAACTGATTTCTGATTATAGGCCTATTAGCTTGTGTAATATTGTGTATAAGTTGATTTCGAAAGCA
ATTGCTAATAGGCTAAAAGGGGTTTGTGTCCTATTTGTTCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGGTACAAGAGGAGAGATAAGCAGGTGATTCTTGAAGAACGGTTAGATCGTTATTTGTGTAACTCTGCCTTTTTTGTCATGTTTCCTTTTGTTGAGGTAAGGCATCT
AGACTTTTGCTTATCAGATCATTGTCCCTTGTTGATAGAGGCTGATGTTTTAGCCAACAAATCGAGAGCTCGGGGAAAGCCTTGTTTTCGTTTTGAGGAGGTGTGGACTC
AGTTTGATGAGTGTAGAGGTATTGTTGAAAAGGGATGGTCTCGTCCCTGTCGGCCCCAAGGTAGAGACCAGGAAAAATTGCTGAATCATGGTGTTCTAGAGTTAAGAATT
GTGCCTATGAGTTGTCTGCATGGGAAAAATCCAAAAAGGGAGACTATGAAAGCCAGTGAGGATTTGGAGAACTTGTTGGGAGAGGAAGAGATCTATTGGCACCAACAATC
TAGGGTGGAATGGATGCGGTGGGGAGATAGGAACTCAAAGTGGTTTCACCAGAGAGCAACCCATCGGAGGAGGAGGAATCAGATGGAAGGGTTGTTGGACCGAGATGGAA
AATGGACGACAGATGAAGGGGAAATGGGTCAGTTGGTGGTCGAATATTTTCAGGCTCTGTTTACTTCTAATGGCCATCAGAACAGTGAAAGTTTGCAGGAGTTTTTGACC
CATATTCAGCCGTGTATCGAGAGGGATGTGAATAAATCTCTGAGTCGTCCATATTTAGAGGAAGAGTTGTTGAGTGCTCTGAAGCAAATGAGCCCTTTAAAAGCCCCTGG
GGAGGATGGTCTTCTAGCTTTGTTCTATCAAAGATTCTGGAATGTGATAGGTAAGGAGGTTACTTTTCTTTGTCTAAATAACCTTAATGATAAGGGCTATGTTGATCTGA
TTAATAGCACTGTAATTACTCTAATACCAAAATGTAAGGATGCACAACTGATTTCTGATTATAGGCCTATTAGCTTGTGTAATATTGTGTATAAGTTGATTTCGAAAGCA
ATTGCTAATAGGCTAAAAGGGGTTTGTGTCCTATTTGTTCTTTAG
Protein sequenceShow/hide protein sequence
MWYKRRDKQVILEERLDRYLCNSAFFVMFPFVEVRHLDFCLSDHCPLLIEADVLANKSRARGKPCFRFEEVWTQFDECRGIVEKGWSRPCRPQGRDQEKLLNHGVLELRI
VPMSCLHGKNPKRETMKASEDLENLLGEEEIYWHQQSRVEWMRWGDRNSKWFHQRATHRRRRNQMEGLLDRDGKWTTDEGEMGQLVVEYFQALFTSNGHQNSESLQEFLT
HIQPCIERDVNKSLSRPYLEEELLSALKQMSPLKAPGEDGLLALFYQRFWNVIGKEVTFLCLNNLNDKGYVDLINSTVITLIPKCKDAQLISDYRPISLCNIVYKLISKA
IANRLKGVCVLFVL