; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036538 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036538
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr3:48150674..48153389
RNA-Seq ExpressionLag0036538
SyntenyLag0036538
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051442.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]9.1e-3124.01Show/hide
Query:  MVVARYDIKKFDGKGDFDLWKAKIKAILGQQKTLLALSDPTQLPATLSKDDKEAMNMTAYETITLNLSNSVLRQVIDADTPLKIWQKLNDLYETKDIHNK
        M   +++I+KFDG GDF LW  +I AILG QK L AL DP +LPATL+K ++E +   AY T+ +N++++VLRQVI+  T    W+KL  LYE KD+ NK
Subjt:  MVVARYDIKKFDGKGDFDLWKAKIKAILGQQKTLLALSDPTQLPATLSKDDKEAMNMTAYETITLNLSNSVLRQVIDADTPLKIWQKLNDLYETKDIHNK

Query:  MYMREKFFT------------------------------------------------EVKNALKYGRVSITTDAIISAIRTKELELLAAKK---------
        M+++EK F+                                                EVK  LKYGR +IT +++I+ +++KELEL    K         
Subjt:  MYMREKFFT------------------------------------------------EVKNALKYGRVSITTDAIISAIRTKELELLAAKK---------

Query:  -------------------------------------------ETFEGLFVKGKPRSK-----------------------------DNKHQTDDK---G
                                                   E FE   V      K                             D K Q  D    G
Subjt:  -------------------------------------------ETFEGLFVKGKPRSK-----------------------------DNKHQTDDK---G

Query:  KSKDVEML---------------------------------------------QSALVVSENGMTESDLWHTRLSHISIKGLQILTKQGIL---------
         ++D E++                                             + AL+       E +LWH RLSHIS KGL  L KQG++         
Subjt:  KSKDVEML---------------------------------------------QSALVVSENGMTESDLWHTRLSHISIKGLQILTKQGIL---------

Query:  ----------------------------------------------------------------------------------------SQEEKWSGKPPN
                                                                                                + EE+W G  P 
Subjt:  ----------------------------------------------------------------------------------------SQEEKWSGKPPN

Query:  LHHLKDRATS----VSQEKGE-----------------------------------------------------SLEEEET-------------------
        L +LK    +    + Q K E                                                     +L E  T                   
Subjt:  LHHLKDRATS----VSQEKGE-----------------------------------------------------SLEEEET-------------------

Query:  --------------DDTLEESQNLRNYSLARDRQRRTIVPPSRFSEADCI--SLVQHVSDSLLFEESSSFEEATNGPNSRAGSRPLPKGYKPIASKWIYK
                      ++  E +++L NYSL RDR RRTI PPSRF+ ADCI  S ++ + D     +  S+E+A             P G   I  KW++K
Subjt:  --------------DDTLEESQNLRNYSLARDRQRRTIVPPSRFSEADCI--SLVQHVSDSLLFEESSSFEEATNGPNSRAGSRPLPKGYKPIASKWIYK

Query:  VKEGVAGVMKPRFKARLVAKGYTQKEGIDYPEVFSPVVKLTSIILLLFLVVQEDLMLD
         K       K +FKARLVAKG+ QKEG+DY E+F PVVK TSI +LL +V  E+L L+
Subjt:  VKEGVAGVMKPRFKARLVAKGYTQKEGIDYPEVFSPVVKLTSIILLLFLVVQEDLMLD

TXG54059.1 hypothetical protein EZV62_019315 [Acer yangbiense]3.1e-3132.95Show/hide
Query:  RYDIKKFDGKGDFDLWKAKIKAILGQQKTLLALSDPTQLPATLSKDDKEAMNMTAYETITLNLSNSVLRQVIDADTPLKIWQKLNDLYETKDIHNKMYMR
        ++DI KFDG GDF +W+ K+KA+L QQK L A+  P +LP +L+ + K+ M   A  TI LNLS++VLR++ D  T   +W+KL  LY TK + NK+Y++
Subjt:  RYDIKKFDGKGDFDLWKAKIKAILGQQKTLLALSDPTQLPATLSKDDKEAMNMTAYETITLNLSNSVLRQVIDADTPLKIWQKLNDLYETKDIHNKMYMR

Query:  EKF--------------------------------------------------FTEVKNALKYGRVSITTDAIISAIRTKELELLAAKKETFEGLFVKGK
        E+                                                   F +VK A+KYGR S++ +  ISA+++KELEL   KK+  E LFV+G+
Subjt:  EKF--------------------------------------------------FTEVKNALKYGRVSITTDAIISAIRTKELELLAAKKETFEGLFVKGK

Query:  PRSKDNKHQTDDKGKSKDVEMLQSALVVSENGMTESDLWHTRLSHISIKGLQILTKQGILSQEE
           K++ + +++K K +       + + S+    +  LWH RL H+S +G+  L+K+ +L++++
Subjt:  PRSKDNKHQTDDKGKSKDVEMLQSALVVSENGMTESDLWHTRLSHISIKGLQILTKQGILSQEE

XP_038885928.1 uncharacterized protein LOC120076236 [Benincasa hispida]2.6e-3040.19Show/hide
Query:  MVVARYDIKKFDGKGDFDLWKAKIKAILGQQKTLLALSDPTQLPATLSKDDKEAMNMTAYETITLNLSNSVLRQVIDADTPLKIWQKLNDLYETKDIHNK
        M   RY+I+KF+ K DF+LWKAKIK +L +QK LLA++DP + P  L K +KE +   AY TI LN+ +SVLRQ++D  T   +W KLND+Y  KD+ NK
Subjt:  MVVARYDIKKFDGKGDFDLWKAKIKAILGQQKTLLALSDPTQLPATLSKDDKEAMNMTAYETITLNLSNSVLRQVIDADTPLKIWQKLNDLYETKDIHNK

Query:  MYMREKFFT------------------------------------------------EVKNALKYGRVSITTDAIISAIRTKELELLAAKKE--TFEGLF
         ++RE+FFT                                                +VK ALKYGR  ITT AIISA+  KELEL   KK+    EG F
Subjt:  MYMREKFFT------------------------------------------------EVKNALKYGRVSITTDAIISAIRTKELELLAAKKE--TFEGLF

Query:  VKGKPRSKDNKHQT
         KG  ++    ++T
Subjt:  VKGKPRSKDNKHQT

XP_038887098.1 uncharacterized protein LOC120077280 [Benincasa hispida]1.3e-2940.57Show/hide
Query:  MVVARYDIKKFDGKGDFDLWKAKIKAILGQQKTLLALSDPTQLPATLSKDDKEAMNMTAYETITLNLSNSVLRQVIDADTPLKIWQKLNDLYETKDIHNK
        M   +Y+I+KFD K DF+L KAKIKA+LGQQK LLA++DP++ P TLS+ +KE +   AY TI LN+++SVLRQ++D  T   +W KLN++Y  KD  NK
Subjt:  MVVARYDIKKFDGKGDFDLWKAKIKAILGQQKTLLALSDPTQLPATLSKDDKEAMNMTAYETITLNLSNSVLRQVIDADTPLKIWQKLNDLYETKDIHNK

Query:  MYMREKFFT------------------------------------------------EVKNALKYGRVSITTDAIISAIRTKELELLAAKKETFEGLFVK
         ++RE+FFT                                                +VK ALKY R  IT DAIISA+R KELEL    +E  +   VK
Subjt:  MYMREKFFT------------------------------------------------EVKNALKYGRVSITTDAIISAIRTKELELLAAKKETFEGLFVK

Query:  GKPRSKDNKHQT
        G+ R+    ++T
Subjt:  GKPRSKDNKHQT

XP_038890043.1 uncharacterized protein LOC120079747 [Benincasa hispida]4.8e-3241.06Show/hide
Query:  MVVARYDIKKFDGKGDFDLWKAKIKAILGQQKTLLALSDPTQLPATLSKDDKEAMNMTAYETITLNLSNSVLRQVIDADTPLKIWQKLNDLYETKDIHNK
        M   RY+I+KFD K DF+LWK KIK +LGQQK LLA++DP + P TL++ +KE + + A  TI LN++++VLRQVI+  T   +W KLN++Y  KD+ NK
Subjt:  MVVARYDIKKFDGKGDFDLWKAKIKAILGQQKTLLALSDPTQLPATLSKDDKEAMNMTAYETITLNLSNSVLRQVIDADTPLKIWQKLNDLYETKDIHNK

Query:  MYMREKFFT------------------------------------------------EVKNALKYGRVSITTDAIISAIRTKELELLAAKK--ETFEGLF
         ++RE+FFT                                                + K A+KYGR  ITT+AIISA+R +ELEL  +KK  +  EGLF
Subjt:  MYMREKFFT------------------------------------------------EVKNALKYGRVSITTDAIISAIRTKELELLAAKK--ETFEGLF

Query:  VKGKPRS
         KGK ++
Subjt:  VKGKPRS

TrEMBL top hitse value%identityAlignment
A0A438CPX9 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-2922.73Show/hide
Query:  MVVARYDIKKFDGKGDFDLWKAKIKAILGQQKTLLALSDPTQLPATLSKDDKEAMNMTAYETITLNLSNSVLRQVIDADTPLKIWQKLNDLYETKDIHNK
        M  A++D++KF GK DF LW+ K++A+L QQ    AL     LP+T+ +  K  +   A+  I L+L +++LR+V  A +  ++W KL  LY TK + N+
Subjt:  MVVARYDIKKFDGKGDFDLWKAKIKAILGQQKTLLALSDPTQLPATLSKDDKEAMNMTAYETITLNLSNSVLRQVIDADTPLKIWQKLNDLYETKDIHNK

Query:  MYMREKF------------------------------------------------FTEVKNALKYGRVSITTDAIISAIRTKELELL-AAKKETFEGLFV
        ++ + K                                                 +T +K A+ YGR S+T D + S +  +EL+    +K+E+ EGL +
Subjt:  MYMREKF------------------------------------------------FTEVKNALKYGRVSITTDAIISAIRTKELELL-AAKKETFEGLFV

Query:  KGKPRSKDNKHQTD---DKGKSKDVE------------------------------MLQSAL---VVSENGMT---------------------------
        +G+   ++ K +      K K+K  +                              ++Q  L   V+S+ G+                            
Subjt:  KGKPRSKDNKHQTD---DKGKSKDVE------------------------------MLQSAL---VVSENGMT---------------------------

Query:  --------------------------ESDLWHTRLSHISIKGLQILTKQGIL------------------------------------------------
                                   + LWH RL HIS +GLQ L KQG+L                                                
Subjt:  --------------------------ESDLWHTRLSHISIKGLQILTKQGIL------------------------------------------------

Query:  ----------------SQEEKWSGKPPNLHHLK-----------------------------------------------------------DRATSVSQ
                        + +EKW+GK  +  HLK                                                            + T    
Subjt:  ----------------SQEEKWSGKPPNLHHLK-----------------------------------------------------------DRATSVSQ

Query:  EKG----------ESLEEEETDDTL--------------EESQNLRNYSLARDRQRRTIVPPSRFSEADCISLVQHVSDSLLFEESSSFEEATNGPNSRA
         +G          E+L+ E++ +T               E +Q L +Y+L RDRQ+R + PP R+ +A   +    V++ ++  E  +++EA N      
Subjt:  EKG----------ESLEEEETDDTL--------------EESQNLRNYSLARDRQRRTIVPPSRFSEADCISLVQHVSDSLLFEESSSFEEATNGPNSRA

Query:  GSRPL------------------PKGYKPIASKWIYKVKEGVAGVMKPRFKARLVAKGYTQKEGIDYPEVFSPVVKLTSIILLLFLVVQEDLMLD
          + +                  PK  K + SKW++K K+G  G   PR+KA LVAKG++QKEG+DY E+FSPVVK +SI LLL  V  EDL LD
Subjt:  GSRPL------------------PKGYKPIASKWIYKVKEGVAGVMKPRFKARLVAKGYTQKEGIDYPEVFSPVVKLTSIILLLFLVVQEDLMLD

A0A5A7U6R2 Retrovirus-related Pol polyprotein from transposon TNT 1-944.4e-3124.01Show/hide
Query:  MVVARYDIKKFDGKGDFDLWKAKIKAILGQQKTLLALSDPTQLPATLSKDDKEAMNMTAYETITLNLSNSVLRQVIDADTPLKIWQKLNDLYETKDIHNK
        M   +++I+KFDG GDF LW  +I AILG QK L AL DP +LPATL+K ++E +   AY T+ +N++++VLRQVI+  T    W+KL  LYE KD+ NK
Subjt:  MVVARYDIKKFDGKGDFDLWKAKIKAILGQQKTLLALSDPTQLPATLSKDDKEAMNMTAYETITLNLSNSVLRQVIDADTPLKIWQKLNDLYETKDIHNK

Query:  MYMREKFFT------------------------------------------------EVKNALKYGRVSITTDAIISAIRTKELELLAAKK---------
        M+++EK F+                                                EVK  LKYGR +IT +++I+ +++KELEL    K         
Subjt:  MYMREKFFT------------------------------------------------EVKNALKYGRVSITTDAIISAIRTKELELLAAKK---------

Query:  -------------------------------------------ETFEGLFVKGKPRSK-----------------------------DNKHQTDDK---G
                                                   E FE   V      K                             D K Q  D    G
Subjt:  -------------------------------------------ETFEGLFVKGKPRSK-----------------------------DNKHQTDDK---G

Query:  KSKDVEML---------------------------------------------QSALVVSENGMTESDLWHTRLSHISIKGLQILTKQGIL---------
         ++D E++                                             + AL+       E +LWH RLSHIS KGL  L KQG++         
Subjt:  KSKDVEML---------------------------------------------QSALVVSENGMTESDLWHTRLSHISIKGLQILTKQGIL---------

Query:  ----------------------------------------------------------------------------------------SQEEKWSGKPPN
                                                                                                + EE+W G  P 
Subjt:  ----------------------------------------------------------------------------------------SQEEKWSGKPPN

Query:  LHHLKDRATS----VSQEKGE-----------------------------------------------------SLEEEET-------------------
        L +LK    +    + Q K E                                                     +L E  T                   
Subjt:  LHHLKDRATS----VSQEKGE-----------------------------------------------------SLEEEET-------------------

Query:  --------------DDTLEESQNLRNYSLARDRQRRTIVPPSRFSEADCI--SLVQHVSDSLLFEESSSFEEATNGPNSRAGSRPLPKGYKPIASKWIYK
                      ++  E +++L NYSL RDR RRTI PPSRF+ ADCI  S ++ + D     +  S+E+A             P G   I  KW++K
Subjt:  --------------DDTLEESQNLRNYSLARDRQRRTIVPPSRFSEADCI--SLVQHVSDSLLFEESSSFEEATNGPNSRAGSRPLPKGYKPIASKWIYK

Query:  VKEGVAGVMKPRFKARLVAKGYTQKEGIDYPEVFSPVVKLTSIILLLFLVVQEDLMLD
         K       K +FKARLVAKG+ QKEG+DY E+F PVVK TSI +LL +V  E+L L+
Subjt:  VKEGVAGVMKPRFKARLVAKGYTQKEGIDYPEVFSPVVKLTSIILLLFLVVQEDLMLD

A0A5C7HB65 gag_pre-integrs domain-containing protein1.5e-3132.95Show/hide
Query:  RYDIKKFDGKGDFDLWKAKIKAILGQQKTLLALSDPTQLPATLSKDDKEAMNMTAYETITLNLSNSVLRQVIDADTPLKIWQKLNDLYETKDIHNKMYMR
        ++DI KFDG GDF +W+ K+KA+L QQK L A+  P +LP +L+ + K+ M   A  TI LNLS++VLR++ D  T   +W+KL  LY TK + NK+Y++
Subjt:  RYDIKKFDGKGDFDLWKAKIKAILGQQKTLLALSDPTQLPATLSKDDKEAMNMTAYETITLNLSNSVLRQVIDADTPLKIWQKLNDLYETKDIHNKMYMR

Query:  EKF--------------------------------------------------FTEVKNALKYGRVSITTDAIISAIRTKELELLAAKKETFEGLFVKGK
        E+                                                   F +VK A+KYGR S++ +  ISA+++KELEL   KK+  E LFV+G+
Subjt:  EKF--------------------------------------------------FTEVKNALKYGRVSITTDAIISAIRTKELELLAAKKETFEGLFVKGK

Query:  PRSKDNKHQTDDKGKSKDVEMLQSALVVSENGMTESDLWHTRLSHISIKGLQILTKQGILSQEE
           K++ + +++K K +       + + S+    +  LWH RL H+S +G+  L+K+ +L++++
Subjt:  PRSKDNKHQTDDKGKSKDVEMLQSALVVSENGMTESDLWHTRLSHISIKGLQILTKQGILSQEE

A0A6A3BFR2 Integrase catalytic domain-containing protein2.7e-2825.83Show/hide
Query:  MVVARYDIKKFDGKGDFDLWKAKIKAILGQQKTLLALSDPTQLPATLSKDDKEAMNMTAYETITLNLSN--SVLRQVIDADTPLKIWQKLNDLYETKDIH
        M  A++DI+KF GK DF LW+ K++A+L QQ  + AL  PT LP  + + ++ ++   A+  I LNL +  ++  ++ D D  L +   +   YE     
Subjt:  MVVARYDIKKFDGKGDFDLWKAKIKAILGQQKTLLALSDPTQLPATLSKDDKEAMNMTAYETITLNLSN--SVLRQVIDADTPLKIWQKLNDLYETKDIH

Query:  NKMYMREKFFTEVKNALKYGRV-SITTDAIISAIRTKELELLAAKKETFEG--LFVKGKPRSKDNKHQTD----DKGKS----------------KDVEM
                     K+A+ YGRV +I+ D + S IR KEL+      E   G  L V+G+   +D ++  +     KGK                 K+   
Subjt:  NKMYMREKFFTEVKNALKYGRV-SITTDAIISAIRTKELELLAAKKETFEG--LFVKGKPRSKDNKHQTD----DKGKS----------------KDVEM

Query:  LQSALVV-----SENGMTESD-----------------------LWHTRLSHIS-IKGLQILTKQGILSQE-----------------------------
         +++L +     S+  + +SD                       L H R + I+  K ++   +Q  L++                              
Subjt:  LQSALVV-----SENGMTESD-----------------------LWHTRLSHIS-IKGLQILTKQGILSQE-----------------------------

Query:  -------------------EKWSGKPPNLHHLK-------------------------------------------------------------DRATSV
                           E W GKP N ++LK                                                             D+ T V
Subjt:  -------------------EKWSGKPPNLHHLK-------------------------------------------------------------DRATSV

Query:  SQEKGESLEEEETDDTLEESQNLRNYSLARDRQRRTIVPPSRFSEADCISLVQHVSDSLLFEESSSFEEATNGPNSRAGSRPL-----------------
          + GE+ +E E +  L+      NY+L RDRQRRTI PP R+  AD IS   ++++ +   E  +F+EA    +    ++ +                 
Subjt:  SQEKGESLEEEETDDTLEESQNLRNYSLARDRQRRTIVPPSRFSEADCISLVQHVSDSLLFEESSSFEEATNGPNSRAGSRPL-----------------

Query:  -PKGYKPIASKWIYKVKEGVAGVMKPRFKARLVAKGYTQKEGIDYPEVFSPVVKLTSIILLLFLVVQEDLMLD
         PK  + +  KWI+K KEG+  V K R+KA+LVAKG+TQ EGID+ E+FSPVVK  SI L+L LV Q DL L+
Subjt:  -PKGYKPIASKWIYKVKEGVAGVMKPRFKARLVAKGYTQKEGIDYPEVFSPVVKLTSIILLLFLVVQEDLMLD

A0A803Q997 Uncharacterized protein1.6e-3329.2Show/hide
Query:  VARYDIKKFDGKGDFDLWKAKIKAILGQQKTLLALSDPTQLPATLSKD-------DKEAMNMTAYETITLNLSNSVLRQVIDADTPLKIWQKLNDLYETK
        +AR+D+++FDG  DF LWK K+ AIL  QK   AL +         KD       + + +   A  +I ++L+++VLRQVI   T L IW KLN LY  +
Subjt:  VARYDIKKFDGKGDFDLWKAKIKAILGQQKTLLALSDPTQLPATLSKD-------DKEAMNMTAYETITLNLSNSVLRQVIDADTPLKIWQKLNDLYETK

Query:  DIHNKMYMREKF------------------------------------------------FTEVKNALKYGRVSITTDAIISAIRTKELEL---LAAKKE
            K++++ KF                                                F E++  + Y R ++T D ++S +R+++ EL    AAK+E
Subjt:  DIHNKMYMREKF------------------------------------------------FTEVKNALKYGRVSITTDAIISAIRTKELEL---LAAKKE

Query:  --TFEGLFVKGK-PR---------SKDNKHQTDD----KGKSKDVEMLQSALVVSENGM----------------------------TESDLWHTRLSHI
          T + L V+G+ PR         S+ N    D     +   +DV  + ++    EN +                            +ES LWH R  HI
Subjt:  --TFEGLFVKGK-PR---------SKDNKHQTDD----KGKSKDVEMLQSALVVSENGM----------------------------TESDLWHTRLSHI

Query:  SIKGLQILTKQGILSQ---------EEKWSGKPPNLHHLKD--RATSVSQEKGESL-------EEEETDDTLEESQNLRNYSLARDRQRRTIVPPSRFSE
          +GL  L+KQG+L           E    GK   +  L    RA SV +     L        + + DD +E    L  Y LA+DR RR I PP+++++
Subjt:  SIKGLQILTKQGILSQ---------EEKWSGKPPNLHHLKD--RATSVSQEKGESL-------EEEETDDTLEESQNLRNYSLARDRQRRTIVPPSRFSE

Query:  ADCISLVQHVSDSLLFEESSSFEEATNGP---------NSRAGSRPLPK---------GYKPIASKWIYKVKEGVAGVMKPRFKARLVAKGYTQKEGIDY
        AD I+    ++     +   ++ EA N           NS+  S    K         G K I SKWI++ KEG+A      +KARL+A+G+TQ EG+DY
Subjt:  ADCISLVQHVSDSLLFEESSSFEEATNGP---------NSRAGSRPLPK---------GYKPIASKWIYKVKEGVAGVMKPRFKARLVAKGYTQKEGIDY

Query:  PEVFSPVVKLTSIILLLFLVVQED
         E+FSPVVK+ +I ++L L VQ D
Subjt:  PEVFSPVVKLTSIILLLFLVVQED

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.6e-0723.51Show/hide
Query:  ETKDIHNKMYMREKFFTEVKNALKYGRVSITTDAIISAIRTKELELLAAKKETFEGLFVKGKPRSKDNKHQTDDKG-----KSKDVEMLQSALVVSENGM
        E+K+  N  ++++   +E KN     R  I T+    +     ++ L   KE+ +    + K R +D+ H  + KG     +S++ E  +    +  +  
Subjt:  ETKDIHNKMYMREKFFTEVKNALKYGRVSITTDAIISAIRTKELELLAAKKETFEGLFVKGKPRSKDNKHQTDDKG-----KSKDVEMLQSALVVSENGM

Query:  TESDLWHTRLSHISIKGLQILTKQGILSQEEKWSGKPPNLHHLKDRATSVSQEKGESLEEEETDDTLEESQNLRNYSLARDRQRRTIVPPSRFSEADCIS
        T++D            G++I+ +     + E+   KP                    +   E D++L +   L  +++  D        P+ F E     
Subjt:  TESDLWHTRLSHISIKGLQILTKQGILSQEEKWSGKPPNLHHLKDRATSVSQEKGESLEEEETDDTLEESQNLRNYSLARDRQRRTIVPPSRFSEADCIS

Query:  LVQHVSDSLLFEESSSFEEATNGPNSRAGSRPLPKGYKPIASKWIYKVKEGVAGVMKPRFKARLVAKGYTQKEGIDYPEVFSPVVKLTSIILLLFLVVQE
         +Q+  D   +EE+ + E   +  N+       P+    + S+W++ VK    G    R+KARLVA+G+TQK  IDY E F+PV +++S   +L LV+Q 
Subjt:  LVQHVSDSLLFEESSSFEEATNGPNSRAGSRPLPKGYKPIASKWIYKVKEGVAGVMKPRFKARLVAKGYTQKEGIDYPEVFSPVVKLTSIILLLFLVVQE

Query:  DL
        +L
Subjt:  DL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.4e-1030.69Show/hide
Query:  SGKPPNLHHLKDRATSVSQEKGESLEE-EETDDTLEE----------SQNLRNYSLARDRQRR------TIVPPSRFSEADCISLVQHVSDSLLFEESSS
        S  P +     D  +   ++ GE +E+ E+ D+ +EE           Q LR     R   RR       ++   R  E+    ++ H   + L +    
Subjt:  SGKPPNLHHLKDRATSVSQEKGESLEE-EETDDTLEE----------SQNLRNYSLARDRQRR------TIVPPSRFSEADCISLVQHVSDSLLFEESSS

Query:  FEEATNGPNSRAGSRPLPKGYKPIASKWIYKVKEGVAGVMKPRFKARLVAKGYTQKEGIDYPEVFSPVVKLTSIILLLFLVVQEDLMLD
          E+    N       LPKG +P+  KW++K+K+        R+KARLV KG+ QK+GID+ E+FSPVVK+TSI  +L L    DL ++
Subjt:  FEEATNGPNSRAGSRPLPKGYKPIASKWIYKVKEGVAGVMKPRFKARLVAKGYTQKEGIDYPEVFSPVVKLTSIILLLFLVVQEDLMLD

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.0e-1027.03Show/hide
Query:  RYDIKKFDGKGDFDLWKAKIKAILGQQKTLLALSDPTQLPATLSKDDKEAMNMTAYETITLNLSNSVLRQVIDADTPLKIWQKLNDLYETKDIHNKMYMR
        +Y++ KF+G   F  W+ +++ +L QQ     L   ++ P T+  +D   ++  A   I L+LS+ V+  +ID DT   IW +L  LY +K + NK+Y++
Subjt:  RYDIKKFDGKGDFDLWKAKIKAILGQQKTLLALSDPTQLPATLSKDDKEAMNMTAYETITLNLSNSVLRQVIDADTPLKIWQKLNDLYETKDIHNKMYMR

Query:  EKFF----TEVKNALKYGRVSITTDAIISAIRTKELELLAAKKETFEGLFVKGKPRSKDNKHQTDDKGKSK-DVEMLQSALVVSE
        ++ +    +E  N L +  V    + +I+ +    +++   ++E    L +   P S DN   T   GK+  +++ + SAL+++E
Subjt:  EKFF----TEVKNALKYGRVSITTDAIISAIRTKELELLAAKKETFEGLFVKGKPRSKDNKHQTDDKGKSK-DVEMLQSALVVSE

P92520 Uncharacterized mitochondrial protein AtMg008201.2e-0441.18Show/hide
Query:  PLPKGYKPIASKWIYKVKEGVAGVMKPRFKARLVAKGYTQKEGIDYPEVFSPVVKLTSIILLLFLVVQ
        P P     +  KW++K K    G +  R KARLVAKG+ Q+EGI + E +SPVV+  +I  +L +  Q
Subjt:  PLPKGYKPIASKWIYKVKEGVAGVMKPRFKARLVAKGYTQKEGIDYPEVFSPVVKLTSIILLLFLVVQ

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.3e-0746.27Show/hide
Query:  PLPKGYKPIASKWIYKVKEGVAGVMKPRFKARLVAKGYTQKEGIDYPEVFSPVVKLTSIILLLFLVV
        P P     +  +WI+  K    G +  R+KARLVAKGY Q+ G+DY E FSPV+K TSI ++L + V
Subjt:  PLPKGYKPIASKWIYKVKEGVAGVMKPRFKARLVAKGYTQKEGIDYPEVFSPVVKLTSIILLLFLVV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.3e-0746.27Show/hide
Query:  PLPKGYKPIASKWIYKVKEGVAGVMKPRFKARLVAKGYTQKEGIDYPEVFSPVVKLTSIILLLFLVV
        P P     +  +WI+  K    G +  R+KARLVAKGY Q+ G+DY E FSPV+K TSI ++L + V
Subjt:  PLPKGYKPIASKWIYKVKEGVAGVMKPRFKARLVAKGYTQKEGIDYPEVFSPVVKLTSIILLLFLVV

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 86.3e-1461.29Show/hide
Query:  LPKGYKPIASKWIYKVKEGVAGVMKPRFKARLVAKGYTQKEGIDYPEVFSPVVKLTSIILLL
        LP   KPI  KW+YK+K    G ++ R+KARLVAKGYTQ+EGID+ E FSPV KLTS+ L+L
Subjt:  LPKGYKPIASKWIYKVKEGVAGVMKPRFKARLVAKGYTQKEGIDYPEVFSPVVKLTSIILLL

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)8.2e-0641.18Show/hide
Query:  PLPKGYKPIASKWIYKVKEGVAGVMKPRFKARLVAKGYTQKEGIDYPEVFSPVVKLTSIILLLFLVVQ
        P P     +  KW++K K    G +  R KARLVAKG+ Q+EGI + E +SPVV+  +I  +L +  Q
Subjt:  PLPKGYKPIASKWIYKVKEGVAGVMKPRFKARLVAKGYTQKEGIDYPEVFSPVVKLTSIILLLFLVVQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGTAGCAAGATATGATATAAAGAAGTTTGACGGGAAGGGAGACTTTGACTTGTGGAAAGCAAAGATCAAGGCTATTCTTGGTCAACAAAAGACTTTGCTTGCATT
ATCAGACCCCACGCAACTGCCAGCAACCTTGTCTAAAGATGATAAAGAAGCCATGAATATGACTGCCTATGAAACAATTACTTTGAACCTAAGCAATAGCGTGCTAAGAC
AAGTCATTGACGCGGATACGCCTTTAAAGATCTGGCAGAAATTAAACGATCTTTATGAGACCAAAGACATACATAATAAAATGTACATGAGGGAGAAATTCTTCACAGAA
GTGAAAAACGCCCTAAAGTATGGCAGAGTATCCATCACTACTGATGCAATAATATCAGCTATAAGAACCAAGGAATTGGAGTTGTTGGCTGCAAAGAAAGAAACCTTTGA
AGGACTGTTTGTCAAAGGGAAACCCAGAAGTAAAGACAATAAACATCAGACAGATGACAAGGGGAAGAGCAAGGATGTAGAGATGCTTCAATCTGCTCTTGTGGTTTCAG
AAAATGGCATGACTGAAAGTGACCTTTGGCATACACGCCTATCCCATATTAGTATTAAAGGCCTCCAAATCTTGACTAAGCAAGGTATTCTTTCTCAAGAAGAGAAGTGG
TCTGGTAAGCCACCAAATTTGCATCACCTCAAGGACAGAGCAACATCAGTTTCACAAGAAAAAGGTGAATCACTGGAAGAAGAAGAAACTGATGATACATTAGAAGAATC
TCAAAATCTAAGGAATTATTCCTTAGCTAGGGACAGACAGAGAAGAACAATAGTCCCACCATCGAGATTCAGTGAAGCTGATTGCATTTCCCTCGTTCAACATGTGTCTG
ACTCTCTGTTATTTGAAGAATCATCTAGCTTTGAGGAAGCAACGAATGGACCTAACAGTAGAGCTGGATCGAGGCCTCTCCCTAAAGGGTACAAACCTATTGCCTCGAAG
TGGATTTACAAGGTCAAAGAAGGAGTTGCAGGAGTTATGAAGCCCAGGTTTAAAGCTCGGTTAGTAGCTAAAGGCTACACTCAGAAGGAGGGTATAGATTACCCTGAGGT
GTTCTCCCCAGTGGTGAAGTTGACCTCAATCATATTGCTACTCTTCCTTGTTGTCCAAGAGGATTTGATGCTTGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTGTAGCAAGATATGATATAAAGAAGTTTGACGGGAAGGGAGACTTTGACTTGTGGAAAGCAAAGATCAAGGCTATTCTTGGTCAACAAAAGACTTTGCTTGCATT
ATCAGACCCCACGCAACTGCCAGCAACCTTGTCTAAAGATGATAAAGAAGCCATGAATATGACTGCCTATGAAACAATTACTTTGAACCTAAGCAATAGCGTGCTAAGAC
AAGTCATTGACGCGGATACGCCTTTAAAGATCTGGCAGAAATTAAACGATCTTTATGAGACCAAAGACATACATAATAAAATGTACATGAGGGAGAAATTCTTCACAGAA
GTGAAAAACGCCCTAAAGTATGGCAGAGTATCCATCACTACTGATGCAATAATATCAGCTATAAGAACCAAGGAATTGGAGTTGTTGGCTGCAAAGAAAGAAACCTTTGA
AGGACTGTTTGTCAAAGGGAAACCCAGAAGTAAAGACAATAAACATCAGACAGATGACAAGGGGAAGAGCAAGGATGTAGAGATGCTTCAATCTGCTCTTGTGGTTTCAG
AAAATGGCATGACTGAAAGTGACCTTTGGCATACACGCCTATCCCATATTAGTATTAAAGGCCTCCAAATCTTGACTAAGCAAGGTATTCTTTCTCAAGAAGAGAAGTGG
TCTGGTAAGCCACCAAATTTGCATCACCTCAAGGACAGAGCAACATCAGTTTCACAAGAAAAAGGTGAATCACTGGAAGAAGAAGAAACTGATGATACATTAGAAGAATC
TCAAAATCTAAGGAATTATTCCTTAGCTAGGGACAGACAGAGAAGAACAATAGTCCCACCATCGAGATTCAGTGAAGCTGATTGCATTTCCCTCGTTCAACATGTGTCTG
ACTCTCTGTTATTTGAAGAATCATCTAGCTTTGAGGAAGCAACGAATGGACCTAACAGTAGAGCTGGATCGAGGCCTCTCCCTAAAGGGTACAAACCTATTGCCTCGAAG
TGGATTTACAAGGTCAAAGAAGGAGTTGCAGGAGTTATGAAGCCCAGGTTTAAAGCTCGGTTAGTAGCTAAAGGCTACACTCAGAAGGAGGGTATAGATTACCCTGAGGT
GTTCTCCCCAGTGGTGAAGTTGACCTCAATCATATTGCTACTCTTCCTTGTTGTCCAAGAGGATTTGATGCTTGACTAG
Protein sequenceShow/hide protein sequence
MVVARYDIKKFDGKGDFDLWKAKIKAILGQQKTLLALSDPTQLPATLSKDDKEAMNMTAYETITLNLSNSVLRQVIDADTPLKIWQKLNDLYETKDIHNKMYMREKFFTE
VKNALKYGRVSITTDAIISAIRTKELELLAAKKETFEGLFVKGKPRSKDNKHQTDDKGKSKDVEMLQSALVVSENGMTESDLWHTRLSHISIKGLQILTKQGILSQEEKW
SGKPPNLHHLKDRATSVSQEKGESLEEEETDDTLEESQNLRNYSLARDRQRRTIVPPSRFSEADCISLVQHVSDSLLFEESSSFEEATNGPNSRAGSRPLPKGYKPIASK
WIYKVKEGVAGVMKPRFKARLVAKGYTQKEGIDYPEVFSPVVKLTSIILLLFLVVQEDLMLD