; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0012059 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0012059
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr1:36947854..36953723
RNA-Seq ExpressionLag0012059
SyntenyLag0012059
Gene Ontology termsGO:0044237 - cellular metabolic process (biological process)
GO:0016740 - transferase activity (molecular function)
GO:0043167 - ion binding (molecular function)
GO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043826.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]4.6e-7050.15Show/hide
Query:  GEAPSLDHLRVFCCLAYAHVKEEKLKKRALKCMFIGYPQGIKGYKIWCLEEGRSKCIISKDVTFNEDEMSFCSKSQSELRSVENVEIEVRIESETRSS--
        G+APSL+HLRVF C AYAHVK+ KL KRALKC+FIGYPQG+KGYK+WC+E+G +KCIIS+DVTFNE EM +C K Q + ++ ++V  EVRI SE R S  
Subjt:  GEAPSLDHLRVFCCLAYAHVKEEKLKKRALKCMFIGYPQGIKGYKIWCLEEGRSKCIISKDVTFNEDEMSFCSKSQSELRSVENVEIEVRIESETRSS--

Query:  ----------------------------------ATRGESSDGSQLGSQAESSQRFESD--------------------------ESITFEEAIEFGLKE
                                          A   ESS  + L +   +  R + +                          E +TFEEAI    K+
Subjt:  ----------------------------------ATRGESSDGSQLGSQAESSQRFESD--------------------------ESITFEEAIEFGLKE

Query:  QWKAAMEEELSSLHKNQMWSLVPNVGSQNGYTQKEGVDYNEVFSLVVRHLSIRLVLSIVVHFDMFLEHMDAIIAFLHGELEKVIYMAQPKGYEVKGKE--
        QWK AMEEEL SLHKNQ WSLVP   +Q     K    +      VVRH SIRL+LSI VHFDMF+E MD    FLHGELE+VIYMAQPKGYEVKGKE  
Subjt:  QWKAAMEEELSSLHKNQMWSLVPNVGSQNGYTQKEGVDYNEVFSLVVRHLSIRLVLSIVVHFDMFLEHMDAIIAFLHGELEKVIYMAQPKGYEVKGKE--

Query:  ---DMVCLLYKSIYGLKQSPRQWYIRSHT
           DMVC L+KS+YGLKQSPRQWYIR  T
Subjt:  ---DMVCLLYKSIYGLKQSPRQWYIRSHT

KAA0047995.1 retrotransposon protein, putative, Ty1-copia sub-class [Cucumis melo var. makuwa]2.5e-6849.09Show/hide
Query:  GEAPSLDHLRVFCCLAYAHVKEEKLKKRALKCMFIGYPQGIKGYKIWCLEEGRSKCIISKDVTFNEDEMSFCSKSQSELRSVENVEIEVRIESETRSS--
        G+APSL+HLRVF C AYAHVK+ KL KRALKCMFIGYPQG+KGYK+WC+E+G +KCIIS+DVTFNE EM +C K Q + ++ ++V  EVRI SE R S  
Subjt:  GEAPSLDHLRVFCCLAYAHVKEEKLKKRALKCMFIGYPQGIKGYKIWCLEEGRSKCIISKDVTFNEDEMSFCSKSQSELRSVENVEIEVRIESETRSS--

Query:  ----------------------------------ATRGESSDGSQLGSQAESSQRFESD--------------------------ESITFEEAIEFGLKE
                                          A   ESS  + L +   +  R + +                          E +TFEEAI    K+
Subjt:  ----------------------------------ATRGESSDGSQLGSQAESSQRFESD--------------------------ESITFEEAIEFGLKE

Query:  QWKAAMEEELSSLHKNQMWSLVPN------VGSQNGYTQKEGVDYNEVFSLVVRHLSIRLVLSIVVHFDMFLEHMDAIIAFLHGELEKVIYMAQPKGYEV
        QWK AMEEEL SLHKNQ WSLVP       + S+  Y  K G   N            RL+LSI VHFDMF+E MD    FLHGELE+VIYMAQPKGYEV
Subjt:  QWKAAMEEELSSLHKNQMWSLVPN------VGSQNGYTQKEGVDYNEVFSLVVRHLSIRLVLSIVVHFDMFLEHMDAIIAFLHGELEKVIYMAQPKGYEV

Query:  KGKEDMVCLLYKSIYGLKQSPRQWYIRSHT
        KGKEDMVC L+KS+YGLKQSPRQWYI   T
Subjt:  KGKEDMVCLLYKSIYGLKQSPRQWYIRSHT

KAA0050719.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]5.4e-7950.42Show/hide
Query:  GEAPSLDHLRVFCCLAYAHVKEEKLKKRALKCMFIGYPQGIKGYKIWCLEEGRSKCIISKDVTFNEDEMSFCSKSQSELRSVENVEIEVRIESETRSS--
        G+APSL+HLRVF C AYAHVK+ KL KRALKCMFIGYPQG+KGYK+WC+E+G +KCIIS+DVTFNE EM +C K Q + ++ ++V  EVRI SE R S  
Subjt:  GEAPSLDHLRVFCCLAYAHVKEEKLKKRALKCMFIGYPQGIKGYKIWCLEEGRSKCIISKDVTFNEDEMSFCSKSQSELRSVENVEIEVRIESETRSS--

Query:  ----------------------------------ATRGESSDGSQLGSQAESSQRFESD--------------------------ESITFEEAIEFGLKE
                                          A   ESS  + L +   +  R + +                          E +TFEEAI    K+
Subjt:  ----------------------------------ATRGESSDGSQLGSQAESSQRFESD--------------------------ESITFEEAIEFGLKE

Query:  QWKAAMEEELSSLHKNQMWSLVPNVGSQ-----------------------------NGYTQKEGVDYNEVFSLVVRHLSIRLVLSIVVHFDMFLEHMDA
        QWK AMEEEL SLHKNQ WSLVP   +Q                              GYTQKEGVD++E+FS VVRH SIRL+LSI VHFDMF+E MD 
Subjt:  QWKAAMEEELSSLHKNQMWSLVPNVGSQ-----------------------------NGYTQKEGVDYNEVFSLVVRHLSIRLVLSIVVHFDMFLEHMDA

Query:  IIAFLHGELEKVIYMAQPKGYEVKGKEDMVCLLYKSIYGLKQSPRQWYIRSHT
          AFLHGELE+VIYMAQPKGYEVKGKEDMVC L+KS+YGLKQSPRQWYIR  T
Subjt:  IIAFLHGELEKVIYMAQPKGYEVKGKEDMVCLLYKSIYGLKQSPRQWYIRSHT

TYK13826.1 putative polyprotein [Cucumis melo var. makuwa]5.4e-7950.42Show/hide
Query:  GEAPSLDHLRVFCCLAYAHVKEEKLKKRALKCMFIGYPQGIKGYKIWCLEEGRSKCIISKDVTFNEDEMSFCSKSQSELRSVENVEIEVRIESETRSS--
        G+APSL+HLRVF C AYAHVK+ KL KRALKCMFIGYPQG+KGYK+WC+E+G +KCIIS+DVTFNE EM +C K Q + ++ ++V  EVRI SE R S  
Subjt:  GEAPSLDHLRVFCCLAYAHVKEEKLKKRALKCMFIGYPQGIKGYKIWCLEEGRSKCIISKDVTFNEDEMSFCSKSQSELRSVENVEIEVRIESETRSS--

Query:  ----------------------------------ATRGESSDGSQLGSQAESSQRFESD--------------------------ESITFEEAIEFGLKE
                                          A   ESS  + L +   +  R + +                          E +TFEEAI    K+
Subjt:  ----------------------------------ATRGESSDGSQLGSQAESSQRFESD--------------------------ESITFEEAIEFGLKE

Query:  QWKAAMEEELSSLHKNQMWSLVPNVGSQ-----------------------------NGYTQKEGVDYNEVFSLVVRHLSIRLVLSIVVHFDMFLEHMDA
        QWK AMEEEL SLHKNQ WSLVP   +Q                              GYTQKEGVD++E+FS VVRH SIRL+LSI VHFDMF+E MD 
Subjt:  QWKAAMEEELSSLHKNQMWSLVPNVGSQ-----------------------------NGYTQKEGVDYNEVFSLVVRHLSIRLVLSIVVHFDMFLEHMDA

Query:  IIAFLHGELEKVIYMAQPKGYEVKGKEDMVCLLYKSIYGLKQSPRQWYIRSHT
          AFLHGELE+VIYMAQPKGYEVKGKEDMVC L+KS+YGLKQSPRQWYIR  T
Subjt:  IIAFLHGELEKVIYMAQPKGYEVKGKEDMVCLLYKSIYGLKQSPRQWYIRSHT

TYK25306.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]5.4e-7950.42Show/hide
Query:  GEAPSLDHLRVFCCLAYAHVKEEKLKKRALKCMFIGYPQGIKGYKIWCLEEGRSKCIISKDVTFNEDEMSFCSKSQSELRSVENVEIEVRIESETRSS--
        G+APSL+HLRVF C AYAHVK+ KL KRALKCMFIGYPQG+KGYK+WC+E+G +KCIIS+DVTFNE EM +C K Q + ++ ++V  EVRI SE R S  
Subjt:  GEAPSLDHLRVFCCLAYAHVKEEKLKKRALKCMFIGYPQGIKGYKIWCLEEGRSKCIISKDVTFNEDEMSFCSKSQSELRSVENVEIEVRIESETRSS--

Query:  ----------------------------------ATRGESSDGSQLGSQAESSQRFESD--------------------------ESITFEEAIEFGLKE
                                          A   ESS  + L +   +  R + +                          E +TFEEAI    K+
Subjt:  ----------------------------------ATRGESSDGSQLGSQAESSQRFESD--------------------------ESITFEEAIEFGLKE

Query:  QWKAAMEEELSSLHKNQMWSLVPNVGSQ-----------------------------NGYTQKEGVDYNEVFSLVVRHLSIRLVLSIVVHFDMFLEHMDA
        QWK AMEEEL SLHKNQ WSLVP   +Q                              GYTQKEGVD++E+FS VVRH SIRL+LSI VHFDMF+E MD 
Subjt:  QWKAAMEEELSSLHKNQMWSLVPNVGSQ-----------------------------NGYTQKEGVDYNEVFSLVVRHLSIRLVLSIVVHFDMFLEHMDA

Query:  IIAFLHGELEKVIYMAQPKGYEVKGKEDMVCLLYKSIYGLKQSPRQWYIRSHT
          AFLHGELE+VIYMAQPKGYEVKGKEDMVC L+KS+YGLKQSPRQWYIR  T
Subjt:  IIAFLHGELEKVIYMAQPKGYEVKGKEDMVCLLYKSIYGLKQSPRQWYIRSHT

TrEMBL top hitse value%identityAlignment
A0A5A7TP18 Putative gag-pol polyprotein2.2e-7050.15Show/hide
Query:  GEAPSLDHLRVFCCLAYAHVKEEKLKKRALKCMFIGYPQGIKGYKIWCLEEGRSKCIISKDVTFNEDEMSFCSKSQSELRSVENVEIEVRIESETRSS--
        G+APSL+HLRVF C AYAHVK+ KL KRALKC+FIGYPQG+KGYK+WC+E+G +KCIIS+DVTFNE EM +C K Q + ++ ++V  EVRI SE R S  
Subjt:  GEAPSLDHLRVFCCLAYAHVKEEKLKKRALKCMFIGYPQGIKGYKIWCLEEGRSKCIISKDVTFNEDEMSFCSKSQSELRSVENVEIEVRIESETRSS--

Query:  ----------------------------------ATRGESSDGSQLGSQAESSQRFESD--------------------------ESITFEEAIEFGLKE
                                          A   ESS  + L +   +  R + +                          E +TFEEAI    K+
Subjt:  ----------------------------------ATRGESSDGSQLGSQAESSQRFESD--------------------------ESITFEEAIEFGLKE

Query:  QWKAAMEEELSSLHKNQMWSLVPNVGSQNGYTQKEGVDYNEVFSLVVRHLSIRLVLSIVVHFDMFLEHMDAIIAFLHGELEKVIYMAQPKGYEVKGKE--
        QWK AMEEEL SLHKNQ WSLVP   +Q     K    +      VVRH SIRL+LSI VHFDMF+E MD    FLHGELE+VIYMAQPKGYEVKGKE  
Subjt:  QWKAAMEEELSSLHKNQMWSLVPNVGSQNGYTQKEGVDYNEVFSLVVRHLSIRLVLSIVVHFDMFLEHMDAIIAFLHGELEKVIYMAQPKGYEVKGKE--

Query:  ---DMVCLLYKSIYGLKQSPRQWYIRSHT
           DMVC L+KS+YGLKQSPRQWYIR  T
Subjt:  ---DMVCLLYKSIYGLKQSPRQWYIRSHT

A0A5A7U2U7 Retrotransposon protein, putative, Ty1-copia sub-class1.2e-6849.09Show/hide
Query:  GEAPSLDHLRVFCCLAYAHVKEEKLKKRALKCMFIGYPQGIKGYKIWCLEEGRSKCIISKDVTFNEDEMSFCSKSQSELRSVENVEIEVRIESETRSS--
        G+APSL+HLRVF C AYAHVK+ KL KRALKCMFIGYPQG+KGYK+WC+E+G +KCIIS+DVTFNE EM +C K Q + ++ ++V  EVRI SE R S  
Subjt:  GEAPSLDHLRVFCCLAYAHVKEEKLKKRALKCMFIGYPQGIKGYKIWCLEEGRSKCIISKDVTFNEDEMSFCSKSQSELRSVENVEIEVRIESETRSS--

Query:  ----------------------------------ATRGESSDGSQLGSQAESSQRFESD--------------------------ESITFEEAIEFGLKE
                                          A   ESS  + L +   +  R + +                          E +TFEEAI    K+
Subjt:  ----------------------------------ATRGESSDGSQLGSQAESSQRFESD--------------------------ESITFEEAIEFGLKE

Query:  QWKAAMEEELSSLHKNQMWSLVPN------VGSQNGYTQKEGVDYNEVFSLVVRHLSIRLVLSIVVHFDMFLEHMDAIIAFLHGELEKVIYMAQPKGYEV
        QWK AMEEEL SLHKNQ WSLVP       + S+  Y  K G   N            RL+LSI VHFDMF+E MD    FLHGELE+VIYMAQPKGYEV
Subjt:  QWKAAMEEELSSLHKNQMWSLVPN------VGSQNGYTQKEGVDYNEVFSLVVRHLSIRLVLSIVVHFDMFLEHMDAIIAFLHGELEKVIYMAQPKGYEV

Query:  KGKEDMVCLLYKSIYGLKQSPRQWYIRSHT
        KGKEDMVC L+KS+YGLKQSPRQWYI   T
Subjt:  KGKEDMVCLLYKSIYGLKQSPRQWYIRSHT

A0A5A7UB25 Putative gag-pol polyprotein2.6e-7950.42Show/hide
Query:  GEAPSLDHLRVFCCLAYAHVKEEKLKKRALKCMFIGYPQGIKGYKIWCLEEGRSKCIISKDVTFNEDEMSFCSKSQSELRSVENVEIEVRIESETRSS--
        G+APSL+HLRVF C AYAHVK+ KL KRALKCMFIGYPQG+KGYK+WC+E+G +KCIIS+DVTFNE EM +C K Q + ++ ++V  EVRI SE R S  
Subjt:  GEAPSLDHLRVFCCLAYAHVKEEKLKKRALKCMFIGYPQGIKGYKIWCLEEGRSKCIISKDVTFNEDEMSFCSKSQSELRSVENVEIEVRIESETRSS--

Query:  ----------------------------------ATRGESSDGSQLGSQAESSQRFESD--------------------------ESITFEEAIEFGLKE
                                          A   ESS  + L +   +  R + +                          E +TFEEAI    K+
Subjt:  ----------------------------------ATRGESSDGSQLGSQAESSQRFESD--------------------------ESITFEEAIEFGLKE

Query:  QWKAAMEEELSSLHKNQMWSLVPNVGSQ-----------------------------NGYTQKEGVDYNEVFSLVVRHLSIRLVLSIVVHFDMFLEHMDA
        QWK AMEEEL SLHKNQ WSLVP   +Q                              GYTQKEGVD++E+FS VVRH SIRL+LSI VHFDMF+E MD 
Subjt:  QWKAAMEEELSSLHKNQMWSLVPNVGSQ-----------------------------NGYTQKEGVDYNEVFSLVVRHLSIRLVLSIVVHFDMFLEHMDA

Query:  IIAFLHGELEKVIYMAQPKGYEVKGKEDMVCLLYKSIYGLKQSPRQWYIRSHT
          AFLHGELE+VIYMAQPKGYEVKGKEDMVC L+KS+YGLKQSPRQWYIR  T
Subjt:  IIAFLHGELEKVIYMAQPKGYEVKGKEDMVCLLYKSIYGLKQSPRQWYIRSHT

A0A5D3CTV2 Putative polyprotein2.6e-7950.42Show/hide
Query:  GEAPSLDHLRVFCCLAYAHVKEEKLKKRALKCMFIGYPQGIKGYKIWCLEEGRSKCIISKDVTFNEDEMSFCSKSQSELRSVENVEIEVRIESETRSS--
        G+APSL+HLRVF C AYAHVK+ KL KRALKCMFIGYPQG+KGYK+WC+E+G +KCIIS+DVTFNE EM +C K Q + ++ ++V  EVRI SE R S  
Subjt:  GEAPSLDHLRVFCCLAYAHVKEEKLKKRALKCMFIGYPQGIKGYKIWCLEEGRSKCIISKDVTFNEDEMSFCSKSQSELRSVENVEIEVRIESETRSS--

Query:  ----------------------------------ATRGESSDGSQLGSQAESSQRFESD--------------------------ESITFEEAIEFGLKE
                                          A   ESS  + L +   +  R + +                          E +TFEEAI    K+
Subjt:  ----------------------------------ATRGESSDGSQLGSQAESSQRFESD--------------------------ESITFEEAIEFGLKE

Query:  QWKAAMEEELSSLHKNQMWSLVPNVGSQ-----------------------------NGYTQKEGVDYNEVFSLVVRHLSIRLVLSIVVHFDMFLEHMDA
        QWK AMEEEL SLHKNQ WSLVP   +Q                              GYTQKEGVD++E+FS VVRH SIRL+LSI VHFDMF+E MD 
Subjt:  QWKAAMEEELSSLHKNQMWSLVPNVGSQ-----------------------------NGYTQKEGVDYNEVFSLVVRHLSIRLVLSIVVHFDMFLEHMDA

Query:  IIAFLHGELEKVIYMAQPKGYEVKGKEDMVCLLYKSIYGLKQSPRQWYIRSHT
          AFLHGELE+VIYMAQPKGYEVKGKEDMVC L+KS+YGLKQSPRQWYIR  T
Subjt:  IIAFLHGELEKVIYMAQPKGYEVKGKEDMVCLLYKSIYGLKQSPRQWYIRSHT

A0A5D3DNU1 Putative gag-pol polyprotein2.6e-7950.42Show/hide
Query:  GEAPSLDHLRVFCCLAYAHVKEEKLKKRALKCMFIGYPQGIKGYKIWCLEEGRSKCIISKDVTFNEDEMSFCSKSQSELRSVENVEIEVRIESETRSS--
        G+APSL+HLRVF C AYAHVK+ KL KRALKCMFIGYPQG+KGYK+WC+E+G +KCIIS+DVTFNE EM +C K Q + ++ ++V  EVRI SE R S  
Subjt:  GEAPSLDHLRVFCCLAYAHVKEEKLKKRALKCMFIGYPQGIKGYKIWCLEEGRSKCIISKDVTFNEDEMSFCSKSQSELRSVENVEIEVRIESETRSS--

Query:  ----------------------------------ATRGESSDGSQLGSQAESSQRFESD--------------------------ESITFEEAIEFGLKE
                                          A   ESS  + L +   +  R + +                          E +TFEEAI    K+
Subjt:  ----------------------------------ATRGESSDGSQLGSQAESSQRFESD--------------------------ESITFEEAIEFGLKE

Query:  QWKAAMEEELSSLHKNQMWSLVPNVGSQ-----------------------------NGYTQKEGVDYNEVFSLVVRHLSIRLVLSIVVHFDMFLEHMDA
        QWK AMEEEL SLHKNQ WSLVP   +Q                              GYTQKEGVD++E+FS VVRH SIRL+LSI VHFDMF+E MD 
Subjt:  QWKAAMEEELSSLHKNQMWSLVPNVGSQ-----------------------------NGYTQKEGVDYNEVFSLVVRHLSIRLVLSIVVHFDMFLEHMDA

Query:  IIAFLHGELEKVIYMAQPKGYEVKGKEDMVCLLYKSIYGLKQSPRQWYIRSHT
          AFLHGELE+VIYMAQPKGYEVKGKEDMVC L+KS+YGLKQSPRQWYIR  T
Subjt:  IIAFLHGELEKVIYMAQPKGYEVKGKEDMVCLLYKSIYGLKQSPRQWYIRSHT

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.0e-1127.09Show/hide
Query:  ENVEIEVRIESETRSSATRGESSDGSQLGSQAESSQRFESDESITFEEAIEFGLKEQWKAAMEEELSSLHKNQMWSLVPNVGSQN---------------
        + +EI  R     ++      + + + L     ++    +D   +F+E      K  W+ A+  EL++   N  W++     ++N               
Subjt:  ENVEIEVRIESETRSSATRGESSDGSQLGSQAESSQRFESDESITFEEAIEFGLKEQWKAAMEEELSSLHKNQMWSLVPNVGSQN---------------

Query:  -------------GYTQKEGVDYNEVFSLVVRHLSIRLVLSIVVHFDMFLEHMDAIIAFLHGELEKVIYMAQPKGYEVKGKEDMVCLLYKSIYGLKQSPR
                     G+TQK  +DY E F+ V R  S R +LS+V+ +++ +  MD   AFL+G L++ IYM  P+G  +    D VC L K+IYGLKQ+ R
Subjt:  -------------GYTQKEGVDYNEVFSLVVRHLSIRLVLSIVVHFDMFLEHMDAIIAFLHGELEKVIYMAQPKGYEVKGKEDMVCLLYKSIYGLKQSPR

Query:  QWY
         W+
Subjt:  QWY

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-3332.53Show/hide
Query:  SLDHLRVFCCLAYAHVKEE---KLKKRALKCMFIGYPQGIKGYKIWCLEEGRSKCIISKDVTFNEDEMSFCSKSQSELRS--VEN-VEIEVRIESETRSS
        S  HL+VF C A+AHV +E   KL  +++ C+FIGY     GY++W  +  + K I S+DV F E E+   +    ++++  + N V I     + T + 
Subjt:  SLDHLRVFCCLAYAHVKEE---KLKKRALKCMFIGYPQGIKGYKIWCLEEGRSKCIISKDVTFNEDEMSFCSKSQSELRS--VEN-VEIEVRIESETRSS

Query:  ATRGESSD-----------GSQLGSQAE-----------------------SSQRFESDESI---------TFEEAIEFGLKEQWKAAMEEELSSLHKNQ
        +T  E S+           G QL    E                        S+R+ S E +         + +E +    K Q   AM+EE+ SL KN 
Subjt:  ATRGESSD-----------GSQLGSQAE-----------------------SSQRFESDESI---------TFEEAIEFGLKEQWKAAMEEELSSLHKNQ

Query:  MWSLVPNVGSQ----------------------------NGYTQKEGVDYNEVFSLVVRHLSIRLVLSIVVHFDMFLEHMDAIIAFLHGELEKVIYMAQP
         + LV     +                             G+ QK+G+D++E+FS VV+  SIR +LS+    D+ +E +D   AFLHG+LE+ IYM QP
Subjt:  MWSLVPNVGSQ----------------------------NGYTQKEGVDYNEVFSLVVRHLSIRLVLSIVVHFDMFLEHMDAIIAFLHGELEKVIYMAQP

Query:  KGYEVKGKEDMVCLLYKSIYGLKQSPRQWYIR
        +G+EV GK+ MVC L KS+YGLKQ+PRQWY++
Subjt:  KGYEVKGKEDMVCLLYKSIYGLKQSPRQWYIR

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.6e-1735.67Show/hide
Query:  AIEFGLKEQWKAAMEEELSSLHKNQMWSLVP--------------------NVGSQN---------GYTQKEGVDYNEVFSLVVRHLSIRLVLSIVVHFD
        AI+    E+W+ AM  E+++   N  W LVP                    + GS N         GY Q+ G+DY E FS V++  SIR+VL + V   
Subjt:  AIEFGLKEQWKAAMEEELSSLHKNQMWSLVP--------------------NVGSQN---------GYTQKEGVDYNEVFSLVVRHLSIRLVLSIVVHFD

Query:  MFLEHMDAIIAFLHGELEKVIYMAQPKGYEVKGKEDMVCLLYKSIYGLKQSPRQWYI
          +  +D   AFL G L   +YM+QP G+  K + + VC L K++YGLKQ+PR WY+
Subjt:  MFLEHMDAIIAFLHGELEKVIYMAQPKGYEVKGKEDMVCLLYKSIYGLKQSPRQWYI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.0e-1732.85Show/hide
Query:  VEIEVRIESETRSSATRGESSDGSQLGSQAESSQRFESDESITFEEAIEFGLKEQWKAAMEEELSSLHKNQMWSLVP--------------------NVG
        +++  +    T S ATR  + DG +  +Q  S     +  S     AI+    ++W+ AM  E+++   N  W LVP                    + G
Subjt:  VEIEVRIESETRSSATRGESSDGSQLGSQAESSQRFESDESITFEEAIEFGLKEQWKAAMEEELSSLHKNQMWSLVP--------------------NVG

Query:  SQN---------GYTQKEGVDYNEVFSLVVRHLSIRLVLSIVVHFDMFLEHMDAIIAFLHGELEKVIYMAQPKGYEVKGKEDMVCLLYKSIYGLKQSPRQ
        S N         GY Q+ G+DY E FS V++  SIR+VL + V     +  +D   AFL G L   +YM+QP G+  K + D VC L K+IYGLKQ+PR 
Subjt:  SQN---------GYTQKEGVDYNEVFSLVVRHLSIRLVLSIVVHFDMFLEHMDAIIAFLHGELEKVIYMAQPKGYEVKGKEDMVCLLYKSIYGLKQSPRQ

Query:  WYIRSHT
        WY+   T
Subjt:  WYIRSHT

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.8e-1733.33Show/hide
Query:  ESDESITFEEAIEFGLKEQWKAAMEEELSSLHKNQMW---SLVPN---VGSQ----------------------NGYTQKEGVDYNEVFSLVVRHLSIRL
        ++ E  T+ EA EF +   W  AM++E+ ++     W   +L PN   +G +                       GYTQ+EG+D+ E FS V +  S++L
Subjt:  ESDESITFEEAIEFGLKEQWKAAMEEELSSLHKNQMW---SLVPN---VGSQ----------------------NGYTQKEGVDYNEVFSLVVRHLSIRL

Query:  VLSIVVHFDMFLEHMDAIIAFLHGELEKVIYMAQPKGYEVKGKEDM----VCLLYKSIYGLKQSPRQWYIR
        +L+I   ++  L  +D   AFL+G+L++ IYM  P GY  +  + +    VC L KSIYGLKQ+ RQW+++
Subjt:  VLSIVVHFDMFLEHMDAIIAFLHGELEKVIYMAQPKGYEVKGKEDM----VCLLYKSIYGLKQSPRQWYIR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGTGAAGCTCCAAGTCTTGATCATCTCAGAGTATTTTGTTGTTTAGCTTATGCTCATGTTAAGGAAGAAAAGCTGAAGAAGAGAGCATTGAAATGTATGTTTAT
TGGCTACCCTCAAGGTATCAAAGGATATAAGATTTGGTGCTTAGAAGAAGGTAGAAGTAAATGTATAATTAGCAAAGATGTCACTTTCAATGAAGATGAGATGTCATTTT
GTAGTAAAAGCCAATCAGAACTGCGATCAGTTGAAAATGTTGAAATAGAAGTCAGAATTGAGTCTGAAACCCGATCATCAGCTACTAGAGGTGAATCTAGTGATGGTTCA
CAGTTGGGTTCACAAGCAGAGTCTTCACAACGGTTTGAATCTGATGAGTCTATTACATTCGAAGAGGCTATTGAGTTTGGGTTGAAGGAACAGTGGAAAGCTGCAATGGA
AGAAGAATTGTCCTCTTTGCATAAGAATCAGATGTGGTCATTGGTTCCAAATGTTGGATCGCAAAATGGCTACACTCAAAAGGAGGGAGTTGATTATAATGAGGTTTTCT
CTCTGGTGGTAAGACATTTGTCTATTCGATTAGTTTTGTCTATTGTTGTTCACTTTGATATGTTTCTTGAACATATGGATGCCATTATAGCATTCCTTCATGGAGAATTG
GAGAAGGTGATTTACATGGCTCAACCAAAGGGCTATGAGGTGAAGGGAAAGGAAGATATGGTTTGTCTCCTTTACAAGTCTATTTATGGACTAAAGCAATCACCAAGACA
GTGGTATATTCGCTCTCATACCAATTGTTATGAGTGCAACCAAAGTCCCACATTGGCTAGACAAGAGGATGATCATAAGTATATAAGAGGGGACAAGTATCTCCATTGGA
TAGTAACTAAGACTATTGCTAATCGACTTAAGATTGTATTGAAAGACATCATTTCTACTCCTCAATTGACTTTTTATACAGGGAAGGAAGCCCTCTTATCTATGGGAAGA
GCTTATTATGGAGTAGAGATTTGCTTAAACAAGTTATACGCCGACGTCAGGTCCTGCGTCGACGTAGATAACGTCGGCGTATCTCAAAATTCTTGTAGTGAAAAGCAAAG
GCAACAAATCCATGTTCTTTTTGGCTGTAAAAGAGCTAAAGAAGTTTGGGATCTTACTTTTCAGCATGATATTCTAAAGGTCGATTTCAATCAAAGCTTTGTGGATAAAT
AG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGGTGAAGCTCCAAGTCTTGATCATCTCAGAGTATTTTGTTGTTTAGCTTATGCTCATGTTAAGGAAGAAAAGCTGAAGAAGAGAGCATTGAAATGTATGTTTAT
TGGCTACCCTCAAGGTATCAAAGGATATAAGATTTGGTGCTTAGAAGAAGGTAGAAGTAAATGTATAATTAGCAAAGATGTCACTTTCAATGAAGATGAGATGTCATTTT
GTAGTAAAAGCCAATCAGAACTGCGATCAGTTGAAAATGTTGAAATAGAAGTCAGAATTGAGTCTGAAACCCGATCATCAGCTACTAGAGGTGAATCTAGTGATGGTTCA
CAGTTGGGTTCACAAGCAGAGTCTTCACAACGGTTTGAATCTGATGAGTCTATTACATTCGAAGAGGCTATTGAGTTTGGGTTGAAGGAACAGTGGAAAGCTGCAATGGA
AGAAGAATTGTCCTCTTTGCATAAGAATCAGATGTGGTCATTGGTTCCAAATGTTGGATCGCAAAATGGCTACACTCAAAAGGAGGGAGTTGATTATAATGAGGTTTTCT
CTCTGGTGGTAAGACATTTGTCTATTCGATTAGTTTTGTCTATTGTTGTTCACTTTGATATGTTTCTTGAACATATGGATGCCATTATAGCATTCCTTCATGGAGAATTG
GAGAAGGTGATTTACATGGCTCAACCAAAGGGCTATGAGGTGAAGGGAAAGGAAGATATGGTTTGTCTCCTTTACAAGTCTATTTATGGACTAAAGCAATCACCAAGACA
GTGGTATATTCGCTCTCATACCAATTGTTATGAGTGCAACCAAAGTCCCACATTGGCTAGACAAGAGGATGATCATAAGTATATAAGAGGGGACAAGTATCTCCATTGGA
TAGTAACTAAGACTATTGCTAATCGACTTAAGATTGTATTGAAAGACATCATTTCTACTCCTCAATTGACTTTTTATACAGGGAAGGAAGCCCTCTTATCTATGGGAAGA
GCTTATTATGGAGTAGAGATTTGCTTAAACAAGTTATACGCCGACGTCAGGTCCTGCGTCGACGTAGATAACGTCGGCGTATCTCAAAATTCTTGTAGTGAAAAGCAAAG
GCAACAAATCCATGTTCTTTTTGGCTGTAAAAGAGCTAAAGAAGTTTGGGATCTTACTTTTCAGCATGATATTCTAAAGGTCGATTTCAATCAAAGCTTTGTGGATAAAT
AG
Protein sequenceShow/hide protein sequence
MEGEAPSLDHLRVFCCLAYAHVKEEKLKKRALKCMFIGYPQGIKGYKIWCLEEGRSKCIISKDVTFNEDEMSFCSKSQSELRSVENVEIEVRIESETRSSATRGESSDGS
QLGSQAESSQRFESDESITFEEAIEFGLKEQWKAAMEEELSSLHKNQMWSLVPNVGSQNGYTQKEGVDYNEVFSLVVRHLSIRLVLSIVVHFDMFLEHMDAIIAFLHGEL
EKVIYMAQPKGYEVKGKEDMVCLLYKSIYGLKQSPRQWYIRSHTNCYECNQSPTLARQEDDHKYIRGDKYLHWIVTKTIANRLKIVLKDIISTPQLTFYTGKEALLSMGR
AYYGVEICLNKLYADVRSCVDVDNVGVSQNSCSEKQRQQIHVLFGCKRAKEVWDLTFQHDILKVDFNQSFVDK