; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018889 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018889
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionTransposon Ty3-I Gag-Pol polyprotein
Genome locationtig00153226:1846265..1848770
RNA-Seq ExpressionSgr018889
SyntenySgr018889
Gene Ontology termsNA
InterPro domainsIPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFZ21628.1 hypothetical protein Acr_29g0007900 [Actinidia rufa]1.3e-7137.47Show/hide
Query:  LLKEFEDVFPEKMPRELPPLRGIEHKIDFLLGALIPNRLAYRTNLKEAKEIQR------------------------------QVDE-------------
        L +EFEDVFPE++P  LPP+RGIEH+IDF+ GA IPN  AYR+N +E KE+QR                              +VDE             
Subjt:  LLKEFEDVFPEKMPRELPPLRGIEHKIDFLLGALIPNRLAYRTNLKEAKEIQR------------------------------QVDE-------------

Query:  -------------------LLANG-----------------------------TSQTWHHYLWPKEFVIHTDHESLKHLRSQDKLNKRHAKWLEYIESFL
                           L+  G                                TW HYLWPKEFVI TDHESLKHL+SQ KLNKRHA+W+E+IE+F 
Subjt:  -------------------LLANG-----------------------------TSQTWHHYLWPKEFVIHTDHESLKHLRSQDKLNKRHAKWLEYIESFL

Query:  YVIRHKKGKENVVADALSR---------------------------------------------------RKGKLCIPSCSIRELLVKEAHGEGLMGQFG
        YVIR+K+GKENVVADALSR                                                   R+ KLC+P+CS+R+LLV+E+HG GLMG FG
Subjt:  YVIRHKKGKENVVADALSR---------------------------------------------------RKGKLCIPSCSIRELLVKEAHGEGLMGQFG

Query:  VAKTYNMLAEHFFWPKMRHDVQKVCNSCLTCKRTKSKIQPHA------------------------QSK---------VDIIQKL------HK-------
        VAKT  +L EHF+WP M+ DV+++C  C+TC++ KS++QP+                         +S+         VD   K+      HK       
Subjt:  VAKTYNMLAEHFFWPKMRHDVQKVCNSCLTCKRTKSKIQPHA------------------------QSK---------VDIIQKL------HK-------

Query:  ------EIKERIERNNFKVASKVNKGRKPMVFSKGDWVWVHLRKERFSLERSFKLNSREDGPFQVLERINDNAYK
              EI    E+   +   + NKGR  +VF  GDWV +H+RKERF ++R  KL  R DGPFQVLERINDNAYK
Subjt:  ------EIKERIERNNFKVASKVNKGRKPMVFSKGDWVWVHLRKERFSLERSFKLNSREDGPFQVLERINDNAYK

RVW46460.1 Transposon Ty3-I Gag-Pol polyprotein [Vitis vinifera]1.9e-7540.14Show/hide
Query:  LKEFEDVFPEKMPRELPPLRGIEHKIDFLLGALIPNRLAYRTNLKEAKEIQRQVDELLANG---------------------------------------
        +KE+EDVFP  +P  LPP+RGIEH+IDF+  A IPNR AYR+N +E KE+QRQV+ELL  G                                       
Subjt:  LKEFEDVFPEKMPRELPPLRGIEHKIDFLLGALIPNRLAYRTNLKEAKEIQRQVDELLANG---------------------------------------

Query:  ---------------------------------------------------------------------TSQT---WHHYLWPKEFVIHTDHESLKHLRS
                                                                             T ++   W HYLWPKEFVIHTDHESLKHLR 
Subjt:  ---------------------------------------------------------------------TSQT---WHHYLWPKEFVIHTDHESLKHLRS

Query:  QDKLNKRHAKWLEYIESFLYVIRHKKGKENVVADALSRRKGKLCIPSCS------IRELLVKE---------------AHGEGLMGQFGVAKTYNMLAEH
        Q KLN+RHAKW+E+IE+FLYVI++K+GKEN+VA+ALSRR   +   +        ++EL   +               AHG GLMG FGV KT ++L EH
Subjt:  QDKLNKRHAKWLEYIESFLYVIRHKKGKENVVADALSRRKGKLCIPSCS------IRELLVKE---------------AHGEGLMGQFGVAKTYNMLAEH

Query:  FFWPKMRHDVQKVCNSCLTC----KRTKSKIQPHAQS-----KVDIIQKLHKEIKERIERNNFKVASKVNKGRKPMVFSKGDWVWVHLRKERFSLERSFK
        FFWPKM+ DV++ C  C+TC    +  +  I     S     KV++++KLH+ +++ IE+ N + ASK NKGR+ ++F  GDWVWVH+RKERF   R  K
Subjt:  FFWPKMRHDVQKVCNSCLTC----KRTKSKIQPHAQS-----KVDIIQKLHKEIKERIERNNFKVASKVNKGRKPMVFSKGDWVWVHLRKERFSLERSFK

Query:  LNSREDGPFQVLERINDNAYK
        L+ R DGPFQ+LERINDNAYK
Subjt:  LNSREDGPFQVLERINDNAYK

RVW72262.1 Transposon Ty3-I Gag-Pol polyprotein [Vitis vinifera]1.2e-6933.73Show/hide
Query:  KEFEDVFPEKMPRELPPLRGIEHKIDFLLGALIPNRLAYRTNLKEAKEIQRQVDELLANG----------------------------------------
        KE+EDVFP  +P  LPP+RGIEH+IDF+LGA IPN+ AYR+N +E KE+QRQV+ELL  G                                        
Subjt:  KEFEDVFPEKMPRELPPLRGIEHKIDFLLGALIPNRLAYRTNLKEAKEIQRQVDELLANG----------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------TSQTWHHYLWPKEFVIHTDHESLKHLRSQDKLNKRHAKWLEYIESFLYVIRHKKGKENVVADALSRRKGKLCIPSC
                                  +TW HYLWPKEFVIHTDHESLKHL+ Q KLN+RHAKW+E+IE+F YVI++K+GKEN+ ADALSRR   +   + 
Subjt:  ------------------------TSQTWHHYLWPKEFVIHTDHESLKHLRSQDKLNKRHAKWLEYIESFLYVIRHKKGKENVVADALSRRKGKLCIPSC

Query:  S------IRELLVKE---------------AHGEGLMGQFGVAKTYNMLAEHFFWPKMRHDVQKVCNSCLTCKRTKSKIQPH------------------
               ++EL   E               AHG GLMG FGV KT ++L EHFFWPKM+ DV++ C  C+TC++TKS++  H                  
Subjt:  S------IRELLVKE---------------AHGEGLMGQFGVAKTYNMLAEHFFWPKMRHDVQKVCNSCLTCKRTKSKIQPH------------------

Query:  -------------------AQSKVDIIQKLHKEIKERIERNNFKVASKVNKGRKPMVFSKGDWVWVHLRKERFSLERSFKLNSREDGPFQVLERINDNAY
                            + K ++++KLH+ +++ IE+ N + A+K NKG + ++F  GDWVWVH+RKERF   R  KL+ R DGPFQVLERINDNAY
Subjt:  -------------------AQSKVDIIQKLHKEIKERIERNNFKVASKVNKGRKPMVFSKGDWVWVHLRKERFSLERSFKLNSREDGPFQVLERINDNAY

Query:  K
        K
Subjt:  K

XP_041001591.1 LOW QUALITY PROTEIN: uncharacterized protein LOC121247287, partial [Juglans microcarpa x Juglans regia]6.0e-6941.79Show/hide
Query:  QTWHHYLWPKEFVIHTDHESLKHLRSQDKLNKRHAKWLEYIESFLYVIRHKKGKENVVADALSRR-----------------------------------
        +TW HYLWP+EFVIHTDHESLKHL+ Q KLNKRHA+W+EYIE+F YVIR+K+GKEN+VADALSRR                                   
Subjt:  QTWHHYLWPKEFVIHTDHESLKHLRSQDKLNKRHAKWLEYIESFLYVIRHKKGKENVVADALSRR-----------------------------------

Query:  ----------------KGKLCIPSCSIRELLVKEAHGEGLMGQFGVAKTYNMLAEHFFWPKMRHDVQKVCNSCLTCKRTKSKIQPH--------------
                        + KLC+P+CS+RELLV+EAHG GLMG FGV KT ++L EHFFWPKM+ DV ++C  C+TC++ KSK+ PH              
Subjt:  ----------------KGKLCIPSCSIRELLVKEAHGEGLMGQFGVAKTYNMLAEHFFWPKMRHDVQKVCNSCLTCKRTKSKIQPH--------------

Query:  -----------------------------------------------------AQSKVDIIQKLHKEIKERIERNNFKVASKVNKGRKPMVFSKGDWVWV
                                                              Q K ++++ LH+ ++ +I + N +VAS+ NKGR+ ++F  GDWVWV
Subjt:  -----------------------------------------------------AQSKVDIIQKLHKEIKERIERNNFKVASKVNKGRKPMVFSKGDWVWV

Query:  HLRKERFSLERSFKLNSREDGPFQVLERINDNAYK
        H+RKERF   R  KL+ R DGPFQ+LE+INDNAYK
Subjt:  HLRKERFSLERSFKLNSREDGPFQVLERINDNAYK

XP_041001591.1 LOW QUALITY PROTEIN: uncharacterized protein LOC121247287, partial [Juglans microcarpa x Juglans regia]1.6e-1371.43Show/hide
Query:  LLKEFEDVFPEKMPRELPPLRGIEHKIDFLLGALIPNRLAYRTNLKEAKEIQRQVD
        LL+EFEDVFP +MP ELPP+RGIEH+IDF+ GA IPNR AYR+N +E KE+QRQ D
Subjt:  LLKEFEDVFPEKMPRELPPLRGIEHKIDFLLGALIPNRLAYRTNLKEAKEIQRQVD

XP_041001591.1 LOW QUALITY PROTEIN: uncharacterized protein LOC121247287, partial [Juglans microcarpa x Juglans regia]7.9e-6934.52Show/hide
Query:  EFEDVFPEKMPRELPPLRGIEHKIDFLLGALIPNRLAYRTNLKEAKEIQRQVDELLANG-----------------------------------------
        EF+DVFP+++P  LPPLRGIEH+IDF+ GA+IPNR  YRTN +E KE+QRQV+EL+  G                                         
Subjt:  EFEDVFPEKMPRELPPLRGIEHKIDFLLGALIPNRLAYRTNLKEAKEIQRQVDELLANG-----------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------TSQTWHHYLWPKEFVIHTDHESLKHLRSQDKLNKRHAKWLEYIESFLYVIRHKKGKENVVADALSR--------------------------
                  +TW HYLWPKEFVIH+DHE+LKH++ Q KLNKRHAKW+EY+ESF YVI++KKGKEN+VADALSR                          
Subjt:  --------TSQTWHHYLWPKEFVIHTDHESLKHLRSQDKLNKRHAKWLEYIESFLYVIRHKKGKENVVADALSR--------------------------

Query:  -------------------------RKGKLCIPSCSIRELLVKEAHGEGLMGQFGVAKTYNMLAEHFFWPKMRHDVQKVCNSCLTCKRTKSKIQPHAQSK
                                 R+GKLC+P  SIR++LV EAH  GLMG FGV KT   L EH +WPKM+ DV+++    L        I   A+ K
Subjt:  -------------------------RKGKLCIPSCSIRELLVKEAHGEGLMGQFGVAKTYNMLAEHFFWPKMRHDVQKVCNSCLTCKRTKSKIQPHAQSK

Query:  VDIIQKLHKEIKERIERNNFKVASKVNKGRKPMVFSKGDWVWVHLRKERFSLERSFKLNSREDGPFQVLERINDNAYK
         + +++LH+ ++  IE        K NKGRK ++F  GDWVWVH+RKERF  +R  KL  R DGPFQVLERIN+N+Y+
Subjt:  VDIIQKLHKEIKERIERNNFKVASKVNKGRKPMVFSKGDWVWVHLRKERFSLERSFKLNSREDGPFQVLERINDNAYK

TrEMBL top hitse value%identityAlignment
A0A2N9F5Z2 Patatin4.2e-6849.25Show/hide
Query:  QTWHHYLWPKEFVIHTDHESLKHLRSQDKLNKRHAKWLEYIESFLYVIRHKKGKENVVADALSRR-----------------------------------
        +TW HYLWP+EFVIHTDHESLKHL+ Q KLN+RHA+WLEYIE+F YVIR+K+GKEN+V DALSRR                                   
Subjt:  QTWHHYLWPKEFVIHTDHESLKHLRSQDKLNKRHAKWLEYIESFLYVIRHKKGKENVVADALSRR-----------------------------------

Query:  ----------------KGKLCIPSCSIRELLVKEAHGEGLMGQFGVAKTYNMLAEHFFWPKMRHDVQKVCNSCLTCKRTKSKIQPHAQSKVDIIQKLHKE
                        + KLC+PSCS+RELLV+EAHG GL+G FGV KT ++L EHFFWPKM+ DV ++C         +S +    Q K ++++ LH+ 
Subjt:  ----------------KGKLCIPSCSIRELLVKEAHGEGLMGQFGVAKTYNMLAEHFFWPKMRHDVQKVCNSCLTCKRTKSKIQPHAQSKVDIIQKLHKE

Query:  IKERIERNNFKVASKVNKGRKPMVFSKGDWVWVHLRKERFSLERSFKLNSREDGPFQVLERINDNAYK
        ++ +I + N +VAS+ NKGR+ ++F  GDWVWVH+RKERF   R  KL+ R DGPFQ+LE+INDNAYK
Subjt:  IKERIERNNFKVASKVNKGRKPMVFSKGDWVWVHLRKERFSLERSFKLNSREDGPFQVLERINDNAYK

A0A2N9F5Z2 Patatin3.2e-1568.33Show/hide
Query:  KEFEDVFPEKMPRELPPLRGIEHKIDFLLGALIPNRLAYRTNLKEAKEIQRQVDELLANG
        +EFEDVFPE+MP ELPP+RGIEH+IDF+ GA IPNR AYR+N +E KE+QRQV++L+  G
Subjt:  KEFEDVFPEKMPRELPPLRGIEHKIDFLLGALIPNRLAYRTNLKEAKEIQRQVDELLANG

A0A2N9HBQ8 Integrase catalytic domain-containing protein4.5e-7837.23Show/hide
Query:  KEFEDVFPEKMPRELPPLRGIEHKIDFLLGALIPNRLAYRTNL---KEAKEIQRQVDELLA--------------------------NGTS---------
        KE+EDVFP  +P  LPP+RGIEH+IDF+ GA IPNR AYR+N    K   + +  VDE                             NG +         
Subjt:  KEFEDVFPEKMPRELPPLRGIEHKIDFLLGALIPNRLAYRTNL---KEAKEIQRQVDELLA--------------------------NGTS---------

Query:  --------QTWHHYLWPKEFVIHTDHESLKHLRSQDKLNKRHAKWLEYIESFLYVIRHKKGKENVVADALSR----------------------------
                +TW HYLWPKEFVIHTDHESLKHL+ Q KLN+RHA+W+E+IE+F YVI++K+GKEN+VADALSR                            
Subjt:  --------QTWHHYLWPKEFVIHTDHESLKHLRSQDKLNKRHAKWLEYIESFLYVIRHKKGKENVVADALSR----------------------------

Query:  -----------------------RKGKLCIPSCSIRELLVKEAHGEGLMGQFGVAKTYNMLAEHFFWPKMRHDVQKVCNSCLTCKRTKSKIQPH------
                               R+ +LC+P+ S+RELLV+EAHG GLMG FGV KT ++L EHFFWPKM+ DV++VC+ C+TC++ KS++ PH      
Subjt:  -----------------------RKGKLCIPSCSIRELLVKEAHGEGLMGQFGVAKTYNMLAEHFFWPKMRHDVQKVCNSCLTCKRTKSKIQPH------

Query:  ----------------------------------------------------------------------------------------AQSKVDIIQKLH
                                                                                                 Q K ++++KLH
Subjt:  ----------------------------------------------------------------------------------------AQSKVDIIQKLH

Query:  KEIKERIERNNFKVASKVNKGRKPMVFSKGDWVWVHLRKERFSLERSFKLNSREDGPFQVLERINDNAYK
        + +++ IE+ N + A+K NKGR+ ++F  GDWVWVH+RKERFS  R  KL+ R DGPFQVLERINDNAYK
Subjt:  KEIKERIERNNFKVASKVNKGRKPMVFSKGDWVWVHLRKERFSLERSFKLNSREDGPFQVLERINDNAYK

A0A438EFI9 Transposon Ty3-I Gag-Pol polyprotein9.3e-7640.14Show/hide
Query:  LKEFEDVFPEKMPRELPPLRGIEHKIDFLLGALIPNRLAYRTNLKEAKEIQRQVDELLANG---------------------------------------
        +KE+EDVFP  +P  LPP+RGIEH+IDF+  A IPNR AYR+N +E KE+QRQV+ELL  G                                       
Subjt:  LKEFEDVFPEKMPRELPPLRGIEHKIDFLLGALIPNRLAYRTNLKEAKEIQRQVDELLANG---------------------------------------

Query:  ---------------------------------------------------------------------TSQT---WHHYLWPKEFVIHTDHESLKHLRS
                                                                             T ++   W HYLWPKEFVIHTDHESLKHLR 
Subjt:  ---------------------------------------------------------------------TSQT---WHHYLWPKEFVIHTDHESLKHLRS

Query:  QDKLNKRHAKWLEYIESFLYVIRHKKGKENVVADALSRRKGKLCIPSCS------IRELLVKE---------------AHGEGLMGQFGVAKTYNMLAEH
        Q KLN+RHAKW+E+IE+FLYVI++K+GKEN+VA+ALSRR   +   +        ++EL   +               AHG GLMG FGV KT ++L EH
Subjt:  QDKLNKRHAKWLEYIESFLYVIRHKKGKENVVADALSRRKGKLCIPSCS------IRELLVKE---------------AHGEGLMGQFGVAKTYNMLAEH

Query:  FFWPKMRHDVQKVCNSCLTC----KRTKSKIQPHAQS-----KVDIIQKLHKEIKERIERNNFKVASKVNKGRKPMVFSKGDWVWVHLRKERFSLERSFK
        FFWPKM+ DV++ C  C+TC    +  +  I     S     KV++++KLH+ +++ IE+ N + ASK NKGR+ ++F  GDWVWVH+RKERF   R  K
Subjt:  FFWPKMRHDVQKVCNSCLTC----KRTKSKIQPHAQS-----KVDIIQKLHKEIKERIERNNFKVASKVNKGRKPMVFSKGDWVWVHLRKERFSLERSFK

Query:  LNSREDGPFQVLERINDNAYK
        L+ R DGPFQ+LERINDNAYK
Subjt:  LNSREDGPFQVLERINDNAYK

A0A438FGZ8 RNA-directed DNA polymerase-like2.5e-6838.61Show/hide
Query:  KEFEDVFPEKMPRELPPLRGIEHKIDFLLGALIPNRLAYRTNLKEAKEIQRQVDEL----------------------------LANGTSQ---------
        KE+EDV    +P  LPP+RGIEH+IDF+LGA IPNR AYR+N +E KE+QR ++ +                            L +G  Q         
Subjt:  KEFEDVFPEKMPRELPPLRGIEHKIDFLLGALIPNRLAYRTNLKEAKEIQRQVDEL----------------------------LANGTSQ---------

Query:  -------TWHHYLW----------PKEFV----IHTDHESLK-------------HLRSQDKLNKRHAKWLEYIESFLYVIRHKKGKENVVADALSR---
                +  Y W          P  F+    I  D E +K             HL+ Q KLN+ HAKW+E+I +F YVI++K+GKEN+VADALSR   
Subjt:  -------TWHHYLW----------PKEFV----IHTDHESLK-------------HLRSQDKLNKRHAKWLEYIESFLYVIRHKKGKENVVADALSR---

Query:  ------------------------------------------------RKGKLCIPSCSIRELLVKEAHGEGLMGQFGVAKTYNMLAEHFFWPKMRHDVQ
                                                        R+ ++C+P+ S+ ELLV+EAHG GLMG FGV KT ++L EHFFWPKM+ DV+
Subjt:  ------------------------------------------------RKGKLCIPSCSIRELLVKEAHGEGLMGQFGVAKTYNMLAEHFFWPKMRHDVQ

Query:  KVCNSCLTCKRTKSKIQPH---AQSKVDIIQKLHKEIKERIERNNFKVASKVNKGRKPMVFSKGDWVWVHLRKERFSLERSFKLNSREDGPFQVLERIND
        + C  C+TC++ KS++ PH        ++++KLH+ +++ IE+ N + A+K NKGR+ ++F  GDWVWVH RKERF   R  KL+ R D PFQVLERIND
Subjt:  KVCNSCLTCKRTKSKIQPH---AQSKVDIIQKLHKEIKERIERNNFKVASKVNKGRKPMVFSKGDWVWVHLRKERFSLERSFKLNSREDGPFQVLERIND

Query:  NAYK
        NAYK
Subjt:  NAYK

A0A438GJB5 Transposon Ty3-I Gag-Pol polyprotein5.9e-7033.73Show/hide
Query:  KEFEDVFPEKMPRELPPLRGIEHKIDFLLGALIPNRLAYRTNLKEAKEIQRQVDELLANG----------------------------------------
        KE+EDVFP  +P  LPP+RGIEH+IDF+LGA IPN+ AYR+N +E KE+QRQV+ELL  G                                        
Subjt:  KEFEDVFPEKMPRELPPLRGIEHKIDFLLGALIPNRLAYRTNLKEAKEIQRQVDELLANG----------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------TSQTWHHYLWPKEFVIHTDHESLKHLRSQDKLNKRHAKWLEYIESFLYVIRHKKGKENVVADALSRRKGKLCIPSC
                                  +TW HYLWPKEFVIHTDHESLKHL+ Q KLN+RHAKW+E+IE+F YVI++K+GKEN+ ADALSRR   +   + 
Subjt:  ------------------------TSQTWHHYLWPKEFVIHTDHESLKHLRSQDKLNKRHAKWLEYIESFLYVIRHKKGKENVVADALSRRKGKLCIPSC

Query:  S------IRELLVKE---------------AHGEGLMGQFGVAKTYNMLAEHFFWPKMRHDVQKVCNSCLTCKRTKSKIQPH------------------
               ++EL   E               AHG GLMG FGV KT ++L EHFFWPKM+ DV++ C  C+TC++TKS++  H                  
Subjt:  S------IRELLVKE---------------AHGEGLMGQFGVAKTYNMLAEHFFWPKMRHDVQKVCNSCLTCKRTKSKIQPH------------------

Query:  -------------------AQSKVDIIQKLHKEIKERIERNNFKVASKVNKGRKPMVFSKGDWVWVHLRKERFSLERSFKLNSREDGPFQVLERINDNAY
                            + K ++++KLH+ +++ IE+ N + A+K NKG + ++F  GDWVWVH+RKERF   R  KL+ R DGPFQVLERINDNAY
Subjt:  -------------------AQSKVDIIQKLHKEIKERIERNNFKVASKVNKGRKPMVFSKGDWVWVHLRKERFSLERSFKLNSREDGPFQVLERINDNAY

Query:  K
        K
Subjt:  K

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.63.4e-0635.78Show/hide
Query:  DFLLGALIP---NRLAYRTNLKEAKEIQRQV--DELLA-NGTSQTWHHYLWPKEFVIHTDHESLKHLRSQDKLNKRHAKWLEYIESFLYVIRHKKGKENV
        D  LGA++    + L+Y +      EI       ELLA    ++T+ HYL  + F I +DH+ L  L      N +  +W   +  F + I++ KGKEN 
Subjt:  DFLLGALIP---NRLAYRTNLKEAKEIQRQV--DELLA-NGTSQTWHHYLWPKEFVIHTDHESLKHLRSQDKLNKRHAKWLEYIESFLYVIRHKKGKENV

Query:  VADALSRRK
        VADALSR K
Subjt:  VADALSRRK

P20825 Retrovirus-related Pol polyprotein from transposon 2975.8e-0632.08Show/hide
Query:  HKIDFLLGALIPNRLAYRTNLKEAKEIQRQVDELLANGTSQTWHHYLWPKEFVIHTDHESLKHLRSQDKLNKRHAKWLEYIESFLYVIRHKKGKENVVAD
        H I F+   L  + L Y    KE          L     ++T+ HYL  ++F+I +DH+ L+ L +  +   +  +W   +  + + I + KGKEN VAD
Subjt:  HKIDFLLGALIPNRLAYRTNLKEAKEIQRQVDELLANGTSQTWHHYLWPKEFVIHTDHESLKHLRSQDKLNKRHAKWLEYIESFLYVIRHKKGKENVVAD

Query:  ALSRRK
        ALSR K
Subjt:  ALSRRK

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein8.3e-0530.11Show/hide
Query:  LAYRTNLKEAKEIQRQVDELLANGTSQTWHHY---LWPKEFVIHTDHESLKHLRSQDKLNKRHAKWLEYIESFLYVIRHKKGKENVVADALSR
        + Y +   E+ +      EL   G  +  HH+   L  K F + TDH SL  L+++++  +R  +WL+ + ++ + + +  G +NVVADA+SR
Subjt:  LAYRTNLKEAKEIQRQVDELLANGTSQTWHHY---LWPKEFVIHTDHESLKHLRSQDKLNKRHAKWLEYIESFLYVIRHKKGKENVVADALSR

Q99315 Transposon Ty3-G Gag-Pol polyprotein8.3e-0530.11Show/hide
Query:  LAYRTNLKEAKEIQRQVDELLANGTSQTWHHY---LWPKEFVIHTDHESLKHLRSQDKLNKRHAKWLEYIESFLYVIRHKKGKENVVADALSR
        + Y +   E+ +      EL   G  +  HH+   L  K F + TDH SL  L+++++  +R  +WL+ + ++ + + +  G +NVVADA+SR
Subjt:  LAYRTNLKEAKEIQRQVDELLANGTSQTWHHY---LWPKEFVIHTDHESLKHLRSQDKLNKRHAKWLEYIESFLYVIRHKKGKENVVADALSR

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AGCTCTTAAAGGAATTTGAGGATGTGTTTCCCGAAAAGATGCCTAGGGAACTTCCACCTCTTAGAGGCATTGAGCACAAGATTGACTTCCTGCTGGGAGCCCTTATTCCT
AATAGGCTGGCATATAGGACTAACCTTAAGGAGGCCAAAGAGATACAAAGACAAGTTGATGAACTCCTTGCCAACGGGACTTCGCAAACTTGGCACCATTATCTTTGGCC
TAAGGAGTTTGTAATCCACACCGATCATGAGAGCCTTAAGCACTTGAGGAGCCAAGACAAGCTCAACAAGCGACATGCCAAGTGGTTAGAGTACATTGAAAGCTTCCTAT
ACGTGATAAGGCACAAGAAAGGTAAGGAAAATGTTGTGGCGGATGCACTCTCAAGGAGGAAAGGAAAGTTGTGCATTCCAAGTTGCTCCATTAGAGAGCTATTGGTGAAG
GAGGCACATGGAGAAGGCTTGATGGGACAGTTTGGAGTTGCCAAAACCTATAATATGTTGGCCGAACATTTCTTTTGGCCAAAGATGAGGCATGATGTCCAAAAGGTATG
TAATAGTTGCTTGACATGCAAGAGGACTAAATCAAAGATACAACCACATGCCCAAAGCAAGGTAGACATTATACAAAAGCTACACAAGGAGATCAAAGAAAGGATTGAAA
GGAACAACTTCAAGGTGGCCTCAAAAGTAAACAAAGGTCGCAAACCCATGGTGTTTAGCAAAGGTGATTGGGTTTGGGTGCATCTAAGAAAAGAAAGATTTTCTCTTGAA
AGAAGCTTCAAACTCAACTCTAGAGAAGATGGCCCATTTCAAGTGCTTGAGCGAATCAACGATAACGCTTACAAG
mRNA sequenceShow/hide mRNA sequence
AGCTCTTAAAGGAATTTGAGGATGTGTTTCCCGAAAAGATGCCTAGGGAACTTCCACCTCTTAGAGGCATTGAGCACAAGATTGACTTCCTGCTGGGAGCCCTTATTCCT
AATAGGCTGGCATATAGGACTAACCTTAAGGAGGCCAAAGAGATACAAAGACAAGTTGATGAACTCCTTGCCAACGGGACTTCGCAAACTTGGCACCATTATCTTTGGCC
TAAGGAGTTTGTAATCCACACCGATCATGAGAGCCTTAAGCACTTGAGGAGCCAAGACAAGCTCAACAAGCGACATGCCAAGTGGTTAGAGTACATTGAAAGCTTCCTAT
ACGTGATAAGGCACAAGAAAGGTAAGGAAAATGTTGTGGCGGATGCACTCTCAAGGAGGAAAGGAAAGTTGTGCATTCCAAGTTGCTCCATTAGAGAGCTATTGGTGAAG
GAGGCACATGGAGAAGGCTTGATGGGACAGTTTGGAGTTGCCAAAACCTATAATATGTTGGCCGAACATTTCTTTTGGCCAAAGATGAGGCATGATGTCCAAAAGGTATG
TAATAGTTGCTTGACATGCAAGAGGACTAAATCAAAGATACAACCACATGCCCAAAGCAAGGTAGACATTATACAAAAGCTACACAAGGAGATCAAAGAAAGGATTGAAA
GGAACAACTTCAAGGTGGCCTCAAAAGTAAACAAAGGTCGCAAACCCATGGTGTTTAGCAAAGGTGATTGGGTTTGGGTGCATCTAAGAAAAGAAAGATTTTCTCTTGAA
AGAAGCTTCAAACTCAACTCTAGAGAAGATGGCCCATTTCAAGTGCTTGAGCGAATCAACGATAACGCTTACAAG
Protein sequenceShow/hide protein sequence
LLKEFEDVFPEKMPRELPPLRGIEHKIDFLLGALIPNRLAYRTNLKEAKEIQRQVDELLANGTSQTWHHYLWPKEFVIHTDHESLKHLRSQDKLNKRHAKWLEYIESFLY
VIRHKKGKENVVADALSRRKGKLCIPSCSIRELLVKEAHGEGLMGQFGVAKTYNMLAEHFFWPKMRHDVQKVCNSCLTCKRTKSKIQPHAQSKVDIIQKLHKEIKERIER
NNFKVASKVNKGRKPMVFSKGDWVWVHLRKERFSLERSFKLNSREDGPFQVLERINDNAYK