; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G19920 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G19920
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionReverse transcriptase
Genome locationChr3:15645050..15647795
RNA-Seq ExpressionCSPI03G19920
SyntenyCSPI03G19920
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0003824 - catalytic activity (molecular function)
GO:0005488 - binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035681.1 reverse transcriptase [Cucumis melo var. makuwa]2.2e-8440.04Show/hide
Query:  IRSWKKMKKLLKARFLPPNYEQTLYNQYQNC-HPGHLSNTCPQRKIIALAEEEGGSMS-ENNKKAEEEAELIEADDGDRVS-----------YSDNSENI
        I+   K K+ L       NY +    +   C  PGHLSN C QRK IALAE+E   MS  + ++ EEE ELIEADDGDR+S              N ++ 
Subjt:  IRSWKKMKKLLKARFLPPNYEQTLYNQYQNC-HPGHLSNTCPQRKIIALAEEEGGSMS-ENNKKAEEEAELIEADDGDRVS-----------YSDNSENI

Query:  VAKKLVTMLNLKARTHLNPYKMGWGKKGGEVVVNEICTITLSIGSEYKDQIICDITEIDV------------------------------------PLAR
           K    +N K   H +PYK+GW KKGGE ++NEICTI LSIG+ YKDQI+CD+ E+DV                                    PLA+
Subjt:  VAKKLVTMLNLKARTHLNPYKMGWGKKGGEVVVNEICTITLSIGSEYKDQIICDITEIDV------------------------------------PLAR

Query:  KNSEGTR--HEKQLFITVCGKKLLNEREQDLLGLIIVEKSKEEQYEVVEPKLQELFDEFPHLKKEPEGLPPLQDVQHHINLVPGASLSNLAHYRMSPQD-
        KN+E  R  +++QLFITV GK LL EREQDLLGL++ +KS+    E+VEP+L+ELF EFPHLKKEP+GLPPL+D+QH I+LVP ASL NL HYRMSP++ 
Subjt:  KNSEGTR--HEKQLFITVCGKKLLNEREQDLLGLIIVEKSKEEQYEVVEPKLQELFDEFPHLKKEPEGLPPLQDVQHHINLVPGASLSNLAHYRMSPQD-

Query:  -------------------------------------------------------------DHLDQLGKAIIFSKIDLKSGYHQIRIRTRDEWKTTFKTN
                                                                     D LDQLGKA+IFSKIDL++GYHQI+IR  DEWKT FKTN
Subjt:  -------------------------------------------------------------DHLDQLGKAIIFSKIDLKSGYHQIRIRTRDEWKTTFKTN

Query:  EGLFE---------WMVMLFGLSNAPSTFVR----------------LMNETLTQQE--------------SFDDIK---------KKLVSNPIIQLS--
        EGLFE         ++  LF +      ++                 L+ E   + E              S  +++         ++ + N  + ++  
Subjt:  EGLFE---------WMVMLFGLSNAPSTFVR----------------LMNETLTQQE--------------SFDDIK---------KKLVSNPIIQLS--

Query:  --NFSSPFEMAVDAYGTGIGVVLSHQGHPIEYFSEKLSSSRQ
           F+SPFE+AV+A GTGIG VLS QGHPIEYFSEKLS+SRQ
Subjt:  --NFSSPFEMAVDAYGTGIGVVLSHQGHPIEYFSEKLSSSRQ

KAA0047078.1 reverse transcriptase [Cucumis melo var. makuwa]1.3e-8151.32Show/hide
Query:  PGHLSNTCPQRKIIALAEEEGGSMSENNKKAEEEAELIEADDGDRVSY-------------------------------------SDNSENIVAKKLVTM
        PGH SNTCPQRK IALA++E  S SE++++ EEEA+LIEADDG RVS                                      + +SEN VAKKLVT 
Subjt:  PGHLSNTCPQRKIIALAEEEGGSMSENNKKAEEEAELIEADDGDRVSY-------------------------------------SDNSENIVAKKLVTM

Query:  LNLKARTHLNPYKMGWGKKGGEVVVNEICTITLSIGSEYKDQIICDITEIDVPLARKNSEGTRHEKQLFITVCGKKLLNEREQDLLGLIIVEKSKEEQYE
        LNLKA  H NPYK+GW KKGGE  ++EICT+ LSIG+ YKDQI+CD+ E+D                            EREQDLLGLIIV+KS EEQ E
Subjt:  LNLKARTHLNPYKMGWGKKGGEVVVNEICTITLSIGSEYKDQIICDITEIDVPLARKNSEGTRHEKQLFITVCGKKLLNEREQDLLGLIIVEKSKEEQYE

Query:  VVEPKLQELFDEFPHLKKEPEGLPPLQDVQHHINLVPGASLSNLAHYRMSPQD-----DHLDQLGK----------------AIIFSKIDLKSGYHQIRI
         ++ +LQ+LF+EFPHLKKEP+GLPPL+D+Q  I+L+PGASL  L+HYRMSP +     +H+++L +                + +   IDLKSGYHQ+RI
Subjt:  VVEPKLQELFDEFPHLKKEPEGLPPLQDVQHHINLVPGASLSNLAHYRMSPQD-----DHLDQLGK----------------AIIFSKIDLKSGYHQIRI

Query:  RTRDEWKTTFKTNEGLFEWMVMLFGLSNAPSTFVRLMNETL
        R  DEWKTTFKTNEGLFEWM+M FGLSN PSTF+RLMN+ L
Subjt:  RTRDEWKTTFKTNEGLFEWMVMLFGLSNAPSTFVRLMNETL

KAA0062943.1 Retrovirus-related Pol polyprotein from transposon 17.6 [Cucumis melo var. makuwa]1.0e-8939.64Show/hide
Query:  NYEQTLYNQYQNC-HPGHLSNTCPQRKIIALAEEEGGSMSENNKKAEEEAELIEADDGDRVSY-------------------------------------
        NY +    +   C  P HLSN CPQRK IALAE+E   MSE +K+ +EE ELIEAD+GDR+S                                      
Subjt:  NYEQTLYNQYQNC-HPGHLSNTCPQRKIIALAEEEGGSMSENNKKAEEEAELIEADDGDRVSY-------------------------------------

Query:  SDNSENIVAKKLVTMLNLKARTHLNPYKMGWGKKGGEVVVNEICTITLSIGSEYKDQIICDITEIDVPLARKNSEGTRHEKQLFITVCGKKLLNEREQDL
        S +SEN VA+KLV  LNLK   H +PYK+GW KK GE ++NEICTI LSI + YKDQI+CD+ E+D                    VC   L    EQDL
Subjt:  SDNSENIVAKKLVTMLNLKARTHLNPYKMGWGKKGGEVVVNEICTITLSIGSEYKDQIICDITEIDVPLARKNSEGTRHEKQLFITVCGKKLLNEREQDL

Query:  LGLIIVEKSKEEQYEVVEPKLQELFDEFPHLKKEPEGLPPLQDVQHHINLVPGASLSNLAHYRMSPQD--------------------------------
        LGL++ EKS+    E+VEP+L+ELF EFPHLKKEP+GLPPL D+QH I+LVPGASL +L HYRMSP++                                
Subjt:  LGLIIVEKSKEEQYEVVEPKLQELFDEFPHLKKEPEGLPPLQDVQHHINLVPGASLSNLAHYRMSPQD--------------------------------

Query:  ------------------------------DHLDQLGKAIIFSKIDLKSGYHQIRIRTRDEWKTTFKTNEGLFEWMVMLFGLSNAPST------------
                                      D LDQLGKA +FSKIDL+S YHQIRIR  DEWKTTFK NEGLFEW+ M FGLSNAPST            
Subjt:  ------------------------------DHLDQLGKAIIFSKIDLKSGYHQIRIRTRDEWKTTFKTNEGLFEWMVMLFGLSNAPST------------

Query:  -------------------------------------------------------------------------FVRLMNETLT----------------Q
                                                                                 F+R  +  +T                Q
Subjt:  -------------------------------------------------------------------------FVRLMNETLT----------------Q

Query:  QESFDDIKKKLVSNPIIQLSNFSSPFEMAVDAYGTGIGVVLSHQGHPIEYFSEKLSSSRQ
        Q+SF++IK++L S+PI+QL +F+SPFE+ VDA G GIG VLS QGHPIEYFSEKLS+SRQ
Subjt:  QESFDDIKKKLVSNPIIQLSNFSSPFEMAVDAYGTGIGVVLSHQGHPIEYFSEKLSSSRQ

TYK30863.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa]8.8e-8640.11Show/hide
Query:  IRSWKKMKKLLKARFLPPNYEQTLYNQYQNC-HPGHLSNTCPQRKIIALAEEEGGSMSENNKKAEEEAELIEADDGDRVS-----------YSDNSENIV
        I+   K K+ L       NY +    +   C  PGHLSN C QRK IALAE+E   MS  +++ EEE ELIEADDGDR+S              N ++  
Subjt:  IRSWKKMKKLLKARFLPPNYEQTLYNQYQNC-HPGHLSNTCPQRKIIALAEEEGGSMSENNKKAEEEAELIEADDGDRVS-----------YSDNSENIV

Query:  AKKLVTMLNLKARTHLNPYKMGWGKKGGEVVVNEICTITLSIGSEYKDQIICDITEIDV------------------------------------PLARK
          K    +N K   H +PYK+GW KKGGE ++NEICTI LSIG+ YKDQI+CD+ E+DV                                    PLA+K
Subjt:  AKKLVTMLNLKARTHLNPYKMGWGKKGGEVVVNEICTITLSIGSEYKDQIICDITEIDV------------------------------------PLARK

Query:  NSEGTR--HEKQLFITVCGKKLLNEREQDLLGLIIVEKSKEEQYEVVEPKLQELFDEFPHLKKEPEGLPPLQDVQHHINLVPGASLSNLAHYRMSPQD--
        N+E  R  +++QLFITV GK LL EREQDLLGL++ +KS+    E+VEP+L+ELF EFPHLKKEP+GLPPL+D+QH I+LVP ASL NL HYRMSP++  
Subjt:  NSEGTR--HEKQLFITVCGKKLLNEREQDLLGLIIVEKSKEEQYEVVEPKLQELFDEFPHLKKEPEGLPPLQDVQHHINLVPGASLSNLAHYRMSPQD--

Query:  ------------------------------------------------------------DHLDQLGKAIIFSKIDLKSGYHQIRIRTRDEWKTTFKTNE
                                                                    D LDQLGKA+IFSKIDL++GYHQI+IR  DEWKT FKTNE
Subjt:  ------------------------------------------------------------DHLDQLGKAIIFSKIDLKSGYHQIRIRTRDEWKTTFKTNE

Query:  GLFE---------WMVMLFGLSNAPSTFVR----------------LMNETLTQQE--SFDDIKKKLVSNPIIQL-------------------------
        GLFE         ++  LF +      ++                 L+ E   + E    + I+ +     I ++                         
Subjt:  GLFE---------WMVMLFGLSNAPSTFVR----------------LMNETLTQQE--SFDDIKKKLVSNPIIQL-------------------------

Query:  SNFSSPFEMAVDAYGTGIGVVLSHQGHPIEYFSEKLSSSRQ
          F+SPFE+AV+A GTGIG VLS QGHPIEYFSEKLS+SRQ
Subjt:  SNFSSPFEMAVDAYGTGIGVVLSHQGHPIEYFSEKLSSSRQ

XP_031744062.1 uncharacterized protein LOC116404773 [Cucumis sativus]1.9e-10942.57Show/hide
Query:  SVWWDQLEIYWQRCGKQPIRSWKKMKKLLKARFLPPNYEQTLYNQYQNC---------------------------------------------------
        S WWDQLEI  QRCGKQPIRSW+KMKKLLKARFLPPNYEQTLYNQYQNC                                                   
Subjt:  SVWWDQLEIYWQRCGKQPIRSWKKMKKLLKARFLPPNYEQTLYNQYQNC---------------------------------------------------

Query:  ----------------------------------------------------------------HPGHLSNTCPQRKIIALAEEEGGSMSENNKKAEEEA
                                                                          GHLSN CPQRK IA+A EEGG  SE++ +AEEE 
Subjt:  ----------------------------------------------------------------HPGHLSNTCPQRKIIALAEEEGGSMSENNKKAEEEA

Query:  ELIEADDGDRVSY-------------------------------------SDNSENIVAKKLVTMLNLKARTHLNPYKMGWGKKGGEVVVNEICTITLSI
        ELIEADDG+RVS                                      S +SEN VAKKLV +LNLKA  H  PYK+GW +KGGE  V+EICT+ LSI
Subjt:  ELIEADDGDRVSY-------------------------------------SDNSENIVAKKLVTMLNLKARTHLNPYKMGWGKKGGEVVVNEICTITLSI

Query:  GSEYKDQIICDITEIDV------------------------------------PLARKNSEGTRHEKQLFITVCGKKLLNEREQDLLGLIIVEKSKEEQY
        G+ YKDQI+CD+ E+DV                                    P+ +K +EG R EKQLFITV GKK+L EREQ +LGL+++EK+KE+Q 
Subjt:  GSEYKDQIICDITEIDV------------------------------------PLARKNSEGTRHEKQLFITVCGKKLLNEREQDLLGLIIVEKSKEEQY

Query:  EVVEPKLQELFDEFPHLKKEPEGLPPLQDVQHHINLVPGASLSNLAHYRMSPQD----------------------------------------------
        E +EPKLQ+L  EFPH+K+EP+GLPPL+D+QHHI+L+PGASL NLAHYRMSPQ+                                              
Subjt:  EVVEPKLQELFDEFPHLKKEPEGLPPLQDVQHHINLVPGASLSNLAHYRMSPQD----------------------------------------------

Query:  ----------------DHLDQLGKAIIFSKIDLKSGYHQIRIRTRDEWKTTFKTNEGLFEWMVMLFGLSNAPSTFVRLMNETLTQQESFDDI
                        D LDQLGKA IFSKIDLKSGYHQIR+R  DEWKT FKTNEGLFEWMVM FGLSNAPSTF+RLMN+TL       DI
Subjt:  ----------------DHLDQLGKAIIFSKIDLKSGYHQIRIRTRDEWKTTFKTNEGLFEWMVMLFGLSNAPSTFVRLMNETLTQQESFDDI

TrEMBL top hitse value%identityAlignment
A0A5A7T256 Reverse transcriptase1.1e-8440.04Show/hide
Query:  IRSWKKMKKLLKARFLPPNYEQTLYNQYQNC-HPGHLSNTCPQRKIIALAEEEGGSMS-ENNKKAEEEAELIEADDGDRVS-----------YSDNSENI
        I+   K K+ L       NY +    +   C  PGHLSN C QRK IALAE+E   MS  + ++ EEE ELIEADDGDR+S              N ++ 
Subjt:  IRSWKKMKKLLKARFLPPNYEQTLYNQYQNC-HPGHLSNTCPQRKIIALAEEEGGSMS-ENNKKAEEEAELIEADDGDRVS-----------YSDNSENI

Query:  VAKKLVTMLNLKARTHLNPYKMGWGKKGGEVVVNEICTITLSIGSEYKDQIICDITEIDV------------------------------------PLAR
           K    +N K   H +PYK+GW KKGGE ++NEICTI LSIG+ YKDQI+CD+ E+DV                                    PLA+
Subjt:  VAKKLVTMLNLKARTHLNPYKMGWGKKGGEVVVNEICTITLSIGSEYKDQIICDITEIDV------------------------------------PLAR

Query:  KNSEGTR--HEKQLFITVCGKKLLNEREQDLLGLIIVEKSKEEQYEVVEPKLQELFDEFPHLKKEPEGLPPLQDVQHHINLVPGASLSNLAHYRMSPQD-
        KN+E  R  +++QLFITV GK LL EREQDLLGL++ +KS+    E+VEP+L+ELF EFPHLKKEP+GLPPL+D+QH I+LVP ASL NL HYRMSP++ 
Subjt:  KNSEGTR--HEKQLFITVCGKKLLNEREQDLLGLIIVEKSKEEQYEVVEPKLQELFDEFPHLKKEPEGLPPLQDVQHHINLVPGASLSNLAHYRMSPQD-

Query:  -------------------------------------------------------------DHLDQLGKAIIFSKIDLKSGYHQIRIRTRDEWKTTFKTN
                                                                     D LDQLGKA+IFSKIDL++GYHQI+IR  DEWKT FKTN
Subjt:  -------------------------------------------------------------DHLDQLGKAIIFSKIDLKSGYHQIRIRTRDEWKTTFKTN

Query:  EGLFE---------WMVMLFGLSNAPSTFVR----------------LMNETLTQQE--------------SFDDIK---------KKLVSNPIIQLS--
        EGLFE         ++  LF +      ++                 L+ E   + E              S  +++         ++ + N  + ++  
Subjt:  EGLFE---------WMVMLFGLSNAPSTFVR----------------LMNETLTQQE--------------SFDDIK---------KKLVSNPIIQLS--

Query:  --NFSSPFEMAVDAYGTGIGVVLSHQGHPIEYFSEKLSSSRQ
           F+SPFE+AV+A GTGIG VLS QGHPIEYFSEKLS+SRQ
Subjt:  --NFSSPFEMAVDAYGTGIGVVLSHQGHPIEYFSEKLSSSRQ

A0A5A7U9F4 Uncharacterized protein1.4e-7656.4Show/hide
Query:  PLARKNSEGTRHEKQ---LFITVCGKKLLNEREQDLLGLIIVEKSKEEQYEVVEPKLQELFDEFPHLKKEPEGLPPLQDVQHHINLVPGASLSNLAHYRM
        P+ +KNSEG R  K    LFITV GKKLL EREQDLLGL+I +KSKEEQ E++EPKL +LF+EFPHLKKEP+GLPPL+D+QH I+L+PGASL NLAHYRM
Subjt:  PLARKNSEGTRHEKQ---LFITVCGKKLLNEREQDLLGLIIVEKSKEEQYEVVEPKLQELFDEFPHLKKEPEGLPPLQDVQHHINLVPGASLSNLAHYRM

Query:  SPQD----DHLDQLGKAIIFSKIDLKSGYHQIRIRTRDEWKTTFKTNEGLFEWMVMLFGLSNAPST----------------------------------
        S ++    D L+QLGK   FSK DLKSGY QIRI+  DEWKTTFKTNEGLFEW+VM FGLSNAPST                                  
Subjt:  SPQD----DHLDQLGKAIIFSKIDLKSGYHQIRIRTRDEWKTTFKTNEGLFEWMVMLFGLSNAPST----------------------------------

Query:  -----------FVRLMNETLT----------------QQESFDDIKKKLVSNPIIQLSNFSSPFEMAVDAYGTGIGVVLSHQGHPIEYF
                   F+R  N   +                QQESF+DIK++L SNPI+QL +FSSPFE+AVDA  TGIGVVLS QGHPIEYF
Subjt:  -----------FVRLMNETLT----------------QQESFDDIKKKLVSNPIIQLSNFSSPFEMAVDAYGTGIGVVLSHQGHPIEYF

A0A5A7V4G7 Retrovirus-related Pol polyprotein from transposon 17.64.9e-9039.64Show/hide
Query:  NYEQTLYNQYQNC-HPGHLSNTCPQRKIIALAEEEGGSMSENNKKAEEEAELIEADDGDRVSY-------------------------------------
        NY +    +   C  P HLSN CPQRK IALAE+E   MSE +K+ +EE ELIEAD+GDR+S                                      
Subjt:  NYEQTLYNQYQNC-HPGHLSNTCPQRKIIALAEEEGGSMSENNKKAEEEAELIEADDGDRVSY-------------------------------------

Query:  SDNSENIVAKKLVTMLNLKARTHLNPYKMGWGKKGGEVVVNEICTITLSIGSEYKDQIICDITEIDVPLARKNSEGTRHEKQLFITVCGKKLLNEREQDL
        S +SEN VA+KLV  LNLK   H +PYK+GW KK GE ++NEICTI LSI + YKDQI+CD+ E+D                    VC   L    EQDL
Subjt:  SDNSENIVAKKLVTMLNLKARTHLNPYKMGWGKKGGEVVVNEICTITLSIGSEYKDQIICDITEIDVPLARKNSEGTRHEKQLFITVCGKKLLNEREQDL

Query:  LGLIIVEKSKEEQYEVVEPKLQELFDEFPHLKKEPEGLPPLQDVQHHINLVPGASLSNLAHYRMSPQD--------------------------------
        LGL++ EKS+    E+VEP+L+ELF EFPHLKKEP+GLPPL D+QH I+LVPGASL +L HYRMSP++                                
Subjt:  LGLIIVEKSKEEQYEVVEPKLQELFDEFPHLKKEPEGLPPLQDVQHHINLVPGASLSNLAHYRMSPQD--------------------------------

Query:  ------------------------------DHLDQLGKAIIFSKIDLKSGYHQIRIRTRDEWKTTFKTNEGLFEWMVMLFGLSNAPST------------
                                      D LDQLGKA +FSKIDL+S YHQIRIR  DEWKTTFK NEGLFEW+ M FGLSNAPST            
Subjt:  ------------------------------DHLDQLGKAIIFSKIDLKSGYHQIRIRTRDEWKTTFKTNEGLFEWMVMLFGLSNAPST------------

Query:  -------------------------------------------------------------------------FVRLMNETLT----------------Q
                                                                                 F+R  +  +T                Q
Subjt:  -------------------------------------------------------------------------FVRLMNETLT----------------Q

Query:  QESFDDIKKKLVSNPIIQLSNFSSPFEMAVDAYGTGIGVVLSHQGHPIEYFSEKLSSSRQ
        Q+SF++IK++L S+PI+QL +F+SPFE+ VDA G GIG VLS QGHPIEYFSEKLS+SRQ
Subjt:  QESFDDIKKKLVSNPIIQLSNFSSPFEMAVDAYGTGIGVVLSHQGHPIEYFSEKLSSSRQ

A0A5D3C3X9 Reverse transcriptase6.4e-8251.32Show/hide
Query:  PGHLSNTCPQRKIIALAEEEGGSMSENNKKAEEEAELIEADDGDRVSY-------------------------------------SDNSENIVAKKLVTM
        PGH SNTCPQRK IALA++E  S SE++++ EEEA+LIEADDG RVS                                      + +SEN VAKKLVT 
Subjt:  PGHLSNTCPQRKIIALAEEEGGSMSENNKKAEEEAELIEADDGDRVSY-------------------------------------SDNSENIVAKKLVTM

Query:  LNLKARTHLNPYKMGWGKKGGEVVVNEICTITLSIGSEYKDQIICDITEIDVPLARKNSEGTRHEKQLFITVCGKKLLNEREQDLLGLIIVEKSKEEQYE
        LNLKA  H NPYK+GW KKGGE  ++EICT+ LSIG+ YKDQI+CD+ E+D                            EREQDLLGLIIV+KS EEQ E
Subjt:  LNLKARTHLNPYKMGWGKKGGEVVVNEICTITLSIGSEYKDQIICDITEIDVPLARKNSEGTRHEKQLFITVCGKKLLNEREQDLLGLIIVEKSKEEQYE

Query:  VVEPKLQELFDEFPHLKKEPEGLPPLQDVQHHINLVPGASLSNLAHYRMSPQD-----DHLDQLGK----------------AIIFSKIDLKSGYHQIRI
         ++ +LQ+LF+EFPHLKKEP+GLPPL+D+Q  I+L+PGASL  L+HYRMSP +     +H+++L +                + +   IDLKSGYHQ+RI
Subjt:  VVEPKLQELFDEFPHLKKEPEGLPPLQDVQHHINLVPGASLSNLAHYRMSPQD-----DHLDQLGK----------------AIIFSKIDLKSGYHQIRI

Query:  RTRDEWKTTFKTNEGLFEWMVMLFGLSNAPSTFVRLMNETL
        R  DEWKTTFKTNEGLFEWM+M FGLSN PSTF+RLMN+ L
Subjt:  RTRDEWKTTFKTNEGLFEWMVMLFGLSNAPSTFVRLMNETL

A0A5D3E417 Transposon Ty3-I Gag-Pol polyprotein isoform X14.3e-8640.11Show/hide
Query:  IRSWKKMKKLLKARFLPPNYEQTLYNQYQNC-HPGHLSNTCPQRKIIALAEEEGGSMSENNKKAEEEAELIEADDGDRVS-----------YSDNSENIV
        I+   K K+ L       NY +    +   C  PGHLSN C QRK IALAE+E   MS  +++ EEE ELIEADDGDR+S              N ++  
Subjt:  IRSWKKMKKLLKARFLPPNYEQTLYNQYQNC-HPGHLSNTCPQRKIIALAEEEGGSMSENNKKAEEEAELIEADDGDRVS-----------YSDNSENIV

Query:  AKKLVTMLNLKARTHLNPYKMGWGKKGGEVVVNEICTITLSIGSEYKDQIICDITEIDV------------------------------------PLARK
          K    +N K   H +PYK+GW KKGGE ++NEICTI LSIG+ YKDQI+CD+ E+DV                                    PLA+K
Subjt:  AKKLVTMLNLKARTHLNPYKMGWGKKGGEVVVNEICTITLSIGSEYKDQIICDITEIDV------------------------------------PLARK

Query:  NSEGTR--HEKQLFITVCGKKLLNEREQDLLGLIIVEKSKEEQYEVVEPKLQELFDEFPHLKKEPEGLPPLQDVQHHINLVPGASLSNLAHYRMSPQD--
        N+E  R  +++QLFITV GK LL EREQDLLGL++ +KS+    E+VEP+L+ELF EFPHLKKEP+GLPPL+D+QH I+LVP ASL NL HYRMSP++  
Subjt:  NSEGTR--HEKQLFITVCGKKLLNEREQDLLGLIIVEKSKEEQYEVVEPKLQELFDEFPHLKKEPEGLPPLQDVQHHINLVPGASLSNLAHYRMSPQD--

Query:  ------------------------------------------------------------DHLDQLGKAIIFSKIDLKSGYHQIRIRTRDEWKTTFKTNE
                                                                    D LDQLGKA+IFSKIDL++GYHQI+IR  DEWKT FKTNE
Subjt:  ------------------------------------------------------------DHLDQLGKAIIFSKIDLKSGYHQIRIRTRDEWKTTFKTNE

Query:  GLFE---------WMVMLFGLSNAPSTFVR----------------LMNETLTQQE--SFDDIKKKLVSNPIIQL-------------------------
        GLFE         ++  LF +      ++                 L+ E   + E    + I+ +     I ++                         
Subjt:  GLFE---------WMVMLFGLSNAPSTFVR----------------LMNETLTQQE--SFDDIKKKLVSNPIIQL-------------------------

Query:  SNFSSPFEMAVDAYGTGIGVVLSHQGHPIEYFSEKLSSSRQ
          F+SPFE+AV+A GTGIG VLS QGHPIEYFSEKLS+SRQ
Subjt:  SNFSSPFEMAVDAYGTGIGVVLSHQGHPIEYFSEKLSSSRQ

SwissProt top hitse value%identityAlignment
P20825 Retrovirus-related Pol polyprotein from transposon 2971.4e-0944.59Show/hide
Query:  YRMSPQDDHLDQLGKAIIFSKIDLKSGYHQIRIRTRDEWKTTFKTNEGLFEWMVMLFGLSNAPSTFVRLMNETL
        Y +   D+ L +LGK   F+ IDL  G+HQI +      KT F T  G +E++ M FGL NAP+TF R MN  L
Subjt:  YRMSPQDDHLDQLGKAIIFSKIDLKSGYHQIRIRTRDEWKTTFKTNEGLFEWMVMLFGLSNAPSTFVRLMNETL

P20825 Retrovirus-related Pol polyprotein from transposon 2972.7e-0533.85Show/hide
Query:  RLMNETLTQQESFDDIKKKLVSNPIIQLSNFSSPFEMAVDAYGTGIGVVLSHQGHPIEYFSEKLS
        ++  + L   E+F+ +K  ++ +PI+QL +F   F +  DA    +G VLS  GHPI + S  L+
Subjt:  RLMNETLTQQESFDDIKKKLVSNPIIQLSNFSSPFEMAVDAYGTGIGVVLSHQGHPIEYFSEKLS

P27502 Polyprotein P31.4e-0943.04Show/hide
Query:  LDQLGKAIIFSKIDLKSGYHQIRIRTRDEWKTTFKTNEGLFEWMVMLFGLSNAPSTFVRLMNETLTQQESFDDIKKKLV
        ++ + KA IFSK DLK+G+H ++++   +  TTF  +EGL+ W V  FG++NAP  F R M      QESF D+K  L+
Subjt:  LDQLGKAIIFSKIDLKSGYHQIRIRTRDEWKTTFKTNEGLFEWMVMLFGLSNAPSTFVRLMNETLTQQESFDDIKKKLV

P31843 RNA-directed DNA polymerase homolog1.5e-1151.35Show/hide
Query:  YRMSPQDDHLDQLGKAIIFSKIDLKSGYHQIRIRTRDEWKTTFKTNEGLFEWMVMLFGLSNAPSTFVRLMNETL
        Y +   DD  D+L +A  F+K+DL+SGY Q+RI   DE KTT  T  G FE+ VM FGL+NA +TF  LMN  L
Subjt:  YRMSPQDDHLDQLGKAIIFSKIDLKSGYHQIRIRTRDEWKTTFKTNEGLFEWMVMLFGLSNAPSTFVRLMNETL

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein5.1e-1252.24Show/hide
Query:  DDHLDQLGKAIIFSKIDLKSGYHQIRIRTRDEWKTTFKTNEGLFEWMVMLFGLSNAPSTFVRLMNET
        D+ L ++G A IF+ +DL SGYHQI +  +D +KT F T  G +E+ VM FGL NAPSTF R M +T
Subjt:  DDHLDQLGKAIIFSKIDLKSGYHQIRIRTRDEWKTTFKTNEGLFEWMVMLFGLSNAPSTFVRLMNET

Q99315 Transposon Ty3-G Gag-Pol polyprotein5.1e-1252.24Show/hide
Query:  DDHLDQLGKAIIFSKIDLKSGYHQIRIRTRDEWKTTFKTNEGLFEWMVMLFGLSNAPSTFVRLMNET
        D+ L ++G A IF+ +DL SGYHQI +  +D +KT F T  G +E+ VM FGL NAPSTF R M +T
Subjt:  DDHLDQLGKAIIFSKIDLKSGYHQIRIRTRDEWKTTFKTNEGLFEWMVMLFGLSNAPSTFVRLMNET

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGAGAAAAGCGTCTATTTCAATGTTGAAGCCATGGGAAAATTGTACGATTTTCCTTTGGACGCTGAGATACCAAAGGAAGTCGTATTTACTAGAACTTTGAAAAG
GCAGACCAGGGAAGCATTGAAAATTATCATGTGGCTTGAGCACCTAAAAAGGCACTGCCCTTCCCGTTTACCATCGGTATGGTGGGATCAGCTGGAGATTTATTGGCAAA
GGTGTGGGAAGCAACCCATTCGTTCTTGGAAGAAAATGAAGAAACTATTGAAGGCACGATTTCTACCCCCAAATTATGAACAAACCTTATACAATCAGTACCAAAACTGC
CACCCAGGTCACCTTTCCAATACTTGCCCCCAACGAAAAATCATAGCGTTAGCTGAGGAAGAAGGAGGTTCGATGAGTGAAAATAACAAAAAAGCAGAGGAAGAAGCTGA
GCTTATTGAGGCGGATGATGGAGATAGAGTCTCTTACAGCGACAATAGTGAAAACATTGTGGCAAAGAAGCTAGTGACTATGTTGAATCTAAAGGCTAGAACCCATCTAA
ATCCTTACAAGATGGGATGGGGGAAGAAAGGAGGAGAGGTCGTGGTTAACGAAATCTGTACAATTACTCTTTCTATTGGAAGCGAGTACAAGGACCAAATTATTTGCGAT
ATCACTGAAATTGATGTTCCATTAGCAAGGAAGAATAGTGAAGGAACAAGACATGAGAAGCAATTATTCATCACTGTTTGTGGGAAAAAACTACTTAACGAAAGGGAACA
AGACCTCCTAGGGTTGATTATTGTTGAAAAGTCCAAGGAAGAACAATATGAGGTCGTGGAACCTAAATTACAAGAGCTATTTGATGAGTTCCCTCATTTGAAAAAAGAAC
CTGAGGGACTTCCACCCCTTCAAGATGTACAACATCATATAAACCTTGTTCCAGGAGCATCATTGTCAAACCTAGCTCATTATAGAATGAGTCCCCAAGATGATCACTTA
GACCAACTTGGCAAAGCCATCATTTTTTCAAAGATTGATCTAAAGAGTGGCTACCACCAAATACGCATTAGGACCAGAGATGAATGGAAGACAACCTTTAAGACCAATGA
AGGCTTATTTGAATGGATGGTCATGCTCTTTGGGTTATCTAATGCTCCTAGTACTTTCGTGAGATTGATGAATGAGACCCTAACACAACAAGAGAGCTTTGATGACATCA
AGAAGAAGTTGGTTTCCAACCCAATTATTCAACTATCAAACTTCTCTTCACCATTTGAAATGGCAGTTGATGCCTATGGCACTGGAATTGGGGTTGTCTTGTCTCATCAA
GGACACCCAATTGAATACTTCAGTGAAAAACTAAGCTCCTCAAGACAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACGAGAAAAGCGTCTATTTCAATGTTGAAGCCATGGGAAAATTGTACGATTTTCCTTTGGACGCTGAGATACCAAAGGAAGTCGTATTTACTAGAACTTTGAAAAG
GCAGACCAGGGAAGCATTGAAAATTATCATGTGGCTTGAGCACCTAAAAAGGCACTGCCCTTCCCGTTTACCATCGGTATGGTGGGATCAGCTGGAGATTTATTGGCAAA
GGTGTGGGAAGCAACCCATTCGTTCTTGGAAGAAAATGAAGAAACTATTGAAGGCACGATTTCTACCCCCAAATTATGAACAAACCTTATACAATCAGTACCAAAACTGC
CACCCAGGTCACCTTTCCAATACTTGCCCCCAACGAAAAATCATAGCGTTAGCTGAGGAAGAAGGAGGTTCGATGAGTGAAAATAACAAAAAAGCAGAGGAAGAAGCTGA
GCTTATTGAGGCGGATGATGGAGATAGAGTCTCTTACAGCGACAATAGTGAAAACATTGTGGCAAAGAAGCTAGTGACTATGTTGAATCTAAAGGCTAGAACCCATCTAA
ATCCTTACAAGATGGGATGGGGGAAGAAAGGAGGAGAGGTCGTGGTTAACGAAATCTGTACAATTACTCTTTCTATTGGAAGCGAGTACAAGGACCAAATTATTTGCGAT
ATCACTGAAATTGATGTTCCATTAGCAAGGAAGAATAGTGAAGGAACAAGACATGAGAAGCAATTATTCATCACTGTTTGTGGGAAAAAACTACTTAACGAAAGGGAACA
AGACCTCCTAGGGTTGATTATTGTTGAAAAGTCCAAGGAAGAACAATATGAGGTCGTGGAACCTAAATTACAAGAGCTATTTGATGAGTTCCCTCATTTGAAAAAAGAAC
CTGAGGGACTTCCACCCCTTCAAGATGTACAACATCATATAAACCTTGTTCCAGGAGCATCATTGTCAAACCTAGCTCATTATAGAATGAGTCCCCAAGATGATCACTTA
GACCAACTTGGCAAAGCCATCATTTTTTCAAAGATTGATCTAAAGAGTGGCTACCACCAAATACGCATTAGGACCAGAGATGAATGGAAGACAACCTTTAAGACCAATGA
AGGCTTATTTGAATGGATGGTCATGCTCTTTGGGTTATCTAATGCTCCTAGTACTTTCGTGAGATTGATGAATGAGACCCTAACACAACAAGAGAGCTTTGATGACATCA
AGAAGAAGTTGGTTTCCAACCCAATTATTCAACTATCAAACTTCTCTTCACCATTTGAAATGGCAGTTGATGCCTATGGCACTGGAATTGGGGTTGTCTTGTCTCATCAA
GGACACCCAATTGAATACTTCAGTGAAAAACTAAGCTCCTCAAGACAGTAA
Protein sequenceShow/hide protein sequence
MDEKSVYFNVEAMGKLYDFPLDAEIPKEVVFTRTLKRQTREALKIIMWLEHLKRHCPSRLPSVWWDQLEIYWQRCGKQPIRSWKKMKKLLKARFLPPNYEQTLYNQYQNC
HPGHLSNTCPQRKIIALAEEEGGSMSENNKKAEEEAELIEADDGDRVSYSDNSENIVAKKLVTMLNLKARTHLNPYKMGWGKKGGEVVVNEICTITLSIGSEYKDQIICD
ITEIDVPLARKNSEGTRHEKQLFITVCGKKLLNEREQDLLGLIIVEKSKEEQYEVVEPKLQELFDEFPHLKKEPEGLPPLQDVQHHINLVPGASLSNLAHYRMSPQDDHL
DQLGKAIIFSKIDLKSGYHQIRIRTRDEWKTTFKTNEGLFEWMVMLFGLSNAPSTFVRLMNETLTQQESFDDIKKKLVSNPIIQLSNFSSPFEMAVDAYGTGIGVVLSHQ
GHPIEYFSEKLSSSRQ