; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005710 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005710
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr6:26904189..26913910
RNA-Seq ExpressionLag0005710
SyntenyLag0005710
Gene Ontology termsGO:0016740 - transferase activity (molecular function)
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIN12235.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]4.3e-10144.98Show/hide
Query:  YQGDILTKKKRLGEFETVSLTEECSVILKNGLPTKAKDPGSFTIPVSIGGKELGRAICDLGASINLMPLSIYQKLGIGEARPTTVTLQLANRSITYLE--
        +  DIL+KK+RLG++E V+LTEECS I++N LP K KDPGSFTIP +IG    GRA+CDLGASINLMP SIY+ LG+GEA+ T++TLQLA+RS+TY +  
Subjt:  YQGDILTKKKRLGEFETVSLTEECSVILKNGLPTKAKDPGSFTIPVSIGGKELGRAICDLGASINLMPLSIYQKLGIGEARPTTVTLQLANRSITYLE--

Query:  ------------------------DKGVPIILGRLFLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEVEDCSFIRILENTI------------V
                                D  VPIILGR FLATGR LIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C  + + +N              +
Subjt:  ------------------------DKGVPIILGRLFLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEVEDCSFIRILENTI------------V

Query:  ETAMEDLTNLD-------------------------ERKAPP--IKSSLIEALTVDLKSLPDHL----------------------------------NK
        E A+ DL + D                         ER AP   +K S+ +  T++LK LP HL                                    
Subjt:  ETAMEDLTNLD-------------------------ERKAPP--IKSSLIEALTVDLKSLPDHL----------------------------------NK

Query:  AIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRKLNPAMKEVVKKEVIKLLDVGIIYPIVDSNW----------------------------------
        AIGWT+ADI+GISPSFCMHKI L++    S+E QR+LNP MKEVVKKE+IK LD GIIYPI DS+W                                  
Subjt:  AIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRKLNPAMKEVVKKEVIKLLDVGIIYPIVDSNW----------------------------------

Query:  -----------------------MLDRLVGQAYYYFLGGYSGYNKIIISPEDREKTTFTCPYGTFAFRRMSFGLCNALATFQWCMLAILFDMIESTVE
                               MLDRL G+ +Y FL GYSGYN+I I+PED+EKTTFTCPYGTFAFRRM FGLCNA ATFQ CM+AI  DM+E+ +E
Subjt:  -----------------------MLDRLVGQAYYYFLGGYSGYNKIIISPEDREKTTFTCPYGTFAFRRMSFGLCNALATFQWCMLAILFDMIESTVE

PIN16590.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]2.5e-10145.38Show/hide
Query:  YQGDILTKKKRLGEFETVSLTEECSVILKNGLPTKAKDPGSFTIPVSIGGKELGRAICDLGASINLMPLSIYQKLGIGEARPTTVTLQLANRSITYLE--
        +  DIL+KK+RLG++ETV+LTEECS I++N LP K KDPGSFTIP +IG    GRA+CDLGASINLMP SIY+ LG+GEA+PT++TLQLA+RS+TY    
Subjt:  YQGDILTKKKRLGEFETVSLTEECSVILKNGLPTKAKDPGSFTIPVSIGGKELGRAICDLGASINLMPLSIYQKLGIGEARPTTVTLQLANRSITYLE--

Query:  ------------------------DKGVPIILGRLFLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEVEDCSFIRILENTI------------V
                                D  VPIILGR FLATGR LIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C  + + +               +
Subjt:  ------------------------DKGVPIILGRLFLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEVEDCSFIRILENTI------------V

Query:  ETAMEDLTN------------LD-------------ERKAPP--IKSSLIEALTVDLKSLPDHL----------------------------------NK
        E A+ DL +            LD             ER AP   +K S+ E  T++LK LP HL                                    
Subjt:  ETAMEDLTN------------LD-------------ERKAPP--IKSSLIEALTVDLKSLPDHL----------------------------------NK

Query:  AIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRKLNPAMKEVVKKEVIKLLDVGIIYPIVDSNW----------------------------------
        AIGWT+ADI+GISPSFCMHKI L++G   S+E QR+LNP MKEVVKKE+IK LD GIIYPI DS+W                                  
Subjt:  AIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRKLNPAMKEVVKKEVIKLLDVGIIYPIVDSNW----------------------------------

Query:  -----------------------MLDRLVGQAYYYFLGGYSGYNKIIISPEDREKTTFTCPYGTFAFRRMSFGLCNALATFQWCMLAILFDMIESTVE
                               MLDRL G+ +Y FL GYSGYN+I I PED+EKTTFTCPYGTF FR+M FGLCNA ATFQ CM+AI  DM+E+ +E
Subjt:  -----------------------MLDRLVGQAYYYFLGGYSGYNKIIISPEDREKTTFTCPYGTFAFRRMSFGLCNALATFQWCMLAILFDMIESTVE

PIN22487.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]2.5e-10145.18Show/hide
Query:  YQGDILTKKKRLGEFETVSLTEECSVILKNGLPTKAKDPGSFTIPVSIGGKELGRAICDLGASINLMPLSIYQKLGIGEARPTTVTLQLANRSITYLE--
        +  DIL+KK+RLG++ETV+LTEECS I++N LP K KDPGSFTIP +IG    GRA+CDLGASINLMP SIY+ LG+GEA+PT++TLQLA+RS+TY +  
Subjt:  YQGDILTKKKRLGEFETVSLTEECSVILKNGLPTKAKDPGSFTIPVSIGGKELGRAICDLGASINLMPLSIYQKLGIGEARPTTVTLQLANRSITYLE--

Query:  ------------------------DKGVPIILGRLFLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEVEDCSFIRI-----------------L
                                D  VPIILGR FLATGR LIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C  + +                 L
Subjt:  ------------------------DKGVPIILGRLFLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEVEDCSFIRI-----------------L

Query:  ENTIV-------ETAMEDLTNLD-------------ERKAPP--IKSSLIEALTVDLKSLPDHL----------------------------------NK
        E  ++       E  +E +  LD             ER  P   +K S+ +  T++LK LP HL                                    
Subjt:  ENTIV-------ETAMEDLTNLD-------------ERKAPP--IKSSLIEALTVDLKSLPDHL----------------------------------NK

Query:  AIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRKLNPAMKEVVKKEVIKLLDVGIIYPIVDSNW----------------------------------
        AIGWT+ADI+GISPSFCMHKI L++    S+E QR+LNP MKEVVKKE+IK LD GIIYPI DS+W                                  
Subjt:  AIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRKLNPAMKEVVKKEVIKLLDVGIIYPIVDSNW----------------------------------

Query:  -----------------------MLDRLVGQAYYYFLGGYSGYNKIIISPEDREKTTFTCPYGTFAFRRMSFGLCNALATFQWCMLAILFDMIESTVE
                               MLDRL G+ +Y FL GYSGYN+I I+PED+EKTTFTCPYGTFAFRRM FGLCNA ATFQ CM+AI  DM+E+ +E
Subjt:  -----------------------MLDRLVGQAYYYFLGGYSGYNKIIISPEDREKTTFTCPYGTFAFRRMSFGLCNALATFQWCMLAILFDMIESTVE

XP_022156989.1 uncharacterized protein LOC111023818 [Momordica charantia]5.2e-10748.93Show/hide
Query:  DILTKKKRLGEFETVSLTEECSVILKNGLPTKAKDPGSFTIPVSIGGKELGRAICDLGASINLMPLSIYQKLGIGEARPTTVTLQLANRSITYLE-----
        DIL KK+RLGEFE V+LT+E S IL   LP K  DPGSFTIPV IGGK +G A+CDLGASINLMPLS+YQKLGIGEARP TVTLQLA+RSITYLE     
Subjt:  DILTKKKRLGEFETVSLTEECSVILKNGLPTKAKDPGSFTIPVSIGGKELGRAICDLGASINLMPLSIYQKLGIGEARPTTVTLQLANRSITYLE-----

Query:  ---------------------DKGVPIILGRLFLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEVEDCSFIRILENTIV-ETAMEDLTN-----
                             DK +PIILGR FL+TGRALIDV  GELT+RV +++V  ++F ++KYP +VE+CS++RI ++ +  E   E+L N     
Subjt:  ---------------------DKGVPIILGRLFLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEVEDCSFIRILENTIV-ETAMEDLTN-----

Query:  ----LDERKAPPIKSSLIEALTVDLKSLPDHL----------------------------------NKAIGWTLADIQGISPSFCMHKITLDEGSFRSIE
            + +R   P++ S+++A  ++LK LP HL                                   KAIGWTLADI+GISPS+CMHKI L+EG   SIE
Subjt:  ----LDERKAPPIKSSLIEALTVDLKSLPDHL----------------------------------NKAIGWTLADIQGISPSFCMHKITLDEGSFRSIE

Query:  QQRKLNPAMKEVVKKEVIKLLDVGIIYPIVD--------------------------------SNW-------------------------MLDRLVGQA
         QR+LNPAMKEVVKKE+IK LD GIIYPI D                                + W                         MLD LVGQ 
Subjt:  QQRKLNPAMKEVVKKEVIKLLDVGIIYPIVD--------------------------------SNW-------------------------MLDRLVGQA

Query:  YYYFLGGYSGYNKIIISPEDREKTTFTCPYGTFAFRRMSFGLCNALATFQWCMLAILFDMIESTVE
        YYY L GY+GYN+I I P+D++KTTFTCPYGTF+FRRM FGLCNA  TFQ CM+AI  D+IE+ VE
Subjt:  YYYFLGGYSGYNKIIISPEDREKTTFTCPYGTFAFRRMSFGLCNALATFQWCMLAILFDMIESTVE

XP_023521407.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111785222 [Cucurbita pepo subsp. pepo]1.6e-10045.98Show/hide
Query:  DILTKKKRLGEFETVSLTEECSVILKNGLPTKAKDPGSFTIPVSIGGKELGRAICDLGASINLMPLSIYQKLGIGEARPTTVTLQLANRSITYLE-----
        D+LT +++  EF+ V L EECS ILKN +P K KDPGSFTIP+SIGGK+LGRA+CDLG+SINLMPLSIY+KLGIGEARPTTVTLQLA+RS TY E     
Subjt:  DILTKKKRLGEFETVSLTEECSVILKNGLPTKAKDPGSFTIPVSIGGKELGRAICDLGASINLMPLSIYQKLGIGEARPTTVTLQLANRSITYLE-----

Query:  ---------------------DKGVPIILGRLFLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEVEDCSFIRIL-----------------ENT
                             D  VPIILGR FL TGR L+DV KG +T+R+ +++V+FN+  +MKYP   E+CS +  L                 E++
Subjt:  ---------------------DKGVPIILGRLFLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEVEDCSFIRIL-----------------ENT

Query:  IVETAMEDLTNLDE------------RKAPPIKSSLIEALTVDLKSLPDHL----------------------------------NKAIGWTLADIQGIS
             +E L  L E            RK+ P++ S+ EA  +DLK LP +L                                    AIGWTLADI+GIS
Subjt:  IVETAMEDLTNLDE------------RKAPPIKSSLIEALTVDLKSLPDHL----------------------------------NKAIGWTLADIQGIS

Query:  PSFCMHKITLDEGSFRSIEQQRKLNPAMKEVVKKEVIKLLDVGIIYPIVDSN--------------------------------W---------------
        PS CMHKI L+EG  +SIEQQR+LNP MKEVV+KE++K LD GIIYPI +S+                                W               
Subjt:  PSFCMHKITLDEGSFRSIEQQRKLNPAMKEVVKKEVIKLLDVGIIYPIVDSN--------------------------------W---------------

Query:  ----------MLDRLVGQAYYYFLGGYSGYNKIIISPEDREKTTFTCPYGTFAFRRMSFGLCNALATFQWCMLAILFDMIESTVE
                  MLDRL G+++Y FL GYSGYN+I ISPED+EKTTFTCPYG FAFRRM FGLCNA ATFQ CM+AI  DM+E+ +E
Subjt:  ----------MLDRLVGQAYYYFLGGYSGYNKIIISPEDREKTTFTCPYGTFAFRRMSFGLCNALATFQWCMLAILFDMIESTVE

TrEMBL top hitse value%identityAlignment
A0A2G9H400 Reverse transcriptase2.1e-10144.98Show/hide
Query:  YQGDILTKKKRLGEFETVSLTEECSVILKNGLPTKAKDPGSFTIPVSIGGKELGRAICDLGASINLMPLSIYQKLGIGEARPTTVTLQLANRSITYLE--
        +  DIL+KK+RLG++E V+LTEECS I++N LP K KDPGSFTIP +IG    GRA+CDLGASINLMP SIY+ LG+GEA+ T++TLQLA+RS+TY +  
Subjt:  YQGDILTKKKRLGEFETVSLTEECSVILKNGLPTKAKDPGSFTIPVSIGGKELGRAICDLGASINLMPLSIYQKLGIGEARPTTVTLQLANRSITYLE--

Query:  ------------------------DKGVPIILGRLFLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEVEDCSFIRILENTI------------V
                                D  VPIILGR FLATGR LIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C  + + +N              +
Subjt:  ------------------------DKGVPIILGRLFLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEVEDCSFIRILENTI------------V

Query:  ETAMEDLTNLD-------------------------ERKAPP--IKSSLIEALTVDLKSLPDHL----------------------------------NK
        E A+ DL + D                         ER AP   +K S+ +  T++LK LP HL                                    
Subjt:  ETAMEDLTNLD-------------------------ERKAPP--IKSSLIEALTVDLKSLPDHL----------------------------------NK

Query:  AIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRKLNPAMKEVVKKEVIKLLDVGIIYPIVDSNW----------------------------------
        AIGWT+ADI+GISPSFCMHKI L++    S+E QR+LNP MKEVVKKE+IK LD GIIYPI DS+W                                  
Subjt:  AIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRKLNPAMKEVVKKEVIKLLDVGIIYPIVDSNW----------------------------------

Query:  -----------------------MLDRLVGQAYYYFLGGYSGYNKIIISPEDREKTTFTCPYGTFAFRRMSFGLCNALATFQWCMLAILFDMIESTVE
                               MLDRL G+ +Y FL GYSGYN+I I+PED+EKTTFTCPYGTFAFRRM FGLCNA ATFQ CM+AI  DM+E+ +E
Subjt:  -----------------------MLDRLVGQAYYYFLGGYSGYNKIIISPEDREKTTFTCPYGTFAFRRMSFGLCNALATFQWCMLAILFDMIESTVE

A0A2G9HH15 Reverse transcriptase1.2e-10145.38Show/hide
Query:  YQGDILTKKKRLGEFETVSLTEECSVILKNGLPTKAKDPGSFTIPVSIGGKELGRAICDLGASINLMPLSIYQKLGIGEARPTTVTLQLANRSITYLE--
        +  DIL+KK+RLG++ETV+LTEECS I++N LP K KDPGSFTIP +IG    GRA+CDLGASINLMP SIY+ LG+GEA+PT++TLQLA+RS+TY    
Subjt:  YQGDILTKKKRLGEFETVSLTEECSVILKNGLPTKAKDPGSFTIPVSIGGKELGRAICDLGASINLMPLSIYQKLGIGEARPTTVTLQLANRSITYLE--

Query:  ------------------------DKGVPIILGRLFLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEVEDCSFIRILENTI------------V
                                D  VPIILGR FLATGR LIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C  + + +               +
Subjt:  ------------------------DKGVPIILGRLFLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEVEDCSFIRILENTI------------V

Query:  ETAMEDLTN------------LD-------------ERKAPP--IKSSLIEALTVDLKSLPDHL----------------------------------NK
        E A+ DL +            LD             ER AP   +K S+ E  T++LK LP HL                                    
Subjt:  ETAMEDLTN------------LD-------------ERKAPP--IKSSLIEALTVDLKSLPDHL----------------------------------NK

Query:  AIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRKLNPAMKEVVKKEVIKLLDVGIIYPIVDSNW----------------------------------
        AIGWT+ADI+GISPSFCMHKI L++G   S+E QR+LNP MKEVVKKE+IK LD GIIYPI DS+W                                  
Subjt:  AIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRKLNPAMKEVVKKEVIKLLDVGIIYPIVDSNW----------------------------------

Query:  -----------------------MLDRLVGQAYYYFLGGYSGYNKIIISPEDREKTTFTCPYGTFAFRRMSFGLCNALATFQWCMLAILFDMIESTVE
                               MLDRL G+ +Y FL GYSGYN+I I PED+EKTTFTCPYGTF FR+M FGLCNA ATFQ CM+AI  DM+E+ +E
Subjt:  -----------------------MLDRLVGQAYYYFLGGYSGYNKIIISPEDREKTTFTCPYGTFAFRRMSFGLCNALATFQWCMLAILFDMIESTVE

A0A2G9HYA0 Reverse transcriptase1.2e-10145.18Show/hide
Query:  YQGDILTKKKRLGEFETVSLTEECSVILKNGLPTKAKDPGSFTIPVSIGGKELGRAICDLGASINLMPLSIYQKLGIGEARPTTVTLQLANRSITYLE--
        +  DIL+KK+RLG++ETV+LTEECS I++N LP K KDPGSFTIP +IG    GRA+CDLGASINLMP SIY+ LG+GEA+PT++TLQLA+RS+TY +  
Subjt:  YQGDILTKKKRLGEFETVSLTEECSVILKNGLPTKAKDPGSFTIPVSIGGKELGRAICDLGASINLMPLSIYQKLGIGEARPTTVTLQLANRSITYLE--

Query:  ------------------------DKGVPIILGRLFLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEVEDCSFIRI-----------------L
                                D  VPIILGR FLATGR LIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C  + +                 L
Subjt:  ------------------------DKGVPIILGRLFLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEVEDCSFIRI-----------------L

Query:  ENTIV-------ETAMEDLTNLD-------------ERKAPP--IKSSLIEALTVDLKSLPDHL----------------------------------NK
        E  ++       E  +E +  LD             ER  P   +K S+ +  T++LK LP HL                                    
Subjt:  ENTIV-------ETAMEDLTNLD-------------ERKAPP--IKSSLIEALTVDLKSLPDHL----------------------------------NK

Query:  AIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRKLNPAMKEVVKKEVIKLLDVGIIYPIVDSNW----------------------------------
        AIGWT+ADI+GISPSFCMHKI L++    S+E QR+LNP MKEVVKKE+IK LD GIIYPI DS+W                                  
Subjt:  AIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRKLNPAMKEVVKKEVIKLLDVGIIYPIVDSNW----------------------------------

Query:  -----------------------MLDRLVGQAYYYFLGGYSGYNKIIISPEDREKTTFTCPYGTFAFRRMSFGLCNALATFQWCMLAILFDMIESTVE
                               MLDRL G+ +Y FL GYSGYN+I I+PED+EKTTFTCPYGTFAFRRM FGLCNA ATFQ CM+AI  DM+E+ +E
Subjt:  -----------------------MLDRLVGQAYYYFLGGYSGYNKIIISPEDREKTTFTCPYGTFAFRRMSFGLCNALATFQWCMLAILFDMIESTVE

A0A2G9I815 DNA-directed DNA polymerase1.0e-10048.14Show/hide
Query:  DILTKKKRLGEFETVSLTEECSVILKNGLPTKAKDPGSFTIPVSIGGKELGRAICDLGASINLMPLSIYQKLGIGEARPTTVTLQLANRSITYLE-----
        DIL+KK+RLG++ETV+L EECS I++N LP K KDPGSFTIP  I     GRA+CDLGASINLMP SIY+ L +GEA+PT++TLQLA+RS TY +     
Subjt:  DILTKKKRLGEFETVSLTEECSVILKNGLPTKAKDPGSFTIPVSIGGKELGRAICDLGASINLMPLSIYQKLGIGEARPTTVTLQLANRSITYLE-----

Query:  ---------------------DKGVPIILGRLFLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEVEDCSFIRILENTI------------VETA
                             D  +PIILGR FLATGR LIDVQKGELTMRV ++++ FNVFKA+K+P+E ++C  + + +N              +E A
Subjt:  ---------------------DKGVPIILGRLFLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEVEDCSFIRILENTI------------VETA

Query:  MEDLTN------------LD-------------ERKAPP--IKSSLIEALTVDLKSLPDHL----------------------------------NKAIG
        + DL N            LD              R AP   +K S+ E  T++LK LP HL                                   +AIG
Subjt:  MEDLTN------------LD-------------ERKAPP--IKSSLIEALTVDLKSLPDHL----------------------------------NKAIG

Query:  WTLADIQGISPSFCMHKITLDEGSFRSIEQQRKLNPAMKEVVKKEVIKLLDVGIIYPIVDSNW-------------------MLDRLVGQAYYYFLGGYS
        WT+ADI+GIS SFCMHKI L+     S+  QR+LNP MKEVVKKE+IK LD GIIYPI DS+W                   MLDRL G+ +Y FL GYS
Subjt:  WTLADIQGISPSFCMHKITLDEGSFRSIEQQRKLNPAMKEVVKKEVIKLLDVGIIYPIVDSNW-------------------MLDRLVGQAYYYFLGGYS

Query:  GYNKIIISPEDREKTTFTCPYGTFAFRRMSFGLCNALATFQWCMLAILFDMIESTVE
        GYN+I I+PED+EKTTFTCPYGTFAFRRM F LCNA ATFQ CM+AI  DM+E+  E
Subjt:  GYNKIIISPEDREKTTFTCPYGTFAFRRMSFGLCNALATFQWCMLAILFDMIESTVE

A0A6J1DV77 uncharacterized protein LOC1110238182.5e-10748.93Show/hide
Query:  DILTKKKRLGEFETVSLTEECSVILKNGLPTKAKDPGSFTIPVSIGGKELGRAICDLGASINLMPLSIYQKLGIGEARPTTVTLQLANRSITYLE-----
        DIL KK+RLGEFE V+LT+E S IL   LP K  DPGSFTIPV IGGK +G A+CDLGASINLMPLS+YQKLGIGEARP TVTLQLA+RSITYLE     
Subjt:  DILTKKKRLGEFETVSLTEECSVILKNGLPTKAKDPGSFTIPVSIGGKELGRAICDLGASINLMPLSIYQKLGIGEARPTTVTLQLANRSITYLE-----

Query:  ---------------------DKGVPIILGRLFLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEVEDCSFIRILENTIV-ETAMEDLTN-----
                             DK +PIILGR FL+TGRALIDV  GELT+RV +++V  ++F ++KYP +VE+CS++RI ++ +  E   E+L N     
Subjt:  ---------------------DKGVPIILGRLFLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEVEDCSFIRILENTIV-ETAMEDLTN-----

Query:  ----LDERKAPPIKSSLIEALTVDLKSLPDHL----------------------------------NKAIGWTLADIQGISPSFCMHKITLDEGSFRSIE
            + +R   P++ S+++A  ++LK LP HL                                   KAIGWTLADI+GISPS+CMHKI L+EG   SIE
Subjt:  ----LDERKAPPIKSSLIEALTVDLKSLPDHL----------------------------------NKAIGWTLADIQGISPSFCMHKITLDEGSFRSIE

Query:  QQRKLNPAMKEVVKKEVIKLLDVGIIYPIVD--------------------------------SNW-------------------------MLDRLVGQA
         QR+LNPAMKEVVKKE+IK LD GIIYPI D                                + W                         MLD LVGQ 
Subjt:  QQRKLNPAMKEVVKKEVIKLLDVGIIYPIVD--------------------------------SNW-------------------------MLDRLVGQA

Query:  YYYFLGGYSGYNKIIISPEDREKTTFTCPYGTFAFRRMSFGLCNALATFQWCMLAILFDMIESTVE
        YYY L GY+GYN+I I P+D++KTTFTCPYGTF+FRRM FGLCNA  TFQ CM+AI  D+IE+ VE
Subjt:  YYYFLGGYSGYNKIIISPEDREKTTFTCPYGTFAFRRMSFGLCNALATFQWCMLAILFDMIESTVE

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.6e-0534.29Show/hide
Query:  FRSIEQQRKLNPAMKEVVKKEVIKLLDVGIIYPIVDSNWMLDRLVGQAYYYFLGGYSGYNKIIISPEDREKTTFTCPYGTFAFRRMSFGLCNALATFQWC
        FR +   RKLN              + VG  +PI + + +L +L    Y+  +    G+++I + PE   KT F+  +G + + RM FGL NA ATFQ C
Subjt:  FRSIEQQRKLNPAMKEVVKKEVIKLLDVGIIYPIVDSNWMLDRLVGQAYYYFLGGYSGYNKIIISPEDREKTTFTCPYGTFAFRRMSFGLCNALATFQWC

Query:  MLAIL
        M  IL
Subjt:  MLAIL

P20825 Retrovirus-related Pol polyprotein from transposon 2975.1e-0437.84Show/hide
Query:  YPIVDSNWMLDRLVGQAYYYFLGGYSGYNKIIISPEDREKTTFTCPYGTFAFRRMSFGLCNALATFQWCMLAIL
        YPI + + +L +L    Y+  +    G+++I +  E   KT F+   G + + RM FGL NA ATFQ CM  IL
Subjt:  YPIVDSNWMLDRLVGQAYYYFLGGYSGYNKIIISPEDREKTTFTCPYGTFAFRRMSFGLCNALATFQWCMLAIL

P31843 RNA-directed DNA polymerase homolog1.1e-0640.24Show/hide
Query:  YPIVDSNWMLDRLVGQAYYYFLGGYSGYNKIIISPEDREKTTFTCPYGTFAFRRMSFGLCNALATFQWCMLAILFDMIESTV
        YPI   + + DRL    ++  L   SGY ++ I+  D  KTT    YG+F FR M FGL NALATF   M  +L++ ++  V
Subjt:  YPIVDSNWMLDRLVGQAYYYFLGGYSGYNKIIISPEDREKTTFTCPYGTFAFRRMSFGLCNALATFQWCMLAILFDMIESTV

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCAAGGAGAGAGCAAAGGAAAGAAATAAAGGAAGGAAGGATGGGATGATTGATGCCAAAATGGGGTACCAAGGGGATATATTGACTAAAAAGAAGAGGTTAGGAGA
ATTTGAAACTGTATCTCTTACTGAGGAATGTAGTGTTATTCTTAAGAATGGGCTACCAACCAAGGCTAAGGATCCAGGGTCATTTACTATTCCCGTCTCAATAGGTGGAA
AAGAGTTGGGTAGAGCAATTTGTGATTTAGGTGCGAGCATTAACCTTATGCCTCTTTCTATCTATCAAAAGTTAGGTATTGGTGAAGCTAGGCCTACCACAGTCACACTC
CAACTAGCTAATAGGTCTATCACATATCTAGAGGATAAAGGTGTCCCAATTATTCTTGGTCGTCTATTTTTGGCTACTGGTAGAGCGTTGATAGATGTTCAGAAAGGGGA
ATTAACAATGAGGGTTTATAATGAGGAAGTGAAGTTTAATGTCTTTAAGGCCATGAAGTATCCAGATGAAGTGGAAGATTGTTCTTTCATTCGAATTCTAGAGAACACAA
TTGTTGAGACAGCAATGGAGGATTTGACAAACTTGGATGAAAGGAAAGCTCCTCCTATTAAGTCATCCCTGATTGAGGCACTTACTGTAGATTTGAAGTCCTTACCAGAT
CATCTAAACAAGGCAATAGGTTGGACATTGGCTGACATACAGGGAATTAGCCCTTCTTTCTGTATGCATAAAATCACTCTAGATGAAGGATCCTTTAGGAGTATTGAGCA
ACAGAGAAAGCTTAACCCTGCAATGAAAGAAGTTGTTAAGAAAGAGGTGATCAAATTGTTGGATGTTGGGATCATTTATCCAATTGTGGATAGCAATTGGATGTTGGATA
GGTTGGTTGGTCAGGCTTACTACTATTTCTTAGGTGGTTATTCTGGGTATAACAAGATTATCATTTCTCCTGAGGATCGGGAAAAAACCACTTTCACCTGCCCTTATGGG
ACGTTTGCTTTCAGACGAATGTCTTTTGGCCTTTGCAATGCTCTAGCAACATTTCAGTGGTGTATGTTAGCAATTTTGTTTGATATGATTGAGTCCACTGTTGAGTACTG
TTGTTGCAGACGATTTAATATTGACAGGCGAAAATGTCCTTGGTTGAGTGGAACACTTGAAGATTTACTACCAGTAGCTTCTAGAGAGATGGAGCACCACTGCCCCACCA
CAGGAAGGCACAGCCCGACCAGGCCGAGGATGACCAGGCCAAGACCGACCAGTTCTGTTATTGCAGATGATTTAGTGCTACAGACGAAAGTGTCCTCGATGGAGTGGCAA
CGCCTATACTCCCCGAGGGATAAGGCTGGGTACCTTATCCTGGTGACACTATGGATACGGCCCGCTTTGTATATTGATACAAACGTAGTGATCCAACACGTTCATGTGGT
TGACATGCGAGTGGGGTTCGCCGACTCAATAAGCCTACCATTTTGGGGACAAGACCGAATGGGGAGCTGGGAACGTAGTCTTACAAGATGGAATTCACTCCTTCCTGATA
TGAGGGTAAGTAGAGGTCCATTAGGTCCCACCGGTAGCTCATTTAGGGCGTTGAGATTTTTGGGCTCCCACAAGCACCTAACGCAGTATCCCAATTGGAGAATACCGGTG
TATCCTTGTGGTGGTGTTCGTGCAATTTTCCAGCAAAATCAAGGAGTGATCGCTGCTGTTTTTTGGGAGAAATTCAGAGAGAATTGGCCAAACAGTAAAGGGTTTCTTCA
AAGAACCCCCGGGAACGGAGCCAGGGCTCAAACCAGAATCATACTAATGGAGAAAGGTTCAGAGACGCTAAAGTTTCGAGGCACGAGCTCAAGGCCAACTAGGCTAAGTA
CATTGTTAGTGCTCTATGTGCTTTTTGCAGGAATGATGATGCTGAAATTGTTGTGGAACGGGACCGAGGTCGACCAGACCACGCCCATACCTTGTTTATGCAAGAGTGTC
ATTCTCCAGACCTTAAGGCATAGGCTCAAGGTCAACCGGGGGCTTGAGGGACTTGAGGTCGAGGCCGACCCTAGTCTCTTCCCTCATTTTTCTTTTGCAGATAACATTTC
GATGCAGGAACTCGAAACTTTCAATAATATGAAAGATGGTCAAGGCCGACCGCAAGGTCTTACCACGGAGCTGACGAGGACAGTGCGATACCAGTCCAAGGAGACAGCCA
GGAAACGGGATCCGGAGGAAGAGCAGGCCAAAGGGTCGGGCCAAAGCCGAAGGGATCGGGTCCGACCCCCTGCTCGGCTTCGGCCTTGGGTCGAGGCTGACCACTCGACC
CGCTTGCGCGGGCTGAGTTCTTCCGCCTTCGTTCGGTCCCTGGCGCCCCCGGTTGCCTCTGTTCAGCCCAAATCACCTCCGAATGCCTACAACGCTAGGAGCATGAATAG
ATATTTAAACCCTTCTTCGTCACTGAAGAAGGGATCCGAAAAATCTATCTTCTCTTCCGCCAGCTCTCAGGTTCTGTTCAGATTCCCGCCGATTACTGACTTAAGCATCG
GAGGTGGTGTGGCAAGCACCACACCGGTGTACAGATCTTTCTTTGATGACTTTGAAAAAGTGAAGTCCACCCCATACTTCCTCGTTAAGAATAAGGGAAATTGTGCTAGT
ATTAATATCATGACTTTATCTTTATGTAAGAGGTTGAACATAGGAGATATTAAATCTACTTCTGTTAAACTCCAATTAGCTAATCAGTTTGTGGAGAACCCTTTAGTACC
TGTTATTTTAGGGAGATCATTTCTCGCTACTAGGCGGGTTATTATTGATATTGACCGCAGGGAGCTTACTGTTAAAGTCCAACACGAGAAAGAAGTATTCAAAGCATTTG
AGGACTCTAAGGATCATTTTGAGGCATTGATAGCGAGCATAGCTAGCCAGTCTTTCCATCTCGTTGCTCTTGGTTTGCTAGGGATCATGAAGAAGTCATCACTTCAGGGC
AGCATTGTTCACCCCTTTATCTATTTTGTTGTTCTCATGCTTACTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTCAAGGAGAGAGCAAAGGAAAGAAATAAAGGAAGGAAGGATGGGATGATTGATGCCAAAATGGGGTACCAAGGGGATATATTGACTAAAAAGAAGAGGTTAGGAGA
ATTTGAAACTGTATCTCTTACTGAGGAATGTAGTGTTATTCTTAAGAATGGGCTACCAACCAAGGCTAAGGATCCAGGGTCATTTACTATTCCCGTCTCAATAGGTGGAA
AAGAGTTGGGTAGAGCAATTTGTGATTTAGGTGCGAGCATTAACCTTATGCCTCTTTCTATCTATCAAAAGTTAGGTATTGGTGAAGCTAGGCCTACCACAGTCACACTC
CAACTAGCTAATAGGTCTATCACATATCTAGAGGATAAAGGTGTCCCAATTATTCTTGGTCGTCTATTTTTGGCTACTGGTAGAGCGTTGATAGATGTTCAGAAAGGGGA
ATTAACAATGAGGGTTTATAATGAGGAAGTGAAGTTTAATGTCTTTAAGGCCATGAAGTATCCAGATGAAGTGGAAGATTGTTCTTTCATTCGAATTCTAGAGAACACAA
TTGTTGAGACAGCAATGGAGGATTTGACAAACTTGGATGAAAGGAAAGCTCCTCCTATTAAGTCATCCCTGATTGAGGCACTTACTGTAGATTTGAAGTCCTTACCAGAT
CATCTAAACAAGGCAATAGGTTGGACATTGGCTGACATACAGGGAATTAGCCCTTCTTTCTGTATGCATAAAATCACTCTAGATGAAGGATCCTTTAGGAGTATTGAGCA
ACAGAGAAAGCTTAACCCTGCAATGAAAGAAGTTGTTAAGAAAGAGGTGATCAAATTGTTGGATGTTGGGATCATTTATCCAATTGTGGATAGCAATTGGATGTTGGATA
GGTTGGTTGGTCAGGCTTACTACTATTTCTTAGGTGGTTATTCTGGGTATAACAAGATTATCATTTCTCCTGAGGATCGGGAAAAAACCACTTTCACCTGCCCTTATGGG
ACGTTTGCTTTCAGACGAATGTCTTTTGGCCTTTGCAATGCTCTAGCAACATTTCAGTGGTGTATGTTAGCAATTTTGTTTGATATGATTGAGTCCACTGTTGAGTACTG
TTGTTGCAGACGATTTAATATTGACAGGCGAAAATGTCCTTGGTTGAGTGGAACACTTGAAGATTTACTACCAGTAGCTTCTAGAGAGATGGAGCACCACTGCCCCACCA
CAGGAAGGCACAGCCCGACCAGGCCGAGGATGACCAGGCCAAGACCGACCAGTTCTGTTATTGCAGATGATTTAGTGCTACAGACGAAAGTGTCCTCGATGGAGTGGCAA
CGCCTATACTCCCCGAGGGATAAGGCTGGGTACCTTATCCTGGTGACACTATGGATACGGCCCGCTTTGTATATTGATACAAACGTAGTGATCCAACACGTTCATGTGGT
TGACATGCGAGTGGGGTTCGCCGACTCAATAAGCCTACCATTTTGGGGACAAGACCGAATGGGGAGCTGGGAACGTAGTCTTACAAGATGGAATTCACTCCTTCCTGATA
TGAGGGTAAGTAGAGGTCCATTAGGTCCCACCGGTAGCTCATTTAGGGCGTTGAGATTTTTGGGCTCCCACAAGCACCTAACGCAGTATCCCAATTGGAGAATACCGGTG
TATCCTTGTGGTGGTGTTCGTGCAATTTTCCAGCAAAATCAAGGAGTGATCGCTGCTGTTTTTTGGGAGAAATTCAGAGAGAATTGGCCAAACAGTAAAGGGTTTCTTCA
AAGAACCCCCGGGAACGGAGCCAGGGCTCAAACCAGAATCATACTAATGGAGAAAGGTTCAGAGACGCTAAAGTTTCGAGGCACGAGCTCAAGGCCAACTAGGCTAAGTA
CATTGTTAGTGCTCTATGTGCTTTTTGCAGGAATGATGATGCTGAAATTGTTGTGGAACGGGACCGAGGTCGACCAGACCACGCCCATACCTTGTTTATGCAAGAGTGTC
ATTCTCCAGACCTTAAGGCATAGGCTCAAGGTCAACCGGGGGCTTGAGGGACTTGAGGTCGAGGCCGACCCTAGTCTCTTCCCTCATTTTTCTTTTGCAGATAACATTTC
GATGCAGGAACTCGAAACTTTCAATAATATGAAAGATGGTCAAGGCCGACCGCAAGGTCTTACCACGGAGCTGACGAGGACAGTGCGATACCAGTCCAAGGAGACAGCCA
GGAAACGGGATCCGGAGGAAGAGCAGGCCAAAGGGTCGGGCCAAAGCCGAAGGGATCGGGTCCGACCCCCTGCTCGGCTTCGGCCTTGGGTCGAGGCTGACCACTCGACC
CGCTTGCGCGGGCTGAGTTCTTCCGCCTTCGTTCGGTCCCTGGCGCCCCCGGTTGCCTCTGTTCAGCCCAAATCACCTCCGAATGCCTACAACGCTAGGAGCATGAATAG
ATATTTAAACCCTTCTTCGTCACTGAAGAAGGGATCCGAAAAATCTATCTTCTCTTCCGCCAGCTCTCAGGTTCTGTTCAGATTCCCGCCGATTACTGACTTAAGCATCG
GAGGTGGTGTGGCAAGCACCACACCGGTGTACAGATCTTTCTTTGATGACTTTGAAAAAGTGAAGTCCACCCCATACTTCCTCGTTAAGAATAAGGGAAATTGTGCTAGT
ATTAATATCATGACTTTATCTTTATGTAAGAGGTTGAACATAGGAGATATTAAATCTACTTCTGTTAAACTCCAATTAGCTAATCAGTTTGTGGAGAACCCTTTAGTACC
TGTTATTTTAGGGAGATCATTTCTCGCTACTAGGCGGGTTATTATTGATATTGACCGCAGGGAGCTTACTGTTAAAGTCCAACACGAGAAAGAAGTATTCAAAGCATTTG
AGGACTCTAAGGATCATTTTGAGGCATTGATAGCGAGCATAGCTAGCCAGTCTTTCCATCTCGTTGCTCTTGGTTTGCTAGGGATCATGAAGAAGTCATCACTTCAGGGC
AGCATTGTTCACCCCTTTATCTATTTTGTTGTTCTCATGCTTACTTAG
Protein sequenceShow/hide protein sequence
MLKERAKERNKGRKDGMIDAKMGYQGDILTKKKRLGEFETVSLTEECSVILKNGLPTKAKDPGSFTIPVSIGGKELGRAICDLGASINLMPLSIYQKLGIGEARPTTVTL
QLANRSITYLEDKGVPIILGRLFLATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEVEDCSFIRILENTIVETAMEDLTNLDERKAPPIKSSLIEALTVDLKSLPD
HLNKAIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRKLNPAMKEVVKKEVIKLLDVGIIYPIVDSNWMLDRLVGQAYYYFLGGYSGYNKIIISPEDREKTTFTCPYG
TFAFRRMSFGLCNALATFQWCMLAILFDMIESTVEYCCCRRFNIDRRKCPWLSGTLEDLLPVASREMEHHCPTTGRHSPTRPRMTRPRPTSSVIADDLVLQTKVSSMEWQ
RLYSPRDKAGYLILVTLWIRPALYIDTNVVIQHVHVVDMRVGFADSISLPFWGQDRMGSWERSLTRWNSLLPDMRVSRGPLGPTGSSFRALRFLGSHKHLTQYPNWRIPV
YPCGGVRAIFQQNQGVIAAVFWEKFRENWPNSKGFLQRTPGNGARAQTRIILMEKGSETLKFRGTSSRPTRLSTLLVLYVLFAGMMMLKLLWNGTEVDQTTPIPCLCKSV
ILQTLRHRLKVNRGLEGLEVEADPSLFPHFSFADNISMQELETFNNMKDGQGRPQGLTTELTRTVRYQSKETARKRDPEEEQAKGSGQSRRDRVRPPARLRPWVEADHST
RLRGLSSSAFVRSLAPPVASVQPKSPPNAYNARSMNRYLNPSSSLKKGSEKSIFSSASSQVLFRFPPITDLSIGGGVASTTPVYRSFFDDFEKVKSTPYFLVKNKGNCAS
INIMTLSLCKRLNIGDIKSTSVKLQLANQFVENPLVPVILGRSFLATRRVIIDIDRRELTVKVQHEKEVFKAFEDSKDHFEALIASIASQSFHLVALGLLGIMKKSSLQG
SIVHPFIYFVVLMLT