; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015918 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015918
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr12:29155076..29158462
RNA-Seq ExpressionLag0015918
SyntenyLag0015918
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIN00194.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]1.0e-14649Show/hide
Query:  VNQV--TDEACVYCGEDHNYEFCPSNPASVL----AQPPQLLMGSQGNNVQAQQKMN-----------QPGFAKAQLANELKARPQGKLPSDTE-HPRRE
        VNQV  T   C  CGE H  + CP +  S+     A+ PQ    S   N   +Q  N            P F + QLAN + +RPQG LPS+TE +PR+ 
Subjt:  VNQV--TDEACVYCGEDHNYEFCPSNPASVL----AQPPQLLMGSQGNNVQAQQKMN-----------QPGFAKAQLANELKARPQGKLPSDTE-HPRRE

Query:  GKEQVKAVTLRSGKPLEESRKTQDLNSNRNNNVVIEKELESGQGAGGSKENAGASGSVPDVEAPYVPPPPYVPPLPFPQRQKPKNRMCDY----------
         K Q +AVTLR+G+ L+E  K    +  +    VI +E E    A   K +     ++P  EA     P YV  +        K R+ DY          
Subjt:  GKEQVKAVTLRSGKPLEESRKTQDLNSNRNNNVVIEKELESGQGAGGSKENAGASGSVPDVEAPYVPPPPYVPPLPFPQRQKPKNRMCDY----------

Query:  ---QEWSTPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEASPTIVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEAD
           Q    PK KDPGSFTIP +IG    GRALCDLGASINLMP S+YR LG+G+A PT +TLQLADRS+TYP+G IED+LVKVDKFIFP  F++LD E D
Subjt:  ---QEWSTPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEASPTIVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEAD

Query:  KDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------IETAIQDSADKHSEKHGE--------
         +VPIILGRPFLATGR LI+VQKGELTMRV ++++ FNVFKAMK+ +E ++C  + + ++              +E A+ D  D+ +E+  E        
Subjt:  KDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------IETAIQDSADKHSEKHGE--------

Query:  -----------------------------------------------------------------------QYRKAIGWTLADIQGISLSFCMHKITLEE
                                                                                ++ AIGWT+ADI+GIS SFCMHKI LE+
Subjt:  -----------------------------------------------------------------------QYRKAIGWTLADIQGISLSFCMHKITLEE

Query:  GSFRSIEQQRRLNPAMKEVVKKEVIKWLDVGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDKML
            S++ QRRLNP MKEVVKKE+IKWLDVGIIYPI+DS+WVSPVQCVPKKGG+TVV N  NELIPTRTVTGWRVCMDYR+LNKATRKDHF LPFID+ML
Subjt:  GSFRSIEQQRRLNPAMKEVVKKEVIKWLDVGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDKML

Query:  DRLAGQAYYCFLDGYSGYNQITIAPEDREKTTFTCPYGTFAFRRMPFGL
        DRLAG+ +YCFLDGYSGYNQI IAPE++EK TFTCPYGTFAFRRMPFGL
Subjt:  DRLAGQAYYCFLDGYSGYNQITIAPEDREKTTFTCPYGTFAFRRMPFGL

PIN12235.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]2.9e-14949.61Show/hide
Query:  VNQV--TDEACVYCGEDHNYEFCPSNPASVL----AQPPQLLMGSQGNNVQAQQKMN-----------QPGFAKAQLANELKARPQGKLPSDTE-HPRRE
        VNQV  T   C  CGE H  + CP +  S+     A+ PQ    S   N   +Q  N            P F + QLAN + +RPQG L S+TE +PR++
Subjt:  VNQV--TDEACVYCGEDHNYEFCPSNPASVL----AQPPQLLMGSQGNNVQAQQKMN-----------QPGFAKAQLANELKARPQGKLPSDTE-HPRRE

Query:  GKEQVKAVTLRSGKPLEESRKTQDLNSNRNNNVVIEKELESGQGAGGSKENAGASGSVPDVEAPYVPPPPYVPPLPFPQRQKPKNRMCDY----------
        GK Q +AVTLR+G+ L+E  K    +  +    VI +E E    A   K +     ++P  EA     P YV  +        K R+ DY          
Subjt:  GKEQVKAVTLRSGKPLEESRKTQDLNSNRNNNVVIEKELESGQGAGGSKENAGASGSVPDVEAPYVPPPPYVPPLPFPQRQKPKNRMCDY----------

Query:  ---QEWSTPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEASPTIVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEAD
           Q    PK KDPGSFTIP +IG    GRALCDLGASINLMP S+YR LG+GEA  T +TLQLADRS+TYP+G IED+LVKVDKFIFP DF++LD + D
Subjt:  ---QEWSTPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEASPTIVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEAD

Query:  KDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------IETAIQDSADKHSEKHGE--------
         +VPIILGRPFLATGR LIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C  + + ++              +E A+ D  D+ +E+  E        
Subjt:  KDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------IETAIQDSADKHSEKHGE--------

Query:  -----------------------------------------------------------------------QYRKAIGWTLADIQGISLSFCMHKITLEE
                                                                                ++ AIGWT+ADI+GIS SFCMHKI LE+
Subjt:  -----------------------------------------------------------------------QYRKAIGWTLADIQGISLSFCMHKITLEE

Query:  GSFRSIEQQRRLNPAMKEVVKKEVIKWLDVGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDKML
            S+E QRRLNP MKEVVKKE+IKWLD GIIYPI+DS+WVSPVQCVPKKGG+TVV N  NELIPTRTVTG RVCMDYR+LNKATRKDHFPLPFID+ML
Subjt:  GSFRSIEQQRRLNPAMKEVVKKEVIKWLDVGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDKML

Query:  DRLAGQAYYCFLDGYSGYNQITIAPEDREKTTFTCPYGTFAFRRMPFGL
        DRLAG+ +YCFLDGYSGYNQI IAPED+EKTTFTCPYGTFAFRRMPFGL
Subjt:  DRLAGQAYYCFLDGYSGYNQITIAPEDREKTTFTCPYGTFAFRRMPFGL

PIN22487.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]2.5e-14845.55Show/hide
Query:  VNQV--TDEACVYCGEDHNYEFCPSNPASVL----AQPPQ-----------------------LLMGS-----QGNNVQAQQKMNQ--PGFAK-------
        VNQV  T   C  CGE H  + CP +  S+     A+ PQ                          GS     QG   Q QQ M +  P   +       
Subjt:  VNQV--TDEACVYCGEDHNYEFCPSNPASVL----AQPPQ-----------------------LLMGS-----QGNNVQAQQKMNQ--PGFAK-------

Query:  -------------AQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNRNNNVVIEKELESGQGAGGSKENAGASGSVPDV
                      QLAN + +RPQG LPS+TE +PR++GK Q +AVTLR+G+ L+E  K    +  +    VI +E E                   +V
Subjt:  -------------AQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNRNNNVVIEKELESGQGAGGSKENAGASGSVPDV

Query:  EAPYVPPPPYVPPLPFPQR-------------------------------QKP------------KNRMCDY-------------QEWSTPKAKDPGSFT
        EAP     P     PFPQR                               Q P            K R+ DY             Q    PK KDPGSFT
Subjt:  EAPYVPPPPYVPPLPFPQR-------------------------------QKP------------KNRMCDY-------------QEWSTPKAKDPGSFT

Query:  IPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEASPTIVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRAL
        IP +IG    GRALCDLGASINLMP S+YR LG+GEA PT +TLQLADRS+TYP+G IED+LVKVDKFIFP DF++LD E D +VPIILGRPFLATGR L
Subjt:  IPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEASPTIVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRAL

Query:  IDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------IETAIQDSADKHSEKHGE--------------------------
        IDVQKGELTMRV ++++ FNVFKAMK+P+E ++C  + + +               +E A+ D  D+ +E+  E                          
Subjt:  IDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------IETAIQDSADKHSEKHGE--------------------------

Query:  -----------------------------------------------------QYRKAIGWTLADIQGISLSFCMHKITLEEGSFRSIEQQRRLNPAMKE
                                                              ++ AIGWT+ADI+GIS SFCMHKI LE+    S+E QRRLNP MKE
Subjt:  -----------------------------------------------------QYRKAIGWTLADIQGISLSFCMHKITLEEGSFRSIEQQRRLNPAMKE

Query:  VVKKEVIKWLDVGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDKMLDRLAGQAYYCFLDGYSGY
        VVKKE+IKWLD GIIYPI+DS+WVSPVQCVPKKGG+TVV N  NELIPTRTVTGWRVCMDYR+LNKATRKDHFPLPFID+MLDRLAG+ +YCFLDGYSGY
Subjt:  VVKKEVIKWLDVGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDKMLDRLAGQAYYCFLDGYSGY

Query:  NQITIAPEDREKTTFTCPYGTFAFRRMPFGL
        NQI IAPED+EKTTFTCPYGTFAFRRMPFGL
Subjt:  NQITIAPEDREKTTFTCPYGTFAFRRMPFGL

PIN22518.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]1.6e-14749.59Show/hide
Query:  QLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNRNNNVVIEKELESGQGAGGSKENAGASGSVPDVEAPYVPPPPYVPPL
        QLAN + +RPQG LPS+TE +PR++GK Q +AVTLR+G+ L+E  K    +  +    VI KE E                   +VEAP     P     
Subjt:  QLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNRNNNVVIEKELESGQGAGGSKENAGASGSVPDVEAPYVPPPPYVPPL

Query:  PFPQR-------------------------------QKP------------KNRMCDY-------------QEWSTPKAKDPGSFTIPVSIGGKELGRAL
        PFPQ+                               Q P            K R+ DY             Q    PK KDPGSFTIP +IG    GRAL
Subjt:  PFPQR-------------------------------QKP------------KNRMCDY-------------QEWSTPKAKDPGSFTIPVSIGGKELGRAL

Query:  CDLGASINLMPLSVYRKLGIGEASPTIVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCN
        CDLGASINLMP S+YR LG+GEA PT +TLQLADRS+TYP+G IED+LVKVDKFIFP DF++LD E D +VPIILGRPFLATGR LIDVQKGELTMRV +
Subjt:  CDLGASINLMPLSVYRKLGIGEASPTIVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCN

Query:  EEVKFNVFKAMKYPDEMEDCSFIRILESTV------------IETAIQDSADKHSEKHGE----------------------------------------
        +++ FNVFKAMK+P+E ++C  + + ++              +E A+ D  ++ +E+  E                                        
Subjt:  EEVKFNVFKAMKYPDEMEDCSFIRILESTV------------IETAIQDSADKHSEKHGE----------------------------------------

Query:  ---------------------------------------QYRKAIGWTLADIQGISLSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDVGI
                                                ++ AIGWT+ADI+GIS SFCMHKI LE+    S+E QRRLN  MKEVVKKE+IKWLD GI
Subjt:  ---------------------------------------QYRKAIGWTLADIQGISLSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDVGI

Query:  IYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDKMLDRLAGQAYYCFLDGYSGYNQITIAPEDREKTT
        IYPI+DS+WVSPVQCVPKKGG+TVV N  NELIPTRTVTGWRVCMDYR+LNKATRKDHFPLPFID+MLDRLAG+ +YCFLDGYSGYNQI IAPED+EKTT
Subjt:  IYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDKMLDRLAGQAYYCFLDGYSGYNQITIAPEDREKTT

Query:  FTCPYGTFAFRRMPFGL
        FTCPYGTFAFRRMPFGL
Subjt:  FTCPYGTFAFRRMPFGL

XP_023522102.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111785979 [Cucurbita pepo subsp. pepo]3.8e-14945.95Show/hide
Query:  AAVVNQVTDEACVYCGEDHNYEFCPSNPASVLAQPPQLLMGSQGNNVQAQQKMNQPGFAKAQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKP
        AAV+NQ   E+C         E+   N A++ +Q   L       N++ Q           QLANEL+ RP  KLP+DTE P+REG EQ +A+ LRSGK 
Subjt:  AAVVNQVTDEACVYCGEDHNYEFCPSNPASVLAQPPQLLMGSQGNNVQAQQKMNQPGFAKAQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKP

Query:  L------------EESRKTQDLNSNRNNNVVIEKELESGQGAGGSKENAGASGSVPDVEAPYVPPPP-------------YVPPLPFPQRQKPKNRMCDY
        +              S++T D    +   VV E+  ++   A   KE++     +       V PP              Y P  PFPQR K K     +
Subjt:  L------------EESRKTQDLNSNRNNNVVIEKELESGQGAGGSKENAGASGSVPDVEAPYVPPPP-------------YVPPLPFPQRQKPKNRMCDY

Query:  QEW---------STP-----------------------------------------------KAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVY
        +++         + P                                               K KDPGSFTIP+SIGGK+LGRALCDLG+SINLMPLS+Y
Subjt:  QEW---------STP-----------------------------------------------KAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVY

Query:  RKLGIGEASPTIVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPD
        +KLGIGEA PT VTLQLADRS TYPEGKIED+L++VDKFIFP DFIILDYEAD DVPIILGRPFL TGR L+DV KG +T+R+ +++V+FN+  +MKYP 
Subjt:  RKLGIGEASPTIVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPD

Query:  EMEDCS---------------------------------------FIRILES------------------------------------------TVIETA
          E+CS                                       F R  ES                                           +I   
Subjt:  EMEDCS---------------------------------------FIRILES------------------------------------------TVIETA

Query:  IQDSADKHSEKHGEQYRKAIGWTLADIQGISLSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDVGIIYPIADSNWVSPVQCVPKKGGVTVV
        +  + +    +  ++++ AIGWTLADI+GIS S CMHKI LEEG  +SIEQQRRLNP MKEVV+KE++KWLD GIIYPIA+S+ VSP+QCVPKKGG+TV+
Subjt:  IQDSADKHSEKHGEQYRKAIGWTLADIQGISLSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDVGIIYPIADSNWVSPVQCVPKKGGVTVV

Query:  SNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDKMLDRLAGQAYYCFLDGYSGYNQITIAPEDREKTTFTCPYGTFAFRRMPFGL
        +N++NELI TR V GWR+CMDYRRLNKATRKDHFPLPFID+MLDRLAG+++YCFLDGYSGYNQITI+PED+EKTTFTCPYG FAFRRMPFGL
Subjt:  SNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDKMLDRLAGQAYYCFLDGYSGYNQITIAPEDREKTTFTCPYGTFAFRRMPFGL

TrEMBL top hitse value%identityAlignment
A0A2G9G4F1 DNA-directed DNA polymerase5.0e-14749Show/hide
Query:  VNQV--TDEACVYCGEDHNYEFCPSNPASVL----AQPPQLLMGSQGNNVQAQQKMN-----------QPGFAKAQLANELKARPQGKLPSDTE-HPRRE
        VNQV  T   C  CGE H  + CP +  S+     A+ PQ    S   N   +Q  N            P F + QLAN + +RPQG LPS+TE +PR+ 
Subjt:  VNQV--TDEACVYCGEDHNYEFCPSNPASVL----AQPPQLLMGSQGNNVQAQQKMN-----------QPGFAKAQLANELKARPQGKLPSDTE-HPRRE

Query:  GKEQVKAVTLRSGKPLEESRKTQDLNSNRNNNVVIEKELESGQGAGGSKENAGASGSVPDVEAPYVPPPPYVPPLPFPQRQKPKNRMCDY----------
         K Q +AVTLR+G+ L+E  K    +  +    VI +E E    A   K +     ++P  EA     P YV  +        K R+ DY          
Subjt:  GKEQVKAVTLRSGKPLEESRKTQDLNSNRNNNVVIEKELESGQGAGGSKENAGASGSVPDVEAPYVPPPPYVPPLPFPQRQKPKNRMCDY----------

Query:  ---QEWSTPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEASPTIVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEAD
           Q    PK KDPGSFTIP +IG    GRALCDLGASINLMP S+YR LG+G+A PT +TLQLADRS+TYP+G IED+LVKVDKFIFP  F++LD E D
Subjt:  ---QEWSTPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEASPTIVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEAD

Query:  KDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------IETAIQDSADKHSEKHGE--------
         +VPIILGRPFLATGR LI+VQKGELTMRV ++++ FNVFKAMK+ +E ++C  + + ++              +E A+ D  D+ +E+  E        
Subjt:  KDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------IETAIQDSADKHSEKHGE--------

Query:  -----------------------------------------------------------------------QYRKAIGWTLADIQGISLSFCMHKITLEE
                                                                                ++ AIGWT+ADI+GIS SFCMHKI LE+
Subjt:  -----------------------------------------------------------------------QYRKAIGWTLADIQGISLSFCMHKITLEE

Query:  GSFRSIEQQRRLNPAMKEVVKKEVIKWLDVGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDKML
            S++ QRRLNP MKEVVKKE+IKWLDVGIIYPI+DS+WVSPVQCVPKKGG+TVV N  NELIPTRTVTGWRVCMDYR+LNKATRKDHF LPFID+ML
Subjt:  GSFRSIEQQRRLNPAMKEVVKKEVIKWLDVGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDKML

Query:  DRLAGQAYYCFLDGYSGYNQITIAPEDREKTTFTCPYGTFAFRRMPFGL
        DRLAG+ +YCFLDGYSGYNQI IAPE++EK TFTCPYGTFAFRRMPFGL
Subjt:  DRLAGQAYYCFLDGYSGYNQITIAPEDREKTTFTCPYGTFAFRRMPFGL

A0A2G9H400 Reverse transcriptase1.4e-14949.61Show/hide
Query:  VNQV--TDEACVYCGEDHNYEFCPSNPASVL----AQPPQLLMGSQGNNVQAQQKMN-----------QPGFAKAQLANELKARPQGKLPSDTE-HPRRE
        VNQV  T   C  CGE H  + CP +  S+     A+ PQ    S   N   +Q  N            P F + QLAN + +RPQG L S+TE +PR++
Subjt:  VNQV--TDEACVYCGEDHNYEFCPSNPASVL----AQPPQLLMGSQGNNVQAQQKMN-----------QPGFAKAQLANELKARPQGKLPSDTE-HPRRE

Query:  GKEQVKAVTLRSGKPLEESRKTQDLNSNRNNNVVIEKELESGQGAGGSKENAGASGSVPDVEAPYVPPPPYVPPLPFPQRQKPKNRMCDY----------
        GK Q +AVTLR+G+ L+E  K    +  +    VI +E E    A   K +     ++P  EA     P YV  +        K R+ DY          
Subjt:  GKEQVKAVTLRSGKPLEESRKTQDLNSNRNNNVVIEKELESGQGAGGSKENAGASGSVPDVEAPYVPPPPYVPPLPFPQRQKPKNRMCDY----------

Query:  ---QEWSTPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEASPTIVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEAD
           Q    PK KDPGSFTIP +IG    GRALCDLGASINLMP S+YR LG+GEA  T +TLQLADRS+TYP+G IED+LVKVDKFIFP DF++LD + D
Subjt:  ---QEWSTPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEASPTIVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEAD

Query:  KDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------IETAIQDSADKHSEKHGE--------
         +VPIILGRPFLATGR LIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C  + + ++              +E A+ D  D+ +E+  E        
Subjt:  KDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------IETAIQDSADKHSEKHGE--------

Query:  -----------------------------------------------------------------------QYRKAIGWTLADIQGISLSFCMHKITLEE
                                                                                ++ AIGWT+ADI+GIS SFCMHKI LE+
Subjt:  -----------------------------------------------------------------------QYRKAIGWTLADIQGISLSFCMHKITLEE

Query:  GSFRSIEQQRRLNPAMKEVVKKEVIKWLDVGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDKML
            S+E QRRLNP MKEVVKKE+IKWLD GIIYPI+DS+WVSPVQCVPKKGG+TVV N  NELIPTRTVTG RVCMDYR+LNKATRKDHFPLPFID+ML
Subjt:  GSFRSIEQQRRLNPAMKEVVKKEVIKWLDVGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDKML

Query:  DRLAGQAYYCFLDGYSGYNQITIAPEDREKTTFTCPYGTFAFRRMPFGL
        DRLAG+ +YCFLDGYSGYNQI IAPED+EKTTFTCPYGTFAFRRMPFGL
Subjt:  DRLAGQAYYCFLDGYSGYNQITIAPEDREKTTFTCPYGTFAFRRMPFGL

A0A2G9HH15 Reverse transcriptase2.5e-14646.66Show/hide
Query:  VNQV--TDEACVYCGEDHNYEFCPSNPASVL----AQPPQ----------------------------LLMGSQGNNVQAQQKMNQ--PGFAK-------
        VNQV  T   C  CGE H  + CP +  S+     A+ PQ                             L   QG   Q QQ M +  P   +       
Subjt:  VNQV--TDEACVYCGEDHNYEFCPSNPASVL----AQPPQ----------------------------LLMGSQGNNVQAQQKMNQ--PGFAK-------

Query:  -------------AQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNRNNNVVIEKELESGQGAGGSKENAGASGSVPDV
                      QLAN + +RP+  LPS+TE +PR++ K Q +AVTLR+G  L+E  K    +         EKE+ S        E  G     P  
Subjt:  -------------AQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNRNNNVVIEKELESGQGAGGSKENAGASGSVPDV

Query:  EAPYVPPPPYVPPLPFPQRQKPKNRMCDY-------------QEWSTPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEASPTIVT
               P YV  +        K R+ DY             Q    PK KDPGSFTIP +IG    GRALCDLGASINLMP S+YR LG+GEA PT +T
Subjt:  EAPYVPPPPYVPPLPFPQRQKPKNRMCDY-------------QEWSTPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEASPTIVT

Query:  LQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILEST
        LQLADRS+TYP G IED+LVKVDKFIFP DF++LD E D +VPIILGRPFLATGR LIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C  + + +  
Subjt:  LQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILEST

Query:  V------------IETAIQDSADKHSEKHGE---------------------------------------------------------------------
                     +E A+ D  D+ +E+  E                                                                     
Subjt:  V------------IETAIQDSADKHSEKHGE---------------------------------------------------------------------

Query:  ----------QYRKAIGWTLADIQGISLSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDVGIIYPIADSNWVSPVQCVPKKGGVTVVSNKD
                   ++ AIGWT+ADI+GIS SFCMHKI LE+G   S+E QRRLNP MKEVVKKE+IKWLD GIIYPI+DS+WVSPVQCVPKKGG+TVV N  
Subjt:  ----------QYRKAIGWTLADIQGISLSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDVGIIYPIADSNWVSPVQCVPKKGGVTVVSNKD

Query:  NELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDKMLDRLAGQAYYCFLDGYSGYNQITIAPEDREKTTFTCPYGTFAFRRMPFGL
        NELIPTRTVTGWRVCMDYR+LNKATRKDHFPLPFID+MLDRLAG+ +YCFLDGYSGYNQI I PED+EKTTFTCPYGTF FR+MPFGL
Subjt:  NELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDKMLDRLAGQAYYCFLDGYSGYNQITIAPEDREKTTFTCPYGTFAFRRMPFGL

A0A2G9HYA0 Reverse transcriptase1.2e-14845.55Show/hide
Query:  VNQV--TDEACVYCGEDHNYEFCPSNPASVL----AQPPQ-----------------------LLMGS-----QGNNVQAQQKMNQ--PGFAK-------
        VNQV  T   C  CGE H  + CP +  S+     A+ PQ                          GS     QG   Q QQ M +  P   +       
Subjt:  VNQV--TDEACVYCGEDHNYEFCPSNPASVL----AQPPQ-----------------------LLMGS-----QGNNVQAQQKMNQ--PGFAK-------

Query:  -------------AQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNRNNNVVIEKELESGQGAGGSKENAGASGSVPDV
                      QLAN + +RPQG LPS+TE +PR++GK Q +AVTLR+G+ L+E  K    +  +    VI +E E                   +V
Subjt:  -------------AQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNRNNNVVIEKELESGQGAGGSKENAGASGSVPDV

Query:  EAPYVPPPPYVPPLPFPQR-------------------------------QKP------------KNRMCDY-------------QEWSTPKAKDPGSFT
        EAP     P     PFPQR                               Q P            K R+ DY             Q    PK KDPGSFT
Subjt:  EAPYVPPPPYVPPLPFPQR-------------------------------QKP------------KNRMCDY-------------QEWSTPKAKDPGSFT

Query:  IPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEASPTIVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRAL
        IP +IG    GRALCDLGASINLMP S+YR LG+GEA PT +TLQLADRS+TYP+G IED+LVKVDKFIFP DF++LD E D +VPIILGRPFLATGR L
Subjt:  IPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEASPTIVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRAL

Query:  IDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------IETAIQDSADKHSEKHGE--------------------------
        IDVQKGELTMRV ++++ FNVFKAMK+P+E ++C  + + +               +E A+ D  D+ +E+  E                          
Subjt:  IDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------IETAIQDSADKHSEKHGE--------------------------

Query:  -----------------------------------------------------QYRKAIGWTLADIQGISLSFCMHKITLEEGSFRSIEQQRRLNPAMKE
                                                              ++ AIGWT+ADI+GIS SFCMHKI LE+    S+E QRRLNP MKE
Subjt:  -----------------------------------------------------QYRKAIGWTLADIQGISLSFCMHKITLEEGSFRSIEQQRRLNPAMKE

Query:  VVKKEVIKWLDVGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDKMLDRLAGQAYYCFLDGYSGY
        VVKKE+IKWLD GIIYPI+DS+WVSPVQCVPKKGG+TVV N  NELIPTRTVTGWRVCMDYR+LNKATRKDHFPLPFID+MLDRLAG+ +YCFLDGYSGY
Subjt:  VVKKEVIKWLDVGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDKMLDRLAGQAYYCFLDGYSGY

Query:  NQITIAPEDREKTTFTCPYGTFAFRRMPFGL
        NQI IAPED+EKTTFTCPYGTFAFRRMPFGL
Subjt:  NQITIAPEDREKTTFTCPYGTFAFRRMPFGL

A0A2G9HYD8 Reverse transcriptase7.7e-14849.59Show/hide
Query:  QLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNRNNNVVIEKELESGQGAGGSKENAGASGSVPDVEAPYVPPPPYVPPL
        QLAN + +RPQG LPS+TE +PR++GK Q +AVTLR+G+ L+E  K    +  +    VI KE E                   +VEAP     P     
Subjt:  QLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNRNNNVVIEKELESGQGAGGSKENAGASGSVPDVEAPYVPPPPYVPPL

Query:  PFPQR-------------------------------QKP------------KNRMCDY-------------QEWSTPKAKDPGSFTIPVSIGGKELGRAL
        PFPQ+                               Q P            K R+ DY             Q    PK KDPGSFTIP +IG    GRAL
Subjt:  PFPQR-------------------------------QKP------------KNRMCDY-------------QEWSTPKAKDPGSFTIPVSIGGKELGRAL

Query:  CDLGASINLMPLSVYRKLGIGEASPTIVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCN
        CDLGASINLMP S+YR LG+GEA PT +TLQLADRS+TYP+G IED+LVKVDKFIFP DF++LD E D +VPIILGRPFLATGR LIDVQKGELTMRV +
Subjt:  CDLGASINLMPLSVYRKLGIGEASPTIVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCN

Query:  EEVKFNVFKAMKYPDEMEDCSFIRILESTV------------IETAIQDSADKHSEKHGE----------------------------------------
        +++ FNVFKAMK+P+E ++C  + + ++              +E A+ D  ++ +E+  E                                        
Subjt:  EEVKFNVFKAMKYPDEMEDCSFIRILESTV------------IETAIQDSADKHSEKHGE----------------------------------------

Query:  ---------------------------------------QYRKAIGWTLADIQGISLSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDVGI
                                                ++ AIGWT+ADI+GIS SFCMHKI LE+    S+E QRRLN  MKEVVKKE+IKWLD GI
Subjt:  ---------------------------------------QYRKAIGWTLADIQGISLSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDVGI

Query:  IYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDKMLDRLAGQAYYCFLDGYSGYNQITIAPEDREKTT
        IYPI+DS+WVSPVQCVPKKGG+TVV N  NELIPTRTVTGWRVCMDYR+LNKATRKDHFPLPFID+MLDRLAG+ +YCFLDGYSGYNQI IAPED+EKTT
Subjt:  IYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDKMLDRLAGQAYYCFLDGYSGYNQITIAPEDREKTT

Query:  FTCPYGTFAFRRMPFGL
        FTCPYGTFAFRRMPFGL
Subjt:  FTCPYGTFAFRRMPFGL

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.3e-1624.93Show/hide
Query:  RALCDLGASINLMPLSVYRKLGIGEASPTIVTLQ---LADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGEL
        + L D G+++N+   +++  L I   S  I T     + ++SI  P            K +FP     L +   ++  ++LGR  LA  +A I  +  E+
Subjt:  RALCDLGASINLMPLSVYRKLGIGEASPTIVTLQ---LADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGEL

Query:  TMRVCNEEVKFNVFKAMKYPDEMEDCSFI---RILESTVIETAIQDSADKHSEKHGEQYRK--AIGWTLADIQ---GISLSFC-MHKITLEEGSFRSIEQ
        T+   N + K     A       ++ + I    + +   I   ++    +    + E+ ++  A+     DIQ   G  L+F    K T+       +  
Subjt:  TMRVCNEEVKFNVFKAMKYPDEMEDCSFI---RILESTVIETAIQDSADKHSEKHGEQYRK--AIGWTLADIQ---GISLSFC-MHKITLEEGSFRSIEQ

Query:  QRRLNPAMKEVVKKEVIKWLDVGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDKMLDRLAGQAY
        +     A ++ V+ ++   L+ GII   ++S + SP+  VPKK      S K            +R+ +DYR+LN+ T  D  P+P +D++L +L    Y
Subjt:  QRRLNPAMKEVVKKEVIKWLDVGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDKMLDRLAGQAY

Query:  YCFLDGYSGYNQITIAPEDREKTTFTCPYGTFAFRRMPFGL
        +  +D   G++QI + PE   KT F+  +G + + RMPFGL
Subjt:  YCFLDGYSGYNQITIAPEDREKTTFTCPYGTFAFRRMPFGL

P10394 Retrovirus-related Pol polyprotein from transposon 4129.7e-1535.61Show/hide
Query:  EVVKKEVIKWLDVGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDKMLDRLAGQAYYCFLDGYSG
        E ++ +V K +   I+ P + S + SP+  VPKK              P      WR+ +DYR++NK    D FPLP ID +LD+L    Y+  LD  SG
Subjt:  EVVKKEVIKWLDVGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDKMLDRLAGQAYYCFLDGYSG

Query:  YNQITIAPEDREKTTFTCPYGTFAFRRMPFGL
        ++QI +    R+ T+F+   G++ F R+PFGL
Subjt:  YNQITIAPEDREKTTFTCPYGTFAFRRMPFGL

P31843 RNA-directed DNA polymerase homolog4.8e-1450Show/hide
Query:  RVCMDYRRLNKATRKDHFPLPFIDKMLDRLAGQAYYCFLDGYSGYNQITIAPEDREKTTFTCPYGTFAFRRMPFGL
        R+C+DYR L K T K+ +P+P +D + DRLA   ++  LD  SGY Q+ IA  D  KTT    YG+F FR MPFGL
Subjt:  RVCMDYRRLNKATRKDHFPLPFIDKMLDRLAGQAYYCFLDGYSGYNQITIAPEDREKTTFTCPYGTFAFRRMPFGL

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.2e-2034.42Show/hide
Query:  STVIET--AIQDSADKHSEKH---------GEQYRKAIGWTL----ADIQGISLSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDVGIIYP
        S V+ T  +++ +A  HS K           ++YR+ I   L    ADI  I +    H I ++ G+     Q   +    ++ + K V K LD   I P
Subjt:  STVIET--AIQDSADKHSEKH---------GEQYRKAIGWTL----ADIQGISLSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDVGIIYP

Query:  IADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDKMLDRLAGQAYYCFLDGYSGYNQITIAPEDREKTTFTC
         + S   SPV  VPKK G                   +R+C+DYR LNKAT  D FPLP ID +L R+     +  LD +SGY+QI + P+DR KT F  
Subjt:  IADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDKMLDRLAGQAYYCFLDGYSGYNQITIAPEDREKTTFTC

Query:  PYGTFAFRRMPFGLL
        P G + +  MPFGL+
Subjt:  PYGTFAFRRMPFGLL

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.2e-2034.42Show/hide
Query:  STVIET--AIQDSADKHSEKH---------GEQYRKAIGWTL----ADIQGISLSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDVGIIYP
        S V+ T  +++ +A  HS K           ++YR+ I   L    ADI  I +    H I ++ G+     Q   +    ++ + K V K LD   I P
Subjt:  STVIET--AIQDSADKHSEKH---------GEQYRKAIGWTL----ADIQGISLSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDVGIIYP

Query:  IADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDKMLDRLAGQAYYCFLDGYSGYNQITIAPEDREKTTFTC
         + S   SPV  VPKK G                   +R+C+DYR LNKAT  D FPLP ID +L R+     +  LD +SGY+QI + P+DR KT F  
Subjt:  IADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDKMLDRLAGQAYYCFLDGYSGYNQITIAPEDREKTTFTC

Query:  PYGTFAFRRMPFGLL
        P G + +  MPFGL+
Subjt:  PYGTFAFRRMPFGLL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGATCCGCCTGGGGTACGGTTTGAGCTTGATCCAGAAATTGAAAGGACATTTAGGAACAGAAGGAGGGAGCAGCGTAGAAACCAGATGGAGAACGCCCCGCAACT
TCCGCAGGACCAGAGCATTCGAGCATATGCTGTCCCGATGTTTAATGAGTTGAATCCAGGGATTGCACGTCCCCATATCCAAGCGGCAAATTTTGAAATGAAACCGGTAA
TGTTTCAGATGTTGCAAACCGTGGGGCAATTCCATGGTTTGTCATCTGAAGACCCTCATTTACATCTTAAGTCTTTTCTAGGAGTTAGTGATTCTTTTGTAATTCAAAGA
GTGCCTAGAGATGCTCTTAGATTAACTTTGTTCCGTATTCTCTTAGAGATGGAGCAAAGTCATGTGATTAGTCATCAGCAGCCACCAGCTATGGAGCCTGCAGCAGTGGT
GAACCAAGTCACGGACGAAGCATGTGTCTATTGCGGTGAAGACCACAACTACGAGTTTTGCCCCAGCAATCCAGCTTCTGTGTTGGCGCAACCACCCCAACTTCTCATGG
GGAGTCAAGGAAATAACGTACAAGCGCAACAGAAGATGAACCAGCCGGGATTTGCTAAAGCGCAGCTAGCTAATGAGCTCAAGGCGAGGCCTCAAGGGAAACTTCCTTCG
GATACTGAACACCCTCGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTAACTCTTAGGAGTGGTAAGCCACTAGAAGAGTCTAGAAAGACCCAGGATTTAAATAGTAATAG
AAATAATAATGTTGTTATTGAGAAAGAGTTGGAGTCTGGACAGGGTGCTGGAGGCAGCAAAGAGAATGCTGGAGCATCTGGTTCTGTGCCAGATGTAGAAGCACCATATG
TGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAAAGCCTAAGAATAGGATGTGTGATTATCAAGAATGGTCTACCCCCAAGGCTAAGGATCCAGGG
TCATTTACCATACCTGTGTCTATAGGTGGAAAGGAATTAGGTAGAGCACTCTGTGATTTAGGTGCAAGCATTAACCTTATGCCTCTTTCGGTCTATCGGAAGTTAGGTAT
TGGTGAAGCTAGTCCTACCATAGTTACACTCCAACTAGCTGATAGGTCTATCACATATCCAGAGGGTAAAATTGAGGATGTCTTAGTAAAAGTGGATAAATTCATATTTC
CGGTTGATTTTATTATTTTAGACTATGAGGCTGATAAAGATGTCCCAATTATTCTAGGTCGTCCATTTTTGGCTACTGGTAGGGCATTAATAGATGTTCAAAAAGGAGAA
TTAACAATGAGAGTCTGTAATGAGGAAGTGAAATTTAATGTGTTTAAAGCCATGAAATATCCAGACGAAATGGAGGATTGCTCCTTCATTAGGATTCTGGAGAGCACAGT
TATTGAGACAGCAATACAGGATTCGGCTGATAAGCATTCAGAAAAGCATGGAGAGCAATACCGCAAAGCTATAGGTTGGACATTAGCTGATATTCAGGGAATTAGCCTAT
CTTTTTGTATGCACAAAATCACTCTAGAGGAGGGATCCTTTAGGAGTATTGAGCAACAAAGAAGGCTTAACCCTGCAATGAAAGAGGTTGTTAAAAAGGAGGTAATTAAA
TGGTTGGATGTTGGGATCATTTATCCAATTGCCGATAGCAATTGGGTAAGCCCTGTCCAATGTGTTCCTAAGAAAGGAGGTGTCACTGTGGTGAGCAATAAAGATAATGA
GTTGATCCCCACAAGGACAGTAACTGGCTGGAGGGTTTGTATGGATTACAGAAGGCTTAATAAAGCCACTCGAAAGGATCATTTCCCTCTACCATTTATCGACAAGATGT
TGGATCGATTGGCTGGTCAGGCCTATTACTGTTTCTTGGATGGTTATTCTGGGTATAACCAGATTACTATTGCTCCTGAAGATCGGGAAAAAACCACTTTCACCTGCCCT
TATGGGACATTCGCTTTTAGGAGAATGCCTTTTGGCCTTTTGGCCTTTGCAATGCTCCAACAACATTTCAGTGGTGTATGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCGATCCGCCTGGGGTACGGTTTGAGCTTGATCCAGAAATTGAAAGGACATTTAGGAACAGAAGGAGGGAGCAGCGTAGAAACCAGATGGAGAACGCCCCGCAACT
TCCGCAGGACCAGAGCATTCGAGCATATGCTGTCCCGATGTTTAATGAGTTGAATCCAGGGATTGCACGTCCCCATATCCAAGCGGCAAATTTTGAAATGAAACCGGTAA
TGTTTCAGATGTTGCAAACCGTGGGGCAATTCCATGGTTTGTCATCTGAAGACCCTCATTTACATCTTAAGTCTTTTCTAGGAGTTAGTGATTCTTTTGTAATTCAAAGA
GTGCCTAGAGATGCTCTTAGATTAACTTTGTTCCGTATTCTCTTAGAGATGGAGCAAAGTCATGTGATTAGTCATCAGCAGCCACCAGCTATGGAGCCTGCAGCAGTGGT
GAACCAAGTCACGGACGAAGCATGTGTCTATTGCGGTGAAGACCACAACTACGAGTTTTGCCCCAGCAATCCAGCTTCTGTGTTGGCGCAACCACCCCAACTTCTCATGG
GGAGTCAAGGAAATAACGTACAAGCGCAACAGAAGATGAACCAGCCGGGATTTGCTAAAGCGCAGCTAGCTAATGAGCTCAAGGCGAGGCCTCAAGGGAAACTTCCTTCG
GATACTGAACACCCTCGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTAACTCTTAGGAGTGGTAAGCCACTAGAAGAGTCTAGAAAGACCCAGGATTTAAATAGTAATAG
AAATAATAATGTTGTTATTGAGAAAGAGTTGGAGTCTGGACAGGGTGCTGGAGGCAGCAAAGAGAATGCTGGAGCATCTGGTTCTGTGCCAGATGTAGAAGCACCATATG
TGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAAAGCCTAAGAATAGGATGTGTGATTATCAAGAATGGTCTACCCCCAAGGCTAAGGATCCAGGG
TCATTTACCATACCTGTGTCTATAGGTGGAAAGGAATTAGGTAGAGCACTCTGTGATTTAGGTGCAAGCATTAACCTTATGCCTCTTTCGGTCTATCGGAAGTTAGGTAT
TGGTGAAGCTAGTCCTACCATAGTTACACTCCAACTAGCTGATAGGTCTATCACATATCCAGAGGGTAAAATTGAGGATGTCTTAGTAAAAGTGGATAAATTCATATTTC
CGGTTGATTTTATTATTTTAGACTATGAGGCTGATAAAGATGTCCCAATTATTCTAGGTCGTCCATTTTTGGCTACTGGTAGGGCATTAATAGATGTTCAAAAAGGAGAA
TTAACAATGAGAGTCTGTAATGAGGAAGTGAAATTTAATGTGTTTAAAGCCATGAAATATCCAGACGAAATGGAGGATTGCTCCTTCATTAGGATTCTGGAGAGCACAGT
TATTGAGACAGCAATACAGGATTCGGCTGATAAGCATTCAGAAAAGCATGGAGAGCAATACCGCAAAGCTATAGGTTGGACATTAGCTGATATTCAGGGAATTAGCCTAT
CTTTTTGTATGCACAAAATCACTCTAGAGGAGGGATCCTTTAGGAGTATTGAGCAACAAAGAAGGCTTAACCCTGCAATGAAAGAGGTTGTTAAAAAGGAGGTAATTAAA
TGGTTGGATGTTGGGATCATTTATCCAATTGCCGATAGCAATTGGGTAAGCCCTGTCCAATGTGTTCCTAAGAAAGGAGGTGTCACTGTGGTGAGCAATAAAGATAATGA
GTTGATCCCCACAAGGACAGTAACTGGCTGGAGGGTTTGTATGGATTACAGAAGGCTTAATAAAGCCACTCGAAAGGATCATTTCCCTCTACCATTTATCGACAAGATGT
TGGATCGATTGGCTGGTCAGGCCTATTACTGTTTCTTGGATGGTTATTCTGGGTATAACCAGATTACTATTGCTCCTGAAGATCGGGAAAAAACCACTTTCACCTGCCCT
TATGGGACATTCGCTTTTAGGAGAATGCCTTTTGGCCTTTTGGCCTTTGCAATGCTCCAACAACATTTCAGTGGTGTATGTTAG
Protein sequenceShow/hide protein sequence
MSDPPGVRFELDPEIERTFRNRRREQRRNQMENAPQLPQDQSIRAYAVPMFNELNPGIARPHIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQR
VPRDALRLTLFRILLEMEQSHVISHQQPPAMEPAAVVNQVTDEACVYCGEDHNYEFCPSNPASVLAQPPQLLMGSQGNNVQAQQKMNQPGFAKAQLANELKARPQGKLPS
DTEHPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNRNNNVVIEKELESGQGAGGSKENAGASGSVPDVEAPYVPPPPYVPPLPFPQRQKPKNRMCDYQEWSTPKAKDPG
SFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEASPTIVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGE
LTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVIETAIQDSADKHSEKHGEQYRKAIGWTLADIQGISLSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIK
WLDVGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDKMLDRLAGQAYYCFLDGYSGYNQITIAPEDREKTTFTCP
YGTFAFRRMPFGLLAFAMLQQHFSGVC