; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022521 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022521
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr7:31651491..31657219
RNA-Seq ExpressionLag0022521
SyntenyLag0022521
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIN22487.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]5.1e-19160.7Show/hide
Query:  DVEPPYVPPPPYVPPLAFPQRQKPKNQDGEFKKILEILNQLHINIPLVEAIEQMPNNVKFLKDILTKKKRLGEFETVSLTKECSAILKNGLPTKTKDPGS
        +VE P     P      FPQR + +  + +F K LE+  +LHINIP  EA+EQMP+ VKF+KDIL+KK+RLG++ETV+LT+ECSAI++N LP K KDPGS
Subjt:  DVEPPYVPPPPYVPPLAFPQRQKPKNQDGEFKKILEILNQLHINIPLVEAIEQMPNNVKFLKDILTKKKRLGEFETVSLTKECSAILKNGLPTKTKDPGS

Query:  FTIPVSIGGKELGRALCELGASNNLMPLSVYLKLSIGETRPTTITLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEVDKDVSIILGRPFLATGR
        FTIP +IG    GRALC+LGAS NLMP S+Y  L +GE +PT+ITLQLADRS+TYP+G IED+LVKVDKFIFP DF++LD EVD +V IILGRPFLATGR
Subjt:  FTIPVSIGGKELGRALCELGASNNLMPLSVYLKLSIGETRPTTITLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEVDKDVSIILGRPFLATGR

Query:  ALIDVQKGELTMRVYNEEVKFNDSTNKHLEDH--------------GEVSVEDLEVCSLER--------KNEKEVFRCEDVYDSLDLDQRK---------
         LIDVQKGELTMRV ++++ FN        +               G  S+ +  +  LER        +NE+++   + +  S  L  R+         
Subjt:  ALIDVQKGELTMRVYNEEVKFNDSTNKHLEDH--------------GEVSVEDLEVCSLER--------KNEKEVFRCEDVYDSLDLDQRK---------

Query:  APPIKPSLIEAPTLDLKPLSDHLKYVYLGEGETLPIIVASDLMSEHEETLIKLLQQYRKAIDWTLADIQGISPTFCMHKITLDEGSFRSVEQQRRLNPTM
        +  +KPS+ + PTL+LKPL  HL Y YLGE +TLP+I++S L     E L+++L+ ++ AI WT+ADI+GISP+FCMHKI L++    SVE QRRLNP M
Subjt:  APPIKPSLIEAPTLDLKPLSDHLKYVYLGEGETLPIIVASDLMSEHEETLIKLLQQYRKAIDWTLADIQGISPTFCMHKITLDEGSFRSVEQQRRLNPTM

Query:  KEVVKKEVIKWLDAGIIYPIADSNWVSPAQCVPKKGGDTVVSNKNNELIPTRTVTGWRVYMDYRKLNKATRMDHFPLPFIDQMLDKLAGQAYYYFLDGYS
        KEVVKKE+IKWLDAGIIYPI+DS+WVSP QCVPKKGG TVV N +NELIPTRTVTGWRV MDYRKLNKATR DHFPLPFIDQMLD+LAG+ +Y FLDGYS
Subjt:  KEVVKKEVIKWLDAGIIYPIADSNWVSPAQCVPKKGGDTVVSNKNNELIPTRTVTGWRVYMDYRKLNKATRMDHFPLPFIDQMLDKLAGQAYYYFLDGYS

Query:  GYNKITIAPEDQEKTTFTCAYGTFAFRRMSFGLCNAPTTFQRCMLAIFSDMIESTVEIFMNDFSVFGGSF
        GYN+I IAPEDQEKTTFTC YGTFAFRRM FGLCNAP TFQRCM+AIF+DM+E+ +E+FM+DFSV+G SF
Subjt:  GYNKITIAPEDQEKTTFTCAYGTFAFRRMSFGLCNAPTTFQRCMLAIFSDMIESTVEIFMNDFSVFGGSF

PIN22518.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]3.3e-19060.7Show/hide
Query:  DVEPPYVPPPPYVPPLAFPQRQKPKNQDGEFKKILEILNQLHINIPLVEAIEQMPNNVKFLKDILTKKKRLGEFETVSLTKECSAILKNGLPTKTKDPGS
        +VE P     P      FPQ+ + +  + +F K LE+  +LHINIP  EA+EQMP+ VKF+KDIL+KK+RLG++ET +LT+EC+AI++N LP K KDPGS
Subjt:  DVEPPYVPPPPYVPPLAFPQRQKPKNQDGEFKKILEILNQLHINIPLVEAIEQMPNNVKFLKDILTKKKRLGEFETVSLTKECSAILKNGLPTKTKDPGS

Query:  FTIPVSIGGKELGRALCELGASNNLMPLSVYLKLSIGETRPTTITLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEVDKDVSIILGRPFLATGR
        FTIP +IG    GRALC+LGAS NLMP S+Y  L +GE +PT+ITLQLADRS+TYP+G IED+LVKVDKFIFP DF++LD EVD +V IILGRPFLATGR
Subjt:  FTIPVSIGGKELGRALCELGASNNLMPLSVYLKLSIGETRPTTITLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEVDKDVSIILGRPFLATGR

Query:  ALIDVQKGELTMRVYNEEVKFN------------DSTNKHLEDH--GEVSVEDLEVCSLERK--------NEKEV-----FRCEDVYDSLDLD--QRKAP
         LIDVQKGELTMRV ++++ FN            +  +  L D+  G  S+ +  + SLER         NE+++           + S  ++  +R  P
Subjt:  ALIDVQKGELTMRVYNEEVKFN------------DSTNKHLEDH--GEVSVEDLEVCSLERK--------NEKEV-----FRCEDVYDSLDLD--QRKAP

Query:  P--IKPSLIEAPTLDLKPLSDHLKYVYLGEGETLPIIVASDLMSEHEETLIKLLQQYRKAIDWTLADIQGISPTFCMHKITLDEGSFRSVEQQRRLNPTM
           +KPS+ + PTL+LKPL +HL YVYLGE +TLP+I++S L     E L+++L+ ++ AI WT+ADI+GISP+FCMHKI L++    SVE QRRLN  M
Subjt:  P--IKPSLIEAPTLDLKPLSDHLKYVYLGEGETLPIIVASDLMSEHEETLIKLLQQYRKAIDWTLADIQGISPTFCMHKITLDEGSFRSVEQQRRLNPTM

Query:  KEVVKKEVIKWLDAGIIYPIADSNWVSPAQCVPKKGGDTVVSNKNNELIPTRTVTGWRVYMDYRKLNKATRMDHFPLPFIDQMLDKLAGQAYYYFLDGYS
        KEVVKKE+IKWLDAGIIYPI+DS+WVSP QCVPKKGG TVV N +NELIPTRTVTGWRV MDYRKLNKATR DHFPLPFIDQMLD+LAG+ +Y FLDGYS
Subjt:  KEVVKKEVIKWLDAGIIYPIADSNWVSPAQCVPKKGGDTVVSNKNNELIPTRTVTGWRVYMDYRKLNKATRMDHFPLPFIDQMLDKLAGQAYYYFLDGYS

Query:  GYNKITIAPEDQEKTTFTCAYGTFAFRRMSFGLCNAPTTFQRCMLAIFSDMIESTVEIFMNDFSVFGGSF
        GYN+I IAPEDQEKTTFTC YGTFAFRRM FGLCNAP TFQRCM+AIF+DM+E+ +E+FM+DFSV+G SF
Subjt:  GYNKITIAPEDQEKTTFTCAYGTFAFRRMSFGLCNAPTTFQRCMLAIFSDMIESTVEIFMNDFSVFGGSF

XP_017239676.1 PREDICTED: uncharacterized protein LOC108212460 [Daucus carota subsp. sativus]4.6e-19259.69Show/hide
Query:  SNNDAGASGPVPDVEPPYVPPPPYVPPLAFPQRQKPKNQDGEFKKILEILNQLHINIPLVEAIEQMPNNVKFLKDILTKKKRLGEFETVSLTKECSAILK
        SN  A AS P   V     PPPP      FPQR + + QD +F+K +++  +L INIP  EA+EQM + VKF+KDIL++K+RL EFETV+LT+ECSAIL+
Subjt:  SNNDAGASGPVPDVEPPYVPPPPYVPPLAFPQRQKPKNQDGEFKKILEILNQLHINIPLVEAIEQMPNNVKFLKDILTKKKRLGEFETVSLTKECSAILK

Query:  NGLPTKTKDPGSFTIPVSIGGKELGRALCELGASNNLMPLSVYLKLSIGETRPTTITLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEVDKDVS
          LP K KDPGSFTIP +IG +  G+ALC+LGAS NLMPLS+++KL +GE +PT++ LQLADRS+ YP G +EDVLVKVDKFIFP DFI+LD E D D+ 
Subjt:  NGLPTKTKDPGSFTIPVSIGGKELGRALCELGASNNLMPLSVYLKLSIGETRPTTITLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEVDKDVS

Query:  IILGRPFLATGRALIDVQKGELTMRVYNEEVKFN-----------------------DSTNKHLEDHGEVSVEDLEVCSLERKNE--KEVFRCEDVYDSL
        ++LGRPFLATGR LIDVQKGELTMRV +E+V FN                       D     LE H   S + LE+   E  +E  +E+  C    ++L
Subjt:  IILGRPFLATGRALIDVQKGELTMRVYNEEVKFN-----------------------DSTNKHLEDHGEVSVEDLEVCSLERKNE--KEVFRCEDVYDSL

Query:  DLDQR------------KAPPIKPSLIEAPTLDLKPLSDHLKYVYLGEGETLPIIVASDLMSEHEETLIKLLQQYRKAIDWTLADIQGISPTFCMHKITL
           +R            K+   KPS+ E P L+LK L  HLKY +LGE  TLP+I++S L +EHEE L+++L++Y++AI W +ADI+GISP+FCMHKI++
Subjt:  DLDQR------------KAPPIKPSLIEAPTLDLKPLSDHLKYVYLGEGETLPIIVASDLMSEHEETLIKLLQQYRKAIDWTLADIQGISPTFCMHKITL

Query:  DEGSFRSVEQQRRLNPTMKEVVKKEVIKWLDAGIIYPIADSNWVSPAQCVPKKGGDTVVSNKNNELIPTRTVTGWRVYMDYRKLNKATRMDHFPLPFIDQ
        ++    ++E QRRLNP MKEVVKKE+IKWLDAGIIYPI+DS+WVSP QCVPKKGG TVV+N+ NELIPTRTVTGWRV MDYRKLNKATR DHFPLPFIDQ
Subjt:  DEGSFRSVEQQRRLNPTMKEVVKKEVIKWLDAGIIYPIADSNWVSPAQCVPKKGGDTVVSNKNNELIPTRTVTGWRVYMDYRKLNKATRMDHFPLPFIDQ

Query:  MLDKLAGQAYYYFLDGYSGYNKITIAPEDQEKTTFTCAYGTFAFRRMSFGLCNAPTTFQRCMLAIFSDMIESTVEIFMNDFSVFGGSF
        MLD+LAG+ +Y FLDGYSGY++I IAPEDQEKTTFTC +GTFAFR++SFGLCNAP+TFQRCM+AIFSDMIE  VE+FM+DFSV G SF
Subjt:  MLDKLAGQAYYYFLDGYSGYNKITIAPEDQEKTTFTCAYGTFAFRRMSFGLCNAPTTFQRCMLAIFSDMIESTVEIFMNDFSVFGGSF

XP_022156989.1 uncharacterized protein LOC111023818 [Momordica charantia]2.1e-19766.67Show/hide
Query:  VPPPPYVPPLAFPQRQKPKNQDGEFKKILEILNQLHINIPLVEAIEQMPNNVKFLKDILTKKKRLGEFETVSLTKECSAILKNGLPTKTKDPGSFTIPVS
        +P    VPP  +PQR + KNQD +F + LE+L QLHINIPL+EA+EQMPN VKFLKDIL KK+RLGEFE V+LTKE SAIL   LP K  DPGSFTIPV 
Subjt:  VPPPPYVPPLAFPQRQKPKNQDGEFKKILEILNQLHINIPLVEAIEQMPNNVKFLKDILTKKKRLGEFETVSLTKECSAILKNGLPTKTKDPGSFTIPVS

Query:  IGGKELGRALCELGASNNLMPLSVYLKLSIGETRPTTITLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEVDKDVSIILGRPFLATGRALIDVQ
        IGGK +G ALC+LGAS NLMPLSVY KL IGE RP T+TLQLADRSITY EGKIEDVLV+VDKFIFP DFIILDYE DK++ IILGRPFL+TGRALIDV 
Subjt:  IGGKELGRALCELGASNNLMPLSVYLKLSIGETRPTTITLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEVDKDVSIILGRPFLATGRALIDVQ

Query:  KGELTMRVYNEEVK---FNDSTNK-HLEDHGEVSVEDLEVCSLERKNEKEVFRCEDVYDSLDLDQRKAPPIKPSLIEAPTLDLKPLSDHLKYVYLGEGET
         GELT+RV +++V    FN       +E+   + + D ++ S E + E+ + + ED    +  D+ +A P++PS+++AP L+LK L  HLKY YLGE ET
Subjt:  KGELTMRVYNEEVK---FNDSTNK-HLEDHGEVSVEDLEVCSLERKNEKEVFRCEDVYDSLDLDQRKAPPIKPSLIEAPTLDLKPLSDHLKYVYLGEGET

Query:  LPIIVASDLMSEHEETLIKLLQQYRKAIDWTLADIQGISPTFCMHKITLDEGSFRSVEQQRRLNPTMKEVVKKEVIKWLDAGIIYPIADSNWVSPAQCVP
        LP+ +A+DL  E E  LI++L+ ++KAI WTLADI+GISP++CMHKI L+EG   S+E QRRLNP MKEVVKKE+IKWLDAGIIYPIAD + +SP QCVP
Subjt:  LPIIVASDLMSEHEETLIKLLQQYRKAIDWTLADIQGISPTFCMHKITLDEGSFRSVEQQRRLNPTMKEVVKKEVIKWLDAGIIYPIADSNWVSPAQCVP

Query:  KKGGDTVVSNKNNELIPTRTVTGWRVYMDYRKLNKATRMDHFPLPFIDQMLDKLAGQAYYYFLDGYSGYNKITIAPEDQEKTTFTCAYGTFAFRRMSFGL
        KKGG TVV N NNELIPTRT+TGW + MDYRKLNKAT+ DHFPLPFIDQMLD L GQ YYY LDGY+GYN+ITI P+DQ+KTTFTC YGTF+FRRM FGL
Subjt:  KKGGDTVVSNKNNELIPTRTVTGWRVYMDYRKLNKATRMDHFPLPFIDQMLDKLAGQAYYYFLDGYSGYNKITIAPEDQEKTTFTCAYGTFAFRRMSFGL

Query:  CNAPTTFQRCMLAIFSDMIESTVEIFMNDFSVFGGSF
        CNAPTTFQRCM+AIF D+IE+ VE+FM+DFSVF   F
Subjt:  CNAPTTFQRCMLAIFSDMIESTVEIFMNDFSVFGGSF

XP_023522102.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111785979 [Cucurbita pepo subsp. pepo]2.4e-19662.52Show/hide
Query:  YVPPPPYVPPLAFPQRQKPKNQDGEFKKILEILNQLHINIPLVEAIEQMPNNVKFLKDILTKKKRLGEFETVSLTKECSAILKNGLPTKTKDPGSFTIPV
        Y P PP      FPQR K K ++  F+K ++I  ++HINIPLVEA++QMPN VKFLKD+LT +++  EF+ V L +ECSAILKN +P K KDPGSFTIP+
Subjt:  YVPPPPYVPPLAFPQRQKPKNQDGEFKKILEILNQLHINIPLVEAIEQMPNNVKFLKDILTKKKRLGEFETVSLTKECSAILKNGLPTKTKDPGSFTIPV

Query:  SIGGKELGRALCELGASNNLMPLSVYLKLSIGETRPTTITLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEVDKDVSIILGRPFLATGRALIDV
        SIGGK+LGRALC+LG+S NLMPLS+Y KL IGE RPTT+TLQLADRS TYPEGKIED+L++VDKFIFP DFIILDYE D DV IILGRPFL TGR L+DV
Subjt:  SIGGKELGRALCELGASNNLMPLSVYLKLSIGETRPTTITLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEVDKDVSIILGRPFLATGRALIDV

Query:  QKGELTMRVYNEEVKFN-DSTNKHLEDHGEVSV--EDLEVCSLERKNEKEVFRCED------------------VYDSLDLDQRKAPPIKPSLIEAPTLD
         KG +T+R+ +++V+FN + + K+     E S   E  E  + E  ++ E  + ED                   ++SL+ + RK+ P++PS+ EAP LD
Subjt:  QKGELTMRVYNEEVKFN-DSTNKHLEDHGEVSV--EDLEVCSLERKNEKEVFRCED------------------VYDSLDLDQRKAPPIKPSLIEAPTLD

Query:  LKPLSDHLKYVYLGEGETLPIIVASDLMSEHEETLIKLLQQYRKAIDWTLADIQGISPTFCMHKITLDEGSFRSVEQQRRLNPTMKEVVKKEVIKWLDAG
        LKPL  +LKY YLG+ +TLPII+++ L S  E+ L++ L++++ AI WTLADI+GISP+ CMHKI L+EG  +S+EQQRRLNP MKEVV+KE++KWLDAG
Subjt:  LKPLSDHLKYVYLGEGETLPIIVASDLMSEHEETLIKLLQQYRKAIDWTLADIQGISPTFCMHKITLDEGSFRSVEQQRRLNPTMKEVVKKEVIKWLDAG

Query:  IIYPIADSNWVSPAQCVPKKGGDTVVSNKNNELIPTRTVTGWRVYMDYRKLNKATRMDHFPLPFIDQMLDKLAGQAYYYFLDGYSGYNKITIAPEDQEKT
        IIYPIA+S+ VSP QCVPKKGG TV++N+NNELI TR V GWR+ MDYR+LNKATR DHFPLPFIDQMLD+LAG+++Y FLDGYSGYN+ITI+PEDQEKT
Subjt:  IIYPIADSNWVSPAQCVPKKGGDTVVSNKNNELIPTRTVTGWRVYMDYRKLNKATRMDHFPLPFIDQMLDKLAGQAYYYFLDGYSGYNKITIAPEDQEKT

Query:  TFTCAYGTFAFRRMSFGLCNAPTTFQRCMLAIFSDMIESTVEIFMNDFSVFGGSF
        TFTC YG FAFRRM FGLCNAP TFQRCM+AIF+DM+E+ +EIFM+DFSV+G SF
Subjt:  TFTCAYGTFAFRRMSFGLCNAPTTFQRCMLAIFSDMIESTVEIFMNDFSVFGGSF

TrEMBL top hitse value%identityAlignment
A0A2G9HH15 Reverse transcriptase3.8e-18461.92Show/hide
Query:  KKILEILNQLHINIPL-VEAIEQMPNNVKFLKDILTKKKRLGEFETVSLTKECSAILKNGLPTKTKDPGSFTIPVSIGGKELGRALCELGASNNLMPLSV
        K+++       I  PL V+A+EQMP+ VKF+KDIL+KK+RLG++ETV+LT+ECSAI++N LP K KDPGSFTIP +IG    GRALC+LGAS NLMP S+
Subjt:  KKILEILNQLHINIPL-VEAIEQMPNNVKFLKDILTKKKRLGEFETVSLTKECSAILKNGLPTKTKDPGSFTIPVSIGGKELGRALCELGASNNLMPLSV

Query:  YLKLSIGETRPTTITLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEVDKDVSIILGRPFLATGRALIDVQKGELTMRVYNEEVKFN--------
        Y  L +GE +PT+ITLQLADRS+TYP G IED+LVKVDKFIFP DF++LD EVD +V IILGRPFLATGR LIDVQKGELTMRV ++++ FN        
Subjt:  YLKLSIGETRPTTITLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEVDKDVSIILGRPFLATGRALIDVQKGELTMRVYNEEVKFN--------

Query:  DSTNKHL-----------EDHGEVSVEDLEVCSLERKNEKEVFRCEDVYDSLDLD-----------QRKAPP--IKPSLIEAPTLDLKPLSDHLKYVYLG
        + +++             E   E  ++ LE   L+  +E+    CE V  +LD             +R AP   +KPS+ E PTL+LKPL  HL Y YLG
Subjt:  DSTNKHL-----------EDHGEVSVEDLEVCSLERKNEKEVFRCEDVYDSLDLD-----------QRKAPP--IKPSLIEAPTLDLKPLSDHLKYVYLG

Query:  EGETLPIIVASDLMSEHEETLIKLLQQYRKAIDWTLADIQGISPTFCMHKITLDEGSFRSVEQQRRLNPTMKEVVKKEVIKWLDAGIIYPIADSNWVSPA
        E +TLP+I++S L     E L+++L+ ++ AI WT+ADI+GISP+FCMHKI L++G   SVE QRRLNP MKEVVKKE+IKWLDAGIIYPI+DS+WVSP 
Subjt:  EGETLPIIVASDLMSEHEETLIKLLQQYRKAIDWTLADIQGISPTFCMHKITLDEGSFRSVEQQRRLNPTMKEVVKKEVIKWLDAGIIYPIADSNWVSPA

Query:  QCVPKKGGDTVVSNKNNELIPTRTVTGWRVYMDYRKLNKATRMDHFPLPFIDQMLDKLAGQAYYYFLDGYSGYNKITIAPEDQEKTTFTCAYGTFAFRRM
        QCVPKKGG TVV N +NELIPTRTVTGWRV MDYRKLNKATR DHFPLPFIDQMLD+LAG+ +Y FLDGYSGYN+I I PEDQEKTTFTC YGTF FR+M
Subjt:  QCVPKKGGDTVVSNKNNELIPTRTVTGWRVYMDYRKLNKATRMDHFPLPFIDQMLDKLAGQAYYYFLDGYSGYNKITIAPEDQEKTTFTCAYGTFAFRRM

Query:  SFGLCNAPTTFQRCMLAIFSDMIESTVEIFMNDFSVFGGSF
         FGLCNAP TFQRCM+AIF+DM+E+ +E+FM+DFSV+G SF
Subjt:  SFGLCNAPTTFQRCMLAIFSDMIESTVEIFMNDFSVFGGSF

A0A2G9HWC5 DNA-directed DNA polymerase5.9e-18560.85Show/hide
Query:  PPLAFPQRQKPK-NQDGEFKKILEILNQLHINIPLVEAIEQMPNNVKFLKDILTKKKRLGEFETVSLTKECSAILKNGLPTKTKDPGSFTIPVSIGGKEL
        P  + P   +P   QDG+  +   +  +LHINIP  EA+EQMP+ VKF+KDIL+KK+RLG++ETV+LT+E SAI++N LP K KDPGSFTIP +IG    
Subjt:  PPLAFPQRQKPK-NQDGEFKKILEILNQLHINIPLVEAIEQMPNNVKFLKDILTKKKRLGEFETVSLTKECSAILKNGLPTKTKDPGSFTIPVSIGGKEL

Query:  GRALCELGASNNLMPLSVYLKLSIGETRPTTITLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEVDKDVSIILGRPFLATGRALIDVQKGELTM
        GRALC+LGAS NLMP S+Y  L +GE +PT+ITLQLADRS+TYP+G IED+LVKVDKFIFP D ++LD EVD ++ IILGRPFLATGR LIDVQKGELTM
Subjt:  GRALCELGASNNLMPLSVYLKLSIGETRPTTITLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEVDKDVSIILGRPFLATGRALIDVQKGELTM

Query:  RVYNEEVKFN----------------------------------DSTNKHLED-HGEVSVEDLEVCSLERKNEKEVFRCEDVYDSLDLDQRKAPPIKPSL
        RV ++++ FN                                  DS  + L D   E + EDLEV  ++  +  + F+   V       Q K   +KPS+
Subjt:  RVYNEEVKFN----------------------------------DSTNKHLED-HGEVSVEDLEVCSLERKNEKEVFRCEDVYDSLDLDQRKAPPIKPSL

Query:  IEAPTLDLKPLSDHLKYVYLGEGETLPIIVASDLMSEHEETLIKLLQQYRKAIDWTLADIQGISPTFCMHKITLDEGSFRSVEQQRRLNPTMKEVVKKEV
         E PTL+LKPL  HL YVYLGE +TLP+I++S L     E L+++L+ +  AI WT+ADI+GISP+FCMHKI L++    SVE QRRLNP MKEVVKKE+
Subjt:  IEAPTLDLKPLSDHLKYVYLGEGETLPIIVASDLMSEHEETLIKLLQQYRKAIDWTLADIQGISPTFCMHKITLDEGSFRSVEQQRRLNPTMKEVVKKEV

Query:  IKWLDAGIIYPIADSNWVSPAQCVPKKGGDTVVSNKNNELIPTRTVTGWRVYMDYRKLNKATRMDHFPLPFIDQMLDKLAGQAYYYFLDGYSGYNKITIA
        IKWLDAGIIYPI+DS+WVSP QCVPKKGG TVV N +NELIPTRTVTGWR  MDYRKLNKATR DHFPLPFIDQMLD+LAG+ +Y FLDGYSGYN+I IA
Subjt:  IKWLDAGIIYPIADSNWVSPAQCVPKKGGDTVVSNKNNELIPTRTVTGWRVYMDYRKLNKATRMDHFPLPFIDQMLDKLAGQAYYYFLDGYSGYNKITIA

Query:  PEDQEKTTFTCAYGTFAFRRMSFGLCNAPTTFQRCMLAIFSDMIESTVEIFMNDFSVFGGSF
        PEDQEK TFTC YGTFAFRRM FGLCNAP TFQRCM+AIF+DM+E+ +EIFM+DFSV+G SF
Subjt:  PEDQEKTTFTCAYGTFAFRRMSFGLCNAPTTFQRCMLAIFSDMIESTVEIFMNDFSVFGGSF

A0A2G9HYA0 Reverse transcriptase2.5e-19160.7Show/hide
Query:  DVEPPYVPPPPYVPPLAFPQRQKPKNQDGEFKKILEILNQLHINIPLVEAIEQMPNNVKFLKDILTKKKRLGEFETVSLTKECSAILKNGLPTKTKDPGS
        +VE P     P      FPQR + +  + +F K LE+  +LHINIP  EA+EQMP+ VKF+KDIL+KK+RLG++ETV+LT+ECSAI++N LP K KDPGS
Subjt:  DVEPPYVPPPPYVPPLAFPQRQKPKNQDGEFKKILEILNQLHINIPLVEAIEQMPNNVKFLKDILTKKKRLGEFETVSLTKECSAILKNGLPTKTKDPGS

Query:  FTIPVSIGGKELGRALCELGASNNLMPLSVYLKLSIGETRPTTITLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEVDKDVSIILGRPFLATGR
        FTIP +IG    GRALC+LGAS NLMP S+Y  L +GE +PT+ITLQLADRS+TYP+G IED+LVKVDKFIFP DF++LD EVD +V IILGRPFLATGR
Subjt:  FTIPVSIGGKELGRALCELGASNNLMPLSVYLKLSIGETRPTTITLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEVDKDVSIILGRPFLATGR

Query:  ALIDVQKGELTMRVYNEEVKFNDSTNKHLEDH--------------GEVSVEDLEVCSLER--------KNEKEVFRCEDVYDSLDLDQRK---------
         LIDVQKGELTMRV ++++ FN        +               G  S+ +  +  LER        +NE+++   + +  S  L  R+         
Subjt:  ALIDVQKGELTMRVYNEEVKFNDSTNKHLEDH--------------GEVSVEDLEVCSLER--------KNEKEVFRCEDVYDSLDLDQRK---------

Query:  APPIKPSLIEAPTLDLKPLSDHLKYVYLGEGETLPIIVASDLMSEHEETLIKLLQQYRKAIDWTLADIQGISPTFCMHKITLDEGSFRSVEQQRRLNPTM
        +  +KPS+ + PTL+LKPL  HL Y YLGE +TLP+I++S L     E L+++L+ ++ AI WT+ADI+GISP+FCMHKI L++    SVE QRRLNP M
Subjt:  APPIKPSLIEAPTLDLKPLSDHLKYVYLGEGETLPIIVASDLMSEHEETLIKLLQQYRKAIDWTLADIQGISPTFCMHKITLDEGSFRSVEQQRRLNPTM

Query:  KEVVKKEVIKWLDAGIIYPIADSNWVSPAQCVPKKGGDTVVSNKNNELIPTRTVTGWRVYMDYRKLNKATRMDHFPLPFIDQMLDKLAGQAYYYFLDGYS
        KEVVKKE+IKWLDAGIIYPI+DS+WVSP QCVPKKGG TVV N +NELIPTRTVTGWRV MDYRKLNKATR DHFPLPFIDQMLD+LAG+ +Y FLDGYS
Subjt:  KEVVKKEVIKWLDAGIIYPIADSNWVSPAQCVPKKGGDTVVSNKNNELIPTRTVTGWRVYMDYRKLNKATRMDHFPLPFIDQMLDKLAGQAYYYFLDGYS

Query:  GYNKITIAPEDQEKTTFTCAYGTFAFRRMSFGLCNAPTTFQRCMLAIFSDMIESTVEIFMNDFSVFGGSF
        GYN+I IAPEDQEKTTFTC YGTFAFRRM FGLCNAP TFQRCM+AIF+DM+E+ +E+FM+DFSV+G SF
Subjt:  GYNKITIAPEDQEKTTFTCAYGTFAFRRMSFGLCNAPTTFQRCMLAIFSDMIESTVEIFMNDFSVFGGSF

A0A2G9HYD8 Reverse transcriptase1.6e-19060.7Show/hide
Query:  DVEPPYVPPPPYVPPLAFPQRQKPKNQDGEFKKILEILNQLHINIPLVEAIEQMPNNVKFLKDILTKKKRLGEFETVSLTKECSAILKNGLPTKTKDPGS
        +VE P     P      FPQ+ + +  + +F K LE+  +LHINIP  EA+EQMP+ VKF+KDIL+KK+RLG++ET +LT+EC+AI++N LP K KDPGS
Subjt:  DVEPPYVPPPPYVPPLAFPQRQKPKNQDGEFKKILEILNQLHINIPLVEAIEQMPNNVKFLKDILTKKKRLGEFETVSLTKECSAILKNGLPTKTKDPGS

Query:  FTIPVSIGGKELGRALCELGASNNLMPLSVYLKLSIGETRPTTITLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEVDKDVSIILGRPFLATGR
        FTIP +IG    GRALC+LGAS NLMP S+Y  L +GE +PT+ITLQLADRS+TYP+G IED+LVKVDKFIFP DF++LD EVD +V IILGRPFLATGR
Subjt:  FTIPVSIGGKELGRALCELGASNNLMPLSVYLKLSIGETRPTTITLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEVDKDVSIILGRPFLATGR

Query:  ALIDVQKGELTMRVYNEEVKFN------------DSTNKHLEDH--GEVSVEDLEVCSLERK--------NEKEV-----FRCEDVYDSLDLD--QRKAP
         LIDVQKGELTMRV ++++ FN            +  +  L D+  G  S+ +  + SLER         NE+++           + S  ++  +R  P
Subjt:  ALIDVQKGELTMRVYNEEVKFN------------DSTNKHLEDH--GEVSVEDLEVCSLERK--------NEKEV-----FRCEDVYDSLDLD--QRKAP

Query:  P--IKPSLIEAPTLDLKPLSDHLKYVYLGEGETLPIIVASDLMSEHEETLIKLLQQYRKAIDWTLADIQGISPTFCMHKITLDEGSFRSVEQQRRLNPTM
           +KPS+ + PTL+LKPL +HL YVYLGE +TLP+I++S L     E L+++L+ ++ AI WT+ADI+GISP+FCMHKI L++    SVE QRRLN  M
Subjt:  P--IKPSLIEAPTLDLKPLSDHLKYVYLGEGETLPIIVASDLMSEHEETLIKLLQQYRKAIDWTLADIQGISPTFCMHKITLDEGSFRSVEQQRRLNPTM

Query:  KEVVKKEVIKWLDAGIIYPIADSNWVSPAQCVPKKGGDTVVSNKNNELIPTRTVTGWRVYMDYRKLNKATRMDHFPLPFIDQMLDKLAGQAYYYFLDGYS
        KEVVKKE+IKWLDAGIIYPI+DS+WVSP QCVPKKGG TVV N +NELIPTRTVTGWRV MDYRKLNKATR DHFPLPFIDQMLD+LAG+ +Y FLDGYS
Subjt:  KEVVKKEVIKWLDAGIIYPIADSNWVSPAQCVPKKGGDTVVSNKNNELIPTRTVTGWRVYMDYRKLNKATRMDHFPLPFIDQMLDKLAGQAYYYFLDGYS

Query:  GYNKITIAPEDQEKTTFTCAYGTFAFRRMSFGLCNAPTTFQRCMLAIFSDMIESTVEIFMNDFSVFGGSF
        GYN+I IAPEDQEKTTFTC YGTFAFRRM FGLCNAP TFQRCM+AIF+DM+E+ +E+FM+DFSV+G SF
Subjt:  GYNKITIAPEDQEKTTFTCAYGTFAFRRMSFGLCNAPTTFQRCMLAIFSDMIESTVEIFMNDFSVFGGSF

A0A6J1DV77 uncharacterized protein LOC1110238181.0e-19766.67Show/hide
Query:  VPPPPYVPPLAFPQRQKPKNQDGEFKKILEILNQLHINIPLVEAIEQMPNNVKFLKDILTKKKRLGEFETVSLTKECSAILKNGLPTKTKDPGSFTIPVS
        +P    VPP  +PQR + KNQD +F + LE+L QLHINIPL+EA+EQMPN VKFLKDIL KK+RLGEFE V+LTKE SAIL   LP K  DPGSFTIPV 
Subjt:  VPPPPYVPPLAFPQRQKPKNQDGEFKKILEILNQLHINIPLVEAIEQMPNNVKFLKDILTKKKRLGEFETVSLTKECSAILKNGLPTKTKDPGSFTIPVS

Query:  IGGKELGRALCELGASNNLMPLSVYLKLSIGETRPTTITLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEVDKDVSIILGRPFLATGRALIDVQ
        IGGK +G ALC+LGAS NLMPLSVY KL IGE RP T+TLQLADRSITY EGKIEDVLV+VDKFIFP DFIILDYE DK++ IILGRPFL+TGRALIDV 
Subjt:  IGGKELGRALCELGASNNLMPLSVYLKLSIGETRPTTITLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEVDKDVSIILGRPFLATGRALIDVQ

Query:  KGELTMRVYNEEVK---FNDSTNK-HLEDHGEVSVEDLEVCSLERKNEKEVFRCEDVYDSLDLDQRKAPPIKPSLIEAPTLDLKPLSDHLKYVYLGEGET
         GELT+RV +++V    FN       +E+   + + D ++ S E + E+ + + ED    +  D+ +A P++PS+++AP L+LK L  HLKY YLGE ET
Subjt:  KGELTMRVYNEEVK---FNDSTNK-HLEDHGEVSVEDLEVCSLERKNEKEVFRCEDVYDSLDLDQRKAPPIKPSLIEAPTLDLKPLSDHLKYVYLGEGET

Query:  LPIIVASDLMSEHEETLIKLLQQYRKAIDWTLADIQGISPTFCMHKITLDEGSFRSVEQQRRLNPTMKEVVKKEVIKWLDAGIIYPIADSNWVSPAQCVP
        LP+ +A+DL  E E  LI++L+ ++KAI WTLADI+GISP++CMHKI L+EG   S+E QRRLNP MKEVVKKE+IKWLDAGIIYPIAD + +SP QCVP
Subjt:  LPIIVASDLMSEHEETLIKLLQQYRKAIDWTLADIQGISPTFCMHKITLDEGSFRSVEQQRRLNPTMKEVVKKEVIKWLDAGIIYPIADSNWVSPAQCVP

Query:  KKGGDTVVSNKNNELIPTRTVTGWRVYMDYRKLNKATRMDHFPLPFIDQMLDKLAGQAYYYFLDGYSGYNKITIAPEDQEKTTFTCAYGTFAFRRMSFGL
        KKGG TVV N NNELIPTRT+TGW + MDYRKLNKAT+ DHFPLPFIDQMLD L GQ YYY LDGY+GYN+ITI P+DQ+KTTFTC YGTF+FRRM FGL
Subjt:  KKGGDTVVSNKNNELIPTRTVTGWRVYMDYRKLNKATRMDHFPLPFIDQMLDKLAGQAYYYFLDGYSGYNKITIAPEDQEKTTFTCAYGTFAFRRMSFGL

Query:  CNAPTTFQRCMLAIFSDMIESTVEIFMNDFSVFGGSF
        CNAPTTFQRCM+AIF D+IE+ VE+FM+DFSVF   F
Subjt:  CNAPTTFQRCMLAIFSDMIESTVEIFMNDFSVFGGSF

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.63.7e-1931.47Show/hide
Query:  LMSEHEETLIKLLQQYRKAIDWTLADIQ---GISPTFC-MHKITLDEGSFRSVEQQRRLNPTMKEVVKKEVIKWLDAGIIYPIADSNWVSPAQCVPKKGG
        L +E ++ L  LLQ+Y         DIQ   G   TF    K T++      +  +       ++ V+ ++   L+ GII   ++S + SP   VPKK  
Subjt:  LMSEHEETLIKLLQQYRKAIDWTLADIQ---GISPTFC-MHKITLDEGSFRSVEQQRRLNPTMKEVVKKEVIKWLDAGIIYPIADSNWVSPAQCVPKKGG

Query:  DTVVSNKNNELIPTRTVTGWRVYMDYRKLNKATRMDHFPLPFIDQMLDKLAGQAYYYFLDGYSGYNKITIAPEDQEKTTFTCAYGTFAFRRMSFGLCNAP
            S K            +R+ +DYRKLN+ T  D  P+P +D++L KL    Y+  +D   G+++I + PE   KT F+  +G + + RM FGL NAP
Subjt:  DTVVSNKNNELIPTRTVTGWRVYMDYRKLNKATRMDHFPLPFIDQMLDKLAGQAYYYFLDGYSGYNKITIAPEDQEKTTFTCAYGTFAFRRMSFGLCNAP

Query:  TTFQRCMLAIFSDMIESTVEIFMNDFSVFGGS
         TFQRCM  I   ++     ++++D  VF  S
Subjt:  TTFQRCMLAIFSDMIESTVEIFMNDFSVFGGS

P10394 Retrovirus-related Pol polyprotein from transposon 4121.4e-1834.52Show/hide
Query:  EVVKKEVIKWLDAGIIYPIADSNWVSPAQCVPKKGGDTVVSNKNNELIPTRTVTGWRVYMDYRKLNKATRMDHFPLPFIDQMLDKLAGQAYYYFLDGYSG
        E ++ +V K +   I+ P + S + SP   VPKK              P      WR+ +DYR++NK    D FPLP ID +LD+L    Y+  LD  SG
Subjt:  EVVKKEVIKWLDAGIIYPIADSNWVSPAQCVPKKGGDTVVSNKNNELIPTRTVTGWRVYMDYRKLNKATRMDHFPLPFIDQMLDKLAGQAYYYFLDGYSG

Query:  YNKITIAPEDQEKTTFTCAYGTFAFRRMSFGLCNAPTTFQRCMLAIFSDMIESTVEIFMNDFSVFGGS
        +++I +    ++ T+F+ + G++ F R+ FGL  AP +FQR M   FS +  S   ++M+D  V G S
Subjt:  YNKITIAPEDQEKTTFTCAYGTFAFRRMSFGLCNAPTTFQRCMLAIFSDMIESTVEIFMNDFSVFGGS

P20825 Retrovirus-related Pol polyprotein from transposon 2971.1e-1834.27Show/hide
Query:  QQRRLNPTMKEVVKKEVIKWLDAGIIYPIADSNWVSPAQCVPKKGGDTVVSNKNNELIPTRTVTGWRVYMDYRKLNKATRMDHFPLPFIDQMLDKLAGQA
        +Q  L  T +  V+ +V + L+ G+I   ++S + SP   VPKK  D   +NK            +RV +DYRKLN+ T  D +P+P +D++L KL    
Subjt:  QQRRLNPTMKEVVKKEVIKWLDAGIIYPIADSNWVSPAQCVPKKGGDTVVSNKNNELIPTRTVTGWRVYMDYRKLNKATRMDHFPLPFIDQMLDKLAGQA

Query:  YYYFLDGYSGYNKITIAPEDQEKTTFTCAYGTFAFRRMSFGLCNAPTTFQRCMLAIFSDMIESTVEIFMNDFSVFGGS
        Y+  +D   G+++I +  E   KT F+   G + + RM FGL NAP TFQRCM  I   ++     ++++D  +F  S
Subjt:  YYYFLDGYSGYNKITIAPEDQEKTTFTCAYGTFAFRRMSFGLCNAPTTFQRCMLAIFSDMIESTVEIFMNDFSVFGGS

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.5e-2033.94Show/hide
Query:  LLQQYRKAIDWTL----ADIQGISPTFCMHKITLDEGSFRSVEQQRRLNPTMKEVVKKEVIKWLDAGIIYPIADSNWVSPAQCVPKKGGDTVVSNKNNEL
        L Q+YR+ I   L    ADI  I      H I +  G+     Q   +    ++ + K V K LD   I P + S   SP   VPKK G           
Subjt:  LLQQYRKAIDWTL----ADIQGISPTFCMHKITLDEGSFRSVEQQRRLNPTMKEVVKKEVIKWLDAGIIYPIADSNWVSPAQCVPKKGGDTVVSNKNNEL

Query:  IPTRTVTGWRVYMDYRKLNKATRMDHFPLPFIDQMLDKLAGQAYYYFLDGYSGYNKITIAPEDQEKTTFTCAYGTFAFRRMSFGLCNAPTTFQRCMLAIF
                +R+ +DYR LNKAT  D FPLP ID +L ++     +  LD +SGY++I + P+D+ KT F    G + +  M FGL NAP+TF R M   F
Subjt:  IPTRTVTGWRVYMDYRKLNKATRMDHFPLPFIDQMLDKLAGQAYYYFLDGYSGYNKITIAPEDQEKTTFTCAYGTFAFRRMSFGLCNAPTTFQRCMLAIF

Query:  SDMIESTVEIFMNDFSVFGGS
         D+    V ++++D  +F  S
Subjt:  SDMIESTVEIFMNDFSVFGGS

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.5e-2033.94Show/hide
Query:  LLQQYRKAIDWTL----ADIQGISPTFCMHKITLDEGSFRSVEQQRRLNPTMKEVVKKEVIKWLDAGIIYPIADSNWVSPAQCVPKKGGDTVVSNKNNEL
        L Q+YR+ I   L    ADI  I      H I +  G+     Q   +    ++ + K V K LD   I P + S   SP   VPKK G           
Subjt:  LLQQYRKAIDWTL----ADIQGISPTFCMHKITLDEGSFRSVEQQRRLNPTMKEVVKKEVIKWLDAGIIYPIADSNWVSPAQCVPKKGGDTVVSNKNNEL

Query:  IPTRTVTGWRVYMDYRKLNKATRMDHFPLPFIDQMLDKLAGQAYYYFLDGYSGYNKITIAPEDQEKTTFTCAYGTFAFRRMSFGLCNAPTTFQRCMLAIF
                +R+ +DYR LNKAT  D FPLP ID +L ++     +  LD +SGY++I + P+D+ KT F    G + +  M FGL NAP+TF R M   F
Subjt:  IPTRTVTGWRVYMDYRKLNKATRMDHFPLPFIDQMLDKLAGQAYYYFLDGYSGYNKITIAPEDQEKTTFTCAYGTFAFRRMSFGLCNAPTTFQRCMLAIF

Query:  SDMIESTVEIFMNDFSVFGGS
         D+    V ++++D  +F  S
Subjt:  SDMIESTVEIFMNDFSVFGGS

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGGGCGGGATCCCTTTGTTCAAGCCCCAGAGTCAGCACTTAAGGGAACAACGTCTCTACTATCCCTAATTCAGGTAGGAGCGAATTCCTTCTTGCATGACTATGT
CCCCAACTATCTACCCGTTCCTATCCCTGAAATGGGAGCTCAAGTCGTCATTCTTGATGATGACGTAGAGGATTCATTGTCTTTTACAAAAGAAATGAATGATGCTGTTA
ACGATCAGAGGATTAAAGCCATTTACTTGAAAATGGAATCTTTGTATTTCAATTCCATCTGGGATCTTGTACATCTGCTTGATGGGACATTGATGACTCTTGTGTTTACA
AAGACTGGTCAGGGTGCTGGAGGCAGCAATAATGATGCTGGAGCATCTGGGCCTGTTCCAGATGTGGAACCACCTTATGTGCCACCCCCACCTTATGTACCACCTCTAGC
TTTTCCACAAAGGCAAAAGCCTAAGAACCAGGATGGTGAATTTAAGAAGATTTTAGAGATTCTTAACCAATTGCATATAAATATCCCTTTAGTAGAAGCTATAGAGCAAA
TGCCAAATAATGTTAAATTTCTTAAGGATATTTTAACTAAAAAGAAGAGGTTAGGTGAGTTTGAAACTGTATCTCTTACTAAGGAATGTAGTGCTATTCTTAAGAATGGG
CTACCAACCAAGACTAAGGATCCAGGGTCATTTACTATACCTGTGTCTATAGGTGGAAAAGAGTTAGGTAGAGCACTTTGTGAATTAGGTGCAAGCAATAACCTTATGCC
TCTTTCGGTCTATCTTAAGCTAAGTATTGGTGAAACTAGGCCTACCACAATCACACTCCAACTAGCTGATAGGTCTATCACATATCCAGAGGGTAAAATTGAGGATGTCT
TAGTGAAGGTAGATAAATTCATATTTCCTGTTGATTTTATTATTTTAGATTATGAGGTTGATAAAGATGTCTCAATTATTCTTGGTCGTCCATTTTTGGCCACTGGTAGG
GCATTGATTGATGTTCAAAAAGGGGAGTTAACAATGAGAGTTTATAATGAGGAAGTGAAATTTAATGATTCGACAAACAAACATTTGGAAGATCATGGAGAGGTTAGTGT
AGAGGATTTAGAAGTTTGTTCTTTAGAAAGAAAAAATGAAAAAGAAGTGTTTAGGTGTGAGGATGTTTATGATTCTTTAGATTTAGATCAAAGAAAAGCTCCTCCTATTA
AGCCATCCCTAATTGAGGCACCCACTTTAGATTTGAAGCCCTTGTCAGATCATTTAAAGTATGTGTATCTTGGGGAAGGTGAGACGTTGCCCATTATTGTTGCATCAGAT
TTAATGTCGGAGCATGAAGAGACCTTAATTAAGTTACTGCAGCAATATCGCAAAGCAATAGATTGGACATTGGCTGACATTCAGGGAATTAGCCCAACTTTTTGTATGCA
TAAAATCACTCTAGATGAGGGATCCTTTAGGAGTGTTGAGCAACAGAGAAGGCTTAACCCTACAATGAAAGAGGTTGTTAAAAAGGAGGTGATTAAATGGTTGGATGCTG
GCATTATTTATCCAATAGCAGACAGCAATTGGGTAAGCCCTGCCCAATGTGTTCCTAAGAAAGGAGGTGACACTGTGGTGAGCAATAAAAACAATGAGTTGATCCCAACC
AGGACAGTAACTGGCTGGAGGGTTTATATGGATTACAGGAAGCTTAATAAAGCTACCCGTATGGACCATTTCCCTCTACCATTTATTGACCAAATGTTGGACAAATTGGC
TGGTCAGGCCTACTACTATTTCTTAGATGGTTATTCTGGGTATAACAAGATTACCATTGCTCCTGAGGATCAGGAAAAAACCACTTTCACCTGCGCTTATGGGACGTTTG
CTTTTAGGCGAATGTCTTTTGGCCTTTGCAATGCTCCAACAACATTTCAGCGGTGTATGTTAGCAATTTTTTCTGATATGATTGAGTCCACTGTTGAGATATTTATGAAC
GATTTTTCAGTGTTTGGAGGGTCTTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGAGGGCGGGATCCCTTTGTTCAAGCCCCAGAGTCAGCACTTAAGGGAACAACGTCTCTACTATCCCTAATTCAGGTAGGAGCGAATTCCTTCTTGCATGACTATGT
CCCCAACTATCTACCCGTTCCTATCCCTGAAATGGGAGCTCAAGTCGTCATTCTTGATGATGACGTAGAGGATTCATTGTCTTTTACAAAAGAAATGAATGATGCTGTTA
ACGATCAGAGGATTAAAGCCATTTACTTGAAAATGGAATCTTTGTATTTCAATTCCATCTGGGATCTTGTACATCTGCTTGATGGGACATTGATGACTCTTGTGTTTACA
AAGACTGGTCAGGGTGCTGGAGGCAGCAATAATGATGCTGGAGCATCTGGGCCTGTTCCAGATGTGGAACCACCTTATGTGCCACCCCCACCTTATGTACCACCTCTAGC
TTTTCCACAAAGGCAAAAGCCTAAGAACCAGGATGGTGAATTTAAGAAGATTTTAGAGATTCTTAACCAATTGCATATAAATATCCCTTTAGTAGAAGCTATAGAGCAAA
TGCCAAATAATGTTAAATTTCTTAAGGATATTTTAACTAAAAAGAAGAGGTTAGGTGAGTTTGAAACTGTATCTCTTACTAAGGAATGTAGTGCTATTCTTAAGAATGGG
CTACCAACCAAGACTAAGGATCCAGGGTCATTTACTATACCTGTGTCTATAGGTGGAAAAGAGTTAGGTAGAGCACTTTGTGAATTAGGTGCAAGCAATAACCTTATGCC
TCTTTCGGTCTATCTTAAGCTAAGTATTGGTGAAACTAGGCCTACCACAATCACACTCCAACTAGCTGATAGGTCTATCACATATCCAGAGGGTAAAATTGAGGATGTCT
TAGTGAAGGTAGATAAATTCATATTTCCTGTTGATTTTATTATTTTAGATTATGAGGTTGATAAAGATGTCTCAATTATTCTTGGTCGTCCATTTTTGGCCACTGGTAGG
GCATTGATTGATGTTCAAAAAGGGGAGTTAACAATGAGAGTTTATAATGAGGAAGTGAAATTTAATGATTCGACAAACAAACATTTGGAAGATCATGGAGAGGTTAGTGT
AGAGGATTTAGAAGTTTGTTCTTTAGAAAGAAAAAATGAAAAAGAAGTGTTTAGGTGTGAGGATGTTTATGATTCTTTAGATTTAGATCAAAGAAAAGCTCCTCCTATTA
AGCCATCCCTAATTGAGGCACCCACTTTAGATTTGAAGCCCTTGTCAGATCATTTAAAGTATGTGTATCTTGGGGAAGGTGAGACGTTGCCCATTATTGTTGCATCAGAT
TTAATGTCGGAGCATGAAGAGACCTTAATTAAGTTACTGCAGCAATATCGCAAAGCAATAGATTGGACATTGGCTGACATTCAGGGAATTAGCCCAACTTTTTGTATGCA
TAAAATCACTCTAGATGAGGGATCCTTTAGGAGTGTTGAGCAACAGAGAAGGCTTAACCCTACAATGAAAGAGGTTGTTAAAAAGGAGGTGATTAAATGGTTGGATGCTG
GCATTATTTATCCAATAGCAGACAGCAATTGGGTAAGCCCTGCCCAATGTGTTCCTAAGAAAGGAGGTGACACTGTGGTGAGCAATAAAAACAATGAGTTGATCCCAACC
AGGACAGTAACTGGCTGGAGGGTTTATATGGATTACAGGAAGCTTAATAAAGCTACCCGTATGGACCATTTCCCTCTACCATTTATTGACCAAATGTTGGACAAATTGGC
TGGTCAGGCCTACTACTATTTCTTAGATGGTTATTCTGGGTATAACAAGATTACCATTGCTCCTGAGGATCAGGAAAAAACCACTTTCACCTGCGCTTATGGGACGTTTG
CTTTTAGGCGAATGTCTTTTGGCCTTTGCAATGCTCCAACAACATTTCAGCGGTGTATGTTAGCAATTTTTTCTGATATGATTGAGTCCACTGTTGAGATATTTATGAAC
GATTTTTCAGTGTTTGGAGGGTCTTTTTAG
Protein sequenceShow/hide protein sequence
MRGRDPFVQAPESALKGTTSLLSLIQVGANSFLHDYVPNYLPVPIPEMGAQVVILDDDVEDSLSFTKEMNDAVNDQRIKAIYLKMESLYFNSIWDLVHLLDGTLMTLVFT
KTGQGAGGSNNDAGASGPVPDVEPPYVPPPPYVPPLAFPQRQKPKNQDGEFKKILEILNQLHINIPLVEAIEQMPNNVKFLKDILTKKKRLGEFETVSLTKECSAILKNG
LPTKTKDPGSFTIPVSIGGKELGRALCELGASNNLMPLSVYLKLSIGETRPTTITLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEVDKDVSIILGRPFLATGR
ALIDVQKGELTMRVYNEEVKFNDSTNKHLEDHGEVSVEDLEVCSLERKNEKEVFRCEDVYDSLDLDQRKAPPIKPSLIEAPTLDLKPLSDHLKYVYLGEGETLPIIVASD
LMSEHEETLIKLLQQYRKAIDWTLADIQGISPTFCMHKITLDEGSFRSVEQQRRLNPTMKEVVKKEVIKWLDAGIIYPIADSNWVSPAQCVPKKGGDTVVSNKNNELIPT
RTVTGWRVYMDYRKLNKATRMDHFPLPFIDQMLDKLAGQAYYYFLDGYSGYNKITIAPEDQEKTTFTCAYGTFAFRRMSFGLCNAPTTFQRCMLAIFSDMIESTVEIFMN
DFSVFGGSF