; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008813 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008813
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr9:30581494..30586239
RNA-Seq ExpressionLag0008813
SyntenyLag0008813
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIN22487.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]1.1e-17555.92Show/hide
Query:  PADKTRQRPKNQDGQFKKFLEILKQMHKNIPLVEAIEQMPNHAKFLKDILTKKKRLGEFETVSLTEECSAILKNELPPKAKDPGSFTIPVSIGGKELGRA
        P    R + +  + QF KFLE+ K++H NIP  EA+EQMP++ KF+KDIL+KK+RLG++ETV+LTEECSAI++N+LPPK KDPGSFTIP +IG    GRA
Subjt:  PADKTRQRPKNQDGQFKKFLEILKQMHKNIPLVEAIEQMPNHAKFLKDILTKKKRLGEFETVSLTEECSAILKNELPPKAKDPGSFTIPVSIGGKELGRA

Query:  LCD---------------LGIGEARPTTVTLPLADMSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVC
        LCD               LG+GEA+PT++TL LAD S+TYP+G IED+LVKVDKFIFP DF++LD E D +VPIILGRPFLATGR LIDVQKGELTMRV 
Subjt:  LCD---------------LGIGEARPTTVTLPLADMSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVC

Query:  NEEVKFNVFKAMKYPDEMEDCSFIRILESTVIETTIQDSASKHSEEH-----GEVSEEDLQVCLLERKNEKELFRCEDVFESLDLDQRKAPP--IKPSLI
        ++++ FNVFKAMK+P+E ++C  + + +      +I +      E        E +EEDL+V  ++  +  +  +   V ESL   +R  P   +KPS+ 
Subjt:  NEEVKFNVFKAMKYPDEMEDCSFIRILESTVIETTIQDSASKHSEEH-----GEVSEEDLQVCLLERKNEKELFRCEDVFESLDLDQRKAPP--IKPSLI

Query:  EAPTLDLKPLSDHLKYVYLGEGETLPIIVASDLLSEHEEALIKLLQQYRKAIGWTLADIQGVSPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVI
        + PTL+LKPL  HL Y YLGE +TLP+I++S L     E L+++L+ ++ AIGWT+ADI+G+SPSFCMHKI LE+    S+E QRRLNP MKEVVKKE+I
Subjt:  EAPTLDLKPLSDHLKYVYLGEGETLPIIVASDLLSEHEEALIKLLQQYRKAIGWTLADIQGVSPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVI

Query:  KWLDAGIIYPIADSNWVSPVLCVPKKGGVTVVSNKDNELIPTRTVTGWR----------------------------------------------ITIAP
        KWLDAGIIYPI+DS+WVSPV CVPKKGG+TVV N  NELIPTRTVTGWR                                              I IAP
Subjt:  KWLDAGIIYPIADSNWVSPVLCVPKKGGVTVVSNKDNELIPTRTVTGWR----------------------------------------------ITIAP

Query:  EDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSLFGRSIQSCLDNLGQVLKRCEDTHLVLN
        EDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCM+AIF+DM+E+ +EVFMDDFS++G S   CL+NL  VLKRCEDT+L+LN
Subjt:  EDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSLFGRSIQSCLDNLGQVLKRCEDTHLVLN

PIN22518.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]1.3e-17657.12Show/hide
Query:  QFKKFLEILKQMHKNIPLVEAIEQMPNHAKFLKDILTKKKRLGEFETVSLTEECSAILKNELPPKAKDPGSFTIPVSIGGKELGRALCD-----------
        QF KFLE+ K++H NIP  EA+EQMP++ KF+KDIL+KK+RLG++ET +LTEEC+AI++N+LPPK KDPGSFTIP +IG    GRALCD           
Subjt:  QFKKFLEILKQMHKNIPLVEAIEQMPNHAKFLKDILTKKKRLGEFETVSLTEECSAILKNELPPKAKDPGSFTIPVSIGGKELGRALCD-----------

Query:  ----LGIGEARPTTVTLPLADMSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKY
            LG+GEA+PT++TL LAD S+TYP+G IED+LVKVDKFIFP DF++LD E D +VPIILGRPFLATGR LIDVQKGELTMRV ++++ FNVFKAMK+
Subjt:  ----LGIGEARPTTVTLPLADMSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKY

Query:  PDEMEDCSFIRILESTVIETTIQDSASKHSEEH-----GEVSEEDLQVCLLERKNEKELFRCEDVFESLDLDQRKAPP--IKPSLIEAPTLDLKPLSDHL
        P+E ++C  + + ++     +I +      E        E +EEDL+V  ++  N  + F+   V ESL   +R  P   +KPS+ + PTL+LKPL +HL
Subjt:  PDEMEDCSFIRILESTVIETTIQDSASKHSEEH-----GEVSEEDLQVCLLERKNEKELFRCEDVFESLDLDQRKAPP--IKPSLIEAPTLDLKPLSDHL

Query:  KYVYLGEGETLPIIVASDLLSEHEEALIKLLQQYRKAIGWTLADIQGVSPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADS
         YVYLGE +TLP+I++S L     E L+++L+ ++ AIGWT+ADI+G+SPSFCMHKI LE+    S+E QRRLN  MKEVVKKE+IKWLDAGIIYPI+DS
Subjt:  KYVYLGEGETLPIIVASDLLSEHEEALIKLLQQYRKAIGWTLADIQGVSPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADS

Query:  NWVSPVLCVPKKGGVTVVSNKDNELIPTRTVTGWR----------------------------------------------ITIAPEDQEKTTFTCPYGT
        +WVSPV CVPKKGG+TVV N  NELIPTRTVTGWR                                              I IAPEDQEKTTFTCPYGT
Subjt:  NWVSPVLCVPKKGGVTVVSNKDNELIPTRTVTGWR----------------------------------------------ITIAPEDQEKTTFTCPYGT

Query:  FAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSLFGRSIQSCLDNLGQVLKRCEDTHLVLN
        FAFRRMPFGLCNAPATFQRCM+AIF+DM+E+ +EVFMDDFS++G S   CL+NL  VLKRCEDT+LVLN
Subjt:  FAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSLFGRSIQSCLDNLGQVLKRCEDTHLVLN

PIN26668.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]8.4e-17154.91Show/hide
Query:  KHIPKNLEFREDPADKTRQRPKNQDGQFKKFLEILKQMHKNIPLVEAIEQMPNHAKFLKDILTKKKRLGEFETVSLTEECSAILKNELPPKAKDPGSFTI
        K +    E +E  A   R + +    QF KFLE+ K++H N P  EA+EQMP++ KF+K IL+KK+RLG++ETV+LTEECSAI++N+LPPK KDPGSFTI
Subjt:  KHIPKNLEFREDPADKTRQRPKNQDGQFKKFLEILKQMHKNIPLVEAIEQMPNHAKFLKDILTKKKRLGEFETVSLTEECSAILKNELPPKAKDPGSFTI

Query:  PVSIGGKELGRALCD---------------LGIGEARPTTVTLPLADMSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALI
        P +IG    GRALCD               LG+GEA+PT++TL LA+ S+TYP+G IED+LVKVDKFIFP DF++LD E D +VPIILGRPFLATGR LI
Subjt:  PVSIGGKELGRALCD---------------LGIGEARPTTVTLPLADMSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALI

Query:  DVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVIETTIQDSASKH-SEEHGEVSEEDLQVCLLERKNEKELFRCEDVFESLDLDQRKAPP
        DVQKG+LTMRV ++++ FNVFKAMK+P+E ++C  + + ++     +I D   +   +   E +EED +V  ++  +  + F+   V ESL   +R AP 
Subjt:  DVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVIETTIQDSASKH-SEEHGEVSEEDLQVCLLERKNEKELFRCEDVFESLDLDQRKAPP

Query:  --IKPSLIEAPTLDLKPLSDHLKYVYLGEGETLPIIVASDLLSEHEEALIKLLQQYRKAIGWTLADIQGVSPSFCMHKITLEEGSFRSIEQQRRLNPAMK
          +KPS+ E+PTL+LKPL  HL Y YLGE +TLP+I++S L     E L+++ + ++ AIGWT+ADI+G+S SFCMHKI LE+    S+E QRRLNP MK
Subjt:  --IKPSLIEAPTLDLKPLSDHLKYVYLGEGETLPIIVASDLLSEHEEALIKLLQQYRKAIGWTLADIQGVSPSFCMHKITLEEGSFRSIEQQRRLNPAMK

Query:  EVVKKEVIKWLDAGIIYPIADSNWVSPVLCVPKKGGVTVVSNKDNELIPTRTVTGWRITIAPE------------------------------------D
        EVVKKE+IKW+DAGIIYPI+DS+WVSPV CVPKKGG+TVV N  NELIPTRTVTGWR+ +                                       D
Subjt:  EVVKKEVIKWLDAGIIYPIADSNWVSPVLCVPKKGGVTVVSNKDNELIPTRTVTGWRITIAPE------------------------------------D

Query:  QEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSLFGRSIQSCLDNLGQVLKRCEDTHLVLN
        QEKTTFTCPYGTFAFRR+PFGLCNAPATFQRCM+AIF+DM+E+ +EVFMDDFS++G S   CL+NL  VLKRCEDT+LVLN
Subjt:  QEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSLFGRSIQSCLDNLGQVLKRCEDTHLVLN

XP_022156989.1 uncharacterized protein LOC111023818 [Momordica charantia]1.3e-17656.63Show/hide
Query:  KHIPKNLEFREDPADKTRQRPKNQDGQFKKFLEILKQMHKNIPLVEAIEQMPNHAKFLKDILTKKKRLGEFETVSLTEECSAILKNELPPKAKDPGSFTI
        K +   +E +  P    R + KNQD QF +FLE+LKQ+H NIPL+EA+EQMPN+ KFLKDIL KK+RLGEFE V+LT+E SAIL  +LP K  DPGSFTI
Subjt:  KHIPKNLEFREDPADKTRQRPKNQDGQFKKFLEILKQMHKNIPLVEAIEQMPNHAKFLKDILTKKKRLGEFETVSLTEECSAILKNELPPKAKDPGSFTI

Query:  PVSIGGKELGRALCD---------------LGIGEARPTTVTLPLADMSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALI
        PV IGGK +G ALCD               LGIGEARP TVTL LAD SITY EGKIEDVLV+VDKFIFP DFIILDYEADK++PIILGRPFL+TGRALI
Subjt:  PVSIGGKELGRALCD---------------LGIGEARPTTVTLPLADMSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALI

Query:  DVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVIETTIQDSASKHSEEHGEVSEEDLQVCLLERKNEKELFRCEDVFESLDLDQRKAPPI
        DV  GELT+RV +++V  ++F ++KYP ++E+CS++RI +                    ++  +++Q   L  + E EL R         +  R   P+
Subjt:  DVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVIETTIQDSASKHSEEHGEVSEEDLQVCLLERKNEKELFRCEDVFESLDLDQRKAPPI

Query:  KPSLIEAPTLDLKPLSDHLKYVYLGEGETLPIIVASDLLSEHEEALIKLLQQYRKAIGWTLADIQGVSPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVV
        +PS+++AP L+LK L  HLKY YLGE ETLP+ +A+DL  E E  LI++L+ ++KAIGWTLADI+G+SPS+CMHKI LEEG   SIE QRRLNPAMKEVV
Subjt:  KPSLIEAPTLDLKPLSDHLKYVYLGEGETLPIIVASDLLSEHEEALIKLLQQYRKAIGWTLADIQGVSPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVV

Query:  KKEVIKWLDAGIIYPIADSNWVSPVLCVPKKGGVTVVSNKDNELIPTRTVTGW----------------------------------------------R
        KKE+IKWLDAGIIYPIAD + +SPV CVPKKGG+TVV N +NELIPTRT+TGW                                              +
Subjt:  KKEVIKWLDAGIIYPIADSNWVSPVLCVPKKGGVTVVSNKDNELIPTRTVTGW----------------------------------------------R

Query:  ITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSLFGRSIQSCLDNLGQVLKRCEDTHLVLN
        ITI P+DQ+KTTFTCPYGTF+FRRMPFGLCNAP TFQRCM+AIF D+IE+ VEVFMDDFS+F +     L NL QVL+RCEDT+LVLN
Subjt:  ITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSLFGRSIQSCLDNLGQVLKRCEDTHLVLN

XP_023522102.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111785979 [Cucurbita pepo subsp. pepo]3.9e-17655.73Show/hide
Query:  PADKTRQRPKNQDGQFKKFLEILKQMHKNIPLVEAIEQMPNHAKFLKDILTKKKRLGEFETVSLTEECSAILKNELPPKAKDPGSFTIPVSIGGKELGRA
        P    R + K ++  F+KF++I K++H NIPLVEA++QMPN+ KFLKD+LT +++  EF+ V L EECSAILKN++P K KDPGSFTIP+SIGGK+LGRA
Subjt:  PADKTRQRPKNQDGQFKKFLEILKQMHKNIPLVEAIEQMPNHAKFLKDILTKKKRLGEFETVSLTEECSAILKNELPPKAKDPGSFTIPVSIGGKELGRA

Query:  LCD---------------LGIGEARPTTVTLPLADMSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVC
        LCD               LGIGEARPTTVTL LAD S TYPEGKIED+L++VDKFIFP DFIILDYEAD DVPIILGRPFL TGR L+DV KG +T+R+ 
Subjt:  LCD---------------LGIGEARPTTVTLPLADMSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVC

Query:  NEEVKFNVFKAMKYPDEMEDCSFIRILESTVIETTIQDSASKHSEEHGEVSEEDLQVCLLERKNEKELFRCEDVFESLDLDQRKAPPIKPSLIEAPTLDL
        +++V+FN+  +MKYP   E+CS +  L      T   D      EE    ++   Q+ +L   +          FESL+ + RK+ P++PS+ EAP LDL
Subjt:  NEEVKFNVFKAMKYPDEMEDCSFIRILESTVIETTIQDSASKHSEEHGEVSEEDLQVCLLERKNEKELFRCEDVFESLDLDQRKAPPIKPSLIEAPTLDL

Query:  KPLSDHLKYVYLGEGETLPIIVASDLLSEHEEALIKLLQQYRKAIGWTLADIQGVSPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGI
        KPL  +LKY YLG+ +TLPII+++ L S  E+ L++ L++++ AIGWTLADI+G+SPS CMHKI LEEG  +SIEQQRRLNP MKEVV+KE++KWLDAGI
Subjt:  KPLSDHLKYVYLGEGETLPIIVASDLLSEHEEALIKLLQQYRKAIGWTLADIQGVSPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGI

Query:  IYPIADSNWVSPVLCVPKKGGVTVVSNKDNELIPTRTVTGWR----------------------------------------------ITIAPEDQEKTT
        IYPIA+S+ VSP+ CVPKKGG+TV++N++NELI TR V GWR                                              ITI+PEDQEKTT
Subjt:  IYPIADSNWVSPVLCVPKKGGVTVVSNKDNELIPTRTVTGWR----------------------------------------------ITIAPEDQEKTT

Query:  FTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSLFGRSIQSCLDNLGQVLKRCEDTHLVLN
        FTCPYG FAFRRMPFGLCNAPATFQRCM+AIF+DM+E+ +E+FMDDFS++G S ++CL NLG+VL+RCE+ +LVLN
Subjt:  FTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSLFGRSIQSCLDNLGQVLKRCEDTHLVLN

TrEMBL top hitse value%identityAlignment
A0A2G9HWC5 DNA-directed DNA polymerase2.9e-16955.29Show/hide
Query:  TRQRPKNQDGQFKKFLEILKQMHKNIPLVEAIEQMPNHAKFLKDILTKKKRLGEFETVSLTEECSAILKNELPPKAKDPGSFTIPVSIGGKELGRALCD-
        T   P+ QDG+ +     LK +H NIP  EA+EQMP++ KF+KDIL+KK+RLG++ETV+LTEE SAI++N+LPPK KDPGSFTIP +IG    GRALCD 
Subjt:  TRQRPKNQDGQFKKFLEILKQMHKNIPLVEAIEQMPNHAKFLKDILTKKKRLGEFETVSLTEECSAILKNELPPKAKDPGSFTIPVSIGGKELGRALCD-

Query:  --------------LGIGEARPTTVTLPLADMSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEV
                      LG+GEA+PT++TL LAD S+TYP+G IED+LVKVDKFIFP D ++LD E D ++ IILGRPFLATGR LIDVQKGELTMRV ++++
Subjt:  --------------LGIGEARPTTVTLPLADMSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEV

Query:  KFNVFKAMKYPDEMEDCSFIRILESTVIETTIQDSASKHSEEH-----GEVSEEDLQVCLLERKNEKELFRCEDVFESLDLDQRKAPPIKPSLIEAPTLD
         FNVFKAMK+P+E ++C  + + ++     +I +      E        E +EEDL+V  ++  +  + F+   V ESL+    ++  +KPS+ E PTL+
Subjt:  KFNVFKAMKYPDEMEDCSFIRILESTVIETTIQDSASKHSEEH-----GEVSEEDLQVCLLERKNEKELFRCEDVFESLDLDQRKAPPIKPSLIEAPTLD

Query:  LKPLSDHLKYVYLGEGETLPIIVASDLLSEHEEALIKLLQQYRKAIGWTLADIQGVSPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAG
        LKPL  HL YVYLGE +TLP+I++S L     E L+++L+ +  AIGWT+ADI+G+SPSFCMHKI LE+    S+E QRRLNP MKEVVKKE+IKWLDAG
Subjt:  LKPLSDHLKYVYLGEGETLPIIVASDLLSEHEEALIKLLQQYRKAIGWTLADIQGVSPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAG

Query:  IIYPIADSNWVSPVLCVPKKGGVTVVSNKDNELIPTRTVTGWR----------------------------------------------ITIAPEDQEKT
        IIYPI+DS+WVSPV CVPKKGG+TVV N  NELIPTRTVTGWR                                              I IAPEDQEK 
Subjt:  IIYPIADSNWVSPVLCVPKKGGVTVVSNKDNELIPTRTVTGWR----------------------------------------------ITIAPEDQEKT

Query:  TFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSLFGRSIQSCLDNLGQVLKRCEDTHLVLN
        TFTCPYGTFAFRRMPFGLCNAPATFQRCM+AIF+DM+E+ +E+FMDDFS++G S   CL+NL  +LKRCEDT+LVLN
Subjt:  TFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSLFGRSIQSCLDNLGQVLKRCEDTHLVLN

A0A2G9HYA0 Reverse transcriptase5.5e-17655.92Show/hide
Query:  PADKTRQRPKNQDGQFKKFLEILKQMHKNIPLVEAIEQMPNHAKFLKDILTKKKRLGEFETVSLTEECSAILKNELPPKAKDPGSFTIPVSIGGKELGRA
        P    R + +  + QF KFLE+ K++H NIP  EA+EQMP++ KF+KDIL+KK+RLG++ETV+LTEECSAI++N+LPPK KDPGSFTIP +IG    GRA
Subjt:  PADKTRQRPKNQDGQFKKFLEILKQMHKNIPLVEAIEQMPNHAKFLKDILTKKKRLGEFETVSLTEECSAILKNELPPKAKDPGSFTIPVSIGGKELGRA

Query:  LCD---------------LGIGEARPTTVTLPLADMSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVC
        LCD               LG+GEA+PT++TL LAD S+TYP+G IED+LVKVDKFIFP DF++LD E D +VPIILGRPFLATGR LIDVQKGELTMRV 
Subjt:  LCD---------------LGIGEARPTTVTLPLADMSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVC

Query:  NEEVKFNVFKAMKYPDEMEDCSFIRILESTVIETTIQDSASKHSEEH-----GEVSEEDLQVCLLERKNEKELFRCEDVFESLDLDQRKAPP--IKPSLI
        ++++ FNVFKAMK+P+E ++C  + + +      +I +      E        E +EEDL+V  ++  +  +  +   V ESL   +R  P   +KPS+ 
Subjt:  NEEVKFNVFKAMKYPDEMEDCSFIRILESTVIETTIQDSASKHSEEH-----GEVSEEDLQVCLLERKNEKELFRCEDVFESLDLDQRKAPP--IKPSLI

Query:  EAPTLDLKPLSDHLKYVYLGEGETLPIIVASDLLSEHEEALIKLLQQYRKAIGWTLADIQGVSPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVI
        + PTL+LKPL  HL Y YLGE +TLP+I++S L     E L+++L+ ++ AIGWT+ADI+G+SPSFCMHKI LE+    S+E QRRLNP MKEVVKKE+I
Subjt:  EAPTLDLKPLSDHLKYVYLGEGETLPIIVASDLLSEHEEALIKLLQQYRKAIGWTLADIQGVSPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVI

Query:  KWLDAGIIYPIADSNWVSPVLCVPKKGGVTVVSNKDNELIPTRTVTGWR----------------------------------------------ITIAP
        KWLDAGIIYPI+DS+WVSPV CVPKKGG+TVV N  NELIPTRTVTGWR                                              I IAP
Subjt:  KWLDAGIIYPIADSNWVSPVLCVPKKGGVTVVSNKDNELIPTRTVTGWR----------------------------------------------ITIAP

Query:  EDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSLFGRSIQSCLDNLGQVLKRCEDTHLVLN
        EDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCM+AIF+DM+E+ +EVFMDDFS++G S   CL+NL  VLKRCEDT+L+LN
Subjt:  EDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSLFGRSIQSCLDNLGQVLKRCEDTHLVLN

A0A2G9HYD8 Reverse transcriptase6.5e-17757.12Show/hide
Query:  QFKKFLEILKQMHKNIPLVEAIEQMPNHAKFLKDILTKKKRLGEFETVSLTEECSAILKNELPPKAKDPGSFTIPVSIGGKELGRALCD-----------
        QF KFLE+ K++H NIP  EA+EQMP++ KF+KDIL+KK+RLG++ET +LTEEC+AI++N+LPPK KDPGSFTIP +IG    GRALCD           
Subjt:  QFKKFLEILKQMHKNIPLVEAIEQMPNHAKFLKDILTKKKRLGEFETVSLTEECSAILKNELPPKAKDPGSFTIPVSIGGKELGRALCD-----------

Query:  ----LGIGEARPTTVTLPLADMSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKY
            LG+GEA+PT++TL LAD S+TYP+G IED+LVKVDKFIFP DF++LD E D +VPIILGRPFLATGR LIDVQKGELTMRV ++++ FNVFKAMK+
Subjt:  ----LGIGEARPTTVTLPLADMSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKY

Query:  PDEMEDCSFIRILESTVIETTIQDSASKHSEEH-----GEVSEEDLQVCLLERKNEKELFRCEDVFESLDLDQRKAPP--IKPSLIEAPTLDLKPLSDHL
        P+E ++C  + + ++     +I +      E        E +EEDL+V  ++  N  + F+   V ESL   +R  P   +KPS+ + PTL+LKPL +HL
Subjt:  PDEMEDCSFIRILESTVIETTIQDSASKHSEEH-----GEVSEEDLQVCLLERKNEKELFRCEDVFESLDLDQRKAPP--IKPSLIEAPTLDLKPLSDHL

Query:  KYVYLGEGETLPIIVASDLLSEHEEALIKLLQQYRKAIGWTLADIQGVSPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADS
         YVYLGE +TLP+I++S L     E L+++L+ ++ AIGWT+ADI+G+SPSFCMHKI LE+    S+E QRRLN  MKEVVKKE+IKWLDAGIIYPI+DS
Subjt:  KYVYLGEGETLPIIVASDLLSEHEEALIKLLQQYRKAIGWTLADIQGVSPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADS

Query:  NWVSPVLCVPKKGGVTVVSNKDNELIPTRTVTGWR----------------------------------------------ITIAPEDQEKTTFTCPYGT
        +WVSPV CVPKKGG+TVV N  NELIPTRTVTGWR                                              I IAPEDQEKTTFTCPYGT
Subjt:  NWVSPVLCVPKKGGVTVVSNKDNELIPTRTVTGWR----------------------------------------------ITIAPEDQEKTTFTCPYGT

Query:  FAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSLFGRSIQSCLDNLGQVLKRCEDTHLVLN
        FAFRRMPFGLCNAPATFQRCM+AIF+DM+E+ +EVFMDDFS++G S   CL+NL  VLKRCEDT+LVLN
Subjt:  FAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSLFGRSIQSCLDNLGQVLKRCEDTHLVLN

A0A2G9IA86 DNA-directed DNA polymerase4.1e-17154.91Show/hide
Query:  KHIPKNLEFREDPADKTRQRPKNQDGQFKKFLEILKQMHKNIPLVEAIEQMPNHAKFLKDILTKKKRLGEFETVSLTEECSAILKNELPPKAKDPGSFTI
        K +    E +E  A   R + +    QF KFLE+ K++H N P  EA+EQMP++ KF+K IL+KK+RLG++ETV+LTEECSAI++N+LPPK KDPGSFTI
Subjt:  KHIPKNLEFREDPADKTRQRPKNQDGQFKKFLEILKQMHKNIPLVEAIEQMPNHAKFLKDILTKKKRLGEFETVSLTEECSAILKNELPPKAKDPGSFTI

Query:  PVSIGGKELGRALCD---------------LGIGEARPTTVTLPLADMSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALI
        P +IG    GRALCD               LG+GEA+PT++TL LA+ S+TYP+G IED+LVKVDKFIFP DF++LD E D +VPIILGRPFLATGR LI
Subjt:  PVSIGGKELGRALCD---------------LGIGEARPTTVTLPLADMSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALI

Query:  DVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVIETTIQDSASKH-SEEHGEVSEEDLQVCLLERKNEKELFRCEDVFESLDLDQRKAPP
        DVQKG+LTMRV ++++ FNVFKAMK+P+E ++C  + + ++     +I D   +   +   E +EED +V  ++  +  + F+   V ESL   +R AP 
Subjt:  DVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVIETTIQDSASKH-SEEHGEVSEEDLQVCLLERKNEKELFRCEDVFESLDLDQRKAPP

Query:  --IKPSLIEAPTLDLKPLSDHLKYVYLGEGETLPIIVASDLLSEHEEALIKLLQQYRKAIGWTLADIQGVSPSFCMHKITLEEGSFRSIEQQRRLNPAMK
          +KPS+ E+PTL+LKPL  HL Y YLGE +TLP+I++S L     E L+++ + ++ AIGWT+ADI+G+S SFCMHKI LE+    S+E QRRLNP MK
Subjt:  --IKPSLIEAPTLDLKPLSDHLKYVYLGEGETLPIIVASDLLSEHEEALIKLLQQYRKAIGWTLADIQGVSPSFCMHKITLEEGSFRSIEQQRRLNPAMK

Query:  EVVKKEVIKWLDAGIIYPIADSNWVSPVLCVPKKGGVTVVSNKDNELIPTRTVTGWRITIAPE------------------------------------D
        EVVKKE+IKW+DAGIIYPI+DS+WVSPV CVPKKGG+TVV N  NELIPTRTVTGWR+ +                                       D
Subjt:  EVVKKEVIKWLDAGIIYPIADSNWVSPVLCVPKKGGVTVVSNKDNELIPTRTVTGWRITIAPE------------------------------------D

Query:  QEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSLFGRSIQSCLDNLGQVLKRCEDTHLVLN
        QEKTTFTCPYGTFAFRR+PFGLCNAPATFQRCM+AIF+DM+E+ +EVFMDDFS++G S   CL+NL  VLKRCEDT+LVLN
Subjt:  QEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSLFGRSIQSCLDNLGQVLKRCEDTHLVLN

A0A6J1DV77 uncharacterized protein LOC1110238186.5e-17756.63Show/hide
Query:  KHIPKNLEFREDPADKTRQRPKNQDGQFKKFLEILKQMHKNIPLVEAIEQMPNHAKFLKDILTKKKRLGEFETVSLTEECSAILKNELPPKAKDPGSFTI
        K +   +E +  P    R + KNQD QF +FLE+LKQ+H NIPL+EA+EQMPN+ KFLKDIL KK+RLGEFE V+LT+E SAIL  +LP K  DPGSFTI
Subjt:  KHIPKNLEFREDPADKTRQRPKNQDGQFKKFLEILKQMHKNIPLVEAIEQMPNHAKFLKDILTKKKRLGEFETVSLTEECSAILKNELPPKAKDPGSFTI

Query:  PVSIGGKELGRALCD---------------LGIGEARPTTVTLPLADMSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALI
        PV IGGK +G ALCD               LGIGEARP TVTL LAD SITY EGKIEDVLV+VDKFIFP DFIILDYEADK++PIILGRPFL+TGRALI
Subjt:  PVSIGGKELGRALCD---------------LGIGEARPTTVTLPLADMSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALI

Query:  DVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVIETTIQDSASKHSEEHGEVSEEDLQVCLLERKNEKELFRCEDVFESLDLDQRKAPPI
        DV  GELT+RV +++V  ++F ++KYP ++E+CS++RI +                    ++  +++Q   L  + E EL R         +  R   P+
Subjt:  DVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVIETTIQDSASKHSEEHGEVSEEDLQVCLLERKNEKELFRCEDVFESLDLDQRKAPPI

Query:  KPSLIEAPTLDLKPLSDHLKYVYLGEGETLPIIVASDLLSEHEEALIKLLQQYRKAIGWTLADIQGVSPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVV
        +PS+++AP L+LK L  HLKY YLGE ETLP+ +A+DL  E E  LI++L+ ++KAIGWTLADI+G+SPS+CMHKI LEEG   SIE QRRLNPAMKEVV
Subjt:  KPSLIEAPTLDLKPLSDHLKYVYLGEGETLPIIVASDLLSEHEEALIKLLQQYRKAIGWTLADIQGVSPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVV

Query:  KKEVIKWLDAGIIYPIADSNWVSPVLCVPKKGGVTVVSNKDNELIPTRTVTGW----------------------------------------------R
        KKE+IKWLDAGIIYPIAD + +SPV CVPKKGG+TVV N +NELIPTRT+TGW                                              +
Subjt:  KKEVIKWLDAGIIYPIADSNWVSPVLCVPKKGGVTVVSNKDNELIPTRTVTGW----------------------------------------------R

Query:  ITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSLFGRSIQSCLDNLGQVLKRCEDTHLVLN
        ITI P+DQ+KTTFTCPYGTF+FRRMPFGLCNAP TFQRCM+AIF D+IE+ VEVFMDDFS+F +     L NL QVL+RCEDT+LVLN
Subjt:  ITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSLFGRSIQSCLDNLGQVLKRCEDTHLVLN

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.65.4e-1139.77Show/hide
Query:  RITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSLFGRSIQSCLDNLGQVLKRCEDTHLVL
        +I + PE   KT F+  +G + + RMPFGL NAPATFQRCM  I   ++     V++DD  +F  S+   L +LG V ++    +L L
Subjt:  RITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSLFGRSIQSCLDNLGQVLKRCEDTHLVL

P10394 Retrovirus-related Pol polyprotein from transposon 4122.8e-0734.83Show/hide
Query:  RITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSLFGRSIQSCLDNLGQVLKRCEDTHLVLN
        +I +    ++ T+F+   G++ F R+PFGL  AP +FQR M   FS +  S   ++MDD  + G S +  L NL +V  +C + +L L+
Subjt:  RITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSLFGRSIQSCLDNLGQVLKRCEDTHLVLN

P20825 Retrovirus-related Pol polyprotein from transposon 2975.0e-0937.5Show/hide
Query:  RITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSLFGRSIQSCLDNLGQVLKRCEDTHLVL
        +I +  E   KT F+   G + + RMPFGL NAPATFQRCM  I   ++     V++DD  +F  S+   L+++  V  +  D +L L
Subjt:  RITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSLFGRSIQSCLDNLGQVLKRCEDTHLVL

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein8.3e-1229.78Show/hide
Query:  LLQQYRKAIGWTL----ADIQGVSPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVLCVPKKGGV----------
        L Q+YR+ I   L    ADI  +      H I ++ G+     Q   +    ++ + K V K LD   I P + S   SPV+ VPKK G           
Subjt:  LLQQYRKAIGWTL----ADIQGVSPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVLCVPKKGGV----------

Query:  ---------------TVVSNKDNELIPTR--TVTGW-RITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSLF
                        ++S   N  I T     +G+ +I + P+D+ KT F  P G + +  MPFGL NAP+TF R M   F D+    V V++DD  +F
Subjt:  ---------------TVVSNKDNELIPTR--TVTGW-RITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSLF

Query:  GRSIQSCLDNLGQVLKRCEDTHLVL
          S +    +L  VL+R ++ +L++
Subjt:  GRSIQSCLDNLGQVLKRCEDTHLVL

Q99315 Transposon Ty3-G Gag-Pol polyprotein8.3e-1229.78Show/hide
Query:  LLQQYRKAIGWTL----ADIQGVSPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVLCVPKKGGV----------
        L Q+YR+ I   L    ADI  +      H I ++ G+     Q   +    ++ + K V K LD   I P + S   SPV+ VPKK G           
Subjt:  LLQQYRKAIGWTL----ADIQGVSPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVLCVPKKGGV----------

Query:  ---------------TVVSNKDNELIPTR--TVTGW-RITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSLF
                        ++S   N  I T     +G+ +I + P+D+ KT F  P G + +  MPFGL NAP+TF R M   F D+    V V++DD  +F
Subjt:  ---------------TVVSNKDNELIPTR--TVTGW-RITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSLF

Query:  GRSIQSCLDNLGQVLKRCEDTHLVL
          S +    +L  VL+R ++ +L++
Subjt:  GRSIQSCLDNLGQVLKRCEDTHLVL

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein2.2e-0432.35Show/hide
Query:  MHPSKAPGPDGFPTLFYRRYWSKVSKTIICNVLDILNVKRPVRKWNETYIALIPKIQQPKAVVDYRPI
        M  +KAPGPD F   F+   W  V  + I  V +       ++++N T I LIPK+     +  +RP+
Subjt:  MHPSKAPGPDGFPTLFYRRYWSKVSKTIICNVLDILNVKRPVRKWNETYIALIPKIQQPKAVVDYRPI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACCCTTCAAAAGCTCCAGGACCAGACGGTTTTCCTACTCTATTTTATCGGAGATATTGGTCAAAGGTAAGTAAAACCATTATCTGTAATGTTTTGGATATTCTTAA
TGTCAAGCGGCCTGTACGGAAGTGGAATGAAACTTACATTGCCTTGATTCCTAAAATTCAACAACCTAAAGCAGTTGTTGATTATAGGCCCATCGGATCAAGCTCCACAA
AGAGAATTACAGGAATAAAACCTGGTAGACACTGTCCACCCATTACTCACCTTTTCTTTGTAGATGATCGTCTACTGTTCTGTAGTGGTAAACTAGAAGAAGTATGGCAT
GTTCGACGTTTATTAAGCATTTATGAGAATGCTTCAGGACAAGCTGTCAATTTCAATAAATCAGCATTATGTTTTTCTCCAAATGTAAATGAGGAGTTTCGATGTGTTTT
ATCTGATTTCTTGGCCTTGCCAATTGTTTTTGATTTGGGCAGATATCTAGACGTTCCAACTACATTCACAAGGAGACGAAGTGATGATTGTAATCCAATCAAAGAACGAG
TTTGGAGAACTCTTCAAGGGTGGAAAGTTATGGATGCACATACTCAGATTGTAACGTGTGTTACCTTCACGTCATTGGAGTTGAAACACATTCCAAAAAATCTAGAGTTT
CGGGAGGATCCTGCTGATAAGACTAGGCAAAGGCCCAAGAATCAGGATGGTCAATTTAAAAAGTTTTTAGAGATTCTTAAGCAGATGCATAAAAATATCCCTTTAGTAGA
AGCTATTGAGCAAATGCCTAACCATGCTAAATTTCTTAAGGATATTTTAACTAAAAAGAAGAGGTTAGGTGAGTTTGAAACTGTATCTCTTACTGAGGAGTGTAGTGCTA
TTCTTAAGAATGAGCTACCACCCAAGGCTAAGGATCCGGGGTCATTTACCATACCTGTGTCTATAGGTGGAAAAGAGTTAGGTAGAGCACTCTGTGATCTAGGTATTGGT
GAAGCTAGGCCTACCACAGTCACACTCCCACTAGCTGATATGTCTATCACATATCCAGAAGGTAAAATTGAGGATGTCTTAGTAAAAGTTGATAAATTCATATTTCCTGT
TGATTTTATTATTTTAGACTATGAGGCTGATAAAGATGTCCCAATTATTCTAGGTCGTCCATTTTTGGCTACTGGTAGGGCGTTAATAGATGTTCAGAAAGGGGAATTAA
CAATGAGAGTCTGTAATGAGGAAGTAAAGTTTAATGTGTTTAAAGCCATGAAATATCCGGACGAAATGGAAGATTGTTCTTTCATTAGGATTCTGGAGAGCACAGTTATT
GAGACAACAATACAGGATTCGGCTAGTAAGCATTCGGAAGAGCATGGAGAGGTTAGTGAAGAAGATTTGCAGGTTTGTTTGTTAGAAAGAAAAAATGAAAAAGAATTGTT
TAGGTGTGAGGATGTTTTTGAGTCTTTAGATTTAGATCAAAGGAAGGCTCCTCCGATTAAGCCATCCCTGATTGAGGCACCTACTTTAGATTTGAAGCCCTTGTCGGATC
ATCTAAAGTATGTGTATCTTGGGGAAGGAGAGACGTTGCCCATTATTGTTGCATCAGATTTATTGTCGGAGCATGAAGAGGCCTTAATAAAATTGTTGCAACAATACCGC
AAGGCTATAGGTTGGACATTGGCTGACATTCAGGGAGTTAGCCCATCTTTTTGTATGCATAAAATCACTCTAGAGGAGGGATCCTTTAGGAGTATTGAGCAACAAAGAAG
GCTTAACCCTGCAATGAAAGAGGTTGTAAAAAAAGAGGTGATTAAATGGTTGGATGCTGGGATCATTTATCCAATTGCCGATAGCAATTGGGTGAGCCCTGTCTTATGTG
TTCCTAAGAAAGGAGGTGTCACTGTGGTGAGCAATAAAGACAATGAGTTAATCCCAACCAGGACAGTAACTGGCTGGAGGATTACCATTGCTCCTGAGGATCAGGAAAAA
ACCACTTTTACCTGCCCGTATGGGACGTTTGCTTTTAGGCGAATGCCTTTTGGTCTTTGCAATGCTCCAGCAACATTTCAACGTTGTATGTTGGCAATTTTTTCTGATAT
GATTGAGTCTACTGTTGAGGTATTCATGGACGATTTTTCATTGTTTGGACGGTCTATTCAGAGTTGTTTAGATAATTTAGGACAGGTGTTAAAGAGGTGTGAGGATACCC
ATCTAGTTCTTAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGCACCCTTCAAAAGCTCCAGGACCAGACGGTTTTCCTACTCTATTTTATCGGAGATATTGGTCAAAGGTAAGTAAAACCATTATCTGTAATGTTTTGGATATTCTTAA
TGTCAAGCGGCCTGTACGGAAGTGGAATGAAACTTACATTGCCTTGATTCCTAAAATTCAACAACCTAAAGCAGTTGTTGATTATAGGCCCATCGGATCAAGCTCCACAA
AGAGAATTACAGGAATAAAACCTGGTAGACACTGTCCACCCATTACTCACCTTTTCTTTGTAGATGATCGTCTACTGTTCTGTAGTGGTAAACTAGAAGAAGTATGGCAT
GTTCGACGTTTATTAAGCATTTATGAGAATGCTTCAGGACAAGCTGTCAATTTCAATAAATCAGCATTATGTTTTTCTCCAAATGTAAATGAGGAGTTTCGATGTGTTTT
ATCTGATTTCTTGGCCTTGCCAATTGTTTTTGATTTGGGCAGATATCTAGACGTTCCAACTACATTCACAAGGAGACGAAGTGATGATTGTAATCCAATCAAAGAACGAG
TTTGGAGAACTCTTCAAGGGTGGAAAGTTATGGATGCACATACTCAGATTGTAACGTGTGTTACCTTCACGTCATTGGAGTTGAAACACATTCCAAAAAATCTAGAGTTT
CGGGAGGATCCTGCTGATAAGACTAGGCAAAGGCCCAAGAATCAGGATGGTCAATTTAAAAAGTTTTTAGAGATTCTTAAGCAGATGCATAAAAATATCCCTTTAGTAGA
AGCTATTGAGCAAATGCCTAACCATGCTAAATTTCTTAAGGATATTTTAACTAAAAAGAAGAGGTTAGGTGAGTTTGAAACTGTATCTCTTACTGAGGAGTGTAGTGCTA
TTCTTAAGAATGAGCTACCACCCAAGGCTAAGGATCCGGGGTCATTTACCATACCTGTGTCTATAGGTGGAAAAGAGTTAGGTAGAGCACTCTGTGATCTAGGTATTGGT
GAAGCTAGGCCTACCACAGTCACACTCCCACTAGCTGATATGTCTATCACATATCCAGAAGGTAAAATTGAGGATGTCTTAGTAAAAGTTGATAAATTCATATTTCCTGT
TGATTTTATTATTTTAGACTATGAGGCTGATAAAGATGTCCCAATTATTCTAGGTCGTCCATTTTTGGCTACTGGTAGGGCGTTAATAGATGTTCAGAAAGGGGAATTAA
CAATGAGAGTCTGTAATGAGGAAGTAAAGTTTAATGTGTTTAAAGCCATGAAATATCCGGACGAAATGGAAGATTGTTCTTTCATTAGGATTCTGGAGAGCACAGTTATT
GAGACAACAATACAGGATTCGGCTAGTAAGCATTCGGAAGAGCATGGAGAGGTTAGTGAAGAAGATTTGCAGGTTTGTTTGTTAGAAAGAAAAAATGAAAAAGAATTGTT
TAGGTGTGAGGATGTTTTTGAGTCTTTAGATTTAGATCAAAGGAAGGCTCCTCCGATTAAGCCATCCCTGATTGAGGCACCTACTTTAGATTTGAAGCCCTTGTCGGATC
ATCTAAAGTATGTGTATCTTGGGGAAGGAGAGACGTTGCCCATTATTGTTGCATCAGATTTATTGTCGGAGCATGAAGAGGCCTTAATAAAATTGTTGCAACAATACCGC
AAGGCTATAGGTTGGACATTGGCTGACATTCAGGGAGTTAGCCCATCTTTTTGTATGCATAAAATCACTCTAGAGGAGGGATCCTTTAGGAGTATTGAGCAACAAAGAAG
GCTTAACCCTGCAATGAAAGAGGTTGTAAAAAAAGAGGTGATTAAATGGTTGGATGCTGGGATCATTTATCCAATTGCCGATAGCAATTGGGTGAGCCCTGTCTTATGTG
TTCCTAAGAAAGGAGGTGTCACTGTGGTGAGCAATAAAGACAATGAGTTAATCCCAACCAGGACAGTAACTGGCTGGAGGATTACCATTGCTCCTGAGGATCAGGAAAAA
ACCACTTTTACCTGCCCGTATGGGACGTTTGCTTTTAGGCGAATGCCTTTTGGTCTTTGCAATGCTCCAGCAACATTTCAACGTTGTATGTTGGCAATTTTTTCTGATAT
GATTGAGTCTACTGTTGAGGTATTCATGGACGATTTTTCATTGTTTGGACGGTCTATTCAGAGTTGTTTAGATAATTTAGGACAGGTGTTAAAGAGGTGTGAGGATACCC
ATCTAGTTCTTAATTAG
Protein sequenceShow/hide protein sequence
MHPSKAPGPDGFPTLFYRRYWSKVSKTIICNVLDILNVKRPVRKWNETYIALIPKIQQPKAVVDYRPIGSSSTKRITGIKPGRHCPPITHLFFVDDRLLFCSGKLEEVWH
VRRLLSIYENASGQAVNFNKSALCFSPNVNEEFRCVLSDFLALPIVFDLGRYLDVPTTFTRRRSDDCNPIKERVWRTLQGWKVMDAHTQIVTCVTFTSLELKHIPKNLEF
REDPADKTRQRPKNQDGQFKKFLEILKQMHKNIPLVEAIEQMPNHAKFLKDILTKKKRLGEFETVSLTEECSAILKNELPPKAKDPGSFTIPVSIGGKELGRALCDLGIG
EARPTTVTLPLADMSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVI
ETTIQDSASKHSEEHGEVSEEDLQVCLLERKNEKELFRCEDVFESLDLDQRKAPPIKPSLIEAPTLDLKPLSDHLKYVYLGEGETLPIIVASDLLSEHEEALIKLLQQYR
KAIGWTLADIQGVSPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVLCVPKKGGVTVVSNKDNELIPTRTVTGWRITIAPEDQEK
TTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSLFGRSIQSCLDNLGQVLKRCEDTHLVLN