; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10018032 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10018032
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionDNA-directed DNA polymerase
Genome locationChr03:29313980..29337371
RNA-Seq ExpressionHG10018032
SyntenyHG10018032
Gene Ontology termsGO:0006261 - DNA-dependent DNA replication (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
InterPro domainsIPR002298 - DNA polymerase A
IPR002421 - 5'-3' exonuclease
IPR020045 - DNA polymerase I-like, H3TH domain
IPR020046 - 5'-3' exonuclease, alpha-helical arch, N-terminal
IPR029060 - PIN-like domain superfamily
IPR036279 - 5'-3' exonuclease, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603161.1 DNA polymerase I, partial [Cucurbita argyrosperma subsp. sororia]1.3e-8958.1Show/hide
Query:  KGFFKRGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRD
        + F  +GSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLA+RSVAAG    + S  +        SL LL ++    
Subjt:  KGFFKRGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRD

Query:  WHMIVVQNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTI
                    +E             MVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIP                                     
Subjt:  WHMIVVQNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTI

Query:  IPIPNAKFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAI
                                                             GVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERI+KMLVTNAEQAI
Subjt:  IPIPNAKFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAI

Query:  LSKDLAILRSDLPVYMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPIIRR
        LSKDLAILRSDLP+YMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPI+RR
Subjt:  LSKDLAILRSDLPVYMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPIIRR

KAG7033480.1 DNA polymerase I, partial [Cucurbita argyrosperma subsp. argyrosperma]1.3e-8958.1Show/hide
Query:  KGFFKRGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRD
        + F  +GSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLA+RSVAAG    + S  +        SL LL ++    
Subjt:  KGFFKRGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRD

Query:  WHMIVVQNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTI
                    +E             MVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIP                                     
Subjt:  WHMIVVQNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTI

Query:  IPIPNAKFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAI
                                                             GVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERI+KMLVTNAEQAI
Subjt:  IPIPNAKFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAI

Query:  LSKDLAILRSDLPVYMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPIIRR
        LSKDLAILRSDLP+YMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPI+RR
Subjt:  LSKDLAILRSDLPVYMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPIIRR

XP_022933106.1 uncharacterized protein LOC111439872 isoform X1 [Cucurbita moschata]5.7e-9058.38Show/hide
Query:  KGFFKRGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRD
        + F  +GSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAG    + S  +        SL LL ++    
Subjt:  KGFFKRGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRD

Query:  WHMIVVQNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTI
                    +E             MVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIP                                     
Subjt:  WHMIVVQNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTI

Query:  IPIPNAKFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAI
                                                             GVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERI+KMLVTNAEQAI
Subjt:  IPIPNAKFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAI

Query:  LSKDLAILRSDLPVYMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPIIRR
        LSKDLAILRSDLP+YMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPI+RR
Subjt:  LSKDLAILRSDLPVYMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPIIRR

XP_022967800.1 uncharacterized protein LOC111467206 [Cucurbita maxima]5.7e-9058.38Show/hide
Query:  KGFFKRGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRD
        + F  +GSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAG    + S  +        SL LL ++    
Subjt:  KGFFKRGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRD

Query:  WHMIVVQNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTI
                    +E             MVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIP                                     
Subjt:  WHMIVVQNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTI

Query:  IPIPNAKFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAI
                                                             GVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERI+KMLVTNAEQAI
Subjt:  IPIPNAKFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAI

Query:  LSKDLAILRSDLPVYMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPIIRR
        LSKDLAILRSDLP+YMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPI+RR
Subjt:  LSKDLAILRSDLPVYMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPIIRR

XP_023543411.1 uncharacterized protein LOC111803303 isoform X1 [Cucurbita pepo subsp. pepo]5.7e-9058.38Show/hide
Query:  KGFFKRGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRD
        + F  +GSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAG    + S  +        SL LL ++    
Subjt:  KGFFKRGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRD

Query:  WHMIVVQNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTI
                    +E             MVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIP                                     
Subjt:  WHMIVVQNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTI

Query:  IPIPNAKFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAI
                                                             GVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERI+KMLVTNAEQAI
Subjt:  IPIPNAKFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAI

Query:  LSKDLAILRSDLPVYMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPIIRR
        LSKDLAILRSDLP+YMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPI+RR
Subjt:  LSKDLAILRSDLPVYMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPIIRR

TrEMBL top hitse value%identityAlignment
A0A6J1D5H5 DNA-directed DNA polymerase4.2e-8656.42Show/hide
Query:  KGFFKRGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRD
        + F  +GSTFRH  YPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVA G    + S  +        SL LL ++    
Subjt:  KGFFKRGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRD

Query:  WHMIVVQNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTI
                    +E             MVSFGLEDFAEKYG LEPSQFVDV+SLVGDKSDNIP                                     
Subjt:  WHMIVVQNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTI

Query:  IPIPNAKFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAI
                                                             GVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERI+KMLVTNA QAI
Subjt:  IPIPNAKFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAI

Query:  LSKDLAILRSDLPVYMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPIIRR
        LSK+LAILRSDLP+YMVPFTTRDLLFKKPEDNGEKFTSL+TAIGAYAEGFSADPIIRR
Subjt:  LSKDLAILRSDLPVYMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPIIRR

A0A6J1D5H7 DNA-directed DNA polymerase4.2e-8656.42Show/hide
Query:  KGFFKRGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRD
        + F  +GSTFRH  YPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVA G    + S  +        SL LL ++    
Subjt:  KGFFKRGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRD

Query:  WHMIVVQNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTI
                    +E             MVSFGLEDFAEKYG LEPSQFVDV+SLVGDKSDNIP                                     
Subjt:  WHMIVVQNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTI

Query:  IPIPNAKFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAI
                                                             GVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERI+KMLVTNA QAI
Subjt:  IPIPNAKFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAI

Query:  LSKDLAILRSDLPVYMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPIIRR
        LSK+LAILRSDLP+YMVPFTTRDLLFKKPEDNGEKFTSL+TAIGAYAEGFSADPIIRR
Subjt:  LSKDLAILRSDLPVYMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPIIRR

A0A6J1EY78 DNA-directed DNA polymerase2.8e-9058.38Show/hide
Query:  KGFFKRGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRD
        + F  +GSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAG    + S  +        SL LL ++    
Subjt:  KGFFKRGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRD

Query:  WHMIVVQNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTI
                    +E             MVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIP                                     
Subjt:  WHMIVVQNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTI

Query:  IPIPNAKFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAI
                                                             GVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERI+KMLVTNAEQAI
Subjt:  IPIPNAKFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAI

Query:  LSKDLAILRSDLPVYMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPIIRR
        LSKDLAILRSDLP+YMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPI+RR
Subjt:  LSKDLAILRSDLPVYMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPIIRR

A0A6J1F3S3 DNA-directed DNA polymerase1.8e-8959.09Show/hide
Query:  GSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRDWHMIVV
        GSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAG    + S  +        SL LL ++          
Subjt:  GSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRDWHMIVV

Query:  QNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTIIPIPNA
              +E             MVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIP                                           
Subjt:  QNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTIIPIPNA

Query:  KFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAILSKDLA
                                                       GVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERI+KMLVTNAEQAILSKDLA
Subjt:  KFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAILSKDLA

Query:  ILRSDLPVYMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPIIRR
        ILRSDLP+YMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPI+RR
Subjt:  ILRSDLPVYMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPIIRR

A0A6J1HVF9 DNA-directed DNA polymerase2.8e-9058.38Show/hide
Query:  KGFFKRGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRD
        + F  +GSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAG    + S  +        SL LL ++    
Subjt:  KGFFKRGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRD

Query:  WHMIVVQNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTI
                    +E             MVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIP                                     
Subjt:  WHMIVVQNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTI

Query:  IPIPNAKFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAI
                                                             GVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERI+KMLVTNAEQAI
Subjt:  IPIPNAKFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAI

Query:  LSKDLAILRSDLPVYMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPIIRR
        LSKDLAILRSDLP+YMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPI+RR
Subjt:  LSKDLAILRSDLPVYMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPIIRR

SwissProt top hitse value%identityAlignment
O52225 DNA polymerase I, thermostable3.1e-1424.68Show/hide
Query:  FFKRGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRDWH
        F  +  +FRH  Y AYK+ R PTP+   + L  +K  +  + +  +E PG EADDV+GTLA ++   G       E RI+  D             RD+ 
Subjt:  FFKRGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRDWH

Query:  MIVVQNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTIIP
         ++ +  +       ++   G LV       +D  EKYGV  P ++VD  +L GD+SDNIP                                       
Subjt:  MIVVQNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTIIP

Query:  IPNAKFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAILS
                                                           GV GIG   A++L+  +G++ENLL+++D+V+ + +++ +  + E   LS
Subjt:  IPNAKFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAILS

Query:  KDLAILRSDLPV
         DLA +R+DLP+
Subjt:  KDLAILRSDLPV

P00582 DNA polymerase I3.1e-1434.36Show/hide
Query:  FFKRGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRDWH
        F  +G TFR   +  YKS+RPP PD +   ++ L A +K+M + ++ V GVEADDVIGTLA  +         +  GR V+  TG           +D  
Subjt:  FFKRGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRDWH

Query:  MIVVQNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDV
         +V  N T +             +     G E+   KYGV  P   +D ++L+GD SDNIP V
Subjt:  MIVVQNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDV

P43741 DNA polymerase I1.1e-1427.94Show/hide
Query:  FFKRGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRDWH
        F  +G TFR   +  YKS+RPP PD + + +Q L   I+++ + ++ V G+EADDVIGTLAL++          S G+ V+  TG       ++ L D +
Subjt:  FFKRGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRDWH

Query:  MIVVQNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTIIP
        ++++              N+  L        +   EKYG+  P   +D ++L+GD +DNIP V  V        T +G  Q IG      A+      +P
Subjt:  MIVVQNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTIIP

Query:  IPNAKFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLE
        I  AK                L+ E+ + D  +T+++I     DV +++      +G     QLI  F   E
Subjt:  IPNAKFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLE

P52026 DNA polymerase I3.7e-1524.33Show/hide
Query:  STFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRDWHMIVVQ
        +TFRH  +  YK  R  TP  + +    L+  +K+  +   E+   EADD+IGT+A R+   GF   + S  R              L+ L    + V  
Subjt:  STFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRDWHMIVVQ

Query:  NRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTIIPIPNAK
         +  + +             + S+  E   EKYG L P Q VD+  L+GDKSDNIP                                            
Subjt:  NRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTIIPIPNAK

Query:  FNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAILSKDLAI
                                                      GV GIG   AV+L+ +FGT+EN+L  +D+++ E++K+ L    + A+LSK LA 
Subjt:  FNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAILSKDLAI

Query:  LRSDLPVYMVPFTTRDLLFKKPEDNGEKFTSLLTAIG
        +  D PV +   T  D+++K   ++ EK  +L   +G
Subjt:  LRSDLPVYMVPFTTRDLLFKKPEDNGEKFTSLLTAIG

Q04957 DNA polymerase I5.7e-1624.93Show/hide
Query:  STFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRDWHMIVVQ
        +TFRH  +  YK  R  TP  + +    L+  +++  +   E+   EADD+IGTLA R+   GF   + S  R              L+ L   H+ V  
Subjt:  STFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRDWHMIVVQ

Query:  NRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTIIPIPNAK
         +  + +             +  +  E   EKYG L P Q VD+  L+GDKSDNIP                                            
Subjt:  NRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTIIPIPNAK

Query:  FNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAILSKDLAI
                                                      GV GIG   AV+L+ +FGT+EN+L  +D+++ E++K+ L  + E A+LSK LA 
Subjt:  FNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAILSKDLAI

Query:  LRSDLPVYMVPFTTRDLLFKKPEDNGEKFTSLLTAIG
        +R D PV +    + D +  + ED  EK  +L   +G
Subjt:  LRSDLPVYMVPFTTRDLLFKKPEDNGEKFTSLLTAIG

Arabidopsis top hitse value%identityAlignment
AT3G52050.1 5'-3' exonuclease family protein4.7e-7447.61Show/hide
Query:  RGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRDWHMIV
        +G  FRHT YPAYKSNRPPTPDTIVQGLQYLKASIK+MS+KVIEVPGVEADDVIGTLA+RS++AGF   + S  +        SL LL L          
Subjt:  RGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRDWHMIV

Query:  VQNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTIIPIPN
            TP   G            M SFG+EDFA+K+G LEP+QFVD+++L GDKSDNIP                                          
Subjt:  VQNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTIIPIPN

Query:  AKFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAILSKDL
                                                        GVDGIGNV+AV+LI+RFGTLENLLQ VD++++ +IK+ L+ +A+QAILSK L
Subjt:  AKFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAILSKDL

Query:  AILRSDLPVYMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPIIRRPF
        A+LRSDLP Y+VPF T+DL FKKPEDNGEK +SLL AI  YAEGFSADP+IRR F
Subjt:  AILRSDLPVYMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPIIRRPF

AT3G52050.2 5'-3' exonuclease family protein4.7e-7447.61Show/hide
Query:  RGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRDWHMIV
        +G  FRHT YPAYKSNRPPTPDTIVQGLQYLKASIK+MS+KVIEVPGVEADDVIGTLA+RS++AGF   + S  +        SL LL L          
Subjt:  RGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRDWHMIV

Query:  VQNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTIIPIPN
            TP   G            M SFG+EDFA+K+G LEP+QFVD+++L GDKSDNIP                                          
Subjt:  VQNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTIIPIPN

Query:  AKFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAILSKDL
                                                        GVDGIGNV+AV+LI+RFGTLENLLQ VD++++ +IK+ L+ +A+QAILSK L
Subjt:  AKFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAILSKDL

Query:  AILRSDLPVYMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPIIRRPF
        A+LRSDLP Y+VPF T+DL FKKPEDNGEK +SLL AI  YAEGFSADP+IRR F
Subjt:  AILRSDLPVYMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPIIRRPF

AT3G52050.3 5'-3' exonuclease family protein4.7e-7447.61Show/hide
Query:  RGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRDWHMIV
        +G  FRHT YPAYKSNRPPTPDTIVQGLQYLKASIK+MS+KVIEVPGVEADDVIGTLA+RS++AGF   + S  +        SL LL L          
Subjt:  RGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRDWHMIV

Query:  VQNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTIIPIPN
            TP   G            M SFG+EDFA+K+G LEP+QFVD+++L GDKSDNIP                                          
Subjt:  VQNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTIIPIPN

Query:  AKFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAILSKDL
                                                        GVDGIGNV+AV+LI+RFGTLENLLQ VD++++ +IK+ L+ +A+QAILSK L
Subjt:  AKFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAILSKDL

Query:  AILRSDLPVYMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPIIRRPF
        A+LRSDLP Y+VPF T+DL FKKPEDNGEK +SLL AI  YAEGFSADP+IRR F
Subjt:  AILRSDLPVYMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPIIRRPF

AT3G52050.4 5'-3' exonuclease family protein4.7e-7447.61Show/hide
Query:  RGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRDWHMIV
        +G  FRHT YPAYKSNRPPTPDTIVQGLQYLKASIK+MS+KVIEVPGVEADDVIGTLA+RS++AGF   + S  +        SL LL L          
Subjt:  RGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRDWHMIV

Query:  VQNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTIIPIPN
            TP   G            M SFG+EDFA+K+G LEP+QFVD+++L GDKSDNIP                                          
Subjt:  VQNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTIIPIPN

Query:  AKFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAILSKDL
                                                        GVDGIGNV+AV+LI+RFGTLENLLQ VD++++ +IK+ L+ +A+QAILSK L
Subjt:  AKFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAILSKDL

Query:  AILRSDLPVYMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPIIRRPF
        A+LRSDLP Y+VPF T+DL FKKPEDNGEK +SLL AI  YAEGFSADP+IRR F
Subjt:  AILRSDLPVYMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPIIRRPF

AT3G52050.5 5'-3' exonuclease family protein1.5e-4341.4Show/hide
Query:  GSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRDWHMIVV
        G  FRHT YPAYKSNRPPTPDTIVQGLQYLKASIK+MS+KVIEVPGVEADDVIGTLA+RS++AGF   + S  +        SL LL L           
Subjt:  GSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRDWHMIVV

Query:  QNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTIIPIPNA
           TP   G            M SFG+EDFA+K+G LEP+QFVD+++L GDKSDNIP                                           
Subjt:  QNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTIIPIPNA

Query:  KFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKM
                                                       GVDGIGNV+AV+LI+RFGTLENLLQ VD++++ +IK++
Subjt:  KFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIKKM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAAAAAATGGGTTCAAGAATCATGTTGTTGTCCTTCAAGAACATCAAGGCAACCCACATCGAGTCCAAAATCGAGTGGCCATGTTGCAAGAACATCAAGATAGGAT
TACAAATCTTAATGTTTTAGGTTCTTATTTGACTATGTTCCCAATAGAACCAATTTTGACTATTGTTGCCTGTCTCAATGTGAGAGACCCTTCTCTAAGGCTATTTAAGA
AGGATGTTGTAGAGACTGCAAAATCCCAATTTTCACAAAATCATAGTGATCACCTTGTCATTATCCGAGCCTATGGGGGACAGAAAGAAGTTGAGAAGAACTTTGTCAAT
AGGGGTTTGGGCAGTGCGGCGGAGGTGATATTGGTGGCGAAGACGGTGGCTAGTGCTGCTGGACAGAAACACAATGAGAGAGAGAGAGAGAGTTGTGAGAGAGAGAATGG
AAGAGAGAAGAAGAGCGCTTATTGGCGGACCGGATATGCCCATCAGCCAAATTATTATGGTTTAATGCTACCAAGGCGATTGTTTCATATATTTAGTTGGCAAGAAATAA
AAGGATTTTTTAAGAGAGGATCCACTTTTCGTCATACACGTTACCCTGCATACAAGAGTAACAGGCCACCTACACCTGATACCATTGTCCAGGGGCTCCAGTACTTGAAG
GCATCCATAAAGTCCATGTCTGTTAAGGTGATTGAGGTACCTGGAGTTGAAGCTGATGATGTAATTGGCACATTGGCTTTGAGAAGTGTTGCTGCTGGGTTCATTTTCTC
TATGGACTCTGAGGGTAGAATAGTTATATGTGATACTGGTATGTCACTAATGTTGCTCTCCTTATCGTATTTGAGGGATTGGCATATGATTGTTGTCCAAAACAGAACAC
CTGTCTATGAGGGTAGGTGCATGGTCTTCAATAGTGGTTGCCTGGTTGGGATGGTTTCTTTTGGGTTGGAGGATTTTGCTGAAAAATATGGAGTTCTGGAACCTTCTCAG
TTTGTTGATGTGATGTCTTTAGTTGGTGACAAATCTGATAATATTCCAGATGTAATTGAAGTATGTTGCGCTCTTGCCCAAGTTCCAACATTAATTGGAAGATATCAAGA
TATTGGAGGCAAAAAATCTTGCTTAGCCCATACTGCTACAAGAACGATTATTCCAATTCCAAATGCAAAATTCAATTTAACGAAAGAAGTTCCATCCATTTCAAGTCCAA
AAAGCCATTTGATTGAGGAAGAGTATGATGATGATTCTTTGTTTACAATGTCTTCTATCGACTTGATGGTGCTTGATGTTCTTATATCGCTAAATGCAGGAGTCGATGGA
ATTGGAAATGTCAATGCTGTGCAACTTATCACTAGATTCGGCACGTTAGAAAATTTGTTGCAACATGTTGATCAAGTGGAAGATGAACGTATTAAGAAGATGTTGGTAAC
AAATGCCGAACAAGCTATCTTGAGCAAGGACCTGGCAATCTTGCGATCTGACCTTCCAGTCTACATGGTACCATTTACCACCAGAGATCTTTTATTCAAGAAACCGGAGG
ATAATGGGGAGAAATTCACAAGCCTCTTAACTGCGATTGGTGCATATGCAGAAGGGTTCTCAGCTGATCCAATAATCAGGAGACCTTTTAGCAAAGAAAATAGGAAACGC
AAGAAGTTTCAGCTTAACATGTTATGGAGTATCCCTTGCTATGAAGGACTGAACTGGCTTCTCCCGTTGATTGATTATTTGAACATTCCAACCATATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCAAAAAATGGGTTCAAGAATCATGTTGTTGTCCTTCAAGAACATCAAGGCAACCCACATCGAGTCCAAAATCGAGTGGCCATGTTGCAAGAACATCAAGATAGGAT
TACAAATCTTAATGTTTTAGGTTCTTATTTGACTATGTTCCCAATAGAACCAATTTTGACTATTGTTGCCTGTCTCAATGTGAGAGACCCTTCTCTAAGGCTATTTAAGA
AGGATGTTGTAGAGACTGCAAAATCCCAATTTTCACAAAATCATAGTGATCACCTTGTCATTATCCGAGCCTATGGGGGACAGAAAGAAGTTGAGAAGAACTTTGTCAAT
AGGGGTTTGGGCAGTGCGGCGGAGGTGATATTGGTGGCGAAGACGGTGGCTAGTGCTGCTGGACAGAAACACAATGAGAGAGAGAGAGAGAGTTGTGAGAGAGAGAATGG
AAGAGAGAAGAAGAGCGCTTATTGGCGGACCGGATATGCCCATCAGCCAAATTATTATGGTTTAATGCTACCAAGGCGATTGTTTCATATATTTAGTTGGCAAGAAATAA
AAGGATTTTTTAAGAGAGGATCCACTTTTCGTCATACACGTTACCCTGCATACAAGAGTAACAGGCCACCTACACCTGATACCATTGTCCAGGGGCTCCAGTACTTGAAG
GCATCCATAAAGTCCATGTCTGTTAAGGTGATTGAGGTACCTGGAGTTGAAGCTGATGATGTAATTGGCACATTGGCTTTGAGAAGTGTTGCTGCTGGGTTCATTTTCTC
TATGGACTCTGAGGGTAGAATAGTTATATGTGATACTGGTATGTCACTAATGTTGCTCTCCTTATCGTATTTGAGGGATTGGCATATGATTGTTGTCCAAAACAGAACAC
CTGTCTATGAGGGTAGGTGCATGGTCTTCAATAGTGGTTGCCTGGTTGGGATGGTTTCTTTTGGGTTGGAGGATTTTGCTGAAAAATATGGAGTTCTGGAACCTTCTCAG
TTTGTTGATGTGATGTCTTTAGTTGGTGACAAATCTGATAATATTCCAGATGTAATTGAAGTATGTTGCGCTCTTGCCCAAGTTCCAACATTAATTGGAAGATATCAAGA
TATTGGAGGCAAAAAATCTTGCTTAGCCCATACTGCTACAAGAACGATTATTCCAATTCCAAATGCAAAATTCAATTTAACGAAAGAAGTTCCATCCATTTCAAGTCCAA
AAAGCCATTTGATTGAGGAAGAGTATGATGATGATTCTTTGTTTACAATGTCTTCTATCGACTTGATGGTGCTTGATGTTCTTATATCGCTAAATGCAGGAGTCGATGGA
ATTGGAAATGTCAATGCTGTGCAACTTATCACTAGATTCGGCACGTTAGAAAATTTGTTGCAACATGTTGATCAAGTGGAAGATGAACGTATTAAGAAGATGTTGGTAAC
AAATGCCGAACAAGCTATCTTGAGCAAGGACCTGGCAATCTTGCGATCTGACCTTCCAGTCTACATGGTACCATTTACCACCAGAGATCTTTTATTCAAGAAACCGGAGG
ATAATGGGGAGAAATTCACAAGCCTCTTAACTGCGATTGGTGCATATGCAGAAGGGTTCTCAGCTGATCCAATAATCAGGAGACCTTTTAGCAAAGAAAATAGGAAACGC
AAGAAGTTTCAGCTTAACATGTTATGGAGTATCCCTTGCTATGAAGGACTGAACTGGCTTCTCCCGTTGATTGATTATTTGAACATTCCAACCATATGA
Protein sequenceShow/hide protein sequence
MSKNGFKNHVVVLQEHQGNPHRVQNRVAMLQEHQDRITNLNVLGSYLTMFPIEPILTIVACLNVRDPSLRLFKKDVVETAKSQFSQNHSDHLVIIRAYGGQKEVEKNFVN
RGLGSAAEVILVAKTVASAAGQKHNERERESCERENGREKKSAYWRTGYAHQPNYYGLMLPRRLFHIFSWQEIKGFFKRGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLK
ASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGFIFSMDSEGRIVICDTGMSLMLLSLSYLRDWHMIVVQNRTPVYEGRCMVFNSGCLVGMVSFGLEDFAEKYGVLEPSQ
FVDVMSLVGDKSDNIPDVIEVCCALAQVPTLIGRYQDIGGKKSCLAHTATRTIIPIPNAKFNLTKEVPSISSPKSHLIEEEYDDDSLFTMSSIDLMVLDVLISLNAGVDG
IGNVNAVQLITRFGTLENLLQHVDQVEDERIKKMLVTNAEQAILSKDLAILRSDLPVYMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPIIRRPFSKENRKR
KKFQLNMLWSIPCYEGLNWLLPLIDYLNIPTI