; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G03540 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G03540
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionDNA glycosylase superfamily protein
Genome locationClcChr09:2703755..2707106
RNA-Seq ExpressionClc09G03540
SyntenyClc09G03540
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136097.2 uncharacterized protein LOC101205558 [Cucumis sativus]3.6e-18784.17Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVESVA--AVGVVADTAGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
        SRASSARGTRQRGPNLRRKQ STVKGADKAVE V   +V VV DT GCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
Subjt:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVESVA--AVGVVADTAGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP

Query:  AILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALCLRILVLSGMQ
        AILNKRHLFREIFLDFDP AVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCK                                           
Subjt:  AILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALCLRILVLSGMQ

Query:  THTVIDELGSFNVYIWNFVNHKPIVSQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFQECIETQTAEKGEKD-GEI
           VIDE GSFNVY+WNFVNHKPI+SQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLI CFRF ECIETQTAEKGE+D GE+
Subjt:  THTVIDELGSFNVYIWNFVNHKPIVSQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFQECIETQTAEKGEKD-GEI

Query:  KLTVNEKMPEALKNLEL
        KL  NEKMPEALKNLEL
Subjt:  KLTVNEKMPEALKNLEL

XP_008461179.1 PREDICTED: probable GMP synthase [glutamine-hydrolyzing] [Cucumis melo]6.6e-18984.38Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVESVA--AVGVVADTAGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
        SRASSARGTRQRGPNLRRKQ STVKGADKAVE V   +V VVADT GCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
Subjt:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVESVA--AVGVVADTAGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP

Query:  AILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALCLRILVLSGMQ
        AILNKRHLFREIFLDFDP  VSKLNEKKMVAPGSAATSLLSELK+RAIIENGRQMCK                                           
Subjt:  AILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALCLRILVLSGMQ

Query:  THTVIDELGSFNVYIWNFVNHKPIVSQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFQECIETQTAEKGEKDGEIK
           VIDE GSFNVY+WNFVNHKPI+SQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRF ECIETQTAEKGE+DGE+K
Subjt:  THTVIDELGSFNVYIWNFVNHKPIVSQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFQECIETQTAEKGEKDGEIK

Query:  LTVNEKMPEALKNLEL
        L  NEKMPEALKNLEL
Subjt:  LTVNEKMPEALKNLEL

XP_022155202.1 uncharacterized protein LOC111022341 [Momordica charantia]4.9e-17681.25Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKAR VE RKPG KPLKKLEKP QE ESKDKRVPLSPPQCV+VPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVESVA--AVGVVADTAGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
        SRASSARGTRQRGPNLRRKQ STVK A+KAVE V   +V VV DT   LE KKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
Subjt:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVESVA--AVGVVADTAGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP

Query:  AILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALCLRILVLSGMQ
        AILNKRHLFREIFLDFDPNAVSKLNEKKMVA GSAATSLLSELKVRAIIENGRQMCK                                           
Subjt:  AILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALCLRILVLSGMQ

Query:  THTVIDELGSFNVYIWNFVNHKPIVSQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFQECIETQTAEKGEKDGEIK
           VIDE GSF+VYIWNFVNHKPI+SQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRF ECIE  TAE+GEKDGEIK
Subjt:  THTVIDELGSFNVYIWNFVNHKPIVSQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFQECIETQTAEKGEKDGEIK

Query:  LTVNEKMPEALKNLEL
          +NEK+PEALKNLEL
Subjt:  LTVNEKMPEALKNLEL

XP_022960311.1 uncharacterized protein LOC111461081 [Cucurbita moschata]2.4e-17580.1Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRK GVKPLKKLEKP QE ESKDKRVPLSPPQCV TVPSVLRQQDRHQAIL LSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRKQSSTVKGADKAVESVAAVGVVA--DTAGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW
        NSRASSARGTRQRGPNLRRK SSTVK A+KAVE V A  VVA  +T GCLE KKRCAWVT NTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW
Subjt:  NSRASSARGTRQRGPNLRRKQSSTVKGADKAVESVAAVGVVA--DTAGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW

Query:  PAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALCLRILVLSGM
        P IL KRHLFRE FLDFDPNAVSKLNEKKMVAPGSAATSLLSE KVRAIIENGRQMCK                                          
Subjt:  PAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALCLRILVLSGM

Query:  QTHTVIDELGSFNVYIWNFVNHKPIVSQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFQECIETQTAEKGEKDGEI
            VIDE GSFNVY+WNFVNHKP +SQFRYPRQVPDKTSKA+VISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHL+SCFRFQECIE  T EKGE+DG+I
Subjt:  QTHTVIDELGSFNVYIWNFVNHKPIVSQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFQECIETQTAEKGEKDGEI

Query:  KLTVNEKMPEALKNLEL
        K T+ EK+PEALKNLEL
Subjt:  KLTVNEKMPEALKNLEL

XP_038900164.1 probable GMP synthase [glutamine-hydrolyzing] [Benincasa hispida]4.5e-19084.73Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKAR VETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQSSTVKGADKA-----VESVAAVGVVADTAGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAEL
        SRASSARGTRQRGPNLRRKQ+STVKGA K+     VES A V VVADT GCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAEL
Subjt:  SRASSARGTRQRGPNLRRKQSSTVKGADKA-----VESVAAVGVVADTAGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAEL

Query:  TWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALCLRILVLS
        TWPAILNKR+LFREIFLDFDPN VSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCK                                        
Subjt:  TWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALCLRILVLS

Query:  GMQTHTVIDELGSFNVYIWNFVNHKPIVSQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFQECIETQTAEKGEKDG
              VIDE GSFNVY+WNFVNHKPI+SQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFQECIETQTAEKGEKDG
Subjt:  GMQTHTVIDELGSFNVYIWNFVNHKPIVSQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFQECIETQTAEKGEKDG

Query:  EIKLTVNEKMPEALKNLEL
        EIKLTVNEKMPEALKNLEL
Subjt:  EIKLTVNEKMPEALKNLEL

TrEMBL top hitse value%identityAlignment
A0A0A0K8L6 Uncharacterized protein1.7e-18784.17Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVESVA--AVGVVADTAGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
        SRASSARGTRQRGPNLRRKQ STVKGADKAVE V   +V VV DT GCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
Subjt:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVESVA--AVGVVADTAGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP

Query:  AILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALCLRILVLSGMQ
        AILNKRHLFREIFLDFDP AVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCK                                           
Subjt:  AILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALCLRILVLSGMQ

Query:  THTVIDELGSFNVYIWNFVNHKPIVSQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFQECIETQTAEKGEKD-GEI
           VIDE GSFNVY+WNFVNHKPI+SQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLI CFRF ECIETQTAEKGE+D GE+
Subjt:  THTVIDELGSFNVYIWNFVNHKPIVSQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFQECIETQTAEKGEKD-GEI

Query:  KLTVNEKMPEALKNLEL
        KL  NEKMPEALKNLEL
Subjt:  KLTVNEKMPEALKNLEL

A0A1S3CE52 probable GMP synthase [glutamine-hydrolyzing]3.2e-18984.38Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVESVA--AVGVVADTAGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
        SRASSARGTRQRGPNLRRKQ STVKGADKAVE V   +V VVADT GCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
Subjt:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVESVA--AVGVVADTAGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP

Query:  AILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALCLRILVLSGMQ
        AILNKRHLFREIFLDFDP  VSKLNEKKMVAPGSAATSLLSELK+RAIIENGRQMCK                                           
Subjt:  AILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALCLRILVLSGMQ

Query:  THTVIDELGSFNVYIWNFVNHKPIVSQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFQECIETQTAEKGEKDGEIK
           VIDE GSFNVY+WNFVNHKPI+SQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRF ECIETQTAEKGE+DGE+K
Subjt:  THTVIDELGSFNVYIWNFVNHKPIVSQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFQECIETQTAEKGEKDGEIK

Query:  LTVNEKMPEALKNLEL
        L  NEKMPEALKNLEL
Subjt:  LTVNEKMPEALKNLEL

A0A5A7UYZ9 Putative GMP synthase3.2e-18984.38Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVESVA--AVGVVADTAGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
        SRASSARGTRQRGPNLRRKQ STVKGADKAVE V   +V VVADT GCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
Subjt:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVESVA--AVGVVADTAGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP

Query:  AILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALCLRILVLSGMQ
        AILNKRHLFREIFLDFDP  VSKLNEKKMVAPGSAATSLLSELK+RAIIENGRQMCK                                           
Subjt:  AILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALCLRILVLSGMQ

Query:  THTVIDELGSFNVYIWNFVNHKPIVSQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFQECIETQTAEKGEKDGEIK
           VIDE GSFNVY+WNFVNHKPI+SQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRF ECIETQTAEKGE+DGE+K
Subjt:  THTVIDELGSFNVYIWNFVNHKPIVSQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFQECIETQTAEKGEKDGEIK

Query:  LTVNEKMPEALKNLEL
        L  NEKMPEALKNLEL
Subjt:  LTVNEKMPEALKNLEL

A0A6J1DNQ3 uncharacterized protein LOC1110223412.4e-17681.25Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKAR VE RKPG KPLKKLEKP QE ESKDKRVPLSPPQCV+VPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVESVA--AVGVVADTAGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
        SRASSARGTRQRGPNLRRKQ STVK A+KAVE V   +V VV DT   LE KKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
Subjt:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVESVA--AVGVVADTAGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP

Query:  AILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALCLRILVLSGMQ
        AILNKRHLFREIFLDFDPNAVSKLNEKKMVA GSAATSLLSELKVRAIIENGRQMCK                                           
Subjt:  AILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALCLRILVLSGMQ

Query:  THTVIDELGSFNVYIWNFVNHKPIVSQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFQECIETQTAEKGEKDGEIK
           VIDE GSF+VYIWNFVNHKPI+SQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRF ECIE  TAE+GEKDGEIK
Subjt:  THTVIDELGSFNVYIWNFVNHKPIVSQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFQECIETQTAEKGEKDGEIK

Query:  LTVNEKMPEALKNLEL
          +NEK+PEALKNLEL
Subjt:  LTVNEKMPEALKNLEL

A0A6J1H7A2 uncharacterized protein LOC1114610811.2e-17580.1Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRK GVKPLKKLEKP QE ESKDKRVPLSPPQCV TVPSVLRQQDRHQAIL LSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRKQSSTVKGADKAVESVAAVGVVA--DTAGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW
        NSRASSARGTRQRGPNLRRK SSTVK A+KAVE V A  VVA  +T GCLE KKRCAWVT NTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW
Subjt:  NSRASSARGTRQRGPNLRRKQSSTVKGADKAVESVAAVGVVA--DTAGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW

Query:  PAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALCLRILVLSGM
        P IL KRHLFRE FLDFDPNAVSKLNEKKMVAPGSAATSLLSE KVRAIIENGRQMCK                                          
Subjt:  PAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALCLRILVLSGM

Query:  QTHTVIDELGSFNVYIWNFVNHKPIVSQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFQECIETQTAEKGEKDGEI
            VIDE GSFNVY+WNFVNHKP +SQFRYPRQVPDKTSKA+VISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHL+SCFRFQECIE  T EKGE+DG+I
Subjt:  QTHTVIDELGSFNVYIWNFVNHKPIVSQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFQECIETQTAEKGEKDGEI

Query:  KLTVNEKMPEALKNLEL
        K T+ EK+PEALKNLEL
Subjt:  KLTVNEKMPEALKNLEL

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 18.1e-3334.07Show/hide
Query:  KRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENG
        +RC WV  + DP Y A+HD EWGVP  D KKLFE++CL G  A L+W  +L KR  +R  F  FDP  V+ + E+ +      A  +    K++AII N 
Subjt:  KRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENG

Query:  RQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALCLRILVLSGMQTHTVIDELGSFNVYIWNFVNHKPIVSQFRYPRQVPDKTSKAEVISKDLVKRG
        R   +++ N                                                 F  ++W+FVNH+P V+Q     ++P  TS ++ +SK L KRG
Subjt:  RQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALCLRILVLSGMQTHTVIDELGSFNVYIWNFVNHKPIVSQFRYPRQVPDKTSKAEVISKDLVKRG

Query:  FRSVGPTVIYTFMQVAGLTNDHLISC
        F+ VG T+ Y+FMQ  GL NDH++ C
Subjt:  FRSVGPTVIYTFMQVAGLTNDHLISC

P44321 DNA-3-methyladenine glycosylase1.0e-2732.3Show/hide
Query:  RCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGR
        RC WV       Y  +HD+EWG P  D +KLFE +CL G  A L+W  +L KR  +RE F  FDP  ++K+    + A    +  +    K+ AI++N +
Subjt:  RCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGR

Query:  QMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALCLRILVLSGMQTHTVIDELG-SFNVYIWNFVNHKPIVSQFRYPRQVPDKTSKAEVISKDLVKRG
                                                        +  +++ G +F+ +IW+FVNHKPIV+     R VP KT  ++ +SK L KRG
Subjt:  QMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALCLRILVLSGMQTHTVIDELG-SFNVYIWNFVNHKPIVSQFRYPRQVPDKTSKAEVISKDLVKRG

Query:  FRSVGPTVIYTFMQVAGLTNDHLISC
        F  +G T  Y FMQ  GL +DHL  C
Subjt:  FRSVGPTVIYTFMQVAGLTNDHLISC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]1.4e-3233.08Show/hide
Query:  LESKKRCAWVTPNTDPC---YAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKV
        +  K RCAW T   +     Y  +HD EWG P+H+DKKLFE L L G  A L+W  IL KR  FR  F DFDP+ V+  +E K+         + +  K+
Subjt:  LESKKRCAWVTPNTDPC---YAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKV

Query:  RAIIENGRQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALCLRILVLSGMQTHTVIDELGSFNVYIWNFVNHKPIVSQFRYPRQVPDKTSKAEVIS
         A I N +    V                                                E GSF+ YIW FV  KPI++ F     +P  T  ++ I+
Subjt:  RAIIENGRQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALCLRILVLSGMQTHTVIDELGSFNVYIWNFVNHKPIVSQFRYPRQVPDKTSKAEVIS

Query:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRF-----QECIETQTAEKGEKDGEIKLTVNEK
        KDL KRGF+ VG T +Y  MQ  G+ NDHL SCF+       +C +     +G       LT N K
Subjt:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRF-----QECIETQTAEKGEKDGEIKLTVNEK

Arabidopsis top hitse value%identityAlignment
AT1G15970.1 DNA glycosylase superfamily protein1.0e-8646.96Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQE---VESKDKR-----VPLSP----PQCVTVPSVLRQQDRHQAILNLSMNA
        MS PPR RS+N  + + R VLGPTGNK +    RKP   P  KLEKP  E   ++SKD++      P SP     QC ++ S + +++      + S +A
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQE---VESKDKR-----VPLSP----PQCVTVPSVLRQQDRHQAILNLSMNA

Query:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQSSTVKGADKAVESVAAVGVVADTAGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCL
        S S ++S  S  S +S  +  R+ G     ++ S  K  +K      A G           +KRCAW+TP  DPCY AFHDEEWGVPVHDDKKLFELLCL
Subjt:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQSSTVKGADKAVESVAAVGVVADTAGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCL

Query:  SGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALC
        SGALAEL+W  IL++RH+ RE+F+DFDP AV++LN+KK+ APG+AA SLLSE+K+R+I++N R + K                                 
Subjt:  SGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALC

Query:  LRILVLSGMQTHTVIDELGSFNVYIWNFVNHKPIVSQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFQEC---IET
                     +I E GS   Y+WNFVN+KP  SQFRY RQVP KTSKAE ISKDLV+RGFRSV PTVIY+FMQ AGLTNDHLI CFR+Q+C    ET
Subjt:  LRILVLSGMQTHTVIDELGSFNVYIWNFVNHKPIVSQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFQEC---IET

Query:  QTAEKGEKDGE
         T  K +K  E
Subjt:  QTAEKGEKDGE

AT1G75090.1 DNA glycosylase superfamily protein4.2e-6138.32Show/hide
Query:  PLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRASSARGTRQRGPNLRRKQSSTVKGADKAVESVAAVGV
        P+K +++ R  + S   R  ++  +    P +  +  +  A      N S S+D SS S +S   S+  T   G   +    S   G +K    VA+V V
Subjt:  PLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRASSARGTRQRGPNLRRKQSSTVKGADKAVESVAAVGV

Query:  VADTAGCLESK-KRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLL
        V D +  +    KRC W+TPN+DP Y  FHDEEWGVPV DDKKLFELL  S ALAE +WP+IL +R  FR++F +FDP+A+++  EK++++       +L
Subjt:  VADTAGCLESK-KRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLL

Query:  SELKVRAIIENGRQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALCLRILVLSGMQTHTVIDELGSFNVYIWNFVNHKPIVSQFRYPRQVPDKTSK
        SE K+RAI+EN + + KV                                                E GSF+ Y W FVNHKP+ + +RY RQVP K+ K
Subjt:  SELKVRAIIENGRQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALCLRILVLSGMQTHTVIDELGSFNVYIWNFVNHKPIVSQFRYPRQVPDKTSK

Query:  AEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFQEC-IETQTAEKGEKDGEIKLTVNEKM
        AE ISKD+++RGFR VGPTV+Y+F+Q +G+ NDHL +CFR+QEC +ET+   K   + E KL ++  +
Subjt:  AEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFQEC-IETQTAEKGEKDGEIKLTVNEKM

AT1G80850.1 DNA glycosylase superfamily protein1.5e-8247Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MS PPR+RS++ +D + R VLGP GNK +     KP  KP+ +  K     E   +  PLSPP       +LR+         +SM AS SSDASS   +
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVESVAAVGVVADTAGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAI
        S  S    +   G  + R+  S    +             A    C + +KRCAW+TP +D CY AFHDEEWGVPVHDDK+LFELL LSGALAEL+W  I
Subjt:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVESVAAVGVVADTAGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAI

Query:  LNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALCLRILVLSGMQTH
        L+KR LFRE+F+DFDP A+S+L  KK+ +P  AAT+LLSE K+R+I+EN  Q+CK                                             
Subjt:  LNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALCLRILVLSGMQTH

Query:  TVIDELGSFNVYIWNFVNHKPIVSQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFQECI
         +I   GSF+ YIWNFVN KP  SQFRYPRQVP KTSKAE+ISKDLV+RGFRSV PTVIY+FMQ AGLTNDHL  CFR  +C+
Subjt:  TVIDELGSFNVYIWNFVNHKPIVSQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFQECI

AT5G57970.1 DNA glycosylase superfamily protein1.9e-9349.87Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPP----------QCVTVPSVLRQQDRHQAIL--NLSMNA
        MSG PR++SMNVA++++R  LG T  KA    T K   K L+KLE+        D++   + P            +   S+LR   RH+  L  NLS+NA
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPP----------QCVTVPSVLRQQDRHQAIL--NLSMNA

Query:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQSSTVKGADKAVESVAAVGVVADTAGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCL
        S SSDAS DSF+SRAS+ R  R      R K   +         SV + G +       E+KKRC WVTPN+DPCY  FHDEEWGVPVHDDK+LFELL L
Subjt:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQSSTVKGADKAVESVAAVGVVADTAGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCL

Query:  SGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALC
        SGALAE TWP IL+KR  FRE+F DFDPNA+ K+NEKK++ PGS A++LLS+LK+RA+IEN RQ+ K                                 
Subjt:  SGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALC

Query:  LRILVLSGMQTHTVIDELGSFNVYIWNFVNHKPIVSQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFQECI
                     VI+E GSF+ YIW+FV +K IVS+FRY RQVP KT KAEVISKDLV+RGFRSVGPTV+Y+FMQ AG+TNDHL SCFRF  CI
Subjt:  LRILVLSGMQTHTVIDELGSFNVYIWNFVNHKPIVSQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFQECI

AT5G57970.2 DNA glycosylase superfamily protein1.9e-9349.87Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPP----------QCVTVPSVLRQQDRHQAIL--NLSMNA
        MSG PR++SMNVA++++R  LG T  KA    T K   K L+KLE+        D++   + P            +   S+LR   RH+  L  NLS+NA
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPP----------QCVTVPSVLRQQDRHQAIL--NLSMNA

Query:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQSSTVKGADKAVESVAAVGVVADTAGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCL
        S SSDAS DSF+SRAS+ R  R      R K   +         SV + G +       E+KKRC WVTPN+DPCY  FHDEEWGVPVHDDK+LFELL L
Subjt:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQSSTVKGADKAVESVAAVGVVADTAGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCL

Query:  SGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALC
        SGALAE TWP IL+KR  FRE+F DFDPNA+ K+NEKK++ PGS A++LLS+LK+RA+IEN RQ+ K                                 
Subjt:  SGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALC

Query:  LRILVLSGMQTHTVIDELGSFNVYIWNFVNHKPIVSQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFQECI
                     VI+E GSF+ YIW+FV +K IVS+FRY RQVP KT KAEVISKDLV+RGFRSVGPTV+Y+FMQ AG+TNDHL SCFRF  CI
Subjt:  LRILVLSGMQTHTVIDELGSFNVYIWNFVNHKPIVSQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFQECI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGGCCCTCCGAGAATCCGGTCTATGAATGTGGCGGATTCCGATTCACGACCGGTACTTGGGCCTACAGGGAACAAAGCGCGTACAGTAGAGACTAGAAAACCTGG
TGTGAAGCCATTGAAGAAACTTGAAAAGCCTCGTCAAGAAGTTGAATCAAAGGACAAAAGGGTGCCATTGTCGCCGCCTCAATGCGTTACAGTTCCATCGGTTTTGAGAC
AACAGGACCGCCACCAAGCGATTCTCAATCTGTCGATGAATGCTTCGTGTTCTTCTGATGCGTCGTCTGATTCGTTTAATAGCCGGGCATCTAGTGCAAGGGGTACAAGA
CAGCGCGGTCCGAATTTGAGGAGAAAGCAAAGTAGTACGGTTAAGGGGGCTGACAAGGCCGTTGAAAGTGTGGCGGCGGTGGGGGTGGTGGCGGATACAGCTGGTTGCTT
AGAGTCCAAAAAAAGATGTGCTTGGGTAACGCCTAATACAGATCCATGTTATGCTGCTTTTCATGATGAAGAATGGGGAGTACCAGTTCACGATGACAAAAAATTGTTTG
AACTGCTTTGCCTATCGGGCGCGTTGGCTGAACTTACATGGCCTGCCATCCTCAACAAAAGACATCTATTTAGGGAAATCTTTTTGGACTTTGACCCAAATGCCGTTTCA
AAATTAAACGAGAAAAAGATGGTTGCTCCAGGAAGTGCTGCTACTTCTTTACTGTCAGAACTCAAGGTTCGAGCTATCATTGAAAACGGTCGTCAAATGTGCAAGGTAGA
TGCTAACTCTCGCCTCTCCTTTCGGATTGTATATCTGTTTGTTTTGTTGCATAAAACAGAAATGTTGTCTATGTTGAATCTGAATGCGCTCTGTTTGAGAATTCTTGTCT
TGTCTGGCATGCAAACACACACAGTAATTGATGAACTTGGTTCCTTCAACGTGTACATCTGGAACTTTGTGAACCATAAACCGATCGTCAGTCAGTTCCGGTACCCACGT
CAAGTCCCAGATAAGACGTCGAAAGCAGAGGTGATTAGCAAGGATCTCGTAAAGAGAGGGTTTCGAAGCGTAGGACCAACAGTCATCTATACATTCATGCAGGTGGCTGG
GTTAACTAATGACCATCTCATCAGTTGCTTTAGGTTTCAAGAATGTATAGAGACACAAACAGCAGAGAAAGGAGAAAAAGATGGTGAAATCAAGCTTACTGTTAATGAGA
AAATGCCAGAGGCTTTGAAAAACTTGGAACTATAA
mRNA sequenceShow/hide mRNA sequence
TATTTTTCCATTTCTCTCTCTCTCTCTCTCGTTTCTCTTTTGATTTTCATACTCTCAAACACAATCATGGCCGTTGCGAATTTTTCATCATCCACTTCCAGATAAGTTTT
TCGCCCTTTCAGTTCTTTTTTTTCATCAATGTCGCTCCATTTTTCAACCCCACTTAGGGTTCTGCCTCTTCTTCTCTCTCATTCACTCCCAAACCCTACTGTTTCAAACC
CCTTTTGACCTTTTCTGCTCCAATGAACTAAAGGGGTTCCTTTTTTTGTTTCTTTAATTTGACCCCTGGTTGATTTCTACACCCCAAATTGGGGTTCTGGAGATTATTAT
CTGTGTTTCTGTTTGTGCATTTTTTAATTGAGAAGAAAAAAATTTCATTGAAATGTCAGGCCCTCCGAGAATCCGGTCTATGAATGTGGCGGATTCCGATTCACGACCGG
TACTTGGGCCTACAGGGAACAAAGCGCGTACAGTAGAGACTAGAAAACCTGGTGTGAAGCCATTGAAGAAACTTGAAAAGCCTCGTCAAGAAGTTGAATCAAAGGACAAA
AGGGTGCCATTGTCGCCGCCTCAATGCGTTACAGTTCCATCGGTTTTGAGACAACAGGACCGCCACCAAGCGATTCTCAATCTGTCGATGAATGCTTCGTGTTCTTCTGA
TGCGTCGTCTGATTCGTTTAATAGCCGGGCATCTAGTGCAAGGGGTACAAGACAGCGCGGTCCGAATTTGAGGAGAAAGCAAAGTAGTACGGTTAAGGGGGCTGACAAGG
CCGTTGAAAGTGTGGCGGCGGTGGGGGTGGTGGCGGATACAGCTGGTTGCTTAGAGTCCAAAAAAAGATGTGCTTGGGTAACGCCTAATACAGATCCATGTTATGCTGCT
TTTCATGATGAAGAATGGGGAGTACCAGTTCACGATGACAAAAAATTGTTTGAACTGCTTTGCCTATCGGGCGCGTTGGCTGAACTTACATGGCCTGCCATCCTCAACAA
AAGACATCTATTTAGGGAAATCTTTTTGGACTTTGACCCAAATGCCGTTTCAAAATTAAACGAGAAAAAGATGGTTGCTCCAGGAAGTGCTGCTACTTCTTTACTGTCAG
AACTCAAGGTTCGAGCTATCATTGAAAACGGTCGTCAAATGTGCAAGGTAGATGCTAACTCTCGCCTCTCCTTTCGGATTGTATATCTGTTTGTTTTGTTGCATAAAACA
GAAATGTTGTCTATGTTGAATCTGAATGCGCTCTGTTTGAGAATTCTTGTCTTGTCTGGCATGCAAACACACACAGTAATTGATGAACTTGGTTCCTTCAACGTGTACAT
CTGGAACTTTGTGAACCATAAACCGATCGTCAGTCAGTTCCGGTACCCACGTCAAGTCCCAGATAAGACGTCGAAAGCAGAGGTGATTAGCAAGGATCTCGTAAAGAGAG
GGTTTCGAAGCGTAGGACCAACAGTCATCTATACATTCATGCAGGTGGCTGGGTTAACTAATGACCATCTCATCAGTTGCTTTAGGTTTCAAGAATGTATAGAGACACAA
ACAGCAGAGAAAGGAGAAAAAGATGGTGAAATCAAGCTTACTGTTAATGAGAAAATGCCAGAGGCTTTGAAAAACTTGGAACTATAAAGAAACCCATTGGTAGCCTTGAA
CCTTGCCTCAGTGTAATTAGCTTCCAGAGTTCTTTTTTTCTTTTC
Protein sequenceShow/hide protein sequence
MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRASSARGTR
QRGPNLRRKQSSTVKGADKAVESVAAVGVVADTAGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPNAVS
KLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVDANSRLSFRIVYLFVLLHKTEMLSMLNLNALCLRILVLSGMQTHTVIDELGSFNVYIWNFVNHKPIVSQFRYPR
QVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFQECIETQTAEKGEKDGEIKLTVNEKMPEALKNLEL