; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh08G005810 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh08G005810
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionDNA glycosylase superfamily protein
Genome locationCmo_Chr08:3554457..3558319
RNA-Seq ExpressionCmoCh08G005810
SyntenyCmoCh08G005810
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593364.1 hypothetical protein SDJN03_12840, partial [Cucurbita argyrosperma subsp. sororia]9.0e-20699.46Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQEAESKDKRVPLSPPQCVTTVPSVLRQQDRHQAILTLSMNASCSSDASSDSF
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQEAESKDKRVPLSPPQCVTTVPSVLRQQDRHQAILTLSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQEAESKDKRVPLSPPQCVTTVPSVLRQQDRHQAILTLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRKSSSTVKRAEKAVEKVGAESVVAATNTVGCLEPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW
        NSRASSARGTRQRGPNLRRKSSSTVKRAEKAVEKVGAESVVAATNTVGCLEPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW
Subjt:  NSRASSARGTRQRGPNLRRKSSSTVKRAEKAVEKVGAESVVAATNTVGCLEPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW

Query:  PTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKADVIS
        PTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKA+VIS
Subjt:  PTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKADVIS

Query:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFRFQECIETTEKGERDGDIKPTIIEKIPEALKNLEL
        KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFRF ECIETTEKGERDGDIKPTIIEKIPEALKNLEL
Subjt:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFRFQECIETTEKGERDGDIKPTIIEKIPEALKNLEL

XP_022155202.1 uncharacterized protein LOC111022341 [Momordica charantia]7.4e-18490.79Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQEAESKDKRVPLSPPQCVTTVPSVLRQQDRHQAILTLSMNASCSSDASSDSF
        MSGPPRIRSMNVADSDSRPVLGPTGNKAR VE RK G KPLKKLEKPHQEAESKDKRVPLSPPQCV +VPSVLRQQDRHQAIL LSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQEAESKDKRVPLSPPQCVTTVPSVLRQQDRHQAILTLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRKSSSTVKRAEKAVEKVGAESVVAATNTVGCLEPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW
        NSRASSARGTRQRGPNLRRK  STVKRAEKAVEKVG ESVV   +TV  LEPKKRCAWVT NTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW
Subjt:  NSRASSARGTRQRGPNLRRKSSSTVKRAEKAVEKVGAESVVAATNTVGCLEPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW

Query:  PTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKADVIS
        P IL KRHLFRE FLDFDPNAVSKLNEKKMVA GSAATSLLSE KVRAIIENGRQMCKVIDEFGSF+VY+WNFVNHKP ISQFRYPRQVPDKTSKA+VIS
Subjt:  PTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKADVIS

Query:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFRFQECIETTEKGERDGDIKPTIIEKIPEALKNLEL
        KDLVKRGFRSVGPTVIYTFMQVAGLTNDHL+SCFRF ECIET E+GE+DG+IKP I EKIPEALKNLEL
Subjt:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFRFQECIETTEKGERDGDIKPTIIEKIPEALKNLEL

XP_022960311.1 uncharacterized protein LOC111461081 [Cucurbita moschata]6.2e-207100Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQEAESKDKRVPLSPPQCVTTVPSVLRQQDRHQAILTLSMNASCSSDASSDSF
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQEAESKDKRVPLSPPQCVTTVPSVLRQQDRHQAILTLSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQEAESKDKRVPLSPPQCVTTVPSVLRQQDRHQAILTLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRKSSSTVKRAEKAVEKVGAESVVAATNTVGCLEPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW
        NSRASSARGTRQRGPNLRRKSSSTVKRAEKAVEKVGAESVVAATNTVGCLEPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW
Subjt:  NSRASSARGTRQRGPNLRRKSSSTVKRAEKAVEKVGAESVVAATNTVGCLEPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW

Query:  PTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKADVIS
        PTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKADVIS
Subjt:  PTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKADVIS

Query:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFRFQECIETTEKGERDGDIKPTIIEKIPEALKNLEL
        KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFRFQECIETTEKGERDGDIKPTIIEKIPEALKNLEL
Subjt:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFRFQECIETTEKGERDGDIKPTIIEKIPEALKNLEL

XP_023004117.1 uncharacterized protein LOC111497544 [Cucurbita maxima]2.1e-20297.83Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQEAESKDKRVPLSPPQCVTTVPSVLRQQDRHQAILTLSMNASCSSDASSDSF
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQEAESKDKRVPLSPPQCVTTVPSVLRQQDRHQAILTLSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQEAESKDKRVPLSPPQCVTTVPSVLRQQDRHQAILTLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRKSSSTVKRAEKAVEKVGAESVVAATNTVGCLEPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW
        NSRASSARGTRQRGPNLRRKSSSTVK+AEKA+EKVGAESVVA  NTVGCLEPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW
Subjt:  NSRASSARGTRQRGPNLRRKSSSTVKRAEKAVEKVGAESVVAATNTVGCLEPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW

Query:  PTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKADVIS
        PTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQM KVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKA+VIS
Subjt:  PTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKADVIS

Query:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFRFQECIETTEKGERDGDIKPTIIEKIPEALKNLEL
        KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFRF ECIETTEKGERDGDIKP+IIEKIPEALKNLEL
Subjt:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFRFQECIETTEKGERDGDIKPTIIEKIPEALKNLEL

XP_023514420.1 uncharacterized protein LOC111778684 [Cucurbita pepo subsp. pepo]2.9e-20498.64Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQEAESKDKRVPLSPPQCVTTVPSVLRQQDRHQAILTLSMNASCSSDASSDSF
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQEAESKDKRVPLSPPQCVTTVPSVLRQQDRHQAILTLSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQEAESKDKRVPLSPPQCVTTVPSVLRQQDRHQAILTLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRKSSSTVKRAEKAVEKVGAESVVAATNTVGCLEPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW
        NSRASSARGTRQRGPNLRRKSSS+VKRAEKAVEKVGAESVVA  NTVGCLEPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW
Subjt:  NSRASSARGTRQRGPNLRRKSSSTVKRAEKAVEKVGAESVVAATNTVGCLEPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW

Query:  PTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKADVIS
        PTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKA+VIS
Subjt:  PTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKADVIS

Query:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFRFQECIETTEKGERDGDIKPTIIEKIPEALKNLEL
        KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFRF ECIETTEKGERDGDIKPTIIEKIPEALKNLEL
Subjt:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFRFQECIETTEKGERDGDIKPTIIEKIPEALKNLEL

TrEMBL top hitse value%identityAlignment
A0A1S3CE52 probable GMP synthase [glutamine-hydrolyzing]8.8e-18389.49Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQEAESKDKRVPLSPPQCVTTVPSVLRQQDRHQAILTLSMNASCSSDASSDSF
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRK GVKPLKKLEKP QE ESKDKRVPLSPPQCV TVPSVLRQQDRHQAIL LSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQEAESKDKRVPLSPPQCVTTVPSVLRQQDRHQAILTLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRKSSSTVKRAEKAVEKVGAESVVAATNTVGCLEPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW
        NSRASSARGTRQRGPNLRRK  STVK A+KAVEKVG ESV    +TVGCLE KKRCAWVT NTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW
Subjt:  NSRASSARGTRQRGPNLRRKSSSTVKRAEKAVEKVGAESVVAATNTVGCLEPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW

Query:  PTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKADVIS
        P IL KRHLFRE FLDFDP  VSKLNEKKMVAPGSAATSLLSE K+RAIIENGRQMCKVIDEFGSFNVY+WNFVNHKP ISQFRYPRQVPDKTSKA+VIS
Subjt:  PTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKADVIS

Query:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFRFQECIE--TTEKGERDGDIKPTIIEKIPEALKNLEL
        KDLVKRGFRSVGPTVIYTFMQVAGLTNDHL+SCFRF ECIE  T EKGERDG++K    EK+PEALKNLEL
Subjt:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFRFQECIE--TTEKGERDGDIKPTIIEKIPEALKNLEL

A0A5A7UYZ9 Putative GMP synthase8.8e-18389.49Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQEAESKDKRVPLSPPQCVTTVPSVLRQQDRHQAILTLSMNASCSSDASSDSF
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRK GVKPLKKLEKP QE ESKDKRVPLSPPQCV TVPSVLRQQDRHQAIL LSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQEAESKDKRVPLSPPQCVTTVPSVLRQQDRHQAILTLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRKSSSTVKRAEKAVEKVGAESVVAATNTVGCLEPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW
        NSRASSARGTRQRGPNLRRK  STVK A+KAVEKVG ESV    +TVGCLE KKRCAWVT NTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW
Subjt:  NSRASSARGTRQRGPNLRRKSSSTVKRAEKAVEKVGAESVVAATNTVGCLEPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW

Query:  PTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKADVIS
        P IL KRHLFRE FLDFDP  VSKLNEKKMVAPGSAATSLLSE K+RAIIENGRQMCKVIDEFGSFNVY+WNFVNHKP ISQFRYPRQVPDKTSKA+VIS
Subjt:  PTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKADVIS

Query:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFRFQECIE--TTEKGERDGDIKPTIIEKIPEALKNLEL
        KDLVKRGFRSVGPTVIYTFMQVAGLTNDHL+SCFRF ECIE  T EKGERDG++K    EK+PEALKNLEL
Subjt:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFRFQECIE--TTEKGERDGDIKPTIIEKIPEALKNLEL

A0A6J1DNQ3 uncharacterized protein LOC1110223413.6e-18490.79Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQEAESKDKRVPLSPPQCVTTVPSVLRQQDRHQAILTLSMNASCSSDASSDSF
        MSGPPRIRSMNVADSDSRPVLGPTGNKAR VE RK G KPLKKLEKPHQEAESKDKRVPLSPPQCV +VPSVLRQQDRHQAIL LSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQEAESKDKRVPLSPPQCVTTVPSVLRQQDRHQAILTLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRKSSSTVKRAEKAVEKVGAESVVAATNTVGCLEPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW
        NSRASSARGTRQRGPNLRRK  STVKRAEKAVEKVG ESVV   +TV  LEPKKRCAWVT NTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW
Subjt:  NSRASSARGTRQRGPNLRRKSSSTVKRAEKAVEKVGAESVVAATNTVGCLEPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW

Query:  PTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKADVIS
        P IL KRHLFRE FLDFDPNAVSKLNEKKMVA GSAATSLLSE KVRAIIENGRQMCKVIDEFGSF+VY+WNFVNHKP ISQFRYPRQVPDKTSKA+VIS
Subjt:  PTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKADVIS

Query:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFRFQECIETTEKGERDGDIKPTIIEKIPEALKNLEL
        KDLVKRGFRSVGPTVIYTFMQVAGLTNDHL+SCFRF ECIET E+GE+DG+IKP I EKIPEALKNLEL
Subjt:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFRFQECIETTEKGERDGDIKPTIIEKIPEALKNLEL

A0A6J1H7A2 uncharacterized protein LOC1114610813.0e-207100Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQEAESKDKRVPLSPPQCVTTVPSVLRQQDRHQAILTLSMNASCSSDASSDSF
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQEAESKDKRVPLSPPQCVTTVPSVLRQQDRHQAILTLSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQEAESKDKRVPLSPPQCVTTVPSVLRQQDRHQAILTLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRKSSSTVKRAEKAVEKVGAESVVAATNTVGCLEPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW
        NSRASSARGTRQRGPNLRRKSSSTVKRAEKAVEKVGAESVVAATNTVGCLEPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW
Subjt:  NSRASSARGTRQRGPNLRRKSSSTVKRAEKAVEKVGAESVVAATNTVGCLEPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW

Query:  PTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKADVIS
        PTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKADVIS
Subjt:  PTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKADVIS

Query:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFRFQECIETTEKGERDGDIKPTIIEKIPEALKNLEL
        KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFRFQECIETTEKGERDGDIKPTIIEKIPEALKNLEL
Subjt:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFRFQECIETTEKGERDGDIKPTIIEKIPEALKNLEL

A0A6J1KPI7 uncharacterized protein LOC1114975441.0e-20297.83Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQEAESKDKRVPLSPPQCVTTVPSVLRQQDRHQAILTLSMNASCSSDASSDSF
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQEAESKDKRVPLSPPQCVTTVPSVLRQQDRHQAILTLSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQEAESKDKRVPLSPPQCVTTVPSVLRQQDRHQAILTLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRKSSSTVKRAEKAVEKVGAESVVAATNTVGCLEPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW
        NSRASSARGTRQRGPNLRRKSSSTVK+AEKA+EKVGAESVVA  NTVGCLEPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW
Subjt:  NSRASSARGTRQRGPNLRRKSSSTVKRAEKAVEKVGAESVVAATNTVGCLEPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW

Query:  PTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKADVIS
        PTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQM KVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKA+VIS
Subjt:  PTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKADVIS

Query:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFRFQECIETTEKGERDGDIKPTIIEKIPEALKNLEL
        KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFRF ECIETTEKGERDGDIKP+IIEKIPEALKNLEL
Subjt:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFRFQECIETTEKGERDGDIKPTIIEKIPEALKNLEL

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 14.7e-4044.44Show/hide
Query:  KRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENG
        +RC WV+   DP Y A+HD EWGVP  D KKLFE++CL G  A L+W T+LKKR  +R  F  FDP  V+ + E+ +      A  +    K++AII N 
Subjt:  KRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENG

Query:  RQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKADVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSC
        R   ++      F  +VW+FVNH+P ++Q     ++P  TS +D +SK L KRGF+ VG T+ Y+FMQ  GL NDH+V C
Subjt:  RQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKADVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSC

P44321 DNA-3-methyladenine glycosylase1.9e-3339.66Show/hide
Query:  RCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGR
        RC WV   +   Y  +HD+EWG P  D +KLFE +CL G  A L+W T+LKKR  +RE F  FDP  ++K+    + A    +  +    K+ AI++N +
Subjt:  RCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGR

Query:  QMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKADVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSC
            +     +F+ ++W+FVNHKP ++     R VP KT  +  +SK L KRGF  +G T  Y FMQ  GL +DHL  C
Subjt:  QMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKADVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]1.2e-4045.7Show/hide
Query:  KKRCAWVTSNTDPC---YAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAI
        K RCAW T   +     Y  +HD EWG P+H+DKKLFE L L G  A L+W TILKKR  FR  F DFDP+ V+  +E K+         + +  K+ A 
Subjt:  KKRCAWVTSNTDPC---YAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAI

Query:  IENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKADVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFR
        I N +    V  EFGSF+ Y+W FV  KP I+ F     +P  T  +D I+KDL KRGF+ VG T +Y  MQ  G+ NDHL SCF+
Subjt:  IENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKADVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFR

Arabidopsis top hitse value%identityAlignment
AT1G15970.1 DNA glycosylase superfamily protein2.7e-9152.94Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQE---AESKDKR-----VPLSP----PQCVTTVPSVLRQQDRHQAILTLSMN
        MS PPR RS+N  + + R VLGPTGNK +    RK    P  KLEKP  E    +SKD++      P SP     QC +   S+LR+        + SM 
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQE---AESKDKR-----VPLSP----PQCVTTVPSVLRQQDRHQAILTLSMN

Query:  ASCSSDASSDSFNSRASSARGTRQRGPNLRRKSSSTVKRAE--KAVEKVGAESVVAATNTVGCLEPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLF
        AS SSDASS   +S  S A  +  +    R  S S+ ++    K  EKV  +            + +KRCAW+T   DPCY AFHDEEWGVPVHDDKKLF
Subjt:  ASCSSDASSDSFNSRASSARGTRQRGPNLRRKSSSTVKRAE--KAVEKVGAESVVAATNTVGCLEPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLF

Query:  ELLCLSGALAELTWPTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYP
        ELLCLSGALAEL+W  IL +RH+ RE F+DFDP AV++LN+KK+ APG+AA SLLSE K+R+I++N R + K+I E GS   Y+WNFVN+KPT SQFRY 
Subjt:  ELLCLSGALAELTWPTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYP

Query:  RQVPDKTSKADVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFRFQEC---IETT------EKGERDGD
        RQVP KTSKA+ ISKDLV+RGFRSV PTVIY+FMQ AGLTNDHL+ CFR+Q+C    ETT      +K ER+ D
Subjt:  RQVPDKTSKADVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFRFQEC---IETT------EKGERDGD

AT1G75090.1 DNA glycosylase superfamily protein5.6e-6544.69Show/hide
Query:  PLKKLEKPHQEAESKDKRVPLSPPQCVTTVPSVLRQQDRHQAILTLSMNASCSSDASSDSFNSRASSARGTRQRGPNLRRKSSSTVKRAEKAVEKVGAES
        P+K +++      S   R  ++  + +T  P +  +  +  A      N S S+D SS S +S   S+  T   G     K ++  KR    VEK+   +
Subjt:  PLKKLEKPHQEAESKDKRVPLSPPQCVTTVPSVLRQQDRHQAILTLSMNASCSSDASSDSFNSRASSARGTRQRGPNLRRKSSSTVKRAEKAVEKVGAES

Query:  VVAATNTVGCLEPK-----KRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPG
        VVA+   V  + PK     KRC W+T N+DP Y  FHDEEWGVPV DDKKLFELL  S ALAE +WP+IL++R  FR+ F +FDP+A+++  EK++++  
Subjt:  VVAATNTVGCLEPK-----KRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPG

Query:  SAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKADVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCF
             +LSE K+RAI+EN + + KV  EFGSF+ Y W FVNHKP  + +RY RQVP K+ KA+ ISKD+++RGFR VGPTV+Y+F+Q +G+ NDHL +CF
Subjt:  SAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKADVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCF

Query:  RFQECIETTEK
        R+QEC   TE+
Subjt:  RFQECIETTEK

AT1G80850.1 DNA glycosylase superfamily protein1.1e-8952.68Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQEAESKDKRVPLSPPQCVTTVPSVLRQQDRHQAILTLSMNASCSSDASSD--
        MS PPR+RS++ +D + R VLGP GNK +         KPL K  K     ++K+       PQC    P +LR+         +SM AS SSDASS   
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQEAESKDKRVPLSPPQCVTTVPSVLRQQDRHQAILTLSMNASCSSDASSD--

Query:  ----SFNSRASSARGTRQRGPNLRRKSSSTVKR--AEKAVEKVGAESVVAATNTVGCL-EPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCL
            S  S +S  R  R+ G      SSS+++R   E+  EK              C  + +KRCAW+T  +D CY AFHDEEWGVPVHDDK+LFELL L
Subjt:  ----SFNSRASSARGTRQRGPNLRRKSSSTVKR--AEKAVEKVGAESVVAATNTVGCL-EPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCL

Query:  SGALAELTWPTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPD
        SGALAEL+W  IL KR LFRE F+DFDP A+S+L  KK+ +P  AAT+LLSE K+R+I+EN  Q+CK+I  FGSF+ Y+WNFVN KPT SQFRYPRQVP 
Subjt:  SGALAELTWPTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPD

Query:  KTSKADVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFRFQECIETTEKG
        KTSKA++ISKDLV+RGFRSV PTVIY+FMQ AGLTNDHL  CFR  +C+   E G
Subjt:  KTSKADVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFRFQECIETTEKG

AT5G57970.1 DNA glycosylase superfamily protein9.5e-9754.21Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQEAESKDKRVPLSPP---------QCVTTVPSVLRQQDRHQAIL--TLSMNA
        MSG PR++SMNVA++++R  LG T  KA    T K+  K L+KLE+        D++   + P         +      S+LR   RH+  L   LS+NA
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQEAESKDKRVPLSPP---------QCVTTVPSVLRQQDRHQAIL--TLSMNA

Query:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKSSSTVKRAEKAVEKVGAESVVAATNTVGCLEPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELL
        S SSDAS DSF+SRAS+ R  R      R KS  +  R+          S  A  +     E KKRC WVT N+DPCY  FHDEEWGVPVHDDK+LFELL
Subjt:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKSSSTVKRAEKAVEKVGAESVVAATNTVGCLEPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELL

Query:  CLSGALAELTWPTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQV
         LSGALAE TWPTIL KR  FRE F DFDPNA+ K+NEKK++ PGS A++LLS+ K+RA+IEN RQ+ KVI+E+GSF+ Y+W+FV +K  +S+FRY RQV
Subjt:  CLSGALAELTWPTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQV

Query:  PDKTSKADVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFRFQECIETTEK
        P KT KA+VISKDLV+RGFRSVGPTV+Y+FMQ AG+TNDHL SCFRF  CI   E+
Subjt:  PDKTSKADVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFRFQECIETTEK

AT5G57970.2 DNA glycosylase superfamily protein9.5e-9754.21Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQEAESKDKRVPLSPP---------QCVTTVPSVLRQQDRHQAIL--TLSMNA
        MSG PR++SMNVA++++R  LG T  KA    T K+  K L+KLE+        D++   + P         +      S+LR   RH+  L   LS+NA
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQEAESKDKRVPLSPP---------QCVTTVPSVLRQQDRHQAIL--TLSMNA

Query:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKSSSTVKRAEKAVEKVGAESVVAATNTVGCLEPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELL
        S SSDAS DSF+SRAS+ R  R      R KS  +  R+          S  A  +     E KKRC WVT N+DPCY  FHDEEWGVPVHDDK+LFELL
Subjt:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKSSSTVKRAEKAVEKVGAESVVAATNTVGCLEPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELL

Query:  CLSGALAELTWPTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQV
         LSGALAE TWPTIL KR  FRE F DFDPNA+ K+NEKK++ PGS A++LLS+ K+RA+IEN RQ+ KVI+E+GSF+ Y+W+FV +K  +S+FRY RQV
Subjt:  CLSGALAELTWPTILKKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQV

Query:  PDKTSKADVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFRFQECIETTEK
        P KT KA+VISKDLV+RGFRSVGPTV+Y+FMQ AG+TNDHL SCFRF  CI   E+
Subjt:  PDKTSKADVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVSCFRFQECIETTEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGGCCCTCCGAGAATCCGGTCTATGAATGTGGCGGATTCCGATTCCCGACCGGTACTTGGGCCTACCGGGAACAAAGCACGAACTGTAGAGACTAGAAAATCCGG
GGTGAAGCCATTGAAGAAGCTTGAAAAGCCTCACCAAGAAGCTGAATCAAAGGACAAGAGGGTGCCATTGTCGCCGCCTCAATGTGTTACTACAGTGCCATCGGTTTTGA
GGCAACAGGACCGTCACCAGGCGATTCTCACCCTCTCGATGAATGCATCGTGTTCTTCTGATGCATCGTCTGATTCGTTTAATAGTCGAGCATCTAGTGCTAGAGGTACG
AGGCAGCGCGGTCCGAATTTGAGGAGAAAGTCTAGTAGTACGGTAAAGAGGGCTGAAAAGGCCGTTGAAAAGGTTGGTGCTGAAAGTGTGGTGGCGGCGACGAATACAGT
CGGTTGCTTAGAACCCAAAAAACGATGCGCTTGGGTAACTTCTAATACAGATCCATGTTATGCTGCTTTTCATGATGAAGAATGGGGAGTTCCAGTTCACGATGATAAAA
AATTGTTCGAACTGCTTTGCCTATCGGGTGCTTTAGCTGAACTTACGTGGCCTACCATCCTCAAAAAAAGACATCTATTTAGGGAAACCTTCTTGGACTTTGATCCAAAT
GCTGTTTCAAAATTAAACGAGAAAAAGATGGTTGCACCTGGAAGTGCTGCTACCTCTTTACTGTCAGAACCCAAGGTGCGAGCTATCATTGAAAATGGTCGTCAAATGTG
CAAGGTAATTGATGAATTTGGATCCTTCAACGTGTACGTTTGGAACTTTGTCAACCACAAACCTACCATCAGTCAATTCCGATATCCCCGGCAGGTTCCTGATAAGACGT
CGAAAGCAGATGTGATTAGCAAGGATCTCGTAAAGAGAGGATTTCGAAGTGTGGGACCAACAGTCATCTACACATTCATGCAGGTGGCCGGGTTAACCAACGACCATCTC
GTCAGTTGCTTTAGATTCCAAGAATGTATCGAGACAACAGAGAAAGGAGAAAGAGATGGTGACATCAAGCCTACTATCATCGAGAAAATACCAGAGGCTCTGAAAAACTT
GGAACTATAA
mRNA sequenceShow/hide mRNA sequence
GTTCCACTTTCCCTTATTTTCTTTCTCCCTCTCTCTTTCTCTCTCTAAATTCTCTCTCTTAGTTTCCCGTTTGATTTTCATACTCGCAAACACAATCATGGCCGTTCCGA
ATTTCTCATCATCCACTTCCAGATAAGTTTTTCGCTCTCTGAGCTTTTTTTTGTTCATCAATGTGGCTCCATTTCTCGGCCCCACCTAGGGTTCTGTCTCTTCCCTTCTC
ACAGTTCACTCCCCAACCCTACTGTTCAACTTCAAACCCATTTCCACCTTGTTGCCTCCAATGAACTAGTGGGATTCTCTTGTTTTTGCTTCTTTAATCTGAATCCTGGT
TGATTTTTACGCCGCCCCAATTTTCGCTGAAATGTCAGGCCCTCCGAGAATCCGGTCTATGAATGTGGCGGATTCCGATTCCCGACCGGTACTTGGGCCTACCGGGAACA
AAGCACGAACTGTAGAGACTAGAAAATCCGGGGTGAAGCCATTGAAGAAGCTTGAAAAGCCTCACCAAGAAGCTGAATCAAAGGACAAGAGGGTGCCATTGTCGCCGCCT
CAATGTGTTACTACAGTGCCATCGGTTTTGAGGCAACAGGACCGTCACCAGGCGATTCTCACCCTCTCGATGAATGCATCGTGTTCTTCTGATGCATCGTCTGATTCGTT
TAATAGTCGAGCATCTAGTGCTAGAGGTACGAGGCAGCGCGGTCCGAATTTGAGGAGAAAGTCTAGTAGTACGGTAAAGAGGGCTGAAAAGGCCGTTGAAAAGGTTGGTG
CTGAAAGTGTGGTGGCGGCGACGAATACAGTCGGTTGCTTAGAACCCAAAAAACGATGCGCTTGGGTAACTTCTAATACAGATCCATGTTATGCTGCTTTTCATGATGAA
GAATGGGGAGTTCCAGTTCACGATGATAAAAAATTGTTCGAACTGCTTTGCCTATCGGGTGCTTTAGCTGAACTTACGTGGCCTACCATCCTCAAAAAAAGACATCTATT
TAGGGAAACCTTCTTGGACTTTGATCCAAATGCTGTTTCAAAATTAAACGAGAAAAAGATGGTTGCACCTGGAAGTGCTGCTACCTCTTTACTGTCAGAACCCAAGGTGC
GAGCTATCATTGAAAATGGTCGTCAAATGTGCAAGGTAATTGATGAATTTGGATCCTTCAACGTGTACGTTTGGAACTTTGTCAACCACAAACCTACCATCAGTCAATTC
CGATATCCCCGGCAGGTTCCTGATAAGACGTCGAAAGCAGATGTGATTAGCAAGGATCTCGTAAAGAGAGGATTTCGAAGTGTGGGACCAACAGTCATCTACACATTCAT
GCAGGTGGCCGGGTTAACCAACGACCATCTCGTCAGTTGCTTTAGATTCCAAGAATGTATCGAGACAACAGAGAAAGGAGAAAGAGATGGTGACATCAAGCCTACTATCA
TCGAGAAAATACCAGAGGCTCTGAAAAACTTGGAACTATAAAGAAGAAGCCATGGTCGTCCTTGAACCTTGCCTCAGTGTAATTAACTTCCAGAGTTCTTTTCTTCTGCC
TTTTTTGTAATGTCCTGTAAATTCCATGATGGGGTAAATGTTAGCAATATTTTGTGTATAAACTGACTTGGATACAGAAGACAGCTAGAATCAGTTCTGTGACGTTGGGA
TCTTGATCGTCTCGTATGTGGCACTGCACGATAATGTTGAAGGTTGAAGGTAGGCCAAATGGCTGTTTGATTGATATTCAAATGTTGTGAATGTGAAAATTTAAGCCTTA
ATTTGTTTTGTATAGAAAATTGGCAATGAAGCAATTGTTTATCCTATTCTATTATTGTTCCTTTCCTCAACTTATTTGCAAAAGAATAAAGGAGAGCCTCTGTTATGAGT
CTTCTTTTGTAGGTGGAAAGTAA
Protein sequenceShow/hide protein sequence
MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKSGVKPLKKLEKPHQEAESKDKRVPLSPPQCVTTVPSVLRQQDRHQAILTLSMNASCSSDASSDSFNSRASSARGT
RQRGPNLRRKSSSTVKRAEKAVEKVGAESVVAATNTVGCLEPKKRCAWVTSNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPTILKKRHLFRETFLDFDPN
AVSKLNEKKMVAPGSAATSLLSEPKVRAIIENGRQMCKVIDEFGSFNVYVWNFVNHKPTISQFRYPRQVPDKTSKADVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHL
VSCFRFQECIETTEKGERDGDIKPTIIEKIPEALKNLEL