; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G026670 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G026670
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionDNA glycosylase superfamily protein
Genome locationchr02:33153212..33156682
RNA-Seq ExpressionLsi02G026670
SyntenyLsi02G026670
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136097.2 uncharacterized protein LOC101205558 [Cucumis sativus]2.8e-19494.88Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARIVETRKPGVKPLKKLEKPRQEVESKDKRVSLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKAR VETRKPGVKPLKKLEKPRQEVESKDKRV LSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARIVETRKPGVKPLKKLEKPRQEVESKDKRVSLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVEKVGVESVAAVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLSLSGALAELTWP
        SRASSARGTRQRGPNLRRKQ STVKGADKAVEKVGVESVA V DTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELL LSGALAELTWP
Subjt:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVEKVGVESVAAVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLSLSGALAELTWP

Query:  SILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
        +ILNKRHLFREIFLDFDP AVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNV++WNFVNHKPIISQFRYPRQVPDKTSKAEVISK
Subjt:  SILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQVPDKTSKAEVISK

Query:  DLVKRGFRSVGPTVIYTFMQVAGLSNDHLISCFRFQECIETQTAEKGEKD-DEIKLTVNEKMPEALKNLEL
        DLVKRGFRSVGPTVIYTFMQVAGL+NDHLI CFRF ECIETQTAEKGE+D  E+KL  NEKMPEALKNLEL
Subjt:  DLVKRGFRSVGPTVIYTFMQVAGLSNDHLISCFRFQECIETQTAEKGEKD-DEIKLTVNEKMPEALKNLEL

XP_008461179.1 PREDICTED: probable GMP synthase [glutamine-hydrolyzing] [Cucumis melo]5.1e-19695.14Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARIVETRKPGVKPLKKLEKPRQEVESKDKRVSLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKAR VETRKPGVKPLKKLEKPRQEVESKDKRV LSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARIVETRKPGVKPLKKLEKPRQEVESKDKRVSLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVEKVGVESVAAVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLSLSGALAELTWP
        SRASSARGTRQRGPNLRRKQ STVKGADKAVEKVGVESVA VADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELL LSGALAELTWP
Subjt:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVEKVGVESVAAVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLSLSGALAELTWP

Query:  SILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
        +ILNKRHLFREIFLDFDP  VSKLNEKKMVAPGSAATSLLSELK+RAIIENGRQMCKVIDEFGSFNV++WNFVNHKPIISQFRYPRQVPDKTSKAEVISK
Subjt:  SILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQVPDKTSKAEVISK

Query:  DLVKRGFRSVGPTVIYTFMQVAGLSNDHLISCFRFQECIETQTAEKGEKDDEIKLTVNEKMPEALKNLEL
        DLVKRGFRSVGPTVIYTFMQVAGL+NDHLISCFRF ECIETQTAEKGE+D E+KL  NEKMPEALKNLEL
Subjt:  DLVKRGFRSVGPTVIYTFMQVAGLSNDHLISCFRFQECIETQTAEKGEKDDEIKLTVNEKMPEALKNLEL

XP_022155202.1 uncharacterized protein LOC111022341 [Momordica charantia]1.3e-18391.62Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARIVETRKPGVKPLKKLEKPRQEVESKDKRVSLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKAR VE RKPG KPLKKLEKP QE ESKDKRV LSPPQCV+VPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARIVETRKPGVKPLKKLEKPRQEVESKDKRVSLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVEKVGVESVAAVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLSLSGALAELTWP
        SRASSARGTRQRGPNLRRKQ STVK A+KAVEKVGVESV  V DTV  LE KKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELL LSGALAELTWP
Subjt:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVEKVGVESVAAVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLSLSGALAELTWP

Query:  SILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
        +ILNKRHLFREIFLDFDPNAVSKLNEKKMVA GSAATSLLSELKVRAIIENGRQMCKVIDEFGSF+V+IWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
Subjt:  SILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQVPDKTSKAEVISK

Query:  DLVKRGFRSVGPTVIYTFMQVAGLSNDHLISCFRFQECIETQTAEKGEKDDEIKLTVNEKMPEALKNLEL
        DLVKRGFRSVGPTVIYTFMQVAGL+NDHLISCFRF ECIE  TAE+GEKD EIK  +NEK+PEALKNLEL
Subjt:  DLVKRGFRSVGPTVIYTFMQVAGLSNDHLISCFRFQECIETQTAEKGEKDDEIKLTVNEKMPEALKNLEL

XP_023514420.1 uncharacterized protein LOC111778684 [Cucurbita pepo subsp. pepo]5.5e-18290.03Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARIVETRKPGVKPLKKLEKPRQEVESKDKRVSLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF
        MSGPPRIRSMNVADSDSRPVLGPTGNKAR VETRK GVKPLKKLEKP QE ESKDKRV LSPPQCV TVPSVLRQQDRHQAIL LSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARIVETRKPGVKPLKKLEKPRQEVESKDKRVSLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRKQSSTVKGADKAVEKVGVESVAAVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLSLSGALAELTW
        NSRASSARGTRQRGPNLRRK SS+VK A+KAVEKVG ESV AVA+TVGCLE KKRCAWVT NTDPCYAAFHDEEWGVPVHDDKKLFELL LSGALAELTW
Subjt:  NSRASSARGTRQRGPNLRRKQSSTVKGADKAVEKVGVESVAAVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLSLSGALAELTW

Query:  PSILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQVPDKTSKAEVIS
        P+IL KRHLFRE FLDFDPNAVSKLNEKKMVAPGSAATSLLSE KVRAIIENGRQMCKVIDEFGSFNV++WNFVNHKP ISQFRYPRQVPDKTSKAEVIS
Subjt:  PSILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQVPDKTSKAEVIS

Query:  KDLVKRGFRSVGPTVIYTFMQVAGLSNDHLISCFRFQECIETQTAEKGEKDDEIKLTVNEKMPEALKNLEL
        KDLVKRGFRSVGPTVIYTFMQVAGL+NDHL+SCFRF ECIE  T EKGE+D +IK T+ EK+PEALKNLEL
Subjt:  KDLVKRGFRSVGPTVIYTFMQVAGLSNDHLISCFRFQECIETQTAEKGEKDDEIKLTVNEKMPEALKNLEL

XP_038900164.1 probable GMP synthase [glutamine-hydrolyzing] [Benincasa hispida]7.9e-19795.71Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARIVETRKPGVKPLKKLEKPRQEVESKDKRVSLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKARIVETRKPGVKPLKKLEKPRQEVESKDKRV LSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARIVETRKPGVKPLKKLEKPRQEVESKDKRVSLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVEKVGVES---VAAVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLSLSGALAEL
        SRASSARGTRQRGPNLRRKQ+STVKGA K+VEKVGVES   VA VADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELL LSGALAEL
Subjt:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVEKVGVES---VAAVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLSLSGALAEL

Query:  TWPSILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQVPDKTSKAEV
        TWP+ILNKR+LFREIFLDFDPN VSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNV++WNFVNHKPIISQFRYPRQVPDKTSKAEV
Subjt:  TWPSILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQVPDKTSKAEV

Query:  ISKDLVKRGFRSVGPTVIYTFMQVAGLSNDHLISCFRFQECIETQTAEKGEKDDEIKLTVNEKMPEALKNLEL
        ISKDLVKRGFRSVGPTVIYTFMQVAGL+NDHLISCFRFQECIETQTAEKGEKD EIKLTVNEKMPEALKNLEL
Subjt:  ISKDLVKRGFRSVGPTVIYTFMQVAGLSNDHLISCFRFQECIETQTAEKGEKDDEIKLTVNEKMPEALKNLEL

TrEMBL top hitse value%identityAlignment
A0A0A0K8L6 Uncharacterized protein1.4e-19494.88Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARIVETRKPGVKPLKKLEKPRQEVESKDKRVSLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKAR VETRKPGVKPLKKLEKPRQEVESKDKRV LSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARIVETRKPGVKPLKKLEKPRQEVESKDKRVSLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVEKVGVESVAAVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLSLSGALAELTWP
        SRASSARGTRQRGPNLRRKQ STVKGADKAVEKVGVESVA V DTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELL LSGALAELTWP
Subjt:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVEKVGVESVAAVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLSLSGALAELTWP

Query:  SILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
        +ILNKRHLFREIFLDFDP AVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNV++WNFVNHKPIISQFRYPRQVPDKTSKAEVISK
Subjt:  SILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQVPDKTSKAEVISK

Query:  DLVKRGFRSVGPTVIYTFMQVAGLSNDHLISCFRFQECIETQTAEKGEKD-DEIKLTVNEKMPEALKNLEL
        DLVKRGFRSVGPTVIYTFMQVAGL+NDHLI CFRF ECIETQTAEKGE+D  E+KL  NEKMPEALKNLEL
Subjt:  DLVKRGFRSVGPTVIYTFMQVAGLSNDHLISCFRFQECIETQTAEKGEKD-DEIKLTVNEKMPEALKNLEL

A0A1S3CE52 probable GMP synthase [glutamine-hydrolyzing]2.5e-19695.14Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARIVETRKPGVKPLKKLEKPRQEVESKDKRVSLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKAR VETRKPGVKPLKKLEKPRQEVESKDKRV LSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARIVETRKPGVKPLKKLEKPRQEVESKDKRVSLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVEKVGVESVAAVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLSLSGALAELTWP
        SRASSARGTRQRGPNLRRKQ STVKGADKAVEKVGVESVA VADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELL LSGALAELTWP
Subjt:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVEKVGVESVAAVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLSLSGALAELTWP

Query:  SILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
        +ILNKRHLFREIFLDFDP  VSKLNEKKMVAPGSAATSLLSELK+RAIIENGRQMCKVIDEFGSFNV++WNFVNHKPIISQFRYPRQVPDKTSKAEVISK
Subjt:  SILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQVPDKTSKAEVISK

Query:  DLVKRGFRSVGPTVIYTFMQVAGLSNDHLISCFRFQECIETQTAEKGEKDDEIKLTVNEKMPEALKNLEL
        DLVKRGFRSVGPTVIYTFMQVAGL+NDHLISCFRF ECIETQTAEKGE+D E+KL  NEKMPEALKNLEL
Subjt:  DLVKRGFRSVGPTVIYTFMQVAGLSNDHLISCFRFQECIETQTAEKGEKDDEIKLTVNEKMPEALKNLEL

A0A5A7UYZ9 Putative GMP synthase2.5e-19695.14Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARIVETRKPGVKPLKKLEKPRQEVESKDKRVSLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKAR VETRKPGVKPLKKLEKPRQEVESKDKRV LSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARIVETRKPGVKPLKKLEKPRQEVESKDKRVSLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVEKVGVESVAAVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLSLSGALAELTWP
        SRASSARGTRQRGPNLRRKQ STVKGADKAVEKVGVESVA VADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELL LSGALAELTWP
Subjt:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVEKVGVESVAAVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLSLSGALAELTWP

Query:  SILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
        +ILNKRHLFREIFLDFDP  VSKLNEKKMVAPGSAATSLLSELK+RAIIENGRQMCKVIDEFGSFNV++WNFVNHKPIISQFRYPRQVPDKTSKAEVISK
Subjt:  SILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQVPDKTSKAEVISK

Query:  DLVKRGFRSVGPTVIYTFMQVAGLSNDHLISCFRFQECIETQTAEKGEKDDEIKLTVNEKMPEALKNLEL
        DLVKRGFRSVGPTVIYTFMQVAGL+NDHLISCFRF ECIETQTAEKGE+D E+KL  NEKMPEALKNLEL
Subjt:  DLVKRGFRSVGPTVIYTFMQVAGLSNDHLISCFRFQECIETQTAEKGEKDDEIKLTVNEKMPEALKNLEL

A0A6J1DNQ3 uncharacterized protein LOC1110223416.3e-18491.62Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARIVETRKPGVKPLKKLEKPRQEVESKDKRVSLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKAR VE RKPG KPLKKLEKP QE ESKDKRV LSPPQCV+VPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARIVETRKPGVKPLKKLEKPRQEVESKDKRVSLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVEKVGVESVAAVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLSLSGALAELTWP
        SRASSARGTRQRGPNLRRKQ STVK A+KAVEKVGVESV  V DTV  LE KKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELL LSGALAELTWP
Subjt:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVEKVGVESVAAVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLSLSGALAELTWP

Query:  SILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
        +ILNKRHLFREIFLDFDPNAVSKLNEKKMVA GSAATSLLSELKVRAIIENGRQMCKVIDEFGSF+V+IWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
Subjt:  SILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQVPDKTSKAEVISK

Query:  DLVKRGFRSVGPTVIYTFMQVAGLSNDHLISCFRFQECIETQTAEKGEKDDEIKLTVNEKMPEALKNLEL
        DLVKRGFRSVGPTVIYTFMQVAGL+NDHLISCFRF ECIE  TAE+GEKD EIK  +NEK+PEALKNLEL
Subjt:  DLVKRGFRSVGPTVIYTFMQVAGLSNDHLISCFRFQECIETQTAEKGEKDDEIKLTVNEKMPEALKNLEL

A0A6J1H7A2 uncharacterized protein LOC1114610813.5e-18289.76Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARIVETRKPGVKPLKKLEKPRQEVESKDKRVSLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF
        MSGPPRIRSMNVADSDSRPVLGPTGNKAR VETRK GVKPLKKLEKP QE ESKDKRV LSPPQCV TVPSVLRQQDRHQAIL LSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARIVETRKPGVKPLKKLEKPRQEVESKDKRVSLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRKQSSTVKGADKAVEKVGVESVAAVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLSLSGALAELTW
        NSRASSARGTRQRGPNLRRK SSTVK A+KAVEKVG ESV A  +TVGCLE KKRCAWVT NTDPCYAAFHDEEWGVPVHDDKKLFELL LSGALAELTW
Subjt:  NSRASSARGTRQRGPNLRRKQSSTVKGADKAVEKVGVESVAAVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLSLSGALAELTW

Query:  PSILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQVPDKTSKAEVIS
        P+IL KRHLFRE FLDFDPNAVSKLNEKKMVAPGSAATSLLSE KVRAIIENGRQMCKVIDEFGSFNV++WNFVNHKP ISQFRYPRQVPDKTSKA+VIS
Subjt:  PSILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQVPDKTSKAEVIS

Query:  KDLVKRGFRSVGPTVIYTFMQVAGLSNDHLISCFRFQECIETQTAEKGEKDDEIKLTVNEKMPEALKNLEL
        KDLVKRGFRSVGPTVIYTFMQVAGL+NDHL+SCFRFQECIE  T EKGE+D +IK T+ EK+PEALKNLEL
Subjt:  KDLVKRGFRSVGPTVIYTFMQVAGLSNDHLISCFRFQECIETQTAEKGEKDDEIKLTVNEKMPEALKNLEL

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 11.1e-3641.67Show/hide
Query:  KRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLSLSGALAELTWPSILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENG
        +RC WV  + DP Y A+HD EWGVP  D KKLFE++ L G  A L+W ++L KR  +R  F  FDP  V+ + E+ +      A  +    K++AII N 
Subjt:  KRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLSLSGALAELTWPSILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENG

Query:  RQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLSNDHLISC
        R   ++      F  F+W+FVNH+P ++Q     ++P  TS ++ +SK L KRGF+ VG T+ Y+FMQ  GL NDH++ C
Subjt:  RQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLSNDHLISC

P44321 DNA-3-methyladenine glycosylase6.4e-3239.66Show/hide
Query:  RCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLSLSGALAELTWPSILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGR
        RC WV       Y  +HD+EWG P  D +KLFE + L G  A L+W ++L KR  +RE F  FDP  ++K+    + A    +  +    K+ AI++N +
Subjt:  RCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLSLSGALAELTWPSILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGR

Query:  QMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLSNDHLISC
            +     +F+ FIW+FVNHKPI++     R VP KT  ++ +SK L KRGF  +G T  Y FMQ  GL +DHL  C
Subjt:  QMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLSNDHLISC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]9.2e-3940.45Show/hide
Query:  LESKKRCAWVTPNTDPC---YAAFHDEEWGVPVHDDKKLFELLSLSGALAELTWPSILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKV
        +  K RCAW T   +     Y  +HD EWG P+H+DKKLFE L L G  A L+W +IL KR  FR  F DFDP+ V+  +E K+         + +  K+
Subjt:  LESKKRCAWVTPNTDPC---YAAFHDEEWGVPVHDDKKLFELLSLSGALAELTWPSILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKV

Query:  RAIIENGRQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLSNDHLISCFRF-----QECIE
         A I N +    V  EFGSF+ +IW FV  KPII+ F     +P  T  ++ I+KDL KRGF+ VG T +Y  MQ  G+ NDHL SCF+       +C +
Subjt:  RAIIENGRQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLSNDHLISCFRF-----QECIE

Query:  TQTAEKGEKDDEIKLTVNEK
             +G       LT N K
Subjt:  TQTAEKGEKDDEIKLTVNEK

Arabidopsis top hitse value%identityAlignment
AT1G15970.1 DNA glycosylase superfamily protein8.1e-9152.45Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARIVETRKPGVKPLKKLEKPRQE---VESKDKRVS-----LSP----PQCVTVPSVLRQQDRHQAILNLSMNA
        MS PPR RS+N  + + R VLGPTGNK +    RKP   P  KLEKP  E   ++SKD++        SP     QC ++ S + +++      + S +A
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARIVETRKPGVKPLKKLEKPRQE---VESKDKRVS-----LSP----PQCVTVPSVLRQQDRHQAILNLSMNA

Query:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQSSTVK-GADKAVEKVGVESVAAVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFEL
        S S ++S  S  S +S  +  R+ G       SST K    K  EKV  +  A         + +KRCAW+TP  DPCY AFHDEEWGVPVHDDKKLFEL
Subjt:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQSSTVK-GADKAVEKVGVESVAAVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFEL

Query:  LSLSGALAELTWPSILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQ
        L LSGALAEL+W  IL++RH+ RE+F+DFDP AV++LN+KK+ APG+AA SLLSE+K+R+I++N R + K+I E GS   ++WNFVN+KP  SQFRY RQ
Subjt:  LSLSGALAELTWPSILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQ

Query:  VPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLSNDHLISCFRFQEC---IETQTAEKGEKDDE
        VP KTSKAE ISKDLV+RGFRSV PTVIY+FMQ AGL+NDHLI CFR+Q+C    ET T  K +K +E
Subjt:  VPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLSNDHLISCFRFQEC---IETQTAEKGEKDDE

AT1G75090.1 DNA glycosylase superfamily protein6.3e-6743.87Show/hide
Query:  PLKKLEKPRQEVESKDKRVSLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRASSARGTRQRGPNLRRKQSSTVKGADKAVEKVG--VE
        P+K +++ R  + S   R  ++  +    P +  +  +  A      N S S+D SS S +S   S+  T   G         T       VEK+   V 
Subjt:  PLKKLEKPRQEVESKDKRVSLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRASSARGTRQRGPNLRRKQSSTVKGADKAVEKVG--VE

Query:  SVAAVADTVGCLESK-KRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLSLSGALAELTWPSILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAA
        SVA V D    +    KRC W+TPN+DP Y  FHDEEWGVPV DDKKLFELL  S ALAE +WPSIL +R  FR++F +FDP+A+++  EK++++     
Subjt:  SVAAVADTVGCLESK-KRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLSLSGALAELTWPSILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAA

Query:  TSLLSELKVRAIIENGRQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLSNDHLISCFRFQ
          +LSE K+RAI+EN + + KV  EFGSF+ + W FVNHKP+ + +RY RQVP K+ KAE ISKD+++RGFR VGPTV+Y+F+Q +G+ NDHL +CFR+Q
Subjt:  TSLLSELKVRAIIENGRQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLSNDHLISCFRFQ

Query:  EC-IETQTAEKGEKDDEIKLTVNEKM
        EC +ET+   K   + E KL ++  +
Subjt:  EC-IETQTAEKGEKDDEIKLTVNEKM

AT1G80850.1 DNA glycosylase superfamily protein2.0e-8954.39Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARIVETRKPGVKPLKKLEKPRQEVESKDKRVSLSP--PQCVTV-PSVLRQQDRHQAILNLSMNASCSSDASSD
        MS PPR+RS++ +D + R VLGP GNK +    +KP  KP+KK       V  K K ++ +   PQC  + P +LR+         +SM AS SSDASS 
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARIVETRKPGVKPLKKLEKPRQEVESKDKRVSLSP--PQCVTV-PSVLRQQDRHQAILNLSMNASCSSDASSD

Query:  SFNSRASSARGTRQRGPNLRRKQSSTVKGADKAVEKVGVESVAAVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLSLSGALAEL
        S  S   S   T      LRR  S +V  +      +  E     +D     + +KRCAW+TP +D CY AFHDEEWGVPVHDDK+LFELLSLSGALAEL
Subjt:  SFNSRASSARGTRQRGPNLRRKQSSTVKGADKAVEKVGVESVAAVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLSLSGALAEL

Query:  TWPSILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQVPDKTSKAEV
        +W  IL+KR LFRE+F+DFDP A+S+L  KK+ +P  AAT+LLSE K+R+I+EN  Q+CK+I  FGSF+ +IWNFVN KP  SQFRYPRQVP KTSKAE+
Subjt:  TWPSILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQVPDKTSKAEV

Query:  ISKDLVKRGFRSVGPTVIYTFMQVAGLSNDHLISCFRFQECI
        ISKDLV+RGFRSV PTVIY+FMQ AGL+NDHL  CFR  +C+
Subjt:  ISKDLVKRGFRSVGPTVIYTFMQVAGLSNDHLISCFRFQECI

AT5G57970.1 DNA glycosylase superfamily protein3.6e-9955.56Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARIVETRKPGVKPLKKLEKPRQEVESKDKRVSLSPP----------QCVTVPSVLRQQDRHQAIL--NLSMNA
        MSG PR++SMNVA++++R  LG T  KA    T K   K L+KLE+        D++ S + P            +   S+LR   RH+  L  NLS+NA
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARIVETRKPGVKPLKKLEKPRQEVESKDKRVSLSPP----------QCVTVPSVLRQQDRHQAIL--NLSMNA

Query:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQSSTVKGADKAVEKVGVESVAAVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELL
        S SSDAS DSF+SRAS+ R  R      R K   +        +   V S  A+       E+KKRC WVTPN+DPCY  FHDEEWGVPVHDDK+LFELL
Subjt:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQSSTVKGADKAVEKVGVESVAAVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELL

Query:  SLSGALAELTWPSILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQV
         LSGALAE TWP+IL+KR  FRE+F DFDPNA+ K+NEKK++ PGS A++LLS+LK+RA+IEN RQ+ KVI+E+GSF+ +IW+FV +K I+S+FRY RQV
Subjt:  SLSGALAELTWPSILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQV

Query:  PDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLSNDHLISCFRFQECI
        P KT KAEVISKDLV+RGFRSVGPTV+Y+FMQ AG++NDHL SCFRF  CI
Subjt:  PDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLSNDHLISCFRFQECI

AT5G57970.2 DNA glycosylase superfamily protein3.6e-9955.56Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARIVETRKPGVKPLKKLEKPRQEVESKDKRVSLSPP----------QCVTVPSVLRQQDRHQAIL--NLSMNA
        MSG PR++SMNVA++++R  LG T  KA    T K   K L+KLE+        D++ S + P            +   S+LR   RH+  L  NLS+NA
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARIVETRKPGVKPLKKLEKPRQEVESKDKRVSLSPP----------QCVTVPSVLRQQDRHQAIL--NLSMNA

Query:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQSSTVKGADKAVEKVGVESVAAVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELL
        S SSDAS DSF+SRAS+ R  R      R K   +        +   V S  A+       E+KKRC WVTPN+DPCY  FHDEEWGVPVHDDK+LFELL
Subjt:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQSSTVKGADKAVEKVGVESVAAVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELL

Query:  SLSGALAELTWPSILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQV
         LSGALAE TWP+IL+KR  FRE+F DFDPNA+ K+NEKK++ PGS A++LLS+LK+RA+IEN RQ+ KVI+E+GSF+ +IW+FV +K I+S+FRY RQV
Subjt:  SLSGALAELTWPSILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQV

Query:  PDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLSNDHLISCFRFQECI
        P KT KAEVISKDLV+RGFRSVGPTV+Y+FMQ AG++NDHL SCFRF  CI
Subjt:  PDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLSNDHLISCFRFQECI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGTTCCGAATTTTTCATCATCCACTTCCAGTGAAATGTCAGGCCCTCCGAGAATCCGGTCTATGAATGTGGCGGATTCCGATTCACGACCGGTACTTGGGCCTAC
AGGGAACAAAGCGCGTATAGTAGAGACTAGAAAACCTGGTGTGAAGCCATTGAAGAAGCTTGAAAAGCCTCGTCAAGAAGTTGAATCAAAGGACAAAAGGGTGTCATTGT
CACCGCCTCAATGCGTAACAGTTCCATCGGTTTTGAGGCAACAGGACCGGCACCAGGCGATTCTCAATCTTTCGATGAATGCCTCGTGTTCTTCTGATGCGTCGTCTGAT
TCGTTTAATAGCCGTGCATCTAGTGCAAGGGGTACGAGACAGCGCGGTCCGAATTTGAGGAGAAAGCAAAGTAGTACGGTTAAAGGGGCTGACAAGGCCGTTGAAAAGGT
TGGTGTTGAAAGTGTGGCGGCAGTGGCGGATACAGTTGGTTGCTTAGAATCCAAAAAACGATGTGCTTGGGTAACGCCTAATACAGATCCATGTTATGCTGCTTTTCATG
ATGAAGAATGGGGAGTACCAGTTCACGATGACAAAAAATTGTTTGAACTTCTTAGCCTATCAGGCGCATTGGCTGAACTTACATGGCCTTCCATCCTCAACAAAAGACAT
CTATTTAGGGAAATCTTTTTGGACTTCGACCCAAATGCCGTTTCAAAATTAAACGAGAAAAAGATGGTTGCTCCTGGAAGTGCTGCTACATCTTTACTGTCAGAACTCAA
GGTTCGAGCTATCATTGAAAATGGTCGTCAAATGTGCAAGGTAATTGATGAATTTGGTTCCTTCAACGTGTTCATTTGGAACTTTGTGAACCATAAACCTATCATCAGTC
AGTTCCGGTACCCACGTCAAGTCCCGGATAAGACGTCGAAAGCAGAGGTGATTAGCAAGGATCTCGTAAAGAGAGGGTTTCGAAGCGTAGGACCAACAGTCATCTATACA
TTCATGCAGGTGGCCGGGTTAAGTAATGACCATCTCATCAGTTGCTTTAGGTTTCAAGAATGTATAGAGACACAAACAGCAGAGAAAGGAGAAAAAGATGATGAAATCAA
GCTTACTGTTAATGAGAAAATGCCAGAGGCTTTGAAAAACTTGGAACTATAA
mRNA sequenceShow/hide mRNA sequence
GGAAAGTCGAACAAAATTTTGGTTCTCAACCCTCGAAAAATTTTCTCCCACTTTCTCATATTTTTCCATTTCTCTCTCTCTCTATTGTTTCTCTTTTGATTTTCATACTC
TCAAACACAATCATGGCCGTTCCGAATTTTTCATCATCCACTTCCAGTGAAATGTCAGGCCCTCCGAGAATCCGGTCTATGAATGTGGCGGATTCCGATTCACGACCGGT
ACTTGGGCCTACAGGGAACAAAGCGCGTATAGTAGAGACTAGAAAACCTGGTGTGAAGCCATTGAAGAAGCTTGAAAAGCCTCGTCAAGAAGTTGAATCAAAGGACAAAA
GGGTGTCATTGTCACCGCCTCAATGCGTAACAGTTCCATCGGTTTTGAGGCAACAGGACCGGCACCAGGCGATTCTCAATCTTTCGATGAATGCCTCGTGTTCTTCTGAT
GCGTCGTCTGATTCGTTTAATAGCCGTGCATCTAGTGCAAGGGGTACGAGACAGCGCGGTCCGAATTTGAGGAGAAAGCAAAGTAGTACGGTTAAAGGGGCTGACAAGGC
CGTTGAAAAGGTTGGTGTTGAAAGTGTGGCGGCAGTGGCGGATACAGTTGGTTGCTTAGAATCCAAAAAACGATGTGCTTGGGTAACGCCTAATACAGATCCATGTTATG
CTGCTTTTCATGATGAAGAATGGGGAGTACCAGTTCACGATGACAAAAAATTGTTTGAACTTCTTAGCCTATCAGGCGCATTGGCTGAACTTACATGGCCTTCCATCCTC
AACAAAAGACATCTATTTAGGGAAATCTTTTTGGACTTCGACCCAAATGCCGTTTCAAAATTAAACGAGAAAAAGATGGTTGCTCCTGGAAGTGCTGCTACATCTTTACT
GTCAGAACTCAAGGTTCGAGCTATCATTGAAAATGGTCGTCAAATGTGCAAGGTAATTGATGAATTTGGTTCCTTCAACGTGTTCATTTGGAACTTTGTGAACCATAAAC
CTATCATCAGTCAGTTCCGGTACCCACGTCAAGTCCCGGATAAGACGTCGAAAGCAGAGGTGATTAGCAAGGATCTCGTAAAGAGAGGGTTTCGAAGCGTAGGACCAACA
GTCATCTATACATTCATGCAGGTGGCCGGGTTAAGTAATGACCATCTCATCAGTTGCTTTAGGTTTCAAGAATGTATAGAGACACAAACAGCAGAGAAAGGAGAAAAAGA
TGATGAAATCAAGCTTACTGTTAATGAGAAAATGCCAGAGGCTTTGAAAAACTTGGAACTATAAAAAAAACCCATTGGTAGCCTTGAACCTTGCCTCAGTGTAATTAGCT
TCCAGAG
Protein sequenceShow/hide protein sequence
MAVPNFSSSTSSEMSGPPRIRSMNVADSDSRPVLGPTGNKARIVETRKPGVKPLKKLEKPRQEVESKDKRVSLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSD
SFNSRASSARGTRQRGPNLRRKQSSTVKGADKAVEKVGVESVAAVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLSLSGALAELTWPSILNKRH
LFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVFIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYT
FMQVAGLSNDHLISCFRFQECIETQTAEKGEKDDEIKLTVNEKMPEALKNLEL