; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh09G001690 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh09G001690
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionDNA glycosylase superfamily protein
Genome locationCmo_Chr09:764732..766318
RNA-Seq ExpressionCmoCh09G001690
SyntenyCmoCh09G001690
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591330.1 hypothetical protein SDJN03_13676, partial [Cucurbita argyrosperma subsp. sororia]1.8e-16482.38Show/hide
Query:  MSVATKLHSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPAIPESVVRDNISIGSSCSSDSLSSNYSTKLLNPKVKPCDVKPVKAVAA
        MSVATKLHSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPAIPESV+RDNISIGSSCSSDSLSSNYSTKLLNPKVKPCDVKPVKAVAA
Subjt:  MSVATKLHSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPAIPESVVRDNISIGSSCSSDSLSSNYSTKLLNPKVKPCDVKPVKAVAA

Query:  GGDPNVTTTTPRLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRSTFHFCPKLEYKSVPKRTLKVFNDF
        GGDPNVTTTTPRLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFR                   KVFNDF
Subjt:  GGDPNVTTTTPRLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRSTFHFCPKLEYKSVPKRTLKVFNDF

Query:  DPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKNRRLVVTTSIPFKRLKAESLRFHCLLTSPQIQQEFGTFSNYCWSFV
        DPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLK                                           IQQEFGTFSNYCWSFV
Subjt:  DPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKNRRLVVTTSIPFKRLKAESLRFHCLLTSPQIQQEFGTFSNYCWSFV

Query:  NKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQECDGMK
        NKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQECD  K
Subjt:  NKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQECDGMK

XP_022935907.1 uncharacterized protein LOC111442674 [Cucurbita moschata]1.2e-17684.02Show/hide
Query:  MSVATKLHSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPAIPESVVRDNISIGSSCSSDSLSSNYSTKLLNPKVKPCDVKPVKAVAA
        MSVATKLHSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPAIPESVVRDNISIGSSCSSDSLSSNYSTKLLNPKVKPCDVKPVKAVAA
Subjt:  MSVATKLHSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPAIPESVVRDNISIGSSCSSDSLSSNYSTKLLNPKVKPCDVKPVKAVAA

Query:  GGDPNVTTTTPRLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRSTFHFCPKLEYKSVPKRTLKVFNDF
        GGDPNVTTTTPRLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFR                   KVFNDF
Subjt:  GGDPNVTTTTPRLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRSTFHFCPKLEYKSVPKRTLKVFNDF

Query:  DPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKNRRLVVTTSIPFKRLKAESLRFHCLLTSPQIQQEFGTFSNYCWSFV
        DPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLK                                           IQQEFGTFSNYCWSFV
Subjt:  DPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKNRRLVVTTSIPFKRLKAESLRFHCLLTSPQIQQEFGTFSNYCWSFV

Query:  NKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQECDGMKLRVEDQRSELLTGALETRS
        NKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQECDGMKLRVEDQRSELLTGALETRS
Subjt:  NKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQECDGMKLRVEDQRSELLTGALETRS

XP_022976629.1 uncharacterized protein LOC111476975 [Cucurbita maxima]4.6e-16881.19Show/hide
Query:  MSVATKLHSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPAIPESVVRDNISIGSSCSSDSLSSNYSTKLLNPKVKPCDVKPVKAVAA
        MSVATKLHSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPAIPESV+RDNISIGSSCSSDSLSSN S KLLNPK     VKPVKAVAA
Subjt:  MSVATKLHSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPAIPESVVRDNISIGSSCSSDSLSSNYSTKLLNPKVKPCDVKPVKAVAA

Query:  GGDPNVTTTTPRLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRSTFHFCPKLEYKSVPKRTLKVFNDF
        GGDPNVTTTTPRLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFR                   KVFNDF
Subjt:  GGDPNVTTTTPRLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRSTFHFCPKLEYKSVPKRTLKVFNDF

Query:  DPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKNRRLVVTTSIPFKRLKAESLRFHCLLTSPQIQQEFGTFSNYCWSFV
        DPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLK                                           IQQEFGTFSNYCWSFV
Subjt:  DPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKNRRLVVTTSIPFKRLKAESLRFHCLLTSPQIQQEFGTFSNYCWSFV

Query:  NKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQECDGMKLRVEDQRSELLTGALETRS
        NKKPITNRFRYARQ+PVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLV+CFRYQECDGMKLRVEDQ SELLTGALETRS
Subjt:  NKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQECDGMKLRVEDQRSELLTGALETRS

XP_023535246.1 uncharacterized protein LOC111796735 [Cucurbita pepo subsp. pepo]1.5e-17181.7Show/hide
Query:  MSVATKLHSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPAIPESVVRDNISIGSSCSSDSLSSNYSTKLLNPKVKPCDVKPVKAVAA
        MSVATKLHSHAKPVLESR ILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPAIPESV+RDN SIGSSCSSDSL S+YSTKLLNPKVKPCDVKPVKAVAA
Subjt:  MSVATKLHSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPAIPESVVRDNISIGSSCSSDSLSSNYSTKLLNPKVKPCDVKPVKAVAA

Query:  GGDPNVTTTTPRLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRSTFHFCPKLEYKSVPKRTLKVFNDF
        GGDPNVT+TTP LSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFR                   KVFNDF
Subjt:  GGDPNVTTTTPRLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRSTFHFCPKLEYKSVPKRTLKVFNDF

Query:  DPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKNRRLVVTTSIPFKRLKAESLRFHCLLTSPQIQQEFGTFSNYCWSFV
        DPSTIA+FTQNEFTTLKENGIQLLSEPKLRAIVENANQVLK                                           IQQEFGTFSNYCWSFV
Subjt:  DPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKNRRLVVTTSIPFKRLKAESLRFHCLLTSPQIQQEFGTFSNYCWSFV

Query:  NKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQECDGMKLRVEDQRSELLTGALETRS
        NKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQECDGMKLRVE QRSELLTGALETRS
Subjt:  NKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQECDGMKLRVEDQRSELLTGALETRS

XP_038905518.1 DNA-3-methyladenine glycosylase 1 [Benincasa hispida]5.8e-14771.79Show/hide
Query:  MSVATKLHSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPAIPESVVRDNISIGSSCSSDSLSSNYSTKLLNPKVKPCDVKPVKAVAA
        MSVATKL SHAKPVLESR ILGPGGNRDRAPEKPKCKQ+TLK +EKQN+ALP I ESV+RDN+S+GSSCSSDS+SSNYS KLL PKVKP  VKPVKAVAA
Subjt:  MSVATKLHSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPAIPESVVRDNISIGSSCSSDSLSSNYSTKLLNPKVKPCDVKPVKAVAA

Query:  GGDPNVTTTTPRLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRSTFHFCPKLEYKSVPKRTLKVFNDF
        GGD N T  +P LS+PGKRC WIT +SDPLYIAFHDEEWGVPVHDD+KLFELLVLSQALAELTWPLIL KRDIFR                   KV NDF
Subjt:  GGDPNVTTTTPRLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRSTFHFCPKLEYKSVPKRTLKVFNDF

Query:  DPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKNRRLVVTTSIPFKRLKAESLRFHCLLTSPQIQQEFGTFSNYCWSFV
        DPS IAQFT+NEFTTLK N IQLLSEPKLRAIVENANQVLK                                           IQQEFG+FSNYCWSFV
Subjt:  DPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKNRRLVVTTSIPFKRLKAESLRFHCLLTSPQIQQEFGTFSNYCWSFV

Query:  NKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQECDG-----MKLRVEDQRSELLTGALE
        NKKPI N FRY RQVPVKTPKAEFMSKDL+RRGFRCVGPTVVYSFMQV GIVNDHLVNCFRYQECD       KLRVED+RSE LTGALE
Subjt:  NKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQECDG-----MKLRVEDQRSELLTGALE

TrEMBL top hitse value%identityAlignment
A0A6J1CHU1 uncharacterized protein LOC1110116231.9e-14369.82Show/hide
Query:  MSVATKLHSHAKPVLESRAILGPGGNRDRAPEKPKCKQE-TLKNSEKQNKALPAIPESVVRDNISIGSSCSSDSLSSNYSTKLLNPKVKPCDVKPVKAVA
        M VA K  SHAKPVLESRAILGPGGNRDR PEKP+CK E TL  +EKQNKALPA+P+SV+RDN+S+GSSCSSDSLSSNYS KLLNPKVKP  VKPVKAVA
Subjt:  MSVATKLHSHAKPVLESRAILGPGGNRDRAPEKPKCKQE-TLKNSEKQNKALPAIPESVVRDNISIGSSCSSDSLSSNYSTKLLNPKVKPCDVKPVKAVA

Query:  AGGDPNVTTTTPRLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRSTFHFCPKLEYKSVPKRTLKVFND
        AG + + TTT+PR SVP KRC WITPYSDPLYIAFHDEEWGVP+HDD+KLFELLVLSQALAELTWP IL KRD+FR                   KVFND
Subjt:  AGGDPNVTTTTPRLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRSTFHFCPKLEYKSVPKRTLKVFND

Query:  FDPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKNRRLVVTTSIPFKRLKAESLRFHCLLTSPQIQQEFGTFSNYCWSF
        FDPS+IA+FT+NEF TLK NGIQ+L+EPKLRAIVENANQVLK                                           IQQEFG+FSNYCWSF
Subjt:  FDPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKNRRLVVTTSIPFKRLKAESLRFHCLLTSPQIQQEFGTFSNYCWSF

Query:  VNKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQECDG-----MKLRVEDQRSELLTGALE
        VNKKPI NRFRYARQVPVKTPKAE MSKDL+RRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQECD      MK RVED R EL  GA E
Subjt:  VNKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQECDG-----MKLRVEDQRSELLTGALE

A0A6J1F6S0 uncharacterized protein LOC1114426745.8e-17784.02Show/hide
Query:  MSVATKLHSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPAIPESVVRDNISIGSSCSSDSLSSNYSTKLLNPKVKPCDVKPVKAVAA
        MSVATKLHSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPAIPESVVRDNISIGSSCSSDSLSSNYSTKLLNPKVKPCDVKPVKAVAA
Subjt:  MSVATKLHSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPAIPESVVRDNISIGSSCSSDSLSSNYSTKLLNPKVKPCDVKPVKAVAA

Query:  GGDPNVTTTTPRLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRSTFHFCPKLEYKSVPKRTLKVFNDF
        GGDPNVTTTTPRLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFR                   KVFNDF
Subjt:  GGDPNVTTTTPRLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRSTFHFCPKLEYKSVPKRTLKVFNDF

Query:  DPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKNRRLVVTTSIPFKRLKAESLRFHCLLTSPQIQQEFGTFSNYCWSFV
        DPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLK                                           IQQEFGTFSNYCWSFV
Subjt:  DPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKNRRLVVTTSIPFKRLKAESLRFHCLLTSPQIQQEFGTFSNYCWSFV

Query:  NKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQECDGMKLRVEDQRSELLTGALETRS
        NKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQECDGMKLRVEDQRSELLTGALETRS
Subjt:  NKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQECDGMKLRVEDQRSELLTGALETRS

A0A6J1FIT9 uncharacterized protein LOC1114461257.9e-14269.72Show/hide
Query:  MSVATKLHSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPAIPESVVRDNISIGSSCSSDSLSSNYSTKLLNPKVKPCDVKPVKAVAA
        MSVATKL SHA+PVLESRAILGPGGNRDRAPEKPKCKQE LK + KQNKALP + ESVVRDN+S+GSSCSSDSLSSNYS KLLN K KP   KPVK VAA
Subjt:  MSVATKLHSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPAIPESVVRDNISIGSSCSSDSLSSNYSTKLLNPKVKPCDVKPVKAVAA

Query:  GGDPNVTTTTPRLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRSTFHFCPKLEYKSVPKRTLKVFNDF
        GGD N TTT+P LSV GKRC WITPYSDPLYIAFHDEEWGVPVHDD+KLFELLVLSQALAELTWPLIL KR IFR                   KVFNDF
Subjt:  GGDPNVTTTTPRLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRSTFHFCPKLEYKSVPKRTLKVFNDF

Query:  DPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKNRRLVVTTSIPFKRLKAESLRFHCLLTSPQIQQEFGTFSNYCWSFV
        DPS+IA FT+ EFTTLK N  QLLS+ KLRAIVENANQVLK                                           IQQEFG+FSNYCWSFV
Subjt:  DPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKNRRLVVTTSIPFKRLKAESLRFHCLLTSPQIQQEFGTFSNYCWSFV

Query:  NKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQECDG-----MKLRVEDQRSELLTGALETRS
        NKKPI NR+RY RQVPVKTPKAEFMSKDL++RGFRCVGPTVVYSF+QV+GIVNDHLV+CFRYQECD      MKLRVE++RSELL  ALE  S
Subjt:  NKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQECDG-----MKLRVEDQRSELLTGALETRS

A0A6J1IHE9 uncharacterized protein LOC1114769752.2e-16881.19Show/hide
Query:  MSVATKLHSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPAIPESVVRDNISIGSSCSSDSLSSNYSTKLLNPKVKPCDVKPVKAVAA
        MSVATKLHSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPAIPESV+RDNISIGSSCSSDSLSSN S KLLNPK     VKPVKAVAA
Subjt:  MSVATKLHSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPAIPESVVRDNISIGSSCSSDSLSSNYSTKLLNPKVKPCDVKPVKAVAA

Query:  GGDPNVTTTTPRLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRSTFHFCPKLEYKSVPKRTLKVFNDF
        GGDPNVTTTTPRLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFR                   KVFNDF
Subjt:  GGDPNVTTTTPRLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRSTFHFCPKLEYKSVPKRTLKVFNDF

Query:  DPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKNRRLVVTTSIPFKRLKAESLRFHCLLTSPQIQQEFGTFSNYCWSFV
        DPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLK                                           IQQEFGTFSNYCWSFV
Subjt:  DPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKNRRLVVTTSIPFKRLKAESLRFHCLLTSPQIQQEFGTFSNYCWSFV

Query:  NKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQECDGMKLRVEDQRSELLTGALETRS
        NKKPITNRFRYARQ+PVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLV+CFRYQECDGMKLRVEDQ SELLTGALETRS
Subjt:  NKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQECDGMKLRVEDQRSELLTGALETRS

A0A6J1J188 uncharacterized protein LOC1114804121.8e-14169.72Show/hide
Query:  MSVATKLHSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPAIPESVVRDNISIGSSCSSDSLSSNYSTKLLNPKVKPCDVKPVKAVAA
        MSVATKL SHA+PVLESRAILGPGGNRDRAPEKPKCKQE LK + KQNKALP + ESVVRDNIS+GSSCSSDSLSSNYS KLLN K KP   KPVK VAA
Subjt:  MSVATKLHSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPAIPESVVRDNISIGSSCSSDSLSSNYSTKLLNPKVKPCDVKPVKAVAA

Query:  GGDPNVTTTTPRLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRSTFHFCPKLEYKSVPKRTLKVFNDF
        GGD N TTT+P L V GKRC WITPYSDPLYIAFHDEEWGVPVHDD+KLFELLVLSQALAELTWPLIL KR IFR+                   VFNDF
Subjt:  GGDPNVTTTTPRLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRSTFHFCPKLEYKSVPKRTLKVFNDF

Query:  DPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKNRRLVVTTSIPFKRLKAESLRFHCLLTSPQIQQEFGTFSNYCWSFV
        DPS+IAQFT+ EFTTLK N  QLLS+ KLRAIVENANQVLK                                           IQQEFG+FSNYCWSFV
Subjt:  DPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKNRRLVVTTSIPFKRLKAESLRFHCLLTSPQIQQEFGTFSNYCWSFV

Query:  NKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQECDG-----MKLRVEDQRSELLTGALETRS
        NKKPI NR+RY RQVPVKTPKAEFMSKDL++RGFRCVGPTVVYSF+QV+GIVNDHLV+CFRYQECD      MKLRVE++RSELL  ALE  S
Subjt:  NKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQECDG-----MKLRVEDQRSELLTGALETRS

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 11.0e-2931.02Show/hide
Query:  KRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRSTFHFCPKLEYKSVPKRTLKVFNDFDPSTIAQFTQNEFTTLK
        +RCGW++   DPLYIA+HD EWGVP  D +KLFE++ L    A L+W  +L KR+ +R+ FH                    FDP  +A   + +   L 
Subjt:  KRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRSTFHFCPKLEYKSVPKRTLKVFNDFDPSTIAQFTQNEFTTLK

Query:  ENGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKNRRLVVTTSIPFKRLKAESLRFHCLLTSPQIQQEFGTFSNYCWSFVNKKPITNRFRYARQVPV
        ++   +    K++AI+ NA   L                                           Q++Q    F ++ WSFVN +P   +     ++P 
Subjt:  ENGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKNRRLVVTTSIPFKRLKAESLRFHCLLTSPQIQQEFGTFSNYCWSFVNKKPITNRFRYARQVPV

Query:  KTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRY
         T  ++ +SK L +RGF+ VG T+ YSFMQ  G+VNDH+V C  Y
Subjt:  KTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRY

P44321 DNA-3-methyladenine glycosylase1.2e-2531.54Show/hide
Query:  RCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRSTFHFCPKLEYKSVPKRTLKVFNDFDPSTIAQFTQNEFTTLKE
        RC W+   S  +YI +HD+EWG P  D +KLFE + L    A L+W  +L KR+ +R  FH                    FDP  IA+ T  +     +
Subjt:  RCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRSTFHFCPKLEYKSVPKRTLKVFNDFDPSTIAQFTQNEFTTLKE

Query:  NGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKNRRLVVTTSIPFKRLKAESLRFHCLLTSPQIQQEFGTFSNYCWSFVNKKPITNRFRYARQVPVK
        N   +    KL AIV+NA   L                                            +++    FS++ WSFVN KPI N     R VP K
Subjt:  NGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKNRRLVVTTSIPFKRLKAESLRFHCLLTSPQIQQEFGTFSNYCWSFVNKKPITNRFRYARQVPVK

Query:  TPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNC
        T  ++ +SK L +RGF  +G T  Y+FMQ  G+V+DHL +C
Subjt:  TPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]1.1e-3132.68Show/hide
Query:  RCGWITPYSD---PLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRSTFHFCPKLEYKSVPKRTLKVFNDFDPSTIAQFTQNEFTT
        RC W T   +    LY  +HD EWG P+H+D+KLFE LVL    A L+W  IL KR+ FR                     F+DFDP  +A + +++   
Subjt:  RCGWITPYSD---PLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRSTFHFCPKLEYKSVPKRTLKVFNDFDPSTIAQFTQNEFTT

Query:  LKENGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKNRRLVVTTSIPFKRLKAESLRFHCLLTSPQIQQEFGTFSNYCWSFVNKKPITNRFRYARQV
        L  N   + +  K+ A + NA    K F+                                        +Q+EFG+F  Y W FV  KPI N F     +
Subjt:  LKENGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKNRRLVVTTSIPFKRLKAESLRFHCLLTSPQIQQEFGTFSNYCWSFVNKKPITNRFRYARQV

Query:  PVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQECDGMK
        P  TP ++ ++KDL +RGF+ VG T +Y+ MQ  G+VNDHL +CF+     GM+
Subjt:  PVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQECDGMK

Arabidopsis top hitse value%identityAlignment
AT1G15970.1 DNA glycosylase superfamily protein6.8e-6137.63Show/hide
Query:  MSVATKLHSHAKPVLESRAILGPGGNR-DRAP-----EKPKCKQETLKNSEKQNK--ALPAIPESVVRDNISIGSSC---SSDSLSSNYSTKLLNPKVKP
        MSV  +  S      E R++LGP GN+  R P     EKP  ++  + + +++ K    PA P + ++   S+ SS    +S S++++YS+   +     
Subjt:  MSVATKLHSHAKPVLESRAILGPGGNR-DRAP-----EKPKCKQETLKNSEKQNK--ALPAIPESVVRDNISIGSSC---SSDSLSSNYSTKLLNPKVKP

Query:  CDVKPVKAVAAGGDPNV------TTTTPRLSV--------------PGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILC
        C+  P+   ++     V       ++T +LSV                KRC WITP +DP Y+AFHDEEWGVPVHDD+KLFELL LS ALAEL+W  IL 
Subjt:  CDVKPVKAVAAGGDPNV------TTTTPRLSV--------------PGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILC

Query:  KRDIFRSTFHFCPKLEYKSVPKRTLKVFNDFDPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKNRRLVVTTSIPFKRL
        +R I R                   +VF DFDP  +A+    + T      I LLSE K+R+I++N+  V K                            
Subjt:  KRDIFRSTFHFCPKLEYKSVPKRTLKVFNDFDPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKNRRLVVTTSIPFKRL

Query:  KAESLRFHCLLTSPQIQQEFGTFSNYCWSFVNKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQEC
                       I  E G+   Y W+FVN KP  ++FRY RQVPVKT KAEF+SKDL+RRGFR V PTV+YSFMQ  G+ NDHL+ CFRYQ+C
Subjt:  KAESLRFHCLLTSPQIQQEFGTFSNYCWSFVNKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQEC

AT1G75090.1 DNA glycosylase superfamily protein6.1e-7846.07Show/hide
Query:  MSVATKLHSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKNSEKQNKAL--PAIPESVVRDNISIGSSCSS-DSLSSNYSTKLLNPKVKPCDVKPVKA
        MS+ +KL S  KP+ ESRAIL   GNR +  +    K+  L     ++ A   P    SV  D+ S  SS S   S+++  S K+  P  K   V+ +  
Subjt:  MSVATKLHSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKNSEKQNKAL--PAIPESVVRDNISIGSSCSS-DSLSSNYSTKLLNPKVKPCDVKPVKA

Query:  VAAGGDPNVTTTTPRLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRSTFHFCPKLEYKSVPKRTLKVF
        V A     V   +P++  P KRC WITP SDP+Y+ FHDEEWGVPV DD+KLFELLV SQALAE +WP IL +RD FR                   K+F
Subjt:  VAAGGDPNVTTTTPRLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRSTFHFCPKLEYKSVPKRTLKVF

Query:  NDFDPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKNRRLVVTTSIPFKRLKAESLRFHCLLTSPQIQQEFGTFSNYCW
         +FDPS IAQFT+    +L+ NG  +LSE KLRAIVENA  VLK                                           ++QEFG+FSNYCW
Subjt:  NDFDPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKNRRLVVTTSIPFKRLKAESLRFHCLLTSPQIQQEFGTFSNYCW

Query:  SFVNKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQECD
         FVN KP+ N +RY RQVPVK+PKAE++SKD+++RGFRCVGPTV+YSF+Q +GIVNDHL  CFRYQEC+
Subjt:  SFVNKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQECD

AT1G80850.1 DNA glycosylase superfamily protein8.9e-6139.63Show/hide
Query:  MSVATKLHSHAKPVLESRAILGPGGNR------DRAPEKPKC-KQETLKNSEKQNKALPAIPESVVRDNISIGSSCSSDSLSSNYSTKLLNPKVKPCD--
        MS   ++ S      E R++LGP GN+       +  +KP   K + L  +EK  +  P  P  + R+ IS+ +S SSD+ SS  S+ L           
Subjt:  MSVATKLHSHAKPVLESRAILGPGGNR------DRAPEKPKC-KQETLKNSEKQNKALPAIPESVVRDNISIGSSCSSDSLSSNYSTKLLNPKVKPCD--

Query:  VKPVKAVAAGGDPNVTTTTPRLSVPG-------KRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRSTFHFCPKL
        ++   +V++        T  R            KRC WITP SD  YIAFHDEEWGVPVHDD++LFELL LS ALAEL+W  IL KR +FR         
Subjt:  VKPVKAVAAGGDPNVTTTTPRLSVPG-------KRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRSTFHFCPKL

Query:  EYKSVPKRTLKVFNDFDPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKNRRLVVTTSIPFKRLKAESLRFHCLLTSPQ
                  +VF DFDP  I++ T  + T+ +     LLSE KLR+I+ENANQV K                                           
Subjt:  EYKSVPKRTLKVFNDFDPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKNRRLVVTTSIPFKRLKAESLRFHCLLTSPQ

Query:  IQQEFGTFSNYCWSFVNKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQEC
        I   FG+F  Y W+FVN+KP  ++FRY RQVPVKT KAE +SKDL+RRGFR V PTV+YSFMQ  G+ NDHL  CFR+ +C
Subjt:  IQQEFGTFSNYCWSFVNKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQEC

AT5G57970.1 DNA glycosylase superfamily protein1.5e-6043.55Show/hide
Query:  NISIGSSCSSDSLSSNYSTKLL------NPKVKPCDVKPVKAVAAGGDPNVTTTTPRLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVL
        N S  S  S DS  S  ST  L        + K    KP   V+ G       + P  S   KRC W+TP SDP YI FHDEEWGVPVHDD++LFELLVL
Subjt:  NISIGSSCSSDSLSSNYSTKLL------NPKVKPCDVKPVKAVAAGGDPNVTTTTPRLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVL

Query:  SQALAELTWPLILCKRDIFRSTFHFCPKLEYKSVPKRTLKVFNDFDPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKN
        S ALAE TWP IL KR  FR                   +VF DFDP+ I +  + +          LLS+ KLRA++ENA Q+LK              
Subjt:  SQALAELTWPLILCKRDIFRSTFHFCPKLEYKSVPKRTLKVFNDFDPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKN

Query:  RRLVVTTSIPFKRLKAESLRFHCLLTSPQIQQEFGTFSNYCWSFVNKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDH
                                     + +E+G+F  Y WSFV  K I ++FRY RQVP KTPKAE +SKDL+RRGFR VGPTVVYSFMQ  GI NDH
Subjt:  RRLVVTTSIPFKRLKAESLRFHCLLTSPQIQQEFGTFSNYCWSFVNKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDH

Query:  LVNCFRYQEC
        L +CFR+  C
Subjt:  LVNCFRYQEC

AT5G57970.2 DNA glycosylase superfamily protein1.5e-6043.55Show/hide
Query:  NISIGSSCSSDSLSSNYSTKLL------NPKVKPCDVKPVKAVAAGGDPNVTTTTPRLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVL
        N S  S  S DS  S  ST  L        + K    KP   V+ G       + P  S   KRC W+TP SDP YI FHDEEWGVPVHDD++LFELLVL
Subjt:  NISIGSSCSSDSLSSNYSTKLL------NPKVKPCDVKPVKAVAAGGDPNVTTTTPRLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVL

Query:  SQALAELTWPLILCKRDIFRSTFHFCPKLEYKSVPKRTLKVFNDFDPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKN
        S ALAE TWP IL KR  FR                   +VF DFDP+ I +  + +          LLS+ KLRA++ENA Q+LK              
Subjt:  SQALAELTWPLILCKRDIFRSTFHFCPKLEYKSVPKRTLKVFNDFDPSTIAQFTQNEFTTLKENGIQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKN

Query:  RRLVVTTSIPFKRLKAESLRFHCLLTSPQIQQEFGTFSNYCWSFVNKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDH
                                     + +E+G+F  Y WSFV  K I ++FRY RQVP KTPKAE +SKDL+RRGFR VGPTVVYSFMQ  GI NDH
Subjt:  RRLVVTTSIPFKRLKAESLRFHCLLTSPQIQQEFGTFSNYCWSFVNKKPITNRFRYARQVPVKTPKAEFMSKDLLRRGFRCVGPTVVYSFMQVTGIVNDH

Query:  LVNCFRYQEC
        L +CFR+  C
Subjt:  LVNCFRYQEC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGTGGCTACGAAGCTCCATTCGCACGCTAAACCGGTTTTGGAGTCCCGAGCAATTCTTGGACCCGGCGGGAACAGAGATAGGGCGCCTGAGAAGCCGAAATGTAA
ACAGGAGACTTTGAAGAACTCAGAGAAGCAGAACAAGGCGCTTCCGGCGATTCCTGAATCGGTTGTTCGAGACAATATCTCCATCGGGAGCTCCTGCTCATCCGATTCTT
TATCAAGCAACTATTCGACCAAATTGTTGAATCCTAAAGTGAAGCCCTGCGATGTGAAGCCTGTGAAGGCTGTTGCTGCCGGAGGTGATCCAAACGTAACCACAACGACG
CCTAGGCTCTCGGTTCCGGGGAAACGCTGTGGTTGGATAACGCCTTATTCTGACCCCCTTTACATCGCTTTCCATGACGAAGAATGGGGAGTCCCAGTTCATGACGACAG
GAAGCTGTTCGAGTTACTTGTATTATCACAAGCCTTAGCAGAGCTTACTTGGCCCTTGATTCTTTGCAAGAGAGATATATTTAGGTCTACTTTTCATTTTTGCCCAAAAC
TTGAATACAAATCAGTCCCTAAAAGAACATTGAAGGTTTTTAATGATTTTGACCCATCTACCATCGCACAGTTCACACAGAATGAGTTTACGACACTAAAAGAAAATGGC
ATCCAGCTCCTATCTGAACCAAAGCTTCGTGCAATCGTGGAGAACGCTAATCAAGTACTCAAGCACTTCCTATTTTTTGTTAAGCCAAAGCCAACCCATAAGAACAGGCG
TTTGGTTGTCACAACTTCCATCCCTTTCAAGCGGTTGAAAGCTGAATCTTTACGTTTCCATTGCCTTTTGACTTCTCCACAGATTCAACAGGAATTTGGTACCTTCAGCA
ACTATTGTTGGAGCTTTGTCAACAAGAAGCCTATAACAAACAGATTTCGATATGCCCGTCAAGTACCAGTAAAGACGCCGAAAGCAGAGTTCATGAGCAAGGATTTGCTA
AGAAGAGGCTTTCGTTGTGTTGGGCCAACTGTGGTTTATTCCTTCATGCAGGTTACCGGAATTGTTAACGATCACTTAGTCAATTGCTTCAGATATCAAGAGTGCGATGG
TATGAAACTAAGAGTAGAAGATCAGCGATCGGAGTTGCTCACCGGAGCTTTGGAGACTAGATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGTGGCTACGAAGCTCCATTCGCACGCTAAACCGGTTTTGGAGTCCCGAGCAATTCTTGGACCCGGCGGGAACAGAGATAGGGCGCCTGAGAAGCCGAAATGTAA
ACAGGAGACTTTGAAGAACTCAGAGAAGCAGAACAAGGCGCTTCCGGCGATTCCTGAATCGGTTGTTCGAGACAATATCTCCATCGGGAGCTCCTGCTCATCCGATTCTT
TATCAAGCAACTATTCGACCAAATTGTTGAATCCTAAAGTGAAGCCCTGCGATGTGAAGCCTGTGAAGGCTGTTGCTGCCGGAGGTGATCCAAACGTAACCACAACGACG
CCTAGGCTCTCGGTTCCGGGGAAACGCTGTGGTTGGATAACGCCTTATTCTGACCCCCTTTACATCGCTTTCCATGACGAAGAATGGGGAGTCCCAGTTCATGACGACAG
GAAGCTGTTCGAGTTACTTGTATTATCACAAGCCTTAGCAGAGCTTACTTGGCCCTTGATTCTTTGCAAGAGAGATATATTTAGGTCTACTTTTCATTTTTGCCCAAAAC
TTGAATACAAATCAGTCCCTAAAAGAACATTGAAGGTTTTTAATGATTTTGACCCATCTACCATCGCACAGTTCACACAGAATGAGTTTACGACACTAAAAGAAAATGGC
ATCCAGCTCCTATCTGAACCAAAGCTTCGTGCAATCGTGGAGAACGCTAATCAAGTACTCAAGCACTTCCTATTTTTTGTTAAGCCAAAGCCAACCCATAAGAACAGGCG
TTTGGTTGTCACAACTTCCATCCCTTTCAAGCGGTTGAAAGCTGAATCTTTACGTTTCCATTGCCTTTTGACTTCTCCACAGATTCAACAGGAATTTGGTACCTTCAGCA
ACTATTGTTGGAGCTTTGTCAACAAGAAGCCTATAACAAACAGATTTCGATATGCCCGTCAAGTACCAGTAAAGACGCCGAAAGCAGAGTTCATGAGCAAGGATTTGCTA
AGAAGAGGCTTTCGTTGTGTTGGGCCAACTGTGGTTTATTCCTTCATGCAGGTTACCGGAATTGTTAACGATCACTTAGTCAATTGCTTCAGATATCAAGAGTGCGATGG
TATGAAACTAAGAGTAGAAGATCAGCGATCGGAGTTGCTCACCGGAGCTTTGGAGACTAGATCTTGA
Protein sequenceShow/hide protein sequence
MSVATKLHSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKNSEKQNKALPAIPESVVRDNISIGSSCSSDSLSSNYSTKLLNPKVKPCDVKPVKAVAAGGDPNVTTTT
PRLSVPGKRCGWITPYSDPLYIAFHDEEWGVPVHDDRKLFELLVLSQALAELTWPLILCKRDIFRSTFHFCPKLEYKSVPKRTLKVFNDFDPSTIAQFTQNEFTTLKENG
IQLLSEPKLRAIVENANQVLKHFLFFVKPKPTHKNRRLVVTTSIPFKRLKAESLRFHCLLTSPQIQQEFGTFSNYCWSFVNKKPITNRFRYARQVPVKTPKAEFMSKDLL
RRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQECDGMKLRVEDQRSELLTGALETRS