; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh01G019530 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh01G019530
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionDNA glycosylase superfamily protein
Genome locationCmo_Chr01:13896146..13898959
RNA-Seq ExpressionCmoCh01G019530
SyntenyCmoCh01G019530
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608520.1 hypothetical protein SDJN03_01862, partial [Cucurbita argyrosperma subsp. sororia]4.5e-16679.12Show/hide
Query:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD
        MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD
Subjt:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD

Query:  ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRSSSLFFFYFPRFYRFVLQIGIWVDIDGC
        ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFR                            
Subjt:  ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRSSSLFFFYFPRFYRFVLQIGIWVDIDGC

Query:  PSSPLTYIVLHLKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKVYCVLLKLYLFPGSIVKLNPSITIHAFGCLNFHPFPSGSKL
                    KVFNDFDPSSIA FTEAEFTTLKVNATQLLSDQKLRAIVENANQVLK+                         FG             
Subjt:  PSSPLTYIVLHLKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKVYCVLLKLYLFPGSIVKLNPSITIHAFGCLNFHPFPSGSKL

Query:  NLSTNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDASVKDDMKLRVENRRSELLIRAL
           +NYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDASVK+DMKLRVENRRSELLIRAL
Subjt:  NLSTNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDASVKDDMKLRVENRRSELLIRAL

Query:  EKSSLTT
        EKSSLTT
Subjt:  EKSSLTT

XP_022940560.1 uncharacterized protein LOC111446125 [Cucurbita moschata]1.4e-16779.61Show/hide
Query:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD
        MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD
Subjt:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD

Query:  ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRSSSLFFFYFPRFYRFVLQIGIWVDIDGC
        ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFR                            
Subjt:  ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRSSSLFFFYFPRFYRFVLQIGIWVDIDGC

Query:  PSSPLTYIVLHLKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKVYCVLLKLYLFPGSIVKLNPSITIHAFGCLNFHPFPSGSKL
                    KVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLK+                         FG             
Subjt:  PSSPLTYIVLHLKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKVYCVLLKLYLFPGSIVKLNPSITIHAFGCLNFHPFPSGSKL

Query:  NLSTNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDASVKDDMKLRVENRRSELLIRAL
           +NYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDASVKDDMKLRVENRRSELLIRAL
Subjt:  NLSTNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDASVKDDMKLRVENRRSELLIRAL

Query:  EKSSLTT
        EKSSLTT
Subjt:  EKSSLTT

XP_022981194.1 uncharacterized protein LOC111480412 [Cucurbita maxima]2.9e-16578.62Show/hide
Query:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD
        MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDN+SVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD
Subjt:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD

Query:  ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRSSSLFFFYFPRFYRFVLQIGIWVDIDGC
        ANATTTSPGL VAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFR+                           
Subjt:  ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRSSSLFFFYFPRFYRFVLQIGIWVDIDGC

Query:  PSSPLTYIVLHLKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKVYCVLLKLYLFPGSIVKLNPSITIHAFGCLNFHPFPSGSKL
                     VFNDFDPSSIA FTEAEFTTLKVNATQLLSDQKLRAIVENANQVLK+                         FG             
Subjt:  PSSPLTYIVLHLKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKVYCVLLKLYLFPGSIVKLNPSITIHAFGCLNFHPFPSGSKL

Query:  NLSTNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDASVKDDMKLRVENRRSELLIRAL
           +NYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDASVKDDMKLRVENRRSELLIRAL
Subjt:  NLSTNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDASVKDDMKLRVENRRSELLIRAL

Query:  EKSSLTT
        EKSSLTT
Subjt:  EKSSLTT

XP_023523621.1 uncharacterized protein LOC111787800 [Cucurbita pepo subsp. pepo]1.2e-16679.36Show/hide
Query:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD
        MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD
Subjt:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD

Query:  ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRSSSLFFFYFPRFYRFVLQIGIWVDIDGC
        ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFR                            
Subjt:  ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRSSSLFFFYFPRFYRFVLQIGIWVDIDGC

Query:  PSSPLTYIVLHLKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKVYCVLLKLYLFPGSIVKLNPSITIHAFGCLNFHPFPSGSKL
                    KVFNDFDPSSIA FTEAEFTTLKVNATQLLSDQKLRAIVENANQVLK+                         FG             
Subjt:  PSSPLTYIVLHLKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKVYCVLLKLYLFPGSIVKLNPSITIHAFGCLNFHPFPSGSKL

Query:  NLSTNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDASVKDDMKLRVENRRSELLIRAL
           +NYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDASVKDDMKLRVENRRSELLIRAL
Subjt:  NLSTNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDASVKDDMKLRVENRRSELLIRAL

Query:  EKSSLTT
        EKSSLTT
Subjt:  EKSSLTT

XP_038905518.1 DNA-3-methyladenine glycosylase 1 [Benincasa hispida]1.8e-13866.99Show/hide
Query:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKP---KPVKTVAA
        MSVATKLQSHA+PVLESR ILGPGGNRDRAPEKPKCKQ+ LK+T KQN+ALP++SESV+RDNVSVGSSCSSDS+SSNYSAKLL  K KP   KPVK VAA
Subjt:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKP---KPVKTVAA

Query:  GGDANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRSSSLFFFYFPRFYRFVLQIGIWVDI
        GGD NAT  SP LS+ GKRCDWIT +SDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKR IFR                         
Subjt:  GGDANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRSSSLFFFYFPRFYRFVLQIGIWVDI

Query:  DGCPSSPLTYIVLHLKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKVYCVLLKLYLFPGSIVKLNPSITIHAFGCLNFHPFPSG
                       KV NDFDPS+IA FTE EFTTLKVNA QLLS+ KLRAIVENANQVLK+                         FG          
Subjt:  DGCPSSPLTYIVLHLKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKVYCVLLKLYLFPGSIVKLNPSITIHAFGCLNFHPFPSG

Query:  SKLNLSTNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDASVKDDMKLRVENRRSELLI
              +NYCWSFVNKKPIRN +RY RQVPVKTPKAEFMSKDL++RGFRCVGPTVVYSF+QV+GIVNDHLV+CFRYQECDA +KDD KLRVE++RSE L 
Subjt:  SKLNLSTNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDASVKDDMKLRVENRRSELLI

Query:  RALEKSSLT
         ALEK  LT
Subjt:  RALEKSSLT

TrEMBL top hitse value%identityAlignment
A0A0A0LG22 Uncharacterized protein8.6e-13165.31Show/hide
Query:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD
        MSVATKLQSH +P LE RAILGPGGNRDRAP+ PKCK E LK+T KQ+KALP +SESV+RDNVSVGSSCSSDSLSSNYSAKLL    KP  VK V+AGGD
Subjt:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD

Query:  ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRSSSLFFFYFPRFYRFVLQIGIWVDIDGC
        +NATTTSP LS+ GKRCDWIT +SDPLYIAFHDEEWGVP+HDDKKLFELLVLSQALAELTWPLILSKR +FR                            
Subjt:  ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRSSSLFFFYFPRFYRFVLQIGIWVDIDGC

Query:  PSSPLTYIVLHLKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKVYCVLLKLYLFPGSIVKLNPSITIHAFGCLNFHPFPSGSKL
                    KV NDFDPSSIA FTE EFTTLKVN  QLLS+ KLRAIV+NANQVLK+                         FG             
Subjt:  PSSPLTYIVLHLKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKVYCVLLKLYLFPGSIVKLNPSITIHAFGCLNFHPFPSGSKL

Query:  NLSTNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDASVKDDMKLRVENRR
           +NYCWSFVNKKPIRNR+RY RQVPVKTPKAEFMSKD+++RGFRCVGPTVVYSF+QV+GIVNDHLV CFRY+ECD  VKDD KLRVE++R
Subjt:  NLSTNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDASVKDDMKLRVENRR

A0A6J1F6S0 uncharacterized protein LOC1114426744.9e-13465.85Show/hide
Query:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKP---KPVKTVAA
        MSVATKL SHA+PVLESRAILGPGGNRDRAPEKPKCKQE LK + KQNKALP + ESVVRDN+S+GSSCSSDSLSSNYS KLLN K KP   KPVK VAA
Subjt:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKP---KPVKTVAA

Query:  GGDANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRSSSLFFFYFPRFYRFVLQIGIWVDI
        GGD N TTT+P LSV GKRC WITPYSDPLYIAFHDEEWGVPVHDD+KLFELLVLSQALAELTWPLIL KR IFR                         
Subjt:  GGDANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRSSSLFFFYFPRFYRFVLQIGIWVDI

Query:  DGCPSSPLTYIVLHLKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKVYCVLLKLYLFPGSIVKLNPSITIHAFGCLNFHPFPSG
                       KVFNDFDPS+IA FT+ EFTTLK N  QLLS+ KLRAIVENANQVLK+                         FG          
Subjt:  DGCPSSPLTYIVLHLKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKVYCVLLKLYLFPGSIVKLNPSITIHAFGCLNFHPFPSG

Query:  SKLNLSTNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDASVKDDMKLRVENRRSELLI
              +NYCWSFVNKKPI NR+RY RQVPVKTPKAEFMSKDL++RGFRCVGPTVVYSF+QV+GIVNDHLV+CFRYQECD      MKLRVE++RSELL 
Subjt:  SKLNLSTNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDASVKDDMKLRVENRRSELLI

Query:  RALEKSS
         ALE  S
Subjt:  RALEKSS

A0A6J1FIT9 uncharacterized protein LOC1114461256.8e-16879.61Show/hide
Query:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD
        MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD
Subjt:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD

Query:  ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRSSSLFFFYFPRFYRFVLQIGIWVDIDGC
        ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFR                            
Subjt:  ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRSSSLFFFYFPRFYRFVLQIGIWVDIDGC

Query:  PSSPLTYIVLHLKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKVYCVLLKLYLFPGSIVKLNPSITIHAFGCLNFHPFPSGSKL
                    KVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLK+                         FG             
Subjt:  PSSPLTYIVLHLKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKVYCVLLKLYLFPGSIVKLNPSITIHAFGCLNFHPFPSGSKL

Query:  NLSTNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDASVKDDMKLRVENRRSELLIRAL
           +NYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDASVKDDMKLRVENRRSELLIRAL
Subjt:  NLSTNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDASVKDDMKLRVENRRSELLIRAL

Query:  EKSSLTT
        EKSSLTT
Subjt:  EKSSLTT

A0A6J1IHE9 uncharacterized protein LOC1114769751.0e-13165.35Show/hide
Query:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD
        MSVATKL SHA+PVLESRAILGPGGNRDRAPEKPKCKQE LK + KQNKALP + ESV+RDN+S+GSSCSSDSLSSN SAKLLN   K KPVK VAAGGD
Subjt:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD

Query:  ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRSSSLFFFYFPRFYRFVLQIGIWVDIDGC
         N TTT+P LSV GKRC WITPYSDPLYIAFHDEEWGVPVHDD+KLFELLVLSQALAELTWPLIL KR IFR                            
Subjt:  ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRSSSLFFFYFPRFYRFVLQIGIWVDIDGC

Query:  PSSPLTYIVLHLKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKVYCVLLKLYLFPGSIVKLNPSITIHAFGCLNFHPFPSGSKL
                    KVFNDFDPS+IA FT+ EFTTLK N  QLLS+ KLRAIVENANQVLK+                         FG             
Subjt:  PSSPLTYIVLHLKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKVYCVLLKLYLFPGSIVKLNPSITIHAFGCLNFHPFPSGSKL

Query:  NLSTNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDASVKDDMKLRVENRRSELLIRAL
           +NYCWSFVNKKPI NR+RY RQ+PVKTPKAEFMSKDL++RGFRCVGPTVVYSF+QV+GIVNDHLVDCFRYQECD      MKLRVE++ SELL  AL
Subjt:  NLSTNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDASVKDDMKLRVENRRSELLIRAL

Query:  EKSS
        E  S
Subjt:  EKSS

A0A6J1J188 uncharacterized protein LOC1114804121.4e-16578.62Show/hide
Query:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD
        MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDN+SVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD
Subjt:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD

Query:  ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRSSSLFFFYFPRFYRFVLQIGIWVDIDGC
        ANATTTSPGL VAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFR+                           
Subjt:  ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRSSSLFFFYFPRFYRFVLQIGIWVDIDGC

Query:  PSSPLTYIVLHLKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKVYCVLLKLYLFPGSIVKLNPSITIHAFGCLNFHPFPSGSKL
                     VFNDFDPSSIA FTEAEFTTLKVNATQLLSDQKLRAIVENANQVLK+                         FG             
Subjt:  PSSPLTYIVLHLKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKVYCVLLKLYLFPGSIVKLNPSITIHAFGCLNFHPFPSGSKL

Query:  NLSTNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDASVKDDMKLRVENRRSELLIRAL
           +NYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDASVKDDMKLRVENRRSELLIRAL
Subjt:  NLSTNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDASVKDDMKLRVENRRSELLIRAL

Query:  EKSSLTT
        EKSSLTT
Subjt:  EKSSLTT

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 11.4e-2429.73Show/hide
Query:  KRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRSSSLFFFYFPRFYRFVLQIGIWVDIDGCPSSPLTYIVLHLKV
        +RC W++   DPLYIA+HD EWGVP  D KKLFE++ L    A L+W  +L KR  +R+                                         
Subjt:  KRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRSSSLFFFYFPRFYRFVLQIGIWVDIDGCPSSPLTYIVLHLKV

Query:  FNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKVYCVLLKLYLFPGSIVKLNPSITIHAFGCLNFHPFPSGSKLNLSTNYCWSFVNKK
        F+ FDP  +A   E +   L  +A  +    K++AI+ NA   L++                             N  PF          ++ WSFVN +
Subjt:  FNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKVYCVLLKLYLFPGSIVKLNPSITIHAFGCLNFHPFPSGSKLNLSTNYCWSFVNKK

Query:  PIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY
        P   +     ++P  T  ++ +SK L KRGF+ VG T+ YSF+Q  G+VNDH+V C  Y
Subjt:  PIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY

P44321 DNA-3-methyladenine glycosylase1.1e-2130.98Show/hide
Query:  RCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRSSSLFFFYFPRFYRFVLQIGIWVDIDGCPSSPLTYIVLHLKVF
        RC W+   S  +YI +HD+EWG P  D +KLFE + L    A L+W  +L KR  +R                                        + F
Subjt:  RCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRSSSLFFFYFPRFYRFVLQIGIWVDIDGCPSSPLTYIVLHLKVF

Query:  NDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKVYCVLLKLYLFPGSIVKLNPSITIHAFGCLNFHPFPSGSKLNLSTNYCWSFVNKKP
        + FDP  IA  T  +      N+  +    KL AIV+NA    K Y  + K     G                 NF            +++ WSFVN KP
Subjt:  NDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKVYCVLLKLYLFPGSIVKLNPSITIHAFGCLNFHPFPSGSKLNLSTNYCWSFVNKKP

Query:  IRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDC
        I N     R VP KT  ++ +SK L KRGF  +G T  Y+F+Q  G+V+DHL DC
Subjt:  IRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]1.4e-2429.85Show/hide
Query:  RCDWITPYSD---PLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRSSSLFFFYFPRFYRFVLQIGIWVDIDGCPSSPLTYIVLHL
        RC W T   +    LY  +HD EWG P+H+DKKLFE LVL    A L+W  IL KR  FR +                                      
Subjt:  RCDWITPYSD---PLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRSSSLFFFYFPRFYRFVLQIGIWVDIDGCPSSPLTYIVLHL

Query:  KVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKVYCVLLKLYLFPGSIVKLNPSITIHAFGCLNFHPFPSGSKLNLSTNYCWSFVN
          F+DFDP  +A++ E +   L  N   + +  K+ A + NA   + V                         FG  +               Y W FV 
Subjt:  KVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKVYCVLLKLYLFPGSIVKLNPSITIHAFGCLNFHPFPSGSKLNLSTNYCWSFVN

Query:  KKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDASV
         KPI N +     +P  TP ++ ++KDL KRGF+ VG T +Y+ +Q  G+VNDHL  CF+   C++S+
Subjt:  KKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDASV

Arabidopsis top hitse value%identityAlignment
AT1G15970.1 DNA glycosylase superfamily protein7.4e-5835.76Show/hide
Query:  MSVATKLQSHAEPVLESRAILGPGGNR-DRAPEKPKCKQEILKRTV---KQNKA---------------LPVVSESVVRDN-VSVGSSCSSDSLSSNYSA
        MSV  + +S      E R++LGP GN+  R P   K ++ ++++T+   K  KA                  +  S++R N  S+ +S SSD+ SS+  +
Subjt:  MSVATKLQSHAEPVLESRAILGPGGNR-DRAPEKPKCKQEILKRTV---KQNKA---------------LPVVSESVVRDN-VSVGSSCSSDSLSSNYSA

Query:  KLLNLKAKPKPVKTVAAGGDANAT-----------TTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRH
          L++ +     K V   G  ++T            +    +   KRC WITP +DP Y+AFHDEEWGVPVHDDKKLFELL LS ALAEL+W  ILS+RH
Subjt:  KLLNLKAKPKPVKTVAAGGDANAT-----------TTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRH

Query:  IFRSSSLFFFYFPRFYRFVLQIGIWVDIDGCPSSPLTYIVLHLKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKVYCVLLKLYL
        I R                                        +VF DFDP ++A   + + T     A  LLS+ K+R+I++N+  V K+         
Subjt:  IFRSSSLFFFYFPRFYRFVLQIGIWVDIDGCPSSPLTYIVLHLKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKVYCVLLKLYL

Query:  FPGSIVKLNPSITIHAFGCLNFHPFPSGSKLNLSTNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVD
          GS+ K                             Y W+FVN KP ++++RY RQVPVKT KAEF+SKDL++RGFR V PTV+YSF+Q +G+ NDHL+ 
Subjt:  FPGSIVKLNPSITIHAFGCLNFHPFPSGSKLNLSTNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVD

Query:  CFRYQEC--DASVKDDMKLRVENRR
        CFRYQ+C  DA      K + +N R
Subjt:  CFRYQEC--DASVKDDMKLRVENRR

AT1G75090.1 DNA glycosylase superfamily protein2.1e-7343.44Show/hide
Query:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKAL--PVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTV--A
        MS+ +KL+S  +P+ ESRAIL   GNR +  +    K+  L   V ++ A   P  + SV  D+ S  SS S  S  +  ++  +   +K   V+ +   
Subjt:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKAL--PVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTV--A

Query:  AGGDANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRSSSLFFFYFPRFYRFVLQIGIWVD
            A     SP +    KRC WITP SDP+Y+ FHDEEWGVPV DDKKLFELLV SQALAE +WP IL +R  FR                        
Subjt:  AGGDANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRSSSLFFFYFPRFYRFVLQIGIWVD

Query:  IDGCPSSPLTYIVLHLKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKVYCVLLKLYLFPGSIVKLNPSITIHAFGCLNFHPFPS
                        K+F +FDPS+IA FTE    +L+VN   +LS+QKLRAIVENA  VLKV                         FG         
Subjt:  IDGCPSSPLTYIVLHLKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKVYCVLLKLYLFPGSIVKLNPSITIHAFGCLNFHPFPS

Query:  GSKLNLSTNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDASVKDDMK
               +NYCW FVN KP+RN YRYGRQVPVK+PKAE++SKD+M+RGFRCVGPTV+YSFLQ SGIVNDHL  CFRYQEC+   + + K
Subjt:  GSKLNLSTNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDASVKDDMK

AT1G80850.1 DNA glycosylase superfamily protein8.7e-5937.37Show/hide
Query:  MSVATKLQSHAEPVLESRAILGPGGNR------DRAPEKPKC-KQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVK
        MS   +++S      E R++LGP GN+       +  +KP   K + L  T K  +  P+    + R+ +S+ +S SSD+ SS+  +  L++ +     +
Subjt:  MSVATKLQSHAEPVLESRAILGPGGNR------DRAPEKPKC-KQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVK

Query:  TVAAGGDANATTT-------------SPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRSSSLFFFY
         +   G  +++++             S       KRC WITP SD  YIAFHDEEWGVPVHDDK+LFELL LS ALAEL+W  ILSKR +FR        
Subjt:  TVAAGGDANATTT-------------SPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRSSSLFFFY

Query:  FPRFYRFVLQIGIWVDIDGCPSSPLTYIVLHLKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKVYCVLLKLYLFPGSIVKLNPS
                                        +VF DFDP +I+  T  + T+ ++ AT LLS+QKLR+I+ENANQV K+                    
Subjt:  FPRFYRFVLQIGIWVDIDGCPSSPLTYIVLHLKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKVYCVLLKLYLFPGSIVKLNPS

Query:  ITIHAFGCLNFHPFPSGSKLNLSTNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
          I AFG  +               Y W+FVN+KP ++++RY RQVPVKT KAE +SKDL++RGFR V PTV+YSF+Q +G+ NDHL  CFR+ +C
Subjt:  ITIHAFGCLNFHPFPSGSKLNLSTNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC

AT5G57970.1 DNA glycosylase superfamily protein1.8e-5642.11Show/hide
Query:  NVSVGSSCSSDSLSSNYSAKLL--------NLKAKPKPVKTVAAGGDANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLS
        N S  S  S DS  S  S   L          K+ P   ++V + G   A  + P  S   KRC W+TP SDP YI FHDEEWGVPVHDDK+LFELLVLS
Subjt:  NVSVGSSCSSDSLSSNYSAKLL--------NLKAKPKPVKTVAAGGDANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLS

Query:  QALAELTWPLILSKRHIFRSSSLFFFYFPRFYRFVLQIGIWVDIDGCPSSPLTYIVLHLKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVEN
         ALAE TWP ILSKR  FR                                        +VF DFDP++I    E +       A+ LLSD KLRA++EN
Subjt:  QALAELTWPLILSKRHIFRSSSLFFFYFPRFYRFVLQIGIWVDIDGCPSSPLTYIVLHLKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVEN

Query:  ANQVLKVYCVLLKLYLFPGSIVKLNPSITIHAFGCLNFHPFPSGSKLNLSTNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVV
        A Q+LKV                      I  +G  +               Y WSFV  K I +++RY RQVP KTPKAE +SKDL++RGFR VGPTVV
Subjt:  ANQVLKVYCVLLKLYLFPGSIVKLNPSITIHAFGCLNFHPFPSGSKLNLSTNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVV

Query:  YSFLQVSGIVNDHLVDCFRYQEC
        YSF+Q +GI NDHL  CFR+  C
Subjt:  YSFLQVSGIVNDHLVDCFRYQEC

AT5G57970.2 DNA glycosylase superfamily protein1.8e-5642.11Show/hide
Query:  NVSVGSSCSSDSLSSNYSAKLL--------NLKAKPKPVKTVAAGGDANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLS
        N S  S  S DS  S  S   L          K+ P   ++V + G   A  + P  S   KRC W+TP SDP YI FHDEEWGVPVHDDK+LFELLVLS
Subjt:  NVSVGSSCSSDSLSSNYSAKLL--------NLKAKPKPVKTVAAGGDANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLS

Query:  QALAELTWPLILSKRHIFRSSSLFFFYFPRFYRFVLQIGIWVDIDGCPSSPLTYIVLHLKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVEN
         ALAE TWP ILSKR  FR                                        +VF DFDP++I    E +       A+ LLSD KLRA++EN
Subjt:  QALAELTWPLILSKRHIFRSSSLFFFYFPRFYRFVLQIGIWVDIDGCPSSPLTYIVLHLKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVEN

Query:  ANQVLKVYCVLLKLYLFPGSIVKLNPSITIHAFGCLNFHPFPSGSKLNLSTNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVV
        A Q+LKV                      I  +G  +               Y WSFV  K I +++RY RQVP KTPKAE +SKDL++RGFR VGPTVV
Subjt:  ANQVLKVYCVLLKLYLFPGSIVKLNPSITIHAFGCLNFHPFPSGSKLNLSTNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVV

Query:  YSFLQVSGIVNDHLVDCFRYQEC
        YSF+Q +GI NDHL  CFR+  C
Subjt:  YSFLQVSGIVNDHLVDCFRYQEC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGTGGCTACGAAGCTCCAATCGCATGCTGAACCGGTTTTGGAGTCCCGAGCAATTCTTGGACCGGGCGGGAACAGAGATAGGGCGCCTGAGAAGCCAAAATGCAA
ACAGGAGATCTTGAAGAGGACAGTGAAGCAGAATAAGGCGCTTCCAGTGGTTTCTGAATCGGTTGTTCGGGACAATGTCTCCGTCGGGAGCTCCTGCTCTTCCGATTCTT
TATCAAGCAACTATTCGGCCAAATTGTTGAATCTGAAAGCGAAGCCCAAGCCTGTGAAGACTGTCGCTGCCGGCGGTGACGCTAACGCAACCACAACGTCGCCTGGGCTC
TCGGTTGCGGGGAAACGCTGTGATTGGATAACGCCTTATTCTGACCCACTTTACATTGCTTTTCACGACGAAGAATGGGGAGTCCCAGTTCATGACGACAAGAAGCTGTT
TGAGTTACTTGTATTATCACAAGCCTTAGCAGAGCTTACTTGGCCCTTGATTCTCAGCAAGAGACACATTTTCAGGTCTTCTTCTCTTTTTTTCTTTTACTTTCCCCGTT
TCTATCGCTTCGTTCTTCAAATTGGAATCTGGGTCGATATTGATGGTTGCCCAAGCTCACCGTTAACATATATTGTCCTCCATTTGAAAGTGTTCAATGATTTTGACCCA
TCTTCCATCGCACATTTCACAGAAGCTGAGTTTACGACACTAAAAGTAAATGCCACGCAGCTCCTGTCTGATCAAAAGCTTCGTGCAATCGTGGAGAACGCTAACCAAGT
ACTCAAGGTATATTGTGTTTTGCTTAAGCTTTATCTCTTCCCTGGCTCTATTGTTAAGCTAAATCCATCCATAACAATACACGCGTTTGGTTGTCTCAACTTCCATCCCT
TTCCAAGTGGTTCAAAGCTGAATCTCTCTACCAACTATTGTTGGAGCTTTGTTAACAAGAAGCCTATCCGAAACCGATATCGATATGGTCGTCAAGTACCGGTAAAGACT
CCTAAAGCGGAGTTCATGAGCAAGGATTTGATGAAGAGAGGATTCCGTTGTGTCGGGCCAACTGTGGTTTATTCCTTCTTGCAAGTTAGCGGAATTGTTAACGATCACTT
GGTCGACTGCTTCAGGTATCAAGAGTGCGACGCAAGCGTCAAAGATGACATGAAATTAAGAGTAGAAAATCGGAGATCAGAGTTGCTTATTCGAGCTTTGGAGAAGTCTT
CCTTGACGACCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGTGGCTACGAAGCTCCAATCGCATGCTGAACCGGTTTTGGAGTCCCGAGCAATTCTTGGACCGGGCGGGAACAGAGATAGGGCGCCTGAGAAGCCAAAATGCAA
ACAGGAGATCTTGAAGAGGACAGTGAAGCAGAATAAGGCGCTTCCAGTGGTTTCTGAATCGGTTGTTCGGGACAATGTCTCCGTCGGGAGCTCCTGCTCTTCCGATTCTT
TATCAAGCAACTATTCGGCCAAATTGTTGAATCTGAAAGCGAAGCCCAAGCCTGTGAAGACTGTCGCTGCCGGCGGTGACGCTAACGCAACCACAACGTCGCCTGGGCTC
TCGGTTGCGGGGAAACGCTGTGATTGGATAACGCCTTATTCTGACCCACTTTACATTGCTTTTCACGACGAAGAATGGGGAGTCCCAGTTCATGACGACAAGAAGCTGTT
TGAGTTACTTGTATTATCACAAGCCTTAGCAGAGCTTACTTGGCCCTTGATTCTCAGCAAGAGACACATTTTCAGGTCTTCTTCTCTTTTTTTCTTTTACTTTCCCCGTT
TCTATCGCTTCGTTCTTCAAATTGGAATCTGGGTCGATATTGATGGTTGCCCAAGCTCACCGTTAACATATATTGTCCTCCATTTGAAAGTGTTCAATGATTTTGACCCA
TCTTCCATCGCACATTTCACAGAAGCTGAGTTTACGACACTAAAAGTAAATGCCACGCAGCTCCTGTCTGATCAAAAGCTTCGTGCAATCGTGGAGAACGCTAACCAAGT
ACTCAAGGTATATTGTGTTTTGCTTAAGCTTTATCTCTTCCCTGGCTCTATTGTTAAGCTAAATCCATCCATAACAATACACGCGTTTGGTTGTCTCAACTTCCATCCCT
TTCCAAGTGGTTCAAAGCTGAATCTCTCTACCAACTATTGTTGGAGCTTTGTTAACAAGAAGCCTATCCGAAACCGATATCGATATGGTCGTCAAGTACCGGTAAAGACT
CCTAAAGCGGAGTTCATGAGCAAGGATTTGATGAAGAGAGGATTCCGTTGTGTCGGGCCAACTGTGGTTTATTCCTTCTTGCAAGTTAGCGGAATTGTTAACGATCACTT
GGTCGACTGCTTCAGGTATCAAGAGTGCGACGCAAGCGTCAAAGATGACATGAAATTAAGAGTAGAAAATCGGAGATCAGAGTTGCTTATTCGAGCTTTGGAGAAGTCTT
CCTTGACGACCTGA
Protein sequenceShow/hide protein sequence
MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGDANATTTSPGL
SVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRSSSLFFFYFPRFYRFVLQIGIWVDIDGCPSSPLTYIVLHLKVFNDFDP
SSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKVYCVLLKLYLFPGSIVKLNPSITIHAFGCLNFHPFPSGSKLNLSTNYCWSFVNKKPIRNRYRYGRQVPVKT
PKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDASVKDDMKLRVENRRSELLIRALEKSSLTT