; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg07386 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg07386
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionRING-CH-type domain-containing protein
Genome locationCarg_Chr01:1472727..1474486
RNA-Seq ExpressionCarg07386
SyntenyCarg07386
Gene Ontology termsGO:0016567 - protein ubiquitination (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004842 - ubiquitin-protein transferase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR022143 - Protein of unknown function DUF3675
IPR033275 - E3 ubiquitin-protein ligase MARCH-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606942.1 putative E3 ubiquitin ligase SUD1, partial [Cucurbita argyrosperma subsp. sororia]2.7e-9584.96Show/hide
Query:  MGEAILYVDDFRFETSYD-----------------------------FAHRDCIQRWCTEKGSIVCEICLQNYEPGYTAPSKKPQLGDPGVSISDGAQIP
        MGEAILYVDDFRFETSYD                             FAHRDCIQRWCTEKGSIVCEICLQNYEPGYTAPSKKPQLGDPGVSISDGAQIP
Subjt:  MGEAILYVDDFRFETSYD-----------------------------FAHRDCIQRWCTEKGSIVCEICLQNYEPGYTAPSKKPQLGDPGVSISDGAQIP

Query:  RSEQEEAAEPASPPADDGASDSACSTTTDRGASYCKSVALTFTLVLLVRHFYDVIAVGAGDYPFTLATVLLLRASGIIFPMYVIIRGITTVQNSIRRSRY
        RSEQEEAAEPASPPADDGASDSACSTTTDRGASYCKSVALTFTLVLLVRHFYDVIAVGAGDYPFTLATVLLLRASGIIFPMYVIIRGITTVQNSIRRSRY
Subjt:  RSEQEEAAEPASPPADDGASDSACSTTTDRGASYCKSVALTFTLVLLVRHFYDVIAVGAGDYPFTLATVLLLRASGIIFPMYVIIRGITTVQNSIRRSRY

Query:  RYRYRDQDSDDDEDDDISRFEDDERR
        RYRYRDQDSDDDEDDDISR     RR
Subjt:  RYRYRDQDSDDDEDDDISRFEDDERR

KAG7036646.1 hypothetical protein SDJN02_00266, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-107100Show/hide
Query:  MGEAILYVDDFRFETSYDFAHRDCIQRWCTEKGSIVCEICLQNYEPGYTAPSKKPQLGDPGVSISDGAQIPRSEQEEAAEPASPPADDGASDSACSTTTD
        MGEAILYVDDFRFETSYDFAHRDCIQRWCTEKGSIVCEICLQNYEPGYTAPSKKPQLGDPGVSISDGAQIPRSEQEEAAEPASPPADDGASDSACSTTTD
Subjt:  MGEAILYVDDFRFETSYDFAHRDCIQRWCTEKGSIVCEICLQNYEPGYTAPSKKPQLGDPGVSISDGAQIPRSEQEEAAEPASPPADDGASDSACSTTTD

Query:  RGASYCKSVALTFTLVLLVRHFYDVIAVGAGDYPFTLATVLLLRASGIIFPMYVIIRGITTVQNSIRRSRYRYRYRDQDSDDDEDDDISRFEDDERRLHH
        RGASYCKSVALTFTLVLLVRHFYDVIAVGAGDYPFTLATVLLLRASGIIFPMYVIIRGITTVQNSIRRSRYRYRYRDQDSDDDEDDDISRFEDDERRLHH
Subjt:  RGASYCKSVALTFTLVLLVRHFYDVIAVGAGDYPFTLATVLLLRASGIIFPMYVIIRGITTVQNSIRRSRYRYRYRDQDSDDDEDDDISRFEDDERRLHH

Query:  IV
        IV
Subjt:  IV

XP_022949158.1 uncharacterized protein LOC111452590 [Cucurbita moschata]1.5e-9883.98Show/hide
Query:  MGEAILYVDDFRFETSYD-----------------------------FAHRDCIQRWCTEKGSIVCEICLQNYEPGYTAPSKKPQLGDPGVSISDGAQIP
        MGEAILYVDDFRFETSYD                             FAHRDCIQRWCTEK SIVCEICLQNYEPGYTAPSKKPQLGDPGVSISDGAQIP
Subjt:  MGEAILYVDDFRFETSYD-----------------------------FAHRDCIQRWCTEKGSIVCEICLQNYEPGYTAPSKKPQLGDPGVSISDGAQIP

Query:  RSEQEEAAEPASPPADDGASDSACSTTTDRGASYCKSVALTFTLVLLVRHFYDVIAVGAGDYPFTLATVLLLRASGIIFPMYVIIRGITTVQNSIRRSRY
        RSEQE AAEPASPPADDGASDS CSTTTDRGASYCKSVALTFTLVLL+RHFYDVIAVGAGDYPFTLATVLLLR SGIIFPMYVIIR ITTVQNSI RSRY
Subjt:  RSEQEEAAEPASPPADDGASDSACSTTTDRGASYCKSVALTFTLVLLVRHFYDVIAVGAGDYPFTLATVLLLRASGIIFPMYVIIRGITTVQNSIRRSRY

Query:  RYRYRDQDSDDDEDDDISRFEDDERRLHHIV
        RYRYRDQDSDD+EDDDISRFEDDERRLHHIV
Subjt:  RYRYRDQDSDDDEDDDISRFEDDERRLHHIV

XP_022998108.1 uncharacterized protein LOC111492856 [Cucurbita maxima]2.9e-9783.98Show/hide
Query:  MGEAILYVDDFRFETSYD-----------------------------FAHRDCIQRWCTEKGSIVCEICLQNYEPGYTAPSKKPQLGDPGVSISDGAQIP
        MGEAILYVDDFRFETSYD                             FAHRDCIQRWCTEKGS VCEICLQNYEPGYTAPSKKPQLGDPGVSISDG QIP
Subjt:  MGEAILYVDDFRFETSYD-----------------------------FAHRDCIQRWCTEKGSIVCEICLQNYEPGYTAPSKKPQLGDPGVSISDGAQIP

Query:  RSEQEEAAEPASPPADDGASDSACSTTTDRGASYCKSVALTFTLVLLVRHFYDVIAVGAGDYPFTLATVLLLRASGIIFPMYVIIRGITTVQNSIRRSRY
        RSEQEEAAEPASPPADDGASDSACSTTTDRGASYCKSVALTFTLVLLVRHFYDV+AVGAGDYPFTLATVLLLRASGIIFPMYVIIR ITTVQ +IRRSRY
Subjt:  RSEQEEAAEPASPPADDGASDSACSTTTDRGASYCKSVALTFTLVLLVRHFYDVIAVGAGDYPFTLATVLLLRASGIIFPMYVIIRGITTVQNSIRRSRY

Query:  RYRYRDQDSDDDEDDDISRFEDDERRLHHIV
        RYRYRDQDS DDEDDDIS FEDDERRLHHIV
Subjt:  RYRYRDQDSDDDEDDDISRFEDDERRLHHIV

XP_023525490.1 uncharacterized protein LOC111789080 [Cucurbita pepo subsp. pepo]1.7e-10286.58Show/hide
Query:  MGEAILYVDDFRFETSYD-----------------------------FAHRDCIQRWCTEKGSIVCEICLQNYEPGYTAPSKKPQLGDPGVSISDGAQIP
        MGEAILYVDDFRFETSYD                             FAHRDCIQRWCTEKGSIVCEICLQNYEPGYTAPSKKPQLGDPGVSISDGAQIP
Subjt:  MGEAILYVDDFRFETSYD-----------------------------FAHRDCIQRWCTEKGSIVCEICLQNYEPGYTAPSKKPQLGDPGVSISDGAQIP

Query:  RSEQEEAAEPASPPADDGASDSACSTTTDRGASYCKSVALTFTLVLLVRHFYDVIAVGAGDYPFTLATVLLLRASGIIFPMYVIIRGITTVQNSIRRSRY
        RSEQEEAAEPASPPADDGASDSACSTTTDRGASYCKSVALTFTLVLLVRHFYDV+AVGAGDYPFTLATVLLLRASGIIFPMYVIIRGITTVQN+IRRSRY
Subjt:  RSEQEEAAEPASPPADDGASDSACSTTTDRGASYCKSVALTFTLVLLVRHFYDVIAVGAGDYPFTLATVLLLRASGIIFPMYVIIRGITTVQNSIRRSRY

Query:  RYRYRDQDSDDDEDDDISRFEDDERRLHHIV
        RYRYRDQDSDDDEDDDISRFEDDERRLHHIV
Subjt:  RYRYRDQDSDDDEDDDISRFEDDERRLHHIV

TrEMBL top hitse value%identityAlignment
A0A0A0KT99 RING-CH-type domain-containing protein1.6e-7767.24Show/hide
Query:  MGEAILYVDDFRFETSYD-----------------------------FAHRDCIQRWCTEKGSIVCEICLQNYEPGYTAPSKKPQLGD-PGVSISDGAQI
        M E  +YV++F F+TSYD                             FAHRDCIQRWC+EKGS VCEICLQNYEPGYTAPSKKP   D P V++ DG +I
Subjt:  MGEAILYVDDFRFETSYD-----------------------------FAHRDCIQRWCTEKGSIVCEICLQNYEPGYTAPSKKPQLGD-PGVSISDGAQI

Query:  PRSEQEEAAEPASPPADDGASDSACSTTTDRGASYCKSVALTFTLVLLVRHFYDVIAVGAGDYPFTLATVLLLRASGIIFPMYVIIRGITTVQNSIRRSR
        PRSE EE AEPAS P DD AS SACSTT DRGAS CKSVALTFTLVLLVRHFYDV+AVG  DYPFTLATVL+LRASGIIFPMYVIIR +T +QNS+RR+R
Subjt:  PRSEQEEAAEPASPPADDGASDSACSTTTDRGASYCKSVALTFTLVLLVRHFYDVIAVGAGDYPFTLATVLLLRASGIIFPMYVIIRGITTVQNSIRRSR

Query:  YRYRYRDQDSDDDEDDDISRFEDDERRLHHIV
        Y+YRYR+ +  DD+DDDIS FEDD+RRLHHIV
Subjt:  YRYRYRDQDSDDDEDDDISRFEDDERRLHHIV

A0A1S4DZR9 uncharacterized protein LOC1034947379.4e-7868.38Show/hide
Query:  MGEAILYVDDFRFETSYD-----------------------------FAHRDCIQRWCTEKGSIVCEICLQNYEPGYTAPSKKPQLGDPGVSISDGAQIP
        M E  +YV++FRF+TSYD                             FAHRDCIQRWC+EKGS VCEICLQNYEPGYTAPSKKP   D GV+I DG +IP
Subjt:  MGEAILYVDDFRFETSYD-----------------------------FAHRDCIQRWCTEKGSIVCEICLQNYEPGYTAPSKKPQLGDPGVSISDGAQIP

Query:  RSEQEEAAEPASPPADDGASDSACSTTTDRGASYCKSVALTFTLVLLVRHFYDVIAVGAGDYPFTLATVLLLRASGIIFPMYVIIRGITTVQNSIRRSRY
        RSE EEAAEPASP   DGAS SACSTT +RGAS CKSVALTFTLVLLVRHFYDV+AVG  +YPFTLATVL+LRASGIIFPMYVIIR IT +QNS+RR+RY
Subjt:  RSEQEEAAEPASPPADDGASDSACSTTTDRGASYCKSVALTFTLVLLVRHFYDVIAVGAGDYPFTLATVLLLRASGIIFPMYVIIRGITTVQNSIRRSRY

Query:  RYRYR---DQDSDDDEDDDISRFEDDERRLHHIV
        +Y+YR   D D DDDE+DDIS FEDD+RRLHHIV
Subjt:  RYRYR---DQDSDDDEDDDISRFEDDERRLHHIV

A0A6J1F9U4 uncharacterized protein LOC1114421376.1e-7765.37Show/hide
Query:  MGEAILYVDDFRFETSYD-----------------------------FAHRDCIQRWCTEKGSIVCEICLQNYEPGYTAPSKKPQLGDPGVSISDGAQIP
        MG+  LYV++F+F+TSY                              FAHRDCIQRWCTEKGS VCEICLQNYEPGYT+PSKKPQ GDPGV+I D  QIP
Subjt:  MGEAILYVDDFRFETSYD-----------------------------FAHRDCIQRWCTEKGSIVCEICLQNYEPGYTAPSKKPQLGDPGVSISDGAQIP

Query:  RSEQEEAAEPASPPADDGASDSACSTTTDRGASYCKSVALTFTLVLLVRHFYDVIAVGAGDYPFTLATVLLLRASGIIFPMYVIIRGITTVQNSIRRSRY
        R+E+EEAAEPASPPAD GASDS CSTTTDRGAS CKSVALTFTL+LL RHFY+V+ +    YPFTLATVL+LRASGIIFPMYVIIR I+ +QNS R++RY
Subjt:  RSEQEEAAEPASPPADDGASDSACSTTTDRGASYCKSVALTFTLVLLVRHFYDVIAVGAGDYPFTLATVLLLRASGIIFPMYVIIRGITTVQNSIRRSRY

Query:  RYRYRDQDSDDDEDDDISRFEDDERRLHHIV
        RY+  ++   D +DDDISRFE+D+RR+HHIV
Subjt:  RYRYRDQDSDDDEDDDISRFEDDERRLHHIV

A0A6J1GB95 uncharacterized protein LOC1114525907.4e-9983.98Show/hide
Query:  MGEAILYVDDFRFETSYD-----------------------------FAHRDCIQRWCTEKGSIVCEICLQNYEPGYTAPSKKPQLGDPGVSISDGAQIP
        MGEAILYVDDFRFETSYD                             FAHRDCIQRWCTEK SIVCEICLQNYEPGYTAPSKKPQLGDPGVSISDGAQIP
Subjt:  MGEAILYVDDFRFETSYD-----------------------------FAHRDCIQRWCTEKGSIVCEICLQNYEPGYTAPSKKPQLGDPGVSISDGAQIP

Query:  RSEQEEAAEPASPPADDGASDSACSTTTDRGASYCKSVALTFTLVLLVRHFYDVIAVGAGDYPFTLATVLLLRASGIIFPMYVIIRGITTVQNSIRRSRY
        RSEQE AAEPASPPADDGASDS CSTTTDRGASYCKSVALTFTLVLL+RHFYDVIAVGAGDYPFTLATVLLLR SGIIFPMYVIIR ITTVQNSI RSRY
Subjt:  RSEQEEAAEPASPPADDGASDSACSTTTDRGASYCKSVALTFTLVLLVRHFYDVIAVGAGDYPFTLATVLLLRASGIIFPMYVIIRGITTVQNSIRRSRY

Query:  RYRYRDQDSDDDEDDDISRFEDDERRLHHIV
        RYRYRDQDSDD+EDDDISRFEDDERRLHHIV
Subjt:  RYRYRDQDSDDDEDDDISRFEDDERRLHHIV

A0A6J1KFW1 uncharacterized protein LOC1114928561.4e-9783.98Show/hide
Query:  MGEAILYVDDFRFETSYD-----------------------------FAHRDCIQRWCTEKGSIVCEICLQNYEPGYTAPSKKPQLGDPGVSISDGAQIP
        MGEAILYVDDFRFETSYD                             FAHRDCIQRWCTEKGS VCEICLQNYEPGYTAPSKKPQLGDPGVSISDG QIP
Subjt:  MGEAILYVDDFRFETSYD-----------------------------FAHRDCIQRWCTEKGSIVCEICLQNYEPGYTAPSKKPQLGDPGVSISDGAQIP

Query:  RSEQEEAAEPASPPADDGASDSACSTTTDRGASYCKSVALTFTLVLLVRHFYDVIAVGAGDYPFTLATVLLLRASGIIFPMYVIIRGITTVQNSIRRSRY
        RSEQEEAAEPASPPADDGASDSACSTTTDRGASYCKSVALTFTLVLLVRHFYDV+AVGAGDYPFTLATVLLLRASGIIFPMYVIIR ITTVQ +IRRSRY
Subjt:  RSEQEEAAEPASPPADDGASDSACSTTTDRGASYCKSVALTFTLVLLVRHFYDVIAVGAGDYPFTLATVLLLRASGIIFPMYVIIRGITTVQNSIRRSRY

Query:  RYRYRDQDSDDDEDDDISRFEDDERRLHHIV
        RYRYRDQDS DDEDDDIS FEDDERRLHHIV
Subjt:  RYRYRDQDSDDDEDDDISRFEDDERRLHHIV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02610.1 RING/FYVE/PHD zinc finger superfamily protein5.9e-3236.16Show/hide
Query:  MGEAILYVDDFRFETSYD-------------------------FAHRDCIQRWCTEKGSIVCEICLQNYEPGYTAPSKKPQLGDPGVSISDGAQIPRSEQ
        MG+ +L++D+   ++S++                         FAHRDCIQRWC EKG+ +CEICLQ Y+PGYT  SK  +  +  V+I D   I R E 
Subjt:  MGEAILYVDDFRFETSYD-------------------------FAHRDCIQRWCTEKGSIVCEICLQNYEPGYTAPSKKPQLGDPGVSISDGAQIPRSEQ

Query:  EEAAEPASPPADDGASDSACSTTTDRGASYCKSVALTFTLVLLVRHFYDVIAVGAGDYPFTLATVLLLRASGIIFPMYVIIRGITTVQNSIRRSRYRYRY
                    + +    C++  DRGAS C+ +AL F+++LL++H +D +  G  +YP+T+ TVL L+A GI+ PM VIIR IT +Q S+   RY+   
Subjt:  EEAAEPASPPADDGASDSACSTTTDRGASYCKSVALTFTLVLLVRHFYDVIAVGAGDYPFTLATVLLLRASGIIFPMYVIIRGITTVQNSIRRSRYRYRY

Query:  RDQDSDDDEDDDISRFEDDERRLH
         ++D+   E++D    E++E++ H
Subjt:  RDQDSDDDEDDDISRFEDDERRLH

AT2G02960.1 RING/FYVE/PHD zinc finger superfamily protein1.3e-1834.81Show/hide
Query:  SYDFAHRDCIQRWCTEKGSIVCEICLQNYEPGYTAPSKKPQLGDPGVSISDGAQIPRSEQEEAAEPASPPADDGASDS---ACSTTTDRGASYCKSVALT
        S  +AHR C+QRWC EKG+I+CEIC Q Y+PGYTAP    Q  +  + I  G  I   +  +    A   A+    +S     + ++  GA++C+S AL 
Subjt:  SYDFAHRDCIQRWCTEKGSIVCEICLQNYEPGYTAPSKKPQLGDPGVSISDGAQIPRSEQEEAAEPASPPADDGASDS---ACSTTTDRGASYCKSVALT

Query:  FTLVLLVRHFYDVI--AVGAGDYPFTLATVLLLRASGIIFPMYVIIRGITTVQNSIRR
           +LL+RH   +     G  D P ++ +++LLRA+G + P Y++   I+ +Q   +R
Subjt:  FTLVLLVRHFYDVI--AVGAGDYPFTLATVLLLRASGIIFPMYVIIRGITTVQNSIRR

AT2G02960.2 RING/FYVE/PHD zinc finger superfamily protein1.3e-1834.81Show/hide
Query:  SYDFAHRDCIQRWCTEKGSIVCEICLQNYEPGYTAPSKKPQLGDPGVSISDGAQIPRSEQEEAAEPASPPADDGASDS---ACSTTTDRGASYCKSVALT
        S  +AHR C+QRWC EKG+I+CEIC Q Y+PGYTAP    Q  +  + I  G  I   +  +    A   A+    +S     + ++  GA++C+S AL 
Subjt:  SYDFAHRDCIQRWCTEKGSIVCEICLQNYEPGYTAPSKKPQLGDPGVSISDGAQIPRSEQEEAAEPASPPADDGASDS---ACSTTTDRGASYCKSVALT

Query:  FTLVLLVRHFYDVI--AVGAGDYPFTLATVLLLRASGIIFPMYVIIRGITTVQNSIRR
           +LL+RH   +     G  D P ++ +++LLRA+G + P Y++   I+ +Q   +R
Subjt:  FTLVLLVRHFYDVI--AVGAGDYPFTLATVLLLRASGIIFPMYVIIRGITTVQNSIRR

AT2G02960.4 RING/FYVE/PHD zinc finger superfamily protein1.3e-1834.81Show/hide
Query:  SYDFAHRDCIQRWCTEKGSIVCEICLQNYEPGYTAPSKKPQLGDPGVSISDGAQIPRSEQEEAAEPASPPADDGASDS---ACSTTTDRGASYCKSVALT
        S  +AHR C+QRWC EKG+I+CEIC Q Y+PGYTAP    Q  +  + I  G  I   +  +    A   A+    +S     + ++  GA++C+S AL 
Subjt:  SYDFAHRDCIQRWCTEKGSIVCEICLQNYEPGYTAPSKKPQLGDPGVSISDGAQIPRSEQEEAAEPASPPADDGASDS---ACSTTTDRGASYCKSVALT

Query:  FTLVLLVRHFYDVI--AVGAGDYPFTLATVLLLRASGIIFPMYVIIRGITTVQNSIRR
           +LL+RH   +     G  D P ++ +++LLRA+G + P Y++   I+ +Q   +R
Subjt:  FTLVLLVRHFYDVI--AVGAGDYPFTLATVLLLRASGIIFPMYVIIRGITTVQNSIRR

AT4G02075.1 RING/FYVE/PHD zinc finger superfamily protein1.3e-3137.72Show/hide
Query:  MGEAILYVDDFR-------------------FET------SYDFAHRDCIQRWCTEKGSIVCEICLQNYEPGYTAPSKKPQLGDPGVSISDGAQIPRSEQ
        MG+ IL++DD +                   FE       +  FAHR+CIQRWC EKG+  CEICLQ Y+ GYTA  K+ +L +  V+I    +  R  +
Subjt:  MGEAILYVDDFR-------------------FET------SYDFAHRDCIQRWCTEKGSIVCEICLQNYEPGYTAPSKKPQLGDPGVSISDGAQIPRSEQ

Query:  EEAAEPASPPADDGASDSACSTTTDRGASYCKSVALTFTLVLLVRHFYDVIAVGAGDYPFTLATVLLLRASGIIFPMYVIIRGITTVQNSIRRSRYRYRY
           +   S         S C++  DRGAS+C+S+  T ++ LL++H +DVI  G  +YPF++ TVL L+A GI+ PM++IIR I+T+Q ++RR   R++Y
Subjt:  EEAAEPASPPADDGASDSACSTTTDRGASYCKSVALTFTLVLLVRHFYDVIAVGAGDYPFTLATVLLLRASGIIFPMYVIIRGITTVQNSIRRSRYRYRY

Query:  RDQDSDD----DEDDDISRFEDDERRLH
         + + +D    D+DDD+   ED+E++ H
Subjt:  RDQDSDD----DEDDDISRFEDDERRLH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGAAGCGATTCTGTACGTGGACGACTTCCGATTCGAAACGTCCTACGATTTCGCCCACAGAGACTGTATTCAGAGATGGTGCACCGAAAAGGGCAGCATAGTCTG
TGAAATTTGCCTCCAGAATTACGAGCCGGGCTACACAGCGCCTTCTAAGAAGCCACAACTCGGCGATCCAGGTGTCAGCATTAGTGATGGTGCGCAAATCCCGAGAAGCG
AGCAAGAAGAGGCGGCGGAGCCCGCGTCTCCGCCCGCTGATGACGGCGCGTCGGACTCCGCCTGCTCCACCACCACTGACCGCGGTGCCTCCTATTGTAAATCGGTTGCT
CTCACCTTCACTTTAGTATTGTTGGTGAGACATTTCTATGATGTTATTGCTGTTGGCGCTGGAGATTATCCATTTACGCTCGCCACGGTGCTTCTTTTAAGAGCAAGCGG
GATCATTTTCCCTATGTACGTGATAATTCGAGGTATCACCACCGTTCAAAACAGCATTCGTCGAAGTCGTTATCGGTACCGATATCGAGATCAGGATTCTGATGACGACG
AAGATGATGATATTTCGAGGTTTGAAGACGATGAAAGAAGGCTCCATCATATTGTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGCGAAGCGATTCTGTACGTGGACGACTTCCGATTCGAAACGTCCTACGATTTCGCCCACAGAGACTGTATTCAGAGATGGTGCACCGAAAAGGGCAGCATAGTCTG
TGAAATTTGCCTCCAGAATTACGAGCCGGGCTACACAGCGCCTTCTAAGAAGCCACAACTCGGCGATCCAGGTGTCAGCATTAGTGATGGTGCGCAAATCCCGAGAAGCG
AGCAAGAAGAGGCGGCGGAGCCCGCGTCTCCGCCCGCTGATGACGGCGCGTCGGACTCCGCCTGCTCCACCACCACTGACCGCGGTGCCTCCTATTGTAAATCGGTTGCT
CTCACCTTCACTTTAGTATTGTTGGTGAGACATTTCTATGATGTTATTGCTGTTGGCGCTGGAGATTATCCATTTACGCTCGCCACGGTGCTTCTTTTAAGAGCAAGCGG
GATCATTTTCCCTATGTACGTGATAATTCGAGGTATCACCACCGTTCAAAACAGCATTCGTCGAAGTCGTTATCGGTACCGATATCGAGATCAGGATTCTGATGACGACG
AAGATGATGATATTTCGAGGTTTGAAGACGATGAAAGAAGGCTCCATCATATTGTCTAA
Protein sequenceShow/hide protein sequence
MGEAILYVDDFRFETSYDFAHRDCIQRWCTEKGSIVCEICLQNYEPGYTAPSKKPQLGDPGVSISDGAQIPRSEQEEAAEPASPPADDGASDSACSTTTDRGASYCKSVA
LTFTLVLLVRHFYDVIAVGAGDYPFTLATVLLLRASGIIFPMYVIIRGITTVQNSIRRSRYRYRYRDQDSDDDEDDDISRFEDDERRLHHIV