; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg22024 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg22024
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Descriptionhydroxyproline-rich glycoprotein family protein
Genome locationCarg_Chr19:6581915..6590540
RNA-Seq ExpressionCarg22024
SyntenyCarg22024
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572024.1 hypothetical protein SDJN03_28752, partial [Cucurbita argyrosperma subsp. sororia]1.6e-12999.57Show/hide
Query:  AIRETPNQPVAIPQTVEETPNQSTTIHQAIEQTPNQSTTIPQATNGHSTSRPRRRRSRTRADTRRIEPPYPWSAEQRASIHNLEYLQSNNIVTIKGDVRC
        AIRETPNQPVAIPQTVEETPNQSTTIHQAIEQTPNQSTTIPQATNGHSTSRPRRRRSRTRADTRRIEPPYPWSAEQRASIHNLEYLQSNNIVTIKGDVRC
Subjt:  AIRETPNQPVAIPQTVEETPNQSTTIHQAIEQTPNQSTTIPQATNGHSTSRPRRRRSRTRADTRRIEPPYPWSAEQRASIHNLEYLQSNNIVTIKGDVRC

Query:  KKCERFYEIEYNLMNKFDEIARFIERERDNMHDRAPICWKNPILPNCEYCREENCVEPMIPDEEDDNQFSRINWLFLLLGQLIGRLKLKQLKYFCAHTYN
        KKCERFYEIEYNLMNKFDEIARFIERERDNMHDRAPICWKNPILPNCE+CREENCVEPMIPDEEDDNQFSRINWLFLLLGQLIGRLKLKQLKYFCAHTYN
Subjt:  KKCERFYEIEYNLMNKFDEIARFIERERDNMHDRAPICWKNPILPNCEYCREENCVEPMIPDEEDDNQFSRINWLFLLLGQLIGRLKLKQLKYFCAHTYN

Query:  HRTGAKDRLIFLTYLALCKQLQPSNRLFNC
        HRTGAKDRLIFLTYLALCKQLQPSNRLFNC
Subjt:  HRTGAKDRLIFLTYLALCKQLQPSNRLFNC

KAG6572025.1 hypothetical protein SDJN03_28753, partial [Cucurbita argyrosperma subsp. sororia]8.2e-9496.55Show/hide
Query:  MATNNHNNDLQLSLALRTAAAATVDAASDRHLIILMRTANNLEFHRRSLRHMKSQTPRETGPIEPPYPWSTNQRAAVHTLNYMTSNQILTITGDVKCHHC
        MATNNHN+DLQLSLALRTAAAATVDAAS+RHLIILMRT NNLEFHRRSL HMKSQTPRETGPIEPPYPWSTNQRA VHTLNYMTSNQILTITGDVKCHHC
Subjt:  MATNNHNNDLQLSLALRTAAAATVDAASDRHLIILMRTANNLEFHRRSLRHMKSQTPRETGPIEPPYPWSTNQRAAVHTLNYMTSNQILTITGDVKCHHC

Query:  QRIYEIEYDIVSKFNEIGSFVENNMESLQDRTPRSWIWPDYPTCRFCGTEKGVRPVIPKECEKINWVFLLLGEM
        QRIY+IEYDIVSKFNEIGSFVENNMESLQDRTPRSWIWPDYPTCRFCGTEKGVRPVIPKECEKINWVFLLLGEM
Subjt:  QRIYEIEYDIVSKFNEIGSFVENNMESLQDRTPRSWIWPDYPTCRFCGTEKGVRPVIPKECEKINWVFLLLGEM

KAG7011696.1 hypothetical protein SDJN02_26602, partial [Cucurbita argyrosperma subsp. argyrosperma]9.9e-249100Show/hide
Query:  MATNNHNNDLQLSLALRTAAAATVDAASDRHLIILMRTANNLEFHRRSLRHMKSQTPRETGPIEPPYPWSTNQRAAVHTLNYMTSNQILTITGDVKCHHC
        MATNNHNNDLQLSLALRTAAAATVDAASDRHLIILMRTANNLEFHRRSLRHMKSQTPRETGPIEPPYPWSTNQRAAVHTLNYMTSNQILTITGDVKCHHC
Subjt:  MATNNHNNDLQLSLALRTAAAATVDAASDRHLIILMRTANNLEFHRRSLRHMKSQTPRETGPIEPPYPWSTNQRAAVHTLNYMTSNQILTITGDVKCHHC

Query:  QRIYEIEYDIVSKFNEIGSFVENNMESLQDRTPRSWIWPDYPTCRFCGTEKGVRPVIPKECEKINWVFLLLGEMLRRIILFITLISLCATKLILLAAIRE
        QRIYEIEYDIVSKFNEIGSFVENNMESLQDRTPRSWIWPDYPTCRFCGTEKGVRPVIPKECEKINWVFLLLGEMLRRIILFITLISLCATKLILLAAIRE
Subjt:  QRIYEIEYDIVSKFNEIGSFVENNMESLQDRTPRSWIWPDYPTCRFCGTEKGVRPVIPKECEKINWVFLLLGEMLRRIILFITLISLCATKLILLAAIRE

Query:  TPNQPVAIPQTVEETPNQSTTIHQAIEQTPNQSTTIPQATNGHSTSRPRRRRSRTRADTRRIEPPYPWSAEQRASIHNLEYLQSNNIVTIKGDVRCKKCE
        TPNQPVAIPQTVEETPNQSTTIHQAIEQTPNQSTTIPQATNGHSTSRPRRRRSRTRADTRRIEPPYPWSAEQRASIHNLEYLQSNNIVTIKGDVRCKKCE
Subjt:  TPNQPVAIPQTVEETPNQSTTIHQAIEQTPNQSTTIPQATNGHSTSRPRRRRSRTRADTRRIEPPYPWSAEQRASIHNLEYLQSNNIVTIKGDVRCKKCE

Query:  RFYEIEYNLMNKFDEIARFIERERDNMHDRAPICWKNPILPNCEYCREENCVEPMIPDEEDDNQFSRINWLFLLLGQLIGRLKLKQLKYFCAHTYNHRTG
        RFYEIEYNLMNKFDEIARFIERERDNMHDRAPICWKNPILPNCEYCREENCVEPMIPDEEDDNQFSRINWLFLLLGQLIGRLKLKQLKYFCAHTYNHRTG
Subjt:  RFYEIEYNLMNKFDEIARFIERERDNMHDRAPICWKNPILPNCEYCREENCVEPMIPDEEDDNQFSRINWLFLLLGQLIGRLKLKQLKYFCAHTYNHRTG

Query:  AKDRLIFLTYLALCKQLQPSNRLFNC
        AKDRLIFLTYLALCKQLQPSNRLFNC
Subjt:  AKDRLIFLTYLALCKQLQPSNRLFNC

XP_022953023.1 mucin-16-like [Cucurbita moschata]2.3e-12899.13Show/hide
Query:  AIRETPNQPVAIPQTVEETPNQSTTIHQAIEQTPNQSTTIPQATNGHSTSRPRRRRSRTRADTRRIEPPYPWSAEQRASIHNLEYLQSNNIVTIKGDVRC
        AIRET NQPVAIPQTVEETPNQSTTIHQAIEQTPNQSTTIPQATNGHSTSRPRRRRSRTRADTRRIEPPYPWSAEQRASIHNLEYLQSNNIVTIKGDVRC
Subjt:  AIRETPNQPVAIPQTVEETPNQSTTIHQAIEQTPNQSTTIPQATNGHSTSRPRRRRSRTRADTRRIEPPYPWSAEQRASIHNLEYLQSNNIVTIKGDVRC

Query:  KKCERFYEIEYNLMNKFDEIARFIERERDNMHDRAPICWKNPILPNCEYCREENCVEPMIPDEEDDNQFSRINWLFLLLGQLIGRLKLKQLKYFCAHTYN
        KKCERFYEIEYNLMNKFDEIARFIERERDNMHDRAPICWKNPILPNCE+CREENCVEPMIPDEEDDNQFSRINWLFLLLGQLIGRLKLKQLKYFCAHTYN
Subjt:  KKCERFYEIEYNLMNKFDEIARFIERERDNMHDRAPICWKNPILPNCEYCREENCVEPMIPDEEDDNQFSRINWLFLLLGQLIGRLKLKQLKYFCAHTYN

Query:  HRTGAKDRLIFLTYLALCKQLQPSNRLFNC
        HRTGAKDRLIFLTYLALCKQLQPSNRLFNC
Subjt:  HRTGAKDRLIFLTYLALCKQLQPSNRLFNC

XP_022972400.1 uncharacterized protein KIAA0754-like [Cucurbita maxima]6.2e-11890.87Show/hide
Query:  AIRETPNQPVAIPQTVEETPNQSTTIHQAIEQTPNQSTTIPQATNGHSTSRPRRRRSRTRADTRRIEPPYPWSAEQRASIHNLEYLQSNNIVTIKGDVRC
        AI +T +QP  I Q + ETPNQ   I Q +E+TPNQSTTIPQATNGHSTSRPRRRRSRTRADTRRIEPPYPWSAEQRASIHNLEYLQSNNIV IKGDVRC
Subjt:  AIRETPNQPVAIPQTVEETPNQSTTIHQAIEQTPNQSTTIPQATNGHSTSRPRRRRSRTRADTRRIEPPYPWSAEQRASIHNLEYLQSNNIVTIKGDVRC

Query:  KKCERFYEIEYNLMNKFDEIARFIERERDNMHDRAPICWKNPILPNCEYCREENCVEPMIPDEEDDNQFSRINWLFLLLGQLIGRLKLKQLKYFCAHTYN
        KKCER+YEIEYNLMNKFDEIARFIERERDNMHDRAPICWKNPILPNCE+CREENCVEPMIPDEEDDNQF RINWLFLLLGQLIGRLKLKQLKYFCAHTYN
Subjt:  KKCERFYEIEYNLMNKFDEIARFIERERDNMHDRAPICWKNPILPNCEYCREENCVEPMIPDEEDDNQFSRINWLFLLLGQLIGRLKLKQLKYFCAHTYN

Query:  HRTGAKDRLIFLTYLALCKQLQPSNRLFNC
        HRTGAKDRLIFLTYLALCKQLQPSNRLFNC
Subjt:  HRTGAKDRLIFLTYLALCKQLQPSNRLFNC

TrEMBL top hitse value%identityAlignment
A0A0A0LAK2 Uncharacterized protein2.4e-5955.91Show/hide
Query:  PNQ-PVAIPQTVEETPNQSTTIHQAIEQTPNQSTTIPQATNGHSTSRPRRRRSRTRADTRRIEPPYPWSAEQRASIHNLEYLQSNNIVTIKGDVRCKKCE
        P+Q P  +P    + P+   T ++A E+      T     N  +   PR +R+R +ADT RIEPPYPWS EQ A IH LEYLQ+NNI TIKG+V+CK+C+
Subjt:  PNQ-PVAIPQTVEETPNQSTTIHQAIEQTPNQSTTIPQATNGHSTSRPRRRRSRTRADTRRIEPPYPWSAEQRASIHNLEYLQSNNIVTIKGDVRCKKCE

Query:  RFYEIEYNLMNKFDEIARFIERERDNMHDRAPICWKNPILPNCEYCREENCVEPMIPDEEDDNQFSRINWLFLLLGQLIGRLKLKQLKYFCAHTYNHRTG
        R  EIEY+LM+KF+E+ +FIERE+ NMHDRAP CW NP L NC++C +E CVEP+IP   D    ++INWLFLLLG  +GRLKL QLK+FC  T  HRTG
Subjt:  RFYEIEYNLMNKFDEIARFIERERDNMHDRAPICWKNPILPNCEYCREENCVEPMIPDEEDDNQFSRINWLFLLLGQLIGRLKLKQLKYFCAHTYNHRTG

Query:  AKDRLIFLTYLALCKQLQPS
        AKDRL++ TY  LCKQLQP+
Subjt:  AKDRLIFLTYLALCKQLQPS

A0A1S3AZB1 protein PAF1 homolog3.1e-6260.09Show/hide
Query:  PQTVEETPNQSTTIHQAIEQTPNQSTTIPQATNGHSTSRPRRRRSRTRADTRRIEPPYPWSAEQRASIHNLEYLQSNNIVTIKGDVRCKKCERFYEIEYN
        P   ++T NQ   I +   QT NQ   IP         +P+RR  RT+AD  RIEPPYPWS E+ A IH LEYL++NNI+TIKG+V+CK+C+R  EIEY 
Subjt:  PQTVEETPNQSTTIHQAIEQTPNQSTTIPQATNGHSTSRPRRRRSRTRADTRRIEPPYPWSAEQRASIHNLEYLQSNNIVTIKGDVRCKKCERFYEIEYN

Query:  LMNKFDEIARFIERERDNMHDRAPICWKNPILPNCEYCREENCVEPMIPDEEDDNQFSRINWLFLLLGQLIGRLKLKQLKYFCAHTYNHRTGAKDRLIFL
        L++KFDEI RFIERE+DNMHDRAP  W NPIL NC +C +E CVEP+I +       S INWLFLLLG  +G LKL QLKYFC  T  HRTGAKDRLI+L
Subjt:  LMNKFDEIARFIERERDNMHDRAPICWKNPILPNCEYCREENCVEPMIPDEEDDNQFSRINWLFLLLGQLIGRLKLKQLKYFCAHTYNHRTGAKDRLIFL

Query:  TYLALCKQLQPSN
        TYLALCKQLQP++
Subjt:  TYLALCKQLQPSN

A0A6J1C690 probable serine/threonine-protein kinase samkC2.9e-6050.89Show/hide
Query:  PNQPVAIPQTVEETPNQSTTIHQAIEQTPNQSTTIPQATNGHSTSRPRRRRSRTRADTRRIEPPYPWSAEQRASIHNLEYLQSNNIVTIKGDVRCKKCER
        P  P+  P T     +QS    +    +    +T P ++          RR R +     IEPPYPWS   RA +H+L+YLQ N I+TI GDV+C +C++
Subjt:  PNQPVAIPQTVEETPNQSTTIHQAIEQTPNQSTTIPQATNGHSTSRPRRRRSRTRADTRRIEPPYPWSAEQRASIHNLEYLQSNNIVTIKGDVRCKKCER

Query:  FYEIEYNLMNKFDEIARFIERERDNMHDRAPICWKNPILPNCEYCREENCVEPMIPDEEDDNQFSRINWLFLLLGQLIGRLKLKQLKYFCAHTYNHRTGA
         Y+IEY+L+ KFDEIA FIE+ +D +HDRAP  W NP LPNC++C +E+C+ P+IP E++D+ +  INWLFLLLGQ+IG L LK LKYFC +T NHRT A
Subjt:  FYEIEYNLMNKFDEIARFIERERDNMHDRAPICWKNPILPNCEYCREENCVEPMIPDEEDDNQFSRINWLFLLLGQLIGRLKLKQLKYFCAHTYNHRTGA

Query:  KDRLIFLTYLALCKQLQPSNRLFN
        KDRL++LTYL+LCKQLQPS  LF+
Subjt:  KDRLIFLTYLALCKQLQPSNRLFN

A0A6J1GM83 mucin-16-like1.1e-12899.13Show/hide
Query:  AIRETPNQPVAIPQTVEETPNQSTTIHQAIEQTPNQSTTIPQATNGHSTSRPRRRRSRTRADTRRIEPPYPWSAEQRASIHNLEYLQSNNIVTIKGDVRC
        AIRET NQPVAIPQTVEETPNQSTTIHQAIEQTPNQSTTIPQATNGHSTSRPRRRRSRTRADTRRIEPPYPWSAEQRASIHNLEYLQSNNIVTIKGDVRC
Subjt:  AIRETPNQPVAIPQTVEETPNQSTTIHQAIEQTPNQSTTIPQATNGHSTSRPRRRRSRTRADTRRIEPPYPWSAEQRASIHNLEYLQSNNIVTIKGDVRC

Query:  KKCERFYEIEYNLMNKFDEIARFIERERDNMHDRAPICWKNPILPNCEYCREENCVEPMIPDEEDDNQFSRINWLFLLLGQLIGRLKLKQLKYFCAHTYN
        KKCERFYEIEYNLMNKFDEIARFIERERDNMHDRAPICWKNPILPNCE+CREENCVEPMIPDEEDDNQFSRINWLFLLLGQLIGRLKLKQLKYFCAHTYN
Subjt:  KKCERFYEIEYNLMNKFDEIARFIERERDNMHDRAPICWKNPILPNCEYCREENCVEPMIPDEEDDNQFSRINWLFLLLGQLIGRLKLKQLKYFCAHTYN

Query:  HRTGAKDRLIFLTYLALCKQLQPSNRLFNC
        HRTGAKDRLIFLTYLALCKQLQPSNRLFNC
Subjt:  HRTGAKDRLIFLTYLALCKQLQPSNRLFNC

A0A6J1I8I0 uncharacterized protein KIAA0754-like3.0e-11890.87Show/hide
Query:  AIRETPNQPVAIPQTVEETPNQSTTIHQAIEQTPNQSTTIPQATNGHSTSRPRRRRSRTRADTRRIEPPYPWSAEQRASIHNLEYLQSNNIVTIKGDVRC
        AI +T +QP  I Q + ETPNQ   I Q +E+TPNQSTTIPQATNGHSTSRPRRRRSRTRADTRRIEPPYPWSAEQRASIHNLEYLQSNNIV IKGDVRC
Subjt:  AIRETPNQPVAIPQTVEETPNQSTTIHQAIEQTPNQSTTIPQATNGHSTSRPRRRRSRTRADTRRIEPPYPWSAEQRASIHNLEYLQSNNIVTIKGDVRC

Query:  KKCERFYEIEYNLMNKFDEIARFIERERDNMHDRAPICWKNPILPNCEYCREENCVEPMIPDEEDDNQFSRINWLFLLLGQLIGRLKLKQLKYFCAHTYN
        KKCER+YEIEYNLMNKFDEIARFIERERDNMHDRAPICWKNPILPNCE+CREENCVEPMIPDEEDDNQF RINWLFLLLGQLIGRLKLKQLKYFCAHTYN
Subjt:  KKCERFYEIEYNLMNKFDEIARFIERERDNMHDRAPICWKNPILPNCEYCREENCVEPMIPDEEDDNQFSRINWLFLLLGQLIGRLKLKQLKYFCAHTYN

Query:  HRTGAKDRLIFLTYLALCKQLQPSNRLFNC
        HRTGAKDRLIFLTYLALCKQLQPSNRLFNC
Subjt:  HRTGAKDRLIFLTYLALCKQLQPSNRLFNC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49330.1 hydroxyproline-rich glycoprotein family protein4.9e-4441.41Show/hide
Query:  PVAIPQTVEETPNQSTTIHQAIEQT---PNQSTTIPQATNGHSTSRPRRRRSRTRADTR--RIEPPYPWSAEQRASIHNLEYLQSNNIVTIKGDVRCKKC
        P+ +    ++TPN    +      +   P  S   P       T   R  RSR+    +   I PP+PW+  +R  I +LEYL+SN I TI G+V+C+ C
Subjt:  PVAIPQTVEETPNQSTTIHQAIEQT---PNQSTTIPQATNGHSTSRPRRRRSRTRADTR--RIEPPYPWSAEQRASIHNLEYLQSNNIVTIKGDVRCKKC

Query:  ERFYEIEYNLMNKFDEIARFIERERDNMHDRAPICWKNPILPNCEYCREENCVEPMIPDEEDDNQFSRINWLFLLLGQLIGRLKLKQLKYFCAHTYNHRT
        E+ Y++ YNL  +F E+ +F   E+  M DRA   W  P    CE C  E  V+P+I + +     S+INWLFLLLGQ +G   L+QLK FC H+ NHRT
Subjt:  ERFYEIEYNLMNKFDEIARFIERERDNMHDRAPICWKNPILPNCEYCREENCVEPMIPDEEDDNQFSRINWLFLLLGQLIGRLKLKQLKYFCAHTYNHRT

Query:  GAKDRLIFLTYLALCKQLQPSNRLFNC
        GAKDR+++LTY+ LCK LQP + LF C
Subjt:  GAKDRLIFLTYLALCKQLQPSNRLFNC

AT2G16190.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1)1.2e-3439.46Show/hide
Query:  PNQPVAIP---QTVEET--PNQSTTIHQAIEQTPNQSTTIPQATNGHSTSRPRRRRSRTRADTRRIEPPYPWSAEQRASIHNLEYLQSNNIVTIKGDVRC
        PN  V  P   Q  EE   P Q   +      TP +    P      ++ RP     R   D R I PPYPW+ ++   I +   L SNNI  I G V C
Subjt:  PNQPVAIP---QTVEET--PNQSTTIHQAIEQTPNQSTTIPQATNGHSTSRPRRRRSRTRADTRRIEPPYPWSAEQRASIHNLEYLQSNNIVTIKGDVRC

Query:  KKCERFYEIEYNLMNKFDEIARFIERERDNMHDRAPICWKNPILPNCEYCREENCVEPMIPDEEDDNQFSRINWLFLLLGQLIGRLKLKQLKYFCAHTYN
        K C+R   +EYNL  KF E+  +I+  ++ M  RAP  W  P L  C  C+ E  ++P++ + +++     INWLFLLLGQ++G   L QL+YFC     
Subjt:  KKCERFYEIEYNLMNKFDEIARFIERERDNMHDRAPICWKNPILPNCEYCREENCVEPMIPDEEDDNQFSRINWLFLLLGQLIGRLKLKQLKYFCAHTYN

Query:  HRTGAKDRLIFLTYLALCKQLQP
        HRTG+KDR++++TYL+LCKQL P
Subjt:  HRTGAKDRLIFLTYLALCKQLQP

AT2G16190.2 FUNCTIONS IN: molecular_function unknown1.3e-2036.13Show/hide
Query:  PNQPVAIP---QTVEET--PNQSTTIHQAIEQTPNQSTTIPQATNGHSTSRPRRRRSRTRADTRRIEPPYPWSAEQRASIHNLEYLQSNNIVTIKGDVRC
        PN  V  P   Q  EE   P Q   +      TP +    P      ++ RP     R   D R I PPYPW+ ++   I +   L SNNI  I G V C
Subjt:  PNQPVAIP---QTVEET--PNQSTTIHQAIEQTPNQSTTIPQATNGHSTSRPRRRRSRTRADTRRIEPPYPWSAEQRASIHNLEYLQSNNIVTIKGDVRC

Query:  KKCERFYEIEYNLMNKFDEIARFIERERDNMHDRAPICWKNPILPNCEYCREENCVEPMIPDEEDDNQFSRINWLFLLLGQLIGRLKLKQL
        K C+R   +EYNL  KF E+  +I+  ++ M  RAP  W  P L  C  C+ E  ++P++ + +++     INWLFLLLGQ++G   L QL
Subjt:  KKCERFYEIEYNLMNKFDEIARFIERERDNMHDRAPICWKNPILPNCEYCREENCVEPMIPDEEDDNQFSRINWLFLLLGQLIGRLKLKQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTACTAACAACCATAACAACGACCTTCAACTCTCTCTCGCCCTCCGGACGGCCGCCGCCGCCACGGTCGATGCAGCCTCCGATCGACACTTAATAATCTTAATGAG
AACCGCCAACAATTTAGAATTTCACCGAAGATCTCTCCGTCATATGAAATCTCAAACACCGAGAGAGACGGGGCCGATCGAGCCACCATATCCATGGTCGACGAACCAAA
GAGCGGCGGTTCATACACTAAATTATATGACATCGAACCAAATCCTAACGATCACTGGGGATGTCAAGTGCCACCATTGTCAAAGAATTTACGAGATCGAATACGACATC
GTTTCGAAGTTCAACGAGATCGGGAGCTTCGTAGAGAACAACATGGAGTCACTCCAGGACCGGACGCCGAGGTCGTGGATATGGCCGGATTATCCGACGTGTCGGTTTTG
CGGGACGGAAAAAGGAGTGAGGCCGGTGATTCCAAAGGAATGTGAGAAGATCAATTGGGTGTTCTTGCTTTTGGGAGAAATGTTACGAAGGATCATCTTGTTTATCACAC
TTATATCACTTTGTGCCACCAAATTGATCCTTCTGGCCGCCATTAGGGAAACTCCGAACCAACCGGTGGCGATCCCTCAGACCGTAGAGGAAACTCCAAACCAATCAACG
ACAATTCATCAAGCCATTGAGCAAACTCCAAACCAATCAACGACAATCCCTCAAGCCACAAATGGACACTCTACATCAAGACCGAGACGAAGACGAAGTAGAACGAGAGC
AGACACTAGGAGGATCGAGCCACCGTATCCATGGTCAGCCGAGCAACGAGCGTCAATCCACAATCTCGAGTACCTCCAATCAAACAACATCGTTACGATCAAAGGCGATG
TGAGGTGCAAAAAATGCGAGAGATTTTACGAAATCGAGTACAATTTGATGAACAAGTTCGATGAGATAGCAAGATTCATAGAAAGAGAAAGAGATAACATGCATGATAGA
GCTCCAATTTGTTGGAAAAACCCTATTTTACCAAATTGTGAGTATTGCAGGGAAGAAAACTGCGTAGAGCCGATGATACCCGACGAGGAGGACGACAACCAATTCAGTAG
AATCAATTGGCTATTCTTGCTTTTGGGGCAATTGATAGGACGTTTAAAGCTCAAACAACTCAAATACTTCTGTGCTCATACCTATAATCATCGAACTGGGGCTAAGGATC
GTCTCATTTTTCTCACTTATCTTGCTTTGTGTAAGCAACTTCAACCCTCCAATCGTCTCTTCAATTGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTACTAACAACCATAACAACGACCTTCAACTCTCTCTCGCCCTCCGGACGGCCGCCGCCGCCACGGTCGATGCAGCCTCCGATCGACACTTAATAATCTTAATGAG
AACCGCCAACAATTTAGAATTTCACCGAAGATCTCTCCGTCATATGAAATCTCAAACACCGAGAGAGACGGGGCCGATCGAGCCACCATATCCATGGTCGACGAACCAAA
GAGCGGCGGTTCATACACTAAATTATATGACATCGAACCAAATCCTAACGATCACTGGGGATGTCAAGTGCCACCATTGTCAAAGAATTTACGAGATCGAATACGACATC
GTTTCGAAGTTCAACGAGATCGGGAGCTTCGTAGAGAACAACATGGAGTCACTCCAGGACCGGACGCCGAGGTCGTGGATATGGCCGGATTATCCGACGTGTCGGTTTTG
CGGGACGGAAAAAGGAGTGAGGCCGGTGATTCCAAAGGAATGTGAGAAGATCAATTGGGTGTTCTTGCTTTTGGGAGAAATGTTACGAAGGATCATCTTGTTTATCACAC
TTATATCACTTTGTGCCACCAAATTGATCCTTCTGGCCGCCATTAGGGAAACTCCGAACCAACCGGTGGCGATCCCTCAGACCGTAGAGGAAACTCCAAACCAATCAACG
ACAATTCATCAAGCCATTGAGCAAACTCCAAACCAATCAACGACAATCCCTCAAGCCACAAATGGACACTCTACATCAAGACCGAGACGAAGACGAAGTAGAACGAGAGC
AGACACTAGGAGGATCGAGCCACCGTATCCATGGTCAGCCGAGCAACGAGCGTCAATCCACAATCTCGAGTACCTCCAATCAAACAACATCGTTACGATCAAAGGCGATG
TGAGGTGCAAAAAATGCGAGAGATTTTACGAAATCGAGTACAATTTGATGAACAAGTTCGATGAGATAGCAAGATTCATAGAAAGAGAAAGAGATAACATGCATGATAGA
GCTCCAATTTGTTGGAAAAACCCTATTTTACCAAATTGTGAGTATTGCAGGGAAGAAAACTGCGTAGAGCCGATGATACCCGACGAGGAGGACGACAACCAATTCAGTAG
AATCAATTGGCTATTCTTGCTTTTGGGGCAATTGATAGGACGTTTAAAGCTCAAACAACTCAAATACTTCTGTGCTCATACCTATAATCATCGAACTGGGGCTAAGGATC
GTCTCATTTTTCTCACTTATCTTGCTTTGTGTAAGCAACTTCAACCCTCCAATCGTCTCTTCAATTGCTAA
Protein sequenceShow/hide protein sequence
MATNNHNNDLQLSLALRTAAAATVDAASDRHLIILMRTANNLEFHRRSLRHMKSQTPRETGPIEPPYPWSTNQRAAVHTLNYMTSNQILTITGDVKCHHCQRIYEIEYDI
VSKFNEIGSFVENNMESLQDRTPRSWIWPDYPTCRFCGTEKGVRPVIPKECEKINWVFLLLGEMLRRIILFITLISLCATKLILLAAIRETPNQPVAIPQTVEETPNQST
TIHQAIEQTPNQSTTIPQATNGHSTSRPRRRRSRTRADTRRIEPPYPWSAEQRASIHNLEYLQSNNIVTIKGDVRCKKCERFYEIEYNLMNKFDEIARFIERERDNMHDR
APICWKNPILPNCEYCREENCVEPMIPDEEDDNQFSRINWLFLLLGQLIGRLKLKQLKYFCAHTYNHRTGAKDRLIFLTYLALCKQLQPSNRLFNC