; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg20526 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg20526
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Descriptionmyocardial zonula adherens protein-like
Genome locationCarg_Chr12:903699..905469
RNA-Seq ExpressionCarg20526
SyntenyCarg20526
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585310.1 hypothetical protein SDJN03_18043, partial [Cucurbita argyrosperma subsp. sororia]3.6e-14999.31Show/hide
Query:  MDAYIKESIDHTLGLPVSTQTLALKLAASEATQRRLEAQCLSLQSKLNQHIRSTKAAQAEAMMSAQGLKKAIERNERLSKENSRLEDQLKRMKVIASRGK
        MDAYIKESIDHTLGLPVSTQTLALKLAASEATQRRLEAQCLSLQSKLNQHIRSTKAAQAEAMMSAQGLKKAIERNERLSKENSRLEDQLKRMKVIASRGK
Subjt:  MDAYIKESIDHTLGLPVSTQTLALKLAASEATQRRLEAQCLSLQSKLNQHIRSTKAAQAEAMMSAQGLKKAIERNERLSKENSRLEDQLKRMKVIASRGK

Query:  MAIHGAKEAEDRARKLNDEATKLTEDLNFFKEQYQMRKFSFASMEEVLVETALNDLVPGDTGSTVGQFLVANMNNDSCRRLLHLSQSLRPSTQRALSLAA
        MAIHGAKEAEDRARKLNDEATKLTEDLNFFKEQYQMRKFSFASMEEVLVET LNDLVPGDTG TVGQFLVANMNNDSCRRLLHLSQSLRPSTQRALSLAA
Subjt:  MAIHGAKEAEDRARKLNDEATKLTEDLNFFKEQYQMRKFSFASMEEVLVETALNDLVPGDTGSTVGQFLVANMNNDSCRRLLHLSQSLRPSTQRALSLAA

Query:  QIKTLEENNRQLQIRLSVAQGEVILLSDEIGRLLEDKEDSSPDPDFARKPPASSSSVKVEKNEWNGCNLFKQFVIVLRQNKRPRESSKSED
        QIKTLEENNRQLQIRLSVAQGEVILLSDEIGRLLEDKEDSSPDPDFARKPPASSSSVKVEKNEWNGCNLFKQFVIVLRQNKRPRESSKSED
Subjt:  QIKTLEENNRQLQIRLSVAQGEVILLSDEIGRLLEDKEDSSPDPDFARKPPASSSSVKVEKNEWNGCNLFKQFVIVLRQNKRPRESSKSED

KAG7020228.1 hypothetical protein SDJN02_16911, partial [Cucurbita argyrosperma subsp. argyrosperma]2.5e-150100Show/hide
Query:  MDAYIKESIDHTLGLPVSTQTLALKLAASEATQRRLEAQCLSLQSKLNQHIRSTKAAQAEAMMSAQGLKKAIERNERLSKENSRLEDQLKRMKVIASRGK
        MDAYIKESIDHTLGLPVSTQTLALKLAASEATQRRLEAQCLSLQSKLNQHIRSTKAAQAEAMMSAQGLKKAIERNERLSKENSRLEDQLKRMKVIASRGK
Subjt:  MDAYIKESIDHTLGLPVSTQTLALKLAASEATQRRLEAQCLSLQSKLNQHIRSTKAAQAEAMMSAQGLKKAIERNERLSKENSRLEDQLKRMKVIASRGK

Query:  MAIHGAKEAEDRARKLNDEATKLTEDLNFFKEQYQMRKFSFASMEEVLVETALNDLVPGDTGSTVGQFLVANMNNDSCRRLLHLSQSLRPSTQRALSLAA
        MAIHGAKEAEDRARKLNDEATKLTEDLNFFKEQYQMRKFSFASMEEVLVETALNDLVPGDTGSTVGQFLVANMNNDSCRRLLHLSQSLRPSTQRALSLAA
Subjt:  MAIHGAKEAEDRARKLNDEATKLTEDLNFFKEQYQMRKFSFASMEEVLVETALNDLVPGDTGSTVGQFLVANMNNDSCRRLLHLSQSLRPSTQRALSLAA

Query:  QIKTLEENNRQLQIRLSVAQGEVILLSDEIGRLLEDKEDSSPDPDFARKPPASSSSVKVEKNEWNGCNLFKQFVIVLRQNKRPRESSKSED
        QIKTLEENNRQLQIRLSVAQGEVILLSDEIGRLLEDKEDSSPDPDFARKPPASSSSVKVEKNEWNGCNLFKQFVIVLRQNKRPRESSKSED
Subjt:  QIKTLEENNRQLQIRLSVAQGEVILLSDEIGRLLEDKEDSSPDPDFARKPPASSSSVKVEKNEWNGCNLFKQFVIVLRQNKRPRESSKSED

XP_022131622.1 uncharacterized protein LOC111004753 [Momordica charantia]6.7e-5552.04Show/hide
Query:  MDAYIKESIDHTLGLPVSTQTLALKLAASEATQRRLEAQCLSLQSKLNQHIRSTKAAQAEAMMSAQGLKKAIERNERLSKENSRLEDQLKRMKVIASRGK
        +D YIKESI H+LGLPVS  TL LKL ASEA Q+ L  QCLSLQSKLN  I + ++++AEA +SAQ LKKAIE+N  LS+ENS L  Q K++K  A++ K
Subjt:  MDAYIKESIDHTLGLPVSTQTLALKLAASEATQRRLEAQCLSLQSKLNQHIRSTKAAQAEAMMSAQGLKKAIERNERLSKENSRLEDQLKRMKVIASRGK

Query:  MAIHGAKEAEDRARKLNDEATKLTEDLNFFKEQYQMRKFSFASMEEVLVETALNDLVPGDTGSTVGQFLVANMNNDSCRRLLHLSQSLRPSTQRALSLAA
        MA+  A E   R R+L  E  +LT+DL FF+ QYQ RKF  +SMEE LV++ L   + G+T +   +FL+ N    SCRRLL+LS SL P+TQ+ALSLAA
Subjt:  MAIHGAKEAEDRARKLNDEATKLTEDLNFFKEQYQMRKFSFASMEEVLVETALNDLVPGDTGSTVGQFLVANMNNDSCRRLLHLSQSLRPSTQRALSLAA

Query:  QIKTLEENNRQLQIRLSVAQGEVILLSDEIGRLLEDKEDSSPDPDFAR----KPPASS--SSVKVEKNE
        +IK LE+N  +LQ+ L  AQ EV  L+++I  L+ED+  +S +    R    KPPASS  +SVK+   E
Subjt:  QIKTLEENNRQLQIRLSVAQGEVILLSDEIGRLLEDKEDSSPDPDFAR----KPPASS--SSVKVEKNE

XP_022951817.1 uncharacterized protein LOC111454550 [Cucurbita moschata]3.9e-9587.83Show/hide
Query:  MMSAQGLKKAIERNERLSKENSRLEDQLKRMKVIASRGKMAIHGAKEAEDRARKLNDEATKLTEDLNFFKEQYQMRKFSFASMEEVLVETALNDLVPGDT
        MMSAQGLKKAIERNERLSKENSRLEDQLK+MKVIASRGKMAIHG KEAEDRARKLN+EA KLTEDLNFFKEQ+QMRKFSFASMEEVLVETALNDLV  DT
Subjt:  MMSAQGLKKAIERNERLSKENSRLEDQLKRMKVIASRGKMAIHGAKEAEDRARKLNDEATKLTEDLNFFKEQYQMRKFSFASMEEVLVETALNDLVPGDT

Query:  GSTVGQFLVANMNNDSCRRLLHLSQSLRPSTQRALSLAAQIKTLEENNRQLQIRLSVAQGEVILLSDEIGRLLEDKEDSSPDPDFARKPPASSSSVKVEK
        GSTVGQFLVANMNNDSCRRLLHLSQSLRPSTQRAL LAAQIKTLEENNRQLQIRLSVAQGEVILLSDEIGRLLEDKEDSSPDPDFARKPPASSSSVK   
Subjt:  GSTVGQFLVANMNNDSCRRLLHLSQSLRPSTQRALSLAAQIKTLEENNRQLQIRLSVAQGEVILLSDEIGRLLEDKEDSSPDPDFARKPPASSSSVKVEK

Query:  NEWNGCNLFKQFVIVLRQNKRPRESSKSED
                       LRQNKRP ES KSED
Subjt:  NEWNGCNLFKQFVIVLRQNKRPRESSKSED

XP_023538594.1 uncharacterized protein LOC111799313 [Cucurbita pepo subsp. pepo]3.5e-9687.39Show/hide
Query:  MMSAQGLKKAIERNERLSKENSRLEDQLKRMKVIASRGKMAIHGAKEAEDRARKLNDEATKLTEDLNFFKEQYQMRKFSFASMEEVLVETALNDLVPGDT
        MMSAQGLKKAIERNERLSKENSRLEDQLK+MKVIAS+G+MAIH AK+AEDRARKLN+EATKLTEDLNFFKEQ+QMRKFSFASMEEVLVETALNDLVPGDT
Subjt:  MMSAQGLKKAIERNERLSKENSRLEDQLKRMKVIASRGKMAIHGAKEAEDRARKLNDEATKLTEDLNFFKEQYQMRKFSFASMEEVLVETALNDLVPGDT

Query:  GSTVGQFLVANMNNDSCRRLLHLSQSLRPSTQRALSLAAQIKTLEENNRQLQIRLSVAQGEVILLSDEIGRLLEDKEDSSPDPDFARKPPASSSSVKVEK
        G TVGQFLVANMNN+SCRRLLHLSQSLRPST RALSLAAQIKTLEENNRQLQIRLSVAQGEVILLSDEIGRLLEDKEDS PDPDFARKPPASSSSVK   
Subjt:  GSTVGQFLVANMNNDSCRRLLHLSQSLRPSTQRALSLAAQIKTLEENNRQLQIRLSVAQGEVILLSDEIGRLLEDKEDSSPDPDFARKPPASSSSVKVEK

Query:  NEWNGCNLFKQFVIVLRQNKRPRESSKSED
                       LRQNKRPRESSKSED
Subjt:  NEWNGCNLFKQFVIVLRQNKRPRESSKSED

TrEMBL top hitse value%identityAlignment
A0A2I4GQJ3 intracellular protein transport protein USO1-like4.4e-3642.21Show/hide
Query:  MDAYIKESIDHTLGLPVSTQTLALKLAASEATQRRLEAQCLSLQSKLNQHIRSTKAAQAEAMMSAQGLKKAIERNERLSKENSRLEDQLKR---------
        +D YIKESIDH+LGLPVS +TL LKL+A E TQRRL  +CLSLQ+KL +     +  +AE+ M+AQ LKK +E N++L+ E   L  Q K+         
Subjt:  MDAYIKESIDHTLGLPVSTQTLALKLAASEATQRRLEAQCLSLQSKLNQHIRSTKAAQAEAMMSAQGLKKAIERNERLSKENSRLEDQLKR---------

Query:  --MKVIASRGKMAIHGAKEAEDRARKLNDEATKLTEDLNFFKEQYQMRKFSFASMEEV----LVETALNDLVPGDTGSTVGQFLVANMNNDSCRRLLHLS
           + +   G  A   AKEAE R  +L +E  +L+++L F+K +Y MR    +S E +    L+E+ L  L+  D  +    FL AN  ++SC+ LL + 
Subjt:  --MKVIASRGKMAIHGAKEAEDRARKLNDEATKLTEDLNFFKEQYQMRKFSFASMEEV----LVETALNDLVPGDTGSTVGQFLVANMNNDSCRRLLHLS

Query:  QSLRPSTQRALSLAAQIKTLEENNRQLQIRLSVAQGEVILLSDE
          LRPSTQ+ALSLAA++ TLE++  +L++ L+ A+ EV +L +E
Subjt:  QSLRPSTQRALSLAAQIKTLEENNRQLQIRLSVAQGEVILLSDE

A0A2N9IYL3 Uncharacterized protein7.5e-3641.5Show/hide
Query:  MDAYIKESIDHTLGLPVSTQTLALKLAASEATQRRLEAQCLSLQSKLNQHIRSTKAAQAEAMMSAQGLKKAIERNERLSKENSRLEDQLKR---------
        +D YIK+SIDH+LGLPVSTQTL LKL+ S+  QRRL ++CLSLQ KL +        +AE+ M+AQ LKK +E N++L+ E + L  Q  +         
Subjt:  MDAYIKESIDHTLGLPVSTQTLALKLAASEATQRRLEAQCLSLQSKLNQHIRSTKAAQAEAMMSAQGLKKAIERNERLSKENSRLEDQLKR---------

Query:  --MKVIASRGKMAIHGAKEAEDRARKLNDEATKLTEDLNFFKEQYQMRKFSFAS----MEEVLVETALNDLVPGDTGSTVGQFLVANMNNDSCRRLLHLS
           + +   G  A   AKEAE R  +L +E  +L+E+L F+K + +MR    ++    +E  L+E+ L  L+  D  ++   FL AN +++SC+RLL + 
Subjt:  --MKVIASRGKMAIHGAKEAEDRARKLNDEATKLTEDLNFFKEQYQMRKFSFAS----MEEVLVETALNDLVPGDTGSTVGQFLVANMNNDSCRRLLHLS

Query:  QSLRPSTQRALSLAAQIKTLEENNRQLQIRLSVAQGEVILLSDEIGRLLEDKE
          LRPSTQ+ LSLAA++KTLE++   L+I L+ A+ EV LL +E   L E+ +
Subjt:  QSLRPSTQRALSLAAQIKTLEENNRQLQIRLSVAQGEVILLSDEIGRLLEDKE

A0A6J1BRI9 uncharacterized protein LOC1110047533.2e-5552.04Show/hide
Query:  MDAYIKESIDHTLGLPVSTQTLALKLAASEATQRRLEAQCLSLQSKLNQHIRSTKAAQAEAMMSAQGLKKAIERNERLSKENSRLEDQLKRMKVIASRGK
        +D YIKESI H+LGLPVS  TL LKL ASEA Q+ L  QCLSLQSKLN  I + ++++AEA +SAQ LKKAIE+N  LS+ENS L  Q K++K  A++ K
Subjt:  MDAYIKESIDHTLGLPVSTQTLALKLAASEATQRRLEAQCLSLQSKLNQHIRSTKAAQAEAMMSAQGLKKAIERNERLSKENSRLEDQLKRMKVIASRGK

Query:  MAIHGAKEAEDRARKLNDEATKLTEDLNFFKEQYQMRKFSFASMEEVLVETALNDLVPGDTGSTVGQFLVANMNNDSCRRLLHLSQSLRPSTQRALSLAA
        MA+  A E   R R+L  E  +LT+DL FF+ QYQ RKF  +SMEE LV++ L   + G+T +   +FL+ N    SCRRLL+LS SL P+TQ+ALSLAA
Subjt:  MAIHGAKEAEDRARKLNDEATKLTEDLNFFKEQYQMRKFSFASMEEVLVETALNDLVPGDTGSTVGQFLVANMNNDSCRRLLHLSQSLRPSTQRALSLAA

Query:  QIKTLEENNRQLQIRLSVAQGEVILLSDEIGRLLEDKEDSSPDPDFAR----KPPASS--SSVKVEKNE
        +IK LE+N  +LQ+ L  AQ EV  L+++I  L+ED+  +S +    R    KPPASS  +SVK+   E
Subjt:  QIKTLEENNRQLQIRLSVAQGEVILLSDEIGRLLEDKEDSSPDPDFAR----KPPASS--SSVKVEKNE

A0A6J1GIQ2 uncharacterized protein LOC1114545501.9e-9587.83Show/hide
Query:  MMSAQGLKKAIERNERLSKENSRLEDQLKRMKVIASRGKMAIHGAKEAEDRARKLNDEATKLTEDLNFFKEQYQMRKFSFASMEEVLVETALNDLVPGDT
        MMSAQGLKKAIERNERLSKENSRLEDQLK+MKVIASRGKMAIHG KEAEDRARKLN+EA KLTEDLNFFKEQ+QMRKFSFASMEEVLVETALNDLV  DT
Subjt:  MMSAQGLKKAIERNERLSKENSRLEDQLKRMKVIASRGKMAIHGAKEAEDRARKLNDEATKLTEDLNFFKEQYQMRKFSFASMEEVLVETALNDLVPGDT

Query:  GSTVGQFLVANMNNDSCRRLLHLSQSLRPSTQRALSLAAQIKTLEENNRQLQIRLSVAQGEVILLSDEIGRLLEDKEDSSPDPDFARKPPASSSSVKVEK
        GSTVGQFLVANMNNDSCRRLLHLSQSLRPSTQRAL LAAQIKTLEENNRQLQIRLSVAQGEVILLSDEIGRLLEDKEDSSPDPDFARKPPASSSSVK   
Subjt:  GSTVGQFLVANMNNDSCRRLLHLSQSLRPSTQRALSLAAQIKTLEENNRQLQIRLSVAQGEVILLSDEIGRLLEDKEDSSPDPDFARKPPASSSSVKVEK

Query:  NEWNGCNLFKQFVIVLRQNKRPRESSKSED
                       LRQNKRP ES KSED
Subjt:  NEWNGCNLFKQFVIVLRQNKRPRESSKSED

A0A6J1KKT5 uncharacterized protein LOC1114966135.2e-5382.43Show/hide
Query:  MEEVLVETALNDLVPGDTGSTVGQFLVANMNNDSCRRLLHLSQSLRPSTQRALSLAAQIKTLEENNRQLQIRLSVAQGEVILLSDEIGRLLEDKEDSSPD
        MEEVLVETALNDLV GDTGST+GQFLVANMNN+SCRRLLHLSQSLRPS+ RALSLAAQIKTLEENNRQLQI LSVAQGEV+LLSDEIGRLLEDKEDSSPD
Subjt:  MEEVLVETALNDLVPGDTGSTVGQFLVANMNNDSCRRLLHLSQSLRPSTQRALSLAAQIKTLEENNRQLQIRLSVAQGEVILLSDEIGRLLEDKEDSSPD

Query:  PDFARKPPASSSSVKVEKNEWNGCNLFKQFVIVLRQNKRPRESSKSED
        PDFARKPPASSSSVK                  LRQNKRP ESSKSED
Subjt:  PDFARKPPASSSSVKVEKNEWNGCNLFKQFVIVLRQNKRPRESSKSED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G14680.1 unknown protein1.5e-2534.3Show/hide
Query:  MDAYIKESIDHTLGLPVSTQTLALKLAASEATQRRLEAQCLSLQSKLNQHIRSTKAAQAEAMMSAQGLKKAIERNERLSKENSRLEDQLKR---------
        +D +IKE+IDH+LGLP+S   L  KL  +E +QRRL  Q LSL S+L +  +     ++EA M+AQ LKK +E N++L  E   L +Q K+         
Subjt:  MDAYIKESIDHTLGLPVSTQTLALKLAASEATQRRLEAQCLSLQSKLNQHIRSTKAAQAEAMMSAQGLKKAIERNERLSKENSRLEDQLKR---------

Query:  --MKVIASRGKMAIHGAKEAEDRARKLNDEATKLTEDLNFFKEQYQMRKFSFASMEEVLVETALNDLVPGDTGSTVGQ-FLVANMNNDS-CRRLLHLSQS
           + +   G      A+EAE R R+L +E  K+++++          K    S E+ LV++ L   V  D   ++G+ FL AN   D  C  LL     
Subjt:  --MKVIASRGKMAIHGAKEAEDRARKLNDEATKLTEDLNFFKEQYQMRKFSFASMEEVLVETALNDLVPGDTGSTVGQ-FLVANMNNDS-CRRLLHLSQS

Query:  LRPSTQRALSLAAQIKTLEENNRQLQIRLSVAQGEVILLSDE
        L+P+TQ+ +SL + +K +E+    L + L+ A+ EV L+S++
Subjt:  LRPSTQRALSLAAQIKTLEENNRQLQIRLSVAQGEVILLSDE

AT4G09060.1 unknown protein1.8e-2634.52Show/hide
Query:  MDAYIKESIDHTLGLPVSTQTLALKLAASEATQRRLEAQCLSLQSKLNQHIRSTKAAQAEAMMSAQGLKKAIERNERLSKENSRLEDQLKRMK-------
        ++ YIK++IDH+LGLP+S ++L  KL  +E +QRRL  Q   L S+L +        ++EA M+AQ LKK +E N++L+ E   L  Q K+++       
Subjt:  MDAYIKESIDHTLGLPVSTQTLALKLAASEATQRRLEAQCLSLQSKLNQHIRSTKAAQAEAMMSAQGLKKAIERNERLSKENSRLEDQLKRMK-------

Query:  ----VIASRGKMAIHGAKEAEDRARKLNDEATKLTEDLNFFKEQYQMRKFSFAS--MEEVLVETALNDLVPGDTGSTVGQFLVANMNNDSCRRLLHLSQS
             +   G  +   A+EAE R R+L DE  +++E++  FK Q    +    +  +EE L+++ L  L+  D       FL AN+++ SC+ LL     
Subjt:  ----VIASRGKMAIHGAKEAEDRARKLNDEATKLTEDLNFFKEQYQMRKFSFAS--MEEVLVETALNDLVPGDTGSTVGQFLVANMNNDSCRRLLHLSQS

Query:  LRPSTQRALSLAAQIKTLEENNRQLQIRLSVAQGEVILLSDEIGRLLEDKED
        L+PSTQ+ LSL ++ K  E+    + + L+ A+ EV L+S  + R L DKE+
Subjt:  LRPSTQRALSLAAQIKTLEENNRQLQIRLSVAQGEVILLSDEIGRLLEDKED

AT4G09060.2 unknown protein5.0e-2431.29Show/hide
Query:  MDAYIKESIDHTLGLPVSTQTLALKLAASEATQRRLEAQCLSLQSKLNQHIRSTKAAQAEAMMSAQGLKKAIERNERLSKENSRLEDQLKRMK-------
        ++ YIK++IDH+LGLP+S ++L  KL  +E +QRRL  Q   L S+L +        ++EA M+AQ LKK +E N++L+ E   L  Q K+++       
Subjt:  MDAYIKESIDHTLGLPVSTQTLALKLAASEATQRRLEAQCLSLQSKLNQHIRSTKAAQAEAMMSAQGLKKAIERNERLSKENSRLEDQLKRMK-------

Query:  ----VIASRGKMAIHGAKEAEDRARKLNDEATKLTEDLNFFKEQ----------------------------YQMRKFSFASMEEVLVETALNDLVPGDT
             +   G  +   A+EAE R R+L DE  +++E++  FK Q                            + + +     +EE L+++ L  L+  D 
Subjt:  ----VIASRGKMAIHGAKEAEDRARKLNDEATKLTEDLNFFKEQ----------------------------YQMRKFSFASMEEVLVETALNDLVPGDT

Query:  GSTVGQFLVANMNNDSCRRLLHLSQSLRPSTQRALSLAAQIKTLEENNRQLQIRLSVAQGEVILLSDEIGRLLEDKED
              FL AN+++ SC+ LL     L+PSTQ+ LSL ++ K  E+    + + L+ A+ EV L+S  + R L DKE+
Subjt:  GSTVGQFLVANMNNDSCRRLLHLSQSLRPSTQRALSLAAQIKTLEENNRQLQIRLSVAQGEVILLSDEIGRLLEDKED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGCATACATCAAAGAATCAATCGACCACACACTGGGCCTCCCCGTCTCCACCCAAACGCTGGCTCTGAAACTCGCCGCCTCCGAAGCCACCCAGCGCCGCCTCGA
AGCCCAATGCCTCTCTTTACAGTCTAAATTAAACCAACACATCCGCTCAACCAAAGCCGCGCAGGCGGAAGCGATGATGAGCGCGCAAGGCTTGAAGAAGGCGATCGAGA
GAAACGAACGATTAAGCAAAGAGAATTCGCGTTTGGAGGATCAATTAAAGAGAATGAAAGTCATTGCAAGTAGAGGGAAAATGGCTATCCATGGCGCCAAGGAGGCGGAA
GATCGTGCGCGTAAATTGAACGATGAGGCTACGAAATTAACCGAGGATTTGAATTTCTTTAAAGAACAATACCAAATGCGAAAGTTTTCCTTCGCTTCCATGGAGGAGGT
TCTAGTCGAGACGGCTTTAAATGATTTGGTTCCTGGGGATACTGGTTCGACCGTTGGGCAGTTCTTGGTGGCGAATATGAACAACGACTCGTGTCGAAGATTGCTACATT
TGTCGCAGAGCTTAAGGCCTTCAACTCAAAGAGCTCTGTCTCTAGCTGCTCAAATCAAGACCCTCGAGGAAAACAATCGACAGCTTCAGATTCGCCTCTCAGTAGCTCAA
GGGGAGGTGATACTATTGTCGGATGAGATTGGTCGGCTGTTGGAGGATAAAGAAGACAGCAGCCCAGACCCAGATTTTGCCAGGAAACCACCGGCTAGTAGTTCATCCGT
GAAGGTTGAGAAAAATGAATGGAATGGCTGTAACTTGTTTAAGCAGTTTGTGATTGTTTTGCGTCAAAACAAAAGACCAAGGGAATCGTCGAAGTCGGAAGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACGCATACATCAAAGAATCAATCGACCACACACTGGGCCTCCCCGTCTCCACCCAAACGCTGGCTCTGAAACTCGCCGCCTCCGAAGCCACCCAGCGCCGCCTCGA
AGCCCAATGCCTCTCTTTACAGTCTAAATTAAACCAACACATCCGCTCAACCAAAGCCGCGCAGGCGGAAGCGATGATGAGCGCGCAAGGCTTGAAGAAGGCGATCGAGA
GAAACGAACGATTAAGCAAAGAGAATTCGCGTTTGGAGGATCAATTAAAGAGAATGAAAGTCATTGCAAGTAGAGGGAAAATGGCTATCCATGGCGCCAAGGAGGCGGAA
GATCGTGCGCGTAAATTGAACGATGAGGCTACGAAATTAACCGAGGATTTGAATTTCTTTAAAGAACAATACCAAATGCGAAAGTTTTCCTTCGCTTCCATGGAGGAGGT
TCTAGTCGAGACGGCTTTAAATGATTTGGTTCCTGGGGATACTGGTTCGACCGTTGGGCAGTTCTTGGTGGCGAATATGAACAACGACTCGTGTCGAAGATTGCTACATT
TGTCGCAGAGCTTAAGGCCTTCAACTCAAAGAGCTCTGTCTCTAGCTGCTCAAATCAAGACCCTCGAGGAAAACAATCGACAGCTTCAGATTCGCCTCTCAGTAGCTCAA
GGGGAGGTGATACTATTGTCGGATGAGATTGGTCGGCTGTTGGAGGATAAAGAAGACAGCAGCCCAGACCCAGATTTTGCCAGGAAACCACCGGCTAGTAGTTCATCCGT
GAAGGTTGAGAAAAATGAATGGAATGGCTGTAACTTGTTTAAGCAGTTTGTGATTGTTTTGCGTCAAAACAAAAGACCAAGGGAATCGTCGAAGTCGGAAGATTGA
Protein sequenceShow/hide protein sequence
MDAYIKESIDHTLGLPVSTQTLALKLAASEATQRRLEAQCLSLQSKLNQHIRSTKAAQAEAMMSAQGLKKAIERNERLSKENSRLEDQLKRMKVIASRGKMAIHGAKEAE
DRARKLNDEATKLTEDLNFFKEQYQMRKFSFASMEEVLVETALNDLVPGDTGSTVGQFLVANMNNDSCRRLLHLSQSLRPSTQRALSLAAQIKTLEENNRQLQIRLSVAQ
GEVILLSDEIGRLLEDKEDSSPDPDFARKPPASSSSVKVEKNEWNGCNLFKQFVIVLRQNKRPRESSKSED