; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g31310 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g31310
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCCHC-type domain-containing protein
Genome locationchr8:22441398..22446761
RNA-Seq ExpressionMoc08g31310
SyntenyMoc08g31310
Gene Ontology termsGO:2000767 - positive regulation of cytoplasmic translation (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0003727 - single-stranded RNA binding (molecular function)
GO:0003729 - mRNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016624 - oxidoreductase activity, acting on the aldehyde or oxo group of donors, disulfide as acceptor (molecular function)
GO:0045182 - translation regulator activity (molecular function)
InterPro domainsIPR001017 - Dehydrogenase, E1 component
IPR036410 - Heat shock protein DnaJ, cysteine-rich domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040581.1 uncharacterized protein E6C27_scaffold262G001550 [Cucumis melo var. makuwa]8.6e-10275.29Show/hide
Query:  MSSSICSSIRALPPHHGRPNN-SRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKELP
        MSSS+C SI AL PH+ RP N   F L QS RC+CFAPRFVACL NDDSVAIPKP+PLAFDP+EELYGL VDL PRNSASSAPEPRSWFGPNGQYIKELP
Subjt:  MSSSICSSIRALPPHHGRPNN-SRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKELP

Query:  CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKIS
        CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDE+PWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSP PEVGLKIS
Subjt:  CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKIS

Query:  RSL-NVDGMDAVAVKQACKFAKEHVLKNGPIDEATSLKGGASDQAATKKRSREAL
        RSL +++    +  K+     ++  L    +      KG A      +KR+ EAL
Subjt:  RSL-NVDGMDAVAVKQACKFAKEHVLKNGPIDEATSLKGGASDQAATKKRSREAL

XP_004143799.1 uncharacterized protein LOC101211176 [Cucumis sativus]1.5e-10175.29Show/hide
Query:  MSSSICSSIRALPPHHGRPNNSR-FALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKELP
        MSSS+C SI AL PH+ RP N R F L QS RC+CFAPRFVA L NDDSVAIPKP+PLAFDP+EELYGL VDL PRNSASSAPEPRSWFGPNGQYIKELP
Subjt:  MSSSICSSIRALPPHHGRPNNSR-FALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKELP

Query:  CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKIS
        CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDE+PWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSP PEVGLKIS
Subjt:  CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKIS

Query:  RSLNV-DGMDAVAVKQACKFAKEHVLKNGPIDEATSLKGGASDQAATKKRSREAL
        RSL + +    +  K+     ++  L    +      KG A      +KR+ EAL
Subjt:  RSLNV-DGMDAVAVKQACKFAKEHVLKNGPIDEATSLKGGASDQAATKKRSREAL

XP_008465679.1 PREDICTED: uncharacterized protein LOC103503314 isoform X2 [Cucumis melo]7.3e-10174.9Show/hide
Query:  MSSSICSSIRALPPHHGRPNN-SRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKELP
        MSSS+C SI AL PH+ R  N   F L QS RC+CFAPRFVACL NDDSVAIPKP+PLAFDP+EELYGL VDL PRNSASSAPEPRSWFGPNGQYIKELP
Subjt:  MSSSICSSIRALPPHHGRPNN-SRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKELP

Query:  CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKIS
        CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDE+PWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSP PEVGLKIS
Subjt:  CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKIS

Query:  RSL-NVDGMDAVAVKQACKFAKEHVLKNGPIDEATSLKGGASDQAATKKRSREAL
        RSL +++    +  K+     ++  L    +      KG A      +KR+ EAL
Subjt:  RSL-NVDGMDAVAVKQACKFAKEHVLKNGPIDEATSLKGGASDQAATKKRSREAL

XP_022151325.1 uncharacterized protein LOC111019288 [Momordica charantia]1.7e-11398.06Show/hide
Query:  MSSSICSSIRALPPHHGRPNNSRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPR----NSASSAPEPRSWFGPNGQYIK
        MSSSICSSIRALPPHHGRPNNSRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPR    NSASSAPEPRSWFGPNGQYIK
Subjt:  MSSSICSSIRALPPHHGRPNNSRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPR----NSASSAPEPRSWFGPNGQYIK

Query:  ELPCPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGL
        ELPCPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGL
Subjt:  ELPCPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGL

Query:  KISRSL
        KISRSL
Subjt:  KISRSL

XP_038889908.1 uncharacterized protein LOC120079678 isoform X2 [Benincasa hispida]2.9e-10275Show/hide
Query:  MSSSICSSIRALPPHHGRPNNSRF--ALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKEL
        MSSS+CSSIRAL PH+ R  N RF   L QS RC+CFAPRFVACL NDDSVAIP+P+PLAFDP+EELYGLGVDL PRN+ASSAPEPRSWFGPNGQYIKEL
Subjt:  MSSSICSSIRALPPHHGRPNNSRF--ALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKEL

Query:  PCPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKI
        PCPSCRGRGYAPCTECGIERS+ADCSVCNGKGIVTCHQCLGDRVIWEESIDE+PWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSP PEVGLKI
Subjt:  PCPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKI

Query:  SRSL-NVDGMDAVAVKQACKFAKEHVLKNGPIDEATSLKGGASDQAATKKRSREAL
        SRSL +++    +  K+     ++  L    +      KG     A  +KR+ EAL
Subjt:  SRSL-NVDGMDAVAVKQACKFAKEHVLKNGPIDEATSLKGGASDQAATKKRSREAL

TrEMBL top hitse value%identityAlignment
A0A0A0KS13 Uncharacterized protein7.1e-10275.29Show/hide
Query:  MSSSICSSIRALPPHHGRPNNSR-FALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKELP
        MSSS+C SI AL PH+ RP N R F L QS RC+CFAPRFVA L NDDSVAIPKP+PLAFDP+EELYGL VDL PRNSASSAPEPRSWFGPNGQYIKELP
Subjt:  MSSSICSSIRALPPHHGRPNNSR-FALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKELP

Query:  CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKIS
        CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDE+PWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSP PEVGLKIS
Subjt:  CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKIS

Query:  RSLNV-DGMDAVAVKQACKFAKEHVLKNGPIDEATSLKGGASDQAATKKRSREAL
        RSL + +    +  K+     ++  L    +      KG A      +KR+ EAL
Subjt:  RSLNV-DGMDAVAVKQACKFAKEHVLKNGPIDEATSLKGGASDQAATKKRSREAL

A0A1S3CPF7 uncharacterized protein LOC103503314 isoform X23.5e-10174.9Show/hide
Query:  MSSSICSSIRALPPHHGRPNN-SRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKELP
        MSSS+C SI AL PH+ R  N   F L QS RC+CFAPRFVACL NDDSVAIPKP+PLAFDP+EELYGL VDL PRNSASSAPEPRSWFGPNGQYIKELP
Subjt:  MSSSICSSIRALPPHHGRPNN-SRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKELP

Query:  CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKIS
        CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDE+PWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSP PEVGLKIS
Subjt:  CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKIS

Query:  RSL-NVDGMDAVAVKQACKFAKEHVLKNGPIDEATSLKGGASDQAATKKRSREAL
        RSL +++    +  K+     ++  L    +      KG A      +KR+ EAL
Subjt:  RSL-NVDGMDAVAVKQACKFAKEHVLKNGPIDEATSLKGGASDQAATKKRSREAL

A0A5D3C105 Uncharacterized protein4.2e-10275.29Show/hide
Query:  MSSSICSSIRALPPHHGRPNN-SRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKELP
        MSSS+C SI AL PH+ RP N   F L QS RC+CFAPRFVACL NDDSVAIPKP+PLAFDP+EELYGL VDL PRNSASSAPEPRSWFGPNGQYIKELP
Subjt:  MSSSICSSIRALPPHHGRPNN-SRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKELP

Query:  CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKIS
        CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDE+PWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSP PEVGLKIS
Subjt:  CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKIS

Query:  RSL-NVDGMDAVAVKQACKFAKEHVLKNGPIDEATSLKGGASDQAATKKRSREAL
        RSL +++    +  K+     ++  L    +      KG A      +KR+ EAL
Subjt:  RSL-NVDGMDAVAVKQACKFAKEHVLKNGPIDEATSLKGGASDQAATKKRSREAL

A0A6J1DAV1 uncharacterized protein LOC1110192888.1e-11498.06Show/hide
Query:  MSSSICSSIRALPPHHGRPNNSRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPR----NSASSAPEPRSWFGPNGQYIK
        MSSSICSSIRALPPHHGRPNNSRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPR    NSASSAPEPRSWFGPNGQYIK
Subjt:  MSSSICSSIRALPPHHGRPNNSRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPR----NSASSAPEPRSWFGPNGQYIK

Query:  ELPCPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGL
        ELPCPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGL
Subjt:  ELPCPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGL

Query:  KISRSL
        KISRSL
Subjt:  KISRSL

A0A6J1KMZ5 uncharacterized protein LOC111496065 isoform X11.1e-9774.22Show/hide
Query:  MSSSICSSIRALPPHHGRPNNSR-FALFQSKRCLCFAPRFVACL-TNDDSVAIPKPSPLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKEL
        MSS +CSSIRAL P   R  N R F L QSKRC+ FAPRFVACL +NDDSVAIPKP PLAFDP EE+YGLGVDL PRNS SSAPEPRSWFGPNGQYI+EL
Subjt:  MSSSICSSIRALPPHHGRPNNSR-FALFQSKRCLCFAPRFVACL-TNDDSVAIPKPSPLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKEL

Query:  PCPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKI
        PCPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDE+PWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSP PEVGLKI
Subjt:  PCPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKI

Query:  SRSL-NVDGMDAVAVKQACKFAKEHVLKNGPIDEATSLKGGASDQAATKKRSREAL
        SRSL +++    +  K+     ++  L    +      KG A      +KR+ EAL
Subjt:  SRSL-NVDGMDAVAVKQACKFAKEHVLKNGPIDEATSLKGGASDQAATKKRSREAL

SwissProt top hitse value%identityAlignment
P52901 Pyruvate dehydrogenase E1 component subunit alpha-1, mitochondrial4.5e-0578.57Show/hide
Query:  LNVDGMDAVAVKQACKFAKEHVLKNGPI
        L VDGMDA AVKQACKFAK+H L+ GPI
Subjt:  LNVDGMDAVAVKQACKFAKEHVLKNGPI

P52902 Pyruvate dehydrogenase E1 component subunit alpha, mitochondrial8.3e-0789.29Show/hide
Query:  LNVDGMDAVAVKQACKFAKEHVLKNGPI
        L VDGMDA+AVKQACKFAKEH LKNGPI
Subjt:  LNVDGMDAVAVKQACKFAKEHVLKNGPI

Q654V6 Pyruvate dehydrogenase E1 component subunit alpha-2, mitochondrial1.6e-0578.57Show/hide
Query:  LNVDGMDAVAVKQACKFAKEHVLKNGPI
        L VDGMD +AVKQACKFAK+H L+NGPI
Subjt:  LNVDGMDAVAVKQACKFAKEHVLKNGPI

Q6Z5N4 Pyruvate dehydrogenase E1 component subunit alpha-1, mitochondrial2.0e-0578.57Show/hide
Query:  LNVDGMDAVAVKQACKFAKEHVLKNGPI
        L VDGMD +AVKQACKFAKEH + NGPI
Subjt:  LNVDGMDAVAVKQACKFAKEHVLKNGPI

Q8H1Y0 Pyruvate dehydrogenase E1 component subunit alpha-2, mitochondrial8.3e-0789.29Show/hide
Query:  LNVDGMDAVAVKQACKFAKEHVLKNGPI
        L VDGMDA+AVKQACKFAKEH LKNGPI
Subjt:  LNVDGMDAVAVKQACKFAKEHVLKNGPI

Arabidopsis top hitse value%identityAlignment
AT1G24180.1 Thiamin diphosphate-binding fold (THDP-binding) superfamily protein5.9e-0889.29Show/hide
Query:  LNVDGMDAVAVKQACKFAKEHVLKNGPI
        L VDGMDA+AVKQACKFAKEH LKNGPI
Subjt:  LNVDGMDAVAVKQACKFAKEHVLKNGPI

AT1G59900.1 pyruvate dehydrogenase complex E1 alpha subunit3.2e-0678.57Show/hide
Query:  LNVDGMDAVAVKQACKFAKEHVLKNGPI
        L VDGMDA AVKQACKFAK+H L+ GPI
Subjt:  LNVDGMDAVAVKQACKFAKEHVLKNGPI

AT5G20220.1 zinc knuckle (CCHC-type) family protein2.1e-6161.96Show/hide
Query:  NSRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPS--PLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKELPCPSCRGRGYAPCTECGIER
        NS+F  F   R +  +    +   ND SV+  + +   + +DPSEEL+  GVD  PR  +  + EPRSWFGPNGQYI+ELPCP+CRGRGY  C+ CGIER
Subjt:  NSRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPS--PLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKELPCPSCRGRGYAPCTECGIER

Query:  SRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKISRSL
        SR DC  C GKGI+TC +CLGD VIWEESIDE+PWEKARS+SP R+KEDDEVDNLEIK  +++KSKR+YQSP+PEVG KISRSL
Subjt:  SRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKISRSL

AT5G20220.2 zinc knuckle (CCHC-type) family protein2.1e-6161.96Show/hide
Query:  NSRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPS--PLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKELPCPSCRGRGYAPCTECGIER
        NS+F  F   R +  +    +   ND SV+  + +   + +DPSEEL+  GVD  PR  +  + EPRSWFGPNGQYI+ELPCP+CRGRGY  C+ CGIER
Subjt:  NSRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPS--PLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKELPCPSCRGRGYAPCTECGIER

Query:  SRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKISRSL
        SR DC  C GKGI+TC +CLGD VIWEESIDE+PWEKARS+SP R+KEDDEVDNLEIK  +++KSKR+YQSP+PEVG KISRSL
Subjt:  SRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKISRSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCTCCTCAATCTGTTCGTCAATTCGGGCATTGCCGCCGCACCACGGGCGCCCCAACAACTCCAGATTCGCATTGTTTCAATCCAAGAGATGTTTATGCTTCGCGCC
TCGCTTCGTTGCTTGCTTGACCAATGACGACTCTGTTGCAATCCCCAAACCCTCGCCGCTGGCCTTCGATCCTTCGGAGGAGTTGTACGGACTTGGCGTCGATTTAAACC
CAAGGAATTCAGCTTCCAGCGCACCTGAACCCAGGTCCTGGTTTGGTCCGAATGGTCAGTATATTAAAGAACTACCTTGCCCAAGTTGCCGAGGTAGGGGCTATGCGCCG
TGTACGGAATGTGGAATTGAAAGATCCCGAGCAGACTGTTCCGTGTGTAATGGGAAGGGTATAGTGACCTGCCACCAATGCTTGGGAGATCGTGTCATATGGGAAGAGTC
CATTGATGAACAACCATGGGAGAAAGCACGCTCCACTTCTCCATTAAGAATGAAGGAAGATGATGAAGTTGATAACCTGGAAATAAAGCTGGAAGAAAAGAAGAAATCAA
AGCGTGTATACCAATCACCTTCTCCTGAAGTTGGATTAAAGATCAGTCGATCATTAAACGTGGATGGCATGGATGCTGTTGCTGTCAAACAGGCTTGCAAGTTTGCTAAG
GAACATGTTCTGAAGAATGGACCAATTGATGAAGCTACTTCTCTCAAAGGTGGAGCTTCTGACCAGGCTGCCACAAAGAAGAGATCAAGAGAAGCTCTTGTATCAGAAGA
TGTTGGTGAACTGTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCCTCCTCAATCTGTTCGTCAATTCGGGCATTGCCGCCGCACCACGGGCGCCCCAACAACTCCAGATTCGCATTGTTTCAATCCAAGAGATGTTTATGCTTCGCGCC
TCGCTTCGTTGCTTGCTTGACCAATGACGACTCTGTTGCAATCCCCAAACCCTCGCCGCTGGCCTTCGATCCTTCGGAGGAGTTGTACGGACTTGGCGTCGATTTAAACC
CAAGGAATTCAGCTTCCAGCGCACCTGAACCCAGGTCCTGGTTTGGTCCGAATGGTCAGTATATTAAAGAACTACCTTGCCCAAGTTGCCGAGGTAGGGGCTATGCGCCG
TGTACGGAATGTGGAATTGAAAGATCCCGAGCAGACTGTTCCGTGTGTAATGGGAAGGGTATAGTGACCTGCCACCAATGCTTGGGAGATCGTGTCATATGGGAAGAGTC
CATTGATGAACAACCATGGGAGAAAGCACGCTCCACTTCTCCATTAAGAATGAAGGAAGATGATGAAGTTGATAACCTGGAAATAAAGCTGGAAGAAAAGAAGAAATCAA
AGCGTGTATACCAATCACCTTCTCCTGAAGTTGGATTAAAGATCAGTCGATCATTAAACGTGGATGGCATGGATGCTGTTGCTGTCAAACAGGCTTGCAAGTTTGCTAAG
GAACATGTTCTGAAGAATGGACCAATTGATGAAGCTACTTCTCTCAAAGGTGGAGCTTCTGACCAGGCTGCCACAAAGAAGAGATCAAGAGAAGCTCTTGTATCAGAAGA
TGTTGGTGAACTGTTGTAG
Protein sequenceShow/hide protein sequence
MSSSICSSIRALPPHHGRPNNSRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKELPCPSCRGRGYAP
CTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKISRSLNVDGMDAVAVKQACKFAK
EHVLKNGPIDEATSLKGGASDQAATKKRSREALVSEDVGELL