; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC00g0668 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC00g0668
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionCCHC-type domain-containing protein
Genome locationscaffold230:250996..254273
RNA-Seq ExpressionMC00g0668
SyntenyMC00g0668
Gene Ontology termsGO:2000767 - positive regulation of cytoplasmic translation (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003727 - single-stranded RNA binding (molecular function)
GO:0003729 - mRNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0045182 - translation regulator activity (molecular function)
InterPro domainsIPR036410 - Heat shock protein DnaJ, cysteine-rich domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040581.1 uncharacterized protein E6C27_scaffold262G001550 [Cucumis melo var. makuwa]2.76e-14888.94Show/hide
Query:  MSSSICSSIRALPPHHGRPNN-SRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKELP
        MSSS+C SI AL PH+ RP N   F L QS RC+CFAPRFVACL NDDSVAIPKP+PLAFDP+EELYGL VDL PRNSASSAPEPRSWFGPNGQYIKELP
Subjt:  MSSSICSSIRALPPHHGRPNN-SRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKELP

Query:  CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKIS
        CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDE+PWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSP PEVGLKIS
Subjt:  CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKIS

Query:  RSLNSLNAKTGLFSKRMKIIHRDPTLHGQRVAAIK
        RSL SLNAKTG+FSKRM+IIHRDP LH QRVAAIK
Subjt:  RSLNSLNAKTGLFSKRMKIIHRDPTLHGQRVAAIK

XP_004143799.1 uncharacterized protein LOC101211176 [Cucumis sativus]1.94e-14588.94Show/hide
Query:  MSSSICSSIRALPPHHGRPNNSRF-ALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKELP
        MSSS+C SI AL PH+ RP N RF  L QS RC+CFAPRFVA L NDDSVAIPKP+PLAFDP+EELYGL VDL PRNSASSAPEPRSWFGPNGQYIKELP
Subjt:  MSSSICSSIRALPPHHGRPNNSRF-ALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKELP

Query:  CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKIS
        CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDE+PWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSP PEVGLKIS
Subjt:  CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKIS

Query:  RSLNSLNAKTGLFSKRMKIIHRDPTLHGQRVAAIK
        RSL  LNAKTG+FSKRMKIIHRDP LH QRVAAIK
Subjt:  RSLNSLNAKTGLFSKRMKIIHRDPTLHGQRVAAIK

XP_008465679.1 PREDICTED: uncharacterized protein LOC103503314 isoform X2 [Cucumis melo]3.90e-14588.51Show/hide
Query:  MSSSICSSIRALPPHHGRPNN-SRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKELP
        MSSS+C SI AL PH+ R  N   F L QS RC+CFAPRFVACL NDDSVAIPKP+PLAFDP+EELYGL VDL PRNSASSAPEPRSWFGPNGQYIKELP
Subjt:  MSSSICSSIRALPPHHGRPNN-SRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKELP

Query:  CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKIS
        CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDE+PWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSP PEVGLKIS
Subjt:  CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKIS

Query:  RSLNSLNAKTGLFSKRMKIIHRDPTLHGQRVAAIK
        RSL SLNAKTG+FSKRM+IIHRDP LH QRVAAIK
Subjt:  RSLNSLNAKTGLFSKRMKIIHRDPTLHGQRVAAIK

XP_022151325.1 uncharacterized protein LOC111019288 [Momordica charantia]9.89e-16297.48Show/hide
Query:  MSSSICSSIRALPPHHGRPNNSRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPR----NSASSAPEPRSWFGPNGQYIK
        MSSSICSSIRALPPHHGRPNNSRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPR    NSASSAPEPRSWFGPNGQYIK
Subjt:  MSSSICSSIRALPPHHGRPNNSRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPR----NSASSAPEPRSWFGPNGQYIK

Query:  ELPCPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGL
        ELPCPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGL
Subjt:  ELPCPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGL

Query:  KISRSLNSLNAKTGLFSKRMKIIHRDPTLHGQRVAAIK
        KISRSL SLNAKTGLFSKRMKIIHRDPTLH QRVAAIK
Subjt:  KISRSLNSLNAKTGLFSKRMKIIHRDPTLHGQRVAAIK

XP_038889908.1 uncharacterized protein LOC120079678 isoform X2 [Benincasa hispida]1.54e-14788.98Show/hide
Query:  MSSSICSSIRALPPHHGRPNNSRFALF--QSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKEL
        MSSS+CSSIRAL PH+ R  N RF L   QS RC+CFAPRFVACL NDDSVAIP+P+PLAFDP+EELYGLGVDL PRN+ASSAPEPRSWFGPNGQYIKEL
Subjt:  MSSSICSSIRALPPHHGRPNNSRFALF--QSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKEL

Query:  PCPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKI
        PCPSCRGRGYAPCTECGIERS+ADCSVCNGKGIVTCHQCLGDRVIWEESIDE+PWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSP PEVGLKI
Subjt:  PCPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKI

Query:  SRSLNSLNAKTGLFSKRMKIIHRDPTLHGQRVAAIK
        SRSL SLNAKTG+FSKRMKIIHRDP LH QRVAAIK
Subjt:  SRSLNSLNAKTGLFSKRMKIIHRDPTLHGQRVAAIK

TrEMBL top hitse value%identityAlignment
A0A0A0KS13 Uncharacterized protein9.40e-14688.94Show/hide
Query:  MSSSICSSIRALPPHHGRPNNSRF-ALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKELP
        MSSS+C SI AL PH+ RP N RF  L QS RC+CFAPRFVA L NDDSVAIPKP+PLAFDP+EELYGL VDL PRNSASSAPEPRSWFGPNGQYIKELP
Subjt:  MSSSICSSIRALPPHHGRPNNSRF-ALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKELP

Query:  CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKIS
        CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDE+PWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSP PEVGLKIS
Subjt:  CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKIS

Query:  RSLNSLNAKTGLFSKRMKIIHRDPTLHGQRVAAIK
        RSL  LNAKTG+FSKRMKIIHRDP LH QRVAAIK
Subjt:  RSLNSLNAKTGLFSKRMKIIHRDPTLHGQRVAAIK

A0A1S3CPF7 uncharacterized protein LOC103503314 isoform X21.89e-14588.51Show/hide
Query:  MSSSICSSIRALPPHHGRPNN-SRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKELP
        MSSS+C SI AL PH+ R  N   F L QS RC+CFAPRFVACL NDDSVAIPKP+PLAFDP+EELYGL VDL PRNSASSAPEPRSWFGPNGQYIKELP
Subjt:  MSSSICSSIRALPPHHGRPNN-SRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKELP

Query:  CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKIS
        CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDE+PWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSP PEVGLKIS
Subjt:  CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKIS

Query:  RSLNSLNAKTGLFSKRMKIIHRDPTLHGQRVAAIK
        RSL SLNAKTG+FSKRM+IIHRDP LH QRVAAIK
Subjt:  RSLNSLNAKTGLFSKRMKIIHRDPTLHGQRVAAIK

A0A5D3C105 Uncharacterized protein1.34e-14888.94Show/hide
Query:  MSSSICSSIRALPPHHGRPNN-SRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKELP
        MSSS+C SI AL PH+ RP N   F L QS RC+CFAPRFVACL NDDSVAIPKP+PLAFDP+EELYGL VDL PRNSASSAPEPRSWFGPNGQYIKELP
Subjt:  MSSSICSSIRALPPHHGRPNN-SRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKELP

Query:  CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKIS
        CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDE+PWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSP PEVGLKIS
Subjt:  CPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKIS

Query:  RSLNSLNAKTGLFSKRMKIIHRDPTLHGQRVAAIK
        RSL SLNAKTG+FSKRM+IIHRDP LH QRVAAIK
Subjt:  RSLNSLNAKTGLFSKRMKIIHRDPTLHGQRVAAIK

A0A6J1DAV1 uncharacterized protein LOC1110192884.79e-16297.48Show/hide
Query:  MSSSICSSIRALPPHHGRPNNSRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPR----NSASSAPEPRSWFGPNGQYIK
        MSSSICSSIRALPPHHGRPNNSRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPR    NSASSAPEPRSWFGPNGQYIK
Subjt:  MSSSICSSIRALPPHHGRPNNSRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPR----NSASSAPEPRSWFGPNGQYIK

Query:  ELPCPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGL
        ELPCPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGL
Subjt:  ELPCPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGL

Query:  KISRSLNSLNAKTGLFSKRMKIIHRDPTLHGQRVAAIK
        KISRSL SLNAKTGLFSKRMKIIHRDPTLH QRVAAIK
Subjt:  KISRSLNSLNAKTGLFSKRMKIIHRDPTLHGQRVAAIK

A0A6J1GHY9 uncharacterized protein LOC111454354 isoform X25.95e-14388.98Show/hide
Query:  MSSSICSSIRALPPHHGRPNNSRF-ALFQSKRCLCFAPRFVACLT-NDDSVAIPKPSPLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKEL
        MSS +CSSIRAL P   R  N RF  L QSKRC+ FAPRFVACL+ NDDSVAIPKP PLAFDP EE+YGLGVDL PRNS SSAPEPRSWFGPNGQYI+EL
Subjt:  MSSSICSSIRALPPHHGRPNNSRF-ALFQSKRCLCFAPRFVACLT-NDDSVAIPKPSPLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKEL

Query:  PCPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKI
        PCPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDE+PWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSP PEVGLKI
Subjt:  PCPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKI

Query:  SRSLNSLNAKTGLFSKRMKIIHRDPTLHGQRVAAIK
        SRSL SLNAKTGLFSKRMKIIHRDPTLH QRVAAIK
Subjt:  SRSLNSLNAKTGLFSKRMKIIHRDPTLHGQRVAAIK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G20220.1 zinc knuckle (CCHC-type) family protein3.7e-7866.2Show/hide
Query:  NSRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPS--PLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKELPCPSCRGRGYAPCTECGIER
        NS+F  F   R +  +    +   ND SV+  + +   + +DPSEEL+  GVD  PR  +  + EPRSWFGPNGQYI+ELPCP+CRGRGY  C+ CGIER
Subjt:  NSRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPS--PLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKELPCPSCRGRGYAPCTECGIER

Query:  SRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKISRSLNSLNAKTGLFSKRMKI
        SR DC  C GKGI+TC +CLGD VIWEESIDE+PWEKARS+SP R+KEDDEVDNLEIK  +++KSKR+YQSP+PEVG KISRSL SLNAKTGLFSKRMKI
Subjt:  SRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKISRSLNSLNAKTGLFSKRMKI

Query:  IHRDPTLHGQRVAAIK
        IHRDP LH QRVAAIK
Subjt:  IHRDPTLHGQRVAAIK

AT5G20220.2 zinc knuckle (CCHC-type) family protein3.7e-7866.2Show/hide
Query:  NSRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPS--PLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKELPCPSCRGRGYAPCTECGIER
        NS+F  F   R +  +    +   ND SV+  + +   + +DPSEEL+  GVD  PR  +  + EPRSWFGPNGQYI+ELPCP+CRGRGY  C+ CGIER
Subjt:  NSRFALFQSKRCLCFAPRFVACLTNDDSVAIPKPS--PLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKELPCPSCRGRGYAPCTECGIER

Query:  SRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKISRSLNSLNAKTGLFSKRMKI
        SR DC  C GKGI+TC +CLGD VIWEESIDE+PWEKARS+SP R+KEDDEVDNLEIK  +++KSKR+YQSP+PEVG KISRSL SLNAKTGLFSKRMKI
Subjt:  SRADCSVCNGKGIVTCHQCLGDRVIWEESIDEQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKISRSLNSLNAKTGLFSKRMKI

Query:  IHRDPTLHGQRVAAIK
        IHRDP LH QRVAAIK
Subjt:  IHRDPTLHGQRVAAIK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GCAAATGTCGCGTTTGTTCCAACCACGTCCACAAGACCGCGACGGCGTGTGGCATTTTCTGACCACACATGCAAAGCCAAAATCTTTCCTTCCAACGCCCACCATTTCCA
GTTTGGTTCTTCAATTCGCGCTGTGGCCGTTCGTAGCAGCGTGCGTATTTCTTCTTCTTCCTCTTCCCATATAACGCATTCCCACTCCAAACACATTTATCAGATGTCCT
CCTCAATCTGTTCGTCAATTCGGGCATTGCCGCCGCACCACGGGCGCCCCAACAACTCCAGATTCGCATTGTTTCAATCCAAGAGATGTTTATGCTTCGCGCCTCGCTTC
GTTGCTTGCTTGACCAATGACGACTCTGTTGCAATCCCCAAACCCTCGCCGCTGGCCTTCGATCCTTCGGAGGAGTTGTACGGACTTGGCGTCGATTTAAACCCAAGGAA
TTCAGCTTCCAGCGCACCTGAACCCAGGTCCTGGTTTGGTCCGAATGGTCAGTATATTAAAGAACTACCTTGCCCAAGTTGCCGAGGTAGGGGCTATGCGCCGTGTACGG
AATGTGGAATTGAAAGATCCCGAGCAGACTGTTCCGTGTGTAATGGGAAGGGTATAGTGACCTGCCACCAATGCTTGGGAGATCGTGTCATATGGGAAGAGTCCATTGAT
GAACAACCATGGGAGAAAGCACGCTCCACTTCTCCATTAAGAATGAAGGAAGATGATGAAGTTGATAACCTGGAAATAAAGCTGGAAGAAAAGAAGAAATCAAAGCGTGT
ATACCAATCACCTTCTCCTGAAGTTGGATTAAAGATCAGTCGATCATTAAACAGTCTCAATGCCAAAACAGGTCTATTTAGCAAGAGAATGAAGATTATCCATCGTGACC
CCACTCTTCATGGCCAGAGAGTTGCTGCAATTAAA
mRNA sequenceShow/hide mRNA sequence
GCAAATGTCGCGTTTGTTCCAACCACGTCCACAAGACCGCGACGGCGTGTGGCATTTTCTGACCACACATGCAAAGCCAAAATCTTTCCTTCCAACGCCCACCATTTCCA
GTTTGGTTCTTCAATTCGCGCTGTGGCCGTTCGTAGCAGCGTGCGTATTTCTTCTTCTTCCTCTTCCCATATAACGCATTCCCACTCCAAACACATTTATCAGATGTCCT
CCTCAATCTGTTCGTCAATTCGGGCATTGCCGCCGCACCACGGGCGCCCCAACAACTCCAGATTCGCATTGTTTCAATCCAAGAGATGTTTATGCTTCGCGCCTCGCTTC
GTTGCTTGCTTGACCAATGACGACTCTGTTGCAATCCCCAAACCCTCGCCGCTGGCCTTCGATCCTTCGGAGGAGTTGTACGGACTTGGCGTCGATTTAAACCCAAGGAA
TTCAGCTTCCAGCGCACCTGAACCCAGGTCCTGGTTTGGTCCGAATGGTCAGTATATTAAAGAACTACCTTGCCCAAGTTGCCGAGGTAGGGGCTATGCGCCGTGTACGG
AATGTGGAATTGAAAGATCCCGAGCAGACTGTTCCGTGTGTAATGGGAAGGGTATAGTGACCTGCCACCAATGCTTGGGAGATCGTGTCATATGGGAAGAGTCCATTGAT
GAACAACCATGGGAGAAAGCACGCTCCACTTCTCCATTAAGAATGAAGGAAGATGATGAAGTTGATAACCTGGAAATAAAGCTGGAAGAAAAGAAGAAATCAAAGCGTGT
ATACCAATCACCTTCTCCTGAAGTTGGATTAAAGATCAGTCGATCATTAAACAGTCTCAATGCCAAAACAGGTCTATTTAGCAAGAGAATGAAGATTATCCATCGTGACC
CCACTCTTCATGGCCAGAGAGTTGCTGCAATTAAA
Protein sequenceShow/hide protein sequence
ANVAFVPTTSTRPRRRVAFSDHTCKAKIFPSNAHHFQFGSSIRAVAVRSSVRISSSSSSHITHSHSKHIYQMSSSICSSIRALPPHHGRPNNSRFALFQSKRCLCFAPRF
VACLTNDDSVAIPKPSPLAFDPSEELYGLGVDLNPRNSASSAPEPRSWFGPNGQYIKELPCPSCRGRGYAPCTECGIERSRADCSVCNGKGIVTCHQCLGDRVIWEESID
EQPWEKARSTSPLRMKEDDEVDNLEIKLEEKKKSKRVYQSPSPEVGLKISRSLNSLNAKTGLFSKRMKIIHRDPTLHGQRVAAIK