; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g2165 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g2165
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionC3H1-type domain-containing protein
Genome locationMC06:29065928..29070472
RNA-Seq ExpressionMC06g2165
SyntenyMC06g2165
Gene Ontology termsGO:0005689 - U12-type spliceosomal complex (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000571 - Zinc finger, CCCH-type
IPR003604 - Matrin/U1-C-like, C2H2-type zinc finger
IPR013085 - U1-C, C2H2-type zinc finger
IPR036236 - Zinc finger C2H2 superfamily
IPR036855 - Zinc finger, CCCH-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158753.1 zinc finger CCCH domain-containing protein 3 [Momordica charantia]1.24e-114100Show/hide
Query:  MPLGKYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDPNQAFSDGFGRGVCNRFVRTGFCQYGDSCKYFHPKNNPQNSSTQPIAGFPEYSNQA
        MPLGKYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDPNQAFSDGFGRGVCNRFVRTGFCQYGDSCKYFHPKNNPQNSSTQPIAGFPEYSNQA
Subjt:  MPLGKYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDPNQAFSDGFGRGVCNRFVRTGFCQYGDSCKYFHPKNNPQNSSTQPIAGFPEYSNQA

Query:  SNIPVNQFVEGSSLTGNLVSNRMGTSWGNLPPSLMPPPEGGYPPLPIVDWG
        SNIPVNQFVEGSSLTGNLVSNRMGTSWGNLPPSLMPPPEGGYPPLPIVDWG
Subjt:  SNIPVNQFVEGSSLTGNLVSNRMGTSWGNLPPSLMPPPEGGYPPLPIVDWG

XP_022931617.1 zinc finger CCCH domain-containing protein 3 [Cucurbita moschata]1.06e-8580.13Show/hide
Query:  MPLGKYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDPNQAFSDGFGRGVCNRFVRTGFCQYGDSCKYFHPKNNPQNSSTQPIAGFPEYSNQA
        MPLGKYYCDYCEKQFQDTPFARKRHL SLSH KAKALWFDSFRD NQ  S  F   +CNRF+ TGFCQYGDSC YFH  NNPQ SS+ PIAGFPE +NQA
Subjt:  MPLGKYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDPNQAFSDGFGRGVCNRFVRTGFCQYGDSCKYFHPKNNPQNSSTQPIAGFPEYSNQA

Query:  SNIPVNQFVEGSSLTGNLVSNRMGTSWGNLPPSLMPPPEGGYPPLPIVDWG
         NIPVNQF+EGSSLTG+L  +R+ TSWGNLPPSLMPPPEGGYPPLP VDWG
Subjt:  SNIPVNQFVEGSSLTGNLVSNRMGTSWGNLPPSLMPPPEGGYPPLPIVDWG

XP_022989487.1 zinc finger CCCH domain-containing protein 3 isoform X1 [Cucurbita maxima]9.86e-9184.11Show/hide
Query:  MPLGKYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDPNQAFSDGFGRGVCNRFVRTGFCQYGDSCKYFHPKNNPQNSSTQPIAGFPEYSNQA
        MPLGKYYCDYCEKQFQDTPFARKRHL SLSHQKAKALWFDSFRD NQ FSD FG  VCNRF+ TGFCQYGDSCKYFH KNN Q SS+QPIAGFPE +NQA
Subjt:  MPLGKYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDPNQAFSDGFGRGVCNRFVRTGFCQYGDSCKYFHPKNNPQNSSTQPIAGFPEYSNQA

Query:  SNIPVNQFVEGSSLTGNLVSNRMGTSWGNLPPSLMPPPEGGYPPLPIVDWG
         N+PVNQF+EGSSLTG+L  +R+ TSWGNLPPSLMPPPEGGYPPLP VDWG
Subjt:  SNIPVNQFVEGSSLTGNLVSNRMGTSWGNLPPSLMPPPEGGYPPLPIVDWG

XP_023520794.1 zinc finger CCCH domain-containing protein 3 [Cucurbita pepo subsp. pepo]2.91e-8378.15Show/hide
Query:  MPLGKYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDPNQAFSDGFGRGVCNRFVRTGFCQYGDSCKYFHPKNNPQNSSTQPIAGFPEYSNQA
        MPLGKYYCDYCEKQFQDTPFARKRHL SLSH KAKALWFDSFRD NQ  S  F   + NRF+ TGFC YGDSC YFH  NNPQ SS+ PIAGFPE +NQA
Subjt:  MPLGKYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDPNQAFSDGFGRGVCNRFVRTGFCQYGDSCKYFHPKNNPQNSSTQPIAGFPEYSNQA

Query:  SNIPVNQFVEGSSLTGNLVSNRMGTSWGNLPPSLMPPPEGGYPPLPIVDWG
         N+PVNQF+EGSSLTG+L  +R+ TSWGNLPPSLMPPPEGGYPPLP VDWG
Subjt:  SNIPVNQFVEGSSLTGNLVSNRMGTSWGNLPPSLMPPPEGGYPPLPIVDWG

XP_038879279.1 zinc finger CCCH domain-containing protein 3 [Benincasa hispida]8.03e-8577.07Show/hide
Query:  MPLGKYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDPNQAFSDGFGRGVCNRFVRTGFCQYGDSCKYFHPKNN-----PQNSSTQPIAGFPE
        MPLGKYYCDYCEKQFQDTPFARKRHL SLSHQKAKALWFDSF+DPNQ     F  G CNRF+ TGFCQYGDSCKYFHP NN       NS++ PIAGFPE
Subjt:  MPLGKYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDPNQAFSDGFGRGVCNRFVRTGFCQYGDSCKYFHPKNN-----PQNSSTQPIAGFPE

Query:  YSNQASNIPVNQFVEGSSLTGNLVSNRMG-TSWGNLPPSLMPPPEGGYPPLPIVDWG
         +NQ  N PVNQF +GSS TG+LVS+ MG TSWGNLPPSLMPPPEGGYPPLP VDWG
Subjt:  YSNQASNIPVNQFVEGSSLTGNLVSNRMG-TSWGNLPPSLMPPPEGGYPPLPIVDWG

TrEMBL top hitse value%identityAlignment
A0A1S3BQ25 zinc finger CCCH domain-containing protein 31.91e-8376.58Show/hide
Query:  MPLGKYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDPNQAF--SDGFGRGVCNRFVRTGFCQYGDSCKYFHPKNN----PQNSSTQPIAGFP
        MPLGKYYCDYC+KQFQDTPFARKRHL SLSHQKAKALWFDSF+D NQ F  S  F   +CNRF+ TGFCQYGDSCKYFHPKNN      NSS+ PIAGFP
Subjt:  MPLGKYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDPNQAF--SDGFGRGVCNRFVRTGFCQYGDSCKYFHPKNN----PQNSSTQPIAGFP

Query:  EYSNQASNIPVNQFVEG-SSLTGNLVSNRMGTSWGNLPPSLMPPPEGGYPPLPIVDWG
        E +NQ  N+P N+FV+G SS TG+LVS+R+GTSWGNLPPSLMPPPEGGYPPLP VDWG
Subjt:  EYSNQASNIPVNQFVEG-SSLTGNLVSNRMGTSWGNLPPSLMPPPEGGYPPLPIVDWG

A0A5D3CFN3 Zinc finger CCCH domain-containing protein 31.91e-8376.58Show/hide
Query:  MPLGKYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDPNQAF--SDGFGRGVCNRFVRTGFCQYGDSCKYFHPKNN----PQNSSTQPIAGFP
        MPLGKYYCDYC+KQFQDTPFARKRHL SLSHQKAKALWFDSF+D NQ F  S  F   +CNRF+ TGFCQYGDSCKYFHPKNN      NSS+ PIAGFP
Subjt:  MPLGKYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDPNQAF--SDGFGRGVCNRFVRTGFCQYGDSCKYFHPKNN----PQNSSTQPIAGFP

Query:  EYSNQASNIPVNQFVEG-SSLTGNLVSNRMGTSWGNLPPSLMPPPEGGYPPLPIVDWG
        E +NQ  N+P N+FV+G SS TG+LVS+R+GTSWGNLPPSLMPPPEGGYPPLP VDWG
Subjt:  EYSNQASNIPVNQFVEG-SSLTGNLVSNRMGTSWGNLPPSLMPPPEGGYPPLPIVDWG

A0A6J1DWZ9 zinc finger CCCH domain-containing protein 35.99e-115100Show/hide
Query:  MPLGKYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDPNQAFSDGFGRGVCNRFVRTGFCQYGDSCKYFHPKNNPQNSSTQPIAGFPEYSNQA
        MPLGKYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDPNQAFSDGFGRGVCNRFVRTGFCQYGDSCKYFHPKNNPQNSSTQPIAGFPEYSNQA
Subjt:  MPLGKYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDPNQAFSDGFGRGVCNRFVRTGFCQYGDSCKYFHPKNNPQNSSTQPIAGFPEYSNQA

Query:  SNIPVNQFVEGSSLTGNLVSNRMGTSWGNLPPSLMPPPEGGYPPLPIVDWG
        SNIPVNQFVEGSSLTGNLVSNRMGTSWGNLPPSLMPPPEGGYPPLPIVDWG
Subjt:  SNIPVNQFVEGSSLTGNLVSNRMGTSWGNLPPSLMPPPEGGYPPLPIVDWG

A0A6J1EU59 zinc finger CCCH domain-containing protein 35.13e-8680.13Show/hide
Query:  MPLGKYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDPNQAFSDGFGRGVCNRFVRTGFCQYGDSCKYFHPKNNPQNSSTQPIAGFPEYSNQA
        MPLGKYYCDYCEKQFQDTPFARKRHL SLSH KAKALWFDSFRD NQ  S  F   +CNRF+ TGFCQYGDSC YFH  NNPQ SS+ PIAGFPE +NQA
Subjt:  MPLGKYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDPNQAFSDGFGRGVCNRFVRTGFCQYGDSCKYFHPKNNPQNSSTQPIAGFPEYSNQA

Query:  SNIPVNQFVEGSSLTGNLVSNRMGTSWGNLPPSLMPPPEGGYPPLPIVDWG
         NIPVNQF+EGSSLTG+L  +R+ TSWGNLPPSLMPPPEGGYPPLP VDWG
Subjt:  SNIPVNQFVEGSSLTGNLVSNRMGTSWGNLPPSLMPPPEGGYPPLPIVDWG

A0A6J1JFY8 zinc finger CCCH domain-containing protein 3 isoform X14.78e-9184.11Show/hide
Query:  MPLGKYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDPNQAFSDGFGRGVCNRFVRTGFCQYGDSCKYFHPKNNPQNSSTQPIAGFPEYSNQA
        MPLGKYYCDYCEKQFQDTPFARKRHL SLSHQKAKALWFDSFRD NQ FSD FG  VCNRF+ TGFCQYGDSCKYFH KNN Q SS+QPIAGFPE +NQA
Subjt:  MPLGKYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDPNQAFSDGFGRGVCNRFVRTGFCQYGDSCKYFHPKNNPQNSSTQPIAGFPEYSNQA

Query:  SNIPVNQFVEGSSLTGNLVSNRMGTSWGNLPPSLMPPPEGGYPPLPIVDWG
         N+PVNQF+EGSSLTG+L  +R+ TSWGNLPPSLMPPPEGGYPPLP VDWG
Subjt:  SNIPVNQFVEGSSLTGNLVSNRMGTSWGNLPPSLMPPPEGGYPPLPIVDWG

SwissProt top hitse value%identityAlignment
Q0JP11 Zinc finger CCCH domain-containing protein 31.6e-4253.8Show/hide
Query:  MPLGKYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDPNQAFSDGFGR-------------GVCNRFVRTGFCQYGDSCKYFHPKNNPQNSST
        MPLGKYYCDYCEKQFQDTP ARKRHL    H +A+ALW+D+ R   Q    G G              GVC  FVRTG C++GDSC+YFHPK  P N   
Subjt:  MPLGKYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDPNQAFSDGFGR-------------GVCNRFVRTGFCQYGDSCKYFHPKNNPQNSST

Query:  QPIAGFPEYSNQASNIPVNQ--FV-----EGSSLTGNLVSNRMGTSWGNLPPSLMPPPEGGYPPLPIVDWG
         P         Q SNI  +Q  FV     +GSS +GN++     TSWGNLPPSL PPPEGGYPP P VDWG
Subjt:  QPIAGFPEYSNQASNIPVNQ--FV-----EGSSLTGNLVSNRMGTSWGNLPPSLMPPPEGGYPPLPIVDWG

Q2TA39 Zinc finger matrin-type protein 51.3e-1732.54Show/hide
Query:  KYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDPNQAFSDGFGRGVCNRFVRTGFCQYGDSCKYFH-PKNNPQNSSTQ------------PIA
        +Y+CDYC++ FQD    RK+HL+ L H KAK LW+D FRD      D   +  C +F+ TG C +G +C++ H  + + Q  S Q             +A
Subjt:  KYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDPNQAFSDGFGRGVCNRFVRTGFCQYGDSCKYFH-PKNNPQNSSTQ------------PIA

Query:  GFPE------YSNQASNIPVNQFVEGSSLTGNLVSNRMGTSW---GNLPPSLMPPPEGGYPPLPIVDWG
          PE         +A  +          +   +    +G  W     LPPSL  PP GG+P  P V WG
Subjt:  GFPE------YSNQASNIPVNQFVEGSSLTGNLVSNRMGTSW---GNLPPSLMPPPEGGYPPLPIVDWG

Q6AXL8 Zinc finger matrin-type protein 56.0e-1831.76Show/hide
Query:  KYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDPNQAFSDGFGRGVCNRFVRTGFCQYGDSCKYFHPKNNPQNSSTQPIAGFPEYSNQ-----
        +YYCDYC++ FQD    RK+HL+ + H +AK  WFD+FRD     +D   + VC +FV+TG C +G SC++ H          Q I              
Subjt:  KYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDPNQAFSDGFGRGVCNRFVRTGFCQYGDSCKYFHPKNNPQNSSTQPIAGFPEYSNQ-----

Query:  ASNIPVNQF------------------VEGSSLTGNLVSNRMGTSWGNLPPSLMPPPEGGYPPLPIVDWG
        +S   V+++                  +E    T N+       S  +LPPSL PPP GG+      +WG
Subjt:  ASNIPVNQF------------------VEGSSLTGNLVSNRMGTSWGNLPPSLMPPPEGGYPPLPIVDWG

Q9CQR5 Zinc finger matrin-type protein 57.3e-1630.54Show/hide
Query:  KYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDPNQAFSDGFGRGVCNRFVRTGFCQYGDSCKYFH-PKNNPQNSSTQ--PIAGFPEYSNQAS
        +Y+CDYC++ FQD    RK+HL+ L H KAK +W+D FRD      D   +  C +F+ TG C +G +C++ H  + + Q  S Q        E+  + +
Subjt:  KYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDPNQAFSDGFGRGVCNRFVRTGFCQYGDSCKYFH-PKNNPQNSSTQ--PIAGFPEYSNQAS

Query:  NIPVNQFVEGSSLTGNLVSN--------------RMGTSW---GNLPPSLMPPPEGGYPPLPIVDWG
         +P     +        +S+              +    W     LPPSL  PP GG+  L  V WG
Subjt:  NIPVNQFVEGSSLTGNLVSN--------------RMGTSW---GNLPPSLMPPPEGGYPPLPIVDWG

Q9UDW3 Zinc finger matrin-type protein 51.1e-1631.74Show/hide
Query:  KYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDPNQAFSDGFGRGVCNRFVRTGFCQYGDSCKYFH-PKNNPQNSSTQ--PIAGFPEYSNQAS
        +Y+CDYC++ FQD    RK+HL+ L H KAK +W+D FRD      D   +  C +F+ TG C +G +C++ H  + + Q  S Q        E+   A 
Subjt:  KYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDPNQAFSDGFGRGVCNRFVRTGFCQYGDSCKYFH-PKNNPQNSSTQ--PIAGFPEYSNQAS

Query:  NIPVNQFVEGSSLTGNLVSN--------------RMGTSW---GNLPPSLMPPPEGGYPPLPIVDWG
         +P     +        +S+              +    W     LPPSL  PP GG+P  P V WG
Subjt:  NIPVNQFVEGSSLTGNLVSN--------------RMGTSW---GNLPPSLMPPPEGGYPPLPIVDWG

Arabidopsis top hitse value%identityAlignment
AT2G47850.2 Zinc finger C-x8-C-x5-C-x3-H type family protein2.3e-0434.48Show/hide
Query:  FSDGFGRGVCNRFVRTGFCQYGDSCKYFHPKNNPQNSSTQP--IAGFPEYSNQASNIP
        + + FG   C  +++TG C++G SCK+ HPKN   + S  P  I G+P      + +P
Subjt:  FSDGFGRGVCNRFVRTGFCQYGDSCKYFHPKNNPQNSSTQP--IAGFPEYSNQASNIP

AT3G06410.1 Zinc finger C-x8-C-x5-C-x3-H type family protein9.2e-0631.88Show/hide
Query:  SHQKAKALWFDSFRDPNQAFSDGFGRGVCNRFVRTGFCQYGDSCKYFHPKNNPQNSSTQPIA----GFP
        +H + +       R    A  +  G  VC  F+RTG C++G SCKY HP+      S  P++    G+P
Subjt:  SHQKAKALWFDSFRDPNQAFSDGFGRGVCNRFVRTGFCQYGDSCKYFHPKNNPQNSSTQPIA----GFP

AT5G18550.1 Zinc finger C-x8-C-x5-C-x3-H type family protein1.1e-0634.78Show/hide
Query:  SHQKAKALWFDSFRDPNQAFSDGFGRGVCNRFVRTGFCQYGDSCKYFHPKNNPQNSSTQPIA----GFP
        +H + +A      R     F +  G+ VC  F+RTG C++G SCKY HP+      S  P++    GFP
Subjt:  SHQKAKALWFDSFRDPNQAFSDGFGRGVCNRFVRTGFCQYGDSCKYFHPKNNPQNSSTQPIA----GFP

AT5G26749.1 C2H2 and C2HC zinc fingers superfamily protein1.5e-4050.98Show/hide
Query:  MPLGKYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDP--NQAFSDGFGRGVCNRFVRTGFCQYGDSCKYFHPKNNPQNSSTQPIAGFPEYSN
        MP GKYYCDYCEK+FQDT  ARKRHL S  H +AKALW+ S      N   S+   +G+CNRF+ + FC +GDSC+YFHP NN  ++      GF   + 
Subjt:  MPLGKYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDP--NQAFSDGFGRGVCNRFVRTGFCQYGDSCKYFHPKNNPQNSSTQPIAGFPEYSN

Query:  QASNIPVNQFVEGSSLTGNLVSNRMGTSWGNLPPSLMPPPEGGYPPLPIVDWG
           ++   Q ++GS+L  + VS R GT W +LPPSL PPPE GYP LP +DWG
Subjt:  QASNIPVNQFVEGSSLTGNLVSNRMGTSWGNLPPSLMPPPEGGYPPLPIVDWG

AT5G26749.2 C2H2 and C2HC zinc fingers superfamily protein4.5e-3745.35Show/hide
Query:  MPLGKYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDP--NQAFSDGFGRGVCNRFVRT-------------------GFCQYGDSCKYFHPK
        MP GKYYCDYCEK+FQDT  ARKRHL S  H +AKALW+ S      N   S+   +G+CNRF+ +                    FC +GDSC+YFHP 
Subjt:  MPLGKYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDP--NQAFSDGFGRGVCNRFVRT-------------------GFCQYGDSCKYFHPK

Query:  NNPQNSSTQPIAGFPEYSNQASNIPVNQFVEGSSLTGNLVSNRMGTSWGNLPPSLMPPPEGGYPPLPIVDWG
        NN  ++      GF   +    ++   Q ++GS+L  + VS R GT W +LPPSL PPPE GYP LP +DWG
Subjt:  NNPQNSSTQPIAGFPEYSNQASNIPVNQFVEGSSLTGNLVSNRMGTSWGNLPPSLMPPPEGGYPPLPIVDWG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCATTGGGAAAGTACTATTGCGACTACTGCGAGAAGCAATTCCAAGACACCCCTTTCGCCAGAAAACGCCATCTTCACAGCCTCTCCCACCAGAAAGCTAAAGCTCT
CTGGTTCGACTCCTTCAGAGACCCCAATCAAGCTTTCTCTGATGGCTTTGGAAGAGGCGTCTGCAACCGATTCGTCAGAACAGGGTTTTGCCAGTACGGTGATTCTTGTA
AATATTTTCATCCCAAGAACAATCCGCAGAACTCGAGTACGCAACCCATTGCAGGTTTCCCGGAATATAGTAATCAAGCCTCAAATATTCCTGTGAATCAATTCGTCGAA
GGCAGCTCTCTGACAGGCAATTTAGTTAGCAACAGAATGGGAACATCATGGGGCAATTTACCTCCATCATTGATGCCTCCTCCAGAGGGTGGTTACCCACCTCTTCCCAT
TGTAGACTGGGGCTAA
mRNA sequenceShow/hide mRNA sequence
GGTATATCTCTTAGGATAAGCATTTAACGCCGATATTAAGGTTACTTTGAATCGCAGAATCGTGCTTTCGTCGAATGTTAGAAGATATCTTGTTGGGTGAGCTCTATCCG
AGAAGAGGGTCGATGAAACTCTACTCATCTTTCACGTACTCCTCAACTCATCGATATCAGACTCAATTCCACGTGGCATTGCAGGAGTGGTCCACGTGTCAGAACAGATT
TTCCCCTTAAGATCCTTAGAATTACAAAATAGCGATGAAAAATGAGGAGAAAATGATCGGAAACAGAACAGCTTAGAAAACCCTCGATAAGGGAAACTCGAAGACTCGCG
AAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGATGCCATTGGGAAAGTACTATTGCGACTACTGCGAGAAGCAATTCCAAGACACCCCTTTCGCCAGAAAACGCCATCT
TCACAGCCTCTCCCACCAGAAAGCTAAAGCTCTCTGGTTCGACTCCTTCAGAGACCCCAATCAAGCTTTCTCTGATGGCTTTGGAAGAGGCGTCTGCAACCGATTCGTCA
GAACAGGGTTTTGCCAGTACGGTGATTCTTGTAAATATTTTCATCCCAAGAACAATCCGCAGAACTCGAGTACGCAACCCATTGCAGGTTTCCCGGAATATAGTAATCAA
GCCTCAAATATTCCTGTGAATCAATTCGTCGAAGGCAGCTCTCTGACAGGCAATTTAGTTAGCAACAGAATGGGAACATCATGGGGCAATTTACCTCCATCATTGATGCC
TCCTCCAGAGGGTGGTTACCCACCTCTTCCCATTGTAGACTGGGGCTAATTGATGAATATCAGTTACCGGTACAAGTCTCCTCGGCTTCTCGTGTTTCCACCCCATGTCT
TATGCAGCGGTTTGCACAAATGTAGTGTCGCTTAGAAGCCAGACAAATGAACTTTTTTTGTTCTTCTCTCGGGTACGAAATGCAACCCCAAAGTGTCATAAACATCGACT
TCTGTTAAGGTTCAAGTAGTATTCAAGTCATGATGCGTCATGTTTATGGTTCTCTAAACTTGTTGAAACCATAATCGGGAGAAATTTAGATAAAGAGTTGATAAAGCATA
GTAAACAGAAGCGTGATTCTTAGCTCTCTTTAATAGCAAACTGAATTTTTCATTCATAAACCAGGCATCAGAGGGTCCATATGTTATTAATCTCAATGCATAGTCGAAAC
AGAGTGTAGAGAAGTGTGAAGAACATATGAATGCATGGTTCAAAATACGTACATTAGCCCGGAAAGCATATTATTATTCCTTTTATTCAAGCATCTTGACAAACAAGCAT
CACCCAATACATCCTCCTATACATTGGTAGTCTAATAACTTGAAGATGCAAGAACCTCTAACAAGGAAGCAATGAATTGCATCTGGGTCTGTTCATCTATAGTTAAGAAG
AGAGGAACATGAACAAATAGCGATTTGATGCCGTTCTCTTCTGCGTAGCGGAGGGAATGATAGTAGACATAATTGCACACGAAACGACCCGCATCATCTGAAGTCATGAC
TTCATATCCCTTTTTTGCCAATGTCTTGGTTATTTCCTCCACTGGAAGGGAAGTCTGCAGGGCAGAGTTGGATGAGAAATCAAATTAAAAGAAAAAAATCATGACTTGCA
TCAAGAATTTAAGACCAGCGAAAGGGTGCTATAATTCCTGCAAATCAAAGGTTATTTGCTTCAAGTGTAGACAACGGAAAATTTGCTACAGATTTTGTTTTTTCCAACCA
AGTTCAATTCTTCTAATGTTTTTGATATTAGCATGAACGCATACAAGATACTACAAAAAAAATACATTTGAATATTGTCATGCAATTTCAATATTTTCATGTGCGTATAA
TGTATTAGTTTATGATATTAGCATATTTGAAGGATGTAATAAAATATGGCATACTCGAAGAAATTAAACAAATTGATTAGATATTTCTATTGATGCAATGCCTCAAATAA
ACAAATGACTTTTTATATTGTGAATTTCATTCTAATAAAATGCAAGTATCAACTCATTTTTTCATATAAAATTAACATTTTAGTTCATTGTACTAAGAATCTCAATATAG
CTCAACTAGTTAAGGGTATATCTCGGATATGTAACTCTAACTAAAGTGTTGGGGTGCTGACCAATTACCACTCACAGTGATGACTTCATGTCCACTGTAGCAGTATTCTC
GCAGTATTCTCGCAGTATTCTCTCAAATTTTAACAGATAATGATATCTAAACATCTATATTTGGTCAAAGAAGAGCATATTCTGGTTCTCAGAACTTAAAAAAGCAACTC
AAGCAGGCATTCCCTAAACTGAAGTTCTAATAGTCAGAAAAGTGGCCTCCAAAAGACATGAATATAGTTTCTTAATGGTAGTATATTTATAAAATGCAGGTATAAAAATT
TAACCCATGGGAAAATTTATCATTCTACCTCTCGTACACGTGAAATTTCACCATCCTCGGGAACGATTGGTAATTTCTGCAGCAGAAGAAACAAATAACTGATCAAACTC
ACAGCAAAAAAGGTGCTCAAACAAACTAAAACTCTCCACATGACTAAGAGATCGATACACATTAAGCAAATTATGTACCTGAGGTTTCCATCCCATTTCATCAGGACAAC
GAAAAGTGGCTTCATTAAAAGCTCGATGTTCAACAGCAAACCTTGAAGCGCCGCTATTAACTCCCAAGTGAAGCTGAAGCAAGAGTAGTAATAATTTGTACTCATTACTC
AGAACATCAATGAAGAGAACCAACATGATGCAAGGCACACATTTGCAAAAGTTCTGCAGTCAAGGTTTTCTGTTTCTCTTATCAACAATCAAGATTTTGTGAAGAAAATG
ATTTAAATGGACAGAATTATGCTTTTCAAACTAAATCGTCTGTATTTACATGTTCTTATATTCAGAATTTATGATACATATATCCGTAATGCTTAAGATGGTGTCAATTT
CATGTACATGATATTATTCAAAGCCAACCTCCCTATCTTTTTTCTGTACATATAGCTCAACTAACTTACTGTAATGTTTGTAAGGTTTTCAGGCATTATAATAGTATTCC
TGTAAAAAATAACTAACTATACCTCAAAATAAAAGTGAATTAAGATGAACAATATAGACTTCAATATTGAAGCAGGCTTTATCAAATGTAAAGGGGCTGTTCTTTGTCAT
TTTATTCTTCATTGACTTTTCATTATAGACACTGAAGACCAACAACTTCCTAAAAAATAACAGGAAAGTTAAACCAGGACAAGTGGGAGCGATGTGATTTCCAAGTAATT
TATGCTGTAGCTTCCATAGTCCAGTCACAAATACTAATGTGTAAATAAGTAATGAGCAGCAGATTATATATCATAATGAAACATAAAATAAGAGTAGACAGATAAAAGGG
GAAGGTAACTTTCCAGAGATGAGTTAAAGAGGAAACAAGTATTAAATAGATAAAGCTTTGAACTAAATGCAGGAATTACTGACCCATATGATTCTTCTTGAATTAGTAGG
TCCAGAATCGTTCCCCTCTATTGCAGATTCCAACGTCTTCCGTAGCGAATCAAGAGATCCATGCCCGGCAGTCTCAAGAATTGAGCAATTCCCAAGAGTTAGACCTTCTG
GCAAACCATTCTTCTCCATATACTTCTTGAGATTATTCACAATTGTCTCAGTTGGATTGTCAGAAACTCCATGGAACTTCTTAAACCCCGTTACATGAATCGTCACTGAT
GGAGGCCCTTCCGACCCCATTTACCAATGATATTTTCTTGATAACAGTGCTACTACAGAGAGAGTGGCAAAAGGAAAATTGCCCCCAACAATCTGCAAGAT
Protein sequenceShow/hide protein sequence
MPLGKYYCDYCEKQFQDTPFARKRHLHSLSHQKAKALWFDSFRDPNQAFSDGFGRGVCNRFVRTGFCQYGDSCKYFHPKNNPQNSSTQPIAGFPEYSNQASNIPVNQFVE
GSSLTGNLVSNRMGTSWGNLPPSLMPPPEGGYPPLPIVDWG