; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g03560 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g03560
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCCHC-type domain-containing protein
Genome locationchr1:2284101..2294036
RNA-Seq ExpressionMoc01g03560
SyntenyMoc01g03560
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAV67516.1 zf-CCHC domain-containing protein/UBN2 domain-containing protein [Cephalotus follicularis]5.2e-5342.51Show/hide
Query:  LFCALCADEFNRVSACNSAKEIWDML------------------------------ESITEMFTRFTNIVNALKSLGKEYTTAENVRKILRSLPKSWEPK
        LFCA+  +EFNR+S+C+SAK++WD+L                              E I +MFTRFT I+N+LK+LGK Y   E VRKILR LP SW PK
Subjt:  LFCALCADEFNRVSACNSAKEIWDML------------------------------ESITEMFTRFTNIVNALKSLGKEYTTAENVRKILRSLPKSWEPK

Query:  VTAIQEAKDLSKLPLEELIGSLMTHEIIMKGNMEEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRFKKHFKKRHFQKKTNSQDAKGEKSTRDVI
        VTAI+EAKDL  LPLE+LIGSLMTHE  MK +   + KKKK++ALK++  +  SES+E   + +++ ++ +FK   K +  +K       K E S ++ +
Subjt:  VTAIQEAKDLSKLPLEELIGSLMTHEIIMKGNMEEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRFKKHFKKRHFQKKTNSQDAKGEKSTRDVI

Query:  ICYECKKAGHVRSECPLLRK-------SSSRRNKKAMKATWDESDEGR-DSESGEEVANLCFMAFGDEDDNDEVCFENPSFDELMDAYNETKEEF-----
        ICYECKK  H +S+CP L+K       +   + KK+M +TWD+SDE R D ES  EVAN+ FM   +E++ DEV F   SFDEL DAY     E+     
Subjt:  ICYECKKAGHVRSECPLLRK-------SSSRRNKKAMKATWDESDEGR-DSESGEEVANLCFMAFGDEDDNDEVCFENPSFDELMDAYNETKEEF-----

Query:  --EKLASDELEMPFENLNFKDNNSEVV
          + L  D + +  E    K+ NS  +
Subjt:  --EKLASDELEMPFENLNFKDNNSEVV

KAG6639461.1 hypothetical protein CIPAW_10G102300 [Carya illinoinensis]1.4e-5345.75Show/hide
Query:  MNCLFCALCADEFNRVSACNSAKEIWDML-----ESITEMFTRFTNIVNALKSLGKEYTTAENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSL
        MN L+ AL  +EFNR+  C +AKEIWD L     ESI+ M TRFTNI+N+L +LGK Y+  E VRKIL SLPK WE KVTAI +A+DL KL + ELIGSL
Subjt:  MNCLFCALCADEFNRVSACNSAKEIWDML-----ESITEMFTRFTNIVNALKSLGKEYTTAENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSL

Query:  MTHEIIMKGNMEEDVKKKKTLALKSTSFQGASESEEELN--EEELAYLSKRFKKHFKK-RHFQKKTNSQDAKGEKSTRDVIICYECKKAGHVRSECPLLR
        +THE  +K   EE+ K KK+LALK+   +  S+ +EE +  +EE+A +++R ++  KK R   +K+  + +K +    D +ICY+C K GH++ +CPLL+
Subjt:  MTHEIIMKGNMEEDVKKKKTLALKSTSFQGASESEEELN--EEELAYLSKRFKKHFKK-RHFQKKTNSQDAKGEKSTRDVIICYECKKAGHVRSECPLLR

Query:  KSSSRRNKKAMKATWDESDEGRDSE-SGEEVANLCFMAFGDEDDNDEVCFENPSFDELMDAYNETKEEFEKLASDELEMPFENLNFKDNNSEVVVNETQQ
        K  + + KKAMKATWD+     DSE S EE ANLC MA  D +  +    ENPS++EL +   E  EEFEKL      +  +N +   N  E++  E+  
Subjt:  KSSSRRNKKAMKATWDESDEGRDSE-SGEEVANLCFMAFGDEDDNDEVCFENPSFDELMDAYNETKEEFEKLASDELEMPFENLNFKDNNSEVVVNETQQ

Query:  VPEETL
        + +E L
Subjt:  VPEETL

XP_022143648.1 uncharacterized protein LOC111013509 [Momordica charantia]2.3e-9391.41Show/hide
Query:  LKSLGKEYTTAENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLMTHEIIMKGNMEEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRF
        +  LGKEYTTAENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLMTHEI+MKGNMEEDVKKKK+LALKSTSFQ ASESEEELNEEELAYLSK+F
Subjt:  LKSLGKEYTTAENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLMTHEIIMKGNMEEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRF

Query:  KKHFKKRHFQKKTNSQDAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDEGRDSESGEEVANLCFMAFGDEDDNDEVCFEN
        KKHFKKRHF KKTNSQDAKGEK+TRD+IICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESD G DSESGEEVANLCFMAFGDEDD++EVC  +
Subjt:  KKHFKKRHFQKKTNSQDAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDEGRDSESGEEVANLCFMAFGDEDDNDEVCFEN

XP_031741720.1 uncharacterized protein LOC116403915 [Cucumis sativus]1.3e-7253.51Show/hide
Query:  MNCLFCALCADEFNRVSACNSAKEIWDML------------------------------ESITEMFTRFTNIVNALKSLGKEYTTAENVRKILRSLPKSW
        +NCL+CAL  DEFNR+S C+SA+EIW+ L                              E+IT+MFTRFTNI+NALK LGK YTT+ENVRKILRSLPK+W
Subjt:  MNCLFCALCADEFNRVSACNSAKEIWDML------------------------------ESITEMFTRFTNIVNALKSLGKEYTTAENVRKILRSLPKSW

Query:  EPKVTAIQEAKDLSKLPLEELIGSLMTHEIIMKGNMEEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRFKKHFKKRHFQKK--TNSQDAKGEKS
        E KVTAIQEAKDL+KLPLEELIGSLMTHEIIMK ++E++ KKKK++ALK+ S +   E E+ L+E+++AY S+++K   K++ + KK  +  +++KGEKS
Subjt:  EPKVTAIQEAKDLSKLPLEELIGSLMTHEIIMKGNMEEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRFKKHFKKRHFQKK--TNSQDAKGEKS

Query:  TRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDEGRDSESGEEVANLCFMAFGDEDD--NDEVCFENPSFDELMDAYNETKEEFEKLAS
         +D +ICYECK++GH+R++CPLL KSS +  KKAMKATWD+S E  +SE  EE+ANL  MA  D+DD  +D+V  E  S DEL + +   + + EKL+S
Subjt:  TRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDEGRDSESGEEVANLCFMAFGDEDD--NDEVCFENPSFDELMDAYNETKEEFEKLAS

XP_038895919.1 uncharacterized protein LOC120084093 [Benincasa hispida]1.9e-8465.11Show/hide
Query:  MNCLFCALCADEFNRVSACNSAKEIWDML------------------------------ESITEMFTRFTNIVNALKSLGKEYTTAENVRKILRSLPKSW
        MNCLFC LC +EFN++SACNSAKEIWDML                              E+I+EMFTRFTNIVN LK LGK YTT+ENVRKILRSLPKSW
Subjt:  MNCLFCALCADEFNRVSACNSAKEIWDML------------------------------ESITEMFTRFTNIVNALKSLGKEYTTAENVRKILRSLPKSW

Query:  EPKVTAIQEAKDLSKLPLEELIGSLMTHEIIMKGNMEEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRFKKHFKKRHFQKKTNSQDAKGEKSTR
        E KVT IQEAKDLSKLPLEEL+GSLM HEIIMK N+EEDVKKKK L LKST  Q  SE+E ELN+EE AYL+K+FKKHF+KR+F KK N+Q+ KGEKS R
Subjt:  EPKVTAIQEAKDLSKLPLEELIGSLMTHEIIMKGNMEEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRFKKHFKKRHFQKKTNSQDAKGEKSTR

Query:  DVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDEGRDSESGEEVANLCFMAF-GDEDDNDEVCFENPSFD
        D IICYECKK GHV  + P  RK+ S++++KAMKAT DESDE  + ES E VANLC MAF  D+DD+DEV  EN +FD
Subjt:  DVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDEGRDSESGEEVANLCFMAF-GDEDDNDEVCFENPSFD

TrEMBL top hitse value%identityAlignment
A0A2N9HED5 CCHC-type domain-containing protein1.7e-5439.42Show/hide
Query:  MNCLFCALCADEFNRVSACNSAKEIWDML------------------------------ESITEMFTRFTNIVNALKSLGKEYTTAENVRKILRSLPKSW
        M+ L+CAL   E+NRVS C+SAKEIWD L                              E+I+EM TRFTNIVN+LK+LGK YT  ENVRKILRSLPK W
Subjt:  MNCLFCALCADEFNRVSACNSAKEIWDML------------------------------ESITEMFTRFTNIVNALKSLGKEYTTAENVRKILRSLPKSW

Query:  EPKVTAIQEAKDLSKLPLEELIGSLMTHEIIMKGNM-EEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRFKKHFKK-----RHFQKKTNSQDAK
        E K+TAI EA+DL  L LEEL GSLMT+E+ M   + EE+VK KK  ALKS+     +  EE   EEE+A +++ FKK  KK     R F KK  +   K
Subjt:  EPKVTAIQEAKDLSKLPLEELIGSLMTHEIIMKGNM-EEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRFKKHFKK-----RHFQKKTNSQDAK

Query:  GEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDE--GRDSESGEEVANLCF-------------------MAFGDEDD--------
        GE S  +   CY+CKK GH ++ECP + K   +  KKA+K TWD+SDE    ++ S  EVANLC                    +AF D++         
Subjt:  GEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDE--GRDSESGEEVANLCF-------------------MAFGDEDD--------

Query:  --NDEVCFENPSFDE--LMDA-----YNETKEEFEKLASDELEMPFENLNFKDNNSEVVV-----------NETQQVP---EETLPKDWSFSIHHLKELI
           DEVC  + S  +   +D+         K +F  L   +      N+ F DN+   ++           +E + +P    E LPK W+   +H KELI
Subjt:  --NDEVCFENPSFDE--LMDA-----YNETKEEFEKLASDELEMPFENLNFKDNNSEVVV-----------NETQQVP---EETLPKDWSFSIHHLKELI

Query:  IGDVSQGVQWR
        IG++  GV  R
Subjt:  IGDVSQGVQWR

A0A2N9I7S8 CCHC-type domain-containing protein4.4e-5842.97Show/hide
Query:  MNCLFCALCADEFNRVSACNSAKEIWDML------------------------------ESITEMFTRFTNIVNALKSLGKEYTTAENVRKILRSLPKSW
        M+ L+CAL  +E+NRV  C+SAKEIWD L                              E+I+EM TRFTNIVN LKSLGK YT  ENVRKILRSLPK W
Subjt:  MNCLFCALCADEFNRVSACNSAKEIWDML------------------------------ESITEMFTRFTNIVNALKSLGKEYTTAENVRKILRSLPKSW

Query:  EPKVTAIQEAKDLSKLPLEELIGSLMTHEIIMKGNM-EEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRFKKHFKK-----RHFQKKTNSQDAK
        E K TAI EA+DL  L L+EL GSLMT+E+ M   + EE+VK KK  ALKS+     +  EE   EEE+A ++++FKK  KK     R F KK  +   +
Subjt:  EPKVTAIQEAKDLSKLPLEELIGSLMTHEIIMKGNM-EEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRFKKHFKK-----RHFQKKTNSQDAK

Query:  GEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDE--GRDSESGEEVANLCFMAFGDEDDNDEVCFENPSFDELMDAYNETKEEFEK
        GE S  +  ICY+CKK GH ++ECP + K + +  KKA+KATWD+SDE    ++ S  E+ANLC + + +E D  E   E  SF  L  A+N+ +   E 
Subjt:  GEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDE--GRDSESGEEVANLCFMAFGDEDDNDEVCFENPSFDELMDAYNETKEEFEK

Query:  LA----SDELEMPFENLNFKDNNSEVVVNETQQ-----VPEETLPKDWSFSIHHLKELIIGDVSQGVQWR
        L      DEL    + +  K   ++V   + ++        E LPK W+    H KELIIG++  GV  R
Subjt:  LA----SDELEMPFENLNFKDNNSEVVVNETQQ-----VPEETLPKDWSFSIHHLKELIIGDVSQGVQWR

A0A2N9IDJ4 CCHC-type domain-containing protein2.3e-5439.42Show/hide
Query:  MNCLFCALCADEFNRVSACNSAKEIWDML------------------------------ESITEMFTRFTNIVNALKSLGKEYTTAENVRKILRSLPKSW
        M+ L+CAL   E+NRVS C+SAKEIWD L                              E+I+EM TRFTNIVN+LK+LGK YT  ENVRKILRSLPK W
Subjt:  MNCLFCALCADEFNRVSACNSAKEIWDML------------------------------ESITEMFTRFTNIVNALKSLGKEYTTAENVRKILRSLPKSW

Query:  EPKVTAIQEAKDLSKLPLEELIGSLMTHEIIMKGNM-EEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRFKKHFKK-----RHFQKKTNSQDAK
        E K+TAI EA+DL  L LEEL GSLMT+E+ M   + EE+VK KK  ALKS+     +  EE   EEE+A +++ FKK  KK     R F KK  +   K
Subjt:  EPKVTAIQEAKDLSKLPLEELIGSLMTHEIIMKGNM-EEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRFKKHFKK-----RHFQKKTNSQDAK

Query:  GEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDE--GRDSESGEEVANLCF-------------------MAFGDEDD--------
        GE S  +   CY+CKK GH ++ECP + K   +  KKA+K TWD+SDE    ++ S  EVANLC                    +AF D++         
Subjt:  GEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDE--GRDSESGEEVANLCF-------------------MAFGDEDD--------

Query:  --NDEVCFENPSFDE--LMDA-----YNETKEEFEKLASDELEMPFENLNFKDNNSEVVV-----------NETQQVP---EETLPKDWSFSIHHLKELI
           DEVC  + S  +   +D+         K +F  L   +      N+ F DN+   ++           +E + +P    E LPK W+   +H KELI
Subjt:  --NDEVCFENPSFDE--LMDA-----YNETKEEFEKLASDELEMPFENLNFKDNNSEVVV-----------NETQQVP---EETLPKDWSFSIHHLKELI

Query:  IGDVSQGVQWR
        IG++  GV  R
Subjt:  IGDVSQGVQWR

A0A2N9IFL9 CCHC-type domain-containing protein2.3e-5439.42Show/hide
Query:  MNCLFCALCADEFNRVSACNSAKEIWDML------------------------------ESITEMFTRFTNIVNALKSLGKEYTTAENVRKILRSLPKSW
        M+ L+CAL   E+NRVS C+SAKEIWD L                              E+I+EM TRFTNIVN+LK+LGK YT  ENVRKILRSLPK W
Subjt:  MNCLFCALCADEFNRVSACNSAKEIWDML------------------------------ESITEMFTRFTNIVNALKSLGKEYTTAENVRKILRSLPKSW

Query:  EPKVTAIQEAKDLSKLPLEELIGSLMTHEIIMKGNM-EEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRFKKHFKK-----RHFQKKTNSQDAK
        E K+TAI EA+DL  L LEEL GSLMT+E+ M   + EE+VK KK  ALKS+     +  EE   EEE+A +++ FKK  KK     R F KK  +   K
Subjt:  EPKVTAIQEAKDLSKLPLEELIGSLMTHEIIMKGNM-EEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRFKKHFKK-----RHFQKKTNSQDAK

Query:  GEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDE--GRDSESGEEVANLCF-------------------MAFGDEDD--------
        GE S  +   CY+CKK GH ++ECP + K   +  KKA+K TWD+SDE    ++ S  EVANLC                    +AF D++         
Subjt:  GEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDE--GRDSESGEEVANLCF-------------------MAFGDEDD--------

Query:  --NDEVCFENPSFDE--LMDA-----YNETKEEFEKLASDELEMPFENLNFKDNNSEVVV-----------NETQQVP---EETLPKDWSFSIHHLKELI
           DEVC  + S  +   +D+         K +F  L   +      N+ F DN+   ++           +E + +P    E LPK W+   +H KELI
Subjt:  --NDEVCFENPSFDE--LMDA-----YNETKEEFEKLASDELEMPFENLNFKDNNSEVVV-----------NETQQVP---EETLPKDWSFSIHHLKELI

Query:  IGDVSQGVQWR
        IG++  GV  R
Subjt:  IGDVSQGVQWR

A0A6J1CR79 uncharacterized protein LOC1110135091.1e-9391.41Show/hide
Query:  LKSLGKEYTTAENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLMTHEIIMKGNMEEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRF
        +  LGKEYTTAENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLMTHEI+MKGNMEEDVKKKK+LALKSTSFQ ASESEEELNEEELAYLSK+F
Subjt:  LKSLGKEYTTAENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLMTHEIIMKGNMEEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRF

Query:  KKHFKKRHFQKKTNSQDAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDEGRDSESGEEVANLCFMAFGDEDDNDEVCFEN
        KKHFKKRHF KKTNSQDAKGEK+TRD+IICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESD G DSESGEEVANLCFMAFGDEDD++EVC  +
Subjt:  KKHFKKRHFQKKTNSQDAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDEGRDSESGEEVANLCFMAFGDEDDNDEVCFEN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTGTTTGTTTTGTGCTTTATGTGCTGATGAATTTAATCGAGTATCTGCATGTAACTCTGCTAAAGAAATATGGGATATGCTTGAATCTATTACTGAGATGTTTAC
TAGATTTACGAATATTGTTAATGCTTTGAAAAGTTTAGGAAAGGAATATACTACAGCTGAGAACGTGAGGAAGATTCTAAGATCTCTACCAAAAAGTTGGGAACCCAAAG
TGACAGCAATTCAAGAAGCGAAGGATCTCTCAAAACTTCCATTGGAGGAGCTCATTGGATCTCTCATGACGCATGAGATAATCATGAAAGGTAATATGGAAGAGGACGTC
AAAAAGAAGAAAACCTTGGCATTAAAGTCTACATCCTTCCAAGGGGCTTCTGAAAGTGAGGAAGAACTAAACGAGGAAGAACTTGCCTACCTATCCAAGAGGTTCAAGAA
GCATTTCAAGAAGAGACACTTTCAAAAGAAAACCAACAGTCAAGATGCTAAAGGAGAAAAGAGCACTAGAGACGTCATCATTTGCTACGAGTGCAAAAAGGCTGGACATG
TACGATCTGAATGCCCTCTACTACGTAAATCATCTTCAAGGAGAAACAAGAAGGCTATGAAAGCTACTTGGGATGAAAGTGATGAAGGAAGGGATTCTGAAAGTGGAGAG
GAAGTAGCTAATCTTTGTTTCATGGCTTTTGGAGATGAAGATGACAACGACGAGGTATGTTTTGAAAATCCTTCTTTTGATGAGCTTATGGATGCTTACAACGAGACTAA
AGAAGAATTTGAAAAATTAGCTAGTGATGAGTTAGAAATGCCTTTTGAGAATTTGAACTTTAAAGATAATAACTCGGAAGTTGTAGTAAATGAGACTCAACAAGTGCCAG
AGGAAACCCTTCCTAAGGATTGGAGTTTCTCAATTCACCATCTGAAGGAGTTAATTATAGGTGATGTATCCCAAGGGGTCCAGTGGAGACGTGCGGAACACCATGATTCT
TATCACGCACCTTTCGGCTACACGCCCCTACCGCCACCAGCGCCCATGCACGCACCTCTTGCTACGCGCCCCATGCCTTGCCACCAGTGCACCCCACATCGTGCCACCTG
TGCGTCCGCCTGTCCTCATGCGCGTGCCATTCCGCCAGTGCCCCTCCCCCCAAGGGCGCCTCAACATTTGATGCTGCCACCCTTGGCGCCAAAACCTTGTTCTTTGTCGC
CCACTCTTGGACCATTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATTGTTTGTTTTGTGCTTTATGTGCTGATGAATTTAATCGAGTATCTGCATGTAACTCTGCTAAAGAAATATGGGATATGCTTGAATCTATTACTGAGATGTTTAC
TAGATTTACGAATATTGTTAATGCTTTGAAAAGTTTAGGAAAGGAATATACTACAGCTGAGAACGTGAGGAAGATTCTAAGATCTCTACCAAAAAGTTGGGAACCCAAAG
TGACAGCAATTCAAGAAGCGAAGGATCTCTCAAAACTTCCATTGGAGGAGCTCATTGGATCTCTCATGACGCATGAGATAATCATGAAAGGTAATATGGAAGAGGACGTC
AAAAAGAAGAAAACCTTGGCATTAAAGTCTACATCCTTCCAAGGGGCTTCTGAAAGTGAGGAAGAACTAAACGAGGAAGAACTTGCCTACCTATCCAAGAGGTTCAAGAA
GCATTTCAAGAAGAGACACTTTCAAAAGAAAACCAACAGTCAAGATGCTAAAGGAGAAAAGAGCACTAGAGACGTCATCATTTGCTACGAGTGCAAAAAGGCTGGACATG
TACGATCTGAATGCCCTCTACTACGTAAATCATCTTCAAGGAGAAACAAGAAGGCTATGAAAGCTACTTGGGATGAAAGTGATGAAGGAAGGGATTCTGAAAGTGGAGAG
GAAGTAGCTAATCTTTGTTTCATGGCTTTTGGAGATGAAGATGACAACGACGAGGTATGTTTTGAAAATCCTTCTTTTGATGAGCTTATGGATGCTTACAACGAGACTAA
AGAAGAATTTGAAAAATTAGCTAGTGATGAGTTAGAAATGCCTTTTGAGAATTTGAACTTTAAAGATAATAACTCGGAAGTTGTAGTAAATGAGACTCAACAAGTGCCAG
AGGAAACCCTTCCTAAGGATTGGAGTTTCTCAATTCACCATCTGAAGGAGTTAATTATAGGTGATGTATCCCAAGGGGTCCAGTGGAGACGTGCGGAACACCATGATTCT
TATCACGCACCTTTCGGCTACACGCCCCTACCGCCACCAGCGCCCATGCACGCACCTCTTGCTACGCGCCCCATGCCTTGCCACCAGTGCACCCCACATCGTGCCACCTG
TGCGTCCGCCTGTCCTCATGCGCGTGCCATTCCGCCAGTGCCCCTCCCCCCAAGGGCGCCTCAACATTTGATGCTGCCACCCTTGGCGCCAAAACCTTGTTCTTTGTCGC
CCACTCTTGGACCATTTTAG
Protein sequenceShow/hide protein sequence
MNCLFCALCADEFNRVSACNSAKEIWDMLESITEMFTRFTNIVNALKSLGKEYTTAENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLMTHEIIMKGNMEEDV
KKKKTLALKSTSFQGASESEEELNEEELAYLSKRFKKHFKKRHFQKKTNSQDAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDEGRDSESGE
EVANLCFMAFGDEDDNDEVCFENPSFDELMDAYNETKEEFEKLASDELEMPFENLNFKDNNSEVVVNETQQVPEETLPKDWSFSIHHLKELIIGDVSQGVQWRRAEHHDS
YHAPFGYTPLPPPAPMHAPLATRPMPCHQCTPHRATCASACPHARAIPPVPLPPRAPQHLMLPPLAPKPCSLSPTLGPF