; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g16690 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g16690
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCCHC-type domain-containing protein
Genome locationchr11:12738309..12743930
RNA-Seq ExpressionMoc11g16690
SyntenyMoc11g16690
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAV75366.1 zf-CCHC domain-containing protein/UBN2 domain-containing protein [Cephalotus follicularis]7.1e-6151.63Show/hide
Query:  DMLEVTHEGTNQVKESKISMLVHNYELFKMDANESITEMFTRFTNIVNALKGLGKEYTTAENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLM
        D+LEVT+EGTNQVKESKISMLVH YELF M  NESI++MFTRFT I+N+LK LGK Y   E VRKILR LPK W PKVTAI+EAKDLS LPLE+L+GSLM
Subjt:  DMLEVTHEGTNQVKESKISMLVHNYELFKMDANESITEMFTRFTNIVNALKGLGKEYTTAENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLM

Query:  THEIIMKGNMEEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRFKKHFK----KRHFPKKTNSQDAKGEKSTRDVIICYECKKAGHVRSECPLLR
        THE  MK +  ++VKKKK++A K++  +  SES+E   ++++  +SK+FKK  K    K+ F KK  SQD   E S ++   CYECKK GH ++ECP L+
Subjt:  THEIIMKGNMEEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRFKKHFK----KRHFPKKTNSQDAKGEKSTRDVIICYECKKAGHVRSECPLLR

Query:  K-----SSSRRNKKAMKATWDESDEGS--DSESGEEVANLCFMAFGDEDDDDEVCFE-NPSFDELMDAYNEIKEEFEK-------LASDELEMPFENLNF
        K         + KKAM ATW  SD  S  + ES EEV NL  MA  ++  +DE   E N +FDEL +AY ++  E+E        L  + + M  E  N 
Subjt:  K-----SSSRRNKKAMKATWDESDEGS--DSESGEEVANLCFMAFGDEDDDDEVCFE-NPSFDELMDAYNEIKEEFEK-------LASDELEMPFENLNF

Query:  KDNNSE
        K  NS+
Subjt:  KDNNSE

GAV81585.1 zf-CCHC domain-containing protein/DUF4219 domain-containing protein/UBN2 domain-containing protein, partial [Cephalotus follicularis]3.2e-6153.33Show/hide
Query:  DMLEVTHEGTNQVKESKISMLVHNYELFKMDANESITEMFTRFTNIVNALKGLGKEYTTAENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLM
        D+LEVT+EGTNQVKESKISMLVH YELF M  NE+I++MFTRFT I+N+LK LGK Y+  E VRKILR L KSW PKVTAI EAKDLS LPLE+L+GSLM
Subjt:  DMLEVTHEGTNQVKESKISMLVHNYELFKMDANESITEMFTRFTNIVNALKGLGKEYTTAENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLM

Query:  THEIIMKGNMEEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRFKKHFKKRHFPKKTNSQDAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSS
        THE  MK + + +VKKKKT+AL++     + E  E   + +LA ++ +FKK  K +   K         E S ++   CYECKK  H +S+CP L+K   
Subjt:  THEIIMKGNMEEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRFKKHFKKRHFPKKTNSQDAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSS

Query:  RRNK--KAMKATWDESDEG-SDSESGEEVANLCFMAFGDEDDDDEVCFENPSFDELMDAYNEIKEEFEKL
         ++K  KA  ATWD+SD   S+ ES EEVAN+ FMAF +E+++D+V F   +FDEL +AY  +  E+E +
Subjt:  RRNK--KAMKATWDESDEG-SDSESGEEVANLCFMAFGDEDDDDEVCFENPSFDELMDAYNEIKEEFEKL

XP_022143648.1 uncharacterized protein LOC111013509 [Momordica charantia]3.1e-9693.43Show/hide
Query:  LKGLGKEYTTAENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLMTHEIIMKGNMEEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRF
        + GLGKEYTTAENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLMTHEI+MKGNMEEDVKKKK+LALKSTSFQ ASESEEELNEEELAYLSK+F
Subjt:  LKGLGKEYTTAENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLMTHEIIMKGNMEEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRF

Query:  KKHFKKRHFPKKTNSQDAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDEGSDSESGEEVANLCFMAFGDEDDDDEVCFEN
        KKHFKKRHFPKKTNSQDAKGEK+TRD+IICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESD GSDSESGEEVANLCFMAFGDEDDD+EVC  +
Subjt:  KKHFKKRHFPKKTNSQDAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDEGSDSESGEEVANLCFMAFGDEDDDDEVCFEN

XP_031741720.1 uncharacterized protein LOC116403915 [Cucumis sativus]1.0e-8363.84Show/hide
Query:  LEVTHEGTNQVKESKISMLVHNYELFKMDANESITEMFTRFTNIVNALKGLGKEYTTAENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLMTH
        LE+THEGTNQVKESKISM VHNYELFKMDANE+IT+MFTRFTNI+NALKGLGK YTT+ENVRKILRSLPK+WE KVTAIQEAKDL+KLPLEELIGSLMTH
Subjt:  LEVTHEGTNQVKESKISMLVHNYELFKMDANESITEMFTRFTNIVNALKGLGKEYTTAENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLMTH

Query:  EIIMKGNMEEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRFKKHFKKRHFPKK--TNSQDAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSS
        EIIMK ++E++ KKKK++ALK+ S +   E E+ L+E+++AY S+++K   K++ + KK  +  +++KGEKS +D +ICYECK++GH+R++CPLL KSS 
Subjt:  EIIMKGNMEEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRFKKHFKKRHFPKK--TNSQDAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSS

Query:  RRNKKAMKATWDESDEGSDSESGEEVANLCFMAFGDEDD--DDEVCFENPSFDELMDAYNEIKEEFEKLAS
        +  KKAMKATWD+S E S+SE  EE+ANL  MA  D+DD  DD+V  E  S DEL + +  ++ + EKL+S
Subjt:  RRNKKAMKATWDESDEGSDSESGEEVANLCFMAFGDEDD--DDEVCFENPSFDELMDAYNEIKEEFEKLAS

XP_038895919.1 uncharacterized protein LOC120084093 [Benincasa hispida]7.0e-9375.79Show/hide
Query:  DMLEVTHEGTNQVKESKISMLVHNYELFKMDANESITEMFTRFTNIVNALKGLGKEYTTAENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLM
        DML+VTHEGTNQVKESKISMLVHNY+LFKMDANE+I+EMFTRFTNIVN LKGLGK YTT+ENVRKILRSLPKSWE KVT IQEAKDLSKLPLEEL+GSLM
Subjt:  DMLEVTHEGTNQVKESKISMLVHNYELFKMDANESITEMFTRFTNIVNALKGLGKEYTTAENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLM

Query:  THEIIMKGNMEEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRFKKHFKKRHFPKKTNSQDAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSS
         HEIIMK N+EEDVKKKK L LKST  Q  SE+E ELN+EE AYL+K+FKKHF+KR+F KK N+Q+ KGEKS RD IICYECKK GHV  + P  RK+ S
Subjt:  THEIIMKGNMEEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRFKKHFKKRHFPKKTNSQDAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSS

Query:  RRNKKAMKATWDESDEGSDSESGEEVANLCFMAF-GDEDDDDEVCFENPSFD
        ++++KAMKAT DESDE S+ ES E VANLC MAF  D+DDDDEV  EN +FD
Subjt:  RRNKKAMKATWDESDEGSDSESGEEVANLCFMAF-GDEDDDDEVCFENPSFD

TrEMBL top hitse value%identityAlignment
A0A2N9HKZ7 CCHC-type domain-containing protein7.4e-6444.44Show/hide
Query:  DMLEVTHEGTNQVKESKISMLVHNYELFKMDANESITEMFTRFTNIVNALKGLGKEYTTAENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLM
        D LEVT+EGTNQVKESK++MLVH YELF M  +E+I+EM TRFTNIVN+LK LGK YT  ENVRKILRSLPK WE K+TAI EA+DL  L LEEL GSLM
Subjt:  DMLEVTHEGTNQVKESKISMLVHNYELFKMDANESITEMFTRFTNIVNALKGLGKEYTTAENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLM

Query:  THEIIMKGNM-EEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRFKKHFKK-----RHFPKKTNSQDAKGEKSTRDVIICYECKKAGHVRSECPL
        T+E+ M   + EE+VK KK  ALKS+     +  EE   EEE+A +++ FKK  KK     R FPKK  +   KGE S  +   CY+CKK GH ++ECP 
Subjt:  THEIIMKGNM-EEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRFKKHFKK-----RHFPKKTNSQDAKGEKSTRDVIICYECKKAGHVRSECPL

Query:  LRKSSSRRNKKAMKATWDESDEGSDSE---SGEEVANLCF-------------------MAFGDEDD----------DDEVCFENPSFDE--LMDA--YN
        + K   +  KKA+K TWD+SDE SDS+   S  EVANLC                    +AF D++            DEVC  + S  +   +D+    
Subjt:  LRKSSSRRNKKAMKATWDESDEGSDSE---SGEEVANLCF-------------------MAFGDEDD----------DDEVCFENPSFDE--LMDA--YN

Query:  EIKEEFEKLASDELEMPFENLNFKDNNSEVVV-----------NETQQVP---EETLPKDWSFSIHHPKELIIGDISQGLLNRPRIR
         +  +  K  S  L+    N+ F DN+   ++           +E + +P    E LPK W+   +HPKELIIG+I  G+  R +++
Subjt:  EIKEEFEKLASDELEMPFENLNFKDNNSEVVV-----------NETQQVP---EETLPKDWSFSIHHPKELIIGDISQGLLNRPRIR

A0A2N9I7S8 CCHC-type domain-containing protein1.9e-6748.71Show/hide
Query:  DMLEVTHEGTNQVKESKISMLVHNYELFKMDANESITEMFTRFTNIVNALKGLGKEYTTAENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLM
        D LEVT+EGTNQVKESK+SMLVH YELF M  +E+I+EM TRFTNIVN LK LGK YT  ENVRKILRSLPK WE K TAI EA+DL  L L+EL GSLM
Subjt:  DMLEVTHEGTNQVKESKISMLVHNYELFKMDANESITEMFTRFTNIVNALKGLGKEYTTAENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLM

Query:  THEIIMKGNM-EEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRFKKHFKK-----RHFPKKTNSQDAKGEKSTRDVIICYECKKAGHVRSECPL
        T+E+ M   + EE+VK KK  ALKS+     +  EE   EEE+A ++++FKK  KK     R FPKK  +   +GE S  +  ICY+CKK GH ++ECP 
Subjt:  THEIIMKGNM-EEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRFKKHFKK-----RHFPKKTNSQDAKGEKSTRDVIICYECKKAGHVRSECPL

Query:  LRKSSSRRNKKAMKATWDESDEGSDSE---SGEEVANLCFMAFGDEDDDDEVCFENPSFDELMDAYNEIKEEFEKLA----SDELEMPFENLNFKDNNSE
        + K + +  KKA+KATWD+SDE SDS+   S  E+ANLC + + +E D  E   E  SF  L  A+N+ +   E L      DEL    + +  K   ++
Subjt:  LRKSSSRRNKKAMKATWDESDEGSDSE---SGEEVANLCFMAFGDEDDDDEVCFENPSFDELMDAYNEIKEEFEKLA----SDELEMPFENLNFKDNNSE

Query:  VVVNETQQ-----VPEETLPKDWSFSIHHPKELIIGDISQGLLNRPRIR
        V   + ++        E LPK W+    HPKELIIG+I  G+  R +++
Subjt:  VVVNETQQ-----VPEETLPKDWSFSIHHPKELIIGDISQGLLNRPRIR

A0A2N9IDJ4 CCHC-type domain-containing protein7.4e-6444.44Show/hide
Query:  DMLEVTHEGTNQVKESKISMLVHNYELFKMDANESITEMFTRFTNIVNALKGLGKEYTTAENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLM
        D LEVT+EGTNQVKESK++MLVH YELF M  +E+I+EM TRFTNIVN+LK LGK YT  ENVRKILRSLPK WE K+TAI EA+DL  L LEEL GSLM
Subjt:  DMLEVTHEGTNQVKESKISMLVHNYELFKMDANESITEMFTRFTNIVNALKGLGKEYTTAENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLM

Query:  THEIIMKGNM-EEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRFKKHFKK-----RHFPKKTNSQDAKGEKSTRDVIICYECKKAGHVRSECPL
        T+E+ M   + EE+VK KK  ALKS+     +  EE   EEE+A +++ FKK  KK     R FPKK  +   KGE S  +   CY+CKK GH ++ECP 
Subjt:  THEIIMKGNM-EEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRFKKHFKK-----RHFPKKTNSQDAKGEKSTRDVIICYECKKAGHVRSECPL

Query:  LRKSSSRRNKKAMKATWDESDEGSDSE---SGEEVANLCF-------------------MAFGDEDD----------DDEVCFENPSFDE--LMDA--YN
        + K   +  KKA+K TWD+SDE SDS+   S  EVANLC                    +AF D++            DEVC  + S  +   +D+    
Subjt:  LRKSSSRRNKKAMKATWDESDEGSDSE---SGEEVANLCF-------------------MAFGDEDD----------DDEVCFENPSFDE--LMDA--YN

Query:  EIKEEFEKLASDELEMPFENLNFKDNNSEVVV-----------NETQQVP---EETLPKDWSFSIHHPKELIIGDISQGLLNRPRIR
         +  +  K  S  L+    N+ F DN+   ++           +E + +P    E LPK W+   +HPKELIIG+I  G+  R +++
Subjt:  EIKEEFEKLASDELEMPFENLNFKDNNSEVVV-----------NETQQVP---EETLPKDWSFSIHHPKELIIGDISQGLLNRPRIR

A0A2N9IFL9 CCHC-type domain-containing protein7.4e-6444.44Show/hide
Query:  DMLEVTHEGTNQVKESKISMLVHNYELFKMDANESITEMFTRFTNIVNALKGLGKEYTTAENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLM
        D LEVT+EGTNQVKESK++MLVH YELF M  +E+I+EM TRFTNIVN+LK LGK YT  ENVRKILRSLPK WE K+TAI EA+DL  L LEEL GSLM
Subjt:  DMLEVTHEGTNQVKESKISMLVHNYELFKMDANESITEMFTRFTNIVNALKGLGKEYTTAENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLM

Query:  THEIIMKGNM-EEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRFKKHFKK-----RHFPKKTNSQDAKGEKSTRDVIICYECKKAGHVRSECPL
        T+E+ M   + EE+VK KK  ALKS+     +  EE   EEE+A +++ FKK  KK     R FPKK  +   KGE S  +   CY+CKK GH ++ECP 
Subjt:  THEIIMKGNM-EEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRFKKHFKK-----RHFPKKTNSQDAKGEKSTRDVIICYECKKAGHVRSECPL

Query:  LRKSSSRRNKKAMKATWDESDEGSDSE---SGEEVANLCF-------------------MAFGDEDD----------DDEVCFENPSFDE--LMDA--YN
        + K   +  KKA+K TWD+SDE SDS+   S  EVANLC                    +AF D++            DEVC  + S  +   +D+    
Subjt:  LRKSSSRRNKKAMKATWDESDEGSDSE---SGEEVANLCF-------------------MAFGDEDD----------DDEVCFENPSFDE--LMDA--YN

Query:  EIKEEFEKLASDELEMPFENLNFKDNNSEVVV-----------NETQQVP---EETLPKDWSFSIHHPKELIIGDISQGLLNRPRIR
         +  +  K  S  L+    N+ F DN+   ++           +E + +P    E LPK W+   +HPKELIIG+I  G+  R +++
Subjt:  EIKEEFEKLASDELEMPFENLNFKDNNSEVVV-----------NETQQVP---EETLPKDWSFSIHHPKELIIGDISQGLLNRPRIR

A0A6J1CR79 uncharacterized protein LOC1110135091.5e-9693.43Show/hide
Query:  LKGLGKEYTTAENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLMTHEIIMKGNMEEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRF
        + GLGKEYTTAENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLMTHEI+MKGNMEEDVKKKK+LALKSTSFQ ASESEEELNEEELAYLSK+F
Subjt:  LKGLGKEYTTAENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLMTHEIIMKGNMEEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRF

Query:  KKHFKKRHFPKKTNSQDAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDEGSDSESGEEVANLCFMAFGDEDDDDEVCFEN
        KKHFKKRHFPKKTNSQDAKGEK+TRD+IICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESD GSDSESGEEVANLCFMAFGDEDDD+EVC  +
Subjt:  KKHFKKRHFPKKTNSQDAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKATWDESDEGSDSESGEEVANLCFMAFGDEDDDDEVCFEN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATATGCTTGAAGTAACTCATGAAGGAACAAATCAAGTAAAAGAGTCAAAAATTAGCATGCTTGTTCATAATTATGAATTGTTTAAAATGGATGCTAACGAG
TCTATTACTGAGATGTTTACTAGATTTACCAATATTGTTAATGCTTTGAAAGGTTTAGGAAAGGAATATACTACAGCTGAGAACGTGAGGAAGATTCTAAGATCT
CTACCAAAAAGTTGGGAACCCAAAGTGACAGCAATTCAAGAAGCGAAGGATCTCTCAAAACTTCCATTGGAGGAGCTCATTGGATCTCTCATGACGCATGAGATA
ATCATGAAAGGTAATATGGAAGAGGACGTCAAAAAGAAGAAAACCTTGGCATTAAAGTCTACATCCTTCCAAGGGGCTTCTGAAAGTGAGGAAGAACTAAACGAG
GAAGAACTTGCCTACCTATCCAAGAGGTTCAAGAAGCATTTCAAGAAGAGGCACTTTCCAAAGAAAACCAACAGTCAAGATGCTAAAGGAGAAAAGAGCACTAGA
GACGTCATCATCTGCTACGAGTGCAAAAAGGCTGGACATGTAAGATCTGAATGCCCTCTACTACGTAAATCATCTTCAAGGAGAAACAAGAAGGCTATGAAAGCT
ACTTGGGATGAAAGTGATGAAGGAAGTGATTCTGAAAGTGGAGAGGAAGTAGCTAATCTTTGTTTCATGGCTTTTGGAGATGAAGATGACGACGACGAGGTATGT
TTTGAAAATCCTTCTTTTGATGAGCTTATGGATGCTTACAACGAGATTAAAGAAGAATTTGAAAAATTAGCTAGTGATGAGTTAGAAATGCCTTTTGAGAATTTG
AACTTTAAAGATAATAACTCGGAAGTTGTAGTAAATGAGACTCAACAAGTGCCAGAGGAAACCCTTCCTAAGGATTGGAGTTTCTCAATTCACCATCCGAAGGAG
TTAATTATAGGTGATATATCCCAAGGGCTTCTGAATAGGCCACGTATTCGCCCCTCCGTAGGCGTAGGTCCACTAAGAACCGATTCTCAGTGGGGATGTTGTCAA
GCAGAAGGCGTGTCTGCTCACGACGGGACAAACACCACAAACAGTTACGTCCCTACATACCCACAGTTGGCAGTGCGTGCTATCGGCACGATGGCCGCATCCAAC
GGCCCTAGGCGACATGCTCAGGCAACGCAAGCAGCGGGGCTACTCTGCACTGTCAGGGATGCACCCCTCTCTATCATACCGTACCGCGTGCGACGCCCCTGCCAC
CTCCAACTGTCGTCCTTCTTGGTCGGCCCTTCAAGGCTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGATATGCTTGAAGTAACTCATGAAGGAACAAATCAAGTAAAAGAGTCAAAAATTAGCATGCTTGTTCATAATTATGAATTGTTTAAAATGGATGCTAACGAG
TCTATTACTGAGATGTTTACTAGATTTACCAATATTGTTAATGCTTTGAAAGGTTTAGGAAAGGAATATACTACAGCTGAGAACGTGAGGAAGATTCTAAGATCT
CTACCAAAAAGTTGGGAACCCAAAGTGACAGCAATTCAAGAAGCGAAGGATCTCTCAAAACTTCCATTGGAGGAGCTCATTGGATCTCTCATGACGCATGAGATA
ATCATGAAAGGTAATATGGAAGAGGACGTCAAAAAGAAGAAAACCTTGGCATTAAAGTCTACATCCTTCCAAGGGGCTTCTGAAAGTGAGGAAGAACTAAACGAG
GAAGAACTTGCCTACCTATCCAAGAGGTTCAAGAAGCATTTCAAGAAGAGGCACTTTCCAAAGAAAACCAACAGTCAAGATGCTAAAGGAGAAAAGAGCACTAGA
GACGTCATCATCTGCTACGAGTGCAAAAAGGCTGGACATGTAAGATCTGAATGCCCTCTACTACGTAAATCATCTTCAAGGAGAAACAAGAAGGCTATGAAAGCT
ACTTGGGATGAAAGTGATGAAGGAAGTGATTCTGAAAGTGGAGAGGAAGTAGCTAATCTTTGTTTCATGGCTTTTGGAGATGAAGATGACGACGACGAGGTATGT
TTTGAAAATCCTTCTTTTGATGAGCTTATGGATGCTTACAACGAGATTAAAGAAGAATTTGAAAAATTAGCTAGTGATGAGTTAGAAATGCCTTTTGAGAATTTG
AACTTTAAAGATAATAACTCGGAAGTTGTAGTAAATGAGACTCAACAAGTGCCAGAGGAAACCCTTCCTAAGGATTGGAGTTTCTCAATTCACCATCCGAAGGAG
TTAATTATAGGTGATATATCCCAAGGGCTTCTGAATAGGCCACGTATTCGCCCCTCCGTAGGCGTAGGTCCACTAAGAACCGATTCTCAGTGGGGATGTTGTCAA
GCAGAAGGCGTGTCTGCTCACGACGGGACAAACACCACAAACAGTTACGTCCCTACATACCCACAGTTGGCAGTGCGTGCTATCGGCACGATGGCCGCATCCAAC
GGCCCTAGGCGACATGCTCAGGCAACGCAAGCAGCGGGGCTACTCTGCACTGTCAGGGATGCACCCCTCTCTATCATACCGTACCGCGTGCGACGCCCCTGCCAC
CTCCAACTGTCGTCCTTCTTGGTCGGCCCTTCAAGGCTATAG
Protein sequenceShow/hide protein sequence
MDMLEVTHEGTNQVKESKISMLVHNYELFKMDANESITEMFTRFTNIVNALKGLGKEYTTAENVRKILRSLPKSWEPKVTAIQEAKDLSKLPLEELIGSLMTHEI
IMKGNMEEDVKKKKTLALKSTSFQGASESEEELNEEELAYLSKRFKKHFKKRHFPKKTNSQDAKGEKSTRDVIICYECKKAGHVRSECPLLRKSSSRRNKKAMKA
TWDESDEGSDSESGEEVANLCFMAFGDEDDDDEVCFENPSFDELMDAYNEIKEEFEKLASDELEMPFENLNFKDNNSEVVVNETQQVPEETLPKDWSFSIHHPKE
LIIGDISQGLLNRPRIRPSVGVGPLRTDSQWGCCQAEGVSAHDGTNTTNSYVPTYPQLAVRAIGTMAASNGPRRHAQATQAAGLLCTVRDAPLSIIPYRVRRPCH
LQLSSFLVGPSRL