; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g05370 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g05370
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRWP-RK domain-containing protein
Genome locationchr8:3949966..3952132
RNA-Seq ExpressionMoc08g05370
SyntenyMoc08g05370
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR044607 - Transcription factor RKD-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585764.1 Protein NLP6, partial [Cucurbita argyrosperma subsp. sororia]2.5e-11782.12Show/hide
Query:  DPRDIMNPFE-ENDPYDSSINANILNFANDQNPTLDDLQSTAEPPMGAARPQQYQQPAAAARENMGMVELLTHFSFGG-LDSEAGPSNVHGEGPSNFEDE
        DPR+ M  FE + DPYDSSINANILNFANDQNPTLDDLQSTAE P+ AAR     +   +A +NMGMVELLTHFSFGG  DSEAGPSN  GE  SNFE++
Subjt:  DPRDIMNPFE-ENDPYDSSINANILNFANDQNPTLDDLQSTAEPPMGAARPQQYQQPAAAARENMGMVELLTHFSFGG-LDSEAGPSNVHGEGPSNFEDE

Query:  DDMQIPSEVESRLLAIWPVTPVPFLCSCCQVLREFLHTNGINSRKLEIHGRLGIICHAILEHKPIVNVDNIGPQYQMFDFCDKSSEEIKQFLLQYCLKQI
        DD+QIPSEVES+LLAIWPVTPVPFLCSCCQVLREFLH+NG+NSRKLEIHGRLG+ICHAILEHKPIVNVDNI PQYQMFDFCDKS EEIKQFLLQYCLKQI
Subjt:  DDMQIPSEVESRLLAIWPVTPVPFLCSCCQVLREFLHTNGINSRKLEIHGRLGIICHAILEHKPIVNVDNIGPQYQMFDFCDKSSEEIKQFLLQYCLKQI

Query:  LEDYNMIPDPMSNFYDALCVGIDWFENLN-TDGFFQLSPDNSEDEDMDQPPPEFQNEPPE---RPPRRPSLAAQ
        LEDYNMIPDPMSNFYDALCVGIDWFENLN TD FFQ SPDNSEDEDMDQP  EFQN+ PE   +P RRPSLAAQ
Subjt:  LEDYNMIPDPMSNFYDALCVGIDWFENLN-TDGFFQLSPDNSEDEDMDQPPPEFQNEPPE---RPPRRPSLAAQ

XP_008445526.1 PREDICTED: protein RKD2-like isoform X1 [Cucumis melo]5.8e-10673.74Show/hide
Query:  ADP--RDIMNPFEEND-PYDSSINANILNFANDQNPTLDDLQSTAEPPMGAARPQQYQQPAAAARENMGMVELLTHFSFGG--LDSEAGPSNVHGEGPSN
        ADP    + +  E+ D PY+SSINANILNF NDQNPTL+DLQSTAE            QPA     NMGMVELL  FS+G    DSEAGPSN+ GE   +
Subjt:  ADP--RDIMNPFEEND-PYDSSINANILNFANDQNPTLDDLQSTAEPPMGAARPQQYQQPAAAARENMGMVELLTHFSFGG--LDSEAGPSNVHGEGPSN

Query:  FEDEDDMQIPSEVESRLLAIWPVTPVPFLCSCCQVLREFLHTNGINSRKLEIHGRLGIICHAILEHKPIVNVDNIGPQYQMFDFCDKSSEEIKQFLLQYC
         +D++ M IPS+VES+LL IWP+TP+PFLCSCCQVLREFLHTNG+NSRKLEIHGRLG+I HAILEHKPIVNVDNI PQYQMFDFC+KS EEIKQFLLQYC
Subjt:  FEDEDDMQIPSEVESRLLAIWPVTPVPFLCSCCQVLREFLHTNGINSRKLEIHGRLGIICHAILEHKPIVNVDNIGPQYQMFDFCDKSSEEIKQFLLQYC

Query:  LKQILEDYNMIPDPMSNFYDALCVGIDWFENLNTDGFFQLSPDNSEDEDMDQPPPEFQNEPP----ERPPRRPSLAAQ
        LKQILEDYNMIPDPMSNFYDALCVGIDWFENLNTDGFFQLSPDNSEDE+M+Q  P+ QNEPP    E+PPRRPSLAAQ
Subjt:  LKQILEDYNMIPDPMSNFYDALCVGIDWFENLNTDGFFQLSPDNSEDEDMDQPPPEFQNEPP----ERPPRRPSLAAQ

XP_022951900.1 uncharacterized protein LOC111454636 [Cucurbita moschata]3.6e-11681.39Show/hide
Query:  DPRDIMNPFE-ENDPYDSSINANILNFANDQNPTLDDLQSTAEPPMGAARPQQYQQPAAAARENMGMVELLTHFSFGG-LDSEAGPSNVHGEGPSNFEDE
        DPR+ M  FE + DPYDSSINANILNFANDQNPTLDDLQSTAE P+ AAR     +   +A +NMGMVELLTHFSFGG  DSEAGPSN  GE  SNFE++
Subjt:  DPRDIMNPFE-ENDPYDSSINANILNFANDQNPTLDDLQSTAEPPMGAARPQQYQQPAAAARENMGMVELLTHFSFGG-LDSEAGPSNVHGEGPSNFEDE

Query:  DDMQIPSEVESRLLAIWPVTPVPFLCSCCQVLREFLHTNGINSRKLEIHGRLGIICHAILEHKPIVNVDNIGPQYQMFDFCDKSSEEIKQFLLQYCLKQI
        DD+QIPSEV+S+LLAIWPVTPVPFLCSCCQVLREFLH+NG+NSRKLEIHGRLG+ICHAILEHKPIVNVDNI PQYQMFDFCDKS EEIKQFLLQYCLKQI
Subjt:  DDMQIPSEVESRLLAIWPVTPVPFLCSCCQVLREFLHTNGINSRKLEIHGRLGIICHAILEHKPIVNVDNIGPQYQMFDFCDKSSEEIKQFLLQYCLKQI

Query:  LEDYNMIPDPMSNFYDALCVGIDWFENLN-TDGFFQLSPDNSEDEDMDQPPPEFQNEPPE---RPPRRPSLAAQ
        LEDYNMIPDPMSNFYDALCVGIDWFENLN TD FFQ SPDNSEDEDMDQ  P FQN+ PE   +P RRPSLAAQ
Subjt:  LEDYNMIPDPMSNFYDALCVGIDWFENLN-TDGFFQLSPDNSEDEDMDQPPPEFQNEPPE---RPPRRPSLAAQ

XP_023002694.1 uncharacterized protein LOC111496477 [Cucurbita maxima]1.5e-11782.12Show/hide
Query:  DPRDIMNPFE-ENDPYDSSINANILNFANDQNPTLDDLQSTAEPPMGAARPQQYQQPAAAARENMGMVELLTHFSFGG-LDSEAGPSNVHGEGPSNFEDE
        DPR+ M  FE + DPYDSSINANILNFANDQNPTLDDLQSTAE P+ AAR     +   +A +NMGMVELLTHFSFGG  DSEAGPSN  GE  SNFE++
Subjt:  DPRDIMNPFE-ENDPYDSSINANILNFANDQNPTLDDLQSTAEPPMGAARPQQYQQPAAAARENMGMVELLTHFSFGG-LDSEAGPSNVHGEGPSNFEDE

Query:  DDMQIPSEVESRLLAIWPVTPVPFLCSCCQVLREFLHTNGINSRKLEIHGRLGIICHAILEHKPIVNVDNIGPQYQMFDFCDKSSEEIKQFLLQYCLKQI
        DD QIPSEVES+LLAIWPVTPVPFLCSCCQVLREFLH+NG+NSRKLEIHGRLG+ICHAILEHKPIVNVDNI PQYQMFDFCDKS EEIKQFLLQYCLKQI
Subjt:  DDMQIPSEVESRLLAIWPVTPVPFLCSCCQVLREFLHTNGINSRKLEIHGRLGIICHAILEHKPIVNVDNIGPQYQMFDFCDKSSEEIKQFLLQYCLKQI

Query:  LEDYNMIPDPMSNFYDALCVGIDWFENLN-TDGFFQLSPDNSEDEDMDQPPPEFQNEPPE---RPPRRPSLAAQ
        LEDYNMIPDPMSNFYDALCVGIDWFENLN TD FFQ SPDNSEDED+DQP PEFQN+ PE   +P RRPSLAAQ
Subjt:  LEDYNMIPDPMSNFYDALCVGIDWFENLN-TDGFFQLSPDNSEDEDMDQPPPEFQNEPPE---RPPRRPSLAAQ

XP_023537902.1 uncharacterized protein LOC111798799 [Cucurbita pepo subsp. pepo]3.0e-11882.48Show/hide
Query:  DPRDIMNPFE-ENDPYDSSINANILNFANDQNPTLDDLQSTAEPPMGAARPQQYQQPAAAARENMGMVELLTHFSFGG-LDSEAGPSNVHGEGPSNFEDE
        DPR+ M  FE + DPYDSSINANILNFANDQNPTLDDLQSTAE P+ AAR     +   +A +NMGMVELLTHFSFGG  DSEAGPSN  GE  SNFE++
Subjt:  DPRDIMNPFE-ENDPYDSSINANILNFANDQNPTLDDLQSTAEPPMGAARPQQYQQPAAAARENMGMVELLTHFSFGG-LDSEAGPSNVHGEGPSNFEDE

Query:  DDMQIPSEVESRLLAIWPVTPVPFLCSCCQVLREFLHTNGINSRKLEIHGRLGIICHAILEHKPIVNVDNIGPQYQMFDFCDKSSEEIKQFLLQYCLKQI
        DD+QIPSEVES+LLAIWPVTPVPFLCSCCQVLREFLH+NG+NSRKLEIHGRLG+ICHAILEHKPIVNVDNI PQYQMFDFCDKS EEIKQFLLQYCLKQI
Subjt:  DDMQIPSEVESRLLAIWPVTPVPFLCSCCQVLREFLHTNGINSRKLEIHGRLGIICHAILEHKPIVNVDNIGPQYQMFDFCDKSSEEIKQFLLQYCLKQI

Query:  LEDYNMIPDPMSNFYDALCVGIDWFENLN-TDGFFQLSPDNSEDEDMDQPPPEFQNEPPE---RPPRRPSLAAQ
        LEDYNMIPDPMSNFYDALCVGIDWFENLN TD FFQ SPDNSEDEDMDQP PEFQN+ PE   +P RRPSLAAQ
Subjt:  LEDYNMIPDPMSNFYDALCVGIDWFENLN-TDGFFQLSPDNSEDEDMDQPPPEFQNEPPE---RPPRRPSLAAQ

TrEMBL top hitse value%identityAlignment
A0A1S3BCG6 protein RKD2-like isoform X12.8e-10673.74Show/hide
Query:  ADP--RDIMNPFEEND-PYDSSINANILNFANDQNPTLDDLQSTAEPPMGAARPQQYQQPAAAARENMGMVELLTHFSFGG--LDSEAGPSNVHGEGPSN
        ADP    + +  E+ D PY+SSINANILNF NDQNPTL+DLQSTAE            QPA     NMGMVELL  FS+G    DSEAGPSN+ GE   +
Subjt:  ADP--RDIMNPFEEND-PYDSSINANILNFANDQNPTLDDLQSTAEPPMGAARPQQYQQPAAAARENMGMVELLTHFSFGG--LDSEAGPSNVHGEGPSN

Query:  FEDEDDMQIPSEVESRLLAIWPVTPVPFLCSCCQVLREFLHTNGINSRKLEIHGRLGIICHAILEHKPIVNVDNIGPQYQMFDFCDKSSEEIKQFLLQYC
         +D++ M IPS+VES+LL IWP+TP+PFLCSCCQVLREFLHTNG+NSRKLEIHGRLG+I HAILEHKPIVNVDNI PQYQMFDFC+KS EEIKQFLLQYC
Subjt:  FEDEDDMQIPSEVESRLLAIWPVTPVPFLCSCCQVLREFLHTNGINSRKLEIHGRLGIICHAILEHKPIVNVDNIGPQYQMFDFCDKSSEEIKQFLLQYC

Query:  LKQILEDYNMIPDPMSNFYDALCVGIDWFENLNTDGFFQLSPDNSEDEDMDQPPPEFQNEPP----ERPPRRPSLAAQ
        LKQILEDYNMIPDPMSNFYDALCVGIDWFENLNTDGFFQLSPDNSEDE+M+Q  P+ QNEPP    E+PPRRPSLAAQ
Subjt:  LKQILEDYNMIPDPMSNFYDALCVGIDWFENLNTDGFFQLSPDNSEDEDMDQPPPEFQNEPP----ERPPRRPSLAAQ

A0A1S3BCY1 uncharacterized protein LOC103488516 isoform X22.0e-10473.38Show/hide
Query:  ADP--RDIMNPFEEND-PYDSSINANILNFANDQNPTLDDLQSTAEPPMGAARPQQYQQPAAAARENMGMVELLTHFSFGG--LDSEAGPSNVHGEGPSN
        ADP    + +  E+ D PY+SSINANILNF NDQNPTL+DLQSTAE            QPA     NMGMVELL  FS+G    DSEAGPSN+ GE   +
Subjt:  ADP--RDIMNPFEEND-PYDSSINANILNFANDQNPTLDDLQSTAEPPMGAARPQQYQQPAAAARENMGMVELLTHFSFGG--LDSEAGPSNVHGEGPSN

Query:  FEDEDDMQIPSEVESRLLAIWPVTPVPFLCSCCQVLREFLHTNGINSRKLEIHGRLGIICHAILEHKPIVNVDNIGPQYQMFDFCDKSSEEIKQFLLQYC
         +D++ M IPS+VES+LL IWP+TP+PFLCSCCQVLREFLHTNG+NSRKLEIHGRLG+I HAILEHKPIVNVDNI PQYQMFDFC+KS EEIKQFLLQYC
Subjt:  FEDEDDMQIPSEVESRLLAIWPVTPVPFLCSCCQVLREFLHTNGINSRKLEIHGRLGIICHAILEHKPIVNVDNIGPQYQMFDFCDKSSEEIKQFLLQYC

Query:  LKQILEDYNMIPDPMSNFYDALCVGIDWFENLNTDGFFQLSPDNSEDEDMDQPPPEFQNEPP----ERPPRRPSLAAQ
        LKQILEDYNMIPDPMSNFYDALCVGIDWFENLNTDGFFQLSPDNS DE+M+Q  P+ QNEPP    E+PPRRPSLAAQ
Subjt:  LKQILEDYNMIPDPMSNFYDALCVGIDWFENLNTDGFFQLSPDNSEDEDMDQPPPEFQNEPP----ERPPRRPSLAAQ

A0A5A7V999 Protein RKD2-like isoform X12.8e-10673.74Show/hide
Query:  ADP--RDIMNPFEEND-PYDSSINANILNFANDQNPTLDDLQSTAEPPMGAARPQQYQQPAAAARENMGMVELLTHFSFGG--LDSEAGPSNVHGEGPSN
        ADP    + +  E+ D PY+SSINANILNF NDQNPTL+DLQSTAE            QPA     NMGMVELL  FS+G    DSEAGPSN+ GE   +
Subjt:  ADP--RDIMNPFEEND-PYDSSINANILNFANDQNPTLDDLQSTAEPPMGAARPQQYQQPAAAARENMGMVELLTHFSFGG--LDSEAGPSNVHGEGPSN

Query:  FEDEDDMQIPSEVESRLLAIWPVTPVPFLCSCCQVLREFLHTNGINSRKLEIHGRLGIICHAILEHKPIVNVDNIGPQYQMFDFCDKSSEEIKQFLLQYC
         +D++ M IPS+VES+LL IWP+TP+PFLCSCCQVLREFLHTNG+NSRKLEIHGRLG+I HAILEHKPIVNVDNI PQYQMFDFC+KS EEIKQFLLQYC
Subjt:  FEDEDDMQIPSEVESRLLAIWPVTPVPFLCSCCQVLREFLHTNGINSRKLEIHGRLGIICHAILEHKPIVNVDNIGPQYQMFDFCDKSSEEIKQFLLQYC

Query:  LKQILEDYNMIPDPMSNFYDALCVGIDWFENLNTDGFFQLSPDNSEDEDMDQPPPEFQNEPP----ERPPRRPSLAAQ
        LKQILEDYNMIPDPMSNFYDALCVGIDWFENLNTDGFFQLSPDNSEDE+M+Q  P+ QNEPP    E+PPRRPSLAAQ
Subjt:  LKQILEDYNMIPDPMSNFYDALCVGIDWFENLNTDGFFQLSPDNSEDEDMDQPPPEFQNEPP----ERPPRRPSLAAQ

A0A6J1GK73 uncharacterized protein LOC1114546361.8e-11681.39Show/hide
Query:  DPRDIMNPFE-ENDPYDSSINANILNFANDQNPTLDDLQSTAEPPMGAARPQQYQQPAAAARENMGMVELLTHFSFGG-LDSEAGPSNVHGEGPSNFEDE
        DPR+ M  FE + DPYDSSINANILNFANDQNPTLDDLQSTAE P+ AAR     +   +A +NMGMVELLTHFSFGG  DSEAGPSN  GE  SNFE++
Subjt:  DPRDIMNPFE-ENDPYDSSINANILNFANDQNPTLDDLQSTAEPPMGAARPQQYQQPAAAARENMGMVELLTHFSFGG-LDSEAGPSNVHGEGPSNFEDE

Query:  DDMQIPSEVESRLLAIWPVTPVPFLCSCCQVLREFLHTNGINSRKLEIHGRLGIICHAILEHKPIVNVDNIGPQYQMFDFCDKSSEEIKQFLLQYCLKQI
        DD+QIPSEV+S+LLAIWPVTPVPFLCSCCQVLREFLH+NG+NSRKLEIHGRLG+ICHAILEHKPIVNVDNI PQYQMFDFCDKS EEIKQFLLQYCLKQI
Subjt:  DDMQIPSEVESRLLAIWPVTPVPFLCSCCQVLREFLHTNGINSRKLEIHGRLGIICHAILEHKPIVNVDNIGPQYQMFDFCDKSSEEIKQFLLQYCLKQI

Query:  LEDYNMIPDPMSNFYDALCVGIDWFENLN-TDGFFQLSPDNSEDEDMDQPPPEFQNEPPE---RPPRRPSLAAQ
        LEDYNMIPDPMSNFYDALCVGIDWFENLN TD FFQ SPDNSEDEDMDQ  P FQN+ PE   +P RRPSLAAQ
Subjt:  LEDYNMIPDPMSNFYDALCVGIDWFENLN-TDGFFQLSPDNSEDEDMDQPPPEFQNEPPE---RPPRRPSLAAQ

A0A6J1KR66 uncharacterized protein LOC1114964777.1e-11882.12Show/hide
Query:  DPRDIMNPFE-ENDPYDSSINANILNFANDQNPTLDDLQSTAEPPMGAARPQQYQQPAAAARENMGMVELLTHFSFGG-LDSEAGPSNVHGEGPSNFEDE
        DPR+ M  FE + DPYDSSINANILNFANDQNPTLDDLQSTAE P+ AAR     +   +A +NMGMVELLTHFSFGG  DSEAGPSN  GE  SNFE++
Subjt:  DPRDIMNPFE-ENDPYDSSINANILNFANDQNPTLDDLQSTAEPPMGAARPQQYQQPAAAARENMGMVELLTHFSFGG-LDSEAGPSNVHGEGPSNFEDE

Query:  DDMQIPSEVESRLLAIWPVTPVPFLCSCCQVLREFLHTNGINSRKLEIHGRLGIICHAILEHKPIVNVDNIGPQYQMFDFCDKSSEEIKQFLLQYCLKQI
        DD QIPSEVES+LLAIWPVTPVPFLCSCCQVLREFLH+NG+NSRKLEIHGRLG+ICHAILEHKPIVNVDNI PQYQMFDFCDKS EEIKQFLLQYCLKQI
Subjt:  DDMQIPSEVESRLLAIWPVTPVPFLCSCCQVLREFLHTNGINSRKLEIHGRLGIICHAILEHKPIVNVDNIGPQYQMFDFCDKSSEEIKQFLLQYCLKQI

Query:  LEDYNMIPDPMSNFYDALCVGIDWFENLN-TDGFFQLSPDNSEDEDMDQPPPEFQNEPPE---RPPRRPSLAAQ
        LEDYNMIPDPMSNFYDALCVGIDWFENLN TD FFQ SPDNSEDED+DQP PEFQN+ PE   +P RRPSLAAQ
Subjt:  LEDYNMIPDPMSNFYDALCVGIDWFENLN-TDGFFQLSPDNSEDEDMDQPPPEFQNEPPE---RPPRRPSLAAQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G16100.1 unknown protein5.8e-1134.04Show/hide
Query:  CSCCQVLREFLH-TNGINSRKLEIHGRLGIICHAILEHKPIVNVDNIGPQYQMFDFCDKSSEEIKQFLLQYCLKQILEDYNMIPDPMSNFYDAL
        C+CC++LRE +H   G    KL+I+G +G ICHAIL  + ++  D++  Q  +F     + EE+K+F+  YC +++    +++ D  + FY A+
Subjt:  CSCCQVLREFLH-TNGINSRKLEIHGRLGIICHAILEHKPIVNVDNIGPQYQMFDFCDKSSEEIKQFLLQYCLKQILEDYNMIPDPMSNFYDAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGATCCGAGGGACATCATGAACCCCTTCGAAGAGAACGATCCATACGACAGTTCCATCAACGCAAATATTCTCAACTTCGCTAACGATCAGAACCCGACG
TTGGACGATTTGCAGTCCACAGCGGAGCCGCCAATGGGGGCGGCGAGGCCCCAACAGTATCAGCAACCGGCCGCGGCGGCGCGTGAGAATATGGGTATGGTGGAG
TTATTGACCCATTTCAGTTTCGGAGGTCTTGATTCGGAAGCGGGGCCCTCTAACGTCCATGGTGAAGGACCGAGCAATTTTGAAGATGAAGATGATATGCAGATT
CCGAGCGAAGTCGAGTCGAGATTGCTCGCGATCTGGCCCGTCACTCCGGTGCCGTTTCTGTGCAGTTGTTGCCAAGTTCTTAGAGAATTTCTCCATACCAATGGC
ATCAACAGCAGAAAACTGGAGATTCATGGTCGACTTGGGATAATCTGCCATGCAATTTTGGAGCATAAACCCATTGTTAATGTGGATAATATAGGTCCTCAGTAC
CAAATGTTCGATTTTTGCGACAAAAGCTCAGAGGAGATTAAGCAGTTTCTTCTACAGTACTGTCTGAAGCAGATTTTGGAGGATTACAATATGATTCCAGATCCA
ATGTCCAACTTCTACGATGCCCTCTGTGTCGGAATTGACTGGTTTGAAAATCTCAACACCGATGGATTCTTTCAACTGTCTCCCGACAATTCTGAAGATGAGGAC
ATGGATCAGCCTCCTCCGGAGTTCCAAAACGAGCCGCCGGAGCGGCCTCCGAGGCGACCCTCTCTTGCCGCACAGGGCGAGAACGGGAAGAATGACGGTGAACGA
TGTGTGGCAATATCTTCATCTGCCCATATCGGAGGCTTCGAAGAAACTCAACGTTTGCAACACTGTGTTGAAGAAGATTTGCCGCCGGAGCGGTCTCAGCCGGTG
GCCTTACAGAAAGATACGGAGTTACGAGAGGCGAATAGCGGCGCTGAGAGCGACGATGAATTCAAGCTATGGAGATACGAGGGCCCGAGCAGAGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGATCCGAGGGACATCATGAACCCCTTCGAAGAGAACGATCCATACGACAGTTCCATCAACGCAAATATTCTCAACTTCGCTAACGATCAGAACCCGACG
TTGGACGATTTGCAGTCCACAGCGGAGCCGCCAATGGGGGCGGCGAGGCCCCAACAGTATCAGCAACCGGCCGCGGCGGCGCGTGAGAATATGGGTATGGTGGAG
TTATTGACCCATTTCAGTTTCGGAGGTCTTGATTCGGAAGCGGGGCCCTCTAACGTCCATGGTGAAGGACCGAGCAATTTTGAAGATGAAGATGATATGCAGATT
CCGAGCGAAGTCGAGTCGAGATTGCTCGCGATCTGGCCCGTCACTCCGGTGCCGTTTCTGTGCAGTTGTTGCCAAGTTCTTAGAGAATTTCTCCATACCAATGGC
ATCAACAGCAGAAAACTGGAGATTCATGGTCGACTTGGGATAATCTGCCATGCAATTTTGGAGCATAAACCCATTGTTAATGTGGATAATATAGGTCCTCAGTAC
CAAATGTTCGATTTTTGCGACAAAAGCTCAGAGGAGATTAAGCAGTTTCTTCTACAGTACTGTCTGAAGCAGATTTTGGAGGATTACAATATGATTCCAGATCCA
ATGTCCAACTTCTACGATGCCCTCTGTGTCGGAATTGACTGGTTTGAAAATCTCAACACCGATGGATTCTTTCAACTGTCTCCCGACAATTCTGAAGATGAGGAC
ATGGATCAGCCTCCTCCGGAGTTCCAAAACGAGCCGCCGGAGCGGCCTCCGAGGCGACCCTCTCTTGCCGCACAGGGCGAGAACGGGAAGAATGACGGTGAACGA
TGTGTGGCAATATCTTCATCTGCCCATATCGGAGGCTTCGAAGAAACTCAACGTTTGCAACACTGTGTTGAAGAAGATTTGCCGCCGGAGCGGTCTCAGCCGGTG
GCCTTACAGAAAGATACGGAGTTACGAGAGGCGAATAGCGGCGCTGAGAGCGACGATGAATTCAAGCTATGGAGATACGAGGGCCCGAGCAGAGGCTGA
Protein sequenceShow/hide protein sequence
MADPRDIMNPFEENDPYDSSINANILNFANDQNPTLDDLQSTAEPPMGAARPQQYQQPAAAARENMGMVELLTHFSFGGLDSEAGPSNVHGEGPSNFEDEDDMQI
PSEVESRLLAIWPVTPVPFLCSCCQVLREFLHTNGINSRKLEIHGRLGIICHAILEHKPIVNVDNIGPQYQMFDFCDKSSEEIKQFLLQYCLKQILEDYNMIPDP
MSNFYDALCVGIDWFENLNTDGFFQLSPDNSEDEDMDQPPPEFQNEPPERPPRRPSLAAQGENGKNDGERCVAISSSAHIGGFEETQRLQHCVEEDLPPERSQPV
ALQKDTELREANSGAESDDEFKLWRYEGPSRG