; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g33280 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g33280
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionIntegral membrane protein hemolysin-III homolog
Genome locationchr8:24172425..24173696
RNA-Seq ExpressionMoc08g33280
SyntenyMoc08g33280
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022153185.1 uncharacterized protein LOC111020739 [Momordica charantia]1.0e-136100Show/hide
Query:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDVNDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYASG
        MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDVNDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYASG
Subjt:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDVNDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYASG

Query:  EVDTKPRKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPSSEVLHSVASNGITDERRTERLFNLQKLLKHCDESDQ
        EVDTKPRKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPSSEVLHSVASNGITDERRTERLFNLQKLLKHCDESDQ
Subjt:  EVDTKPRKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPSSEVLHSVASNGITDERRTERLFNLQKLLKHCDESDQ

Query:  KGYIEFLHGLPPSELSKLAINLEKRSMNLSVEEGKEIQRMKALNILGNIQ
        KGYIEFLHGLPPSELSKLAINLEKRSMNLSVEEGKEIQRMKALNILGNIQ
Subjt:  KGYIEFLHGLPPSELSKLAINLEKRSMNLSVEEGKEIQRMKALNILGNIQ

XP_022945996.1 uncharacterized protein LOC111450215 [Cucurbita moschata]1.5e-10882.87Show/hide
Query:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDV-NDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYAS
        MIDSKL +GGMGS ETHLSMY++KQSPIA+KKVALRDV NDNR+V+YNYPE SCSLGGKLVNGSK+SG+KRSN PTCSP S  HQSFKG+GVNEHI YA+
Subjt:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDV-NDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYAS

Query:  GEVDTKPRKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPSSEVLHSVASNGITDERRTERLFNLQKLLKHCDESD
        GEVD KP KKRALG  TSCAPAFSS LAASPM FSPVR+SLPIFTEK GNFL V+GSNLL I P  E+LHSV SNGITDE+RTERLFNLQ LLKHCDESD
Subjt:  GEVDTKPRKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPSSEVLHSVASNGITDERRTERLFNLQKLLKHCDESD

Query:  QKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEEGKEIQRMKALNILGNIQ
        QKG+IE LHGLPPSELS+LAINLEK+SM+LSVEEGKEIQRMKALNILGN+Q
Subjt:  QKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEEGKEIQRMKALNILGNIQ

XP_022999755.1 uncharacterized protein LOC111494010 [Cucurbita maxima]3.1e-10982.87Show/hide
Query:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDV-NDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYAS
        MIDSKL NGGMGS ETHLSMY++KQSPIA+KKVALRDV NDNR+V+YNYPE SCSLGGKLVNGSK+SG+KRSN PTCSP S  HQSFKG+GVNEHI YA+
Subjt:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDV-NDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYAS

Query:  GEVDTKPRKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPSSEVLHSVASNGITDERRTERLFNLQKLLKHCDESD
        GEVD KP KKRALG  TSCAP FSS LAASP+ FSPVR+SLPIFTEK GNFL V+GSNLL IPP  E+LHSV SNGITDE+RTERLFNLQ LLKHCDESD
Subjt:  GEVDTKPRKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPSSEVLHSVASNGITDERRTERLFNLQKLLKHCDESD

Query:  QKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEEGKEIQRMKALNILGNIQ
        QKG+IE LHGLPPSELS+LAINLEK+SM+LSVEEGKEIQRMKALNILGN+Q
Subjt:  QKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEEGKEIQRMKALNILGNIQ

XP_023545576.1 uncharacterized protein LOC111804962 [Cucurbita pepo subsp. pepo]5.8e-10882.47Show/hide
Query:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDV-NDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYAS
        MIDSKL +GGMGS ETHLSMY++KQSPIA+KKVALRDV NDNR+V+YNY E SCSLGGKLVNGSK+SG+KRSN PTCSP S  HQSFKG+GV EHI YA+
Subjt:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDV-NDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYAS

Query:  GEVDTKPRKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPSSEVLHSVASNGITDERRTERLFNLQKLLKHCDESD
        GEVD KP KKRALG  TSCAPAFSS LAASPM FSPVR+SLPIFTEK GNFL V+GSNLL IPP  E+LHSV SNGITDE+RTERLFNLQ LLKHCDESD
Subjt:  GEVDTKPRKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPSSEVLHSVASNGITDERRTERLFNLQKLLKHCDESD

Query:  QKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEEGKEIQRMKALNILGNIQ
        QKG+IE LHGLPPSELS+LAINLEK+SM+LSVEEGKEIQRMKALNILGN+Q
Subjt:  QKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEEGKEIQRMKALNILGNIQ

XP_038884507.1 uncharacterized protein LOC120075309 [Benincasa hispida]4.5e-10883.27Show/hide
Query:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDV-NDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYAS
        MIDSKL +GGMGS ETHLSMYQ KQSPIA KKVALRDV NDNR++MYNYPE SCSLGGKLVNGSKLSG+KRSN PT SP S  HQSFKG+GVNEH  YAS
Subjt:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDV-NDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYAS

Query:  GEVDTKPRKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPSSEVLHSVASNGITDERRTERLFNLQKLLKHCDESD
        GEVD KP KKRA G  TSCAPAFSSLLAASPM  SPVR+SLPIFTEK GNFL V+GS+LL IPP SE+L SV SNGITDE+RTERLFNLQK LKHCDESD
Subjt:  GEVDTKPRKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPSSEVLHSVASNGITDERRTERLFNLQKLLKHCDESD

Query:  QKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEEGKEIQRMKALNILGNIQ
        +KG IEFLHGLPPSELS  AINLEKRSMNLSVEEGKEIQRMKALNILGN+Q
Subjt:  QKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEEGKEIQRMKALNILGNIQ

TrEMBL top hitse value%identityAlignment
A0A1S3CME9 uncharacterized protein LOC103502625 isoform X12.2e-10078.17Show/hide
Query:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDV-NDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYAS
        MIDSKL +GGM S ET+L +YQ+KQSPIA KKVALRDV NDNR+++YNYPE SCSLGGKL+NGSKLSG+KRSN PTCSP S  HQSFKG+GVNEH  YA+
Subjt:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDV-NDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYAS

Query:  GEVDTKPRKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPSSEVLHSVASNGITDERRTERLFNLQKLLKHCDESD
        GEVD KP KKRA G  TSCAPAFSS LA SPM FSPVR+S PIFTEK GNFL V+GSNLL IPP  E+L S  SNGITDE+RTERLFNLQKLLKH D+SD
Subjt:  GEVDTKPRKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPSSEVLHSVASNGITDERRTERLFNLQKLLKHCDESD

Query:  QKGYIEF-LHGLPPSELSKLAINLEKRSMNLSVEEGKEIQRMKALNILGNIQ
        QKG IE  LHGLPPSELS  AINLEKRSM+LSVEEGKEIQRMKALNIL N+Q
Subjt:  QKGYIEF-LHGLPPSELSKLAINLEKRSMNLSVEEGKEIQRMKALNILGNIQ

A0A1S3CMI7 uncharacterized protein LOC103502625 isoform X28.8e-10278.49Show/hide
Query:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDV-NDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYAS
        MIDSKL +GGM S ET+L +YQ+KQSPIA KKVALRDV NDNR+++YNYPE SCSLGGKL+NGSKLSG+KRSN PTCSP S  HQSFKG+GVNEH  YA+
Subjt:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDV-NDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYAS

Query:  GEVDTKPRKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPSSEVLHSVASNGITDERRTERLFNLQKLLKHCDESD
        GEVD KP KKRA G  TSCAPAFSS LA SPM FSPVR+S PIFTEK GNFL V+GSNLL IPP  E+L S  SNGITDE+RTERLFNLQKLLKH D+SD
Subjt:  GEVDTKPRKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPSSEVLHSVASNGITDERRTERLFNLQKLLKHCDESD

Query:  QKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEEGKEIQRMKALNILGNIQ
        QKG IE LHGLPPSELS  AINLEKRSM+LSVEEGKEIQRMKALNIL N+Q
Subjt:  QKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEEGKEIQRMKALNILGNIQ

A0A6J1DG32 uncharacterized protein LOC1110207395.0e-137100Show/hide
Query:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDVNDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYASG
        MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDVNDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYASG
Subjt:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDVNDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYASG

Query:  EVDTKPRKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPSSEVLHSVASNGITDERRTERLFNLQKLLKHCDESDQ
        EVDTKPRKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPSSEVLHSVASNGITDERRTERLFNLQKLLKHCDESDQ
Subjt:  EVDTKPRKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPSSEVLHSVASNGITDERRTERLFNLQKLLKHCDESDQ

Query:  KGYIEFLHGLPPSELSKLAINLEKRSMNLSVEEGKEIQRMKALNILGNIQ
        KGYIEFLHGLPPSELSKLAINLEKRSMNLSVEEGKEIQRMKALNILGNIQ
Subjt:  KGYIEFLHGLPPSELSKLAINLEKRSMNLSVEEGKEIQRMKALNILGNIQ

A0A6J1G2G3 uncharacterized protein LOC1114502157.5e-10982.87Show/hide
Query:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDV-NDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYAS
        MIDSKL +GGMGS ETHLSMY++KQSPIA+KKVALRDV NDNR+V+YNYPE SCSLGGKLVNGSK+SG+KRSN PTCSP S  HQSFKG+GVNEHI YA+
Subjt:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDV-NDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYAS

Query:  GEVDTKPRKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPSSEVLHSVASNGITDERRTERLFNLQKLLKHCDESD
        GEVD KP KKRALG  TSCAPAFSS LAASPM FSPVR+SLPIFTEK GNFL V+GSNLL I P  E+LHSV SNGITDE+RTERLFNLQ LLKHCDESD
Subjt:  GEVDTKPRKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPSSEVLHSVASNGITDERRTERLFNLQKLLKHCDESD

Query:  QKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEEGKEIQRMKALNILGNIQ
        QKG+IE LHGLPPSELS+LAINLEK+SM+LSVEEGKEIQRMKALNILGN+Q
Subjt:  QKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEEGKEIQRMKALNILGNIQ

A0A6J1KE02 uncharacterized protein LOC1114940101.5e-10982.87Show/hide
Query:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDV-NDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYAS
        MIDSKL NGGMGS ETHLSMY++KQSPIA+KKVALRDV NDNR+V+YNYPE SCSLGGKLVNGSK+SG+KRSN PTCSP S  HQSFKG+GVNEHI YA+
Subjt:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDV-NDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYAS

Query:  GEVDTKPRKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPSSEVLHSVASNGITDERRTERLFNLQKLLKHCDESD
        GEVD KP KKRALG  TSCAP FSS LAASP+ FSPVR+SLPIFTEK GNFL V+GSNLL IPP  E+LHSV SNGITDE+RTERLFNLQ LLKHCDESD
Subjt:  GEVDTKPRKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPSSEVLHSVASNGITDERRTERLFNLQKLLKHCDESD

Query:  QKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEEGKEIQRMKALNILGNIQ
        QKG+IE LHGLPPSELS+LAINLEK+SM+LSVEEGKEIQRMKALNILGN+Q
Subjt:  QKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEEGKEIQRMKALNILGNIQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G45250.1 Integral membrane protein hemolysin-III homolog2.1e-1047.69Show/hide
Query:  ERLFNLQKLLKHCDESDQKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEEGKEIQRMKALNILG
        ER  +LQ LL   ++SD+  +++ L  L  +ELSK A++LEKRS+  S+EE +E+QR+ ALN+LG
Subjt:  ERLFNLQKLLKHCDESDQKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEEGKEIQRMKALNILG

AT2G45250.2 Integral membrane protein hemolysin-III homolog2.6e-0545.1Show/hide
Query:  ERLFNLQKLLKHCDESDQKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEE
        ER  +LQ LL   ++SD+  +++ L  L  +ELSK A++LEKRS+  S+EE
Subjt:  ERLFNLQKLLKHCDESDQKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEE

AT4G38280.1 BEST Arabidopsis thaliana protein match is: Integral membrane protein hemolysin-III homolog (TAIR:AT2G45250.1)2.7e-1030.15Show/hide
Query:  PEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYASGEVDTKPRKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLG
        PEG+     K      +S      PP  SP +    S + V V   +     EVDT                   S  AAS    +P  T  P       
Subjt:  PEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYASGEVDTKPRKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLG

Query:  NFLTVSGSNLLSIPPSSEVLHSVASNGITDERRTERLFNLQKLLKHCDESDQKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEEGKEIQRMKALNILG
                  L IP S     +  S+ +  E   ER  +LQ LL   ++SD+  +++ L  L  +ELSK A++LEKRS+  S+EE +E+QR+ ALN+LG
Subjt:  NFLTVSGSNLLSIPPSSEVLHSVASNGITDERRTERLFNLQKLLKHCDESDQKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEEGKEIQRMKALNILG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGATAGCAAGTTGACCAATGGTGGGATGGGTAGCGGTGAGACTCATTTATCTATGTATCAGAACAAGCAATCACCAATTGCACTGAAAAAGGTTGCTTTAAGGGA
TGTGAACGATAATAGAAATGTCATGTATAACTACCCGGAAGGTTCTTGTTCTTTGGGCGGAAAACTTGTTAATGGAAGTAAGCTCTCAGGGACCAAGAGATCCAACCCCC
CTACATGCTCACCTGGCTCTGTAACCCATCAATCCTTCAAGGGGGTTGGTGTAAATGAGCACATATTTTATGCAAGCGGAGAAGTTGACACCAAACCCAGAAAAAAAAGA
GCATTGGGATGTGGCACATCTTGTGCACCTGCATTTTCTTCTCTTCTTGCAGCCTCCCCAATGACATTTTCACCGGTTAGGACTTCACTTCCCATTTTCACCGAAAAGCT
CGGTAATTTTCTGACAGTTTCTGGATCCAATCTTTTGAGTATCCCTCCTAGTTCGGAGGTTCTTCACTCTGTTGCTTCAAATGGGATTACTGATGAGCGGAGAACAGAAC
GTTTATTCAATTTGCAGAAGCTCCTCAAACATTGTGACGAGTCGGATCAAAAAGGTTACATTGAGTTTCTACATGGTTTACCTCCATCTGAGCTCAGCAAACTTGCCATT
AATCTTGAGAAGAGATCAATGAATCTGTCAGTAGAAGAAGGGAAAGAGATCCAAAGGATGAAGGCTTTGAATATTCTGGGCAACATTCAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTGATAGCAAGTTGACCAATGGTGGGATGGGTAGCGGTGAGACTCATTTATCTATGTATCAGAACAAGCAATCACCAATTGCACTGAAAAAGGTTGCTTTAAGGGA
TGTGAACGATAATAGAAATGTCATGTATAACTACCCGGAAGGTTCTTGTTCTTTGGGCGGAAAACTTGTTAATGGAAGTAAGCTCTCAGGGACCAAGAGATCCAACCCCC
CTACATGCTCACCTGGCTCTGTAACCCATCAATCCTTCAAGGGGGTTGGTGTAAATGAGCACATATTTTATGCAAGCGGAGAAGTTGACACCAAACCCAGAAAAAAAAGA
GCATTGGGATGTGGCACATCTTGTGCACCTGCATTTTCTTCTCTTCTTGCAGCCTCCCCAATGACATTTTCACCGGTTAGGACTTCACTTCCCATTTTCACCGAAAAGCT
CGGTAATTTTCTGACAGTTTCTGGATCCAATCTTTTGAGTATCCCTCCTAGTTCGGAGGTTCTTCACTCTGTTGCTTCAAATGGGATTACTGATGAGCGGAGAACAGAAC
GTTTATTCAATTTGCAGAAGCTCCTCAAACATTGTGACGAGTCGGATCAAAAAGGTTACATTGAGTTTCTACATGGTTTACCTCCATCTGAGCTCAGCAAACTTGCCATT
AATCTTGAGAAGAGATCAATGAATCTGTCAGTAGAAGAAGGGAAAGAGATCCAAAGGATGAAGGCTTTGAATATTCTGGGCAACATTCAGTGA
Protein sequenceShow/hide protein sequence
MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDVNDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYASGEVDTKPRKKR
ALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPSSEVLHSVASNGITDERRTERLFNLQKLLKHCDESDQKGYIEFLHGLPPSELSKLAI
NLEKRSMNLSVEEGKEIQRMKALNILGNIQ