; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS023477 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS023477
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionIntegral membrane protein hemolysin-III homolog
Genome locationscaffold787:166712..168026
RNA-Seq ExpressionMS023477
SyntenyMS023477
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038455.1 uncharacterized protein E6C27_scaffold119G00220 [Cucumis melo var. makuwa]3.1e-10274.63Show/hide
Query:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDV-NDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYAS
        MIDSKL +GGM S ET+L +YQ+KQSPIA KKVALRDV NDNR+V+YNYPE SCSLGGKL+NGSKLSG+KRSN PTCSP S  HQSFKG+GVNEH  YA+
Subjt:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDV-NDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYAS

Query:  GEVDTKPGKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPNSEVLHSVASNGITDERRTERLFNLQKLLKHCDESD
        GEVD KPGKKRA G  TSCAPAFSS LA SPM FSPVR+S PIFTEK GNFL V+GSNLL IPP  E+L S  SNGITDE+RTERLFNLQKLLKH D+SD
Subjt:  GEVDTKPGKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPNSEVLHSVASNGITDERRTERLFNLQKLLKHCDESD

Query:  QKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEEADRFMFFRERDPKDEGFEYSGQHSVTSETSFVRLVNND
        QKG IE LHGLPPSELS  AINLEKRSM+L            ERDP DEGFEYS Q SV SE SFV L+NND
Subjt:  QKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEEADRFMFFRERDPKDEGFEYSGQHSVTSETSFVRLVNND

XP_022153185.1 uncharacterized protein LOC111020739 [Momordica charantia]3.0e-12699.14Show/hide
Query:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDVNDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYASG
        MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDVNDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYASG
Subjt:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDVNDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYASG

Query:  EVDTKPGKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPNSEVLHSVASNGITDERRTERLFNLQKLLKHCDESDQ
        EVDTKP KKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPP+SEVLHSVASNGITDERRTERLFNLQKLLKHCDESDQ
Subjt:  EVDTKPGKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPNSEVLHSVASNGITDERRTERLFNLQKLLKHCDESDQ

Query:  KGYIEFLHGLPPSELSKLAINLEKRSMNLSVEE
        KGYIEFLHGLPPSELSKLAINLEKRSMNLSVEE
Subjt:  KGYIEFLHGLPPSELSKLAINLEKRSMNLSVEE

XP_022945996.1 uncharacterized protein LOC111450215 [Cucurbita moschata]2.2e-10082.48Show/hide
Query:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDV-NDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYAS
        MIDSKL +GGMGS ETHLSMY++KQSPIA+KKVALRDV NDNR+V+YNYPE SCSLGGKLVNGSK+SG+KRSN PTCSP S  HQSFKG+GVNEHI YA+
Subjt:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDV-NDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYAS

Query:  GEVDTKPGKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPNSEVLHSVASNGITDERRTERLFNLQKLLKHCDESD
        GEVD KPGKKRALG  TSCAPAFSS LAASPM FSPVR+SLPIFTEK GNFL V+GSNLL I P  E+LHSV SNGITDE+RTERLFNLQ LLKHCDESD
Subjt:  GEVDTKPGKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPNSEVLHSVASNGITDERRTERLFNLQKLLKHCDESD

Query:  QKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEE
        QKG+IE LHGLPPSELS+LAINLEK+SM+LSVEE
Subjt:  QKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEE

XP_022999755.1 uncharacterized protein LOC111494010 [Cucurbita maxima]4.4e-10182.48Show/hide
Query:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDV-NDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYAS
        MIDSKL NGGMGS ETHLSMY++KQSPIA+KKVALRDV NDNR+V+YNYPE SCSLGGKLVNGSK+SG+KRSN PTCSP S  HQSFKG+GVNEHI YA+
Subjt:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDV-NDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYAS

Query:  GEVDTKPGKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPNSEVLHSVASNGITDERRTERLFNLQKLLKHCDESD
        GEVD KPGKKRALG  TSCAP FSS LAASP+ FSPVR+SLPIFTEK GNFL V+GSNLL IPP  E+LHSV SNGITDE+RTERLFNLQ LLKHCDESD
Subjt:  GEVDTKPGKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPNSEVLHSVASNGITDERRTERLFNLQKLLKHCDESD

Query:  QKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEE
        QKG+IE LHGLPPSELS+LAINLEK+SM+LSVEE
Subjt:  QKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEE

XP_038884507.1 uncharacterized protein LOC120075309 [Benincasa hispida]8.3e-10082.91Show/hide
Query:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDV-NDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYAS
        MIDSKL +GGMGS ETHLSMYQ KQSPIA KKVALRDV NDNR++MYNYPE SCSLGGKLVNGSKLSG+KRSN PT SP S  HQSFKG+GVNEH  YAS
Subjt:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDV-NDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYAS

Query:  GEVDTKPGKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPNSEVLHSVASNGITDERRTERLFNLQKLLKHCDESD
        GEVD KPGKKRA G  TSCAPAFSSLLAASPM  SPVR+SLPIFTEK GNFL V+GS+LL IPP SE+L SV SNGITDE+RTERLFNLQK LKHCDESD
Subjt:  GEVDTKPGKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPNSEVLHSVASNGITDERRTERLFNLQKLLKHCDESD

Query:  QKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEE
        +KG IEFLHGLPPSELS  AINLEKRSMNLSVEE
Subjt:  QKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEE

TrEMBL top hitse value%identityAlignment
A0A1S3CMI7 uncharacterized protein LOC103502625 isoform X23.3e-9478.21Show/hide
Query:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDV-NDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYAS
        MIDSKL +GGM S ET+L +YQ+KQSPIA KKVALRDV NDNR+++YNYPE SCSLGGKL+NGSKLSG+KRSN PTCSP S  HQSFKG+GVNEH  YA+
Subjt:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDV-NDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYAS

Query:  GEVDTKPGKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPNSEVLHSVASNGITDERRTERLFNLQKLLKHCDESD
        GEVD KPGKKRA G  TSCAPAFSS LA SPM FSPVR+S PIFTEK GNFL V+GSNLL IPP  E+L S  SNGITDE+RTERLFNLQKLLKH D+SD
Subjt:  GEVDTKPGKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPNSEVLHSVASNGITDERRTERLFNLQKLLKHCDESD

Query:  QKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEE
        QKG IE LHGLPPSELS  AINLEKRSM+LSVEE
Subjt:  QKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEE

A0A5A7TAR2 Uncharacterized protein1.5e-10274.63Show/hide
Query:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDV-NDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYAS
        MIDSKL +GGM S ET+L +YQ+KQSPIA KKVALRDV NDNR+V+YNYPE SCSLGGKL+NGSKLSG+KRSN PTCSP S  HQSFKG+GVNEH  YA+
Subjt:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDV-NDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYAS

Query:  GEVDTKPGKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPNSEVLHSVASNGITDERRTERLFNLQKLLKHCDESD
        GEVD KPGKKRA G  TSCAPAFSS LA SPM FSPVR+S PIFTEK GNFL V+GSNLL IPP  E+L S  SNGITDE+RTERLFNLQKLLKH D+SD
Subjt:  GEVDTKPGKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPNSEVLHSVASNGITDERRTERLFNLQKLLKHCDESD

Query:  QKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEEADRFMFFRERDPKDEGFEYSGQHSVTSETSFVRLVNND
        QKG IE LHGLPPSELS  AINLEKRSM+L            ERDP DEGFEYS Q SV SE SFV L+NND
Subjt:  QKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEEADRFMFFRERDPKDEGFEYSGQHSVTSETSFVRLVNND

A0A6J1DG32 uncharacterized protein LOC1110207391.5e-12699.14Show/hide
Query:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDVNDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYASG
        MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDVNDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYASG
Subjt:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDVNDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYASG

Query:  EVDTKPGKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPNSEVLHSVASNGITDERRTERLFNLQKLLKHCDESDQ
        EVDTKP KKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPP+SEVLHSVASNGITDERRTERLFNLQKLLKHCDESDQ
Subjt:  EVDTKPGKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPNSEVLHSVASNGITDERRTERLFNLQKLLKHCDESDQ

Query:  KGYIEFLHGLPPSELSKLAINLEKRSMNLSVEE
        KGYIEFLHGLPPSELSKLAINLEKRSMNLSVEE
Subjt:  KGYIEFLHGLPPSELSKLAINLEKRSMNLSVEE

A0A6J1G2G3 uncharacterized protein LOC1114502151.1e-10082.48Show/hide
Query:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDV-NDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYAS
        MIDSKL +GGMGS ETHLSMY++KQSPIA+KKVALRDV NDNR+V+YNYPE SCSLGGKLVNGSK+SG+KRSN PTCSP S  HQSFKG+GVNEHI YA+
Subjt:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDV-NDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYAS

Query:  GEVDTKPGKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPNSEVLHSVASNGITDERRTERLFNLQKLLKHCDESD
        GEVD KPGKKRALG  TSCAPAFSS LAASPM FSPVR+SLPIFTEK GNFL V+GSNLL I P  E+LHSV SNGITDE+RTERLFNLQ LLKHCDESD
Subjt:  GEVDTKPGKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPNSEVLHSVASNGITDERRTERLFNLQKLLKHCDESD

Query:  QKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEE
        QKG+IE LHGLPPSELS+LAINLEK+SM+LSVEE
Subjt:  QKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEE

A0A6J1KE02 uncharacterized protein LOC1114940102.1e-10182.48Show/hide
Query:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDV-NDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYAS
        MIDSKL NGGMGS ETHLSMY++KQSPIA+KKVALRDV NDNR+V+YNYPE SCSLGGKLVNGSK+SG+KRSN PTCSP S  HQSFKG+GVNEHI YA+
Subjt:  MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDV-NDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYAS

Query:  GEVDTKPGKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPNSEVLHSVASNGITDERRTERLFNLQKLLKHCDESD
        GEVD KPGKKRALG  TSCAP FSS LAASP+ FSPVR+SLPIFTEK GNFL V+GSNLL IPP  E+LHSV SNGITDE+RTERLFNLQ LLKHCDESD
Subjt:  GEVDTKPGKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPNSEVLHSVASNGITDERRTERLFNLQKLLKHCDESD

Query:  QKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEE
        QKG+IE LHGLPPSELS+LAINLEK+SM+LSVEE
Subjt:  QKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G45250.1 Integral membrane protein hemolysin-III homolog7.5e-0646.15Show/hide
Query:  ERLFNLQKLLKHCDESDQKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEEA
        ER  +LQ LL   ++SD+  +++ L  L  +ELSK A++LEKRS+  S+EEA
Subjt:  ERLFNLQKLLKHCDESDQKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEEA

AT2G45250.2 Integral membrane protein hemolysin-III homolog2.8e-0545.1Show/hide
Query:  ERLFNLQKLLKHCDESDQKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEE
        ER  +LQ LL   ++SD+  +++ L  L  +ELSK A++LEKRS+  S+EE
Subjt:  ERLFNLQKLLKHCDESDQKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEE

AT4G38280.1 BEST Arabidopsis thaliana protein match is: Integral membrane protein hemolysin-III homolog (TAIR:AT2G45250.1)9.8e-0627.96Show/hide
Query:  PEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYASGEVDTKPGKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLG
        PEG+     K      +S      PP  SP +    S + V V   +     EVDT                   S  AAS    +P  T  P       
Subjt:  PEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYASGEVDTKPGKKRALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLG

Query:  NFLTVSGSNLLSIPPNSEVLHSVASNGITDERRTERLFNLQKLLKHCDESDQKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEEA
                  L IP +     +  S+ +  E   ER  +LQ LL   ++SD+  +++ L  L  +ELSK A++LEKRS+  S+EEA
Subjt:  NFLTVSGSNLLSIPPNSEVLHSVASNGITDERRTERLFNLQKLLKHCDESDQKGYIEFLHGLPPSELSKLAINLEKRSMNLSVEEA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGATAGCAAGTTGACCAATGGTGGGATGGGTAGCGGTGAGACTCATTTATCTATGTATCAGAACAAGCAATCACCAATTGCACTGAAAAAGGTTGCTTTAAGGGA
TGTGAACGATAATAGAAATGTCATGTATAACTACCCGGAAGGTTCTTGTTCTTTGGGCGGAAAACTTGTTAATGGAAGTAAGCTCTCAGGGACCAAGAGATCCAACCCCC
CTACATGCTCACCTGGCTCTGTAACCCATCAATCCTTCAAGGGGGTTGGTGTAAATGAGCACATATTTTATGCAAGCGGAGAAGTTGACACCAAACCCGGAAAAAAAAGA
GCATTGGGATGTGGCACATCTTGTGCACCTGCATTTTCTTCTCTTCTTGCAGCCTCCCCAATGACATTTTCACCGGTTAGGACTTCACTTCCCATTTTCACCGAAAAGCT
CGGTAATTTTCTGACAGTTTCTGGATCCAATCTTTTGAGTATCCCTCCTAATTCGGAGGTTCTTCACTCTGTTGCTTCAAATGGGATTACTGATGAGCGGAGAACAGAAC
GTTTATTCAATTTGCAGAAGCTCCTCAAACATTGTGACGAGTCGGATCAAAAAGGTTACATTGAGTTTCTACATGGTTTACCTCCATCTGAGCTCAGCAAACTTGCCATT
AATCTTGAGAAGAGATCAATGAATCTGTCAGTAGAAGAAGCTGACAGGTTTATGTTCTTCAGGGAAAGAGATCCAAAGGATGAAGGCTTTGAATATTCTGGGCAACATTC
AGTGACATCTGAGACAAGTTTTGTTAGATTAGTGAATAACGACCTT
mRNA sequenceShow/hide mRNA sequence
ATGATTGATAGCAAGTTGACCAATGGTGGGATGGGTAGCGGTGAGACTCATTTATCTATGTATCAGAACAAGCAATCACCAATTGCACTGAAAAAGGTTGCTTTAAGGGA
TGTGAACGATAATAGAAATGTCATGTATAACTACCCGGAAGGTTCTTGTTCTTTGGGCGGAAAACTTGTTAATGGAAGTAAGCTCTCAGGGACCAAGAGATCCAACCCCC
CTACATGCTCACCTGGCTCTGTAACCCATCAATCCTTCAAGGGGGTTGGTGTAAATGAGCACATATTTTATGCAAGCGGAGAAGTTGACACCAAACCCGGAAAAAAAAGA
GCATTGGGATGTGGCACATCTTGTGCACCTGCATTTTCTTCTCTTCTTGCAGCCTCCCCAATGACATTTTCACCGGTTAGGACTTCACTTCCCATTTTCACCGAAAAGCT
CGGTAATTTTCTGACAGTTTCTGGATCCAATCTTTTGAGTATCCCTCCTAATTCGGAGGTTCTTCACTCTGTTGCTTCAAATGGGATTACTGATGAGCGGAGAACAGAAC
GTTTATTCAATTTGCAGAAGCTCCTCAAACATTGTGACGAGTCGGATCAAAAAGGTTACATTGAGTTTCTACATGGTTTACCTCCATCTGAGCTCAGCAAACTTGCCATT
AATCTTGAGAAGAGATCAATGAATCTGTCAGTAGAAGAAGCTGACAGGTTTATGTTCTTCAGGGAAAGAGATCCAAAGGATGAAGGCTTTGAATATTCTGGGCAACATTC
AGTGACATCTGAGACAAGTTTTGTTAGATTAGTGAATAACGACCTT
Protein sequenceShow/hide protein sequence
MIDSKLTNGGMGSGETHLSMYQNKQSPIALKKVALRDVNDNRNVMYNYPEGSCSLGGKLVNGSKLSGTKRSNPPTCSPGSVTHQSFKGVGVNEHIFYASGEVDTKPGKKR
ALGCGTSCAPAFSSLLAASPMTFSPVRTSLPIFTEKLGNFLTVSGSNLLSIPPNSEVLHSVASNGITDERRTERLFNLQKLLKHCDESDQKGYIEFLHGLPPSELSKLAI
NLEKRSMNLSVEEADRFMFFRERDPKDEGFEYSGQHSVTSETSFVRLVNNDL