; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g21410 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g21410
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:15580682..15588487
RNA-Seq ExpressionMoc04g21410
SyntenyMoc04g21410
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR018710 - Protein of unknown function DUF2232


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022145869.1 uncharacterized protein LOC111015219 [Momordica charantia]3.3e-158100Show/hide
Query:  MISGKVYPPYSTPWISPPTQLHLHLRSPVPLLKISSGLRLISSESISLSFPTISASKPCAKSVRFSNSVAKVYRYEYEGQNSTTFSDLEDLSENGVVYKK
        MISGKVYPPYSTPWISPPTQLHLHLRSPVPLLKISSGLRLISSESISLSFPTISASKPCAKSVRFSNSVAKVYRYEYEGQNSTTFSDLEDLSENGVVYKK
Subjt:  MISGKVYPPYSTPWISPPTQLHLHLRSPVPLLKISSGLRLISSESISLSFPTISASKPCAKSVRFSNSVAKVYRYEYEGQNSTTFSDLEDLSENGVVYKK

Query:  TLAMVECSMFAALSGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLILSGPVKALTYLLKHGLVGLTMGSLWRLGANWSISIFL
        TLAMVECSMFAALSGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLILSGPVKALTYLLKHGLVGLTMGSLWRLGANWSISIFL
Subjt:  TLAMVECSMFAALSGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLILSGPVKALTYLLKHGLVGLTMGSLWRLGANWSISIFL

Query:  CTIVRALGAVGYVLTSSFLIRENILALITINIHASLTLILTAWGVNLIPSMSAIYAIFGTLVLLNSGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLDKAM
        CTIVRALGAVGYVLTSSFLIRENILALITINIHASLTLILTAWGVNLIPSMSAIYAIFGTLVLLNSGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLDKAM
Subjt:  CTIVRALGAVGYVLTSSFLIRENILALITINIHASLTLILTAWGVNLIPSMSAIYAIFGTLVLLNSGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLDKAM

XP_022929150.1 uncharacterized protein LOC111435817 [Cucurbita moschata]8.0e-12884.64Show/hide
Query:  MISGKVYPPYSTPWISPP------TQLHLHLRSPVPLLKISSGLRLISSESISLSFPTISASKPCAKSVRFSNSVAKVYRYEYEGQNSTTFSDLEDLSEN
        MISG +YP  ST  I PP      T  H+HL  P+PLLKISS LRLIS ES+SLSFPT  ASK   KS RFSNSVAKV  Y +EGQN T+ SDLEDLSEN
Subjt:  MISGKVYPPYSTPWISPP------TQLHLHLRSPVPLLKISSGLRLISSESISLSFPTISASKPCAKSVRFSNSVAKVYRYEYEGQNSTTFSDLEDLSEN

Query:  GVVYKKTLAMVECSMFAALSGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLILSGPVKALTYLLKHGLVGLTMGSLWRLGANW
        GVVYKKTLAMVECSMFAAL+GLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLL+LSGPVKALT+LL+HGLVGLTMGSLWRLGANW
Subjt:  GVVYKKTLAMVECSMFAALSGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLILSGPVKALTYLLKHGLVGLTMGSLWRLGANW

Query:  SISIFLCTIVRALGAVGYVLTSSFLIRENILALITINIHASLTLILTAWGVNLIPSMSAIYAIFGTLVLLNSGCFMFLLHLLYSIFLTRLGLKTSLTLPR
        S SIFLCTIVRALGAVGYVL SSFLIRENILALITINIHASLTLI TA GVNLIPSM+AIYAIFGTLV+LN GCFMFLLHLLYSIFLTRLGLKTSLTLPR
Subjt:  SISIFLCTIVRALGAVGYVLTSSFLIRENILALITINIHASLTLILTAWGVNLIPSMSAIYAIFGTLVLLNSGCFMFLLHLLYSIFLTRLGLKTSLTLPR

Query:  WLDKAM
        WL+KAM
Subjt:  WLDKAM

XP_022969819.1 uncharacterized protein LOC111468904 [Cucurbita maxima]5.2e-12784.87Show/hide
Query:  MISGKVYPPYSTPWISPP----TQLHLHLRSPVPLLKISSGLRLISSESISLSFPTISASKPCAKSVRFSNSVAKVYRYEYEGQNSTTFSDLEDLSENGV
        MISG +YP  ST  I PP    T  H+HL    PLLKIS+ LRLIS ES+SLSFPT  ASK   KS RFSNSVAKV  Y +EGQN T+ SDLEDLSENGV
Subjt:  MISGKVYPPYSTPWISPP----TQLHLHLRSPVPLLKISSGLRLISSESISLSFPTISASKPCAKSVRFSNSVAKVYRYEYEGQNSTTFSDLEDLSENGV

Query:  VYKKTLAMVECSMFAALSGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLILSGPVKALTYLLKHGLVGLTMGSLWRLGANWSI
        VYKKTLAMVECSMFAAL+GLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLL+LSGPVKALT+LL+HGLVGLTMGSLWRLGANWS 
Subjt:  VYKKTLAMVECSMFAALSGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLILSGPVKALTYLLKHGLVGLTMGSLWRLGANWSI

Query:  SIFLCTIVRALGAVGYVLTSSFLIRENILALITINIHASLTLILTAWGVNLIPSMSAIYAIFGTLVLLNSGCFMFLLHLLYSIFLTRLGLKTSLTLPRWL
        SIFLCTIVRALGAVGYVL SSFLIRENILALITINIHASLTLI TA GVNLIPSMSAIYAIFGTLV+LN GCFMFLLHLLYSIFLTRLGLKTSLTLPRWL
Subjt:  SIFLCTIVRALGAVGYVLTSSFLIRENILALITINIHASLTLILTAWGVNLIPSMSAIYAIFGTLVLLNSGCFMFLLHLLYSIFLTRLGLKTSLTLPRWL

Query:  DKAM
        +KAM
Subjt:  DKAM

XP_023549519.1 uncharacterized protein LOC111807999 [Cucurbita pepo subsp. pepo]3.0e-12784.31Show/hide
Query:  MISGKVYPPYSTPWISPP------TQLHLHLRSPVPLLKISSGLRLISSESISLSFPTISASKPCAKSVRFSNSVAKVYRYEYEGQNSTTFSDLEDLSEN
        MISGK+YP  ST  I PP      T +H+HL    PLLKISS LRLIS +S+SLSFPT  ASK   KS RFSNSVAKV  Y +EGQN T+ SDLEDLSEN
Subjt:  MISGKVYPPYSTPWISPP------TQLHLHLRSPVPLLKISSGLRLISSESISLSFPTISASKPCAKSVRFSNSVAKVYRYEYEGQNSTTFSDLEDLSEN

Query:  GVVYKKTLAMVECSMFAALSGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLILSGPVKALTYLLKHGLVGLTMGSLWRLGANW
        GVVYKKTLAMVECSMFAAL+GLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLL+LSGPVKALT+LL+HGLVGLTMGSLWRLGANW
Subjt:  GVVYKKTLAMVECSMFAALSGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLILSGPVKALTYLLKHGLVGLTMGSLWRLGANW

Query:  SISIFLCTIVRALGAVGYVLTSSFLIRENILALITINIHASLTLILTAWGVNLIPSMSAIYAIFGTLVLLNSGCFMFLLHLLYSIFLTRLGLKTSLTLPR
        S SIFLCTIVRALGAVGYVL SSFLIRENILALITINIHASLTLI TA GVNLIPSM+AIYAIFGTLV+LN GCFMFLLHLLYSIFLTRLGLKTSLTLPR
Subjt:  SISIFLCTIVRALGAVGYVLTSSFLIRENILALITINIHASLTLILTAWGVNLIPSMSAIYAIFGTLVLLNSGCFMFLLHLLYSIFLTRLGLKTSLTLPR

Query:  WLDKAM
        WL+KAM
Subjt:  WLDKAM

XP_038885169.1 uncharacterized protein LOC120075651 [Benincasa hispida]4.0e-12784.33Show/hide
Query:  MISGKVYPPYSTPWISPPTQLHLHLRSPVPLLKISSGLRLISSESISLSFPTISASKPCAKSVRFSNSVAKVYRYEYEGQNSTTFSDLEDLSENGVVYKK
        MISGK+YP YS   I PP Q +LHL    PLL+ISS LRLIS +S+SLSFP+  ASK  AKS RFSNSV KV  Y YEGQN    SDLEDLSE+G VYKK
Subjt:  MISGKVYPPYSTPWISPPTQLHLHLRSPVPLLKISSGLRLISSESISLSFPTISASKPCAKSVRFSNSVAKVYRYEYEGQNSTTFSDLEDLSENGVVYKK

Query:  TLAMVECSMFAALSGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLILSGPVKALTYLLKHGLVGLTMGSLWRLGANWSISIFL
        TLAMVECSMFAAL+GLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLL+LSGPVKALTYLL+HGLVG TMGSLWRLGANWS SIFL
Subjt:  TLAMVECSMFAALSGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLILSGPVKALTYLLKHGLVGLTMGSLWRLGANWSISIFL

Query:  CTIVRALGAVGYVLTSSFLIRENILALITINIHASLTLILTAWGVNLIPSMSAIYAIFGTLVLLNSGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLDKAM
        CTIVRALGAVGYVL SSFLIRENILALITINIHASLTLI TAWGVNLIPSM+AIYAIFG LV LN GCFMFLLHLLYS+FLTRLGLKTSLTLPRWL+KAM
Subjt:  CTIVRALGAVGYVLTSSFLIRENILALITINIHASLTLILTAWGVNLIPSMSAIYAIFGTLVLLNSGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLDKAM

TrEMBL top hitse value%identityAlignment
A0A0A0K8H0 Uncharacterized protein1.2e-12180.58Show/hide
Query:  MISGKVYPPYS---------TPWISPPTQLHLHLRSPVPLLKISSGLRLISSESISLSFPTISASKPCAKSVRFSNSVAKVYRYEYEGQNSTTFSDLEDL
        MISGK+Y  YS         TP  +P   LHL        LKISS LRLIS +S SLSFP+   SK  AKS RFS+S+ +V  Y YEGQNS T SDL+DL
Subjt:  MISGKVYPPYS---------TPWISPPTQLHLHLRSPVPLLKISSGLRLISSESISLSFPTISASKPCAKSVRFSNSVAKVYRYEYEGQNSTTFSDLEDL

Query:  SENGVVYKKTLAMVECSMFAALSGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLILSGPVKALTYLLKHGLVGLTMGSLWRLG
        SENGVVYKKTLAMVECSMFAAL+GLVYFLSNSLALENYFGCFFCLPIVISSMRWGI+AGRKTMVATFLLLL+LSGPVKALTYLL+HGLVG TMGSLWRLG
Subjt:  SENGVVYKKTLAMVECSMFAALSGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLILSGPVKALTYLLKHGLVGLTMGSLWRLG

Query:  ANWSISIFLCTIVRALGAVGYVLTSSFLIRENILALITINIHASLTLILTAWGVNLIPSMSAIYAIFGTLVLLNSGCFMFLLHLLYSIFLTRLGLKTSLT
        ANWS SIFLCTIVRA GAVGYVL SSFLIRENILALITINIHASLTLI TAWGVNLIPSM+AIYAIFGTLV LN GCFMFLLHLLYSIFLTRLGLKTSLT
Subjt:  ANWSISIFLCTIVRALGAVGYVLTSSFLIRENILALITINIHASLTLILTAWGVNLIPSMSAIYAIFGTLVLLNSGCFMFLLHLLYSIFLTRLGLKTSLT

Query:  LPRWLDKAM
        LPRWL+KAM
Subjt:  LPRWLDKAM

A0A1S4DWP5 uncharacterized protein LOC103488678 isoform X62.4e-12280.26Show/hide
Query:  MISGKVY---------PPYSTPWISPPTQLHLHLRSPVPLLKISSGLRLISSESISLSFPTISASKPCAKSVRFSNSVAKVYRYEYEGQNSTTFSDLEDL
        MISGK+Y         PP  TP  +P   LHL        LKISS LRLIS +S+SLS P+  ASK  AKS RFSNS+ +V  Y YEGQNS T SDL+DL
Subjt:  MISGKVY---------PPYSTPWISPPTQLHLHLRSPVPLLKISSGLRLISSESISLSFPTISASKPCAKSVRFSNSVAKVYRYEYEGQNSTTFSDLEDL

Query:  SENGVVYKKTLAMVECSMFAALSGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLILSGPVKALTYLLKHGLVGLTMGSLWRLG
        SENGVVYKKTLAMVECSMFAAL+GLVYFLSNSLALENYFGCFFCLPIVISSMRWGI+AGRKTMVATFLLLL+LSGPVKALTYLL+HGLVG TMGSLWRLG
Subjt:  SENGVVYKKTLAMVECSMFAALSGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLILSGPVKALTYLLKHGLVGLTMGSLWRLG

Query:  ANWSISIFLCTIVRALGAVGYVLTSSFLIRENILALITINIHASLTLILTAWGVNLIPSMSAIYAIFGTLVLLNSGCFMFLLHLLYSIFLTRLGLKTSLT
        ANWS SIFLCTIVRA GAVGYVL SSFLIRENIL+LITINIHASLTLI TAWGVNLIPSM+AIYAIFGTLV LN GCFMFLLHLLYS+FLTRLGLKTSLT
Subjt:  ANWSISIFLCTIVRALGAVGYVLTSSFLIRENILALITINIHASLTLILTAWGVNLIPSMSAIYAIFGTLVLLNSGCFMFLLHLLYSIFLTRLGLKTSLT

Query:  LPRWLDKAM
        LPRWL+KAM
Subjt:  LPRWLDKAM

A0A6J1CVR0 uncharacterized protein LOC1110152191.6e-158100Show/hide
Query:  MISGKVYPPYSTPWISPPTQLHLHLRSPVPLLKISSGLRLISSESISLSFPTISASKPCAKSVRFSNSVAKVYRYEYEGQNSTTFSDLEDLSENGVVYKK
        MISGKVYPPYSTPWISPPTQLHLHLRSPVPLLKISSGLRLISSESISLSFPTISASKPCAKSVRFSNSVAKVYRYEYEGQNSTTFSDLEDLSENGVVYKK
Subjt:  MISGKVYPPYSTPWISPPTQLHLHLRSPVPLLKISSGLRLISSESISLSFPTISASKPCAKSVRFSNSVAKVYRYEYEGQNSTTFSDLEDLSENGVVYKK

Query:  TLAMVECSMFAALSGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLILSGPVKALTYLLKHGLVGLTMGSLWRLGANWSISIFL
        TLAMVECSMFAALSGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLILSGPVKALTYLLKHGLVGLTMGSLWRLGANWSISIFL
Subjt:  TLAMVECSMFAALSGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLILSGPVKALTYLLKHGLVGLTMGSLWRLGANWSISIFL

Query:  CTIVRALGAVGYVLTSSFLIRENILALITINIHASLTLILTAWGVNLIPSMSAIYAIFGTLVLLNSGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLDKAM
        CTIVRALGAVGYVLTSSFLIRENILALITINIHASLTLILTAWGVNLIPSMSAIYAIFGTLVLLNSGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLDKAM
Subjt:  CTIVRALGAVGYVLTSSFLIRENILALITINIHASLTLILTAWGVNLIPSMSAIYAIFGTLVLLNSGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLDKAM

A0A6J1ETG3 uncharacterized protein LOC1114358173.9e-12884.64Show/hide
Query:  MISGKVYPPYSTPWISPP------TQLHLHLRSPVPLLKISSGLRLISSESISLSFPTISASKPCAKSVRFSNSVAKVYRYEYEGQNSTTFSDLEDLSEN
        MISG +YP  ST  I PP      T  H+HL  P+PLLKISS LRLIS ES+SLSFPT  ASK   KS RFSNSVAKV  Y +EGQN T+ SDLEDLSEN
Subjt:  MISGKVYPPYSTPWISPP------TQLHLHLRSPVPLLKISSGLRLISSESISLSFPTISASKPCAKSVRFSNSVAKVYRYEYEGQNSTTFSDLEDLSEN

Query:  GVVYKKTLAMVECSMFAALSGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLILSGPVKALTYLLKHGLVGLTMGSLWRLGANW
        GVVYKKTLAMVECSMFAAL+GLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLL+LSGPVKALT+LL+HGLVGLTMGSLWRLGANW
Subjt:  GVVYKKTLAMVECSMFAALSGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLILSGPVKALTYLLKHGLVGLTMGSLWRLGANW

Query:  SISIFLCTIVRALGAVGYVLTSSFLIRENILALITINIHASLTLILTAWGVNLIPSMSAIYAIFGTLVLLNSGCFMFLLHLLYSIFLTRLGLKTSLTLPR
        S SIFLCTIVRALGAVGYVL SSFLIRENILALITINIHASLTLI TA GVNLIPSM+AIYAIFGTLV+LN GCFMFLLHLLYSIFLTRLGLKTSLTLPR
Subjt:  SISIFLCTIVRALGAVGYVLTSSFLIRENILALITINIHASLTLILTAWGVNLIPSMSAIYAIFGTLVLLNSGCFMFLLHLLYSIFLTRLGLKTSLTLPR

Query:  WLDKAM
        WL+KAM
Subjt:  WLDKAM

A0A6J1I3S1 uncharacterized protein LOC1114689042.5e-12784.87Show/hide
Query:  MISGKVYPPYSTPWISPP----TQLHLHLRSPVPLLKISSGLRLISSESISLSFPTISASKPCAKSVRFSNSVAKVYRYEYEGQNSTTFSDLEDLSENGV
        MISG +YP  ST  I PP    T  H+HL    PLLKIS+ LRLIS ES+SLSFPT  ASK   KS RFSNSVAKV  Y +EGQN T+ SDLEDLSENGV
Subjt:  MISGKVYPPYSTPWISPP----TQLHLHLRSPVPLLKISSGLRLISSESISLSFPTISASKPCAKSVRFSNSVAKVYRYEYEGQNSTTFSDLEDLSENGV

Query:  VYKKTLAMVECSMFAALSGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLILSGPVKALTYLLKHGLVGLTMGSLWRLGANWSI
        VYKKTLAMVECSMFAAL+GLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLL+LSGPVKALT+LL+HGLVGLTMGSLWRLGANWS 
Subjt:  VYKKTLAMVECSMFAALSGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLILSGPVKALTYLLKHGLVGLTMGSLWRLGANWSI

Query:  SIFLCTIVRALGAVGYVLTSSFLIRENILALITINIHASLTLILTAWGVNLIPSMSAIYAIFGTLVLLNSGCFMFLLHLLYSIFLTRLGLKTSLTLPRWL
        SIFLCTIVRALGAVGYVL SSFLIRENILALITINIHASLTLI TA GVNLIPSMSAIYAIFGTLV+LN GCFMFLLHLLYSIFLTRLGLKTSLTLPRWL
Subjt:  SIFLCTIVRALGAVGYVLTSSFLIRENILALITINIHASLTLILTAWGVNLIPSMSAIYAIFGTLVLLNSGCFMFLLHLLYSIFLTRLGLKTSLTLPRWL

Query:  DKAM
        +KAM
Subjt:  DKAM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G26180.1 unknown protein5.8e-8469.49Show/hide
Query:  SNSVAKVYRYEYEGQNSTTFSDLEDLSE-NGVVYKKTLAMVECSMFAALSGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLIL
        S S A +Y  + +G+   +  +     E + VVY+KTL +VEC+MFAA++GLVYFLSNSLA+ENYFGCFF LPIVISS+RW IA GRKTMVAT +LL IL
Subjt:  SNSVAKVYRYEYEGQNSTTFSDLEDLSE-NGVVYKKTLAMVECSMFAALSGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLIL

Query:  SGPVKALTYLLKHGLVGLTMGSLWRLGANWSISIFLCTIVRALGAVGYVLTSSFLIRENILALITINIHASLTLILTAWGVNLIPSMSAIYAIFGTLVLL
        SGPVKALTY L HGLVGL +GSLW +GA+W +SIFLCT+VRALG +GYVLTSSFLIRENILA+ITINIHASL+ + TA G+N++PSMS IY IFGT++LL
Subjt:  SGPVKALTYLLKHGLVGLTMGSLWRLGANWSISIFLCTIVRALGAVGYVLTSSFLIRENILALITINIHASLTLILTAWGVNLIPSMSAIYAIFGTLVLL

Query:  NSGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLDKAM
        NSG F+ LLHLLYSIFLTRLG+K+SL LP WLDKA+
Subjt:  NSGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLDKAM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTTCTGGGAAGGTTTATCCACCTTACTCCACACCATGGATTTCCCCGCCAACACAGCTTCATCTTCATCTTCGTTCCCCAGTTCCTCTTCTCAAAATCTCC
AGTGGACTCAGATTAATTAGCTCTGAATCCATCTCCCTCTCTTTTCCGACCATTTCTGCTTCTAAACCCTGCGCCAAGTCCGTTAGATTTTCGAATTCAGTGGCA
AAAGTTTATCGCTATGAGTATGAGGGCCAAAACTCGACTACTTTTTCGGACTTGGAAGACTTGTCCGAGAATGGAGTTGTCTATAAGAAGACATTGGCGATGGTG
GAGTGCTCCATGTTCGCTGCACTTAGTGGCTTGGTCTACTTCTTGAGTAATTCACTTGCTCTTGAGAATTACTTCGGCTGTTTCTTCTGTCTACCAATTGTAATC
TCTTCAATGAGATGGGGCATAGCAGCTGGGAGAAAAACAATGGTGGCGACATTCTTGCTGCTGCTTATTTTGTCTGGTCCAGTGAAAGCTTTAACCTATCTGCTT
AAGCATGGTTTAGTGGGGTTGACAATGGGCTCCTTGTGGAGGCTTGGAGCAAATTGGAGTATTTCGATCTTCCTGTGCACAATCGTTCGGGCACTCGGGGCAGTG
GGGTATGTCTTAACATCTTCATTCTTGATAAGAGAAAACATACTAGCTCTGATCACTATTAATATTCATGCTTCCCTCACCCTTATCTTGACTGCCTGGGGTGTA
AACTTGATTCCATCAATGAGTGCAATATATGCTATCTTTGGGACGCTGGTATTGCTGAACTCGGGATGCTTCATGTTTTTGCTCCACCTTTTGTACTCCATATTC
CTTACCAGACTTGGCCTGAAGACTTCATTGACATTGCCAAGGTGGCTGGATAAGGCGATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGATTTCTGGGAAGGTTTATCCACCTTACTCCACACCATGGATTTCCCCGCCAACACAGCTTCATCTTCATCTTCGTTCCCCAGTTCCTCTTCTCAAAATCTCC
AGTGGACTCAGATTAATTAGCTCTGAATCCATCTCCCTCTCTTTTCCGACCATTTCTGCTTCTAAACCCTGCGCCAAGTCCGTTAGATTTTCGAATTCAGTGGCA
AAAGTTTATCGCTATGAGTATGAGGGCCAAAACTCGACTACTTTTTCGGACTTGGAAGACTTGTCCGAGAATGGAGTTGTCTATAAGAAGACATTGGCGATGGTG
GAGTGCTCCATGTTCGCTGCACTTAGTGGCTTGGTCTACTTCTTGAGTAATTCACTTGCTCTTGAGAATTACTTCGGCTGTTTCTTCTGTCTACCAATTGTAATC
TCTTCAATGAGATGGGGCATAGCAGCTGGGAGAAAAACAATGGTGGCGACATTCTTGCTGCTGCTTATTTTGTCTGGTCCAGTGAAAGCTTTAACCTATCTGCTT
AAGCATGGTTTAGTGGGGTTGACAATGGGCTCCTTGTGGAGGCTTGGAGCAAATTGGAGTATTTCGATCTTCCTGTGCACAATCGTTCGGGCACTCGGGGCAGTG
GGGTATGTCTTAACATCTTCATTCTTGATAAGAGAAAACATACTAGCTCTGATCACTATTAATATTCATGCTTCCCTCACCCTTATCTTGACTGCCTGGGGTGTA
AACTTGATTCCATCAATGAGTGCAATATATGCTATCTTTGGGACGCTGGTATTGCTGAACTCGGGATGCTTCATGTTTTTGCTCCACCTTTTGTACTCCATATTC
CTTACCAGACTTGGCCTGAAGACTTCATTGACATTGCCAAGGTGGCTGGATAAGGCGATGTAA
Protein sequenceShow/hide protein sequence
MISGKVYPPYSTPWISPPTQLHLHLRSPVPLLKISSGLRLISSESISLSFPTISASKPCAKSVRFSNSVAKVYRYEYEGQNSTTFSDLEDLSENGVVYKKTLAMV
ECSMFAALSGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLILSGPVKALTYLLKHGLVGLTMGSLWRLGANWSISIFLCTIVRALGAV
GYVLTSSFLIRENILALITINIHASLTLILTAWGVNLIPSMSAIYAIFGTLVLLNSGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLDKAM