; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g00340 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g00340
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionFilamentous hemagglutinin transporter
Genome locationchr4:238793..240086
RNA-Seq ExpressionMoc04g00340
SyntenyMoc04g00340
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139833.1 uncharacterized protein LOC101214550 [Cucumis sativus]3.2e-10687.6Show/hide
Query:  MAAEVSSLVRVLAGYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQMINQT
        MAAEVSSL+RVLAGYKD+DNRT LGNGQD STALVTRDLLGQSS L ++QELDLDLQVPTGWEKRLDLKSGKVYIQRS+TPDSP++SDSK  Q QMINQT
Subjt:  MAAEVSSLVRVLAGYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQMINQT

Query:  ESKFQDLNFPPSPSKRTLNLFSETSLDLKLPS--SSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGKEIQEEEAAEV
        ESKFQDLNFPPSPSKRTLNLF+ETSLDLKL S  SST+YASVCTLDKVKSALERADKELVKKRSSLWK  S+SSPSYSSSSSS  AAAGKEIQEEEAAE+
Subjt:  ESKFQDLNFPPSPSKRTLNLFSETSLDLKLPS--SSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGKEIQEEEAAEV

Query:  RNWTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI
        RN   P+AVGCPGCLSYVLVMKNNPRCPRCNSVVPLP++KKPRIDLNMSI
Subjt:  RNWTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI

XP_008447115.1 PREDICTED: uncharacterized protein LOC103489640 [Cucumis melo]2.7e-10586.45Show/hide
Query:  MAAEVSSLVRVLAGYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQMINQT
        MAAEVSSL+RVLAGYK++DNRT LGNGQD STALVTRDLLGQSS L ++QELDLDLQVPTGWEKRLDLKSGKVYIQRS+TPDSP++SDSK  Q QMINQT
Subjt:  MAAEVSSLVRVLAGYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQMINQT

Query:  ESKFQDLNFPPSPSKRTLNLFSETSLDLKL---PSSSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGKEIQEEEAAE
        ESKFQDLNFPPSPSKRTLNLF+ETSLDLKL   PS ST+YASVCTLDKVKSALERADKELVKKRSSLWK  S+SSPSYSSSSSSAAA   KEIQEEEAAE
Subjt:  ESKFQDLNFPPSPSKRTLNLFSETSLDLKL---PSSSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGKEIQEEEAAE

Query:  VRNWTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI
        +RN   P+AVGCPGCLSYVLVMKNNPRCPRCNSVVPLP++KKPRIDLNMSI
Subjt:  VRNWTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI

XP_022952111.1 uncharacterized protein LOC111454876 [Cucurbita moschata]1.1e-9378.49Show/hide
Query:  MAAEVSSLVRVLA-----GYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQ
        MAA+VS+L+RVLA     GY DEDNRT LGNGQ + T LVTRDLLGQSS LA +QELDLDLQVP+GWEKRLDLKSGKVYIQRS+TPDSP++SDSK   HQ
Subjt:  MAAEVSSLVRVLA-----GYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQ

Query:  MINQTESKFQDLNFPPSPSKRTLNLFSETSLDLKL-------PSSSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGK
        M NQTESKFQDLNFPPSPSKRTLNLF+ETSLDLKL       PS ST+YASVCTLDKVKSALERADKELVKKRSSLWK  S+SSPSYSSSSS+AA    K
Subjt:  MINQTESKFQDLNFPPSPSKRTLNLFSETSLDLKL-------PSSSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGK

Query:  EIQ-EEEAAEVRN----WTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI
        EIQ EEEA E RN       P+AVGC GCLSYVLV KNNPRCPRCNSVVPL SMKKPRIDLNMSI
Subjt:  EIQ-EEEAAEVRN----WTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI

XP_022969452.1 uncharacterized protein LOC111468450 [Cucurbita maxima]3.3e-9581.08Show/hide
Query:  MAAEVSSLVRVL--AGYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQMIN
        MAA+VSSL+RVL  AGY DEDNRT LGNGQ + T LVTRDLLGQSS LA +QELDLDLQVP+GWEKRLDLKSGKVYIQRS+TPDSP++SDSK  QHQM N
Subjt:  MAAEVSSLVRVL--AGYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQMIN

Query:  QTESKFQDLNFPPSPSKRTLNLFSETSLDLKL-------PSSSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGKEIQ
        QTESKFQDLNFPPSPSKRTLNLF+ETSLDLKL       PS ST+YASVCTLDKVKSALERADKELVKKRSSLWK  S+SSPSYSSSSS+AA    KEIQ
Subjt:  QTESKFQDLNFPPSPSKRTLNLFSETSLDLKL-------PSSSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGKEIQ

Query:  -EEEAAEVRN-WTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI
         EEEA E RN    P+AVGC GCLSYVLV KNNPRCPRCNSVVPL SMKKPRIDLNMSI
Subjt:  -EEEAAEVRN-WTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI

XP_023553772.1 uncharacterized protein DDB_G0280205 [Cucurbita pepo subsp. pepo]1.1e-9379.47Show/hide
Query:  MAAEVSSLVRVLA------GYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQH
        MAA+VS+L+RVLA      GY DEDNRT LGNGQ + T LVTRDLLGQSS LA +QELDLDLQVP+GWEKRLDLKSGKVYIQRS+TPDSP++SDSK   H
Subjt:  MAAEVSSLVRVLA------GYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQH

Query:  QMINQTESKFQDLNFPPSPSKRTLNLFSETSLDLKL-------PSSSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAG
        QM NQTESKFQDLNFPPSPSKRTLNLF+ETSLDLKL       PS ST+YASVCTLDKVKSALERADKELVKKRSSLWKS+SS S SYSSSSS+AA    
Subjt:  QMINQTESKFQDLNFPPSPSKRTLNLFSETSLDLKL-------PSSSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAG

Query:  KEIQ-EEEAAEVRNWT-TPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI
        KEIQ EEEA E RN T  P+AVGC GCLSYVLV KNNPRCPRCNSVVPL SMKKPRIDLNMSI
Subjt:  KEIQ-EEEAAEVRNWT-TPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI

TrEMBL top hitse value%identityAlignment
A0A0A0K3J9 Uncharacterized protein1.5e-10687.6Show/hide
Query:  MAAEVSSLVRVLAGYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQMINQT
        MAAEVSSL+RVLAGYKD+DNRT LGNGQD STALVTRDLLGQSS L ++QELDLDLQVPTGWEKRLDLKSGKVYIQRS+TPDSP++SDSK  Q QMINQT
Subjt:  MAAEVSSLVRVLAGYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQMINQT

Query:  ESKFQDLNFPPSPSKRTLNLFSETSLDLKLPS--SSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGKEIQEEEAAEV
        ESKFQDLNFPPSPSKRTLNLF+ETSLDLKL S  SST+YASVCTLDKVKSALERADKELVKKRSSLWK  S+SSPSYSSSSSS  AAAGKEIQEEEAAE+
Subjt:  ESKFQDLNFPPSPSKRTLNLFSETSLDLKLPS--SSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGKEIQEEEAAEV

Query:  RNWTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI
        RN   P+AVGCPGCLSYVLVMKNNPRCPRCNSVVPLP++KKPRIDLNMSI
Subjt:  RNWTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI

A0A1S3BGM6 uncharacterized protein LOC1034896401.3e-10586.45Show/hide
Query:  MAAEVSSLVRVLAGYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQMINQT
        MAAEVSSL+RVLAGYK++DNRT LGNGQD STALVTRDLLGQSS L ++QELDLDLQVPTGWEKRLDLKSGKVYIQRS+TPDSP++SDSK  Q QMINQT
Subjt:  MAAEVSSLVRVLAGYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQMINQT

Query:  ESKFQDLNFPPSPSKRTLNLFSETSLDLKL---PSSSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGKEIQEEEAAE
        ESKFQDLNFPPSPSKRTLNLF+ETSLDLKL   PS ST+YASVCTLDKVKSALERADKELVKKRSSLWK  S+SSPSYSSSSSSAAA   KEIQEEEAAE
Subjt:  ESKFQDLNFPPSPSKRTLNLFSETSLDLKL---PSSSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGKEIQEEEAAE

Query:  VRNWTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI
        +RN   P+AVGCPGCLSYVLVMKNNPRCPRCNSVVPLP++KKPRIDLNMSI
Subjt:  VRNWTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI

A0A6J1EXD9 uncharacterized protein LOC1114370429.7e-9377.82Show/hide
Query:  MAAEVSSLVRVLAGYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQMINQT
        MAA+VSSL+RVLAGYKD+DNR  L    ++STAL TRDLLGQSS LA++QELDLDLQVP+GWEKRLDLKSGKVYIQRS+TPDSP++SDSKQR  QM NQT
Subjt:  MAAEVSSLVRVLAGYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQMINQT

Query:  ESKFQDLNFPPSPSKRTLNLFSETSLDLKLPSSSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGKEIQEEEAAEVRN
        ESK QDLNFPPSPSKRTLNLF+ETSLDL L +SST+YASVCTLDKVKSALERADKEL+KKRS+LWKS SS S             A KEIQEEEAAE R 
Subjt:  ESKFQDLNFPPSPSKRTLNLFSETSLDLKLPSSSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGKEIQEEEAAEVRN

Query:  WTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI
           P+AVGCPGCLSYVLV KNNPRCPRCNSVVPLPS+KKPRIDLN+SI
Subjt:  WTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI

A0A6J1GJC7 uncharacterized protein LOC1114548765.1e-9478.49Show/hide
Query:  MAAEVSSLVRVLA-----GYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQ
        MAA+VS+L+RVLA     GY DEDNRT LGNGQ + T LVTRDLLGQSS LA +QELDLDLQVP+GWEKRLDLKSGKVYIQRS+TPDSP++SDSK   HQ
Subjt:  MAAEVSSLVRVLA-----GYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQ

Query:  MINQTESKFQDLNFPPSPSKRTLNLFSETSLDLKL-------PSSSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGK
        M NQTESKFQDLNFPPSPSKRTLNLF+ETSLDLKL       PS ST+YASVCTLDKVKSALERADKELVKKRSSLWK  S+SSPSYSSSSS+AA    K
Subjt:  MINQTESKFQDLNFPPSPSKRTLNLFSETSLDLKL-------PSSSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGK

Query:  EIQ-EEEAAEVRN----WTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI
        EIQ EEEA E RN       P+AVGC GCLSYVLV KNNPRCPRCNSVVPL SMKKPRIDLNMSI
Subjt:  EIQ-EEEAAEVRN----WTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI

A0A6J1HWE1 uncharacterized protein LOC1114684501.6e-9581.08Show/hide
Query:  MAAEVSSLVRVL--AGYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQMIN
        MAA+VSSL+RVL  AGY DEDNRT LGNGQ + T LVTRDLLGQSS LA +QELDLDLQVP+GWEKRLDLKSGKVYIQRS+TPDSP++SDSK  QHQM N
Subjt:  MAAEVSSLVRVL--AGYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQMIN

Query:  QTESKFQDLNFPPSPSKRTLNLFSETSLDLKL-------PSSSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGKEIQ
        QTESKFQDLNFPPSPSKRTLNLF+ETSLDLKL       PS ST+YASVCTLDKVKSALERADKELVKKRSSLWK  S+SSPSYSSSSS+AA    KEIQ
Subjt:  QTESKFQDLNFPPSPSKRTLNLFSETSLDLKL-------PSSSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGKEIQ

Query:  -EEEAAEVRN-WTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI
         EEEA E RN    P+AVGC GCLSYVLV KNNPRCPRCNSVVPL SMKKPRIDLNMSI
Subjt:  -EEEAAEVRN-WTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G16500.1 unknown protein6.1e-4745.04Show/hide
Query:  MAAEVSSLVRVLAGYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLA-------ETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQ
        MAA+VSSLVR+L+ +KD+        G   + AL+TRDLLG    +        ++ ELDLD+QVP GWEKRLDLKSGKVY+Q+     S  SS      
Subjt:  MAAEVSSLVRVLAGYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLA-------ETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQ

Query:  H--QMINQTESKFQDLNFPP----SPSKRTLNLF---SETSLDLKL-----------------PSSSTSY-ASVCTLDKVKSALERADKELVKKRSSLWK
        H     NQT  +FQDLN PP     P+K  L+LF    +TSL+LKL                 P+ S SY +SVCTLDKVK ALERA+K+  K++     
Subjt:  H--QMINQTESKFQDLNFPP----SPSKRTLNLF---SETSLDLKL-----------------PSSSTSY-ASVCTLDKVKSALERADKELVKKRSSLWK

Query:  SSSSSSPSYSSSSSSAAAAAGKEIQEEEAAEVRNWTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI
         S      Y  ++S+  AA                 + +A GCPGCLSYV V KNNP+CPRC+S VPLP+MKKP+IDLN+S+
Subjt:  SSSSSSPSYSSSSSSAAAAAGKEIQEEEAAEVRNWTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI

AT1G79160.1 unknown protein4.7e-4748.86Show/hide
Query:  MAAEVSSLVRVLAGYKDEDNRTTLGNGQDQSTALVTRDLL--GQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQMIN
        MAA+VSSLVR+L+GYKD+        G   S AL+TRDLL  G+      + ELDLDLQVPTG+EKRLDLKSGKVY+QR  +  S   +++ Q      N
Subjt:  MAAEVSSLVRVLAGYKDEDNRTTLGNGQDQSTALVTRDLL--GQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQMIN

Query:  QTESKFQDLNFPPSPSKRT--LNLFSETSLDLK-LPSSSTS------YASVCTLDKVKSALERADKE--LVKKRSSLWKSSSSSSPSYSSSSSSAAAAAG
        QT   FQDLNFPP     +  LNLF +T+ +LK LPSS +S        SVCTLDKVKSALERA+++  + KKR       S     Y    + A A   
Subjt:  QTESKFQDLNFPPSPSKRT--LNLFSETSLDLK-LPSSSTS------YASVCTLDKVKSALERADKE--LVKKRSSLWKSSSSSSPSYSSSSSSAAAAAG

Query:  KEIQEEEAAEVRNWTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPS---MKKPRIDLNMSI
                       +P+  GCPGCLSYVLVM NNP+CPRC+++VPLP+    KKP+IDLN+SI
Subjt:  KEIQEEEAAEVRNWTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPS---MKKPRIDLNMSI

AT3G11600.1 unknown protein1.7e-0433.82Show/hide
Query:  SPSYSSSSSSAAAAAGKEIQEEEAAEVRNWTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKK
        SP+ S+++S ++  + +  QEE        T+ + VGCP CL YV++  ++P+CP+C S V L  +++
Subjt:  SPSYSSSSSSAAAAAGKEIQEEEAAEVRNWTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCCGAAGTCAGTAGTCTAGTACGGGTACTCGCCGGCTACAAGGACGAAGATAATCGGACGACGCTGGGGAATGGCCAGGATCAATCAACGGCTCTCGTTACCCG
CGACTTGCTCGGACAATCTTCCAAGCTCGCCGAGACTCAGGAATTGGACCTCGATTTGCAGGTTCCCACCGGCTGGGAAAAGAGACTCGACTTGAAGTCGGGGAAAGTTT
ACATACAGAGAAGTAAAACTCCGGATTCTCCGGTGAGTTCGGATTCGAAGCAACGGCAACACCAAATGATTAATCAGACAGAATCGAAGTTTCAGGATTTGAACTTCCCA
CCGTCTCCTTCGAAACGAACGTTAAATCTGTTCAGCGAAACGAGCTTGGATTTGAAATTGCCGTCGTCGTCCACCAGTTACGCAAGCGTGTGCACTCTGGATAAGGTGAA
ATCGGCGCTGGAGAGGGCGGACAAGGAGCTGGTAAAGAAACGCTCTTCGCTCTGGAAGTCGTCGTCCTCGTCGTCGCCGTCGTACTCTTCGTCGTCTTCGTCGGCGGCGG
CGGCGGCGGGGAAAGAAATTCAGGAAGAAGAAGCGGCGGAAGTCAGGAACTGGACGACGCCGATTGCGGTGGGTTGCCCGGGATGTTTATCTTATGTATTGGTAATGAAA
AACAATCCGAGGTGTCCTCGTTGCAACTCTGTTGTTCCATTGCCCAGTATGAAGAAACCTCGGATTGATCTAAACATGTCCATATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGCCGAAGTCAGTAGTCTAGTACGGGTACTCGCCGGCTACAAGGACGAAGATAATCGGACGACGCTGGGGAATGGCCAGGATCAATCAACGGCTCTCGTTACCCG
CGACTTGCTCGGACAATCTTCCAAGCTCGCCGAGACTCAGGAATTGGACCTCGATTTGCAGGTTCCCACCGGCTGGGAAAAGAGACTCGACTTGAAGTCGGGGAAAGTTT
ACATACAGAGAAGTAAAACTCCGGATTCTCCGGTGAGTTCGGATTCGAAGCAACGGCAACACCAAATGATTAATCAGACAGAATCGAAGTTTCAGGATTTGAACTTCCCA
CCGTCTCCTTCGAAACGAACGTTAAATCTGTTCAGCGAAACGAGCTTGGATTTGAAATTGCCGTCGTCGTCCACCAGTTACGCAAGCGTGTGCACTCTGGATAAGGTGAA
ATCGGCGCTGGAGAGGGCGGACAAGGAGCTGGTAAAGAAACGCTCTTCGCTCTGGAAGTCGTCGTCCTCGTCGTCGCCGTCGTACTCTTCGTCGTCTTCGTCGGCGGCGG
CGGCGGCGGGGAAAGAAATTCAGGAAGAAGAAGCGGCGGAAGTCAGGAACTGGACGACGCCGATTGCGGTGGGTTGCCCGGGATGTTTATCTTATGTATTGGTAATGAAA
AACAATCCGAGGTGTCCTCGTTGCAACTCTGTTGTTCCATTGCCCAGTATGAAGAAACCTCGGATTGATCTAAACATGTCCATATAA
Protein sequenceShow/hide protein sequence
MAAEVSSLVRVLAGYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQMINQTESKFQDLNFP
PSPSKRTLNLFSETSLDLKLPSSSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGKEIQEEEAAEVRNWTTPIAVGCPGCLSYVLVMK
NNPRCPRCNSVVPLPSMKKPRIDLNMSI