; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g0030 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g0030
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionFilamentous hemagglutinin transporter
Genome locationMC04:239428..241323
RNA-Seq ExpressionMC04g0030
SyntenyMC04g0030
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139833.1 uncharacterized protein LOC101214550 [Cucumis sativus]2.41e-13887.6Show/hide
Query:  MAAEVSSLVRVLAGYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQMINQT
        MAAEVSSL+RVLAGYKD+DNRT LGNGQD STALVTRDLLGQSS L ++QELDLDLQVPTGWEKRLDLKSGKVYIQRS+TPDSP++SDSKQ Q  MINQT
Subjt:  MAAEVSSLVRVLAGYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQMINQT

Query:  ESKFQDLNFPPSPSKRTLNLFSETSLDLKLPSS--STSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGKEIQEEEAAEV
        ESKFQDLNFPPSPSKRTLNLF+ETSLDLKL SS  ST+YASVCTLDKVKSALERADKELVKKRSSLWK  S+SSPSYSSSSSSAAA  GKEIQEEEAAE+
Subjt:  ESKFQDLNFPPSPSKRTLNLFSETSLDLKLPSS--STSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGKEIQEEEAAEV

Query:  RNWTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI
        RN   P+AVGCPGCLSYVLVMKNNPRCPRCNSVVPLP++KKPRIDLNMSI
Subjt:  RNWTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI

XP_008447115.1 PREDICTED: uncharacterized protein LOC103489640 [Cucumis melo]5.65e-13786.45Show/hide
Query:  MAAEVSSLVRVLAGYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQMINQT
        MAAEVSSL+RVLAGYK++DNRT LGNGQD STALVTRDLLGQSS L ++QELDLDLQVPTGWEKRLDLKSGKVYIQRS+TPDSP++SDSKQ Q  MINQT
Subjt:  MAAEVSSLVRVLAGYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQMINQT

Query:  ESKFQDLNFPPSPSKRTLNLFSETSLDLKL---PSSSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGKEIQEEEAAE
        ESKFQDLNFPPSPSKRTLNLF+ETSLDLKL   PS ST+YASVCTLDKVKSALERADKELVKKRSSLWK  S+SSPSYSSSSSSAAA   KEIQEEEAAE
Subjt:  ESKFQDLNFPPSPSKRTLNLFSETSLDLKL---PSSSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGKEIQEEEAAE

Query:  VRNWTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI
        +RN   P+AVGCPGCLSYVLVMKNNPRCPRCNSVVPLP++KKPRIDLNMSI
Subjt:  VRNWTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI

XP_022952111.1 uncharacterized protein LOC111454876 [Cucurbita moschata]1.50e-12178.49Show/hide
Query:  MAAEVSSLVRVLAG-----YKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQ
        MAA+VS+L+RVLAG     Y DEDNRT LGNGQ+  T LVTRDLLGQSS LA +QELDLDLQVP+GWEKRLDLKSGKVYIQRS+TPDSP++SDSK   HQ
Subjt:  MAAEVSSLVRVLAG-----YKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQ

Query:  MINQTESKFQDLNFPPSPSKRTLNLFSETSLDLKL-------PSSSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGK
        M NQTESKFQDLNFPPSPSKRTLNLF+ETSLDLKL       PS ST+YASVCTLDKVKSALERADKELVKKRSSLWK  S+SSPSYSSSSS+AA    K
Subjt:  MINQTESKFQDLNFPPSPSKRTLNLFSETSLDLKL-------PSSSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGK

Query:  EIQEEE-AAEVRN----WTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI
        EIQEEE A E RN       P+AVGC GCLSYVLV KNNPRCPRCNSVVPL SMKKPRIDLNMSI
Subjt:  EIQEEE-AAEVRN----WTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI

XP_022969452.1 uncharacterized protein LOC111468450 [Cucurbita maxima]1.28e-12381.08Show/hide
Query:  MAAEVSSLVRVLAG--YKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQMIN
        MAA+VSSL+RVLAG  Y DEDNRT LGNGQ+  T LVTRDLLGQSS LA +QELDLDLQVP+GWEKRLDLKSGKVYIQRS+TPDSP++SDSKQ  HQM N
Subjt:  MAAEVSSLVRVLAG--YKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQMIN

Query:  QTESKFQDLNFPPSPSKRTLNLFSETSLDLKL-------PSSSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGKEIQ
        QTESKFQDLNFPPSPSKRTLNLF+ETSLDLKL       PS ST+YASVCTLDKVKSALERADKELVKKRSSLWK  S+SSPSYSSSSS+AA    KEIQ
Subjt:  QTESKFQDLNFPPSPSKRTLNLFSETSLDLKL-------PSSSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGKEIQ

Query:  EEE-AAEVRNWTT-PIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI
        EEE A E RN    P+AVGC GCLSYVLV KNNPRCPRCNSVVPL SMKKPRIDLNMSI
Subjt:  EEE-AAEVRNWTT-PIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI

XP_023553772.1 uncharacterized protein DDB_G0280205 [Cucurbita pepo subsp. pepo]2.06e-12179.47Show/hide
Query:  MAAEVSSLVRVLAG------YKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQH
        MAA+VS+L+RVLAG      Y DEDNRT LGNGQ+  T LVTRDLLGQSS LA +QELDLDLQVP+GWEKRLDLKSGKVYIQRS+TPDSP++SDSK   H
Subjt:  MAAEVSSLVRVLAG------YKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQH

Query:  QMINQTESKFQDLNFPPSPSKRTLNLFSETSLDLKL-------PSSSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAG
        QM NQTESKFQDLNFPPSPSKRTLNLF+ETSLDLKL       PS ST+YASVCTLDKVKSALERADKELVKKRSSLWKS+SS S SYSSSSS+AA    
Subjt:  QMINQTESKFQDLNFPPSPSKRTLNLFSETSLDLKL-------PSSSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAG

Query:  KEIQEEE-AAEVRNWTT-PIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI
        KEIQEEE A E RN T  P+AVGC GCLSYVLV KNNPRCPRCNSVVPL SMKKPRIDLNMSI
Subjt:  KEIQEEE-AAEVRNWTT-PIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI

TrEMBL top hitse value%identityAlignment
A0A0A0K3J9 Uncharacterized protein1.16e-13887.6Show/hide
Query:  MAAEVSSLVRVLAGYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQMINQT
        MAAEVSSL+RVLAGYKD+DNRT LGNGQD STALVTRDLLGQSS L ++QELDLDLQVPTGWEKRLDLKSGKVYIQRS+TPDSP++SDSKQ Q  MINQT
Subjt:  MAAEVSSLVRVLAGYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQMINQT

Query:  ESKFQDLNFPPSPSKRTLNLFSETSLDLKLPSS--STSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGKEIQEEEAAEV
        ESKFQDLNFPPSPSKRTLNLF+ETSLDLKL SS  ST+YASVCTLDKVKSALERADKELVKKRSSLWK  S+SSPSYSSSSSSAAA  GKEIQEEEAAE+
Subjt:  ESKFQDLNFPPSPSKRTLNLFSETSLDLKLPSS--STSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGKEIQEEEAAEV

Query:  RNWTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI
        RN   P+AVGCPGCLSYVLVMKNNPRCPRCNSVVPLP++KKPRIDLNMSI
Subjt:  RNWTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI

A0A1S3BGM6 uncharacterized protein LOC1034896402.74e-13786.45Show/hide
Query:  MAAEVSSLVRVLAGYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQMINQT
        MAAEVSSL+RVLAGYK++DNRT LGNGQD STALVTRDLLGQSS L ++QELDLDLQVPTGWEKRLDLKSGKVYIQRS+TPDSP++SDSKQ Q  MINQT
Subjt:  MAAEVSSLVRVLAGYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQMINQT

Query:  ESKFQDLNFPPSPSKRTLNLFSETSLDLKL---PSSSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGKEIQEEEAAE
        ESKFQDLNFPPSPSKRTLNLF+ETSLDLKL   PS ST+YASVCTLDKVKSALERADKELVKKRSSLWK  S+SSPSYSSSSSSAAA   KEIQEEEAAE
Subjt:  ESKFQDLNFPPSPSKRTLNLFSETSLDLKL---PSSSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGKEIQEEEAAE

Query:  VRNWTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI
        +RN   P+AVGCPGCLSYVLVMKNNPRCPRCNSVVPLP++KKPRIDLNMSI
Subjt:  VRNWTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI

A0A6J1EXD9 uncharacterized protein LOC1114370422.59e-12078.23Show/hide
Query:  MAAEVSSLVRVLAGYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQMINQT
        MAA+VSSL+RVLAGYKD+DNR  L    ++STAL TRDLLGQSS LA++QELDLDLQVP+GWEKRLDLKSGKVYIQRS+TPDSP++SDSKQRQ  M NQT
Subjt:  MAAEVSSLVRVLAGYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQMINQT

Query:  ESKFQDLNFPPSPSKRTLNLFSETSLDLKLPSSSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGKEIQEEEAAEVRN
        ESK QDLNFPPSPSKRTLNLF+ETSLDL L +SST+YASVCTLDKVKSALERADKEL+KKRS+LWK  S SSPS           A KEIQEEEAAE R 
Subjt:  ESKFQDLNFPPSPSKRTLNLFSETSLDLKLPSSSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGKEIQEEEAAEVRN

Query:  WTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI
           P+AVGCPGCLSYVLV KNNPRCPRCNSVVPLPS+KKPRIDLN+SI
Subjt:  WTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI

A0A6J1GJC7 uncharacterized protein LOC1114548767.27e-12278.49Show/hide
Query:  MAAEVSSLVRVLAG-----YKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQ
        MAA+VS+L+RVLAG     Y DEDNRT LGNGQ+  T LVTRDLLGQSS LA +QELDLDLQVP+GWEKRLDLKSGKVYIQRS+TPDSP++SDSK   HQ
Subjt:  MAAEVSSLVRVLAG-----YKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQ

Query:  MINQTESKFQDLNFPPSPSKRTLNLFSETSLDLKL-------PSSSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGK
        M NQTESKFQDLNFPPSPSKRTLNLF+ETSLDLKL       PS ST+YASVCTLDKVKSALERADKELVKKRSSLWK  S+SSPSYSSSSS+AA    K
Subjt:  MINQTESKFQDLNFPPSPSKRTLNLFSETSLDLKL-------PSSSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGK

Query:  EIQEEE-AAEVRN----WTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI
        EIQEEE A E RN       P+AVGC GCLSYVLV KNNPRCPRCNSVVPL SMKKPRIDLNMSI
Subjt:  EIQEEE-AAEVRN----WTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI

A0A6J1HWE1 uncharacterized protein LOC1114684506.19e-12481.08Show/hide
Query:  MAAEVSSLVRVLAG--YKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQMIN
        MAA+VSSL+RVLAG  Y DEDNRT LGNGQ+  T LVTRDLLGQSS LA +QELDLDLQVP+GWEKRLDLKSGKVYIQRS+TPDSP++SDSKQ  HQM N
Subjt:  MAAEVSSLVRVLAG--YKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQMIN

Query:  QTESKFQDLNFPPSPSKRTLNLFSETSLDLKL-------PSSSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGKEIQ
        QTESKFQDLNFPPSPSKRTLNLF+ETSLDLKL       PS ST+YASVCTLDKVKSALERADKELVKKRSSLWK  S+SSPSYSSSSS+AA    KEIQ
Subjt:  QTESKFQDLNFPPSPSKRTLNLFSETSLDLKL-------PSSSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGKEIQ

Query:  EEE-AAEVRNWTT-PIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI
        EEE A E RN    P+AVGC GCLSYVLV KNNPRCPRCNSVVPL SMKKPRIDLNMSI
Subjt:  EEE-AAEVRNWTT-PIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G16500.1 unknown protein6.1e-4745.04Show/hide
Query:  MAAEVSSLVRVLAGYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLA-------ETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQ
        MAA+VSSLVR+L+ +KD+        G   + AL+TRDLLG    +        ++ ELDLD+QVP GWEKRLDLKSGKVY+Q+     S  SS      
Subjt:  MAAEVSSLVRVLAGYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLA-------ETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQ

Query:  H--QMINQTESKFQDLNFPP----SPSKRTLNLF---SETSLDLKL-----------------PSSSTSY-ASVCTLDKVKSALERADKELVKKRSSLWK
        H     NQT  +FQDLN PP     P+K  L+LF    +TSL+LKL                 P+ S SY +SVCTLDKVK ALERA+K+  K++     
Subjt:  H--QMINQTESKFQDLNFPP----SPSKRTLNLF---SETSLDLKL-----------------PSSSTSY-ASVCTLDKVKSALERADKELVKKRSSLWK

Query:  SSSSSSPSYSSSSSSAAAAAGKEIQEEEAAEVRNWTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI
         S      Y  ++S+  AA                 + +A GCPGCLSYV V KNNP+CPRC+S VPLP+MKKP+IDLN+S+
Subjt:  SSSSSSPSYSSSSSSAAAAAGKEIQEEEAAEVRNWTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKKPRIDLNMSI

AT1G79160.1 unknown protein4.7e-4748.86Show/hide
Query:  MAAEVSSLVRVLAGYKDEDNRTTLGNGQDQSTALVTRDLL--GQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQMIN
        MAA+VSSLVR+L+GYKD+        G   S AL+TRDLL  G+      + ELDLDLQVPTG+EKRLDLKSGKVY+QR  +  S   +++ Q      N
Subjt:  MAAEVSSLVRVLAGYKDEDNRTTLGNGQDQSTALVTRDLL--GQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQMIN

Query:  QTESKFQDLNFPPSPSKRT--LNLFSETSLDLK-LPSSSTS------YASVCTLDKVKSALERADKE--LVKKRSSLWKSSSSSSPSYSSSSSSAAAAAG
        QT   FQDLNFPP     +  LNLF +T+ +LK LPSS +S        SVCTLDKVKSALERA+++  + KKR       S     Y    + A A   
Subjt:  QTESKFQDLNFPPSPSKRT--LNLFSETSLDLK-LPSSSTS------YASVCTLDKVKSALERADKE--LVKKRSSLWKSSSSSSPSYSSSSSSAAAAAG

Query:  KEIQEEEAAEVRNWTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPS---MKKPRIDLNMSI
                       +P+  GCPGCLSYVLVM NNP+CPRC+++VPLP+    KKP+IDLN+SI
Subjt:  KEIQEEEAAEVRNWTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPS---MKKPRIDLNMSI

AT3G11600.1 unknown protein1.7e-0433.82Show/hide
Query:  SPSYSSSSSSAAAAAGKEIQEEEAAEVRNWTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKK
        SP+ S+++S ++  + +  QEE        T+ + VGCP CL YV++  ++P+CP+C S V L  +++
Subjt:  SPSYSSSSSSAAAAAGKEIQEEEAAEVRNWTTPIAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPSMKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCCGAAGTCAGTAGTCTAGTACGGGTACTCGCCGGCTACAAGGACGAAGATAATCGGACGACGCTGGGGAATGGCCAGGATCAATCAACGGCTCTCGTTACCCG
CGACTTGCTCGGACAATCTTCCAAGCTCGCCGAGACTCAGGAATTGGACCTCGATTTGCAGGTTCCCACCGGCTGGGAAAAGAGACTCGACTTGAAGTCGGGGAAAGTTT
ACATACAGAGAAGTAAAACTCCGGATTCTCCGGTGAGTTCGGATTCGAAGCAACGGCAACACCAAATGATTAATCAGACAGAATCGAAGTTTCAGGATTTGAACTTCCCA
CCGTCTCCTTCGAAACGAACGTTAAATCTGTTCAGCGAAACGAGCTTGGATTTGAAATTGCCGTCGTCGTCCACCAGTTACGCAAGCGTGTGCACTCTGGATAAGGTGAA
ATCGGCGCTGGAGAGGGCGGACAAGGAGCTGGTAAAGAAACGCTCTTCGCTCTGGAAGTCGTCGTCCTCGTCGTCGCCGTCGTACTCTTCGTCGTCTTCGTCGGCGGCGG
CGGCGGCGGGGAAAGAAATTCAGGAAGAAGAAGCGGCGGAAGTCAGGAACTGGACGACGCCGATTGCGGTGGGTTGCCCGGGATGTTTATCTTATGTATTGGTAATGAAA
AACAATCCGAGGTGTCCTCGTTGCAACTCTGTTGTTCCATTGCCCAGTATGAAGAAACCTCGGATTGATCTAAACATGTCCATATAA
mRNA sequenceShow/hide mRNA sequence
GAATATATGAGAATATAAGTGGCATCAAAATAAATAACAAAAAATACTTAAAGATAGAGAGAAAAAACAAACAAACAAAGGGGGGCATTAATTGACGCAAATAAGAGTAG
TAGTAGTAATTGTCATTGGCATTAAATGCCCCAATTAATTCTCTTCCACCACTCTTCCACTTTCCCATCATTAATCAAATTTGAACAATTAATTTCCATTTCACTCCCAC
CATTAATATTTTCAAAAATTAAAATTGTTTTTTTTTTCTTTTTAACTTAATTAATTAATTTAGTTCATAACGGCCTCGGCTTGCATTATTTACTAGATTACTGTCTTATA
TTTCATTGTGGTCTCCATAGAGTTACAAAGAGAGACAGCCCACGCCCACCCCTCCTTCTCCTTCTCTCTCTCCCCTTCATTTTCTTCTTTTTTATTTCTTCAGAATCAGA
AATCAGAAAACCTTCAAAAAGGGGGTTTCTCTATAAAGCTCCAAAAATGGCTGCCGAAGTCAGTAGTCTAGTACGGGTACTCGCCGGCTACAAGGACGAAGATAATCGGA
CGACGCTGGGGAATGGCCAGGATCAATCAACGGCTCTCGTTACCCGCGACTTGCTCGGACAATCTTCCAAGCTCGCCGAGACTCAGGAATTGGACCTCGATTTGCAGGTT
CCCACCGGCTGGGAAAAGAGACTCGACTTGAAGTCGGGGAAAGTTTACATACAGAGAAGTAAAACTCCGGATTCTCCGGTGAGTTCGGATTCGAAGCAACGGCAACACCA
AATGATTAATCAGACAGAATCGAAGTTTCAGGATTTGAACTTCCCACCGTCTCCTTCGAAACGAACGTTAAATCTGTTCAGCGAAACGAGCTTGGATTTGAAATTGCCGT
CGTCGTCCACCAGTTACGCAAGCGTGTGCACTCTGGATAAGGTGAAATCGGCGCTGGAGAGGGCGGACAAGGAGCTGGTAAAGAAACGCTCTTCGCTCTGGAAGTCGTCG
TCCTCGTCGTCGCCGTCGTACTCTTCGTCGTCTTCGTCGGCGGCGGCGGCGGCGGGGAAAGAAATTCAGGAAGAAGAAGCGGCGGAAGTCAGGAACTGGACGACGCCGAT
TGCGGTGGGTTGCCCGGGATGTTTATCTTATGTATTGGTAATGAAAAACAATCCGAGGTGTCCTCGTTGCAACTCTGTTGTTCCATTGCCCAGTATGAAGAAACCTCGGA
TTGATCTAAACATGTCCATATAAAAAAAAAAAAAAAATTATGTTGGGATTTGGAATCAAATCATATCATCATTATCATATATAGGTTGTCCAAACCAAAATTAAGAACGT
TGCCTCTTTTCATCCATCTTCAGTTTAAT
Protein sequenceShow/hide protein sequence
MAAEVSSLVRVLAGYKDEDNRTTLGNGQDQSTALVTRDLLGQSSKLAETQELDLDLQVPTGWEKRLDLKSGKVYIQRSKTPDSPVSSDSKQRQHQMINQTESKFQDLNFP
PSPSKRTLNLFSETSLDLKLPSSSTSYASVCTLDKVKSALERADKELVKKRSSLWKSSSSSSPSYSSSSSSAAAAAGKEIQEEEAAEVRNWTTPIAVGCPGCLSYVLVMK
NNPRCPRCNSVVPLPSMKKPRIDLNMSI