; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g17710 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g17710
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr9:14038373..14042835
RNA-Seq ExpressionMoc09g17710
SyntenyMoc09g17710
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142768.1 uncharacterized protein LOC111012806 [Momordica charantia]4.4e-13283.28Show/hide
Query:  MCYFLTGLADETFTVKLREEAPATFAEVLQKAKKVIDGQELLRTKTGRLEKQIDQKKLSQEKRKADSKSKDKGSSSSSSRTEYRRSEIGHNRSRPYERYT
        MCYFLTGLADET TVKL EEAPATF EVLQKAKKVIDGQELLRTKTGR EKQIDQKKL+QEKRKADSKSKDKGSSSS SRTEYRR E G +RSRPYE YT
Subjt:  MCYFLTGLADETFTVKLREEAPATFAEVLQKAKKVIDGQELLRTKTGRLEKQIDQKKLSQEKRKADSKSKDKGSSSSSSRTEYRRSEIGHNRSRPYERYT

Query:  PTTIPISEILMNIEESRMEKLLKRPEQLRGDPEKLHKEKYCRFHRNQGHNTTNCWELKRQIEGLIQDGYFKNGGQHGNKRKELAREARREVCIIREQKPT
        PTTIPISEILMNIEE  MEKLLKRP++L+GDPEK +K+KYCRFHRN GHN T+CWELKRQIE LIQDG+FKNGGQ GN RKELARE RREVCIIREQKPT
Subjt:  PTTIPISEILMNIEESRMEKLLKRPEQLRGDPEKLHKEKYCRFHRNQGHNTTNCWELKRQIEGLIQDGYFKNGGQHGNKRKELAREARREVCIIREQKPT

Query:  CSITFGDIDLEEVHLPHNDTLVIAPLIDHVLIRRVLVDGGVSANILSLPTYLALGWTRSQLKRSSTPLVGFSREPVSLDGCIDLPVTIGQD-TYETDLAR
        CSITFGD  LE VHLPHND LVIAP IDHVL+RRVLVD   SANILSLPTYLALGWTRSQLK+S TPLVGFS E VS +GCIDLPVTIGQD T  T +A 
Subjt:  CSITFGDIDLEEVHLPHNDTLVIAPLIDHVLIRRVLVDGGVSANILSLPTYLALGWTRSQLKRSSTPLVGFSREPVSLDGCIDLPVTIGQD-TYETDLAR

Query:  SVPVE
         V V+
Subjt:  SVPVE

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.3e-11570.48Show/hide
Query:  MCYFLTGLADETFTVKLREEAPATFAEVLQKAKKVIDGQELLRTKTGRLEKQIDQKKLSQEKRKADSKSKDKGSSSSSSRTEYRRSEIGHNRSRPYERYT
        MCYFLTGLADET TVKLREEAPATFAEVLQK KKVIDGQELLRTKTGR EK IDQ +  ++K KADSKS+DKG SSSSSR +YRRS   HN+SRPYE YT
Subjt:  MCYFLTGLADETFTVKLREEAPATFAEVLQKAKKVIDGQELLRTKTGRLEKQIDQKKLSQEKRKADSKSKDKGSSSSSSRTEYRRSEIGHNRSRPYERYT

Query:  PTTIPISEILMNIEESRMEKLLKRPEQLRGDPEKLHKEKYCRFHRNQGHNTTNCWELKRQIEGLIQDGYFKN--GGQHG---------------------
        PTTIPI EIL NIEE+ MEKLLKRPE+LRGDPEK + +KYCRFHR+ GHNT+N WELKRQIE LIQDGYFK   G                         
Subjt:  PTTIPISEILMNIEESRMEKLLKRPEQLRGDPEKLHKEKYCRFHRNQGHNTTNCWELKRQIEGLIQDGYFKN--GGQHG---------------------

Query:  ----NKRKELAREARREVCIIREQKPTCSITFGDIDLEEVHLPHNDTLVIAPLIDHVLIRRVLVDGGVSANILSLPTYLALGWTRSQLKRSSTPLVGFSR
            NK+KELAREARREVCIIREQ+PT SI F   DLE VHLPHND LVIAPLID VL+RR+LVDGG SANILSL TYLALGWTRSQLK+S TPLVGFS 
Subjt:  ----NKRKELAREARREVCIIREQKPTCSITFGDIDLEEVHLPHNDTLVIAPLIDHVLIRRVLVDGGVSANILSLPTYLALGWTRSQLKRSSTPLVGFSR

Query:  EPVSLDGCIDLPVTIGQ-DTYETDLARSVPVE
        E +SL+GCIDLPV+I Q DT  T +A  V ++
Subjt:  EPVSLDGCIDLPVTIGQ-DTYETDLARSVPVE

XP_022153957.1 uncharacterized protein LOC111021344 [Momordica charantia]1.1e-13082.62Show/hide
Query:  MCYFLTGLADETFTVKLREEAPATFAEVLQKAKKVIDGQELLRTKTGRLEKQIDQKKLSQEKRKADSKSKDKGSSSSSSRTEYRRSEIGHNRSRPYERYT
        MCYFLTGLADET TVKL EEAPATFAEVL+KAKKVIDGQELLRTKTGR E+QIDQKKL+QEKRKADSKSKDKGSSSS SRTEYRRSE G +RSRPYERYT
Subjt:  MCYFLTGLADETFTVKLREEAPATFAEVLQKAKKVIDGQELLRTKTGRLEKQIDQKKLSQEKRKADSKSKDKGSSSSSSRTEYRRSEIGHNRSRPYERYT

Query:  PTTIPISEILMNIEESRMEKLLKRPEQLRGDPEKLHKEKYCRFHRNQGHNTTNCWELKRQIEGLIQDGYFKNGGQHGNKRKELAREARREVCIIREQKPT
         TTIPISEIL NIEES MEKLLKRPE+LRGD EK +K+KYCRFHR+ GHNTT+CWELKRQIE LIQD YFKNGGQ GNKRK+LAREARREVCIIREQKPT
Subjt:  PTTIPISEILMNIEESRMEKLLKRPEQLRGDPEKLHKEKYCRFHRNQGHNTTNCWELKRQIEGLIQDGYFKNGGQHGNKRKELAREARREVCIIREQKPT

Query:  CSITFGDIDLEEVHLPHNDTLVIAPLIDHVLIRRVLVDGGVSANILSLPTYLALGWTRSQLKRSSTPLVGFSREPVSLDGCIDLPVTIGQD-TYETDLAR
        C I F D DLE VHLPHND LVIAPLIDHV +RRVLVDG  SANILSLPTYLALGWTR QLK+S TP VGFS E VS +GCIDLPVTIGQD T  T +A 
Subjt:  CSITFGDIDLEEVHLPHNDTLVIAPLIDHVLIRRVLVDGGVSANILSLPTYLALGWTRSQLKRSSTPLVGFSREPVSLDGCIDLPVTIGQD-TYETDLAR

Query:  SVPVE
         V ++
Subjt:  SVPVE

XP_022156024.1 uncharacterized protein LOC111022993 [Momordica charantia]2.2e-12369.55Show/hide
Query:  MCYFLTGLADETFTVKLREEAPATFAEVLQKAKKVIDGQELLRTKTGRLEKQIDQKKLSQEKRKADSKSKDKGSSSSSSRTEYRRSEIGHNRSRPYERYT
        MCYFLTGLADET TVKL EEAPATF EVLQKAKKVI GQELLRTKT RLE+QIDQKK +QE+RK DSKSKDKGSSSS  RTEYRRS+IG +RSR YERYT
Subjt:  MCYFLTGLADETFTVKLREEAPATFAEVLQKAKKVIDGQELLRTKTGRLEKQIDQKKLSQEKRKADSKSKDKGSSSSSSRTEYRRSEIGHNRSRPYERYT

Query:  PTTIPISEILMNIEESRMEKLLKRPEQLRGDPEKLHKEKYCRFHRNQGHNTTNCWELKRQIEGLIQDGYFKN----------------------------
        PTTIPISEIL NIEE+ ME LLKR E+L+GDP+K HK+KYC FHR+ GHNT+NCWELK QIE LIQDGYFK                             
Subjt:  PTTIPISEILMNIEESRMEKLLKRPEQLRGDPEKLHKEKYCRFHRNQGHNTTNCWELKRQIEGLIQDGYFKN----------------------------

Query:  --------GGQHGNKRKELAREARREVCIIREQKPTCSITFGDIDLEEVHLPHNDTLVIAPLIDHVLIRRVLVDGGVSANILSLPTYLALGWTRSQLKRS
                GGQ GNKRKELAREA+REVCIIR+QKPTCSITF D DLE VHLPHND LVIAPLIDHVL+RRVLVDGG SANILSLPTYLALGWTRSQLK+S
Subjt:  --------GGQHGNKRKELAREARREVCIIREQKPTCSITFGDIDLEEVHLPHNDTLVIAPLIDHVLIRRVLVDGGVSANILSLPTYLALGWTRSQLKRS

Query:  STPLVGFSREPVSLDGCIDLPVTIGQDT----YETDLARSVPVEILDSPS---ILEPD
         TPLVGFS E VS +GC+DLPVTIGQD+    Y + L RS    + +  +   +LEP+
Subjt:  STPLVGFSREPVSLDGCIDLPVTIGQDT----YETDLARSVPVEILDSPS---ILEPD

XP_022157474.1 uncharacterized protein LOC111024166 [Momordica charantia]2.9e-12371.59Show/hide
Query:  MCYFLTGLADETFTVKLREEAPATFAEVLQKAKKVIDGQELLRTKTGRLEKQIDQKKLSQEKRKADSKSKDKGSSSSSSRTEYRRSEIGHNRSRPYERYT
        MCYFLTGLA+ET  VKL EEAPATFAEVLQKAKKVIDGQELLRTKT R EK IDQKKL+QE ++ + KSKDKG SSSSSR EYRRS+ G NRSRPYERYT
Subjt:  MCYFLTGLADETFTVKLREEAPATFAEVLQKAKKVIDGQELLRTKTGRLEKQIDQKKLSQEKRKADSKSKDKGSSSSSSRTEYRRSEIGHNRSRPYERYT

Query:  PTTIPISEILMNIEESRMEKLLKRPEQLRGDPEKLHKEKYCRFHRNQGHNTTNCWELKRQIEGLIQDGYFK-----------------------------
        PTTIPIS IL NIEE+ MEKLLKRP +LRGD EK +K+KYCRFHR+ GHNT++CWELKRQIE LIQD YFK                             
Subjt:  PTTIPISEILMNIEESRMEKLLKRPEQLRGDPEKLHKEKYCRFHRNQGHNTTNCWELKRQIEGLIQDGYFK-----------------------------

Query:  -----------NGGQHGNKRKELAREARREVCIIREQKPTCSITFGDIDLEEVHLPHNDTLVIAPLIDHVLIRRVLVDGGVSANILSLPTYLALGWTRSQ
                   +GGQ GNKRKELAREARREVCIIREQKPTCSITFGD DLE VHLPHND LVIAPLIDHVL+RRVLVDGG SANILSLPTYLALGWTRSQ
Subjt:  -----------NGGQHGNKRKELAREARREVCIIREQKPTCSITFGDIDLEEVHLPHNDTLVIAPLIDHVLIRRVLVDGGVSANILSLPTYLALGWTRSQ

Query:  LKRSSTPLVGFSREPVSLDGCIDLPVTIGQD-TYETDLARSVPVE
        LK+S TPLVGFSRE VS +GCIDLP+TIGQD T  T +A  V ++
Subjt:  LKRSSTPLVGFSREPVSLDGCIDLPVTIGQD-TYETDLARSVPVE

TrEMBL top hitse value%identityAlignment
A0A6J1CP40 uncharacterized protein LOC1110128062.1e-13283.28Show/hide
Query:  MCYFLTGLADETFTVKLREEAPATFAEVLQKAKKVIDGQELLRTKTGRLEKQIDQKKLSQEKRKADSKSKDKGSSSSSSRTEYRRSEIGHNRSRPYERYT
        MCYFLTGLADET TVKL EEAPATF EVLQKAKKVIDGQELLRTKTGR EKQIDQKKL+QEKRKADSKSKDKGSSSS SRTEYRR E G +RSRPYE YT
Subjt:  MCYFLTGLADETFTVKLREEAPATFAEVLQKAKKVIDGQELLRTKTGRLEKQIDQKKLSQEKRKADSKSKDKGSSSSSSRTEYRRSEIGHNRSRPYERYT

Query:  PTTIPISEILMNIEESRMEKLLKRPEQLRGDPEKLHKEKYCRFHRNQGHNTTNCWELKRQIEGLIQDGYFKNGGQHGNKRKELAREARREVCIIREQKPT
        PTTIPISEILMNIEE  MEKLLKRP++L+GDPEK +K+KYCRFHRN GHN T+CWELKRQIE LIQDG+FKNGGQ GN RKELARE RREVCIIREQKPT
Subjt:  PTTIPISEILMNIEESRMEKLLKRPEQLRGDPEKLHKEKYCRFHRNQGHNTTNCWELKRQIEGLIQDGYFKNGGQHGNKRKELAREARREVCIIREQKPT

Query:  CSITFGDIDLEEVHLPHNDTLVIAPLIDHVLIRRVLVDGGVSANILSLPTYLALGWTRSQLKRSSTPLVGFSREPVSLDGCIDLPVTIGQD-TYETDLAR
        CSITFGD  LE VHLPHND LVIAP IDHVL+RRVLVD   SANILSLPTYLALGWTRSQLK+S TPLVGFS E VS +GCIDLPVTIGQD T  T +A 
Subjt:  CSITFGDIDLEEVHLPHNDTLVIAPLIDHVLIRRVLVDGGVSANILSLPTYLALGWTRSQLKRSSTPLVGFSREPVSLDGCIDLPVTIGQD-TYETDLAR

Query:  SVPVE
         V V+
Subjt:  SVPVE

A0A6J1DHB3 uncharacterized protein LOC1110204796.3e-11670.48Show/hide
Query:  MCYFLTGLADETFTVKLREEAPATFAEVLQKAKKVIDGQELLRTKTGRLEKQIDQKKLSQEKRKADSKSKDKGSSSSSSRTEYRRSEIGHNRSRPYERYT
        MCYFLTGLADET TVKLREEAPATFAEVLQK KKVIDGQELLRTKTGR EK IDQ +  ++K KADSKS+DKG SSSSSR +YRRS   HN+SRPYE YT
Subjt:  MCYFLTGLADETFTVKLREEAPATFAEVLQKAKKVIDGQELLRTKTGRLEKQIDQKKLSQEKRKADSKSKDKGSSSSSSRTEYRRSEIGHNRSRPYERYT

Query:  PTTIPISEILMNIEESRMEKLLKRPEQLRGDPEKLHKEKYCRFHRNQGHNTTNCWELKRQIEGLIQDGYFKN--GGQHG---------------------
        PTTIPI EIL NIEE+ MEKLLKRPE+LRGDPEK + +KYCRFHR+ GHNT+N WELKRQIE LIQDGYFK   G                         
Subjt:  PTTIPISEILMNIEESRMEKLLKRPEQLRGDPEKLHKEKYCRFHRNQGHNTTNCWELKRQIEGLIQDGYFKN--GGQHG---------------------

Query:  ----NKRKELAREARREVCIIREQKPTCSITFGDIDLEEVHLPHNDTLVIAPLIDHVLIRRVLVDGGVSANILSLPTYLALGWTRSQLKRSSTPLVGFSR
            NK+KELAREARREVCIIREQ+PT SI F   DLE VHLPHND LVIAPLID VL+RR+LVDGG SANILSL TYLALGWTRSQLK+S TPLVGFS 
Subjt:  ----NKRKELAREARREVCIIREQKPTCSITFGDIDLEEVHLPHNDTLVIAPLIDHVLIRRVLVDGGVSANILSLPTYLALGWTRSQLKRSSTPLVGFSR

Query:  EPVSLDGCIDLPVTIGQ-DTYETDLARSVPVE
        E +SL+GCIDLPV+I Q DT  T +A  V ++
Subjt:  EPVSLDGCIDLPVTIGQ-DTYETDLARSVPVE

A0A6J1DKD3 uncharacterized protein LOC1110213445.3e-13182.62Show/hide
Query:  MCYFLTGLADETFTVKLREEAPATFAEVLQKAKKVIDGQELLRTKTGRLEKQIDQKKLSQEKRKADSKSKDKGSSSSSSRTEYRRSEIGHNRSRPYERYT
        MCYFLTGLADET TVKL EEAPATFAEVL+KAKKVIDGQELLRTKTGR E+QIDQKKL+QEKRKADSKSKDKGSSSS SRTEYRRSE G +RSRPYERYT
Subjt:  MCYFLTGLADETFTVKLREEAPATFAEVLQKAKKVIDGQELLRTKTGRLEKQIDQKKLSQEKRKADSKSKDKGSSSSSSRTEYRRSEIGHNRSRPYERYT

Query:  PTTIPISEILMNIEESRMEKLLKRPEQLRGDPEKLHKEKYCRFHRNQGHNTTNCWELKRQIEGLIQDGYFKNGGQHGNKRKELAREARREVCIIREQKPT
         TTIPISEIL NIEES MEKLLKRPE+LRGD EK +K+KYCRFHR+ GHNTT+CWELKRQIE LIQD YFKNGGQ GNKRK+LAREARREVCIIREQKPT
Subjt:  PTTIPISEILMNIEESRMEKLLKRPEQLRGDPEKLHKEKYCRFHRNQGHNTTNCWELKRQIEGLIQDGYFKNGGQHGNKRKELAREARREVCIIREQKPT

Query:  CSITFGDIDLEEVHLPHNDTLVIAPLIDHVLIRRVLVDGGVSANILSLPTYLALGWTRSQLKRSSTPLVGFSREPVSLDGCIDLPVTIGQD-TYETDLAR
        C I F D DLE VHLPHND LVIAPLIDHV +RRVLVDG  SANILSLPTYLALGWTR QLK+S TP VGFS E VS +GCIDLPVTIGQD T  T +A 
Subjt:  CSITFGDIDLEEVHLPHNDTLVIAPLIDHVLIRRVLVDGGVSANILSLPTYLALGWTRSQLKRSSTPLVGFSREPVSLDGCIDLPVTIGQD-TYETDLAR

Query:  SVPVE
         V ++
Subjt:  SVPVE

A0A6J1DS14 uncharacterized protein LOC1110229931.1e-12369.55Show/hide
Query:  MCYFLTGLADETFTVKLREEAPATFAEVLQKAKKVIDGQELLRTKTGRLEKQIDQKKLSQEKRKADSKSKDKGSSSSSSRTEYRRSEIGHNRSRPYERYT
        MCYFLTGLADET TVKL EEAPATF EVLQKAKKVI GQELLRTKT RLE+QIDQKK +QE+RK DSKSKDKGSSSS  RTEYRRS+IG +RSR YERYT
Subjt:  MCYFLTGLADETFTVKLREEAPATFAEVLQKAKKVIDGQELLRTKTGRLEKQIDQKKLSQEKRKADSKSKDKGSSSSSSRTEYRRSEIGHNRSRPYERYT

Query:  PTTIPISEILMNIEESRMEKLLKRPEQLRGDPEKLHKEKYCRFHRNQGHNTTNCWELKRQIEGLIQDGYFKN----------------------------
        PTTIPISEIL NIEE+ ME LLKR E+L+GDP+K HK+KYC FHR+ GHNT+NCWELK QIE LIQDGYFK                             
Subjt:  PTTIPISEILMNIEESRMEKLLKRPEQLRGDPEKLHKEKYCRFHRNQGHNTTNCWELKRQIEGLIQDGYFKN----------------------------

Query:  --------GGQHGNKRKELAREARREVCIIREQKPTCSITFGDIDLEEVHLPHNDTLVIAPLIDHVLIRRVLVDGGVSANILSLPTYLALGWTRSQLKRS
                GGQ GNKRKELAREA+REVCIIR+QKPTCSITF D DLE VHLPHND LVIAPLIDHVL+RRVLVDGG SANILSLPTYLALGWTRSQLK+S
Subjt:  --------GGQHGNKRKELAREARREVCIIREQKPTCSITFGDIDLEEVHLPHNDTLVIAPLIDHVLIRRVLVDGGVSANILSLPTYLALGWTRSQLKRS

Query:  STPLVGFSREPVSLDGCIDLPVTIGQDT----YETDLARSVPVEILDSPS---ILEPD
         TPLVGFS E VS +GC+DLPVTIGQD+    Y + L RS    + +  +   +LEP+
Subjt:  STPLVGFSREPVSLDGCIDLPVTIGQDT----YETDLARSVPVEILDSPS---ILEPD

A0A6J1DWK7 uncharacterized protein LOC1110241661.4e-12371.59Show/hide
Query:  MCYFLTGLADETFTVKLREEAPATFAEVLQKAKKVIDGQELLRTKTGRLEKQIDQKKLSQEKRKADSKSKDKGSSSSSSRTEYRRSEIGHNRSRPYERYT
        MCYFLTGLA+ET  VKL EEAPATFAEVLQKAKKVIDGQELLRTKT R EK IDQKKL+QE ++ + KSKDKG SSSSSR EYRRS+ G NRSRPYERYT
Subjt:  MCYFLTGLADETFTVKLREEAPATFAEVLQKAKKVIDGQELLRTKTGRLEKQIDQKKLSQEKRKADSKSKDKGSSSSSSRTEYRRSEIGHNRSRPYERYT

Query:  PTTIPISEILMNIEESRMEKLLKRPEQLRGDPEKLHKEKYCRFHRNQGHNTTNCWELKRQIEGLIQDGYFK-----------------------------
        PTTIPIS IL NIEE+ MEKLLKRP +LRGD EK +K+KYCRFHR+ GHNT++CWELKRQIE LIQD YFK                             
Subjt:  PTTIPISEILMNIEESRMEKLLKRPEQLRGDPEKLHKEKYCRFHRNQGHNTTNCWELKRQIEGLIQDGYFK-----------------------------

Query:  -----------NGGQHGNKRKELAREARREVCIIREQKPTCSITFGDIDLEEVHLPHNDTLVIAPLIDHVLIRRVLVDGGVSANILSLPTYLALGWTRSQ
                   +GGQ GNKRKELAREARREVCIIREQKPTCSITFGD DLE VHLPHND LVIAPLIDHVL+RRVLVDGG SANILSLPTYLALGWTRSQ
Subjt:  -----------NGGQHGNKRKELAREARREVCIIREQKPTCSITFGDIDLEEVHLPHNDTLVIAPLIDHVLIRRVLVDGGVSANILSLPTYLALGWTRSQ

Query:  LKRSSTPLVGFSREPVSLDGCIDLPVTIGQD-TYETDLARSVPVE
        LK+S TPLVGFSRE VS +GCIDLP+TIGQD T  T +A  V ++
Subjt:  LKRSSTPLVGFSREPVSLDGCIDLPVTIGQD-TYETDLARSVPVE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCTACTTCCTCACCGGCCTAGCCGACGAGACCTTCACTGTGAAGCTCAGAGAGGAGGCTCCAGCAACCTTCGCTGAAGTTCTGCAAAAGGCGAAGAAAGTCATTGA
TGGGCAAGAGCTCCTCCGAACCAAAACTGGCCGACTTGAAAAGCAAATCGACCAGAAGAAGCTAAGCCAAGAGAAGAGGAAGGCCGATTCTAAGTCTAAGGACAAGGGAT
CGTCCTCCTCTAGCAGCAGAACCGAGTATCGTCGGTCGGAGATCGGCCACAATCGAAGCCGACCTTACGAGCGTTACACTCCGACCACCATCCCTATTTCTGAAATACTT
ATGAACATCGAGGAAAGCAGGATGGAAAAACTCCTCAAGCGACCTGAGCAGCTCCGAGGAGACCCAGAAAAGCTTCACAAGGAAAAGTATTGCCGTTTTCATCGCAATCA
AGGCCATAATACAACGAATTGCTGGGAGCTGAAGCGCCAAATTGAAGGTCTCATTCAAGATGGCTATTTCAAAAATGGGGGCCAGCATGGAAATAAAAGGAAAGAACTAG
CTCGCGAGGCCAGGCGCGAGGTATGCATCATTAGGGAGCAGAAGCCTACTTGCTCCATTACCTTCGGCGATATCGACCTGGAGGAAGTTCATTTGCCTCATAATGATACA
CTTGTGATCGCTCCTCTCATTGATCACGTCTTGATCCGAAGAGTACTGGTAGATGGAGGCGTATCTGCCAACATCCTATCCCTCCCAACATATCTAGCATTGGGGTGGAC
CAGGTCACAGTTGAAGAGAAGTTCAACACCCTTAGTTGGATTCTCTAGAGAACCGGTCTCCCTTGATGGGTGTATCGACTTGCCGGTTACGATTGGGCAAGATACGTATG
AGACCGATCTGGCTAGATCGGTCCCGGTCGAGATCTTGGACAGCCCTTCAATCTTGGAGCCAGATATGATGAAGGTTGACACTTCATCACCCTCTTGGATGGACCCAATC
GTGGAGTTCATCAAAGGAAATCCACCGCAAGATCCGAAGGAGCAAAGGAAGATGGCACGGAGAGCAGCTCGATTCACACTCCGAGAAGGAGCGTTGTACCGACGTGGCTT
CTCCCTGCCTTTGCTTAAGAAGATTCAGAGTCATGTGGGCACCCTTGACCCAAGTTGGGAGGGACCGTTTGAGGTTAAAGGCATAGTCCGGCCCGAAACGTATATGCTGG
CCGACCTGGAAGGACAAGTGCTTGCGCATCCATGGAACGCGGAGCACTTGAAGCGCTATTACCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTGCTACTTCCTCACCGGCCTAGCCGACGAGACCTTCACTGTGAAGCTCAGAGAGGAGGCTCCAGCAACCTTCGCTGAAGTTCTGCAAAAGGCGAAGAAAGTCATTGA
TGGGCAAGAGCTCCTCCGAACCAAAACTGGCCGACTTGAAAAGCAAATCGACCAGAAGAAGCTAAGCCAAGAGAAGAGGAAGGCCGATTCTAAGTCTAAGGACAAGGGAT
CGTCCTCCTCTAGCAGCAGAACCGAGTATCGTCGGTCGGAGATCGGCCACAATCGAAGCCGACCTTACGAGCGTTACACTCCGACCACCATCCCTATTTCTGAAATACTT
ATGAACATCGAGGAAAGCAGGATGGAAAAACTCCTCAAGCGACCTGAGCAGCTCCGAGGAGACCCAGAAAAGCTTCACAAGGAAAAGTATTGCCGTTTTCATCGCAATCA
AGGCCATAATACAACGAATTGCTGGGAGCTGAAGCGCCAAATTGAAGGTCTCATTCAAGATGGCTATTTCAAAAATGGGGGCCAGCATGGAAATAAAAGGAAAGAACTAG
CTCGCGAGGCCAGGCGCGAGGTATGCATCATTAGGGAGCAGAAGCCTACTTGCTCCATTACCTTCGGCGATATCGACCTGGAGGAAGTTCATTTGCCTCATAATGATACA
CTTGTGATCGCTCCTCTCATTGATCACGTCTTGATCCGAAGAGTACTGGTAGATGGAGGCGTATCTGCCAACATCCTATCCCTCCCAACATATCTAGCATTGGGGTGGAC
CAGGTCACAGTTGAAGAGAAGTTCAACACCCTTAGTTGGATTCTCTAGAGAACCGGTCTCCCTTGATGGGTGTATCGACTTGCCGGTTACGATTGGGCAAGATACGTATG
AGACCGATCTGGCTAGATCGGTCCCGGTCGAGATCTTGGACAGCCCTTCAATCTTGGAGCCAGATATGATGAAGGTTGACACTTCATCACCCTCTTGGATGGACCCAATC
GTGGAGTTCATCAAAGGAAATCCACCGCAAGATCCGAAGGAGCAAAGGAAGATGGCACGGAGAGCAGCTCGATTCACACTCCGAGAAGGAGCGTTGTACCGACGTGGCTT
CTCCCTGCCTTTGCTTAAGAAGATTCAGAGTCATGTGGGCACCCTTGACCCAAGTTGGGAGGGACCGTTTGAGGTTAAAGGCATAGTCCGGCCCGAAACGTATATGCTGG
CCGACCTGGAAGGACAAGTGCTTGCGCATCCATGGAACGCGGAGCACTTGAAGCGCTATTACCCTTGA
Protein sequenceShow/hide protein sequence
MCYFLTGLADETFTVKLREEAPATFAEVLQKAKKVIDGQELLRTKTGRLEKQIDQKKLSQEKRKADSKSKDKGSSSSSSRTEYRRSEIGHNRSRPYERYTPTTIPISEIL
MNIEESRMEKLLKRPEQLRGDPEKLHKEKYCRFHRNQGHNTTNCWELKRQIEGLIQDGYFKNGGQHGNKRKELAREARREVCIIREQKPTCSITFGDIDLEEVHLPHNDT
LVIAPLIDHVLIRRVLVDGGVSANILSLPTYLALGWTRSQLKRSSTPLVGFSREPVSLDGCIDLPVTIGQDTYETDLARSVPVEILDSPSILEPDMMKVDTSSPSWMDPI
VEFIKGNPPQDPKEQRKMARRAARFTLREGALYRRGFSLPLLKKIQSHVGTLDPSWEGPFEVKGIVRPETYMLADLEGQVLAHPWNAEHLKRYYP