; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g02190 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g02190
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr1:1440274..1455354
RNA-Seq ExpressionMoc01g02190
SyntenyMoc01g02190
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]6.5e-14696Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEVELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEE ELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEVELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFHASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAIVCGFTS
        KWF+ASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LA+VC F S
Subjt:  KWFHASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAIVCGFTS

Query:  GVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQTEAADALPLGEEA
        GVKRKSKGRAHALEAAQSSKP TPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQTEAAD  PLGE A
Subjt:  GVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQTEAADALPLGEEA

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]3.8e-10698.96Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEVELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEE ELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEVELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFHASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL
        KWF+ASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL
Subjt:  KWFHASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]2.1e-10496.39Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEVELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEE ELLDVDQLLACFEAKRIAKKPGRFYMCARKGA GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEVELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFHASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAI
        KWF+ASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSEL +
Subjt:  KWFHASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAI

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]6.7e-16787.82Show/hide
Query:  MSSSISSNLGSDLARRLESGLEEIENFRISDDGEDSDASTSG-----------------------------RASRGGERVDNPPEGWVTLYFKMFEYGLR
        MSSSISSNL SDLARRLES LEEIEN RISDDGEDSDASTSG                             R    GER DNPPEGWVTLYFKMFEYGLR
Subjt:  MSSSISSNLGSDLARRLESGLEEIENFRISDDGEDSDASTSG-----------------------------RASRGGERVDNPPEGWVTLYFKMFEYGLR

Query:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEVELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFHASG
        LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEE EL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWF+ASG
Subjt:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEVELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFHASG

Query:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAIVCGFTSGVKRKSK
        EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELA+VCGF SGVKRKSK
Subjt:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAIVCGFTSGVKRKSK

Query:  GRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        GRAHALEAAQSSKPATPAVVGPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  GRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]3.2e-12954.24Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFHASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFHASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAIVCGFTSGVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQTEAADALP
         VR IE+SRPNSELA+VCGFT  VKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ SGG S EKR R+++EA+D         P
Subjt:  AVRPIESSRPNSELAIVCGFTSGVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQTEAADALP

Query:  LGEEAREEAPPKRRRKKKKAISPSEVGA---------------------------------------CRVSRISAASLDRCLRRASKFVRPWVRSAEDHR
        L  E R E+P +RRRKKKK  S SE GA                                        +VSRISA  LDR LRRASKFV     S     
Subjt:  LGEEAREEAPPKRRRKKKKAISPSEVGA---------------------------------------CRVSRISAASLDRCLRRASKFVRPWVRSAEDHR

Query:  LRRR-GKTSAPFL----LDFAQQAELDWREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAE--------------------------------
        L+R     +  F+    L    +AELD RE LAA+E+E   AALEAA +T+K ELLKA  EV+ L+AE                                
Subjt:  LRRR-GKTSAPFL----LDFAQQAELDWREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAE--------------------------------

Query:  -------------ALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKDIASDMPDLQIDLSGLKRRYAEKWASGPGGT
                      LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMK IA+DMP LQIDL+GLK++Y+EKWASGP GT
Subjt:  -------------ALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKDIASDMPDLQIDLSGLKRRYAEKWASGPGGT

Query:  PGPQALVEQYVRDLDSDYSDPEED--------QVGSTQEGAP
        P PQ+LV++YVR+LDSDYSD EE+        +VG+TQE  P
Subjt:  PGPQALVEQYVRDLDSDYSDPEED--------QVGSTQEGAP

TrEMBL top hitse value%identityAlignment
A0A6J1CR42 uncharacterized protein LOC1110138263.1e-14696Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEVELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEE ELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEVELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFHASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAIVCGFTS
        KWF+ASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LA+VC F S
Subjt:  KWFHASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAIVCGFTS

Query:  GVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQTEAADALPLGEEA
        GVKRKSKGRAHALEAAQSSKP TPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQTEAAD  PLGE A
Subjt:  GVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQTEAADALPLGEEA

A0A6J1DWD2 uncharacterized protein LOC1110246801.9e-10698.96Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEVELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEE ELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEVELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFHASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL
        KWF+ASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL
Subjt:  KWFHASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL

A0A6J1DWF1 uncharacterized protein LOC1110251081.0e-10496.39Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEVELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEE ELLDVDQLLACFEAKRIAKKPGRFYMCARKGA GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEVELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFHASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAI
        KWF+ASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSEL +
Subjt:  KWFHASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAI

A0A6J1DXS5 uncharacterized protein LOC1110255023.2e-16787.82Show/hide
Query:  MSSSISSNLGSDLARRLESGLEEIENFRISDDGEDSDASTSG-----------------------------RASRGGERVDNPPEGWVTLYFKMFEYGLR
        MSSSISSNL SDLARRLES LEEIEN RISDDGEDSDASTSG                             R    GER DNPPEGWVTLYFKMFEYGLR
Subjt:  MSSSISSNLGSDLARRLESGLEEIENFRISDDGEDSDASTSG-----------------------------RASRGGERVDNPPEGWVTLYFKMFEYGLR

Query:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEVELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFHASG
        LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEE EL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWF+ASG
Subjt:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEVELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFHASG

Query:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAIVCGFTSGVKRKSK
        EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELA+VCGF SGVKRKSK
Subjt:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAIVCGFTSGVKRKSK

Query:  GRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        GRAHALEAAQSSKPATPAVVGPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  GRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256651.6e-12954.24Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFHASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFHASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAIVCGFTSGVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQTEAADALP
         VR IE+SRPNSELA+VCGFT  VKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ SGG S EKR R+++EA+D         P
Subjt:  AVRPIESSRPNSELAIVCGFTSGVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQTEAADALP

Query:  LGEEAREEAPPKRRRKKKKAISPSEVGA---------------------------------------CRVSRISAASLDRCLRRASKFVRPWVRSAEDHR
        L  E R E+P +RRRKKKK  S SE GA                                        +VSRISA  LDR LRRASKFV     S     
Subjt:  LGEEAREEAPPKRRRKKKKAISPSEVGA---------------------------------------CRVSRISAASLDRCLRRASKFVRPWVRSAEDHR

Query:  LRRR-GKTSAPFL----LDFAQQAELDWREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAE--------------------------------
        L+R     +  F+    L    +AELD RE LAA+E+E   AALEAA +T+K ELLKA  EV+ L+AE                                
Subjt:  LRRR-GKTSAPFL----LDFAQQAELDWREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAE--------------------------------

Query:  -------------ALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKDIASDMPDLQIDLSGLKRRYAEKWASGPGGT
                      LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMK IA+DMP LQIDL+GLK++Y+EKWASGP GT
Subjt:  -------------ALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKDIASDMPDLQIDLSGLKRRYAEKWASGPGGT

Query:  PGPQALVEQYVRDLDSDYSDPEED--------QVGSTQEGAP
        P PQ+LV++YVR+LDSDYSD EE+        +VG+TQE  P
Subjt:  PGPQALVEQYVRDLDSDYSDPEED--------QVGSTQEGAP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTTCTCTGCCAGGTCGAGATCGGACCAGACAGAGAGGGGGTCAATGTTTGGGGAAGGGAATACCCTGGACACGTGGAGGACCCCCATCGGTGCTCCAAGA
CGACTCATGAATCATGCCCAGCTCGGAAGGCCGAGATGGCCGCCACCAACGCCGACAAATGCCGAGTGTGAGGGCCGAGGTGAGCTCGGCTCGGGTCCGACCCAC
CGGGGAGCTCGATATGGGCCGAGATGCGCAACTGTTCGCCCAAGCTTTCAGATCGGTCCGGAGGCCGGGTTCGAGCTGCAACCAGGAACACACTGTAGTGCAAAC
CTTTGCATAAACAAGATCTACATGAGTCGATTTGGAAGCAGGCGCGGGGACAGCTTACTGAGAGACAAAGACAAACCGATGCGAAACAGTGCTTGGGCATCAATC
AAGGTTCGTTCGATTTCTTTATCGAAAGCTTTTGGAAATTTGATAAAGTTTCATGAAGAACTTGATGATGGGACTTCTTGGAATATGTTTAGAGGCTATGCTCGA
ACTCGGCCTCCGGACCGGCCTGAACACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTCAGTATAGGTCAGACCTGGGCCAGGTCCACCTCG
GCCCTCATACTTAGCATTTGTCGGCGTTGGTGGCGGTCATCTCGGCTATCCGAGCTGGGCATGACTCATGAGCCATCTTGGAGCACCAATAGGGGTCCTCCACGT
GTCCAGGGCTATAAATGCCCCCAATCCTTCAGATCATACCTTACGTTCCTTGAGTTCTTGGAGTTCGATCTGGAGGCAGCTCGAACCCTTGGTAGGTCGGTCTCT
TCCCTCTCTCTTTCGAACGTAATTGCCATGTCGTCCTCTATTAGCAGCAACCTAGGATCCGATTTAGCTCGTAGGTTAGAGTCCGGGCTCGAGGAGATAGAAAAC
TTTAGAATCTCCGATGACGGGGAGGATAGTGACGCCTCCACTTCAGGTCGGGCTTCCAGAGGAGGGGAGAGAGTTGACAATCCTCCAGAGGGATGGGTCACTCTC
TACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCGAATGGG
TGGGGTGTCATTTTCGCTTTGGCCATCCTCTTTTGGCTACGAGCTCGGGATAGTGAGGAGGTCGAGCTGTTGGATGTAGACCAGCTCCTCGCGTGCTTCGAGGCG
AAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGG
TTCCACGCTTCCGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAG
CTTACGCAAGCCTCCTTCGATACGCTGAAATACTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGACTGACGAACTGCTGCTTGAGTCCGGG
CTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCTGAACTTGCCATAGTTTGTGGATTTACAAGCGGCGTGAAGCGCAAGTCTAAG
GGCCGAGCCCATGCTCTTGAGGCTGCCCAGAGTTCGAAACCTGCCACCCCTGCCGTGGTAGGGCCTGCCTCGGAAGATCCAGCCCCGGTGATCGAGCTGGAGTCT
TCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCCCAGACCGAGGCGGCGGACGCCCTGCCTTTGGGCGAGGAGGCGAGGGAG
GAAGCCCCTCCGAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCCCCCTCGGAGGTCGGAGCTTGCAGGGTGTCCCGCATCTCAGCTGCAAGTTTGGACCGCTGC
CTAAGGAGGGCGTCCAAATTTGTGAGACCCTGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCCGAGGTAAGACTAGTGCCCCATTTTTGCTTGATTTCGCC
CAACAGGCCGAGCTGGATTGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTG
AAGGCTCACTCTGAGGTGGAGACTTTGAAGGCCGAGGCGCTCGAAGCGAAGGATAAGGAGCTGGAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGCCTC
AGCAATGGAGTCCTACTAGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGATTTTTCTGACGCGGGCTTCAAGTTCCTCATGAAGGACATT
GCTTCCGACATGCCCGACCTTCAGATCGATCTCAGCGGTCTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCTGGTCCTGGCGGCACCCCTGGCCCCCAAGCGTTG
GTGGAACAGTATGTCAGAGATCTGGACTCTGACTACTCCGATCCCGAAGAGGACCAGGTCGGCTCCACTCAAGAGGGCGCTCCTCAAGCGGACTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACTTCTCTGCCAGGTCGAGATCGGACCAGACAGAGAGGGGGTCAATGTTTGGGGAAGGGAATACCCTGGACACGTGGAGGACCCCCATCGGTGCTCCAAGA
CGACTCATGAATCATGCCCAGCTCGGAAGGCCGAGATGGCCGCCACCAACGCCGACAAATGCCGAGTGTGAGGGCCGAGGTGAGCTCGGCTCGGGTCCGACCCAC
CGGGGAGCTCGATATGGGCCGAGATGCGCAACTGTTCGCCCAAGCTTTCAGATCGGTCCGGAGGCCGGGTTCGAGCTGCAACCAGGAACACACTGTAGTGCAAAC
CTTTGCATAAACAAGATCTACATGAGTCGATTTGGAAGCAGGCGCGGGGACAGCTTACTGAGAGACAAAGACAAACCGATGCGAAACAGTGCTTGGGCATCAATC
AAGGTTCGTTCGATTTCTTTATCGAAAGCTTTTGGAAATTTGATAAAGTTTCATGAAGAACTTGATGATGGGACTTCTTGGAATATGTTTAGAGGCTATGCTCGA
ACTCGGCCTCCGGACCGGCCTGAACACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTCAGTATAGGTCAGACCTGGGCCAGGTCCACCTCG
GCCCTCATACTTAGCATTTGTCGGCGTTGGTGGCGGTCATCTCGGCTATCCGAGCTGGGCATGACTCATGAGCCATCTTGGAGCACCAATAGGGGTCCTCCACGT
GTCCAGGGCTATAAATGCCCCCAATCCTTCAGATCATACCTTACGTTCCTTGAGTTCTTGGAGTTCGATCTGGAGGCAGCTCGAACCCTTGGTAGGTCGGTCTCT
TCCCTCTCTCTTTCGAACGTAATTGCCATGTCGTCCTCTATTAGCAGCAACCTAGGATCCGATTTAGCTCGTAGGTTAGAGTCCGGGCTCGAGGAGATAGAAAAC
TTTAGAATCTCCGATGACGGGGAGGATAGTGACGCCTCCACTTCAGGTCGGGCTTCCAGAGGAGGGGAGAGAGTTGACAATCCTCCAGAGGGATGGGTCACTCTC
TACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCGAATGGG
TGGGGTGTCATTTTCGCTTTGGCCATCCTCTTTTGGCTACGAGCTCGGGATAGTGAGGAGGTCGAGCTGTTGGATGTAGACCAGCTCCTCGCGTGCTTCGAGGCG
AAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGG
TTCCACGCTTCCGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAG
CTTACGCAAGCCTCCTTCGATACGCTGAAATACTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGACTGACGAACTGCTGCTTGAGTCCGGG
CTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCTGAACTTGCCATAGTTTGTGGATTTACAAGCGGCGTGAAGCGCAAGTCTAAG
GGCCGAGCCCATGCTCTTGAGGCTGCCCAGAGTTCGAAACCTGCCACCCCTGCCGTGGTAGGGCCTGCCTCGGAAGATCCAGCCCCGGTGATCGAGCTGGAGTCT
TCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCCCAGACCGAGGCGGCGGACGCCCTGCCTTTGGGCGAGGAGGCGAGGGAG
GAAGCCCCTCCGAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCCCCCTCGGAGGTCGGAGCTTGCAGGGTGTCCCGCATCTCAGCTGCAAGTTTGGACCGCTGC
CTAAGGAGGGCGTCCAAATTTGTGAGACCCTGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCCGAGGTAAGACTAGTGCCCCATTTTTGCTTGATTTCGCC
CAACAGGCCGAGCTGGATTGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTG
AAGGCTCACTCTGAGGTGGAGACTTTGAAGGCCGAGGCGCTCGAAGCGAAGGATAAGGAGCTGGAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGCCTC
AGCAATGGAGTCCTACTAGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGATTTTTCTGACGCGGGCTTCAAGTTCCTCATGAAGGACATT
GCTTCCGACATGCCCGACCTTCAGATCGATCTCAGCGGTCTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCTGGTCCTGGCGGCACCCCTGGCCCCCAAGCGTTG
GTGGAACAGTATGTCAGAGATCTGGACTCTGACTACTCCGATCCCGAAGAGGACCAGGTCGGCTCCACTCAAGAGGGCGCTCCTCAAGCGGACTCTTAG
Protein sequenceShow/hide protein sequence
MNFSARSRSDQTERGSMFGEGNTLDTWRTPIGAPRRLMNHAQLGRPRWPPPTPTNAECEGRGELGSGPTHRGARYGPRCATVRPSFQIGPEAGFELQPGTHCSAN
LCINKIYMSRFGSRRGDSLLRDKDKPMRNSAWASIKVRSISLSKAFGNLIKFHEELDDGTSWNMFRGYARTRPPDRPEHLGGPAQKGEHSDDQVSIGQTWARSTS
ALILSICRRWWRSSRLSELGMTHEPSWSTNRGPPRVQGYKCPQSFRSYLTFLEFLEFDLEAARTLGRSVSSLSLSNVIAMSSSISSNLGSDLARRLESGLEEIEN
FRISDDGEDSDASTSGRASRGGERVDNPPEGWVTLYFKMFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEVELLDVDQLLACFEA
KRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFHASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESG
LLDYNPAVRPIESSRPNSELAIVCGFTSGVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQTEAADALPLGEEARE
EAPPKRRRKKKKAISPSEVGACRVSRISAASLDRCLRRASKFVRPWVRSAEDHRLRRRGKTSAPFLLDFAQQAELDWREVLAAREKEEFSAALEAASSTMKDELL
KAHSEVETLKAEALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKDIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQAL
VEQYVRDLDSDYSDPEEDQVGSTQEGAPQADS