; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g16360 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g16360
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr6:12908557..12911396
RNA-Seq ExpressionMoc06g16360
SyntenyMoc06g16360
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]7.8e-8152.03Show/hide
Query:  RGKKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTGSVKRKSKGRAHALKSVEGIEPTTPAVARPAVQDRAEPSSEVPTPVIELDSAGEH
        RG+KIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFT SVKRKSKGRAHALK V+  +P TPAV + A QD+A PSS  PTPVIELDS GE 
Subjt:  RGKKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTGSVKRKSKGRAHALKSVEGIEPTTPAVARPAVQDRAEPSSEVPTPVIELDSAGEH

Query:  SREKRPRNESEALDVSPLKEVRGESPLKRIRKKTKTTFSSEVGPRGTLPTSHANLVDDTEDRMGGTSDVKMRFRMEPSSSGVKDQVSRISAACLDLPQKG
        SREKR R+ESEALDVSPL+EVR                                                                              
Subjt:  SREKRPRNESEALDVSPLKEVRGESPLKRIRKKTKTTFSSEVGPRGTLPTSHANLVDDTEDRMGGTSDVKMRFRMEPSSSGVKDQVSRISAACLDLPQKG

Query:  VQAFIASIHSTVMIKAELDGREALAAKEKENSSAALEAATTMKGELLKARSEVDILRAEVEANAELLKKEDERHNAHLQAAHAITKGLEKEKFQLLKEKD
                                                                    EA AELLK+EDERH AHL+AAHAITKGLEKEKFQLLKEKD
Subjt:  VQAFIASIHSTVMIKAELDGREALAAKEKENSSAALEAATTMKGELLKARSEVDILRAEVEANAELLKKEDERHNAHLQAAHAITKGLEKEKFQLLKEKD

Query:  DLAQVLEKKDASLGRLTAELKEVNERLTNGALLEETFRQHPDFDGFAKDFSDAGFKFLMKDIAADMPHL
        D+ Q LE+KDA++GRL AELK   ERLTNGALLE  FRQHPDFDGFAKDFSDAGFKFLMK IAAD+PHL
Subjt:  DLAQVLEKKDASLGRLTAELKEVNERLTNGALLEETFRQHPDFDGFAKDFSDAGFKFLMKDIAADMPHL

XP_022147182.1 uncharacterized protein LOC111016193 [Momordica charantia]1.5e-7674.77Show/hide
Query:  MRFRMEPSSSGVKDQVSRISAACLD-------------------LPQKGVQAFIASIHSTVMIKAELDGREALAAKEKENSSAALEAATTMKGELLKARS
        MRFRME SSSGVKDQVSRISA CLD                         +AFIASIHS VM+KAELDGREAL AKE+EN S  LEAATT+KGELLKA+ 
Subjt:  MRFRMEPSSSGVKDQVSRISAACLD-------------------LPQKGVQAFIASIHSTVMIKAELDGREALAAKEKENSSAALEAATTMKGELLKARS

Query:  EVDILRAEVEANAELLKKEDERHNAHLQAAHAITKGLEKEKFQLLKEKDDLAQVLEKKDASLGRLTAELKEVNERLTNGALLEETFRQHPDFDGFAKDFS
        EVDILRAEV+A  +LLKKE E+H AHL+AAHAITKGLEKEKFQLLKEKDDLAQVLEKKDAS+GRLT ELK++ ERLT+GALLEE+FRQHP+FDGFAKDFS
Subjt:  EVDILRAEVEANAELLKKEDERHNAHLQAAHAITKGLEKEKFQLLKEKDDLAQVLEKKDASLGRLTAELKEVNERLTNGALLEETFRQHPDFDGFAKDFS

Query:  DAGFKFLMKDIAADMPHL
        DAGFKFLMK IAADMPHL
Subjt:  DAGFKFLMKDIAADMPHL

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]3.5e-8176.11Show/hide
Query:  MGGTSDVKMRFRMEPSSSGVKDQVSRISAACLDLPQK-------------------GVQAFIASIHSTVMIKAELDGREALAAKEKENSSAALEAATTMK
        MGGT DV+ RFRMEPSSSGVKDQVSRISA CLD   K                     +AF+ASIHS +M+KAELDGREALAAKE+ENSSAALEAATT+K
Subjt:  MGGTSDVKMRFRMEPSSSGVKDQVSRISAACLDLPQK-------------------GVQAFIASIHSTVMIKAELDGREALAAKEKENSSAALEAATTMK

Query:  GELLKARSEVDILRAEVEANAELLKKEDERHNAHLQAAHAITKGLEKEKFQLLKEKDDLAQVLEKKDASLGRLTAELKEVNERLTNGALLEETFRQHPDF
        GELLKA+ EV ILRAEV+A AELLKKE E+H AHL+AAHAITKGLEKEKFQLLKEKDDLAQVLE KD S+GRLTAELK++ ERLTNG+LLEE+FRQH DF
Subjt:  GELLKARSEVDILRAEVEANAELLKKEDERHNAHLQAAHAITKGLEKEKFQLLKEKDDLAQVLEKKDASLGRLTAELKEVNERLTNGALLEETFRQHPDF

Query:  DGFAKDFSDAGFKFLMKDIAADMPHL
        DGFAKDFSDAGFKFLMK IAADMPHL
Subjt:  DGFAKDFSDAGFKFLMKDIAADMPHL

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]4.4e-7650.14Show/hide
Query:  SDSGEDLALRLESELEEIENFRFSDDGEDSDTSTL-----------------------------------------------------------------
        S+   DLA RLES+LEEIEN R SDDGEDSD ST                                                                  
Subjt:  SDSGEDLALRLESELEEIENFRFSDDGEDSDTSTL-----------------------------------------------------------------

Query:  ------------------GLGVIFALAILFWLRARDEDEAELLNVEQLLECFQAKRIAKKRGRYYMCARKDAGGIVKGPTSIKGWVGKWFFASGEWLAKN
                          G GVIFALAILFWLRARD +EAEL +V+QLL CF+AKRIAKK GR+YMCARK AGGIVKGPTSIKGWV KWF+ASGEWLAK+
Subjt:  ------------------GLGVIFALAILFWLRARDEDEAELLNVEQLLECFQAKRIAKKRGRYYMCARKDAGGIVKGPTSIKGWVGKWFFASGEWLAKN

Query:  ESRRPFFDVPV------------------------------RGKKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTGSVKRKSKGRAHAL
        ES R FFDVP                               RG+K+GTLVTD+LLLESGLLDYNP VRPIE+SRPNSELAMVCGF   VKRKSKGRAHAL
Subjt:  ESRRPFFDVPV------------------------------RGKKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTGSVKRKSKGRAHAL

Query:  KSVEGIEPTTPAVARPAVQDRAEPSSEVPTPVIELDSAGEHSREKRPRNESEALD
        ++ +  +P TPAV  PA        SE P  VIEL+S+G  SREKRPR+++EA+D
Subjt:  KSVEGIEPTTPAVARPAVQDRAEPSSEVPTPVIELDSAGEHSREKRPRNESEALD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]5.3e-17072.63Show/hide
Query:  MCARKDAGGIVKGPTSIKGWVGKWFFASGEWLAKNESRRPFFDVPV------------------------------RGKKIGTLVTDKLLLESGLLDYNP
        MCARK  GGIVKGPTSIKGWVGKWFFASGEWLAK+ES R FFDVP                               R +KI TLVTDKLLLESGLLDYNP
Subjt:  MCARKDAGGIVKGPTSIKGWVGKWFFASGEWLAKNESRRPFFDVPV------------------------------RGKKIGTLVTDKLLLESGLLDYNP

Query:  LVRPIEASRPNSELAMVCGFTGSVKRKSKGRAHALKSVEGIEPTTPAVARPAVQDRAEPSSEVPTPVIELDSAGEHSREKRPRNESEALDVSPLKEVRGE
        LVR IEASRPNSELAMVCGFTGSVKRKSKGRAHALK+V G EP TP V R   Q  + PSS VPTPVIELD +G  S EKR R ESEALDVSPL EVRGE
Subjt:  LVRPIEASRPNSELAMVCGFTGSVKRKSKGRAHALKSVEGIEPTTPAVARPAVQDRAEPSSEVPTPVIELDSAGEHSREKRPRNESEALDVSPLKEVRGE

Query:  SPLKRIRKKTKTTFSSEVGPRGTLPTSHANLVDDTEDRMGGTSDVKMRFRMEPSSSGVKDQVSRISAACLD---------------LPQKGV----QAFI
        SPL+R RKK KT+ SSE G RGTLPTSHA+LVDD E RM GTS+V+MRF MEPSSSGVKDQVSRISA CLD               + Q+ +    +AFI
Subjt:  SPLKRIRKKTKTTFSSEVGPRGTLPTSHANLVDDTEDRMGGTSDVKMRFRMEPSSSGVKDQVSRISAACLD---------------LPQKGV----QAFI

Query:  ASIHSTVMIKAELDGREALAAKEKENSSAALEAATTMKGELLKARSEVDILRAEVEANAELLKKEDERHNAHLQAAHAITKGLEKEKFQLLKEKDDLAQV
        ASIH  VM+KAELDGREALAAKE+ENS AALEAATT+KGELLKA+ EVDILRAEV+A  +LLKKE E+H AHL+AAHAITKGLEKEKFQLLKEKDDLAQV
Subjt:  ASIHSTVMIKAELDGREALAAKEKENSSAALEAATTMKGELLKARSEVDILRAEVEANAELLKKEDERHNAHLQAAHAITKGLEKEKFQLLKEKDDLAQV

Query:  LEKKDASLGRLTAELKEVNERLTNGALLEETFRQHPDFDGFAKDFSDAGFKFLMKDIAADMPHL
        LE+KDAS+GRLT ELK++ ERLTNG LLEE+FRQHPDFDGFAKDFSDAGFKFLMK IAADMPHL
Subjt:  LEKKDASLGRLTAELKEVNERLTNGALLEETFRQHPDFDGFAKDFSDAGFKFLMKDIAADMPHL

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124673.8e-8152.03Show/hide
Query:  RGKKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTGSVKRKSKGRAHALKSVEGIEPTTPAVARPAVQDRAEPSSEVPTPVIELDSAGEH
        RG+KIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFT SVKRKSKGRAHALK V+  +P TPAV + A QD+A PSS  PTPVIELDS GE 
Subjt:  RGKKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTGSVKRKSKGRAHALKSVEGIEPTTPAVARPAVQDRAEPSSEVPTPVIELDSAGEH

Query:  SREKRPRNESEALDVSPLKEVRGESPLKRIRKKTKTTFSSEVGPRGTLPTSHANLVDDTEDRMGGTSDVKMRFRMEPSSSGVKDQVSRISAACLDLPQKG
        SREKR R+ESEALDVSPL+EVR                                                                              
Subjt:  SREKRPRNESEALDVSPLKEVRGESPLKRIRKKTKTTFSSEVGPRGTLPTSHANLVDDTEDRMGGTSDVKMRFRMEPSSSGVKDQVSRISAACLDLPQKG

Query:  VQAFIASIHSTVMIKAELDGREALAAKEKENSSAALEAATTMKGELLKARSEVDILRAEVEANAELLKKEDERHNAHLQAAHAITKGLEKEKFQLLKEKD
                                                                    EA AELLK+EDERH AHL+AAHAITKGLEKEKFQLLKEKD
Subjt:  VQAFIASIHSTVMIKAELDGREALAAKEKENSSAALEAATTMKGELLKARSEVDILRAEVEANAELLKKEDERHNAHLQAAHAITKGLEKEKFQLLKEKD

Query:  DLAQVLEKKDASLGRLTAELKEVNERLTNGALLEETFRQHPDFDGFAKDFSDAGFKFLMKDIAADMPHL
        D+ Q LE+KDA++GRL AELK   ERLTNGALLE  FRQHPDFDGFAKDFSDAGFKFLMK IAAD+PHL
Subjt:  DLAQVLEKKDASLGRLTAELKEVNERLTNGALLEETFRQHPDFDGFAKDFSDAGFKFLMKDIAADMPHL

A0A6J1D1N9 uncharacterized protein LOC1110161937.4e-7774.77Show/hide
Query:  MRFRMEPSSSGVKDQVSRISAACLD-------------------LPQKGVQAFIASIHSTVMIKAELDGREALAAKEKENSSAALEAATTMKGELLKARS
        MRFRME SSSGVKDQVSRISA CLD                         +AFIASIHS VM+KAELDGREAL AKE+EN S  LEAATT+KGELLKA+ 
Subjt:  MRFRMEPSSSGVKDQVSRISAACLD-------------------LPQKGVQAFIASIHSTVMIKAELDGREALAAKEKENSSAALEAATTMKGELLKARS

Query:  EVDILRAEVEANAELLKKEDERHNAHLQAAHAITKGLEKEKFQLLKEKDDLAQVLEKKDASLGRLTAELKEVNERLTNGALLEETFRQHPDFDGFAKDFS
        EVDILRAEV+A  +LLKKE E+H AHL+AAHAITKGLEKEKFQLLKEKDDLAQVLEKKDAS+GRLT ELK++ ERLT+GALLEE+FRQHP+FDGFAKDFS
Subjt:  EVDILRAEVEANAELLKKEDERHNAHLQAAHAITKGLEKEKFQLLKEKDDLAQVLEKKDASLGRLTAELKEVNERLTNGALLEETFRQHPDFDGFAKDFS

Query:  DAGFKFLMKDIAADMPHL
        DAGFKFLMK IAADMPHL
Subjt:  DAGFKFLMKDIAADMPHL

A0A6J1DF31 uncharacterized protein LOC1110199091.7e-8176.11Show/hide
Query:  MGGTSDVKMRFRMEPSSSGVKDQVSRISAACLDLPQK-------------------GVQAFIASIHSTVMIKAELDGREALAAKEKENSSAALEAATTMK
        MGGT DV+ RFRMEPSSSGVKDQVSRISA CLD   K                     +AF+ASIHS +M+KAELDGREALAAKE+ENSSAALEAATT+K
Subjt:  MGGTSDVKMRFRMEPSSSGVKDQVSRISAACLDLPQK-------------------GVQAFIASIHSTVMIKAELDGREALAAKEKENSSAALEAATTMK

Query:  GELLKARSEVDILRAEVEANAELLKKEDERHNAHLQAAHAITKGLEKEKFQLLKEKDDLAQVLEKKDASLGRLTAELKEVNERLTNGALLEETFRQHPDF
        GELLKA+ EV ILRAEV+A AELLKKE E+H AHL+AAHAITKGLEKEKFQLLKEKDDLAQVLE KD S+GRLTAELK++ ERLTNG+LLEE+FRQH DF
Subjt:  GELLKARSEVDILRAEVEANAELLKKEDERHNAHLQAAHAITKGLEKEKFQLLKEKDDLAQVLEKKDASLGRLTAELKEVNERLTNGALLEETFRQHPDF

Query:  DGFAKDFSDAGFKFLMKDIAADMPHL
        DGFAKDFSDAGFKFLMK IAADMPHL
Subjt:  DGFAKDFSDAGFKFLMKDIAADMPHL

A0A6J1DXS5 uncharacterized protein LOC1110255022.1e-7650.14Show/hide
Query:  SDSGEDLALRLESELEEIENFRFSDDGEDSDTSTL-----------------------------------------------------------------
        S+   DLA RLES+LEEIEN R SDDGEDSD ST                                                                  
Subjt:  SDSGEDLALRLESELEEIENFRFSDDGEDSDTSTL-----------------------------------------------------------------

Query:  ------------------GLGVIFALAILFWLRARDEDEAELLNVEQLLECFQAKRIAKKRGRYYMCARKDAGGIVKGPTSIKGWVGKWFFASGEWLAKN
                          G GVIFALAILFWLRARD +EAEL +V+QLL CF+AKRIAKK GR+YMCARK AGGIVKGPTSIKGWV KWF+ASGEWLAK+
Subjt:  ------------------GLGVIFALAILFWLRARDEDEAELLNVEQLLECFQAKRIAKKRGRYYMCARKDAGGIVKGPTSIKGWVGKWFFASGEWLAKN

Query:  ESRRPFFDVPV------------------------------RGKKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTGSVKRKSKGRAHAL
        ES R FFDVP                               RG+K+GTLVTD+LLLESGLLDYNP VRPIE+SRPNSELAMVCGF   VKRKSKGRAHAL
Subjt:  ESRRPFFDVPV------------------------------RGKKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTGSVKRKSKGRAHAL

Query:  KSVEGIEPTTPAVARPAVQDRAEPSSEVPTPVIELDSAGEHSREKRPRNESEALD
        ++ +  +P TPAV  PA        SE P  VIEL+S+G  SREKRPR+++EA+D
Subjt:  KSVEGIEPTTPAVARPAVQDRAEPSSEVPTPVIELDSAGEHSREKRPRNESEALD

A0A6J1DZB3 uncharacterized protein LOC1110256652.6e-17072.63Show/hide
Query:  MCARKDAGGIVKGPTSIKGWVGKWFFASGEWLAKNESRRPFFDVPV------------------------------RGKKIGTLVTDKLLLESGLLDYNP
        MCARK  GGIVKGPTSIKGWVGKWFFASGEWLAK+ES R FFDVP                               R +KI TLVTDKLLLESGLLDYNP
Subjt:  MCARKDAGGIVKGPTSIKGWVGKWFFASGEWLAKNESRRPFFDVPV------------------------------RGKKIGTLVTDKLLLESGLLDYNP

Query:  LVRPIEASRPNSELAMVCGFTGSVKRKSKGRAHALKSVEGIEPTTPAVARPAVQDRAEPSSEVPTPVIELDSAGEHSREKRPRNESEALDVSPLKEVRGE
        LVR IEASRPNSELAMVCGFTGSVKRKSKGRAHALK+V G EP TP V R   Q  + PSS VPTPVIELD +G  S EKR R ESEALDVSPL EVRGE
Subjt:  LVRPIEASRPNSELAMVCGFTGSVKRKSKGRAHALKSVEGIEPTTPAVARPAVQDRAEPSSEVPTPVIELDSAGEHSREKRPRNESEALDVSPLKEVRGE

Query:  SPLKRIRKKTKTTFSSEVGPRGTLPTSHANLVDDTEDRMGGTSDVKMRFRMEPSSSGVKDQVSRISAACLD---------------LPQKGV----QAFI
        SPL+R RKK KT+ SSE G RGTLPTSHA+LVDD E RM GTS+V+MRF MEPSSSGVKDQVSRISA CLD               + Q+ +    +AFI
Subjt:  SPLKRIRKKTKTTFSSEVGPRGTLPTSHANLVDDTEDRMGGTSDVKMRFRMEPSSSGVKDQVSRISAACLD---------------LPQKGV----QAFI

Query:  ASIHSTVMIKAELDGREALAAKEKENSSAALEAATTMKGELLKARSEVDILRAEVEANAELLKKEDERHNAHLQAAHAITKGLEKEKFQLLKEKDDLAQV
        ASIH  VM+KAELDGREALAAKE+ENS AALEAATT+KGELLKA+ EVDILRAEV+A  +LLKKE E+H AHL+AAHAITKGLEKEKFQLLKEKDDLAQV
Subjt:  ASIHSTVMIKAELDGREALAAKEKENSSAALEAATTMKGELLKARSEVDILRAEVEANAELLKKEDERHNAHLQAAHAITKGLEKEKFQLLKEKDDLAQV

Query:  LEKKDASLGRLTAELKEVNERLTNGALLEETFRQHPDFDGFAKDFSDAGFKFLMKDIAADMPHL
        LE+KDAS+GRLT ELK++ ERLTNG LLEE+FRQHPDFDGFAKDFSDAGFKFLMK IAADMPHL
Subjt:  LEKKDASLGRLTAELKEVNERLTNGALLEETFRQHPDFDGFAKDFSDAGFKFLMKDIAADMPHL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCGGGTTCGTGAGCGACCTGCTGCAGGCGTCGTCGCCACCGTGTCGAGGAATGCGGCCACCGTGGGAATTGAACGTCGTCGCTCGTCGGAGGAGTGCCGT
CGCCGTGCAGGTCAGACCATAAGCAGTTCGACCCCCAAACCAAGTGACTCTGGGGAGGACTTAGCTCTTAGGTTAGAGTCCGAGCTGGAAGAGATAGAAAATTTT
AGGTTTTCTGATGATGGGGAGGATAGTGACACTTCCACCTTGGGCCTGGGTGTCATTTTTGCTTTGGCCATCCTTTTTTGGTTACGAGCTCGAGACGAGGACGAG
GCCGAGCTGCTAAATGTTGAGCAGCTTCTTGAGTGCTTCCAAGCCAAGAGAATAGCTAAGAAGCGAGGTCGGTATTATATGTGCGCAAGGAAAGACGCGGGTGGT
ATAGTCAAAGGGCCGACCTCCATCAAAGGATGGGTCGGGAAATGGTTCTTTGCCTCTGGAGAGTGGCTGGCAAAAAACGAATCACGTCGTCCCTTCTTTGACGTG
CCTGTTAGGGGCAAGAAGATCGGAACCTTGGTGACTGACAAACTGCTTCTGGAATCTGGGTTGTTGGACTACAATCCCTTGGTGCGTCCGATTGAAGCTTCAAGG
CCAAACTCCGAGCTTGCAATGGTGTGCGGGTTCACTGGCAGTGTGAAGCGCAAGTCCAAGGGCCGTGCTCACGCCCTTAAGTCTGTTGAGGGTATAGAGCCAACA
ACCCCTGCTGTGGCTCGACCTGCGGTTCAGGACAGGGCTGAACCGTCTTCTGAAGTTCCAACTCCGGTGATCGAGTTGGACTCTGCTGGGGAGCACTCCAGAGAA
AAGCGCCCAAGGAATGAGTCTGAGGCGTTGGACGTATCTCCCTTGAAGGAGGTGAGGGGAGAGTCTCCTTTGAAGAGGATAAGGAAGAAGACGAAGACCACATTC
TCCTCGGAGGTAGGACCTCGTGGGACCCTGCCCACGAGCCATGCTAACTTGGTGGACGACACTGAAGATCGGATGGGGGGGACGTCCGACGTGAAGATGCGGTTC
AGAATGGAACCGTCGAGCTCCGGGGTGAAGGACCAGGTGTCCCGCATTTCGGCTGCGTGCTTGGACCTGCCTCAGAAGGGCGTCCAAGCGTTCATTGCTTCCATT
CATTCGACAGTTATGATAAAGGCTGAATTGGATGGAAGGGAGGCTTTGGCAGCAAAGGAGAAGGAGAACTCCTCTGCTGCCTTAGAGGCTGCCACCACAATGAAG
GGCGAGTTACTGAAGGCTCGCTCCGAAGTGGATATCTTGAGGGCCGAGGTGGAAGCCAATGCCGAGTTGTTGAAGAAGGAGGATGAGAGGCATAATGCCCACCTC
CAAGCTGCCCATGCCATCACTAAAGGGCTGGAGAAGGAGAAGTTCCAACTCCTAAAGGAGAAGGATGACCTTGCTCAAGTCCTTGAGAAGAAGGATGCTTCGCTA
GGGCGCCTTACCGCCGAGCTGAAGGAGGTGAATGAACGCCTCACCAACGGGGCTCTCTTGGAGGAAACTTTCAGGCAGCACCCTGACTTTGACGGGTTTGCCAAG
GACTTCAGCGATGCAGGCTTCAAATTTCTGATGAAAGACATTGCTGCTGACATGCCCCACCTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATCGGGTTCGTGAGCGACCTGCTGCAGGCGTCGTCGCCACCGTGTCGAGGAATGCGGCCACCGTGGGAATTGAACGTCGTCGCTCGTCGGAGGAGTGCCGT
CGCCGTGCAGGTCAGACCATAAGCAGTTCGACCCCCAAACCAAGTGACTCTGGGGAGGACTTAGCTCTTAGGTTAGAGTCCGAGCTGGAAGAGATAGAAAATTTT
AGGTTTTCTGATGATGGGGAGGATAGTGACACTTCCACCTTGGGCCTGGGTGTCATTTTTGCTTTGGCCATCCTTTTTTGGTTACGAGCTCGAGACGAGGACGAG
GCCGAGCTGCTAAATGTTGAGCAGCTTCTTGAGTGCTTCCAAGCCAAGAGAATAGCTAAGAAGCGAGGTCGGTATTATATGTGCGCAAGGAAAGACGCGGGTGGT
ATAGTCAAAGGGCCGACCTCCATCAAAGGATGGGTCGGGAAATGGTTCTTTGCCTCTGGAGAGTGGCTGGCAAAAAACGAATCACGTCGTCCCTTCTTTGACGTG
CCTGTTAGGGGCAAGAAGATCGGAACCTTGGTGACTGACAAACTGCTTCTGGAATCTGGGTTGTTGGACTACAATCCCTTGGTGCGTCCGATTGAAGCTTCAAGG
CCAAACTCCGAGCTTGCAATGGTGTGCGGGTTCACTGGCAGTGTGAAGCGCAAGTCCAAGGGCCGTGCTCACGCCCTTAAGTCTGTTGAGGGTATAGAGCCAACA
ACCCCTGCTGTGGCTCGACCTGCGGTTCAGGACAGGGCTGAACCGTCTTCTGAAGTTCCAACTCCGGTGATCGAGTTGGACTCTGCTGGGGAGCACTCCAGAGAA
AAGCGCCCAAGGAATGAGTCTGAGGCGTTGGACGTATCTCCCTTGAAGGAGGTGAGGGGAGAGTCTCCTTTGAAGAGGATAAGGAAGAAGACGAAGACCACATTC
TCCTCGGAGGTAGGACCTCGTGGGACCCTGCCCACGAGCCATGCTAACTTGGTGGACGACACTGAAGATCGGATGGGGGGGACGTCCGACGTGAAGATGCGGTTC
AGAATGGAACCGTCGAGCTCCGGGGTGAAGGACCAGGTGTCCCGCATTTCGGCTGCGTGCTTGGACCTGCCTCAGAAGGGCGTCCAAGCGTTCATTGCTTCCATT
CATTCGACAGTTATGATAAAGGCTGAATTGGATGGAAGGGAGGCTTTGGCAGCAAAGGAGAAGGAGAACTCCTCTGCTGCCTTAGAGGCTGCCACCACAATGAAG
GGCGAGTTACTGAAGGCTCGCTCCGAAGTGGATATCTTGAGGGCCGAGGTGGAAGCCAATGCCGAGTTGTTGAAGAAGGAGGATGAGAGGCATAATGCCCACCTC
CAAGCTGCCCATGCCATCACTAAAGGGCTGGAGAAGGAGAAGTTCCAACTCCTAAAGGAGAAGGATGACCTTGCTCAAGTCCTTGAGAAGAAGGATGCTTCGCTA
GGGCGCCTTACCGCCGAGCTGAAGGAGGTGAATGAACGCCTCACCAACGGGGCTCTCTTGGAGGAAACTTTCAGGCAGCACCCTGACTTTGACGGGTTTGCCAAG
GACTTCAGCGATGCAGGCTTCAAATTTCTGATGAAAGACATTGCTGCTGACATGCCCCACCTCTAG
Protein sequenceShow/hide protein sequence
MDRVRERPAAGVVATVSRNAATVGIERRRSSEECRRRAGQTISSSTPKPSDSGEDLALRLESELEEIENFRFSDDGEDSDTSTLGLGVIFALAILFWLRARDEDE
AELLNVEQLLECFQAKRIAKKRGRYYMCARKDAGGIVKGPTSIKGWVGKWFFASGEWLAKNESRRPFFDVPVRGKKIGTLVTDKLLLESGLLDYNPLVRPIEASR
PNSELAMVCGFTGSVKRKSKGRAHALKSVEGIEPTTPAVARPAVQDRAEPSSEVPTPVIELDSAGEHSREKRPRNESEALDVSPLKEVRGESPLKRIRKKTKTTF
SSEVGPRGTLPTSHANLVDDTEDRMGGTSDVKMRFRMEPSSSGVKDQVSRISAACLDLPQKGVQAFIASIHSTVMIKAELDGREALAAKEKENSSAALEAATTMK
GELLKARSEVDILRAEVEANAELLKKEDERHNAHLQAAHAITKGLEKEKFQLLKEKDDLAQVLEKKDASLGRLTAELKEVNERLTNGALLEETFRQHPDFDGFAK
DFSDAGFKFLMKDIAADMPHL