; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g18930 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g18930
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr6:14740599..14746073
RNA-Seq ExpressionMoc06g18930
SyntenyMoc06g18930
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]4.5e-8973.12Show/hide
Query:  LPLHPFVQEFLFRTGLAPAQVALNGWGVIFALAVLFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG
        LPLHPFVQEFLFRTGLAPAQVA NGWGVIFALA+LFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG
Subjt:  LPLHPFVQEFLFRTGLAPAQVALNGWGVIFALAVLFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG

Query:  EWLAKDESGRSFFDVPTR-------------SDSDFVFVQFQSDQSPSSRK-------------------PPSTP-----------MVCGFASNVKRKSK
        EWLAKDESGRSFFDVPTR             + + F  +++  ++ P  RK                   P   P           MVC FAS VKRKSK
Subjt:  EWLAKDESGRSFFDVPTR-------------SDSDFVFVQFQSDQSPSSRK-------------------PPSTP-----------MVCGFASNVKRKSK

Query:  GRAHALEAAQSSKPATSAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        GRAHALEAAQSSKP T AVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
Subjt:  GRAHALEAAQSSKPATSAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]5.5e-9583.54Show/hide
Query:  GTSDVTARFRVEPSSSRVRDQVSRISAASLDRCLKRASKFVSDPGSVLQRTIHYATEAFVASIQSALPVKAELDGREALAAGEKEEFSAALEAASSTMKD
        G   + A+ R+EPSSS VRDQVSRISAASLDRCL+RASKFVS PGSVLQRTI YA EAFVASIQSAL VKAELDGRE LAA EKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRVEPSSSRVRDQVSRISAASLDRCLKRASKFVSDPGSVLQRTIHYATEAFVASIQSALPVKAELDGREALAAGEKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVEAKAELLKKEEDRHKAQLRAAHAITKGLENEKFHLLKEKDDMLQALEAKEEELKHATVELETVKERLNNEVLLEESFRQHPDFD
        ELLKAHSEVE LKAEVE++AELLKKEEDR +AQLRAAHAIT+GLE EKF LLKEKDDMLQALEAK++EL+HAT ELET KERL+N VLLEE+FRQHPDFD
Subjt:  ELLKAHSEVEILKAEVEAKAELLKKEEDRHKAQLRAAHAITKGLENEKFHLLKEKDDMLQALEAKEEELKHATVELETVKERLNNEVLLEESFRQHPDFD

Query:  GFAKNFSDVGFKFLMKGIASDIPDLQIDLGGLKKRSA
        GFAK+FSD GFKFLMKGIASD+PDLQIDL GLK+R A
Subjt:  GFAKNFSDVGFKFLMKGIASDIPDLQIDLGGLKKRSA

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]6.5e-11267.32Show/hide
Query:  MLSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRG------------------------------------
        M SS SS+L  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRG                                    
Subjt:  MLSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRG------------------------------------

Query:  --LPLHPFVQEFLFRTGLAPAQVALNGWGVIFALAVLFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
          LPLHPFVQEFLFRTGLAPAQVA NGWGVIFALA+LFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  --LPLHPFVQEFLFRTGLAPAQVALNGWGVIFALAVLFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTR-------------SDSDFVFVQFQSDQSPSSRK-------------------PPSTP-----------MVCGFASNVKRK
        SGEWLAKDESGRSFFDVPTR             + + F  +++  ++ P  RK                   P   P           MVCGFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTR-------------SDSDFVFVQFQSDQSPSSRK-------------------PPSTP-----------MVCGFASNVKRK

Query:  SKGRAHALEAAQSSKPATSAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        SKGRAHALEAAQSSKPAT AVVGPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKGRAHALEAAQSSKPATSAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

XP_022159185.1 uncharacterized protein LOC111025606 [Momordica charantia]4.3e-9267.95Show/hide
Query:  MVCGFASNVKRKSKGRAHALEAAQSSKPATSAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLDEEVREEVPLKRRRKKKKTTSPLEVRARG
        MVCGFAS+VKRKSKGRAHA EAAQSSKPAT AV GPASEDPAPVIELESSGGPSREKRPRDQTEAVD  PL EEVREEVPLKRRRKKKKT SPLEV A G
Subjt:  MVCGFASNVKRKSKGRAHALEAAQSSKPATSAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLDEEVREEVPLKRRRKKKKTTSPLEVRARG

Query:  ALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSRVRDQVSRISAASLDRCLKRASKFVSDPGSVLQRTIHYATEAFVASIQSALPVKAELDGREALAAG
         LP SFADRVDDPEARMGGTSDVTARFRV+PSS+ VRDQVSRISAASLDRCL+RASKFVSDPGSVLQRTI YA EAFVASIQSAL VKAELDGRE LAA 
Subjt:  ALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSRVRDQVSRISAASLDRCLKRASKFVSDPGSVLQRTIHYATEAFVASIQSALPVKAELDGREALAAG

Query:  EKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRHKAQLRAAHAITKGLENEKFHLLKEKDDMLQALEAKEEELKHATVELETVKER
        EKEEFS                                                                        ALEAK++EL+HAT ELET KER
Subjt:  EKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRHKAQLRAAHAITKGLENEKFHLLKEKDDMLQALEAKEEELKHATVELETVKER

Query:  LNNEVLLEESFR
        L+N VLLEESFR
Subjt:  LNNEVLLEESFR

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]8.4e-13661.97Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTR-------------SDSDFVFVQFQSDQSPSSRK--------------------
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTR             + + F  ++   D  P  RK                    
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTR-------------SDSDFVFVQFQSDQSPSSRK--------------------

Query:  -----PPSTP-----MVCGFASNVKRKSKGRAHALEAAQSSKPATSAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLDEEVRE
               S P     MVCGF  +VKRKSKGRAHAL+    ++P T  V         GP+S  P PVIEL+ SGG S EKR R+++EA+DVSPL+ EVR 
Subjt:  -----PPSTP-----MVCGFASNVKRKSKGRAHALEAAQSSKPATSAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLDEEVRE

Query:  EVPLKRRRKKKKTTSPLEVRARGALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSRVRDQVSRISAASLDRCLKRASKFVSDPGSVLQRTIHYATEAF
        E PL+RRRKKKKT+S  E  ARG LPTS AD VDDPEARM GTS+V  RF +EPSSS V+DQVSRISA  LDR L+RASKFVSDPGSVLQRTI    EAF
Subjt:  EVPLKRRRKKKKTTSPLEVRARGALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSRVRDQVSRISAASLDRCLKRASKFVSDPGSVLQRTIHYATEAF

Query:  VASIQSALPVKAELDGREALAAGEKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRHKAQLRAAHAITKGLENEKFHLLKEKDDML
        +ASI  A+ VKAELDGREALAA E+E   AALEAA +T+K ELLKA  EV+IL+AEV+AK +LLKKE ++HKA LRAAHAITKGLE EKF LLKEKDD+ 
Subjt:  VASIQSALPVKAELDGREALAAGEKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRHKAQLRAAHAITKGLENEKFHLLKEKDDML

Query:  QALEAKEEELKHATVELETVKERLNNEVLLEESFRQHPDFDGFAKNFSDVGFKFLMKGIASDIPDLQIDLGGLKKR
        Q LE K+  +   T EL+ +KERL N  LLEESFRQHPDFDGFAK+FSD GFKFLMKGIA+D+P LQIDL GLKK+
Subjt:  QALEAKEEELKHATVELETVKERLNNEVLLEESFRQHPDFDGFAKNFSDVGFKFLMKGIASDIPDLQIDLGGLKKR

TrEMBL top hitse value%identityAlignment
A0A6J1CR42 uncharacterized protein LOC1110138262.2e-8973.12Show/hide
Query:  LPLHPFVQEFLFRTGLAPAQVALNGWGVIFALAVLFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG
        LPLHPFVQEFLFRTGLAPAQVA NGWGVIFALA+LFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG
Subjt:  LPLHPFVQEFLFRTGLAPAQVALNGWGVIFALAVLFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG

Query:  EWLAKDESGRSFFDVPTR-------------SDSDFVFVQFQSDQSPSSRK-------------------PPSTP-----------MVCGFASNVKRKSK
        EWLAKDESGRSFFDVPTR             + + F  +++  ++ P  RK                   P   P           MVC FAS VKRKSK
Subjt:  EWLAKDESGRSFFDVPTR-------------SDSDFVFVQFQSDQSPSSRK-------------------PPSTP-----------MVCGFASNVKRKSK

Query:  GRAHALEAAQSSKPATSAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        GRAHALEAAQSSKP T AVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
Subjt:  GRAHALEAAQSSKPATSAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

A0A6J1D971 uncharacterized protein LOC1110185382.7e-9583.54Show/hide
Query:  GTSDVTARFRVEPSSSRVRDQVSRISAASLDRCLKRASKFVSDPGSVLQRTIHYATEAFVASIQSALPVKAELDGREALAAGEKEEFSAALEAASSTMKD
        G   + A+ R+EPSSS VRDQVSRISAASLDRCL+RASKFVS PGSVLQRTI YA EAFVASIQSAL VKAELDGRE LAA EKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRVEPSSSRVRDQVSRISAASLDRCLKRASKFVSDPGSVLQRTIHYATEAFVASIQSALPVKAELDGREALAAGEKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVEAKAELLKKEEDRHKAQLRAAHAITKGLENEKFHLLKEKDDMLQALEAKEEELKHATVELETVKERLNNEVLLEESFRQHPDFD
        ELLKAHSEVE LKAEVE++AELLKKEEDR +AQLRAAHAIT+GLE EKF LLKEKDDMLQALEAK++EL+HAT ELET KERL+N VLLEE+FRQHPDFD
Subjt:  ELLKAHSEVEILKAEVEAKAELLKKEEDRHKAQLRAAHAITKGLENEKFHLLKEKDDMLQALEAKEEELKHATVELETVKERLNNEVLLEESFRQHPDFD

Query:  GFAKNFSDVGFKFLMKGIASDIPDLQIDLGGLKKRSA
        GFAK+FSD GFKFLMKGIASD+PDLQIDL GLK+R A
Subjt:  GFAKNFSDVGFKFLMKGIASDIPDLQIDLGGLKKRSA

A0A6J1DXS5 uncharacterized protein LOC1110255023.1e-11267.32Show/hide
Query:  MLSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRG------------------------------------
        M SS SS+L  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRG                                    
Subjt:  MLSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRG------------------------------------

Query:  --LPLHPFVQEFLFRTGLAPAQVALNGWGVIFALAVLFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
          LPLHPFVQEFLFRTGLAPAQVA NGWGVIFALA+LFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  --LPLHPFVQEFLFRTGLAPAQVALNGWGVIFALAVLFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTR-------------SDSDFVFVQFQSDQSPSSRK-------------------PPSTP-----------MVCGFASNVKRK
        SGEWLAKDESGRSFFDVPTR             + + F  +++  ++ P  RK                   P   P           MVCGFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTR-------------SDSDFVFVQFQSDQSPSSRK-------------------PPSTP-----------MVCGFASNVKRK

Query:  SKGRAHALEAAQSSKPATSAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        SKGRAHALEAAQSSKPAT AVVGPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKGRAHALEAAQSSKPATSAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

A0A6J1DXZ1 uncharacterized protein LOC1110256062.1e-9267.95Show/hide
Query:  MVCGFASNVKRKSKGRAHALEAAQSSKPATSAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLDEEVREEVPLKRRRKKKKTTSPLEVRARG
        MVCGFAS+VKRKSKGRAHA EAAQSSKPAT AV GPASEDPAPVIELESSGGPSREKRPRDQTEAVD  PL EEVREEVPLKRRRKKKKT SPLEV A G
Subjt:  MVCGFASNVKRKSKGRAHALEAAQSSKPATSAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLDEEVREEVPLKRRRKKKKTTSPLEVRARG

Query:  ALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSRVRDQVSRISAASLDRCLKRASKFVSDPGSVLQRTIHYATEAFVASIQSALPVKAELDGREALAAG
         LP SFADRVDDPEARMGGTSDVTARFRV+PSS+ VRDQVSRISAASLDRCL+RASKFVSDPGSVLQRTI YA EAFVASIQSAL VKAELDGRE LAA 
Subjt:  ALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSRVRDQVSRISAASLDRCLKRASKFVSDPGSVLQRTIHYATEAFVASIQSALPVKAELDGREALAAG

Query:  EKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRHKAQLRAAHAITKGLENEKFHLLKEKDDMLQALEAKEEELKHATVELETVKER
        EKEEFS                                                                        ALEAK++EL+HAT ELET KER
Subjt:  EKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRHKAQLRAAHAITKGLENEKFHLLKEKDDMLQALEAKEEELKHATVELETVKER

Query:  LNNEVLLEESFR
        L+N VLLEESFR
Subjt:  LNNEVLLEESFR

A0A6J1DZB3 uncharacterized protein LOC1110256654.1e-13661.97Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTR-------------SDSDFVFVQFQSDQSPSSRK--------------------
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTR             + + F  ++   D  P  RK                    
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTR-------------SDSDFVFVQFQSDQSPSSRK--------------------

Query:  -----PPSTP-----MVCGFASNVKRKSKGRAHALEAAQSSKPATSAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLDEEVRE
               S P     MVCGF  +VKRKSKGRAHAL+    ++P T  V         GP+S  P PVIEL+ SGG S EKR R+++EA+DVSPL+ EVR 
Subjt:  -----PPSTP-----MVCGFASNVKRKSKGRAHALEAAQSSKPATSAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLDEEVRE

Query:  EVPLKRRRKKKKTTSPLEVRARGALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSRVRDQVSRISAASLDRCLKRASKFVSDPGSVLQRTIHYATEAF
        E PL+RRRKKKKT+S  E  ARG LPTS AD VDDPEARM GTS+V  RF +EPSSS V+DQVSRISA  LDR L+RASKFVSDPGSVLQRTI    EAF
Subjt:  EVPLKRRRKKKKTTSPLEVRARGALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSRVRDQVSRISAASLDRCLKRASKFVSDPGSVLQRTIHYATEAF

Query:  VASIQSALPVKAELDGREALAAGEKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRHKAQLRAAHAITKGLENEKFHLLKEKDDML
        +ASI  A+ VKAELDGREALAA E+E   AALEAA +T+K ELLKA  EV+IL+AEV+AK +LLKKE ++HKA LRAAHAITKGLE EKF LLKEKDD+ 
Subjt:  VASIQSALPVKAELDGREALAAGEKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRHKAQLRAAHAITKGLENEKFHLLKEKDDML

Query:  QALEAKEEELKHATVELETVKERLNNEVLLEESFRQHPDFDGFAKNFSDVGFKFLMKGIASDIPDLQIDLGGLKKR
        Q LE K+  +   T EL+ +KERL N  LLEESFRQHPDFDGFAK+FSD GFKFLMKGIA+D+P LQIDL GLKK+
Subjt:  QALEAKEEELKHATVELETVKERLNNEVLLEESFRQHPDFDGFAKNFSDVGFKFLMKGIASDIPDLQIDLGGLKKR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCACTGCTGCAAAGCGTCATGCCGTTGTTAACGAGGCAGCTCGAACCCTTGGTAGGTCAGTCTCTTCCCTCACTTTCTCTTTCGAACGTAGTTGCCATGTTGTCCTC
TTTTAGCAGCGACTTAGGGTCTGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGATAGTGATGCCT
CCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGACTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCGA
ACTGGGTTGGCTCCGGCTCAAGTGGCCCTCAATGGGTGGGGTGTCATTTTCGCTTTGGCCGTCCTTTTTTGGCTACGAGCTCGGGATAGTGAAGAGGCCGAGCTGTTGGA
CGTAGACCAGCTCCTTGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCT
CCATCAAAGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTTGCAAAGGACGAGTCGGGTCGTTCCTTCTTTGACGTTCCCACTAGGTCTGACTCGGATTTT
GTCTTTGTGCAGTTTCAATCTGACCAGTCCCCGAGCTCACGCAAGCCTCCTTCGACACCCATGGTTTGCGGATTTGCAAGCAACGTGAAGCGCAAGTCCAAGGGCCGAGC
CCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACTTCTGCTGTGGTAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCTT
CGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGTCTCGCCCTTGGACGAGGAGGTGAGGGAGGAAGTCCCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACC
ACCTCCCCCTTGGAGGTCAGAGCTCGTGGGGCCCTGCCTACGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCGACGTGACAGCACGGTT
CAGAGTCGAGCCGTCAAGTTCTCGGGTGAGGGACCAGGTGTCCCGCATCTCGGCTGCAAGTTTGGACCGCTGCCTCAAAAGAGCGTCCAAATTTGTAAGTGACCCGGGGT
CCGTCCTGCAGAGGACCATCCACTACGCCACTGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGCCCGTGAAGGCCGAGCTGGATGGGAGGGAAGCTCTGGCAGCGGGG
GAGAAAGAGGAGTTCTCTGCTGCTTTGGAGGCTGCTTCTTCCACCATGAAGGATGAGCTGCTGAAAGCTCACTCTGAGGTGGAAATTTTGAAGGCCGAGGTGGAGGCCAA
GGCCGAGCTGCTGAAGAAAGAAGAGGACAGACACAAAGCCCAGCTCCGAGCTGCCCATGCTATCACCAAGGGCTTGGAGAATGAGAAGTTCCACCTCCTCAAGGAGAAGG
ACGACATGCTCCAGGCGCTTGAAGCGAAAGAGGAGGAGCTGAAGCATGCGACTGTCGAGCTGGAGACGGTGAAGGAGCGTCTCAACAATGAAGTCCTATTGGAGGAATCG
TTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAAACTTCTCTGACGTGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATTCCTGACCTTCAGATCGATCT
CGGTGGTCTGAAGAAGAGGTCGGCACCACTCAAAAGGGCGCTCCTCAAGCAGGCTCTTAGGCGATCACCTTTCATGAGGCCTTTCTCTGTCTTCCTCTCTCTTTTTAAGT
GTTTGAATTTTAAGTTCGTCAGTGGTTTTGGCATCGCACCTCGTACCCTTAGATCCATTGAAAACCCTTTTGGCATTTCAAGGATAATAACGCTTCAGGTGCTCCGCGTT
CCACGGGTGCGCGAGGACGTCTCCTTTCAAATCGGCCAATATGTACGTCCCAGGTCGGACTATTCCCTTGACCTCAAACGGCCCCTCCCAGGTCGGGTCAAGGGCACCCA
CATGGGTTTAGACCCTTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCACTGCTGCAAAGCGTCATGCCGTTGTTAACGAGGCAGCTCGAACCCTTGGTAGGTCAGTCTCTTCCCTCACTTTCTCTTTCGAACGTAGTTGCCATGTTGTCCTC
TTTTAGCAGCGACTTAGGGTCTGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGATAGTGATGCCT
CCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGACTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCGA
ACTGGGTTGGCTCCGGCTCAAGTGGCCCTCAATGGGTGGGGTGTCATTTTCGCTTTGGCCGTCCTTTTTTGGCTACGAGCTCGGGATAGTGAAGAGGCCGAGCTGTTGGA
CGTAGACCAGCTCCTTGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCT
CCATCAAAGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTTGCAAAGGACGAGTCGGGTCGTTCCTTCTTTGACGTTCCCACTAGGTCTGACTCGGATTTT
GTCTTTGTGCAGTTTCAATCTGACCAGTCCCCGAGCTCACGCAAGCCTCCTTCGACACCCATGGTTTGCGGATTTGCAAGCAACGTGAAGCGCAAGTCCAAGGGCCGAGC
CCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACTTCTGCTGTGGTAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCTT
CGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGTCTCGCCCTTGGACGAGGAGGTGAGGGAGGAAGTCCCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACC
ACCTCCCCCTTGGAGGTCAGAGCTCGTGGGGCCCTGCCTACGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCGACGTGACAGCACGGTT
CAGAGTCGAGCCGTCAAGTTCTCGGGTGAGGGACCAGGTGTCCCGCATCTCGGCTGCAAGTTTGGACCGCTGCCTCAAAAGAGCGTCCAAATTTGTAAGTGACCCGGGGT
CCGTCCTGCAGAGGACCATCCACTACGCCACTGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGCCCGTGAAGGCCGAGCTGGATGGGAGGGAAGCTCTGGCAGCGGGG
GAGAAAGAGGAGTTCTCTGCTGCTTTGGAGGCTGCTTCTTCCACCATGAAGGATGAGCTGCTGAAAGCTCACTCTGAGGTGGAAATTTTGAAGGCCGAGGTGGAGGCCAA
GGCCGAGCTGCTGAAGAAAGAAGAGGACAGACACAAAGCCCAGCTCCGAGCTGCCCATGCTATCACCAAGGGCTTGGAGAATGAGAAGTTCCACCTCCTCAAGGAGAAGG
ACGACATGCTCCAGGCGCTTGAAGCGAAAGAGGAGGAGCTGAAGCATGCGACTGTCGAGCTGGAGACGGTGAAGGAGCGTCTCAACAATGAAGTCCTATTGGAGGAATCG
TTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAAACTTCTCTGACGTGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATTCCTGACCTTCAGATCGATCT
CGGTGGTCTGAAGAAGAGGTCGGCACCACTCAAAAGGGCGCTCCTCAAGCAGGCTCTTAGGCGATCACCTTTCATGAGGCCTTTCTCTGTCTTCCTCTCTCTTTTTAAGT
GTTTGAATTTTAAGTTCGTCAGTGGTTTTGGCATCGCACCTCGTACCCTTAGATCCATTGAAAACCCTTTTGGCATTTCAAGGATAATAACGCTTCAGGTGCTCCGCGTT
CCACGGGTGCGCGAGGACGTCTCCTTTCAAATCGGCCAATATGTACGTCCCAGGTCGGACTATTCCCTTGACCTCAAACGGCCCCTCCCAGGTCGGGTCAAGGGCACCCA
CATGGGTTTAGACCCTTCTTAA
Protein sequenceShow/hide protein sequence
MALLQSVMPLLTRQLEPLVGQSLPSLSLSNVVAMLSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGLPLHPFVQEFLFR
TGLAPAQVALNGWGVIFALAVLFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRSDSDF
VFVQFQSDQSPSSRKPPSTPMVCGFASNVKRKSKGRAHALEAAQSSKPATSAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLDEEVREEVPLKRRRKKKKT
TSPLEVRARGALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSRVRDQVSRISAASLDRCLKRASKFVSDPGSVLQRTIHYATEAFVASIQSALPVKAELDGREALAAG
EKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRHKAQLRAAHAITKGLENEKFHLLKEKDDMLQALEAKEEELKHATVELETVKERLNNEVLLEES
FRQHPDFDGFAKNFSDVGFKFLMKGIASDIPDLQIDLGGLKKRSAPLKRALLKQALRRSPFMRPFSVFLSLFKCLNFKFVSGFGIAPRTLRSIENPFGISRIITLQVLRV
PRVREDVSFQIGQYVRPRSDYSLDLKRPLPGRVKGTHMGLDPS