; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g09310 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g09310
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr6:6799751..6802018
RNA-Seq ExpressionMoc06g09310
SyntenyMoc06g09310
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]2.2e-10381.5Show/hide
Query:  MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
Subjt:  MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRLIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPA------------------------KRLRDQTEAVDVSPLGEEVREEVPLKRRR
        AVR IESSRPNSELAMVCGFASNVKRKSKG+AHALEAAQSSKP TPA                        KR RDQTEAVDVSPLGEEVREEVPLKRRR
Subjt:  AVRLIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPA------------------------KRLRDQTEAVDVSPLGEEVREEVPLKRRR

Query:  KKKKTTSPLEVGARGALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQ
        KKKKTTSPLEVGARG LP SFADRVDDPEARMGGT DVT RFRVEPSSSGVRDQ
Subjt:  KKKKTTSPLEVGARGALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]5.1e-11382.71Show/hide
Query:  LPFHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIFFLLRARDSEEAELLDVNQLLACFEAKRIAKKPGRFYMCARKGACGIVKGPTSIKGWVRKWFYASG
        LP HPFVQEFLFRTGLAPAQVAPNGWGVIFALAI F LRARDSEEAELLDV+QLLACFEAKRIAKKPGRFYMCARKGA GIVKGPTSIKGWVRKWFYASG
Subjt:  LPFHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIFFLLRARDSEEAELLDVNQLLACFEAKRIAKKPGRFYMCARKGACGIVKGPTSIKGWVRKWFYASG

Query:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRLIESSRPNSELAMVCGFASNVKRKSK
        EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVR IE SRPNS LAMVC FAS VKRKSK
Subjt:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRLIESSRPNSELAMVCGFASNVKRKSK

Query:  GRAHALEAAQSSKPATPA------------------------KRLRDQTEAV-------DVSPLGE
        GRAHALEAAQSSKP TPA                        KR RDQTEAV       DV PLGE
Subjt:  GRAHALEAAQSSKPATPA------------------------KRLRDQTEAV-------DVSPLGE

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]9.7e-12084.56Show/hide
Query:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKD
        G   + A+ R+EPSSSGVRDQVSRISAASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSALAVKAELDGRE LAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKNDMLQALEAKEEELKHGTAELETAK---------EESFRQHPDFD
        ELLKAHSEVE LKAEVE++AELLKKEEDRR+AQLRAAHAIT+GLE+EKFQLLKEK+DMLQALEAK++EL+H TAELETAK         EE+FRQHPDFD
Subjt:  ELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKNDMLQALEAKEEELKHGTAELETAK---------EESFRQHPDFD

Query:  GFVKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLQEDQVGTTQEGAPQAGS
        GF KDFSDAGFKFLMKGIASDMPDLQIDL GLK+RYAE+WASGP GTPGPQALVD+YVRDLDSDYSD +EDQVG+TQEGA   GS
Subjt:  GFVKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLQEDQVGTTQEGAPQAGS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]3.0e-13776.06Show/hide
Query:  MPSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDNDASTSGQGLEYPSRIPEHYLGSLRKG------------------------------------
        M SS SS+L  + DLARRLES+LEEIEN R SDDGED+DASTSGQGLEYPSRIPEHYLGSLR+G                                    
Subjt:  MPSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDNDASTSGQGLEYPSRIPEHYLGSLRKG------------------------------------

Query:  --LPFHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIFFLLRARDSEEAELLDVNQLLACFEAKRIAKKPGRFYMCARKGACGIVKGPTSIKGWVRKWFYA
          LP HPFVQEFLFRTGLAPAQVAPNGWGVIFALAI F LRARDSEEAEL DV+QLLACFEAKRIAKKPGRFYMCARKGA GIVKGPTSIKGWVRKWFYA
Subjt:  --LPFHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIFFLLRARDSEEAELLDVNQLLACFEAKRIAKKPGRFYMCARKGACGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRLIESSRPNSELAMVCGFASNVKRK
        SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVR IESSRPNSELAMVCGFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRLIESSRPNSELAMVCGFASNVKRK

Query:  SKGRAHALEAAQSSKPATPA------------------------KRLRDQTEAVD
        SKGRAHALEAAQSSKPATPA                        KR RDQTEAVD
Subjt:  SKGRAHALEAAQSSKPATPA------------------------KRLRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.3e-18568.47Show/hide
Query:  MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKG  GIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+HFPR RK+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRLIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPA--------------------------------KRLRDQTEAVDVSPLGEEVRE
         VRLIE+SRPNSELAMVCGF  +VKRKSKGRAHAL+    ++P TP                                 KR R+++EA+DVSPL  EVR 
Subjt:  AVRLIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPA--------------------------------KRLRDQTEAVDVSPLGEEVRE

Query:  EVPLKRRRKKKKTTSPLEVGARGALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF
        E PL+RRRKKKKT+S  E GARG LPTS AD VDDPEARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF
Subjt:  EVPLKRRRKKKKTTSPLEVGARGALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKNDML
        +ASI  A+ VKAELDGREALAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+AK +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEK+D+ 
Subjt:  VASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKNDML

Query:  QALEAKEEELKHGTAELETAK---------EESFRQHPDFDGFVKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVR
        Q LE K+  +   T EL+  K         EESFRQHPDFDGF KDFSDAGFKFLMKGIA+DMP LQIDL GLKK+Y+E+WASGP+GTP PQ+LVDKYVR
Subjt:  QALEAKEEELKHGTAELETAK---------EESFRQHPDFDGFVKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVR

Query:  DLDSDYSDLQED--------QVGTTQEGAP--QAGS
        +LDSDYSD++E+        +VGTTQE  P  Q GS
Subjt:  DLDSDYSDLQED--------QVGTTQEGAP--QAGS

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092981.0e-10381.5Show/hide
Query:  MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
Subjt:  MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRLIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPA------------------------KRLRDQTEAVDVSPLGEEVREEVPLKRRR
        AVR IESSRPNSELAMVCGFASNVKRKSKG+AHALEAAQSSKP TPA                        KR RDQTEAVDVSPLGEEVREEVPLKRRR
Subjt:  AVRLIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPA------------------------KRLRDQTEAVDVSPLGEEVREEVPLKRRR

Query:  KKKKTTSPLEVGARGALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQ
        KKKKTTSPLEVGARG LP SFADRVDDPEARMGGT DVT RFRVEPSSSGVRDQ
Subjt:  KKKKTTSPLEVGARGALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138262.5e-11382.71Show/hide
Query:  LPFHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIFFLLRARDSEEAELLDVNQLLACFEAKRIAKKPGRFYMCARKGACGIVKGPTSIKGWVRKWFYASG
        LP HPFVQEFLFRTGLAPAQVAPNGWGVIFALAI F LRARDSEEAELLDV+QLLACFEAKRIAKKPGRFYMCARKGA GIVKGPTSIKGWVRKWFYASG
Subjt:  LPFHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIFFLLRARDSEEAELLDVNQLLACFEAKRIAKKPGRFYMCARKGACGIVKGPTSIKGWVRKWFYASG

Query:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRLIESSRPNSELAMVCGFASNVKRKSK
        EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVR IE SRPNS LAMVC FAS VKRKSK
Subjt:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRLIESSRPNSELAMVCGFASNVKRKSK

Query:  GRAHALEAAQSSKPATPA------------------------KRLRDQTEAV-------DVSPLGE
        GRAHALEAAQSSKP TPA                        KR RDQTEAV       DV PLGE
Subjt:  GRAHALEAAQSSKPATPA------------------------KRLRDQTEAV-------DVSPLGE

A0A6J1D971 uncharacterized protein LOC1110185384.7e-12084.56Show/hide
Query:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKD
        G   + A+ R+EPSSSGVRDQVSRISAASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSALAVKAELDGRE LAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKNDMLQALEAKEEELKHGTAELETAK---------EESFRQHPDFD
        ELLKAHSEVE LKAEVE++AELLKKEEDRR+AQLRAAHAIT+GLE+EKFQLLKEK+DMLQALEAK++EL+H TAELETAK         EE+FRQHPDFD
Subjt:  ELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKNDMLQALEAKEEELKHGTAELETAK---------EESFRQHPDFD

Query:  GFVKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLQEDQVGTTQEGAPQAGS
        GF KDFSDAGFKFLMKGIASDMPDLQIDL GLK+RYAE+WASGP GTPGPQALVD+YVRDLDSDYSD +EDQVG+TQEGA   GS
Subjt:  GFVKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLQEDQVGTTQEGAPQAGS

A0A6J1DXS5 uncharacterized protein LOC1110255021.5e-13776.06Show/hide
Query:  MPSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDNDASTSGQGLEYPSRIPEHYLGSLRKG------------------------------------
        M SS SS+L  + DLARRLES+LEEIEN R SDDGED+DASTSGQGLEYPSRIPEHYLGSLR+G                                    
Subjt:  MPSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDNDASTSGQGLEYPSRIPEHYLGSLRKG------------------------------------

Query:  --LPFHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIFFLLRARDSEEAELLDVNQLLACFEAKRIAKKPGRFYMCARKGACGIVKGPTSIKGWVRKWFYA
          LP HPFVQEFLFRTGLAPAQVAPNGWGVIFALAI F LRARDSEEAEL DV+QLLACFEAKRIAKKPGRFYMCARKGA GIVKGPTSIKGWVRKWFYA
Subjt:  --LPFHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIFFLLRARDSEEAELLDVNQLLACFEAKRIAKKPGRFYMCARKGACGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRLIESSRPNSELAMVCGFASNVKRK
        SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVR IESSRPNSELAMVCGFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRLIESSRPNSELAMVCGFASNVKRK

Query:  SKGRAHALEAAQSSKPATPA------------------------KRLRDQTEAVD
        SKGRAHALEAAQSSKPATPA                        KR RDQTEAVD
Subjt:  SKGRAHALEAAQSSKPATPA------------------------KRLRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256656.4e-18668.47Show/hide
Query:  MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKG  GIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+HFPR RK+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRLIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPA--------------------------------KRLRDQTEAVDVSPLGEEVRE
         VRLIE+SRPNSELAMVCGF  +VKRKSKGRAHAL+    ++P TP                                 KR R+++EA+DVSPL  EVR 
Subjt:  AVRLIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPA--------------------------------KRLRDQTEAVDVSPLGEEVRE

Query:  EVPLKRRRKKKKTTSPLEVGARGALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF
        E PL+RRRKKKKT+S  E GARG LPTS AD VDDPEARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF
Subjt:  EVPLKRRRKKKKTTSPLEVGARGALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKNDML
        +ASI  A+ VKAELDGREALAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+AK +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEK+D+ 
Subjt:  VASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKNDML

Query:  QALEAKEEELKHGTAELETAK---------EESFRQHPDFDGFVKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVR
        Q LE K+  +   T EL+  K         EESFRQHPDFDGF KDFSDAGFKFLMKGIA+DMP LQIDL GLKK+Y+E+WASGP+GTP PQ+LVDKYVR
Subjt:  QALEAKEEELKHGTAELETAK---------EESFRQHPDFDGFVKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVR

Query:  DLDSDYSDLQED--------QVGTTQEGAP--QAGS
        +LDSDYSD++E+        +VGTTQE  P  Q GS
Subjt:  DLDSDYSDLQED--------QVGTTQEGAP--QAGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGTCCTCTTTTAGCAGCGACTTAGGGTCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGA
TAATGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAAGGGACTTCCCTTTCACCCTTTCGTCCAAGAGT
TTCTTTTCCGAACTGGGCTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCTTTTTTTTGTTACGAGCTCGGGATAGTGAAGAGGCC
GAGCTGCTGGACGTAAACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGATTCTATATGTGCGCAAGGAAAGGCGCATGCGGTATAGTTAA
GGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGATGAGTCAGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTG
GGAACCTAGTATCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCTTTCGACACGTTGAAATATTACAAGGAGCATTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTG
ACCGACAAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCTCATTGAATCCTCAAGGCCGAACTCCGAACTTGCCATGGTTTGCGGATTTGCAAG
CAACGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACTCCTGCTAAGCGCCTCAGGGATCAGACCGAGGCGGTGGACG
TCTCGCCCTTGGGCGAGGAGGTGAGGGAGGAAGTCCCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGCCCTGCCTACG
AGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTCGGACGTGACAGCACGGTTCAGAGTCGAGCCGTCAAGTTCTGGGGTGAGGGACCAGGTGTC
CCGCATCTCGGCTGCAAGTTTGGACCGCTGCCTCAGAAGAGCGTCCAAATTTGTAAGTGACCCGGGGTCCGTCCTGCAGAGGACCATCGACTACGCCGCTGAGGCGTTTG
TTGCTTCCATTCAATCGGCTCTGGCCGTGAAGGCCGAGCTGGATGGGAGGGAAGCTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCCTCTTCC
ACCATGAAGGATGAGCTGCTGAAAGCTCACTCTGAGGTGGAAATTTTGAAGGCTGAGGTGGAGGCCAAGGCCGAGCTGCTGAAGAAAGAAGAGGACAGACGCAAGGCCCA
GCTCCGAGCTGCCCATGCTATCACCAAGGGCTTGGAGAAGGAGAAGTTCCAACTTCTCAAGGAGAAGAATGACATGCTCCAGGCGCTTGAAGCGAAGGAGGAGGAGCTGA
AGCATGGGACTGCCGAGCTGGAGACGGCGAAGGAGGAATCGTTCAGGCAACATCCTGACTTCGATGGATTTGTCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATG
AAGGGCATTGCTTCCGACATGCCTGACCTTCAGATCGATCTTGGTGGTCTGAAGAAAAGGTATGCCGAGCAGTGGGCGTCTGGGCCTAGCGGCACCCCTGGCCCCCAAGC
GTTGGTGGATAAGTATGTCAGAGATCTGGACTCTGACTACTCCGATCTCCAAGAGGACCAGGTCGGCACCACTCAGGAGGGCGCTCCTCAAGCAGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCGTCCTCTTTTAGCAGCGACTTAGGGTCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGA
TAATGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAAGGGACTTCCCTTTCACCCTTTCGTCCAAGAGT
TTCTTTTCCGAACTGGGCTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCTTTTTTTTGTTACGAGCTCGGGATAGTGAAGAGGCC
GAGCTGCTGGACGTAAACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGATTCTATATGTGCGCAAGGAAAGGCGCATGCGGTATAGTTAA
GGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGATGAGTCAGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTG
GGAACCTAGTATCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCTTTCGACACGTTGAAATATTACAAGGAGCATTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTG
ACCGACAAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCTCATTGAATCCTCAAGGCCGAACTCCGAACTTGCCATGGTTTGCGGATTTGCAAG
CAACGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACTCCTGCTAAGCGCCTCAGGGATCAGACCGAGGCGGTGGACG
TCTCGCCCTTGGGCGAGGAGGTGAGGGAGGAAGTCCCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGCCCTGCCTACG
AGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTCGGACGTGACAGCACGGTTCAGAGTCGAGCCGTCAAGTTCTGGGGTGAGGGACCAGGTGTC
CCGCATCTCGGCTGCAAGTTTGGACCGCTGCCTCAGAAGAGCGTCCAAATTTGTAAGTGACCCGGGGTCCGTCCTGCAGAGGACCATCGACTACGCCGCTGAGGCGTTTG
TTGCTTCCATTCAATCGGCTCTGGCCGTGAAGGCCGAGCTGGATGGGAGGGAAGCTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCCTCTTCC
ACCATGAAGGATGAGCTGCTGAAAGCTCACTCTGAGGTGGAAATTTTGAAGGCTGAGGTGGAGGCCAAGGCCGAGCTGCTGAAGAAAGAAGAGGACAGACGCAAGGCCCA
GCTCCGAGCTGCCCATGCTATCACCAAGGGCTTGGAGAAGGAGAAGTTCCAACTTCTCAAGGAGAAGAATGACATGCTCCAGGCGCTTGAAGCGAAGGAGGAGGAGCTGA
AGCATGGGACTGCCGAGCTGGAGACGGCGAAGGAGGAATCGTTCAGGCAACATCCTGACTTCGATGGATTTGTCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATG
AAGGGCATTGCTTCCGACATGCCTGACCTTCAGATCGATCTTGGTGGTCTGAAGAAAAGGTATGCCGAGCAGTGGGCGTCTGGGCCTAGCGGCACCCCTGGCCCCCAAGC
GTTGGTGGATAAGTATGTCAGAGATCTGGACTCTGACTACTCCGATCTCCAAGAGGACCAGGTCGGCACCACTCAGGAGGGCGCTCCTCAAGCAGGCTCTTAG
Protein sequenceShow/hide protein sequence
MPSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDNDASTSGQGLEYPSRIPEHYLGSLRKGLPFHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIFFLLRARDSEEA
ELLDVNQLLACFEAKRIAKKPGRFYMCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLV
TDKLLLESGLLDYNPAVRLIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAKRLRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVGARGALPT
SFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASS
TMKDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKNDMLQALEAKEEELKHGTAELETAKEESFRQHPDFDGFVKDFSDAGFKFLM
KGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLQEDQVGTTQEGAPQAGS