; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g36740 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g36740
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr9:28333945..28336241
RNA-Seq ExpressionMoc09g36740
SyntenyMoc09g36740
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]9.9e-11590.36Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFENLVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDELLLESGLLDYNPAVRPI
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDES      V  R    VPELTQA FDTLKYYKEHFPRGRKVGTLVTD+LLLESGLLDYNPAVRPI
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFENLVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDELLLESGLLDYNPAVRPI

Query:  ESSRPNSELAMVCRFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEALLKRRRKKKKT
        ESSRPNSELAMVC FASNVKRKSKG+AHALEAAQSSKP TPAVVGPASEDPAPVIELESS GPSREKRPRDQTEAVDVSPLGEEVREE  LKRRRKKKKT
Subjt:  ESSRPNSELAMVCRFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEALLKRRRKKKKT

Query:  TSPLEVGARGALPASFADRVDDPEARMGGTSDVTVRFRVEPSSSGVRDQ
        TSPLEVGARG LPASFADRVDDPEARMGGT DVT RFRVEPSSSGVRDQ
Subjt:  TSPLEVGARGALPASFADRVDDPEARMGGTSDVTVRFRVEPSSSGVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]9.9e-11591.29Show/hide
Query:  WGVIFALAILFWLRARDSEEAELLEVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFENL---
        WGVIFALAILFWLRARDSEEAELL+VDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRF NL   
Subjt:  WGVIFALAILFWLRARDSEEAELLEVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFENL---

Query:  --VPELTQAFFDTLKYYKEHFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASE
          VPELTQA FDTLKYYKE FPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVCRFAS VKRKSKGRAHALEAAQSSKP TPAVVGPASE
Subjt:  --VPELTQAFFDTLKYYKEHFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASE

Query:  DPAPVIELESSGGPSREKRPRDQTEAV-------DVSPLGE
        DPAPVIELESSGGPSREKRPRDQTEAV       DV PLGE
Subjt:  DPAPVIELESSGGPSREKRPRDQTEAV-------DVSPLGE

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]1.5e-12385.96Show/hide
Query:  GTSDVTVRFRVEPSSSGVRDQVSRISAASLDCCLMRASKFVSDPGSVLQRTIDYAAETFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKD
        G   +  + R+EPSSSGVRDQVSRISAASLD CL RASKFVS PGSVLQRTIDYAAE FVASIQSALAVKAELDGRE LAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTVRFRVEPSSSGVRDQVSRISAASLDCCLMRASKFVSDPGSVLQRTIDYAAETFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEVLKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHVTAELETVKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVE LKAEVE++AELLKKEEDRR+AQLRAAHAIT+GLE+EKFQLLKEKDDMLQALEAK++EL+H TAELET KERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVEVLKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHVTAELETVKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKKRYAEQWTSGSGGTPGPQALVDKYVRYLDSDYSDLEEDQVDTTQEGAPQAGS
        GFAKDFSDAGFKFLMKGI SDMPDLQIDLSGLK+RYAE+W SG GGTPGPQALVD+YVR LDSDYSD EEDQV +TQEGA   GS
Subjt:  GFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKKRYAEQWTSGSGGTPGPQALVDKYVRYLDSDYSDLEEDQVDTTQEGAPQAGS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]8.6e-15985.07Show/hide
Query:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDYGEDSDASTSGQGLEYPSRIPEHYLGSLSRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFE--
        MSSS SSNL  + DLARRLES+LEEIEN R SD GEDSDASTSGQGLEYPSRIPEHYLGSL RGFAIPENILLR+PEEGERADNPPEGWVTLYFKMFE  
Subjt:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDYGEDSDASTSGQGLEYPSRIPEHYLGSLSRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFE--

Query:  ---------------------------WGVIFALAILFWLRARDSEEAELLEVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
                                   WGVIFALAILFWLRARDSEEAEL +VDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  ---------------------------WGVIFALAILFWLRARDSEEAELLEVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFENL-----VPELTQAFFDTLKYYKEHFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFASNVKRK
        SGEWLAKDESGRSFFDVPTRF NL     VPELTQA FDTLKYYKE FPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVC FAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTRFENL-----VPELTQAFFDTLKYYKEHFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFASNVKRK

Query:  SKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        SKGRAHALEAAQSSKPATPAVVGPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]2.6e-19270.34Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFENLV-----PELTQAFFDTLKYYKEHFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRF NLV     PEL QA FDTLK+YK+HFPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFENLV-----PELTQAFFDTLKYYKEHFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCRFASNVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVRE
         VR IE+SRPNSELAMVC F  +VKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ SGG S EKR R+++EA+DVSPL  EVR 
Subjt:  AVRPIESSRPNSELAMVCRFASNVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVRE

Query:  EALLKRRRKKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTVRFRVEPSSSGVRDQVSRISAASLDCCLMRASKFVSDPGSVLQRTIDYAAETF
        E+ L+RRRKKKKT+S  E GARG LP S AD VDDPEARM GTS+V +RF +EPSSSGV+DQVSRISA  LD  L RASKFVSDPGSVLQRTID  AE F
Subjt:  EALLKRRRKKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTVRFRVEPSSSGVRDQVSRISAASLDCCLMRASKFVSDPGSVLQRTIDYAAETF

Query:  VASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEVLKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML
        +ASI  A+ VKAELDGREALAA+E+E   AALEAA +T+K ELLKA  EV++L+AEV+AK +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEVLKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML

Query:  QALEAKEEELKHVTAELETVKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKKRYAEQWTSGSGGTPGPQALVDKYVR
        Q LE K+  +  +T EL+ +KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGI +DMP LQIDL+GLKK+Y+E+W SG  GTP PQ+LVDKYVR
Subjt:  QALEAKEEELKHVTAELETVKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKKRYAEQWTSGSGGTPGPQALVDKYVR

Query:  YLDSDYSDLEED--------QVDTTQEGAP--QAGS
         LDSDYSD+EE+        +V TTQE  P  Q GS
Subjt:  YLDSDYSDLEED--------QVDTTQEGAP--QAGS

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092984.8e-11590.36Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFENLVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDELLLESGLLDYNPAVRPI
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDES      V  R    VPELTQA FDTLKYYKEHFPRGRKVGTLVTD+LLLESGLLDYNPAVRPI
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFENLVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDELLLESGLLDYNPAVRPI

Query:  ESSRPNSELAMVCRFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEALLKRRRKKKKT
        ESSRPNSELAMVC FASNVKRKSKG+AHALEAAQSSKP TPAVVGPASEDPAPVIELESS GPSREKRPRDQTEAVDVSPLGEEVREE  LKRRRKKKKT
Subjt:  ESSRPNSELAMVCRFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEALLKRRRKKKKT

Query:  TSPLEVGARGALPASFADRVDDPEARMGGTSDVTVRFRVEPSSSGVRDQ
        TSPLEVGARG LPASFADRVDDPEARMGGT DVT RFRVEPSSSGVRDQ
Subjt:  TSPLEVGARGALPASFADRVDDPEARMGGTSDVTVRFRVEPSSSGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138264.8e-11591.29Show/hide
Query:  WGVIFALAILFWLRARDSEEAELLEVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFENL---
        WGVIFALAILFWLRARDSEEAELL+VDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRF NL   
Subjt:  WGVIFALAILFWLRARDSEEAELLEVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFENL---

Query:  --VPELTQAFFDTLKYYKEHFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASE
          VPELTQA FDTLKYYKE FPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVCRFAS VKRKSKGRAHALEAAQSSKP TPAVVGPASE
Subjt:  --VPELTQAFFDTLKYYKEHFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASE

Query:  DPAPVIELESSGGPSREKRPRDQTEAV-------DVSPLGE
        DPAPVIELESSGGPSREKRPRDQTEAV       DV PLGE
Subjt:  DPAPVIELESSGGPSREKRPRDQTEAV-------DVSPLGE

A0A6J1D971 uncharacterized protein LOC1110185387.4e-12485.96Show/hide
Query:  GTSDVTVRFRVEPSSSGVRDQVSRISAASLDCCLMRASKFVSDPGSVLQRTIDYAAETFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKD
        G   +  + R+EPSSSGVRDQVSRISAASLD CL RASKFVS PGSVLQRTIDYAAE FVASIQSALAVKAELDGRE LAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTVRFRVEPSSSGVRDQVSRISAASLDCCLMRASKFVSDPGSVLQRTIDYAAETFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEVLKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHVTAELETVKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVE LKAEVE++AELLKKEEDRR+AQLRAAHAIT+GLE+EKFQLLKEKDDMLQALEAK++EL+H TAELET KERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVEVLKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHVTAELETVKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKKRYAEQWTSGSGGTPGPQALVDKYVRYLDSDYSDLEEDQVDTTQEGAPQAGS
        GFAKDFSDAGFKFLMKGI SDMPDLQIDLSGLK+RYAE+W SG GGTPGPQALVD+YVR LDSDYSD EEDQV +TQEGA   GS
Subjt:  GFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKKRYAEQWTSGSGGTPGPQALVDKYVRYLDSDYSDLEEDQVDTTQEGAPQAGS

A0A6J1DXS5 uncharacterized protein LOC1110255024.2e-15985.07Show/hide
Query:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDYGEDSDASTSGQGLEYPSRIPEHYLGSLSRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFE--
        MSSS SSNL  + DLARRLES+LEEIEN R SD GEDSDASTSGQGLEYPSRIPEHYLGSL RGFAIPENILLR+PEEGERADNPPEGWVTLYFKMFE  
Subjt:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDYGEDSDASTSGQGLEYPSRIPEHYLGSLSRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFE--

Query:  ---------------------------WGVIFALAILFWLRARDSEEAELLEVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
                                   WGVIFALAILFWLRARDSEEAEL +VDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  ---------------------------WGVIFALAILFWLRARDSEEAELLEVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFENL-----VPELTQAFFDTLKYYKEHFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFASNVKRK
        SGEWLAKDESGRSFFDVPTRF NL     VPELTQA FDTLKYYKE FPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVC FAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTRFENL-----VPELTQAFFDTLKYYKEHFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFASNVKRK

Query:  SKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        SKGRAHALEAAQSSKPATPAVVGPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256651.3e-19270.34Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFENLV-----PELTQAFFDTLKYYKEHFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRF NLV     PEL QA FDTLK+YK+HFPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFENLV-----PELTQAFFDTLKYYKEHFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCRFASNVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVRE
         VR IE+SRPNSELAMVC F  +VKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ SGG S EKR R+++EA+DVSPL  EVR 
Subjt:  AVRPIESSRPNSELAMVCRFASNVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVRE

Query:  EALLKRRRKKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTVRFRVEPSSSGVRDQVSRISAASLDCCLMRASKFVSDPGSVLQRTIDYAAETF
        E+ L+RRRKKKKT+S  E GARG LP S AD VDDPEARM GTS+V +RF +EPSSSGV+DQVSRISA  LD  L RASKFVSDPGSVLQRTID  AE F
Subjt:  EALLKRRRKKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTVRFRVEPSSSGVRDQVSRISAASLDCCLMRASKFVSDPGSVLQRTIDYAAETF

Query:  VASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEVLKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML
        +ASI  A+ VKAELDGREALAA+E+E   AALEAA +T+K ELLKA  EV++L+AEV+AK +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEVLKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML

Query:  QALEAKEEELKHVTAELETVKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKKRYAEQWTSGSGGTPGPQALVDKYVR
        Q LE K+  +  +T EL+ +KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGI +DMP LQIDL+GLKK+Y+E+W SG  GTP PQ+LVDKYVR
Subjt:  QALEAKEEELKHVTAELETVKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKKRYAEQWTSGSGGTPGPQALVDKYVR

Query:  YLDSDYSDLEED--------QVDTTQEGAP--QAGS
         LDSDYSD+EE+        +V TTQE  P  Q GS
Subjt:  YLDSDYSDLEED--------QVDTTQEGAP--QAGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATTACGGGGAGGA
TAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTAGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTTA
GGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTTACTCTCTACTTCAAAATGTTTGAGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGG
CTACGAGCTCGGGATAGTGAAGAGGCCGAGCTGCTGGAAGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGC
AAGGAAAGGCGCAGGCGGGATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTT
CCTTCTTTGACGTTCCCACTAGGTTTGAGAACCTAGTCCCCGAGCTTACGCAAGCCTTCTTCGACACGTTGAAATATTACAAGGAGCATTTTCCGAGGGGTAGGAAGGTC
GGAACCTTGGTGACCGACGAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAACTTGCCATGGTTTG
CAGATTTGCAAGCAACGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACTCCTGCTGTGGTAGGGCCAGCCTCGGAAG
ATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCTTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGTCTCGCCTTTGGGCGAGGAGGTGAGG
GAGGAGGCCCTCCTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGCCCTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCC
TGAGGCCAGGATGGGCGGGACGTCCGACGTGACAGTACGGTTCAGAGTCGAGCCGTCAAGTTCTGGGGTGAGGGACCAAGTGTCCCGCATCTCGGCTGCAAGTTTGGACT
GCTGCCTAATGAGGGCGTCCAAATTTGTAAGTGACCCAGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCTGAGACGTTTGTTGCTTCCATTCAATCGGCTCTGGCC
GTGAAGGCCGAGCTGGATGGGAGGGAAGCTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCTTCCACCATGAAGGATGAGCTGCTGAAAGC
TCACTCTGAGGTGGAAGTTTTGAAGGCCGAGGTGGAGGCCAAGGCCGAGCTGCTGAAGAAAGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCA
AGGGCTTGGAGAAGGAGAAGTTCCAACTCCTCAAGGAGAAGGACGACATGCTCCAAGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCATGTGACTGCCGAGCTGGAGACG
GTGAAGGAGCGTCTCAGCAATGGAGTCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCAT
GAAGGGCATTACTTCCGACATGCCTGACCTTCAGATCGATCTCAGTGGTCTGAAGAAGAGGTATGCTGAGCAGTGGACGTCTGGGTCTGGCGGCACCCCTGGCCCCCAAG
CGTTGGTGGATAAGTATGTCAGATATCTGGACTCTGACTACTCCGATCTCGAAGAGGACCAGGTCGACACCACTCAGGAGGGCGCTCCTCAGGCAGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATTACGGGGAGGA
TAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTAGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTTA
GGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTTACTCTCTACTTCAAAATGTTTGAGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGG
CTACGAGCTCGGGATAGTGAAGAGGCCGAGCTGCTGGAAGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGC
AAGGAAAGGCGCAGGCGGGATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTT
CCTTCTTTGACGTTCCCACTAGGTTTGAGAACCTAGTCCCCGAGCTTACGCAAGCCTTCTTCGACACGTTGAAATATTACAAGGAGCATTTTCCGAGGGGTAGGAAGGTC
GGAACCTTGGTGACCGACGAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAACTTGCCATGGTTTG
CAGATTTGCAAGCAACGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACTCCTGCTGTGGTAGGGCCAGCCTCGGAAG
ATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCTTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGTCTCGCCTTTGGGCGAGGAGGTGAGG
GAGGAGGCCCTCCTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGCCCTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCC
TGAGGCCAGGATGGGCGGGACGTCCGACGTGACAGTACGGTTCAGAGTCGAGCCGTCAAGTTCTGGGGTGAGGGACCAAGTGTCCCGCATCTCGGCTGCAAGTTTGGACT
GCTGCCTAATGAGGGCGTCCAAATTTGTAAGTGACCCAGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCTGAGACGTTTGTTGCTTCCATTCAATCGGCTCTGGCC
GTGAAGGCCGAGCTGGATGGGAGGGAAGCTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCTTCCACCATGAAGGATGAGCTGCTGAAAGC
TCACTCTGAGGTGGAAGTTTTGAAGGCCGAGGTGGAGGCCAAGGCCGAGCTGCTGAAGAAAGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCA
AGGGCTTGGAGAAGGAGAAGTTCCAACTCCTCAAGGAGAAGGACGACATGCTCCAAGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCATGTGACTGCCGAGCTGGAGACG
GTGAAGGAGCGTCTCAGCAATGGAGTCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCAT
GAAGGGCATTACTTCCGACATGCCTGACCTTCAGATCGATCTCAGTGGTCTGAAGAAGAGGTATGCTGAGCAGTGGACGTCTGGGTCTGGCGGCACCCCTGGCCCCCAAG
CGTTGGTGGATAAGTATGTCAGATATCTGGACTCTGACTACTCCGATCTCGAAGAGGACCAGGTCGACACCACTCAGGAGGGCGCTCCTCAGGCAGGCTCTTAG
Protein sequenceShow/hide protein sequence
MSSSFSSNLGSDEDLARRLESELEEIENFRFSDYGEDSDASTSGQGLEYPSRIPEHYLGSLSRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEWGVIFALAILFW
LRARDSEEAELLEVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFENLVPELTQAFFDTLKYYKEHFPRGRKV
GTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVR
EEALLKRRRKKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTVRFRVEPSSSGVRDQVSRISAASLDCCLMRASKFVSDPGSVLQRTIDYAAETFVASIQSALA
VKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEVLKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHVTAELET
VKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKKRYAEQWTSGSGGTPGPQALVDKYVRYLDSDYSDLEEDQVDTTQEGAPQAGS