; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g33190 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g33190
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:24975837..24979976
RNA-Seq ExpressionMoc04g33190
SyntenyMoc04g33190
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]7.6e-10883.86Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPIRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLEFGLLYYNP
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDES              V+I+PVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLE GLL YNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPIRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLEFGLLYYNP

Query:  AVRPIESSRPNFEL----GFASNVKRKSKGRAHVLEAAKSSEPATPAVAGPVSEDLAPVIELESSGGPSREKRPRGQTEAADVSSLGEEVREEAPLKRRR
        AVRPIESSRPN EL    GFASNVKRKSKG+AH LEAA+SS+P TPAV GP SED APVIELESS GPSREKRPR QTEA DVS LGEEVREE PLKRRR
Subjt:  AVRPIESSRPNFEL----GFASNVKRKSKGRAHVLEAAKSSEPATPAVAGPVSEDLAPVIELESSGGPSREKRPRGQTEAADVSSLGEEVREEAPLKRRR

Query:  KKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVKDQ
        KKKKTTSPLEVGARG LPASFADRVDDPEARMGGT DVT RFRVEPSSSGV+DQ
Subjt:  KKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVKDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]3.8e-12387.59Show/hide
Query:  LPLHPFVQEFLFRTGLALAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG
        LPLHPFVQEFLFRTGLA AQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG
Subjt:  LPLHPFVQEFLFRTGLALAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG

Query:  EWLAKDESGRSFFDVPIRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLEFGLLYYNPAVRPIESSRPNFELG----FASNVKRKSK
        EWLAKDESGRSFFDVP RFGNLVSI+PVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLE GLL YNPAVRPIE SRPN  L     FAS VKRKSK
Subjt:  EWLAKDESGRSFFDVPIRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLEFGLLYYNPAVRPIESSRPNFELG----FASNVKRKSK

Query:  GRAHVLEAAKSSEPATPAVAGPVSEDLAPVIELESSGGPSREKRPR-------GQTEAADVSSLGE
        GRAH LEAA+SS+P TPAV GP SED APVIELESSGGPSREKRPR        QTEAADV  LGE
Subjt:  GRAHVLEAAKSSEPATPAVAGPVSEDLAPVIELESSGGPSREKRPR-------GQTEAADVSSLGE

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]4.6e-12187.55Show/hide
Query:  GTSDVTARFRVEPSSSGVKDQVSRISAASLDHCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKD
        G   + A+ R+EPSSSGV+DQVSRISAASLD CLRRASKFVS PGSVLQRTIDYAAEAFVASIQSALAVKAELDGRE LAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRVEPSSSGVKDQVSRISAASLDHCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVEAKAELVKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELEMVKERLSNGVLLEESFRQHPDFV
        ELLKAHSEVE LKAEVE++AEL+KKEEDRR+AQLRAAHAIT+GLE+EKFQLLKEKDDMLQALEAK++EL+HATAELE  KERLSNGVLLEE+FRQHPDF 
Subjt:  ELLKAHSEVEILKAEVEAKAELVKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELEMVKERLSNGVLLEESFRQHPDFV

Query:  GFAKDFSDAGFKFLMKGIGSDMPDLQIDLGGLKKRYAEQWASGPNGTPGPQALVDKYVRDLDSDYSDLEEDQV
        GFAKDFSDAGFKFLMKGI SDMPDLQIDL GLK+RYAE+WASGP GTPGPQALVD+YVRDLDSDYSD EEDQV
Subjt:  GFAKDFSDAGFKFLMKGIGSDMPDLQIDLGGLKKRYAEQWASGPNGTPGPQALVDKYVRDLDSDYSDLEEDQV

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]9.2e-14679.72Show/hide
Query:  MSSSFNSDLGSDEDLARRLESELEEIENFRFSDDREDSDASTSGQGLEYPSRIPEHYLGSLRRG------------------------------------
        MSSS +S+L  + DLARRLES+LEEIEN R SDD EDSDASTSGQGLEYPSRIPEHYLGSLRRG                                    
Subjt:  MSSSFNSDLGSDEDLARRLESELEEIENFRFSDDREDSDASTSGQGLEYPSRIPEHYLGSLRRG------------------------------------

Query:  --LPLHPFVQEFLFRTGLALAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
          LPLHPFVQEFLFRTGLA AQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  --LPLHPFVQEFLFRTGLALAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPIRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLEFGLLYYNPAVRPIESSRPNFEL----GFASNVKRK
        SGEWLAKDESGRSFFDVP RFGNLVSI+PVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLE GLL YNPAVRPIESSRPN EL    GFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPIRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLEFGLLYYNPAVRPIESSRPNFEL----GFASNVKRK

Query:  SKGRAHVLEAAKSSEPATPAVAGPVSEDLAPVIELESSGGPSREKRPRGQTEAAD
        SKGRAH LEAA+SS+PATPAV GP SED A VIELESSGGPSREKRPR QTEA D
Subjt:  SKGRAHVLEAAKSSEPATPAVAGPVSEDLAPVIELESSGGPSREKRPRGQTEAAD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]4.2e-19170.04Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPIRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLEFGLLYYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVP RFGNLVSI+ +PEL QA+FDTLK+YK+HFPR RK+ TLVTDKLLLE GLL YNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPIRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLEFGLLYYNP

Query:  AVRPIESSRPNFEL----GFASNVKRKSKGRAHVLEAAKSSEPATPAV--------AGPVSEDLAPVIELESSGGPSREKRPRGQTEAADVSSLGEEVRE
         VR IE+SRPN EL    GF  +VKRKSKGRAH L+    +EP TP V        +GP S    PVIEL+ SGG S EKR R ++EA DVS L  EVR 
Subjt:  AVRPIESSRPNFEL----GFASNVKRKSKGRAHVLEAAKSSEPATPAV--------AGPVSEDLAPVIELESSGGPSREKRPRGQTEAADVSSLGEEVRE

Query:  EAPLKRRRKKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVKDQVSRISAASLDHCLRRASKFVSDPGSVLQRTIDYAAEAF
        E+PL+RRRKKKKT+S  E GARG LP S AD VDDPEARM GTS+V  RF +EPSSSGVKDQVSRISA  LD  LRRASKFVSDPGSVLQRTID  AEAF
Subjt:  EAPLKRRRKKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVKDQVSRISAASLDHCLRRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELVKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML
        +ASI  A+ VKAELDGREALAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+AK +L+KKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELVKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML

Query:  QALEAKEEELKHATAELEMVKERLSNGVLLEESFRQHPDFVGFAKDFSDAGFKFLMKGIGSDMPDLQIDLGGLKKRYAEQWASGPNGTPGPQALVDKYVR
        Q LE K+  +   T EL+ +KERL+NG LLEESFRQHPDF GFAKDFSDAGFKFLMKGI +DMP LQIDL GLKK+Y+E+WASGPNGTP PQ+LVDKYVR
Subjt:  QALEAKEEELKHATAELEMVKERLSNGVLLEESFRQHPDFVGFAKDFSDAGFKFLMKGIGSDMPDLQIDLGGLKKRYAEQWASGPNGTPGPQALVDKYVR

Query:  DLDSDYSDLEED-----QVLRVPRVREDVSFQIG
        +LDSDYSD+EE+     +   V   +E+V  Q G
Subjt:  DLDSDYSDLEED-----QVLRVPRVREDVSFQIG

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092983.7e-10883.86Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPIRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLEFGLLYYNP
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDES              V+I+PVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLE GLL YNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPIRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLEFGLLYYNP

Query:  AVRPIESSRPNFEL----GFASNVKRKSKGRAHVLEAAKSSEPATPAVAGPVSEDLAPVIELESSGGPSREKRPRGQTEAADVSSLGEEVREEAPLKRRR
        AVRPIESSRPN EL    GFASNVKRKSKG+AH LEAA+SS+P TPAV GP SED APVIELESS GPSREKRPR QTEA DVS LGEEVREE PLKRRR
Subjt:  AVRPIESSRPNFEL----GFASNVKRKSKGRAHVLEAAKSSEPATPAVAGPVSEDLAPVIELESSGGPSREKRPRGQTEAADVSSLGEEVREEAPLKRRR

Query:  KKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVKDQ
        KKKKTTSPLEVGARG LPASFADRVDDPEARMGGT DVT RFRVEPSSSGV+DQ
Subjt:  KKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVKDQ

A0A6J1CR42 uncharacterized protein LOC1110138261.8e-12387.59Show/hide
Query:  LPLHPFVQEFLFRTGLALAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG
        LPLHPFVQEFLFRTGLA AQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG
Subjt:  LPLHPFVQEFLFRTGLALAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG

Query:  EWLAKDESGRSFFDVPIRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLEFGLLYYNPAVRPIESSRPNFELG----FASNVKRKSK
        EWLAKDESGRSFFDVP RFGNLVSI+PVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLE GLL YNPAVRPIE SRPN  L     FAS VKRKSK
Subjt:  EWLAKDESGRSFFDVPIRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLEFGLLYYNPAVRPIESSRPNFELG----FASNVKRKSK

Query:  GRAHVLEAAKSSEPATPAVAGPVSEDLAPVIELESSGGPSREKRPR-------GQTEAADVSSLGE
        GRAH LEAA+SS+P TPAV GP SED APVIELESSGGPSREKRPR        QTEAADV  LGE
Subjt:  GRAHVLEAAKSSEPATPAVAGPVSEDLAPVIELESSGGPSREKRPR-------GQTEAADVSSLGE

A0A6J1D971 uncharacterized protein LOC1110185382.2e-12187.55Show/hide
Query:  GTSDVTARFRVEPSSSGVKDQVSRISAASLDHCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKD
        G   + A+ R+EPSSSGV+DQVSRISAASLD CLRRASKFVS PGSVLQRTIDYAAEAFVASIQSALAVKAELDGRE LAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRVEPSSSGVKDQVSRISAASLDHCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVEAKAELVKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELEMVKERLSNGVLLEESFRQHPDFV
        ELLKAHSEVE LKAEVE++AEL+KKEEDRR+AQLRAAHAIT+GLE+EKFQLLKEKDDMLQALEAK++EL+HATAELE  KERLSNGVLLEE+FRQHPDF 
Subjt:  ELLKAHSEVEILKAEVEAKAELVKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELEMVKERLSNGVLLEESFRQHPDFV

Query:  GFAKDFSDAGFKFLMKGIGSDMPDLQIDLGGLKKRYAEQWASGPNGTPGPQALVDKYVRDLDSDYSDLEEDQV
        GFAKDFSDAGFKFLMKGI SDMPDLQIDL GLK+RYAE+WASGP GTPGPQALVD+YVRDLDSDYSD EEDQV
Subjt:  GFAKDFSDAGFKFLMKGIGSDMPDLQIDLGGLKKRYAEQWASGPNGTPGPQALVDKYVRDLDSDYSDLEEDQV

A0A6J1DXS5 uncharacterized protein LOC1110255024.4e-14679.72Show/hide
Query:  MSSSFNSDLGSDEDLARRLESELEEIENFRFSDDREDSDASTSGQGLEYPSRIPEHYLGSLRRG------------------------------------
        MSSS +S+L  + DLARRLES+LEEIEN R SDD EDSDASTSGQGLEYPSRIPEHYLGSLRRG                                    
Subjt:  MSSSFNSDLGSDEDLARRLESELEEIENFRFSDDREDSDASTSGQGLEYPSRIPEHYLGSLRRG------------------------------------

Query:  --LPLHPFVQEFLFRTGLALAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
          LPLHPFVQEFLFRTGLA AQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  --LPLHPFVQEFLFRTGLALAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPIRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLEFGLLYYNPAVRPIESSRPNFEL----GFASNVKRK
        SGEWLAKDESGRSFFDVP RFGNLVSI+PVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLE GLL YNPAVRPIESSRPN EL    GFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPIRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLEFGLLYYNPAVRPIESSRPNFEL----GFASNVKRK

Query:  SKGRAHVLEAAKSSEPATPAVAGPVSEDLAPVIELESSGGPSREKRPRGQTEAAD
        SKGRAH LEAA+SS+PATPAV GP SED A VIELESSGGPSREKRPR QTEA D
Subjt:  SKGRAHVLEAAKSSEPATPAVAGPVSEDLAPVIELESSGGPSREKRPRGQTEAAD

A0A6J1DZB3 uncharacterized protein LOC1110256652.0e-19170.04Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPIRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLEFGLLYYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVP RFGNLVSI+ +PEL QA+FDTLK+YK+HFPR RK+ TLVTDKLLLE GLL YNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPIRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLEFGLLYYNP

Query:  AVRPIESSRPNFEL----GFASNVKRKSKGRAHVLEAAKSSEPATPAV--------AGPVSEDLAPVIELESSGGPSREKRPRGQTEAADVSSLGEEVRE
         VR IE+SRPN EL    GF  +VKRKSKGRAH L+    +EP TP V        +GP S    PVIEL+ SGG S EKR R ++EA DVS L  EVR 
Subjt:  AVRPIESSRPNFEL----GFASNVKRKSKGRAHVLEAAKSSEPATPAV--------AGPVSEDLAPVIELESSGGPSREKRPRGQTEAADVSSLGEEVRE

Query:  EAPLKRRRKKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVKDQVSRISAASLDHCLRRASKFVSDPGSVLQRTIDYAAEAF
        E+PL+RRRKKKKT+S  E GARG LP S AD VDDPEARM GTS+V  RF +EPSSSGVKDQVSRISA  LD  LRRASKFVSDPGSVLQRTID  AEAF
Subjt:  EAPLKRRRKKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVKDQVSRISAASLDHCLRRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELVKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML
        +ASI  A+ VKAELDGREALAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+AK +L+KKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELVKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML

Query:  QALEAKEEELKHATAELEMVKERLSNGVLLEESFRQHPDFVGFAKDFSDAGFKFLMKGIGSDMPDLQIDLGGLKKRYAEQWASGPNGTPGPQALVDKYVR
        Q LE K+  +   T EL+ +KERL+NG LLEESFRQHPDF GFAKDFSDAGFKFLMKGI +DMP LQIDL GLKK+Y+E+WASGPNGTP PQ+LVDKYVR
Subjt:  QALEAKEEELKHATAELEMVKERLSNGVLLEESFRQHPDFVGFAKDFSDAGFKFLMKGIGSDMPDLQIDLGGLKKRYAEQWASGPNGTPGPQALVDKYVR

Query:  DLDSDYSDLEED-----QVLRVPRVREDVSFQIG
        +LDSDYSD+EE+     +   V   +E+V  Q G
Subjt:  DLDSDYSDLEED-----QVLRVPRVREDVSFQIG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTTTTAACAGTGACTTAGGGTCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACAGGGAAGA
TAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGCTTCCCCTTCACCCTTTTGTCCAAGAAT
TTCTCTTCCGAACTGGGTTGGCTCTGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAAGAGGCG
GAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAA
GGGGCCGACCTCCATCAAAGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTTGCAAAGGACGAGTCGGGTCGTTCCTTCTTTGACGTTCCCATTAGGTTTG
GGAACCTAGTTTCAATCCAACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTGAAATATTACAAGGAGCATTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTG
ACCGACAAGCTGCTGCTTGAGTTCGGGCTGCTATATTACAACCCCGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTTCGAACTTGGATTTGCAAGCAACGTGAAACG
CAAGTCCAAGGGCCGAGCTCATGTTCTTGAGGCCGCCAAGAGTTCGGAACCTGCCACTCCTGCCGTGGCAGGGCCAGTCTCGGAAGATCTAGCCCCAGTGATCGAGCTGG
AGTCTTCTGGGGGTCCTTCGAGGGAGAAGCGCCCCAGGGGTCAGACTGAGGCGGCGGACGTCTCGTCCTTGGGCGAGGAGGTGAGAGAGGAGGCCCCTTTGAAGCGAAGG
AGGAAGAAGAAGAAGACCACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGCCCTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTC
CGACGTGACAGCACGGTTCAGAGTCGAGCCGTCAAGTTCTGGGGTGAAGGACCAGGTGTCCCGCATCTCGGCTGCAAGTTTGGACCACTGCCTCAGAAGAGCGTCCAAAT
TTGTAAGTGACCCGGGGTCCGTCCTGCAGAGGACCATCGACTACGCCGCTGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCCGTGAAGGCCGAGCTGGATGGGAGG
GAAGCTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCCTCTTCCACCATGAAAGATGAGCTGCTGAAAGCTCACTCTGAGGTGGAAATTTTGAA
GGCTGAGGTGGAGGCCAAGGCCGAGCTGGTGAAGAAAGAAGAGGACAGACGCAAGGCCCAACTCCGAGCTGCCCATGCTATCACTAAGGGCTTGGAGAAGGAGAAGTTCC
AACTCCTCAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCACGCGACTGCTGAGCTGGAGATGGTGAAGGAGCGTCTCAGCAATGGA
GTCCTATTGGAGGAATCGTTCAGGCAACATCCCGACTTCGTTGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGGTTCCGACATGCC
TGACCTTCAGATCGATCTCGGTGGTCTGAAGAAGAGGTATGCTGAGCAGTGGGCGTCTGGGCCTAACGGCACCCCTGGCCCCCAAGCGTTGGTGGATAAGTACGTCAGAG
ATCTGGACTCTGACTACTCCGACCTCGAAGAGGATCAGGTCCTCCGCGTTCCACGGGTGCGCGAGGACGTTTCCTTTCAGATCGGCCAATATGTACGTCCCAGGTCAGAC
TATTCCCTTGACCTCAAACGACCCCTCCCAGGTCGGGTCAAGGGCACTCACATGGGCCCTCTTCAGGGGTTAGGCATTTCAATAGAGGCAGGGAAAAGCCGCGTCGGTAC
AACGCTCCACCTCGGACCACGAACCGAGCTGCTTGCCTTGCCAACCTTCTGCGCTCCTTGGGGTCTTGTGGTGAATTGCCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTTTTAACAGTGACTTAGGGTCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACAGGGAAGA
TAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGCTTCCCCTTCACCCTTTTGTCCAAGAAT
TTCTCTTCCGAACTGGGTTGGCTCTGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAAGAGGCG
GAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAA
GGGGCCGACCTCCATCAAAGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTTGCAAAGGACGAGTCGGGTCGTTCCTTCTTTGACGTTCCCATTAGGTTTG
GGAACCTAGTTTCAATCCAACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTGAAATATTACAAGGAGCATTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTG
ACCGACAAGCTGCTGCTTGAGTTCGGGCTGCTATATTACAACCCCGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTTCGAACTTGGATTTGCAAGCAACGTGAAACG
CAAGTCCAAGGGCCGAGCTCATGTTCTTGAGGCCGCCAAGAGTTCGGAACCTGCCACTCCTGCCGTGGCAGGGCCAGTCTCGGAAGATCTAGCCCCAGTGATCGAGCTGG
AGTCTTCTGGGGGTCCTTCGAGGGAGAAGCGCCCCAGGGGTCAGACTGAGGCGGCGGACGTCTCGTCCTTGGGCGAGGAGGTGAGAGAGGAGGCCCCTTTGAAGCGAAGG
AGGAAGAAGAAGAAGACCACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGCCCTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTC
CGACGTGACAGCACGGTTCAGAGTCGAGCCGTCAAGTTCTGGGGTGAAGGACCAGGTGTCCCGCATCTCGGCTGCAAGTTTGGACCACTGCCTCAGAAGAGCGTCCAAAT
TTGTAAGTGACCCGGGGTCCGTCCTGCAGAGGACCATCGACTACGCCGCTGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCCGTGAAGGCCGAGCTGGATGGGAGG
GAAGCTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCCTCTTCCACCATGAAAGATGAGCTGCTGAAAGCTCACTCTGAGGTGGAAATTTTGAA
GGCTGAGGTGGAGGCCAAGGCCGAGCTGGTGAAGAAAGAAGAGGACAGACGCAAGGCCCAACTCCGAGCTGCCCATGCTATCACTAAGGGCTTGGAGAAGGAGAAGTTCC
AACTCCTCAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCACGCGACTGCTGAGCTGGAGATGGTGAAGGAGCGTCTCAGCAATGGA
GTCCTATTGGAGGAATCGTTCAGGCAACATCCCGACTTCGTTGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGGTTCCGACATGCC
TGACCTTCAGATCGATCTCGGTGGTCTGAAGAAGAGGTATGCTGAGCAGTGGGCGTCTGGGCCTAACGGCACCCCTGGCCCCCAAGCGTTGGTGGATAAGTACGTCAGAG
ATCTGGACTCTGACTACTCCGACCTCGAAGAGGATCAGGTCCTCCGCGTTCCACGGGTGCGCGAGGACGTTTCCTTTCAGATCGGCCAATATGTACGTCCCAGGTCAGAC
TATTCCCTTGACCTCAAACGACCCCTCCCAGGTCGGGTCAAGGGCACTCACATGGGCCCTCTTCAGGGGTTAGGCATTTCAATAGAGGCAGGGAAAAGCCGCGTCGGTAC
AACGCTCCACCTCGGACCACGAACCGAGCTGCTTGCCTTGCCAACCTTCTGCGCTCCTTGGGGTCTTGTGGTGAATTGCCCTTAA
Protein sequenceShow/hide protein sequence
MSSSFNSDLGSDEDLARRLESELEEIENFRFSDDREDSDASTSGQGLEYPSRIPEHYLGSLRRGLPLHPFVQEFLFRTGLALAQVAPNGWGVIFALAILFWLRARDSEEA
ELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPIRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLV
TDKLLLEFGLLYYNPAVRPIESSRPNFELGFASNVKRKSKGRAHVLEAAKSSEPATPAVAGPVSEDLAPVIELESSGGPSREKRPRGQTEAADVSSLGEEVREEAPLKRR
RKKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVKDQVSRISAASLDHCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGR
EALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELVKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELEMVKERLSNG
VLLEESFRQHPDFVGFAKDFSDAGFKFLMKGIGSDMPDLQIDLGGLKKRYAEQWASGPNGTPGPQALVDKYVRDLDSDYSDLEEDQVLRVPRVREDVSFQIGQYVRPRSD
YSLDLKRPLPGRVKGTHMGPLQGLGISIEAGKSRVGTTLHLGPRTELLALPTFCAPWGLVVNCP