; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g01760 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g01760
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:1374313..1377016
RNA-Seq ExpressionMoc07g01760
SyntenyMoc07g01760
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]1.7e-11791.18Show/hide
Query:  EFLFRTRLAPTQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYICARKGAGGIVKGSTSIKGWVRKWFYASGEWLAKDES
        EFLFRT LAP QVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFY+CARKGAGGIVKG TSIKGWVRKWFYASGEWLAKDES
Subjt:  EFLFRTRLAPTQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYICARKGAGGIVKGSTSIKGWVRKWFYASGEWLAKDES

Query:  GRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEA
        GRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIE SRPNS LAMVC FAS VKRKSKGRAHALEA
Subjt:  GRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEA

Query:  AQSSEPAPPAVAGPASEDPAPVIELESSGGSFEGEAPQ
        AQSS+P  PAV GPASEDPAPVIELESSGG    + P+
Subjt:  AQSSEPAPPAVAGPASEDPAPVIELESSGGSFEGEAPQ

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]2.8e-12385.26Show/hide
Query:  GTSDVTARFRVEPSSSGVRDQVSHISAASLDCCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALPVKAELDGREALAAREKEEFSAALEAASSTMKD
        G   + A+ R+EPSSSGVRDQVS ISAASLD CLRRASKFVS PGSVLQRTIDYAAEAFVASIQSAL VKAELDGRE LAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRVEPSSSGVRDQVSHISAASLDCCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALPVKAELDGREALAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVEAKAELLKKEENRRKAQLRAAHAITKGLKKEKFQLLKEKDDILQALEAKEEELKHATAELEMVKERLSNGALFEESFRQHPDFD
        ELLKAHSEVE LKAEVE++AELLKKEE+RR+AQLRAAHAIT+GL++EKFQLLKEKDD+LQALEAK++EL+HATAELE  KERLSNG L EE+FRQHPDFD
Subjt:  ELLKAHSEVEILKAEVEAKAELLKKEENRRKAQLRAAHAITKGLKKEKFQLLKEKDDILQALEAKEEELKHATAELEMVKERLSNGALFEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEENQVGTTQEGAPQAGS
        GFAKDFSDAGFKFLMKGIASDMPDLQIDL GLK+RYAE+WASGP GTPGPQALVD+YVRDLDSDYSD EE+QVG+TQEGA   GS
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEENQVGTTQEGAPQAGS

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]9.9e-10574.23Show/hide
Query:  MGGTSDVTARFRVEPSSSGVRDQVSHISAASLDCCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALPVKAELDGREALAAREKEEFSAALEAASSTM
        MGGT DV  RFR+EPSSSGV+DQVS ISA  LD CL+RASKFVSDPGSVLQRTID AAEAFVASI SA+ VKAELDGREALAA+E+E  SAALEAA +T+
Subjt:  MGGTSDVTARFRVEPSSSGVRDQVSHISAASLDCCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALPVKAELDGREALAAREKEEFSAALEAASSTM

Query:  KDELLKAHSEVEILKAEVEAKAELLKKEENRRKAQLRAAHAITKGLKKEKFQLLKEKDDILQALEAKEEELKHATAELEMVKERLSNGALFEESFRQHPD
        K ELLKA  EV IL+AEV+AKAELLKKE  + KA LRAAHAITKGL+KEKFQLLKEKDD+ Q LE K+  +   TAEL+ +KERL+NG+L EESFRQH D
Subjt:  KDELLKAHSEVEILKAEVEAKAELLKKEENRRKAQLRAAHAITKGLKKEKFQLLKEKDDILQALEAKEEELKHATAELEMVKERLSNGALFEESFRQHPD

Query:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEE--------NQVGTTQEGAP
        FDGFAKDFSDAGFKFLMKGIA+DMP LQIDL  LKK+Y+E+WASGP+GTPGPQ+LV KYVR+LDSDYSD+EE        N++GTTQE  P
Subjt:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEE--------NQVGTTQEGAP

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]3.9e-12572.41Show/hide
Query:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDNDASTS---------------------------------------------------------
        MSSS SS+L  + DLARRLES+LEEIEN R SDDGED+DASTS                                                         
Subjt:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDNDASTS---------------------------------------------------------

Query:  ----------EFLFRTRLAPTQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYICARKGAGGIVKGSTSIKGWVRKWFYA
                  EFLFRT LAP QVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFY+CARKGAGGIVKG TSIKGWVRKWFYA
Subjt:  ----------EFLFRTRLAPTQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYICARKGAGGIVKGSTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK
        SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK

Query:  SKGRAHALEAAQSSEPAPPAVAGPASEDPAPVIELESSGGSFEGEAPQ
        SKGRAHALEAAQSS+PA PAV GPASEDPA VIELESSGG    + P+
Subjt:  SKGRAHALEAAQSSEPAPPAVAGPASEDPAPVIELESSGGSFEGEAPQ

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.9e-17766.54Show/hide
Query:  ICARKGAGGIVKGSTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
        +CARKG GGIVKG TSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+HFPR RK+ TLVTDKLLLESGLLDYNP
Subjt:  ICARKGAGGIVKGSTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEPAPPAV--------AGPASEDPAPVIELESSGG--------------------SFEGE
         VR IE+SRPNSELAMVCGF  +VKRKSKGRAHAL+    +EP  P V        +GP+S  P PVIEL+ SGG                       GE
Subjt:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEPAPPAV--------AGPASEDPAPVIELESSGG--------------------SFEGE

Query:  AP-----------QGSDRGG-GRLVLGRGDRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSHISAASLDCCLRRASKFVSDPGSVLQRTIDYAAEAFV
        +P             S+ G  G L     D VDDPEARM GTS+V  RF +EPSSSGV+DQVS ISA  LD  LRRASKFVSDPGSVLQRTID  AEAF+
Subjt:  AP-----------QGSDRGG-GRLVLGRGDRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSHISAASLDCCLRRASKFVSDPGSVLQRTIDYAAEAFV

Query:  ASIQSALPVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEENRRKAQLRAAHAITKGLKKEKFQLLKEKDDILQ
        ASI  A+ VKAELDGREALAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+AK +LLKKE  + KA LRAAHAITKGL+KEKFQLLKEKDD+ Q
Subjt:  ASIQSALPVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEENRRKAQLRAAHAITKGLKKEKFQLLKEKDDILQ

Query:  ALEAKEEELKHATAELEMVKERLSNGALFEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRD
         LE K+  +   T EL+ +KERL+NG L EESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL GLKK+Y+E+WASGP+GTP PQ+LVDKYVR+
Subjt:  ALEAKEEELKHATAELEMVKERLSNGALFEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRD

Query:  LDSDYSDLEEN--------QVGTTQEGAP--QAGS
        LDSDYSD+EE         +VGTTQE  P  Q GS
Subjt:  LDSDYSDLEEN--------QVGTTQEGAP--QAGS

TrEMBL top hitse value%identityAlignment
A0A6J1CR42 uncharacterized protein LOC1110138268.4e-11891.18Show/hide
Query:  EFLFRTRLAPTQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYICARKGAGGIVKGSTSIKGWVRKWFYASGEWLAKDES
        EFLFRT LAP QVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFY+CARKGAGGIVKG TSIKGWVRKWFYASGEWLAKDES
Subjt:  EFLFRTRLAPTQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYICARKGAGGIVKGSTSIKGWVRKWFYASGEWLAKDES

Query:  GRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEA
        GRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIE SRPNS LAMVC FAS VKRKSKGRAHALEA
Subjt:  GRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEA

Query:  AQSSEPAPPAVAGPASEDPAPVIELESSGGSFEGEAPQ
        AQSS+P  PAV GPASEDPAPVIELESSGG    + P+
Subjt:  AQSSEPAPPAVAGPASEDPAPVIELESSGGSFEGEAPQ

A0A6J1D971 uncharacterized protein LOC1110185381.3e-12385.26Show/hide
Query:  GTSDVTARFRVEPSSSGVRDQVSHISAASLDCCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALPVKAELDGREALAAREKEEFSAALEAASSTMKD
        G   + A+ R+EPSSSGVRDQVS ISAASLD CLRRASKFVS PGSVLQRTIDYAAEAFVASIQSAL VKAELDGRE LAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRVEPSSSGVRDQVSHISAASLDCCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALPVKAELDGREALAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVEAKAELLKKEENRRKAQLRAAHAITKGLKKEKFQLLKEKDDILQALEAKEEELKHATAELEMVKERLSNGALFEESFRQHPDFD
        ELLKAHSEVE LKAEVE++AELLKKEE+RR+AQLRAAHAIT+GL++EKFQLLKEKDD+LQALEAK++EL+HATAELE  KERLSNG L EE+FRQHPDFD
Subjt:  ELLKAHSEVEILKAEVEAKAELLKKEENRRKAQLRAAHAITKGLKKEKFQLLKEKDDILQALEAKEEELKHATAELEMVKERLSNGALFEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEENQVGTTQEGAPQAGS
        GFAKDFSDAGFKFLMKGIASDMPDLQIDL GLK+RYAE+WASGP GTPGPQALVD+YVRDLDSDYSD EE+QVG+TQEGA   GS
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEENQVGTTQEGAPQAGS

A0A6J1DF31 uncharacterized protein LOC1110199094.8e-10574.23Show/hide
Query:  MGGTSDVTARFRVEPSSSGVRDQVSHISAASLDCCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALPVKAELDGREALAAREKEEFSAALEAASSTM
        MGGT DV  RFR+EPSSSGV+DQVS ISA  LD CL+RASKFVSDPGSVLQRTID AAEAFVASI SA+ VKAELDGREALAA+E+E  SAALEAA +T+
Subjt:  MGGTSDVTARFRVEPSSSGVRDQVSHISAASLDCCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALPVKAELDGREALAAREKEEFSAALEAASSTM

Query:  KDELLKAHSEVEILKAEVEAKAELLKKEENRRKAQLRAAHAITKGLKKEKFQLLKEKDDILQALEAKEEELKHATAELEMVKERLSNGALFEESFRQHPD
        K ELLKA  EV IL+AEV+AKAELLKKE  + KA LRAAHAITKGL+KEKFQLLKEKDD+ Q LE K+  +   TAEL+ +KERL+NG+L EESFRQH D
Subjt:  KDELLKAHSEVEILKAEVEAKAELLKKEENRRKAQLRAAHAITKGLKKEKFQLLKEKDDILQALEAKEEELKHATAELEMVKERLSNGALFEESFRQHPD

Query:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEE--------NQVGTTQEGAP
        FDGFAKDFSDAGFKFLMKGIA+DMP LQIDL  LKK+Y+E+WASGP+GTPGPQ+LV KYVR+LDSDYSD+EE        N++GTTQE  P
Subjt:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEE--------NQVGTTQEGAP

A0A6J1DXS5 uncharacterized protein LOC1110255021.9e-12572.41Show/hide
Query:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDNDASTS---------------------------------------------------------
        MSSS SS+L  + DLARRLES+LEEIEN R SDDGED+DASTS                                                         
Subjt:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDNDASTS---------------------------------------------------------

Query:  ----------EFLFRTRLAPTQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYICARKGAGGIVKGSTSIKGWVRKWFYA
                  EFLFRT LAP QVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFY+CARKGAGGIVKG TSIKGWVRKWFYA
Subjt:  ----------EFLFRTRLAPTQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYICARKGAGGIVKGSTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK
        SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK

Query:  SKGRAHALEAAQSSEPAPPAVAGPASEDPAPVIELESSGGSFEGEAPQ
        SKGRAHALEAAQSS+PA PAV GPASEDPA VIELESSGG    + P+
Subjt:  SKGRAHALEAAQSSEPAPPAVAGPASEDPAPVIELESSGGSFEGEAPQ

A0A6J1DZB3 uncharacterized protein LOC1110256659.4e-17866.54Show/hide
Query:  ICARKGAGGIVKGSTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
        +CARKG GGIVKG TSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+HFPR RK+ TLVTDKLLLESGLLDYNP
Subjt:  ICARKGAGGIVKGSTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEPAPPAV--------AGPASEDPAPVIELESSGG--------------------SFEGE
         VR IE+SRPNSELAMVCGF  +VKRKSKGRAHAL+    +EP  P V        +GP+S  P PVIEL+ SGG                       GE
Subjt:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEPAPPAV--------AGPASEDPAPVIELESSGG--------------------SFEGE

Query:  AP-----------QGSDRGG-GRLVLGRGDRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSHISAASLDCCLRRASKFVSDPGSVLQRTIDYAAEAFV
        +P             S+ G  G L     D VDDPEARM GTS+V  RF +EPSSSGV+DQVS ISA  LD  LRRASKFVSDPGSVLQRTID  AEAF+
Subjt:  AP-----------QGSDRGG-GRLVLGRGDRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSHISAASLDCCLRRASKFVSDPGSVLQRTIDYAAEAFV

Query:  ASIQSALPVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEENRRKAQLRAAHAITKGLKKEKFQLLKEKDDILQ
        ASI  A+ VKAELDGREALAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+AK +LLKKE  + KA LRAAHAITKGL+KEKFQLLKEKDD+ Q
Subjt:  ASIQSALPVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEENRRKAQLRAAHAITKGLKKEKFQLLKEKDDILQ

Query:  ALEAKEEELKHATAELEMVKERLSNGALFEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRD
         LE K+  +   T EL+ +KERL+NG L EESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL GLKK+Y+E+WASGP+GTP PQ+LVDKYVR+
Subjt:  ALEAKEEELKHATAELEMVKERLSNGALFEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRD

Query:  LDSDYSDLEEN--------QVGTTQEGAP--QAGS
        LDSDYSD+EE         +VGTTQE  P  Q GS
Subjt:  LDSDYSDLEEN--------QVGTTQEGAP--QAGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCACCTTGGAGCACCAATGGCTGTCCTCCACGTGTCCAGGGTATTCTTTTCCCCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCGGAG
AAGTTCATTCGATTCGCTTCGGACACGTGGCGACTTCCCATTCGTGGGAAAACATCACTGTTGCGGTGCATTTATCGTCGGAATATTCAAATATTCCGACGCTTC
GGATTTTCGGGAGGATCCCAGCCGCTCGTTGATTACACGTGTACGCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCACTTTCTCTTTCGAACGTAGTTGACATG
TCGTCCTCTTTTAGCAGCGACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAG
GATAATGATGCCTCCACCTCGGAATTTCTCTTCCGAACTAGGTTGGCTCCGACTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTT
TGGCTACGAGCTCGGGATAGTGAAGAGGCCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTAT
ATATGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGTCGACCTCCATCAAAGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTTGCAAAGGAC
GAGTCGGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTGAAATAT
TACAAGGAGCATTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATT
GAATCCTCAAGGCCGAACTCCGAACTTGCCATGGTTTGCGGATTTGCAAGCAACGTGAAACGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGT
TCGGAACCTGCCCCTCCTGCTGTGGCAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGGTCTTTCGAGGGAGAAGCGCCCCAG
GGGTCAGACCGAGGCGGTGGACGTCTCGTCCTTGGGCGAGGAGATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCGACGTGACAGCACGGTTCAGA
GTCGAGCCGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCACATCTCGGCTGCAAGTTTGGACTGCTGCCTCAGAAGAGCGTCCAAATTTGTAAGTGACCCGGGG
TCCGTCCTGCAGAGGACCATCGACTACGCCGCTGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGCCCGTGAAAGCCGAGCTGGATGGGAGGGAAGCTCTAGCA
GCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCCTCTTCCACCATGAAGGATGAGCTGCTGAAAGCTCACTCTGAGGTGGAAATTTTGAAGGCTGAG
GTGGAGGCCAAGGCCGAGCTGCTGAAGAAAGAAGAGAACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAAGGGCTTGAAGAAGGAGAAGTTCCAA
CTCCTCAAGGAGAAGGACGACATACTCCAGGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCATGCGACTGCTGAGCTGGAGATGGTGAAGGAGCGTCTCAGCAAT
GGAGCCCTATTCGAGGAATCGTTCAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCC
GACATGCCTGACCTTCAGATCGATCTCGGTGGTCTGAAGAAGAGGTATGCTGAGCAGTGGGCGTCTGGGCCTAGCGGCACCCCTGGCCCCCAAGCGTTGGTGGAT
AAGTACGTCAGAGATCTGGACTCTGACTACTCCGACCTCGAAGAGAATCAGGTCGGCACCACTCAGGAGGGCGCTCCTCAAGCAGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTCACCTTGGAGCACCAATGGCTGTCCTCCACGTGTCCAGGGTATTCTTTTCCCCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCGGAG
AAGTTCATTCGATTCGCTTCGGACACGTGGCGACTTCCCATTCGTGGGAAAACATCACTGTTGCGGTGCATTTATCGTCGGAATATTCAAATATTCCGACGCTTC
GGATTTTCGGGAGGATCCCAGCCGCTCGTTGATTACACGTGTACGCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCACTTTCTCTTTCGAACGTAGTTGACATG
TCGTCCTCTTTTAGCAGCGACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAG
GATAATGATGCCTCCACCTCGGAATTTCTCTTCCGAACTAGGTTGGCTCCGACTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTT
TGGCTACGAGCTCGGGATAGTGAAGAGGCCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTAT
ATATGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGTCGACCTCCATCAAAGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTTGCAAAGGAC
GAGTCGGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTGAAATAT
TACAAGGAGCATTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATT
GAATCCTCAAGGCCGAACTCCGAACTTGCCATGGTTTGCGGATTTGCAAGCAACGTGAAACGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGT
TCGGAACCTGCCCCTCCTGCTGTGGCAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGGTCTTTCGAGGGAGAAGCGCCCCAG
GGGTCAGACCGAGGCGGTGGACGTCTCGTCCTTGGGCGAGGAGATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCGACGTGACAGCACGGTTCAGA
GTCGAGCCGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCACATCTCGGCTGCAAGTTTGGACTGCTGCCTCAGAAGAGCGTCCAAATTTGTAAGTGACCCGGGG
TCCGTCCTGCAGAGGACCATCGACTACGCCGCTGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGCCCGTGAAAGCCGAGCTGGATGGGAGGGAAGCTCTAGCA
GCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCCTCTTCCACCATGAAGGATGAGCTGCTGAAAGCTCACTCTGAGGTGGAAATTTTGAAGGCTGAG
GTGGAGGCCAAGGCCGAGCTGCTGAAGAAAGAAGAGAACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAAGGGCTTGAAGAAGGAGAAGTTCCAA
CTCCTCAAGGAGAAGGACGACATACTCCAGGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCATGCGACTGCTGAGCTGGAGATGGTGAAGGAGCGTCTCAGCAAT
GGAGCCCTATTCGAGGAATCGTTCAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCC
GACATGCCTGACCTTCAGATCGATCTCGGTGGTCTGAAGAAGAGGTATGCTGAGCAGTGGGCGTCTGGGCCTAGCGGCACCCCTGGCCCCCAAGCGTTGGTGGAT
AAGTACGTCAGAGATCTGGACTCTGACTACTCCGACCTCGAAGAGAATCAGGTCGGCACCACTCAGGAGGGCGCTCCTCAAGCAGGCTCTTAG
Protein sequenceShow/hide protein sequence
MGHLGAPMAVLHVSRVFFSPNIGPLSVWSDLDLAEKFIRFASDTWRLPIRGKTSLLRCIYRRNIQIFRRFGFSGGSQPLVDYTCTLEPLVGRSLPSLSLSNVVDM
SSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDNDASTSEFLFRTRLAPTQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFY
ICARKGAGGIVKGSTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPI
ESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEPAPPAVAGPASEDPAPVIELESSGGSFEGEAPQGSDRGGGRLVLGRGDRVDDPEARMGGTSDVTARFR
VEPSSSGVRDQVSHISAASLDCCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALPVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAE
VEAKAELLKKEENRRKAQLRAAHAITKGLKKEKFQLLKEKDDILQALEAKEEELKHATAELEMVKERLSNGALFEESFRQHPDFDGFAKDFSDAGFKFLMKGIAS
DMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEENQVGTTQEGAPQAGS