; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g06470 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g06470
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:5331632..5334830
RNA-Seq ExpressionMoc07g06470
SyntenyMoc07g06470
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]3.9e-12088.19Show/hide
Query:  EYPFRIPEHYLGSLRREFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLGVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWV
        EY  R+P H      +EFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELL VDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWV
Subjt:  EYPFRIPEHYLGSLRREFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLGVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWV

Query:  RKWFYASEEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKECFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFA
        RKWFYAS EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FA
Subjt:  RKWFYASEEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKECFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFA

Query:  SSVKCKSKGRAHALEAAQSSRPTTPAVAGPASEDPAPVIELESFGGSLEGEAPQ
        S VK KSKGRAHALEAAQSS+P TPAV GPASEDPAPVIELES GG    + P+
Subjt:  SSVKCKSKGRAHALEAAQSSRPTTPAVAGPASEDPAPVIELESFGGSLEGEAPQ

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]1.8e-13392.28Show/hide
Query:  GTSDVTTRFRVEPSSSGVRDHVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        G   +  + R+EPSSSGVRD VSRISAASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTTRFRVEPSSSGVRDHVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVE LKAEVESQAELLKKEEDRR+AQLRAAHAITRGLE+EKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFFMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDLEEDQVGATQEGAPQAGS
        GFAKDFSDAGFKF MKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSD EEDQVG+TQEGA   GS
Subjt:  GFAKDFSDAGFKFFMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDLEEDQVGATQEGAPQAGS

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]1.8e-10473.54Show/hide
Query:  MGGTSDVTTRFRVEPSSSGVRDHVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTM
        MGGT DV TRFR+EPSSSGV+D VSRISA  LDRCL+RASKFVSDPGSVLQRTID AAEAFVASI SA+ VKAELDGRE LAA+E+E  SAALEAA +T+
Subjt:  MGGTSDVTTRFRVEPSSSGVRDHVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTM

Query:  KDELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPD
        K ELLKA  EV IL+AEV+++AELLKKE ++ KA LRAAHAIT+GLEKEKFQLLKEKDD+ Q LE KD  +   TAEL+  KERL+NG LLEESFRQH D
Subjt:  KDELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPD

Query:  FDGFAKDFSDAGFKFFMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDLEED--------QVGATQEGAP
        FDGFAKDFSDAGFKF MKGIA+DMP LQIDLS LK++Y+EKWASGP GTPGPQ+LV +YVR+LDSDYSD+EE+        ++G TQE  P
Subjt:  FDGFAKDFSDAGFKFFMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDLEED--------QVGATQEGAP

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]9.5e-14379.48Show/hide
Query:  MSSSFSSNLGSDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPFRIPEHYLGSLRR---------------------------------------
        MSSS SSNL SDLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYP RIPEHYLGSLRR                                       
Subjt:  MSSSFSSNLGSDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPFRIPEHYLGSLRR---------------------------------------

Query:  --------EFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLGVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASE
                EFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL  VDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYAS 
Subjt:  --------EFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLGVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASE

Query:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKECFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKCKSK
        EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VK KSK
Subjt:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKECFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKCKSK

Query:  GRAHALEAAQSSRPTTPAVAGPASEDPAPVIELESFGGSLEGEAPQ
        GRAHALEAAQSS+P TPAV GPASEDPA VIELES GG    + P+
Subjt:  GRAHALEAAQSSRPTTPAVAGPASEDPAPVIELESFGGSLEGEAPQ

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]3.6e-17465.05Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASEEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKECFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+AS EWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASEEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKECFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASSVKCKSKGRAHALEAAQSSRPTTPAV--------AGPASEDPAPVIELESFGG--------------------SLEGE
         VR IE+SRPNSELAMVCGF  SVK KSKGRAHAL+    + P TP V        +GP+S  P PVIEL+  GG                     + GE
Subjt:  AVRPIESSRPNSELAMVCGFASSVKCKSKGRAHALEAAQSSRPTTPAV--------AGPASEDPAPVIELESFGG--------------------SLEGE

Query:  AP----------QGSHRGGGRPAF--GQGDRVDDPAAKMGGTSDVTTRFRVEPSSSGVRDHVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFV
        +P            S   G R        D VDDP A+M GTS+V  RF +EPSSSGV+D VSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF+
Subjt:  AP----------QGSHRGGGRPAF--GQGDRVDDPAAKMGGTSDVTTRFRVEPSSSGVRDHVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFV

Query:  ASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQ
        ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+++ +LLKKE ++ KA LRAAHAIT+GLEKEKFQLLKEKDD+ Q
Subjt:  ASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQ

Query:  ALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFFMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRD
         LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKF MKGIA+DMP LQIDL+GLK++Y+EKWASGP GTP PQ+LVD+YVR+
Subjt:  ALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFFMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRD

Query:  LDSDYSDLEED--------QVGATQEGAP--QAGS
        LDSDYSD+EE+        +VG TQE  P  Q GS
Subjt:  LDSDYSDLEED--------QVGATQEGAP--QAGS

TrEMBL top hitse value%identityAlignment
A0A6J1CR42 uncharacterized protein LOC1110138261.9e-12088.19Show/hide
Query:  EYPFRIPEHYLGSLRREFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLGVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWV
        EY  R+P H      +EFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELL VDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWV
Subjt:  EYPFRIPEHYLGSLRREFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLGVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWV

Query:  RKWFYASEEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKECFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFA
        RKWFYAS EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FA
Subjt:  RKWFYASEEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKECFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFA

Query:  SSVKCKSKGRAHALEAAQSSRPTTPAVAGPASEDPAPVIELESFGGSLEGEAPQ
        S VK KSKGRAHALEAAQSS+P TPAV GPASEDPAPVIELES GG    + P+
Subjt:  SSVKCKSKGRAHALEAAQSSRPTTPAVAGPASEDPAPVIELESFGGSLEGEAPQ

A0A6J1D971 uncharacterized protein LOC1110185388.7e-13492.28Show/hide
Query:  GTSDVTTRFRVEPSSSGVRDHVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        G   +  + R+EPSSSGVRD VSRISAASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTTRFRVEPSSSGVRDHVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVE LKAEVESQAELLKKEEDRR+AQLRAAHAITRGLE+EKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFFMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDLEEDQVGATQEGAPQAGS
        GFAKDFSDAGFKF MKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSD EEDQVG+TQEGA   GS
Subjt:  GFAKDFSDAGFKFFMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDLEEDQVGATQEGAPQAGS

A0A6J1DF31 uncharacterized protein LOC1110199098.5e-10573.54Show/hide
Query:  MGGTSDVTTRFRVEPSSSGVRDHVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTM
        MGGT DV TRFR+EPSSSGV+D VSRISA  LDRCL+RASKFVSDPGSVLQRTID AAEAFVASI SA+ VKAELDGRE LAA+E+E  SAALEAA +T+
Subjt:  MGGTSDVTTRFRVEPSSSGVRDHVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTM

Query:  KDELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPD
        K ELLKA  EV IL+AEV+++AELLKKE ++ KA LRAAHAIT+GLEKEKFQLLKEKDD+ Q LE KD  +   TAEL+  KERL+NG LLEESFRQH D
Subjt:  KDELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPD

Query:  FDGFAKDFSDAGFKFFMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDLEED--------QVGATQEGAP
        FDGFAKDFSDAGFKF MKGIA+DMP LQIDLS LK++Y+EKWASGP GTPGPQ+LV +YVR+LDSDYSD+EE+        ++G TQE  P
Subjt:  FDGFAKDFSDAGFKFFMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDLEED--------QVGATQEGAP

A0A6J1DXS5 uncharacterized protein LOC1110255024.6e-14379.48Show/hide
Query:  MSSSFSSNLGSDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPFRIPEHYLGSLRR---------------------------------------
        MSSS SSNL SDLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYP RIPEHYLGSLRR                                       
Subjt:  MSSSFSSNLGSDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPFRIPEHYLGSLRR---------------------------------------

Query:  --------EFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLGVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASE
                EFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL  VDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYAS 
Subjt:  --------EFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLGVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASE

Query:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKECFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKCKSK
        EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VK KSK
Subjt:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKECFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKCKSK

Query:  GRAHALEAAQSSRPTTPAVAGPASEDPAPVIELESFGGSLEGEAPQ
        GRAHALEAAQSS+P TPAV GPASEDPA VIELES GG    + P+
Subjt:  GRAHALEAAQSSRPTTPAVAGPASEDPAPVIELESFGGSLEGEAPQ

A0A6J1DZB3 uncharacterized protein LOC1110256651.7e-17465.05Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASEEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKECFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+AS EWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASEEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKECFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASSVKCKSKGRAHALEAAQSSRPTTPAV--------AGPASEDPAPVIELESFGG--------------------SLEGE
         VR IE+SRPNSELAMVCGF  SVK KSKGRAHAL+    + P TP V        +GP+S  P PVIEL+  GG                     + GE
Subjt:  AVRPIESSRPNSELAMVCGFASSVKCKSKGRAHALEAAQSSRPTTPAV--------AGPASEDPAPVIELESFGG--------------------SLEGE

Query:  AP----------QGSHRGGGRPAF--GQGDRVDDPAAKMGGTSDVTTRFRVEPSSSGVRDHVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFV
        +P            S   G R        D VDDP A+M GTS+V  RF +EPSSSGV+D VSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF+
Subjt:  AP----------QGSHRGGGRPAF--GQGDRVDDPAAKMGGTSDVTTRFRVEPSSSGVRDHVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFV

Query:  ASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQ
        ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+++ +LLKKE ++ KA LRAAHAIT+GLEKEKFQLLKEKDD+ Q
Subjt:  ASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQ

Query:  ALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFFMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRD
         LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKF MKGIA+DMP LQIDL+GLK++Y+EKWASGP GTP PQ+LVD+YVR+
Subjt:  ALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFFMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRD

Query:  LDSDYSDLEED--------QVGATQEGAP--QAGS
        LDSDYSD+EE+        +VG TQE  P  Q GS
Subjt:  LDSDYSDLEED--------QVGATQEGAP--QAGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGACCGAAATTTTGAACATTGCTTTTACCTATGTATTTGTTTATCTACTTTTTGAAAATATATTCTGTTTATGCAAGGATATGCACAACAGTGTGTTCCAG
ATTGCAGCTCGAACTTGGCCTCCGGACCGACCTGAACACTTGGGCGAACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTCAGTATAGGTCGGATTCCCAGT
TTAGTTCGAGGCCAGAAATCATCGTACCTGATCGGGGAGTCATACCTTACGTTCCCTGAATTCTTGGAGTTCGATCTGAAGGCAGCTCGAACCCTTGGTAGGTCG
GTCTCTTCCCTCTCTCTTTCGAACATAGTTGCCATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATA
GAAAACTTTAGATTCTCCGATGACGGGGAGGATAGTGACGCCTCCACTTCAGGTCAGGGTTTGGAATACCCTTTTAGGATACCTGAGCACTACCTCGGATCCCTT
CGTAGGGAATTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTTGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGG
GATAGTGAGGAGGCCGAGCTGTTGGGCGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAA
GGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCCGAGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCC
TTCTTTGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTGAAATACTACAAGGAGTGCTTT
CCGAGGGGTAGGAAGGTCGGAACCCTGGTGACTGACGAGCTACTCCTTGAGTCCGGACTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCG
AACTCTGAACTTGCCATGGTTTGCGGGTTTGCAAGCAGCGTGAAGTGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAGACCTACCACC
CCTGCCGTGGCAGGGCCTGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTTTGGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCACACCGAGGC
GGTGGACGCCCTGCCTTTGGGCAAGGAGATCGGGTGGACGATCCTGCTGCCAAGATGGGCGGGACGTCCGACGTGACGACGCGGTTCAGAGTTGAGCCGTCAAGT
TCCGGGGTGAGGGACCATGTGTCCCGCATCTCAGCTGCAAGTTTGGACCGCTGCCTAAGAAGGGCGTCCAAATTTGTGAGCGACCCTGGGTCCGTTCTGCAGAGG
ACCATCGATTACGCCGCCGAGGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAG
GAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGATTTTGAAGGCCGAGGTGGAGTCCCAGGCC
GAGCTGCTGAAGAAGGAAGAGGACAGGCGCAAGGCCCAACTCCGAGCTGCCCACGCTATCACCAGGGGCTTGGAGAAGGAGAAGTTCCAGCTCCTGAAGGAGAAG
GACGACATGCTCCAGGCGCTTGAAGCGAAGGATAAGGAGCTGGAGCATGCGACTGCCGAGCTGGAGACGGCAAAGGAGCGCCTCAGCAATGGAGTCCTATTGGAG
GAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTTTCTGACGCGGGCTTCAAGTTCTTCATGAAGGGCATTGCTTCCGACATGCCCGACCTT
CAGATCGATCTCAGCGGTCTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATCAGTATGTCAGAGAT
CTGGACTCTGACTACTCCGATCTCGAAGAGGACCAGGTCGGCGCCACTCAAGAGGGCGCTCCTCAAGCAGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGACCGAAATTTTGAACATTGCTTTTACCTATGTATTTGTTTATCTACTTTTTGAAAATATATTCTGTTTATGCAAGGATATGCACAACAGTGTGTTCCAG
ATTGCAGCTCGAACTTGGCCTCCGGACCGACCTGAACACTTGGGCGAACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTCAGTATAGGTCGGATTCCCAGT
TTAGTTCGAGGCCAGAAATCATCGTACCTGATCGGGGAGTCATACCTTACGTTCCCTGAATTCTTGGAGTTCGATCTGAAGGCAGCTCGAACCCTTGGTAGGTCG
GTCTCTTCCCTCTCTCTTTCGAACATAGTTGCCATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATA
GAAAACTTTAGATTCTCCGATGACGGGGAGGATAGTGACGCCTCCACTTCAGGTCAGGGTTTGGAATACCCTTTTAGGATACCTGAGCACTACCTCGGATCCCTT
CGTAGGGAATTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTTGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGG
GATAGTGAGGAGGCCGAGCTGTTGGGCGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAA
GGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCCGAGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCC
TTCTTTGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTGAAATACTACAAGGAGTGCTTT
CCGAGGGGTAGGAAGGTCGGAACCCTGGTGACTGACGAGCTACTCCTTGAGTCCGGACTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCG
AACTCTGAACTTGCCATGGTTTGCGGGTTTGCAAGCAGCGTGAAGTGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAGACCTACCACC
CCTGCCGTGGCAGGGCCTGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTTTGGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCACACCGAGGC
GGTGGACGCCCTGCCTTTGGGCAAGGAGATCGGGTGGACGATCCTGCTGCCAAGATGGGCGGGACGTCCGACGTGACGACGCGGTTCAGAGTTGAGCCGTCAAGT
TCCGGGGTGAGGGACCATGTGTCCCGCATCTCAGCTGCAAGTTTGGACCGCTGCCTAAGAAGGGCGTCCAAATTTGTGAGCGACCCTGGGTCCGTTCTGCAGAGG
ACCATCGATTACGCCGCCGAGGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAG
GAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGATTTTGAAGGCCGAGGTGGAGTCCCAGGCC
GAGCTGCTGAAGAAGGAAGAGGACAGGCGCAAGGCCCAACTCCGAGCTGCCCACGCTATCACCAGGGGCTTGGAGAAGGAGAAGTTCCAGCTCCTGAAGGAGAAG
GACGACATGCTCCAGGCGCTTGAAGCGAAGGATAAGGAGCTGGAGCATGCGACTGCCGAGCTGGAGACGGCAAAGGAGCGCCTCAGCAATGGAGTCCTATTGGAG
GAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTTTCTGACGCGGGCTTCAAGTTCTTCATGAAGGGCATTGCTTCCGACATGCCCGACCTT
CAGATCGATCTCAGCGGTCTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATCAGTATGTCAGAGAT
CTGGACTCTGACTACTCCGATCTCGAAGAGGACCAGGTCGGCGCCACTCAAGAGGGCGCTCCTCAAGCAGGCTCTTAG
Protein sequenceShow/hide protein sequence
MSTEILNIAFTYVFVYLLFENIFCLCKDMHNSVFQIAARTWPPDRPEHLGEPAQKGEHSDDQVSIGRIPSLVRGQKSSYLIGESYLTFPEFLEFDLKAARTLGRS
VSSLSLSNIVAMSSSFSSNLGSDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPFRIPEHYLGSLRREFLFRTGLAPAQVAPNGWGVIFALAILFWLRAR
DSEEAELLGVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASEEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKECF
PRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKCKSKGRAHALEAAQSSRPTTPAVAGPASEDPAPVIELESFGGSLEGEAPQGSHRG
GGRPAFGQGDRVDDPAAKMGGTSDVTTRFRVEPSSSGVRDHVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKE
EFSAALEAASSTMKDELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLE
ESFRQHPDFDGFAKDFSDAGFKFFMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDLEEDQVGATQEGAPQAGS