; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g14960 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g14960
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr3:10061335..10070810
RNA-Seq ExpressionMoc03g14960
SyntenyMoc03g14960
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]2.2e-13993.33Show/hide
Query:  MFEYGLRLPLHTFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRPRDSEEAELLDVDQLLACFEAKRIAKKPGQFYMCARKGADDIVKGPTSIKGWVR
        MFEYGLRLPLH FVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLR RDSEEAELLDVDQLLACFEAKRIAKKPG+FYMCARKGA  IVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHTFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRPRDSEEAELLDVDQLLACFEAKRIAKKPGQFYMCARKGADDIVKGPTSIKGWVR

Query:  KWFYASGEWIAKDESGRSFFDVPTRFGNLVSIRPVPELTHASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVHPIESSRLNSELAMVCGFAS
        KWFYASGEW+AKDESGRSFFDVPTRFGNLVSIRPVPELT ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAV PIE SR NS LAMVC FAS
Subjt:  KWFYASGEWIAKDESGRSFFDVPTRFGNLVSIRPVPELTHASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVHPIESSRLNSELAMVCGFAS

Query:  GVKRKSKSRAHALEVAQSSKPATPAVVGPTSKDPAPVIELESSGGPSREKRPRDQTEAVDAQTEAADAPP
        GVKRKSK RAHALE AQSSKP TPAVVGP S+DPAPVIELESSGGPSREKRPRDQTEAVDAQTEAAD PP
Subjt:  GVKRKSKSRAHALEVAQSSKPATPAVVGPTSKDPAPVIELESSGGPSREKRPRDQTEAVDAQTEAADAPP

XP_022155229.1 uncharacterized protein LOC111022371 [Momordica charantia]6.1e-8194.23Show/hide
Query:  MFEYGLRLPLHTFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRPRDSEEAELLDVDQLLACFEAKRIAKKPGQFYMCARKGADDIVKGPTSIKGWVR
        MFEY LR PLH FVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLR RDSEEAELLDVDQLLACFEAKRIAKKPG+FYMCARKGA  IVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHTFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRPRDSEEAELLDVDQLLACFEAKRIAKKPGQFYMCARKGADDIVKGPTSIKGWVR

Query:  KWFYASGEWIAKDESGRSFFDVPTRFGNLVSIRPVPELTHASFDTLKYYKERFPRG
        KWFYASGEW+AKDESGRSFFDVPTRFGNLVSIRPVPELT ASFDTLKYYKERFPRG
Subjt:  KWFYASGEWIAKDESGRSFFDVPTRFGNLVSIRPVPELTHASFDTLKYYKERFPRG

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]1.1e-10195.31Show/hide
Query:  MFEYGLRLPLHTFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRPRDSEEAELLDVDQLLACFEAKRIAKKPGQFYMCARKGADDIVKGPTSIKGWVR
        MFEYGLRLPLH FVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLR RDSEEAELLDVDQLLACFEAKRIAKKPG+FYMCARKGA  IVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHTFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRPRDSEEAELLDVDQLLACFEAKRIAKKPGQFYMCARKGADDIVKGPTSIKGWVR

Query:  KWFYASGEWIAKDESGRSFFDVPTRFGNLVSIRPVPELTHASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVHPIESSRLNSEL
        KWFYASGEW+AKDESGRSFFDVPTRFGNLVSIRPVPELT ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAV PIESSR NSEL
Subjt:  KWFYASGEWIAKDESGRSFFDVPTRFGNLVSIRPVPELTHASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVHPIESSRLNSEL

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]4.8e-10294.33Show/hide
Query:  MFEYGLRLPLHTFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRPRDSEEAELLDVDQLLACFEAKRIAKKPGQFYMCARKGADDIVKGPTSIKGWVR
        MFEYGLRLPLH FVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLR RDSEEAELLDVDQLLACFEAKRIAKKPG+FYMCARKGAD IVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHTFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRPRDSEEAELLDVDQLLACFEAKRIAKKPGQFYMCARKGADDIVKGPTSIKGWVR

Query:  KWFYASGEWIAKDESGRSFFDVPTRFGNLVSIRPVPELTHASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVHPIESSRLNSELAM
        KWFYASGEW+AKDESGRSFFDVPTRFGNLVSIRPVPELT ASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAV PIESSR NSEL M
Subjt:  KWFYASGEWIAKDESGRSFFDVPTRFGNLVSIRPVPELTHASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVHPIESSRLNSELAM

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.2e-16484.23Show/hide
Query:  SDASDLREDLSRSLITRAARTPGRSVTFLSLSNVIAMSSSISSNLGSDIARRIPEHYLGSLRKGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG
        S +S+L  DL+R L ++        +    +S+    S + +S  G +   RIPEHYLGSLR+GFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG
Subjt:  SDASDLREDLSRSLITRAARTPGRSVTFLSLSNVIAMSSSISSNLGSDIARRIPEHYLGSLRKGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHTFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRPRDSEEAELLDVDQLLACFEAKRIAKKPGQFYMCARKGADDIVKGPTSIKGWVRKWFYA
        LRLPLH FVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLR RDSEEAEL DVDQLLACFEAKRIAKKPG+FYMCARKGA  IVKGPTSIKGWVRKWFYA
Subjt:  LRLPLHTFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRPRDSEEAELLDVDQLLACFEAKRIAKKPGQFYMCARKGADDIVKGPTSIKGWVRKWFYA

Query:  SGEWIAKDESGRSFFDVPTRFGNLVSIRPVPELTHASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVHPIESSRLNSELAMVCGFASGVKRK
        SGEW+AKDESGRSFFDVPTRFGNLVSIRPVPELT ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAV PIESSR NSELAMVCGFASGVKRK
Subjt:  SGEWIAKDESGRSFFDVPTRFGNLVSIRPVPELTHASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVHPIESSRLNSELAMVCGFASGVKRK

Query:  SKSRAHALEVAQSSKPATPAVVGPTSKDPAPVIELESSGGPSREKRPRDQTEAVD
        SK RAHALE AQSSKPATPAVVGP S+DPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKSRAHALEVAQSSKPATPAVVGPTSKDPAPVIELESSGGPSREKRPRDQTEAVD

TrEMBL top hitse value%identityAlignment
A0A6J1CR42 uncharacterized protein LOC1110138261.1e-13993.33Show/hide
Query:  MFEYGLRLPLHTFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRPRDSEEAELLDVDQLLACFEAKRIAKKPGQFYMCARKGADDIVKGPTSIKGWVR
        MFEYGLRLPLH FVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLR RDSEEAELLDVDQLLACFEAKRIAKKPG+FYMCARKGA  IVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHTFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRPRDSEEAELLDVDQLLACFEAKRIAKKPGQFYMCARKGADDIVKGPTSIKGWVR

Query:  KWFYASGEWIAKDESGRSFFDVPTRFGNLVSIRPVPELTHASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVHPIESSRLNSELAMVCGFAS
        KWFYASGEW+AKDESGRSFFDVPTRFGNLVSIRPVPELT ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAV PIE SR NS LAMVC FAS
Subjt:  KWFYASGEWIAKDESGRSFFDVPTRFGNLVSIRPVPELTHASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVHPIESSRLNSELAMVCGFAS

Query:  GVKRKSKSRAHALEVAQSSKPATPAVVGPTSKDPAPVIELESSGGPSREKRPRDQTEAVDAQTEAADAPP
        GVKRKSK RAHALE AQSSKP TPAVVGP S+DPAPVIELESSGGPSREKRPRDQTEAVDAQTEAAD PP
Subjt:  GVKRKSKSRAHALEVAQSSKPATPAVVGPTSKDPAPVIELESSGGPSREKRPRDQTEAVDAQTEAADAPP

A0A6J1DPM7 uncharacterized protein LOC1110223713.0e-8194.23Show/hide
Query:  MFEYGLRLPLHTFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRPRDSEEAELLDVDQLLACFEAKRIAKKPGQFYMCARKGADDIVKGPTSIKGWVR
        MFEY LR PLH FVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLR RDSEEAELLDVDQLLACFEAKRIAKKPG+FYMCARKGA  IVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHTFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRPRDSEEAELLDVDQLLACFEAKRIAKKPGQFYMCARKGADDIVKGPTSIKGWVR

Query:  KWFYASGEWIAKDESGRSFFDVPTRFGNLVSIRPVPELTHASFDTLKYYKERFPRG
        KWFYASGEW+AKDESGRSFFDVPTRFGNLVSIRPVPELT ASFDTLKYYKERFPRG
Subjt:  KWFYASGEWIAKDESGRSFFDVPTRFGNLVSIRPVPELTHASFDTLKYYKERFPRG

A0A6J1DWD2 uncharacterized protein LOC1110246805.2e-10295.31Show/hide
Query:  MFEYGLRLPLHTFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRPRDSEEAELLDVDQLLACFEAKRIAKKPGQFYMCARKGADDIVKGPTSIKGWVR
        MFEYGLRLPLH FVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLR RDSEEAELLDVDQLLACFEAKRIAKKPG+FYMCARKGA  IVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHTFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRPRDSEEAELLDVDQLLACFEAKRIAKKPGQFYMCARKGADDIVKGPTSIKGWVR

Query:  KWFYASGEWIAKDESGRSFFDVPTRFGNLVSIRPVPELTHASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVHPIESSRLNSEL
        KWFYASGEW+AKDESGRSFFDVPTRFGNLVSIRPVPELT ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAV PIESSR NSEL
Subjt:  KWFYASGEWIAKDESGRSFFDVPTRFGNLVSIRPVPELTHASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVHPIESSRLNSEL

A0A6J1DWF1 uncharacterized protein LOC1110251082.3e-10294.33Show/hide
Query:  MFEYGLRLPLHTFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRPRDSEEAELLDVDQLLACFEAKRIAKKPGQFYMCARKGADDIVKGPTSIKGWVR
        MFEYGLRLPLH FVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLR RDSEEAELLDVDQLLACFEAKRIAKKPG+FYMCARKGAD IVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHTFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRPRDSEEAELLDVDQLLACFEAKRIAKKPGQFYMCARKGADDIVKGPTSIKGWVR

Query:  KWFYASGEWIAKDESGRSFFDVPTRFGNLVSIRPVPELTHASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVHPIESSRLNSELAM
        KWFYASGEW+AKDESGRSFFDVPTRFGNLVSIRPVPELT ASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAV PIESSR NSEL M
Subjt:  KWFYASGEWIAKDESGRSFFDVPTRFGNLVSIRPVPELTHASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVHPIESSRLNSELAM

A0A6J1DXS5 uncharacterized protein LOC1110255025.7e-16584.23Show/hide
Query:  SDASDLREDLSRSLITRAARTPGRSVTFLSLSNVIAMSSSISSNLGSDIARRIPEHYLGSLRKGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG
        S +S+L  DL+R L ++        +    +S+    S + +S  G +   RIPEHYLGSLR+GFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG
Subjt:  SDASDLREDLSRSLITRAARTPGRSVTFLSLSNVIAMSSSISSNLGSDIARRIPEHYLGSLRKGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHTFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRPRDSEEAELLDVDQLLACFEAKRIAKKPGQFYMCARKGADDIVKGPTSIKGWVRKWFYA
        LRLPLH FVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLR RDSEEAEL DVDQLLACFEAKRIAKKPG+FYMCARKGA  IVKGPTSIKGWVRKWFYA
Subjt:  LRLPLHTFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRPRDSEEAELLDVDQLLACFEAKRIAKKPGQFYMCARKGADDIVKGPTSIKGWVRKWFYA

Query:  SGEWIAKDESGRSFFDVPTRFGNLVSIRPVPELTHASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVHPIESSRLNSELAMVCGFASGVKRK
        SGEW+AKDESGRSFFDVPTRFGNLVSIRPVPELT ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAV PIESSR NSELAMVCGFASGVKRK
Subjt:  SGEWIAKDESGRSFFDVPTRFGNLVSIRPVPELTHASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVHPIESSRLNSELAMVCGFASGVKRK

Query:  SKSRAHALEVAQSSKPATPAVVGPTSKDPAPVIELESSGGPSREKRPRDQTEAVD
        SK RAHALE AQSSKPATPAVVGP S+DPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKSRAHALEVAQSSKPATPAVVGPTSKDPAPVIELESSGGPSREKRPRDQTEAVD

SwissProt top hitse value%identityAlignment
Q9LEX8 Uncharacterized protein At3g60930, chloroplastic5.2e-0627.64Show/hide
Query:  EHYLGSLRKGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG--LRLPLHTFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRPRDSEEAELLD
        E  L  L+  F +   + LR+P   ERAD+PP G+ TLY + F YG  L LP+   V E++    +A +Q+       + +L  L  +  R  E    + 
Subjt:  EHYLGSLRKGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG--LRLPLHTFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRPRDSEEAELLD

Query:  VDQLLACFEAKRIAK-KPGQFYMCARKGADDIVKGPTSIKGWVRKWFY-ASGEWIAKDESGRSFFDVPTRFG----NLVSIRPVPELTHASFDTLKYYK
        +  L    E +R+ K +  ++Y+   KG   I   P+  + +   +F+ A  + I +D  G     V TR+G     L  + P+P+   ++F  L   K
Subjt:  VDQLLACFEAKRIAK-KPGQFYMCARKGADDIVKGPTSIKGWVRKWFY-ASGEWIAKDESGRSFFDVPTRFG----NLVSIRPVPELTHASFDTLKYYK

Arabidopsis top hitse value%identityAlignment
AT2G15420.1 myosin heavy chain-related8.7e-0932.09Show/hide
Query:  PENILLRLPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHTFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRPRDSEEAELLDVDQLLACFEAKRIA
        P  I L  P+  +R   PPEG++ LY   F   GL  PL  F+ E+  R  +A +Q+          LAIL       +E    +D D         R+ 
Subjt:  PENILLRLPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHTFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRPRDSEEAELLDVDQLLACFEAKRIA

Query:  KKPGQFYMCARKGADDIVKGPTS-IKGWVRKWFY
        + PG +Y  A K    IV G  S I GW R++F+
Subjt:  KKPGQFYMCARKGADDIVKGPTS-IKGWVRKWFY

AT3G42060.1 myosin heavy chain-related7.7e-0522.96Show/hide
Query:  GRSVTFLSLSNVIAMSSSISSNLGSDIARRIPEHYLGSLRKGFAIPENILLRLPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHTFVQEFLFRTGLAPA
        GR   F   +   A S S S  +G   A R    + G        PE +   +PE  +R  + PEG++ L+   F E GL  PL  F+  +  R  +A +
Subjt:  GRSVTFLSLSNVIAMSSSISSNLGSDIARRIPEHYLGSLRKGFAIPENILLRLPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHTFVQEFLFRTGLAPA

Query:  QVAPNGWGVIFALAILFWLRPRDSEEAELLDVDQLLACFEAKRIAKKPGQFYMCARKGADDIVKGPTSIKGWVRKWFYASGEWIAKDESGRSFFDV
        Q++         L IL       +EE  ++D+D            K        +R+G        + ++ W + +F+A    ++ D++  S  ++
Subjt:  QVAPNGWGVIFALAILFWLRPRDSEEAELLDVDQLLACFEAKRIAKKPGQFYMCARKGADDIVKGPTSIKGWVRKWFYASGEWIAKDESGRSFFDV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTCACCTGCTTCGTAAAGCCTACTATTTCAAGGACAAAGCCCTAGACATTTGCGTTAAGAACGTTAGGCTAGATCTGCGAGCTGAGTGGCTCGGGAGGAAGAAGGC
AAGGCTTGAGGCAAAGAATACAAAGTTGTTCGCCGTAACGACAAACTGGTTGCTCAGAAAGTGGAGCTTATTGCTCGAGTGGGTCAATGCCGAATCGATGTCAAAGGGGG
TGGCTATTACGAAAGAAGCTTCGAATGGGTGTGGGTCCATGCCAAAACATGTCGGAATGCTACTATTTCGTCGGATTGCAGGCTGTACGTCGAAATGTGTCTGCAACGGC
GTCATCACTGTTGTTACTGTTCCTGTGGGTCGTGGGTCAAAAGATTTTAAAGTTGTGTCGTCGAAAATAACCTTCAGCTGTTGTGGAAGATCGTCCAAAATTGTTCTGGA
TGGGAAGACGCCGTTCGTGGAGGAATCACGTCGGAGAAGGCCGCATGCCCCTGCCGAACTGGTTGGGAATGCGCCACCGCTACAGGTGTTGTTGTCGTCAGAGAAACCAA
GATGTGTTGCCGCTGGATTGCACGAGGGCGCTGCAGATCTATCAATGCTGGATCGCTGTCGCTACATCGAGAAAACGCTCGTGGAATACGCATCGTGTCGTGGGTTTCGT
TTTAAGCTACAACTGCCACCGTTGCTACCCGAGGTATTACGCTACCACGAAAACGTCAAAAATAGAGCTTCGCCGAACTGCTGTGTTGTTATCGGAGTGGGCGTCACTGG
AGAGGTGGCTGAAAGCGAGGGAAGGGTGGGCGGCTGCTACAACGAGGAAAAAGGGAGCAAGAAGAAGGAGATACTTGGCAGTGTGTTCGTGATTGTAGCTCGAACTCGGC
CTCCGGACCGACCTGAACACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTCAGTATAGGGTATTCCCTTCCCCAAACATTGGCCCCCTCTCTGTCT
GGTCCGATCTCAACCTGGCAGAGAAGTTCATCCGACTTGCTTTGGACACGTGGCGACTTCCTATTTGTGGGAAAATACAACCGTCGCGGAAGATTTATCGTCGGAATATT
CAAATATTCTGACGCTTCGGATCTAAGAGAGGATCTTAGCCGCTCGTTGATTACACGTGCAGCTCGAACCCCTGGTAGGTCGGTCACTTTCCTCTCTCTTTCGAACGTAA
TTGCCATGTCGTCCTCTATTAGCAGCAACCTAGGGTCCGATATAGCTCGTAGGATACCTGAGCACTACCTCGGATCCCTCCGTAAGGGGTTCGCTATCCCTGAGAACATC
CTCCTCAGGCTTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACACTTT
TGTCCAAGAGTTTCTCTTTCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTTGCTTTGGCCATCCTCTTTTGGCTACGACCTCGGGATA
GTGAGGAGGCCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAGGCGAAAAGGATAGCTAAGAAGCCCGGTCAGTTCTATATGTGCGCAAGGAAAGGCGCAGAC
GATATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGATCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCCC
CACTAGGTTTGGGAACCTAGTTTCAATCCGGCCAGTCCCCGAGCTTACGCATGCCTCCTTCGATACTCTGAAATACTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCG
GAACCCTGGTGACTGACGAACTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCATCCCATTGAATCCTCAAGGCTGAACTCTGAACTTGCCATGGTTTGC
GGATTTGCAAGCGGCGTGAAGCGCAAGTCTAAGAGCCGAGCCCATGCTCTTGAGGTCGCCCAGAGTTCGAAACCTGCCACACCTGCTGTGGTAGGGCCTACCTCGAAAGA
TCCAGCCCCGGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAACGCCCCAGGGATCAGACCGAGGCGGTGGACGCCCAGACCGAGGCGGCGGATGCCCCGC
CTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCGTCACCTGCTTCGTAAAGCCTACTATTTCAAGGACAAAGCCCTAGACATTTGCGTTAAGAACGTTAGGCTAGATCTGCGAGCTGAGTGGCTCGGGAGGAAGAAGGC
AAGGCTTGAGGCAAAGAATACAAAGTTGTTCGCCGTAACGACAAACTGGTTGCTCAGAAAGTGGAGCTTATTGCTCGAGTGGGTCAATGCCGAATCGATGTCAAAGGGGG
TGGCTATTACGAAAGAAGCTTCGAATGGGTGTGGGTCCATGCCAAAACATGTCGGAATGCTACTATTTCGTCGGATTGCAGGCTGTACGTCGAAATGTGTCTGCAACGGC
GTCATCACTGTTGTTACTGTTCCTGTGGGTCGTGGGTCAAAAGATTTTAAAGTTGTGTCGTCGAAAATAACCTTCAGCTGTTGTGGAAGATCGTCCAAAATTGTTCTGGA
TGGGAAGACGCCGTTCGTGGAGGAATCACGTCGGAGAAGGCCGCATGCCCCTGCCGAACTGGTTGGGAATGCGCCACCGCTACAGGTGTTGTTGTCGTCAGAGAAACCAA
GATGTGTTGCCGCTGGATTGCACGAGGGCGCTGCAGATCTATCAATGCTGGATCGCTGTCGCTACATCGAGAAAACGCTCGTGGAATACGCATCGTGTCGTGGGTTTCGT
TTTAAGCTACAACTGCCACCGTTGCTACCCGAGGTATTACGCTACCACGAAAACGTCAAAAATAGAGCTTCGCCGAACTGCTGTGTTGTTATCGGAGTGGGCGTCACTGG
AGAGGTGGCTGAAAGCGAGGGAAGGGTGGGCGGCTGCTACAACGAGGAAAAAGGGAGCAAGAAGAAGGAGATACTTGGCAGTGTGTTCGTGATTGTAGCTCGAACTCGGC
CTCCGGACCGACCTGAACACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTCAGTATAGGGTATTCCCTTCCCCAAACATTGGCCCCCTCTCTGTCT
GGTCCGATCTCAACCTGGCAGAGAAGTTCATCCGACTTGCTTTGGACACGTGGCGACTTCCTATTTGTGGGAAAATACAACCGTCGCGGAAGATTTATCGTCGGAATATT
CAAATATTCTGACGCTTCGGATCTAAGAGAGGATCTTAGCCGCTCGTTGATTACACGTGCAGCTCGAACCCCTGGTAGGTCGGTCACTTTCCTCTCTCTTTCGAACGTAA
TTGCCATGTCGTCCTCTATTAGCAGCAACCTAGGGTCCGATATAGCTCGTAGGATACCTGAGCACTACCTCGGATCCCTCCGTAAGGGGTTCGCTATCCCTGAGAACATC
CTCCTCAGGCTTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACACTTT
TGTCCAAGAGTTTCTCTTTCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTTGCTTTGGCCATCCTCTTTTGGCTACGACCTCGGGATA
GTGAGGAGGCCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAGGCGAAAAGGATAGCTAAGAAGCCCGGTCAGTTCTATATGTGCGCAAGGAAAGGCGCAGAC
GATATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGATCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCCC
CACTAGGTTTGGGAACCTAGTTTCAATCCGGCCAGTCCCCGAGCTTACGCATGCCTCCTTCGATACTCTGAAATACTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCG
GAACCCTGGTGACTGACGAACTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCATCCCATTGAATCCTCAAGGCTGAACTCTGAACTTGCCATGGTTTGC
GGATTTGCAAGCGGCGTGAAGCGCAAGTCTAAGAGCCGAGCCCATGCTCTTGAGGTCGCCCAGAGTTCGAAACCTGCCACACCTGCTGTGGTAGGGCCTACCTCGAAAGA
TCCAGCCCCGGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAACGCCCCAGGGATCAGACCGAGGCGGTGGACGCCCAGACCGAGGCGGCGGATGCCCCGC
CTTAA
Protein sequenceShow/hide protein sequence
MRHLLRKAYYFKDKALDICVKNVRLDLRAEWLGRKKARLEAKNTKLFAVTTNWLLRKWSLLLEWVNAESMSKGVAITKEASNGCGSMPKHVGMLLFRRIAGCTSKCVCNG
VITVVTVPVGRGSKDFKVVSSKITFSCCGRSSKIVLDGKTPFVEESRRRRPHAPAELVGNAPPLQVLLSSEKPRCVAAGLHEGAADLSMLDRCRYIEKTLVEYASCRGFR
FKLQLPPLLPEVLRYHENVKNRASPNCCVVIGVGVTGEVAESEGRVGGCYNEEKGSKKKEILGSVFVIVARTRPPDRPEHLGGPAQKGEHSDDQVSIGYSLPQTLAPSLS
GPISTWQRSSSDLLWTRGDFLFVGKYNRRGRFIVGIFKYSDASDLREDLSRSLITRAARTPGRSVTFLSLSNVIAMSSSISSNLGSDIARRIPEHYLGSLRKGFAIPENI
LLRLPEEGERADNPPEGWVTLYFKMFEYGLRLPLHTFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRPRDSEEAELLDVDQLLACFEAKRIAKKPGQFYMCARKGAD
DIVKGPTSIKGWVRKWFYASGEWIAKDESGRSFFDVPTRFGNLVSIRPVPELTHASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVHPIESSRLNSELAMVC
GFASGVKRKSKSRAHALEVAQSSKPATPAVVGPTSKDPAPVIELESSGGPSREKRPRDQTEAVDAQTEAADAPP