; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g13410 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g13410
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr4:10408259..10411546
RNA-Seq ExpressionMoc04g13410
SyntenyMoc04g13410
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]9.7e-13457.53Show/hide
Query:  VSIKPIPELDQATFDTLKYYKDNFPKGRKIGTLVTDKLLLESGLLDYDPLVRPIEASRPNSELAMVCGFTGSVKRKSKGRAHALKTVVGTEPVTPTVPLS
        +SIKPIPEL QATFDTLK+YKDNFP+GRKIGTLVTDKLLLESGLLDY+PLVRPIEASRPNSELAMVCGFT SVKRKSKGRAHALK V  ++PVTP V  +
Subjt:  VSIKPIPELDQATFDTLKYYKDNFPKGRKIGTLVTDKLLLESGLLDYDPLVRPIEASRPNSELAMVCGFTGSVKRKSKGRAHALKTVVGTEPVTPTVPLS

Query:  EAQGNSGPSSAVPTPVIELNLSGGRSEEKRLREESEALDVSPLNEVKGESPLRRRRKKKKTSSSSEAEARGTLPTSHANLVDDPEARMGGTSDVRTRFKM
         AQ  +GPSSA PTPVIEL+ +G RS EKR R ESEALDVSPL EV+                                                     
Subjt:  EAQGNSGPSSAVPTPVIELNLSGGRSEEKRLREESEALDVSPLNEVKGESPLRRRRKKKKTSSSSEAEARGTLPTSHANLVDDPEARMGGTSDVRTRFKM

Query:  EPSSSGVKDQVSRISATCLDCCLRRASKFAFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTLKGELLKAQGEVGILRAEVDAKAELLKKEGEK
                                                                                               +AKAELLK+E E+
Subjt:  EPSSSGVKDQVSRISATCLDCCLRRASKFAFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTLKGELLKAQGEVGILRAEVDAKAELLKKEGEK

Query:  HKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLDEKDASIRRLTAELKDLKERLTNGSLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDL
        HKAHLRAAHAITKGLEKEKFQLLKEKDD+ Q L+ KDA+I RL AELK  KERLTNG+LLE +FRQHPDFDGFAKDFSDAGFKFLMKGIAAD+PHL++DL
Subjt:  HKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLDEKDASIRRLTAELKDLKERLTNGSLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDL

Query:  SDLKKKYSEKWASGPNGTPGPQSLVDNYVRELDSDYSDVEEEDAPSQDLIEIGTTQEEVPSQQDGSQEVNLLGSQGELSSHLGSS
         DLKK+Y+EKWASGPNGT GP SLVD YVR+LDSDYSD++E++ PSQ+  E+GTTQE VPSQQDGSQEVNLLGSQGELSSHLGSS
Subjt:  SDLKKKYSEKWASGPNGTPGPQSLVDNYVRELDSDYSDVEEEDAPSQDLIEIGTTQEEVPSQQDGSQEVNLLGSQGELSSHLGSS

XP_022147182.1 uncharacterized protein LOC111016193 [Momordica charantia]4.7e-12082.87Show/hide
Query:  RFKMEPSSSGVKDQVSRISATCLDCCLRRASKF-----------------AFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTLKGELLKAQGE
        RF+ME SSSGVKDQVSRISATCLD CLRRAS+F                 AF+ASIHSA+MVKAELDGREAL AKEREN S  LEAATTLKGELLKAQGE
Subjt:  RFKMEPSSSGVKDQVSRISATCLDCCLRRASKF-----------------AFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTLKGELLKAQGE

Query:  VGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLDEKDASIRRLTAELKDLKERLTNGSLLEESFRQHPDFDGFAKDFSD
        V ILRAEVDAK +LLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVL++KDASI RLT ELKDLKERLT+G+LLEESFRQHP+FDGFAKDFSD
Subjt:  VGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLDEKDASIRRLTAELKDLKERLTNGSLLEESFRQHPDFDGFAKDFSD

Query:  AGFKFLMKGIAADMPHLQIDLSDLKKKYSEKWASGPNGTPGPQSLVDNYVRELDSDYSDVEEEDAPSQDLIEIGTTQEEVPSQQDG
        AGFKFLMKGIAADMPHLQIDLSDLKK+YSE WASGPNGTPGPQSLVD YVRELDSDYSD+EEEDAPSQ+  ++GTTQEE PSQ  G
Subjt:  AGFKFLMKGIAADMPHLQIDLSDLKKKYSEKWASGPNGTPGPQSLVDNYVRELDSDYSDVEEEDAPSQDLIEIGTTQEEVPSQQDG

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]4.8e-14189.21Show/hide
Query:  MGGTSDVRTRFKMEPSSSGVKDQVSRISATCLDCCLRRASKF-----------------AFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTLK
        MGGT DVRTRF+MEPSSSGVKDQVSRISATCLD CL+RASKF                 AFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTLK
Subjt:  MGGTSDVRTRFKMEPSSSGVKDQVSRISATCLDCCLRRASKF-----------------AFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTLK

Query:  GELLKAQGEVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLDEKDASIRRLTAELKDLKERLTNGSLLEESFRQHPDF
        GELLKAQGEVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVL+ KD SI RLTAELKDLKERLTNGSLLEESFRQH DF
Subjt:  GELLKAQGEVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLDEKDASIRRLTAELKDLKERLTNGSLLEESFRQHPDF

Query:  DGFAKDFSDAGFKFLMKGIAADMPHLQIDLSDLKKKYSEKWASGPNGTPGPQSLVDNYVRELDSDYSDVEEEDAPSQDLIEIGTTQEEVPSQQDGSQEVN
        DGFAKDFSDAGFKFLMKGIAADMPHLQIDLS+LKKKYSEKWASGPNGTPGPQSLV  YVRELDSDYSD+EEEDAPSQ+  EIGTTQEEVPSQQDGSQEVN
Subjt:  DGFAKDFSDAGFKFLMKGIAADMPHLQIDLSDLKKKYSEKWASGPNGTPGPQSLVDNYVRELDSDYSDVEEEDAPSQDLIEIGTTQEEVPSQQDGSQEVN

Query:  LLGSQGELSSHLGSS
        LLGS+GELSSHLGSS
Subjt:  LLGSQGELSSHLGSS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.9e-15377.18Show/hide
Query:  SDSEEVLARRLESELEEIENFRFSDDGEDSDTSTSGQGREYPSRMPERYLEPLRRGFNIPNDILLRIPEEGERADNPPEGWVTLYLKMFEYGLRLPLHPF
        S+ E  LARRLES+LEEIEN R SDDGEDSD STSGQG EYPSR+PE YL  LRRGF IP +ILLR+PEEGERADNPPEGWVTLY KMFEYGLRLPLHPF
Subjt:  SDSEEVLARRLESELEEIENFRFSDDGEDSDTSTSGQGREYPSRMPERYLEPLRRGFNIPNDILLRIPEEGERADNPPEGWVTLYLKMFEYGLRLPLHPF

Query:  AQEFLNRTGLASAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKEWVDKWFFASGEWLAKD
         QEFL RTGLA AQVAPNGWGVIFALAILFWLRARD +EAEL  VDQLL CFEAKRIAKKPGR+YMCARKGAGGIVKGPTSIK WV KWF+ASGEWLAKD
Subjt:  AQEFLNRTGLASAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKEWVDKWFFASGEWLAKD

Query:  ESGRPFFDMSARFGNLVSIKPIPELDQATFDTLKYYKDNFPKGRKIGTLVTDKLLLESGLLDYDPLVRPIEASRPNSELAMVCGFTGSVKRKSKGRAHAL
        ESGR FFD+  RFGNLVSI+P+PEL QA+FDTLKYYK+ FP+GRK+GTLVTD+LLLESGLLDY+P VRPIE+SRPNSELAMVCGF   VKRKSKGRAHAL
Subjt:  ESGRPFFDMSARFGNLVSIKPIPELDQATFDTLKYYKDNFPKGRKIGTLVTDKLLLESGLLDYDPLVRPIEASRPNSELAMVCGFTGSVKRKSKGRAHAL

Query:  KTVVGTEPVTPTVPLSEAQGNSGPSSAVPTPVIELNLSGGRSEEKRLREESEALD
        +    ++P TP V         GP+S  P  VIEL  SGG S EKR R+++EA+D
Subjt:  KTVVGTEPVTPTVPLSEAQGNSGPSSAVPTPVIELNLSGGRSEEKRLREESEALD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]4.1e-24987.08Show/hide
Query:  MCARKGAGGIVKGPTSIKEWVDKWFFASGEWLAKDESGRPFFDMSARFGNLVSIKPIPELDQATFDTLKYYKDNFPKGRKIGTLVTDKLLLESGLLDYDP
        MCARKG GGIVKGPTSIK WV KWFFASGEWLAKDESGR FFD+  RFGNLVSIK IPEL QATFDTLK+YKD+FP+ RKI TLVTDKLLLESGLLDY+P
Subjt:  MCARKGAGGIVKGPTSIKEWVDKWFFASGEWLAKDESGRPFFDMSARFGNLVSIKPIPELDQATFDTLKYYKDNFPKGRKIGTLVTDKLLLESGLLDYDP

Query:  LVRPIEASRPNSELAMVCGFTGSVKRKSKGRAHALKTVVGTEPVTPTVPLSEAQGNSGPSSAVPTPVIELNLSGGRSEEKRLREESEALDVSPLNEVKGE
        LVR IEASRPNSELAMVCGFTGSVKRKSKGRAHALKTVVGTEPVTPTVP + AQGNSGPSSAVPTPVIEL+LSGGRS EKR REESEALDVSPLNEV+GE
Subjt:  LVRPIEASRPNSELAMVCGFTGSVKRKSKGRAHALKTVVGTEPVTPTVPLSEAQGNSGPSSAVPTPVIELNLSGGRSEEKRLREESEALDVSPLNEVKGE

Query:  SPLRRRRKKKKTSSSSEAEARGTLPTSHANLVDDPEARMGGTSDVRTRFKMEPSSSGVKDQVSRISATCLDCCLRRASKF-----------------AFV
        SPLRRRRKKKKTSSSSEA ARGTLPTSHA+LVDDPEARM GTS+VR RF MEPSSSGVKDQVSRISATCLD  LRRASKF                 AF+
Subjt:  SPLRRRRKKKKTSSSSEAEARGTLPTSHANLVDDPEARMGGTSDVRTRFKMEPSSSGVKDQVSRISATCLDCCLRRASKF-----------------AFV

Query:  ASIHSAIMVKAELDGREALAAKERENSSAALEAATTLKGELLKAQGEVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQV
        ASIH A+MVKAELDGREALAAKERENS AALEAATTLKGELLKAQGEV ILRAEVDAK +LLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQV
Subjt:  ASIHSAIMVKAELDGREALAAKERENSSAALEAATTLKGELLKAQGEVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQV

Query:  LDEKDASIRRLTAELKDLKERLTNGSLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDLSDLKKKYSEKWASGPNGTPGPQSLVDNYVREL
        L+EKDASI RLT ELKDLKERLTNG+LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDL+ LKKKYSEKWASGPNGTP PQSLVD YVREL
Subjt:  LDEKDASIRRLTAELKDLKERLTNGSLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDLSDLKKKYSEKWASGPNGTPGPQSLVDNYVREL

Query:  DSDYSDVEEEDAPSQDLIEIGTTQEEVPSQQDGS
        DSDYSD+EEEDAPSQ+  E+GTTQEEVPSQQ GS
Subjt:  DSDYSDVEEEDAPSQDLIEIGTTQEEVPSQQDGS

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124674.7e-13457.53Show/hide
Query:  VSIKPIPELDQATFDTLKYYKDNFPKGRKIGTLVTDKLLLESGLLDYDPLVRPIEASRPNSELAMVCGFTGSVKRKSKGRAHALKTVVGTEPVTPTVPLS
        +SIKPIPEL QATFDTLK+YKDNFP+GRKIGTLVTDKLLLESGLLDY+PLVRPIEASRPNSELAMVCGFT SVKRKSKGRAHALK V  ++PVTP V  +
Subjt:  VSIKPIPELDQATFDTLKYYKDNFPKGRKIGTLVTDKLLLESGLLDYDPLVRPIEASRPNSELAMVCGFTGSVKRKSKGRAHALKTVVGTEPVTPTVPLS

Query:  EAQGNSGPSSAVPTPVIELNLSGGRSEEKRLREESEALDVSPLNEVKGESPLRRRRKKKKTSSSSEAEARGTLPTSHANLVDDPEARMGGTSDVRTRFKM
         AQ  +GPSSA PTPVIEL+ +G RS EKR R ESEALDVSPL EV+                                                     
Subjt:  EAQGNSGPSSAVPTPVIELNLSGGRSEEKRLREESEALDVSPLNEVKGESPLRRRRKKKKTSSSSEAEARGTLPTSHANLVDDPEARMGGTSDVRTRFKM

Query:  EPSSSGVKDQVSRISATCLDCCLRRASKFAFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTLKGELLKAQGEVGILRAEVDAKAELLKKEGEK
                                                                                               +AKAELLK+E E+
Subjt:  EPSSSGVKDQVSRISATCLDCCLRRASKFAFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTLKGELLKAQGEVGILRAEVDAKAELLKKEGEK

Query:  HKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLDEKDASIRRLTAELKDLKERLTNGSLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDL
        HKAHLRAAHAITKGLEKEKFQLLKEKDD+ Q L+ KDA+I RL AELK  KERLTNG+LLE +FRQHPDFDGFAKDFSDAGFKFLMKGIAAD+PHL++DL
Subjt:  HKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLDEKDASIRRLTAELKDLKERLTNGSLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDL

Query:  SDLKKKYSEKWASGPNGTPGPQSLVDNYVRELDSDYSDVEEEDAPSQDLIEIGTTQEEVPSQQDGSQEVNLLGSQGELSSHLGSS
         DLKK+Y+EKWASGPNGT GP SLVD YVR+LDSDYSD++E++ PSQ+  E+GTTQE VPSQQDGSQEVNLLGSQGELSSHLGSS
Subjt:  SDLKKKYSEKWASGPNGTPGPQSLVDNYVRELDSDYSDVEEEDAPSQDLIEIGTTQEEVPSQQDGSQEVNLLGSQGELSSHLGSS

A0A6J1D1N9 uncharacterized protein LOC1110161932.3e-12082.87Show/hide
Query:  RFKMEPSSSGVKDQVSRISATCLDCCLRRASKF-----------------AFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTLKGELLKAQGE
        RF+ME SSSGVKDQVSRISATCLD CLRRAS+F                 AF+ASIHSA+MVKAELDGREAL AKEREN S  LEAATTLKGELLKAQGE
Subjt:  RFKMEPSSSGVKDQVSRISATCLDCCLRRASKF-----------------AFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTLKGELLKAQGE

Query:  VGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLDEKDASIRRLTAELKDLKERLTNGSLLEESFRQHPDFDGFAKDFSD
        V ILRAEVDAK +LLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVL++KDASI RLT ELKDLKERLT+G+LLEESFRQHP+FDGFAKDFSD
Subjt:  VGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLDEKDASIRRLTAELKDLKERLTNGSLLEESFRQHPDFDGFAKDFSD

Query:  AGFKFLMKGIAADMPHLQIDLSDLKKKYSEKWASGPNGTPGPQSLVDNYVRELDSDYSDVEEEDAPSQDLIEIGTTQEEVPSQQDG
        AGFKFLMKGIAADMPHLQIDLSDLKK+YSE WASGPNGTPGPQSLVD YVRELDSDYSD+EEEDAPSQ+  ++GTTQEE PSQ  G
Subjt:  AGFKFLMKGIAADMPHLQIDLSDLKKKYSEKWASGPNGTPGPQSLVDNYVRELDSDYSDVEEEDAPSQDLIEIGTTQEEVPSQQDG

A0A6J1DF31 uncharacterized protein LOC1110199092.3e-14189.21Show/hide
Query:  MGGTSDVRTRFKMEPSSSGVKDQVSRISATCLDCCLRRASKF-----------------AFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTLK
        MGGT DVRTRF+MEPSSSGVKDQVSRISATCLD CL+RASKF                 AFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTLK
Subjt:  MGGTSDVRTRFKMEPSSSGVKDQVSRISATCLDCCLRRASKF-----------------AFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTLK

Query:  GELLKAQGEVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLDEKDASIRRLTAELKDLKERLTNGSLLEESFRQHPDF
        GELLKAQGEVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVL+ KD SI RLTAELKDLKERLTNGSLLEESFRQH DF
Subjt:  GELLKAQGEVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLDEKDASIRRLTAELKDLKERLTNGSLLEESFRQHPDF

Query:  DGFAKDFSDAGFKFLMKGIAADMPHLQIDLSDLKKKYSEKWASGPNGTPGPQSLVDNYVRELDSDYSDVEEEDAPSQDLIEIGTTQEEVPSQQDGSQEVN
        DGFAKDFSDAGFKFLMKGIAADMPHLQIDLS+LKKKYSEKWASGPNGTPGPQSLV  YVRELDSDYSD+EEEDAPSQ+  EIGTTQEEVPSQQDGSQEVN
Subjt:  DGFAKDFSDAGFKFLMKGIAADMPHLQIDLSDLKKKYSEKWASGPNGTPGPQSLVDNYVRELDSDYSDVEEEDAPSQDLIEIGTTQEEVPSQQDGSQEVN

Query:  LLGSQGELSSHLGSS
        LLGS+GELSSHLGSS
Subjt:  LLGSQGELSSHLGSS

A0A6J1DXS5 uncharacterized protein LOC1110255029.1e-15477.18Show/hide
Query:  SDSEEVLARRLESELEEIENFRFSDDGEDSDTSTSGQGREYPSRMPERYLEPLRRGFNIPNDILLRIPEEGERADNPPEGWVTLYLKMFEYGLRLPLHPF
        S+ E  LARRLES+LEEIEN R SDDGEDSD STSGQG EYPSR+PE YL  LRRGF IP +ILLR+PEEGERADNPPEGWVTLY KMFEYGLRLPLHPF
Subjt:  SDSEEVLARRLESELEEIENFRFSDDGEDSDTSTSGQGREYPSRMPERYLEPLRRGFNIPNDILLRIPEEGERADNPPEGWVTLYLKMFEYGLRLPLHPF

Query:  AQEFLNRTGLASAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKEWVDKWFFASGEWLAKD
         QEFL RTGLA AQVAPNGWGVIFALAILFWLRARD +EAEL  VDQLL CFEAKRIAKKPGR+YMCARKGAGGIVKGPTSIK WV KWF+ASGEWLAKD
Subjt:  AQEFLNRTGLASAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKEWVDKWFFASGEWLAKD

Query:  ESGRPFFDMSARFGNLVSIKPIPELDQATFDTLKYYKDNFPKGRKIGTLVTDKLLLESGLLDYDPLVRPIEASRPNSELAMVCGFTGSVKRKSKGRAHAL
        ESGR FFD+  RFGNLVSI+P+PEL QA+FDTLKYYK+ FP+GRK+GTLVTD+LLLESGLLDY+P VRPIE+SRPNSELAMVCGF   VKRKSKGRAHAL
Subjt:  ESGRPFFDMSARFGNLVSIKPIPELDQATFDTLKYYKDNFPKGRKIGTLVTDKLLLESGLLDYDPLVRPIEASRPNSELAMVCGFTGSVKRKSKGRAHAL

Query:  KTVVGTEPVTPTVPLSEAQGNSGPSSAVPTPVIELNLSGGRSEEKRLREESEALD
        +    ++P TP V         GP+S  P  VIEL  SGG S EKR R+++EA+D
Subjt:  KTVVGTEPVTPTVPLSEAQGNSGPSSAVPTPVIELNLSGGRSEEKRLREESEALD

A0A6J1DZB3 uncharacterized protein LOC1110256652.0e-24987.08Show/hide
Query:  MCARKGAGGIVKGPTSIKEWVDKWFFASGEWLAKDESGRPFFDMSARFGNLVSIKPIPELDQATFDTLKYYKDNFPKGRKIGTLVTDKLLLESGLLDYDP
        MCARKG GGIVKGPTSIK WV KWFFASGEWLAKDESGR FFD+  RFGNLVSIK IPEL QATFDTLK+YKD+FP+ RKI TLVTDKLLLESGLLDY+P
Subjt:  MCARKGAGGIVKGPTSIKEWVDKWFFASGEWLAKDESGRPFFDMSARFGNLVSIKPIPELDQATFDTLKYYKDNFPKGRKIGTLVTDKLLLESGLLDYDP

Query:  LVRPIEASRPNSELAMVCGFTGSVKRKSKGRAHALKTVVGTEPVTPTVPLSEAQGNSGPSSAVPTPVIELNLSGGRSEEKRLREESEALDVSPLNEVKGE
        LVR IEASRPNSELAMVCGFTGSVKRKSKGRAHALKTVVGTEPVTPTVP + AQGNSGPSSAVPTPVIEL+LSGGRS EKR REESEALDVSPLNEV+GE
Subjt:  LVRPIEASRPNSELAMVCGFTGSVKRKSKGRAHALKTVVGTEPVTPTVPLSEAQGNSGPSSAVPTPVIELNLSGGRSEEKRLREESEALDVSPLNEVKGE

Query:  SPLRRRRKKKKTSSSSEAEARGTLPTSHANLVDDPEARMGGTSDVRTRFKMEPSSSGVKDQVSRISATCLDCCLRRASKF-----------------AFV
        SPLRRRRKKKKTSSSSEA ARGTLPTSHA+LVDDPEARM GTS+VR RF MEPSSSGVKDQVSRISATCLD  LRRASKF                 AF+
Subjt:  SPLRRRRKKKKTSSSSEAEARGTLPTSHANLVDDPEARMGGTSDVRTRFKMEPSSSGVKDQVSRISATCLDCCLRRASKF-----------------AFV

Query:  ASIHSAIMVKAELDGREALAAKERENSSAALEAATTLKGELLKAQGEVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQV
        ASIH A+MVKAELDGREALAAKERENS AALEAATTLKGELLKAQGEV ILRAEVDAK +LLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQV
Subjt:  ASIHSAIMVKAELDGREALAAKERENSSAALEAATTLKGELLKAQGEVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQV

Query:  LDEKDASIRRLTAELKDLKERLTNGSLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDLSDLKKKYSEKWASGPNGTPGPQSLVDNYVREL
        L+EKDASI RLT ELKDLKERLTNG+LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDL+ LKKKYSEKWASGPNGTP PQSLVD YVREL
Subjt:  LDEKDASIRRLTAELKDLKERLTNGSLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDLSDLKKKYSEKWASGPNGTPGPQSLVDNYVREL

Query:  DSDYSDVEEEDAPSQDLIEIGTTQEEVPSQQDGS
        DSDYSD+EEEDAPSQ+  E+GTTQEEVPSQQ GS
Subjt:  DSDYSDVEEEDAPSQDLIEIGTTQEEVPSQQDGS

SwissProt top hitse value%identityAlignment
Q9LEX8 Uncharacterized protein At3g60930, chloroplastic2.6e-0422.31Show/hide
Query:  PSDSEEVLARRLESELEEIENFRFSDDGEDSDTSTSGQGREYPSRMPERYLEPLRRGFNIPNDILLRIPEEGERADNPPEGWVTLYLKMFEYG--LRLPL
        P+++E++L    +   E+ E  +     +    +  G      S   E+ L  L+  F +   + LR+P   ERAD+PP G+ TLY + F YG  L LP+
Subjt:  PSDSEEVLARRLESELEEIENFRFSDDGEDSDTSTSGQGREYPSRMPERYLEPLRRGFNIPNDILLRIPEEGERADNPPEGWVTLYLKMFEYG--LRLPL

Query:  HPFAQEFLNRTGLASAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAK-KPGRYYMCARKGAGGIVKGPTSIKEWVDKWFF-ASGE
             E++    +A +Q+       + +L  L  +  R  +    +++  L    E +R+ K +  RYY+   KG   I   P+  + + D +FF A  +
Subjt:  HPFAQEFLNRTGLASAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAK-KPGRYYMCARKGAGGIVKGPTSIKEWVDKWFF-ASGE

Query:  WLAKDESGRPFFDMSARFGNLVSIKPIPELDQATFDTLKYYKDNFPKGRKIGTLVTDKLLLESGLLDYDPLVRPIEASRPNSELAMVCGFT--------G
         + +D  G            L  ++PIP+   + F  L   K ++ K         +++     LL         E+S   ++  +    T         
Subjt:  WLAKDESGRPFFDMSARFGNLVSIKPIPELDQATFDTLKYYKDNFPKGRKIGTLVTDKLLLESGLLDYDPLVRPIEASRPNSELAMVCGFT--------G

Query:  SVKRKSKGRAHALKTVVGTEPVTPTV-PLSEAQGNSGPSSAVPTPVIELNLSG----------GRSEEKRLREES----------EALDVSPLNEVKGES
          K   + R    + +V T  ++P   P +   GN  P +  P    E +  G              E +  E S           A+D +    ++ + 
Subjt:  SVKRKSKGRAHALKTVVGTEPVTPTV-PLSEAQGNSGPSSAVPTPVIELNLSG----------GRSEEKRLREES----------EALDVSPLNEVKGES

Query:  PLRRRRKKKKTSSSSEAEARGTLP-----TSHANLVDDPEARMGGTSDVRTRFKMEPSSSGVKDQVSRISATCLDCCLRRASKFAFVASIHSAIMVKAEL
           +++KKKK +S SE E    LP     T+ ANL       +GG         + P  + ++ +  + + T        AS    V S  SA+    E+
Subjt:  PLRRRRKKKKTSSSSEAEARGTLP-----TSHANLVDDPEARMGGTSDVRTRFKMEPSSSGVKDQVSRISATCLDCCLRRASKFAFVASIHSAIMVKAEL

Query:  DGREA-----LAAKERENSSAALEAATT-LKGELLKAQGEV----GILRAEVDAKAE---------LLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEK
         G+ A     + A ERE + A  EAA   L+ E ++    V     I  AE + KA          L +  G +     RA     + + +     +K  
Subjt:  DGREA-----LAAKERENSSAALEAATT-LKGELLKAQGEV----GILRAEVDAKAE---------LLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEK

Query:  DDLAQVLDEKDASIRRLTAELKD--LKERLTNGSLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDL-SDLKKKYSEKWASGPNGTPGPQS
        +    +LDE +     L+    +  L E L  G +LE    Q    D + KDF+DA           ++     +L  DLK    E     P G    +S
Subjt:  DDLAQVLDEKDASIRRLTAELKD--LKERLTNGSLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAADMPHLQIDL-SDLKKKYSEKWASGPNGTPGPQS

Query:  LVDNYVRELDSDYSDVEEEDAPSQDL
        L D       +      +++ PS+DL
Subjt:  LVDNYVRELDSDYSDVEEEDAPSQDL

Arabidopsis top hitse value%identityAlignment
AT1G32010.1 myosin heavy chain-related6.7e-0824.21Show/hide
Query:  RLESELEEIENFRFSDDGEDSDTSTSGQGREY------PSRMPERYLEPLRRGFNIPNDILLRIPEEGERADNPPEGWVTLYLKMF-EYGLRLPLHPFAQ
        R+ ++ +   N    D+ E +D + SG+ R+       P+      +        +P  + +RIP + +R  + PEG++ L+   F E GLR P+  F  
Subjt:  RLESELEEIENFRFSDDGEDSDTSTSGQGREY------PSRMPERYLEPLRRGFNIPNDILLRIPEEGERADNPPEGWVTLYLKMF-EYGLRLPLHPFAQ

Query:  EFLNRTGLASAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKEWVDKWFFA
         F     +A +Q+       I   A L  L AR       LSV+ +       ++  K G++Y+ + +G   +  GP+  ++W+  +F+A
Subjt:  EFLNRTGLASAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKEWVDKWFFA

AT2G15420.1 myosin heavy chain-related5.9e-0423.76Show/hide
Query:  NIPNDILLRIPEEGERADNPPEGWVTLYLKMF-EYGLRLPLHPFAQEFLNRTGLASAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKR
        N P +I L  P+  +R   PPEG++ LY   F   GL  PL  F  E+  R  +A +Q+          LAIL        +    +  D         R
Subjt:  NIPNDILLRIPEEGERADNPPEGWVTLYLKMF-EYGLRLPLHPFAQEFLNRTGLASAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKR

Query:  IAKKPGRYYMCARKGAGGIVKGPTS-IKEWVDKWFFASGEWLAKDESGRPFFD---MSARFGNLVSIKPIPELDQAT----FDTLKYYKDNFPKGR----
        + + PG YY  A K    IV G  S I  W  ++FF      + +     F D   MS      V   P   LD          L +   +FP+ R    
Subjt:  IAKKPGRYYMCARKGAGGIVKGPTS-IKEWVDKWFFASGEWLAKDESGRPFFD---MSARFGNLVSIKPIPELDQAT----FDTLKYYKDNFPKGR----

Query:  KIGTLVTDKLLLESGLLDYDPLVRPIEASRPNSELAMVCGFTGSVKRKSKGRAHALKTVVGTEPV--TPTVPLSEAQGNSGP---SSAVPTPVIELNLSG
        ++G ++           +   L+  +E S   +E  +     G    +S GR  A ++  G   V  +   P++E +G  G     SA+P      +L G
Subjt:  KIGTLVTDKLLLESGLLDYDPLVRPIEASRPNSELAMVCGFTGSVKRKSKGRAHALKTVVGTEPV--TPTVPLSEAQGNSGP---SSAVPTPVIELNLSG

Query:  GRSEEKRL-REESEALDVSPLNEVKGESPLRRRRKKKKT-------SSSSEAEARGTLPTSHANLVDDPEARMGGTSDVRTRFKMEPSSSGVKDQVSRIS
            +KR  R+++E          KG   + +  +++ T         S + +A+    T +A    D  +R+ G SD  +           KD   +  
Subjt:  GRSEEKRL-REESEALDVSPLNEVKGESPLRRRRKKKKT-------SSSSEAEARGTLPTSHANLVDDPEARMGGTSDVRTRFKMEPSSSGVKDQVSRIS

Query:  ATCLDC-CLRRASKFAFVASIHSAIMVKAELDGREAL------AAKERENSSAALEAATTLKGELLKAQGEVGILRAEVDAKAELLKKEGEKHKAHLRAA
         TC       R S F    +  SA  +  E    EAL       A ERE S+   + ++ L  ++   Q  V   R +++A  +    EG    A LR +
Subjt:  ATCLDC-CLRRASKFAFVASIHSAIMVKAELDGREAL------AAKERENSSAALEAATTLKGELLKAQGEVGILRAEVDAKAELLKKEGEKHKAHLRAA

Query:  HAITKGLEKEK--FQLLKEKDDLAQVLDEKDASIRRLTAELKDLKERLTNG-SLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA
               E++K   Q+      L +++ +K A       EL+  +  L NG   LE +     D D F +  + A    L+ GI+
Subjt:  HAITKGLEKEK--FQLLKEKDDLAQVLDEKDASIRRLTAELKDLKERLTNG-SLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA

AT5G38190.1 INVOLVED IN: biological_process unknown4.4e-0725.14Show/hide
Query:  RFSDD-GEDSDTSTSGQGREY------PSRMPERYLEPLRRGFNIPNDILLRIPEEGERADNPPEGWVTLYLKMF-EYGLRLPLHPFAQEFLNRTGLASA
        R++DD  E +D + SG+ R+       P+      +        +P  + +RIP + +R  + PEG++ L+   F E GLR P+  F   F     +A +
Subjt:  RFSDD-GEDSDTSTSGQGREY------PSRMPERYLEPLRRGFNIPNDILLRIPEEGERADNPPEGWVTLYLKMF-EYGLRLPLHPFAQEFLNRTGLASA

Query:  QVAPNGWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKEWVDKWFFA
        Q+       I   A L  L AR       LSV+ +       ++  K G++Y+ + +G   +   P+  ++W+  +F+A
Subjt:  QVAPNGWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKEWVDKWFFA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATGCCAACTCTAGTTGCTTCTACGATTGGTTGGACAACTTTGTGATTGTTTATGCAGGAATTTGCACAACAGTTCTTCACGAATCGAGCTCGAACCCGGTCTCCGG
TTTCGACCTGAACACTAGAGTGGACCTGAACAAGAGGGTAATGGATCCGACAGTACACACGACTGGCGGTTACATGTCTTTTCTCATATCGGACCTGTCGGGTTCCGAGC
AGGTCGGACCCCAGTCAGGTCAAACTTTGGTGCCCATATTTCATCTTTTAAGGGGCAAACTCGGTCACATCGGTGGGGCCGAGGTGGACCTAAGCAATCCTTTTTATCTA
ATTTCTTCAAACACGAATAAGGGTCCTCCACGTGTCCCGGGTTGTCGGAGCACTCAAGCGCTTCGCCGTTGCGTATCCCCAGAAGATCCCAGCCGCTCGTTGATTACACG
TTTCGATCTGAAACCAGCTCGAACCCTTTCTCTAGGTCGGACGATAAGTAGTTCGCCTCCCAAACCAAGTGACTCTGAAGAGGTCTTAGCTCGTAGGTTAGAGTCCGAGC
TTGAAGAAATAGAGAACTTTAGGTTCTCAGATGACGGAGAGGATAGTGATACCTCCACCTCGGGCCAGGGTCGGGAGTACCCTTCTAGGATGCCCGAGCGTTATCTTGAA
CCCCTTCGTAGGGGGTTTAACATTCCGAATGACATCCTCCTTAGGATTCCGGAGGAAGGGGAAAGAGCTGACAATCCCCCAGAGGGATGGGTCACTCTTTATCTCAAGAT
GTTTGAGTACGGCCTCAGGCTTCCCCTTCATCCTTTCGCCCAGGAGTTCTTAAACCGAACTGGACTGGCTTCTGCTCAAGTGGCCCCTAATGGGTGGGGTGTCATTTTTG
CTTTAGCCATTCTTTTTTGGTTGCGAGCTCGGGACGAAGATGAGGCCGAGCTGCTAAGTGTTGACCAGCTTCTTGGGTGTTTTGAGGCCAAGAGGATAGCCAAAAAACCA
GGTCGGTACTATATGTGCGCAAGGAAGGGCGCGGGTGGTATAGTCAAGGGGCCGACCTCCATCAAAGAATGGGTAGACAAGTGGTTCTTTGCCTCTGGAGAGTGGTTGGC
AAAGGACGAATCAGGTCGTCCCTTCTTTGACATGTCTGCTAGGTTTGGGAACCTAGTATCGATCAAGCCGATTCCCGAGCTCGATCAAGCCACTTTTGACACCCTCAAGT
ACTACAAGGACAACTTCCCCAAGGGCAGGAAGATCGGAACCTTGGTCACCGACAAGCTTCTCTTGGAGTCGGGGCTTCTTGACTACGACCCTCTAGTTCGACCAATCGAA
GCTTCAAGGCCGAACTCTGAACTCGCAATGGTGTGCGGATTCACTGGAAGTGTGAAGCGCAAGTCCAAGGGCCGTGCTCACGCCCTCAAGACTGTGGTGGGGACTGAACC
GGTGACGCCTACGGTGCCACTGTCTGAGGCTCAGGGTAACTCTGGGCCTTCTTCTGCAGTCCCCACCCCTGTGATCGAACTAAACTTGTCTGGGGGTCGATCTGAAGAGA
AGCGTCTGAGGGAAGAGTCTGAGGCGCTTGACGTATCTCCCCTGAACGAGGTTAAGGGAGAGTCGCCTTTGAGGAGAAGAAGAAAGAAGAAGAAGACCTCCTCCTCCTCG
GAAGCTGAGGCTCGTGGGACTCTGCCTACGAGCCATGCTAATTTGGTGGATGACCCCGAAGCTCGGATGGGGGGGACATCCGATGTGCGAACGCGGTTCAAGATGGAACC
GTCAAGCTCCGGGGTGAAGGACCAGGTGTCCCGCATCTCGGCCACGTGCTTGGATTGCTGCCTAAGGAGAGCATCCAAGTTCGCGTTTGTCGCTTCCATTCACTCAGCTA
TTATGGTCAAGGCTGAGCTGGATGGAAGGGAGGCTTTGGCAGCAAAGGAGAGGGAGAACTCCTCTGCTGCCTTAGAGGCTGCCACCACGCTGAAGGGCGAGCTGCTAAAG
GCCCAAGGCGAGGTGGGTATCTTAAGGGCCGAGGTGGATGCCAAGGCCGAGCTTTTGAAGAAGGAGGGTGAAAAGCACAAGGCCCACCTCCGAGCAGCCCATGCGATTAC
CAAGGGGCTGGAGAAGGAGAAGTTCCAACTCTTGAAGGAGAAGGACGATCTCGCTCAGGTCCTCGACGAGAAGGATGCCTCAATAAGGCGCCTCACAGCCGAGCTTAAAG
ACCTAAAGGAGCGCCTAACCAATGGATCTCTACTGGAGGAGTCGTTCAGGCAACACCCGGACTTCGATGGGTTTGCCAAGGACTTCAGCGACGCCGGCTTCAAGTTCCTG
ATGAAGGGCATTGCTGCTGACATGCCCCACCTTCAGATCGATCTCAGCGATCTCAAGAAGAAATATTCTGAGAAGTGGGCTTCTGGGCCTAATGGGACTCCTGGCCCTCA
ATCGCTGGTGGATAACTACGTCAGGGAGCTGGACTCTGACTACTCCGACGTGGAAGAAGAGGATGCTCCTAGCCAAGATCTCATTGAGATCGGCACGACGCAAGAGGAGG
TTCCTTCCCAGCAGGATGGATCTCAGGAGGTTAACCTTCTAGGCTCCCAAGGCGAGCTGTCCTCCCACCTCGGAAGTAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCATGCCAACTCTAGTTGCTTCTACGATTGGTTGGACAACTTTGTGATTGTTTATGCAGGAATTTGCACAACAGTTCTTCACGAATCGAGCTCGAACCCGGTCTCCGG
TTTCGACCTGAACACTAGAGTGGACCTGAACAAGAGGGTAATGGATCCGACAGTACACACGACTGGCGGTTACATGTCTTTTCTCATATCGGACCTGTCGGGTTCCGAGC
AGGTCGGACCCCAGTCAGGTCAAACTTTGGTGCCCATATTTCATCTTTTAAGGGGCAAACTCGGTCACATCGGTGGGGCCGAGGTGGACCTAAGCAATCCTTTTTATCTA
ATTTCTTCAAACACGAATAAGGGTCCTCCACGTGTCCCGGGTTGTCGGAGCACTCAAGCGCTTCGCCGTTGCGTATCCCCAGAAGATCCCAGCCGCTCGTTGATTACACG
TTTCGATCTGAAACCAGCTCGAACCCTTTCTCTAGGTCGGACGATAAGTAGTTCGCCTCCCAAACCAAGTGACTCTGAAGAGGTCTTAGCTCGTAGGTTAGAGTCCGAGC
TTGAAGAAATAGAGAACTTTAGGTTCTCAGATGACGGAGAGGATAGTGATACCTCCACCTCGGGCCAGGGTCGGGAGTACCCTTCTAGGATGCCCGAGCGTTATCTTGAA
CCCCTTCGTAGGGGGTTTAACATTCCGAATGACATCCTCCTTAGGATTCCGGAGGAAGGGGAAAGAGCTGACAATCCCCCAGAGGGATGGGTCACTCTTTATCTCAAGAT
GTTTGAGTACGGCCTCAGGCTTCCCCTTCATCCTTTCGCCCAGGAGTTCTTAAACCGAACTGGACTGGCTTCTGCTCAAGTGGCCCCTAATGGGTGGGGTGTCATTTTTG
CTTTAGCCATTCTTTTTTGGTTGCGAGCTCGGGACGAAGATGAGGCCGAGCTGCTAAGTGTTGACCAGCTTCTTGGGTGTTTTGAGGCCAAGAGGATAGCCAAAAAACCA
GGTCGGTACTATATGTGCGCAAGGAAGGGCGCGGGTGGTATAGTCAAGGGGCCGACCTCCATCAAAGAATGGGTAGACAAGTGGTTCTTTGCCTCTGGAGAGTGGTTGGC
AAAGGACGAATCAGGTCGTCCCTTCTTTGACATGTCTGCTAGGTTTGGGAACCTAGTATCGATCAAGCCGATTCCCGAGCTCGATCAAGCCACTTTTGACACCCTCAAGT
ACTACAAGGACAACTTCCCCAAGGGCAGGAAGATCGGAACCTTGGTCACCGACAAGCTTCTCTTGGAGTCGGGGCTTCTTGACTACGACCCTCTAGTTCGACCAATCGAA
GCTTCAAGGCCGAACTCTGAACTCGCAATGGTGTGCGGATTCACTGGAAGTGTGAAGCGCAAGTCCAAGGGCCGTGCTCACGCCCTCAAGACTGTGGTGGGGACTGAACC
GGTGACGCCTACGGTGCCACTGTCTGAGGCTCAGGGTAACTCTGGGCCTTCTTCTGCAGTCCCCACCCCTGTGATCGAACTAAACTTGTCTGGGGGTCGATCTGAAGAGA
AGCGTCTGAGGGAAGAGTCTGAGGCGCTTGACGTATCTCCCCTGAACGAGGTTAAGGGAGAGTCGCCTTTGAGGAGAAGAAGAAAGAAGAAGAAGACCTCCTCCTCCTCG
GAAGCTGAGGCTCGTGGGACTCTGCCTACGAGCCATGCTAATTTGGTGGATGACCCCGAAGCTCGGATGGGGGGGACATCCGATGTGCGAACGCGGTTCAAGATGGAACC
GTCAAGCTCCGGGGTGAAGGACCAGGTGTCCCGCATCTCGGCCACGTGCTTGGATTGCTGCCTAAGGAGAGCATCCAAGTTCGCGTTTGTCGCTTCCATTCACTCAGCTA
TTATGGTCAAGGCTGAGCTGGATGGAAGGGAGGCTTTGGCAGCAAAGGAGAGGGAGAACTCCTCTGCTGCCTTAGAGGCTGCCACCACGCTGAAGGGCGAGCTGCTAAAG
GCCCAAGGCGAGGTGGGTATCTTAAGGGCCGAGGTGGATGCCAAGGCCGAGCTTTTGAAGAAGGAGGGTGAAAAGCACAAGGCCCACCTCCGAGCAGCCCATGCGATTAC
CAAGGGGCTGGAGAAGGAGAAGTTCCAACTCTTGAAGGAGAAGGACGATCTCGCTCAGGTCCTCGACGAGAAGGATGCCTCAATAAGGCGCCTCACAGCCGAGCTTAAAG
ACCTAAAGGAGCGCCTAACCAATGGATCTCTACTGGAGGAGTCGTTCAGGCAACACCCGGACTTCGATGGGTTTGCCAAGGACTTCAGCGACGCCGGCTTCAAGTTCCTG
ATGAAGGGCATTGCTGCTGACATGCCCCACCTTCAGATCGATCTCAGCGATCTCAAGAAGAAATATTCTGAGAAGTGGGCTTCTGGGCCTAATGGGACTCCTGGCCCTCA
ATCGCTGGTGGATAACTACGTCAGGGAGCTGGACTCTGACTACTCCGACGTGGAAGAAGAGGATGCTCCTAGCCAAGATCTCATTGAGATCGGCACGACGCAAGAGGAGG
TTCCTTCCCAGCAGGATGGATCTCAGGAGGTTAACCTTCTAGGCTCCCAAGGCGAGCTGTCCTCCCACCTCGGAAGTAGCTGA
Protein sequenceShow/hide protein sequence
MHANSSCFYDWLDNFVIVYAGICTTVLHESSSNPVSGFDLNTRVDLNKRVMDPTVHTTGGYMSFLISDLSGSEQVGPQSGQTLVPIFHLLRGKLGHIGGAEVDLSNPFYL
ISSNTNKGPPRVPGCRSTQALRRCVSPEDPSRSLITRFDLKPARTLSLGRTISSSPPKPSDSEEVLARRLESELEEIENFRFSDDGEDSDTSTSGQGREYPSRMPERYLE
PLRRGFNIPNDILLRIPEEGERADNPPEGWVTLYLKMFEYGLRLPLHPFAQEFLNRTGLASAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAKKP
GRYYMCARKGAGGIVKGPTSIKEWVDKWFFASGEWLAKDESGRPFFDMSARFGNLVSIKPIPELDQATFDTLKYYKDNFPKGRKIGTLVTDKLLLESGLLDYDPLVRPIE
ASRPNSELAMVCGFTGSVKRKSKGRAHALKTVVGTEPVTPTVPLSEAQGNSGPSSAVPTPVIELNLSGGRSEEKRLREESEALDVSPLNEVKGESPLRRRRKKKKTSSSS
EAEARGTLPTSHANLVDDPEARMGGTSDVRTRFKMEPSSSGVKDQVSRISATCLDCCLRRASKFAFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTLKGELLK
AQGEVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQVLDEKDASIRRLTAELKDLKERLTNGSLLEESFRQHPDFDGFAKDFSDAGFKFL
MKGIAADMPHLQIDLSDLKKKYSEKWASGPNGTPGPQSLVDNYVRELDSDYSDVEEEDAPSQDLIEIGTTQEEVPSQQDGSQEVNLLGSQGELSSHLGSS