; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g26290 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g26290
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionINVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: my s in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink).
Genome locationchr6:19823669..19827395
RNA-Seq ExpressionMoc06g26290
SyntenyMoc06g26290
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]3.9e-14862.67Show/hide
Query:  VSIKPIPELDQATFDTLKFYKDNFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTRSMKRKSKGRAHALKTIQSSKPSTPAVDQN
        +SIKPIPEL QATFDTLKFYKDNFP+GRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFT S+KRKSKGRAHALK +QSS P TPAVDQN
Subjt:  VSIKPIPELDQATFDTLKFYKDNFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTRSMKRKSKGRAHALKTIQSSKPSTPAVDQN

Query:  AAQDQAGPSSAVPTLVIELDSTGERSREKRSRSESEALDVSPLREVREGSPLKRRKKKKKTTSSSEVGPRGPLPSSHADLVDDLEARMGGTSDVKMQFRM
        AAQDQAGPSSA PT VIELDSTGERSREKRSRSESEALDVSPLREVR                                                     
Subjt:  AAQDQAGPSSAVPTLVIELDSTGERSREKRSRSESEALDVSPLREVREGSPLKRRKKKKKTTSSSEVGPRGPLPSSHADLVDDLEARMGGTSDVKMQFRM

Query:  EPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHVVEVFTASIHSAVMIKAELDGREALAAKERENSSAALEATTTLKGELLKARSEVDIL
                                                                                                            
Subjt:  EPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHVVEVFTASIHSAVMIKAELDGREALAAKERENSSAALEATTTLKGELLKARSEVDIL

Query:  RAEVEAKAELLKREDERHKAHFRAAHAITKGLEKEKFQLLKEKDDLLQAFEGKDAAIGCLTAELKAEKERLSNGALLEAAFRQHPDFDRFVKNFSYAGFK
            EAKAELLKREDERHKAH RAAHAITKGLEKEKFQLLKEKDD+LQA E KDAAIG L AELKAEKERL+NGALLEAAFRQHPDFD F K+FS AGFK
Subjt:  RAEVEAKAELLKREDERHKAHFRAAHAITKGLEKEKFQLLKEKDDLLQAFEGKDAAIGCLTAELKAEKERLSNGALLEAAFRQHPDFDRFVKNFSYAGFK

Query:  FLMKGIAADMPHLQVDLGDLKKKYAEKWASGPNGTRGPASLVDKYVRDLDSDYSDLEEDEVPSQEATEVGTTQEGVPSQQNKSQEVNLLGSQGELSSHLG
        FLMKGIAAD+PHL+VDLGDLKK+YAEKWASGPNGT GPASLVDKYVRDLDSDYSDL+EDEVPSQE TEVGTTQEGVPSQQ+ SQEVNLLGSQGELSSHLG
Subjt:  FLMKGIAADMPHLQVDLGDLKKKYAEKWASGPNGTRGPASLVDKYVRDLDSDYSDLEEDEVPSQEATEVGTTQEGVPSQQNKSQEVNLLGSQGELSSHLG

Query:  S
        S
Subjt:  S

XP_022147182.1 uncharacterized protein LOC111016193 [Momordica charantia]7.2e-11879.58Show/hide
Query:  MQFRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHVVEVFTASIHSAVMIKAELDGREALAAKERENSSAALEATTTLKGELLKARS
        M+FRME SSSGVKDQVSRISA CLDRCLRRAS+FVSDPGSVLQRTID+  E F ASIHSAVM+KAELDGREAL AKEREN S  LEA TTLKGELLKA+ 
Subjt:  MQFRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHVVEVFTASIHSAVMIKAELDGREALAAKERENSSAALEATTTLKGELLKARS

Query:  EVDILRAEVEAKAELLKREDERHKAHFRAAHAITKGLEKEKFQLLKEKDDLLQAFEGKDAAIGCLTAELKAEKERLSNGALLEAAFRQHPDFDRFVKNFS
        EVDILRAEV+AK +LLK+E E+HKAH RAAHAITKGLEKEKFQLLKEKDDL Q  E KDA+IG LT ELK  KERL++GALLE +FRQHP+FD F K+FS
Subjt:  EVDILRAEVEAKAELLKREDERHKAHFRAAHAITKGLEKEKFQLLKEKDDLLQAFEGKDAAIGCLTAELKAEKERLSNGALLEAAFRQHPDFDRFVKNFS

Query:  YAGFKFLMKGIAADMPHLQVDLGDLKKKYAEKWASGPNGTRGPASLVDKYVRDLDSDYSDLEEDEVPSQEATEVGTTQEGVPSQ
         AGFKFLMKGIAADMPHLQ+DL DLKK+Y+E WASGPNGT GP SLVDKYVR+LDSDYSD+EE++ PSQE T+VGTTQE  PSQ
Subjt:  YAGFKFLMKGIAADMPHLQVDLGDLKKKYAEKWASGPNGTRGPASLVDKYVRDLDSDYSDLEEDEVPSQEATEVGTTQEGVPSQ

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]7.2e-13481.53Show/hide
Query:  MGGTSDVKMQFRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHVVEVFTASIHSAVMIKAELDGREALAAKERENSSAALEATTTLK
        MGGT DV+ +FRMEPSSSGVKDQVSRISA CLDRCL+RASKFVSDPGSVLQRTID+  E F ASIHSA+M+KAELDGREALAAKERENSSAALEA TTLK
Subjt:  MGGTSDVKMQFRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHVVEVFTASIHSAVMIKAELDGREALAAKERENSSAALEATTTLK

Query:  GELLKARSEVDILRAEVEAKAELLKREDERHKAHFRAAHAITKGLEKEKFQLLKEKDDLLQAFEGKDAAIGCLTAELKAEKERLSNGALLEAAFRQHPDF
        GELLKA+ EV ILRAEV+AKAELLK+E E+HKAH RAAHAITKGLEKEKFQLLKEKDDL Q  EGKD +IG LTAELK  KERL+NG+LLE +FRQH DF
Subjt:  GELLKARSEVDILRAEVEAKAELLKREDERHKAHFRAAHAITKGLEKEKFQLLKEKDDLLQAFEGKDAAIGCLTAELKAEKERLSNGALLEAAFRQHPDF

Query:  DRFVKNFSYAGFKFLMKGIAADMPHLQVDLGDLKKKYAEKWASGPNGTRGPASLVDKYVRDLDSDYSDLEEDEVPSQEATEVGTTQEGVPSQQNKSQEVN
        D F K+FS AGFKFLMKGIAADMPHLQ+DL +LKKKY+EKWASGPNGT GP SLV KYVR+LDSDYSD+EE++ PSQE  E+GTTQE VPSQQ+ SQEVN
Subjt:  DRFVKNFSYAGFKFLMKGIAADMPHLQVDLGDLKKKYAEKWASGPNGTRGPASLVDKYVRDLDSDYSDLEEDEVPSQEATEVGTTQEGVPSQQNKSQEVN

Query:  LLGSQGELSSHLGS
        LLGS+GELSSHLGS
Subjt:  LLGSQGELSSHLGS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]3.2e-15880.23Show/hide
Query:  LARRLESELEEIENFRFSDDGEDSDTSTSGQGLEYPSRMPEHYLGPLRKGFNIPNDILLRIPEEGERADNPPEGWVTLYLKMFEYGLRLPLHLFAQEFLN
        LARRLES+LEEIEN R SDDGEDSD STSGQGLEYPSR+PEHYLG LR+GF IP +ILLR+PEEGERADNPPEGWVTLY KMFEYGLRLPLH F QEFL 
Subjt:  LARRLESELEEIENFRFSDDGEDSDTSTSGQGLEYPSRMPEHYLGPLRKGFNIPNDILLRIPEEGERADNPPEGWVTLYLKMFEYGLRLPLHLFAQEFLN

Query:  RTGLALAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFASGEWLAKDESGHPF
        RTGLA AQVAPNGWGVIFALAILFWLRARD +EAEL  VDQLL CFEAKRIAKKPGR+YMCARKGAGGIVKGPTSIKGWV KWF+ASGEWLAKDESG  F
Subjt:  RTGLALAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFASGEWLAKDESGHPF

Query:  FDVPARFGNLVSIKPIPELDQATFDTLKFYKDNFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTRSMKRKSKGRAHALKTIQSS
        FDVP RFGNLVSI+P+PEL QA+FDTLK+YK+ FP+GRK+GTLVTD+LLLESGLLDYNP VRPIE+SRPNSELAMVCGF   +KRKSKGRAHAL+  QSS
Subjt:  FDVPARFGNLVSIKPIPELDQATFDTLKFYKDNFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTRSMKRKSKGRAHALKTIQSS

Query:  KPSTPAVDQNAAQDQAGPSSAVPTLVIELDSTGERSREKRSRSESEALD
        KP+TPAV         GP+S  P LVIEL+S+G  SREKR R ++EA+D
Subjt:  KPSTPAVDQNAAQDQAGPSSAVPTLVIELDSTGERSREKRSRSESEALD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]5.7e-23280.71Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVGKWFFASGEWLAKDESGHPFFDVPARFGNLVSIKPIPELDQATFDTLKFYKDNFPKGRKIGTLVTDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWVGKWFFASGEWLAKDESG  FFDVP RFGNLVSIK IPEL QATFDTLK YKD+FP+ RKI TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVGKWFFASGEWLAKDESGHPFFDVPARFGNLVSIKPIPELDQATFDTLKFYKDNFPKGRKIGTLVTDKLLLESGLLDYNP

Query:  LVRPIEASRPNSELAMVCGFTRSMKRKSKGRAHALKTIQSSKPSTPAVDQNAAQDQAGPSSAVPTLVIELDSTGERSREKRSRSESEALDVSPLREVREG
        LVR IEASRPNSELAMVCGFT S+KRKSKGRAHALKT+  ++P TP V +  AQ  +GPSSAVPT VIELD +G RS EKRSR ESEALDVSPL EVR  
Subjt:  LVRPIEASRPNSELAMVCGFTRSMKRKSKGRAHALKTIQSSKPSTPAVDQNAAQDQAGPSSAVPTLVIELDSTGERSREKRSRSESEALDVSPLREVREG

Query:  SPLKRRKKKKKTTSSSEVGPRGPLPSSHADLVDDLEARMGGTSDVKMQFRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHVVEVFT
        SPL+RR+KKKKT+SSSE G RG LP+SHADLVDD EARM GTS+V+M+F MEPSSSGVKDQVSRISA CLDR LRRASKFVSDPGSVLQRTID+V E F 
Subjt:  SPLKRRKKKKKTTSSSEVGPRGPLPSSHADLVDDLEARMGGTSDVKMQFRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHVVEVFT

Query:  ASIHSAVMIKAELDGREALAAKERENSSAALEATTTLKGELLKARSEVDILRAEVEAKAELLKREDERHKAHFRAAHAITKGLEKEKFQLLKEKDDLLQA
        ASIH AVM+KAELDGREALAAKERENS AALEA TTLKGELLKA+ EVDILRAEV+AK +LLK+E E+HKAH RAAHAITKGLEKEKFQLLKEKDDL Q 
Subjt:  ASIHSAVMIKAELDGREALAAKERENSSAALEATTTLKGELLKARSEVDILRAEVEAKAELLKREDERHKAHFRAAHAITKGLEKEKFQLLKEKDDLLQA

Query:  FEGKDAAIGCLTAELKAEKERLSNGALLEAAFRQHPDFDRFVKNFSYAGFKFLMKGIAADMPHLQVDLGDLKKKYAEKWASGPNGTRGPASLVDKYVRDL
         E KDA+IG LT ELK  KERL+NG LLE +FRQHPDFD F K+FS AGFKFLMKGIAADMPHLQ+DL  LKKKY+EKWASGPNGT  P SLVDKYVR+L
Subjt:  FEGKDAAIGCLTAELKAEKERLSNGALLEAAFRQHPDFDRFVKNFSYAGFKFLMKGIAADMPHLQVDLGDLKKKYAEKWASGPNGTRGPASLVDKYVRDL

Query:  DSDYSDLEEDEVPSQEATEVGTTQEGVPSQQNKS
        DSDYSD+EE++ PSQE  EVGTTQE VPSQQ  S
Subjt:  DSDYSDLEEDEVPSQEATEVGTTQEGVPSQQNKS

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124671.9e-14862.67Show/hide
Query:  VSIKPIPELDQATFDTLKFYKDNFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTRSMKRKSKGRAHALKTIQSSKPSTPAVDQN
        +SIKPIPEL QATFDTLKFYKDNFP+GRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFT S+KRKSKGRAHALK +QSS P TPAVDQN
Subjt:  VSIKPIPELDQATFDTLKFYKDNFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTRSMKRKSKGRAHALKTIQSSKPSTPAVDQN

Query:  AAQDQAGPSSAVPTLVIELDSTGERSREKRSRSESEALDVSPLREVREGSPLKRRKKKKKTTSSSEVGPRGPLPSSHADLVDDLEARMGGTSDVKMQFRM
        AAQDQAGPSSA PT VIELDSTGERSREKRSRSESEALDVSPLREVR                                                     
Subjt:  AAQDQAGPSSAVPTLVIELDSTGERSREKRSRSESEALDVSPLREVREGSPLKRRKKKKKTTSSSEVGPRGPLPSSHADLVDDLEARMGGTSDVKMQFRM

Query:  EPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHVVEVFTASIHSAVMIKAELDGREALAAKERENSSAALEATTTLKGELLKARSEVDIL
                                                                                                            
Subjt:  EPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHVVEVFTASIHSAVMIKAELDGREALAAKERENSSAALEATTTLKGELLKARSEVDIL

Query:  RAEVEAKAELLKREDERHKAHFRAAHAITKGLEKEKFQLLKEKDDLLQAFEGKDAAIGCLTAELKAEKERLSNGALLEAAFRQHPDFDRFVKNFSYAGFK
            EAKAELLKREDERHKAH RAAHAITKGLEKEKFQLLKEKDD+LQA E KDAAIG L AELKAEKERL+NGALLEAAFRQHPDFD F K+FS AGFK
Subjt:  RAEVEAKAELLKREDERHKAHFRAAHAITKGLEKEKFQLLKEKDDLLQAFEGKDAAIGCLTAELKAEKERLSNGALLEAAFRQHPDFDRFVKNFSYAGFK

Query:  FLMKGIAADMPHLQVDLGDLKKKYAEKWASGPNGTRGPASLVDKYVRDLDSDYSDLEEDEVPSQEATEVGTTQEGVPSQQNKSQEVNLLGSQGELSSHLG
        FLMKGIAAD+PHL+VDLGDLKK+YAEKWASGPNGT GPASLVDKYVRDLDSDYSDL+EDEVPSQE TEVGTTQEGVPSQQ+ SQEVNLLGSQGELSSHLG
Subjt:  FLMKGIAADMPHLQVDLGDLKKKYAEKWASGPNGTRGPASLVDKYVRDLDSDYSDLEEDEVPSQEATEVGTTQEGVPSQQNKSQEVNLLGSQGELSSHLG

Query:  S
        S
Subjt:  S

A0A6J1D1N9 uncharacterized protein LOC1110161933.5e-11879.58Show/hide
Query:  MQFRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHVVEVFTASIHSAVMIKAELDGREALAAKERENSSAALEATTTLKGELLKARS
        M+FRME SSSGVKDQVSRISA CLDRCLRRAS+FVSDPGSVLQRTID+  E F ASIHSAVM+KAELDGREAL AKEREN S  LEA TTLKGELLKA+ 
Subjt:  MQFRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHVVEVFTASIHSAVMIKAELDGREALAAKERENSSAALEATTTLKGELLKARS

Query:  EVDILRAEVEAKAELLKREDERHKAHFRAAHAITKGLEKEKFQLLKEKDDLLQAFEGKDAAIGCLTAELKAEKERLSNGALLEAAFRQHPDFDRFVKNFS
        EVDILRAEV+AK +LLK+E E+HKAH RAAHAITKGLEKEKFQLLKEKDDL Q  E KDA+IG LT ELK  KERL++GALLE +FRQHP+FD F K+FS
Subjt:  EVDILRAEVEAKAELLKREDERHKAHFRAAHAITKGLEKEKFQLLKEKDDLLQAFEGKDAAIGCLTAELKAEKERLSNGALLEAAFRQHPDFDRFVKNFS

Query:  YAGFKFLMKGIAADMPHLQVDLGDLKKKYAEKWASGPNGTRGPASLVDKYVRDLDSDYSDLEEDEVPSQEATEVGTTQEGVPSQ
         AGFKFLMKGIAADMPHLQ+DL DLKK+Y+E WASGPNGT GP SLVDKYVR+LDSDYSD+EE++ PSQE T+VGTTQE  PSQ
Subjt:  YAGFKFLMKGIAADMPHLQVDLGDLKKKYAEKWASGPNGTRGPASLVDKYVRDLDSDYSDLEEDEVPSQEATEVGTTQEGVPSQ

A0A6J1DF31 uncharacterized protein LOC1110199093.5e-13481.53Show/hide
Query:  MGGTSDVKMQFRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHVVEVFTASIHSAVMIKAELDGREALAAKERENSSAALEATTTLK
        MGGT DV+ +FRMEPSSSGVKDQVSRISA CLDRCL+RASKFVSDPGSVLQRTID+  E F ASIHSA+M+KAELDGREALAAKERENSSAALEA TTLK
Subjt:  MGGTSDVKMQFRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHVVEVFTASIHSAVMIKAELDGREALAAKERENSSAALEATTTLK

Query:  GELLKARSEVDILRAEVEAKAELLKREDERHKAHFRAAHAITKGLEKEKFQLLKEKDDLLQAFEGKDAAIGCLTAELKAEKERLSNGALLEAAFRQHPDF
        GELLKA+ EV ILRAEV+AKAELLK+E E+HKAH RAAHAITKGLEKEKFQLLKEKDDL Q  EGKD +IG LTAELK  KERL+NG+LLE +FRQH DF
Subjt:  GELLKARSEVDILRAEVEAKAELLKREDERHKAHFRAAHAITKGLEKEKFQLLKEKDDLLQAFEGKDAAIGCLTAELKAEKERLSNGALLEAAFRQHPDF

Query:  DRFVKNFSYAGFKFLMKGIAADMPHLQVDLGDLKKKYAEKWASGPNGTRGPASLVDKYVRDLDSDYSDLEEDEVPSQEATEVGTTQEGVPSQQNKSQEVN
        D F K+FS AGFKFLMKGIAADMPHLQ+DL +LKKKY+EKWASGPNGT GP SLV KYVR+LDSDYSD+EE++ PSQE  E+GTTQE VPSQQ+ SQEVN
Subjt:  DRFVKNFSYAGFKFLMKGIAADMPHLQVDLGDLKKKYAEKWASGPNGTRGPASLVDKYVRDLDSDYSDLEEDEVPSQEATEVGTTQEGVPSQQNKSQEVN

Query:  LLGSQGELSSHLGS
        LLGS+GELSSHLGS
Subjt:  LLGSQGELSSHLGS

A0A6J1DXS5 uncharacterized protein LOC1110255021.6e-15880.23Show/hide
Query:  LARRLESELEEIENFRFSDDGEDSDTSTSGQGLEYPSRMPEHYLGPLRKGFNIPNDILLRIPEEGERADNPPEGWVTLYLKMFEYGLRLPLHLFAQEFLN
        LARRLES+LEEIEN R SDDGEDSD STSGQGLEYPSR+PEHYLG LR+GF IP +ILLR+PEEGERADNPPEGWVTLY KMFEYGLRLPLH F QEFL 
Subjt:  LARRLESELEEIENFRFSDDGEDSDTSTSGQGLEYPSRMPEHYLGPLRKGFNIPNDILLRIPEEGERADNPPEGWVTLYLKMFEYGLRLPLHLFAQEFLN

Query:  RTGLALAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFASGEWLAKDESGHPF
        RTGLA AQVAPNGWGVIFALAILFWLRARD +EAEL  VDQLL CFEAKRIAKKPGR+YMCARKGAGGIVKGPTSIKGWV KWF+ASGEWLAKDESG  F
Subjt:  RTGLALAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFASGEWLAKDESGHPF

Query:  FDVPARFGNLVSIKPIPELDQATFDTLKFYKDNFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTRSMKRKSKGRAHALKTIQSS
        FDVP RFGNLVSI+P+PEL QA+FDTLK+YK+ FP+GRK+GTLVTD+LLLESGLLDYNP VRPIE+SRPNSELAMVCGF   +KRKSKGRAHAL+  QSS
Subjt:  FDVPARFGNLVSIKPIPELDQATFDTLKFYKDNFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTRSMKRKSKGRAHALKTIQSS

Query:  KPSTPAVDQNAAQDQAGPSSAVPTLVIELDSTGERSREKRSRSESEALD
        KP+TPAV         GP+S  P LVIEL+S+G  SREKR R ++EA+D
Subjt:  KPSTPAVDQNAAQDQAGPSSAVPTLVIELDSTGERSREKRSRSESEALD

A0A6J1DZB3 uncharacterized protein LOC1110256652.8e-23280.71Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVGKWFFASGEWLAKDESGHPFFDVPARFGNLVSIKPIPELDQATFDTLKFYKDNFPKGRKIGTLVTDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWVGKWFFASGEWLAKDESG  FFDVP RFGNLVSIK IPEL QATFDTLK YKD+FP+ RKI TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVGKWFFASGEWLAKDESGHPFFDVPARFGNLVSIKPIPELDQATFDTLKFYKDNFPKGRKIGTLVTDKLLLESGLLDYNP

Query:  LVRPIEASRPNSELAMVCGFTRSMKRKSKGRAHALKTIQSSKPSTPAVDQNAAQDQAGPSSAVPTLVIELDSTGERSREKRSRSESEALDVSPLREVREG
        LVR IEASRPNSELAMVCGFT S+KRKSKGRAHALKT+  ++P TP V +  AQ  +GPSSAVPT VIELD +G RS EKRSR ESEALDVSPL EVR  
Subjt:  LVRPIEASRPNSELAMVCGFTRSMKRKSKGRAHALKTIQSSKPSTPAVDQNAAQDQAGPSSAVPTLVIELDSTGERSREKRSRSESEALDVSPLREVREG

Query:  SPLKRRKKKKKTTSSSEVGPRGPLPSSHADLVDDLEARMGGTSDVKMQFRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHVVEVFT
        SPL+RR+KKKKT+SSSE G RG LP+SHADLVDD EARM GTS+V+M+F MEPSSSGVKDQVSRISA CLDR LRRASKFVSDPGSVLQRTID+V E F 
Subjt:  SPLKRRKKKKKTTSSSEVGPRGPLPSSHADLVDDLEARMGGTSDVKMQFRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHVVEVFT

Query:  ASIHSAVMIKAELDGREALAAKERENSSAALEATTTLKGELLKARSEVDILRAEVEAKAELLKREDERHKAHFRAAHAITKGLEKEKFQLLKEKDDLLQA
        ASIH AVM+KAELDGREALAAKERENS AALEA TTLKGELLKA+ EVDILRAEV+AK +LLK+E E+HKAH RAAHAITKGLEKEKFQLLKEKDDL Q 
Subjt:  ASIHSAVMIKAELDGREALAAKERENSSAALEATTTLKGELLKARSEVDILRAEVEAKAELLKREDERHKAHFRAAHAITKGLEKEKFQLLKEKDDLLQA

Query:  FEGKDAAIGCLTAELKAEKERLSNGALLEAAFRQHPDFDRFVKNFSYAGFKFLMKGIAADMPHLQVDLGDLKKKYAEKWASGPNGTRGPASLVDKYVRDL
         E KDA+IG LT ELK  KERL+NG LLE +FRQHPDFD F K+FS AGFKFLMKGIAADMPHLQ+DL  LKKKY+EKWASGPNGT  P SLVDKYVR+L
Subjt:  FEGKDAAIGCLTAELKAEKERLSNGALLEAAFRQHPDFDRFVKNFSYAGFKFLMKGIAADMPHLQVDLGDLKKKYAEKWASGPNGTRGPASLVDKYVRDL

Query:  DSDYSDLEEDEVPSQEATEVGTTQEGVPSQQNKS
        DSDYSD+EE++ PSQE  EVGTTQE VPSQQ  S
Subjt:  DSDYSDLEEDEVPSQEATEVGTTQEGVPSQQNKS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G32010.1 myosin heavy chain-related1.7e-0824.74Show/hide
Query:  RLESELEEIENFRFSDDGEDSDTSTSGQGLEY------PSRMPEHYLGPLRKGFNIPNDILLRIPEEGERADNPPEGWVTLYLKMF-EYGLRLPLHLFAQ
        R+ ++ +   N    D+ E +D + SG+  +       P+      +G       +P  + +RIP + +R  + PEG++ L+   F E GLR P+  F  
Subjt:  RLESELEEIENFRFSDDGEDSDTSTSGQGLEY------PSRMPEHYLGPLRKGFNIPNDILLRIPEEGERADNPPEGWVTLYLKMF-EYGLRLPLHLFAQ

Query:  EFLNRTGLALAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFA
         F     +A++Q+       I   A L  L AR       LSV+ +       ++  K G++Y+ + +G   +  GP+  + W+G +F+A
Subjt:  EFLNRTGLALAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFA

AT2G15420.1 myosin heavy chain-related7.4e-0423.17Show/hide
Query:  NIPNDILLRIPEEGERADNPPEGWVTLYLKMF-EYGLRLPLHLFAQEFLNRTGLALAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKR
        N P +I L  P+  +R   PPEG++ LY   F   GL  PL  F  E+  R  +A++Q+          LAIL        +    +  D         R
Subjt:  NIPNDILLRIPEEGERADNPPEGWVTLYLKMF-EYGLRLPLHLFAQEFLNRTGLALAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKR

Query:  IAKKPGRYYMCARKGAGGIVKGPTS-IKGWVGKWFFAS--------------GEWLAKDESGHPFFDVPARFGNLVSIKPIPELDQATFDTLKFYKDNFP
        + + PG YY  A K    IV G  S I GW  ++FF                 +W    E      D P  F  L +I  I EL    + T      +FP
Subjt:  IAKKPGRYYMCARKGAGGIVKGPTS-IKGWVGKWFFAS--------------GEWLAKDESGHPFFDVPARFGNLVSIKPIPELDQATFDTLKFYKDNFP

Query:  KGR----KIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSE--LAMVCGFTRSMKRKSKGRAHALKTIQSSKPSTPAVDQNAAQDQAGPSSAVPTLVIEL
        + R    ++G ++           +   L+  +E S   +E  L +      S+ R S   +     +       P  +           SA+P+L    
Subjt:  KGR----KIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSE--LAMVCGFTRSMKRKSKGRAHALKTIQSSKPSTPAVDQNAAQDQAGPSSAVPTLVIEL

Query:  DSTGERSREKRSRSESEALDV--SPLRE------VREGSPLKRRKKKKKTTSSSEVGPRGPLPSSHADLVDDLEARMGGTSDVKMQFRMEPSSSGVKDQV
         ST  + R  R  +E  +  V  +P RE      V  G   + + K    T ++         +S ADLV  +       S V      E      +   
Subjt:  DSTGERSREKRSRSESEALDV--SPLRE------VREGSPLKRRKKKKKTTSSSEVGPRGPLPSSHADLVDDLEARMGGTSDVKMQFRMEPSSSGVKDQV

Query:  SRISAACLDRCLRRASKFVSDPGSVLQRTIDHVVEVFTASIHSAVMIKAELDGREALA---AKERENSSAALEATTTLKGELLKARSEVDILRAEVEAKA
          I A        R S F+     V              S        AE +  + LA   A ERE S+   + ++ L  ++   +S VD  R ++EA  
Subjt:  SRISAACLDRCLRRASKFVSDPGSVLQRTIDHVVEVFTASIHSAVMIKAELDGREALA---AKERENSSAALEATTTLKGELLKARSEVDILRAEVEAKA

Query:  ELLKREDERHKAHFRAAHAITKGLEKEKFQLLKEKDDLLQAFEGKDAAIGCLTAELKAEKERLSNGA-LLEAAFRQHPDFDRFVKNFSYAGFKFLMKGIA
        +    E  R +   R  H +    +K   Q+      L +  + K A       EL+  +  L NG   LE A     D D F +  + A    L+ GI+
Subjt:  ELLKREDERHKAHFRAAHAITKGLEKEKFQLLKEKDDLLQAFEGKDAAIGCLTAELKAEKERLSNGA-LLEAAFRQHPDFDRFVKNFSYAGFKFLMKGIA

AT5G38190.1 INVOLVED IN: biological_process unknown1.8e-0521.5Show/hide
Query:  RFSDD-GEDSDTSTSGQGLEY------PSRMPEHYLGPLRKGFNIPNDILLRIPEEGERADNPPEGWVTLYLKMF-EYGLRLPLHLFAQEFLNRTGLALA
        R++DD  E +D + SG+  +       P+      +G       +P  + +RIP + +R  + PEG++ L+   F E GLR P+  F   F     +A++
Subjt:  RFSDD-GEDSDTSTSGQGLEY------PSRMPEHYLGPLRKGFNIPNDILLRIPEEGERADNPPEGWVTLYLKMF-EYGLRLPLHLFAQEFLNRTGLALA

Query:  QVAPNGWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFASGEW-LAKDESGHPFFDVPAR
        Q+       I   A L  L AR       LSV+ +       ++  K G++Y+ + +G   +   P+  + W+G +F+A  +  L +D S          
Subjt:  QVAPNGWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFASGEW-LAKDESGHPFFDVPAR

Query:  FGNLVSIKPIPELDQATFDTLKFYKDNFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTRSMKRKSKGRAHALKTIQSSKPSTPA
             +++    +D      LK  K N  K +K                      RP + S  N  LA       ++ R+ + R  A      SKP    
Subjt:  FGNLVSIKPIPELDQATFDTLKFYKDNFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTRSMKRKSKGRAHALKTIQSSKPSTPA

Query:  VDQNAAQDQAGPSSAVPTLVIELDSTGERSREKRSRSESEALDVSPLREVREGSPL-------KRRKKKKKTTSSSEVGPRGPLPSSHADLVDDLEARMG
        VD+   +D+A         V E +  G   + K+  S +E  DV+ +  V +  P+         R   ++ +     G + P  S       D   R+ 
Subjt:  VDQNAAQDQAGPSSAVPTLVIELDSTGERSREKRSRSESEALDVSPLREVREGSPL-------KRRKKKKKTTSSSEVGPRGPLPSSHADLVDDLEARMG

Query:  GTSDVKM------------QFRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDP--------GSVLQRTIDHVVEVFTASIHSA----VMIKAELDGR
        G+ DV +             FR E           R+  A   R LR+     S P         +V    +  ++  +  ++ SA    ++++ +L   
Subjt:  GTSDVKM------------QFRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDP--------GSVLQRTIDHVVEVFTASIHSA----VMIKAELDGR

Query:  EALAAKERENSSAALEATTTLKGELLKARSEVDILRAEVEAKAELLKREDERHKAHFRAAHAITKGLEKEKF
        E   A   + + A LE    LK E +  ++E    R++  A  E + R+    +A       I     +EKF
Subjt:  EALAAKERENSSAALEATTTLKGELLKARSEVDILRAEVEAKAELLKREDERHKAHFRAAHAITKGLEKEKF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCGGTCGCTCCTCATCATGATAACGTTATGACAGTTGAAGAAAAGTTTGGAGGCCATCTCGGAGGAATTTGCACAACGATTCTTCACGAATCGAGCTCG
AATCCGGTCTCCGGTTCCCACCTGAATACTAGAGTGGACTGGCACAAGAGGGGTAAGCACTCCGACGCTCAAGTCAGTATAGATCAGACTCCTTATTTAGTTCGA
GGTAATATGACCGTTGCGGAATACGTTTCGACCTGCCAGGTTGTCGGAGCACTCAAGCATTTCGCCGTTGCGTATCCCGAGAAGATCCCAGCCGCTCGTTGGTTA
CACGTTAGCGATAGCTTGGGTAGTTTAGGTCGGACGATAAGTAGTTCGCCCCCCAAACCAAGTGACTCTGGGGAGGTCTTAGCTCGTAGGTTAGAATCTGAGCTT
GAAGAAATAGAGAACTTTAGGTTCTCAGATGACGGAGAGGATAGTGATACCTCCACCTCGGGCCAGGGTCTGGAGTACCCTTCTAGGATGCCCGAGCATTATCTT
GGACCCCTTCGTAAGGGGTTTAACATTCCGAATGACATCCTCCTTAGGATTCCAGAGGAAGGGGAAAGAGCTGACAATCCCCCAGAGGGATGGGTCACTCTTTAT
CTCAAGATGTTTGAGTACGGCCTCAGGCTTCCCCTTCATCTTTTCGCCCAAGAGTTCTTAAACCGAACTGGACTGGCTCTTGCTCAAGTGGCCCCCAATGGGTGG
GGTGTCATTTTTGCATTAGCCATTCTTTTTTGGTTGCGAGCTCGGGATGAGGACGAGGCCGAGCTGCTAAGTGTTGATCAGCTCCTTGGATGTTTTGAGGCCAAG
AGGATAGCCAAAAAACCTGGTCGGTACTATATGTGCGCAAGGAAGGGCGCGGGTGGTATAGTCAAGGGGCCGACCTCCATCAAGGGATGGGTAGGCAAGTGGTTC
TTTGCCTCTGGAGAGTGGTTGGCAAAGGACGAATCAGGTCATCCCTTCTTTGACGTGCCTGCTAGGTTTGGGAACCTAGTATCAATCAAGCCGATTCCCGAGCTC
GATCAAGCCACATTTGACACCCTCAAATTCTACAAGGACAACTTCCCAAAGGGCCGGAAGATCGGGACCTTGGTCACCGACAAGCTGCTGCTAGAATCAGGGCTA
TTGGACTACAATCCTTTAGTTCGTCCAATTGAAGCTTCGAGGCCAAACTCCGAGCTCGCTATGGTGTGTGGATTCACACGCAGCATGAAACGCAAGTCTAAGGGT
CGTGCTCACGCCCTTAAGACAATTCAGAGCTCTAAACCATCTACACCTGCCGTGGATCAGAATGCAGCTCAGGACCAGGCTGGTCCATCTTCTGCAGTTCCAACT
CTGGTGATTGAGTTGGATTCTACTGGGGAGCGCTCCAGGGAGAAGCGCTCGAGGAGCGAGTCCGAGGCCTTGGACGTGTCACCTCTTCGTGAGGTGAGAGAGGGC
TCTCCTCTGAAGAGGAGGAAGAAGAAGAAGAAGACCACCTCCTCCTCGGAGGTTGGACCTCGCGGCCCCCTACCTTCAAGCCACGCCGACCTCGTAGACGACCTT
GAAGCTCGGATGGGGGGCACATCCGACGTGAAGATGCAGTTCAGAATGGAACCGTCGAGCTCCGGGGTGAAAGACCAGGTGTCACGCATCTCGGCTGCCTGCTTG
GATCGCTGTCTTAGGAGAGCCTCCAAGTTTGTGAGCGACCCAGGGTCCGTGCTGCAGCGGACTATCGACCACGTCGTCGAGGTGTTCACTGCCTCCATCCATTCA
GCAGTCATGATCAAGGCCGAGTTGGATGGAAGGGAGGCCTTGGCAGCGAAGGAGAGGGAGAATTCATCTGCTGCCTTGGAGGCTACCACTACGCTGAAGGGGGAG
CTGCTGAAGGCTCGGAGCGAGGTGGACATACTGAGGGCTGAGGTTGAAGCCAAAGCCGAGCTGCTGAAGAGGGAAGATGAAAGGCACAAGGCCCACTTCCGAGCT
GCCCACGCTATCACTAAGGGGCTGGAGAAGGAGAAGTTCCAACTCCTTAAAGAGAAGGACGACCTGCTCCAGGCCTTCGAAGGGAAAGACGCTGCAATTGGGTGT
CTTACTGCCGAGTTGAAGGCGGAGAAGGAACGCCTTTCTAATGGAGCTCTTCTTGAAGCAGCCTTCAGGCAACACCCAGATTTTGACAGGTTTGTCAAGAACTTC
AGCTACGCAGGCTTCAAATTCCTGATGAAGGGGATTGCTGCTGACATGCCTCACCTCCAGGTCGACCTCGGCGATCTGAAGAAGAAGTACGCTGAGAAATGGGCT
TCTGGGCCTAATGGCACCCGAGGTCCTGCATCCCTGGTGGACAAATACGTCAGAGATCTGGACTCTGACTACTCCGACTTGGAGGAAGACGAGGTCCCAAGTCAG
GAAGCTACTGAGGTCGGCACCACCCAAGAAGGAGTCCCTTCCCAGCAGAACAAATCTCAGGAGGTCAACCTTCTAGGTTCTCAAGGCGAGCTATCTTCTCACCTC
GGGAGCGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCGGTCGCTCCTCATCATGATAACGTTATGACAGTTGAAGAAAAGTTTGGAGGCCATCTCGGAGGAATTTGCACAACGATTCTTCACGAATCGAGCTCG
AATCCGGTCTCCGGTTCCCACCTGAATACTAGAGTGGACTGGCACAAGAGGGGTAAGCACTCCGACGCTCAAGTCAGTATAGATCAGACTCCTTATTTAGTTCGA
GGTAATATGACCGTTGCGGAATACGTTTCGACCTGCCAGGTTGTCGGAGCACTCAAGCATTTCGCCGTTGCGTATCCCGAGAAGATCCCAGCCGCTCGTTGGTTA
CACGTTAGCGATAGCTTGGGTAGTTTAGGTCGGACGATAAGTAGTTCGCCCCCCAAACCAAGTGACTCTGGGGAGGTCTTAGCTCGTAGGTTAGAATCTGAGCTT
GAAGAAATAGAGAACTTTAGGTTCTCAGATGACGGAGAGGATAGTGATACCTCCACCTCGGGCCAGGGTCTGGAGTACCCTTCTAGGATGCCCGAGCATTATCTT
GGACCCCTTCGTAAGGGGTTTAACATTCCGAATGACATCCTCCTTAGGATTCCAGAGGAAGGGGAAAGAGCTGACAATCCCCCAGAGGGATGGGTCACTCTTTAT
CTCAAGATGTTTGAGTACGGCCTCAGGCTTCCCCTTCATCTTTTCGCCCAAGAGTTCTTAAACCGAACTGGACTGGCTCTTGCTCAAGTGGCCCCCAATGGGTGG
GGTGTCATTTTTGCATTAGCCATTCTTTTTTGGTTGCGAGCTCGGGATGAGGACGAGGCCGAGCTGCTAAGTGTTGATCAGCTCCTTGGATGTTTTGAGGCCAAG
AGGATAGCCAAAAAACCTGGTCGGTACTATATGTGCGCAAGGAAGGGCGCGGGTGGTATAGTCAAGGGGCCGACCTCCATCAAGGGATGGGTAGGCAAGTGGTTC
TTTGCCTCTGGAGAGTGGTTGGCAAAGGACGAATCAGGTCATCCCTTCTTTGACGTGCCTGCTAGGTTTGGGAACCTAGTATCAATCAAGCCGATTCCCGAGCTC
GATCAAGCCACATTTGACACCCTCAAATTCTACAAGGACAACTTCCCAAAGGGCCGGAAGATCGGGACCTTGGTCACCGACAAGCTGCTGCTAGAATCAGGGCTA
TTGGACTACAATCCTTTAGTTCGTCCAATTGAAGCTTCGAGGCCAAACTCCGAGCTCGCTATGGTGTGTGGATTCACACGCAGCATGAAACGCAAGTCTAAGGGT
CGTGCTCACGCCCTTAAGACAATTCAGAGCTCTAAACCATCTACACCTGCCGTGGATCAGAATGCAGCTCAGGACCAGGCTGGTCCATCTTCTGCAGTTCCAACT
CTGGTGATTGAGTTGGATTCTACTGGGGAGCGCTCCAGGGAGAAGCGCTCGAGGAGCGAGTCCGAGGCCTTGGACGTGTCACCTCTTCGTGAGGTGAGAGAGGGC
TCTCCTCTGAAGAGGAGGAAGAAGAAGAAGAAGACCACCTCCTCCTCGGAGGTTGGACCTCGCGGCCCCCTACCTTCAAGCCACGCCGACCTCGTAGACGACCTT
GAAGCTCGGATGGGGGGCACATCCGACGTGAAGATGCAGTTCAGAATGGAACCGTCGAGCTCCGGGGTGAAAGACCAGGTGTCACGCATCTCGGCTGCCTGCTTG
GATCGCTGTCTTAGGAGAGCCTCCAAGTTTGTGAGCGACCCAGGGTCCGTGCTGCAGCGGACTATCGACCACGTCGTCGAGGTGTTCACTGCCTCCATCCATTCA
GCAGTCATGATCAAGGCCGAGTTGGATGGAAGGGAGGCCTTGGCAGCGAAGGAGAGGGAGAATTCATCTGCTGCCTTGGAGGCTACCACTACGCTGAAGGGGGAG
CTGCTGAAGGCTCGGAGCGAGGTGGACATACTGAGGGCTGAGGTTGAAGCCAAAGCCGAGCTGCTGAAGAGGGAAGATGAAAGGCACAAGGCCCACTTCCGAGCT
GCCCACGCTATCACTAAGGGGCTGGAGAAGGAGAAGTTCCAACTCCTTAAAGAGAAGGACGACCTGCTCCAGGCCTTCGAAGGGAAAGACGCTGCAATTGGGTGT
CTTACTGCCGAGTTGAAGGCGGAGAAGGAACGCCTTTCTAATGGAGCTCTTCTTGAAGCAGCCTTCAGGCAACACCCAGATTTTGACAGGTTTGTCAAGAACTTC
AGCTACGCAGGCTTCAAATTCCTGATGAAGGGGATTGCTGCTGACATGCCTCACCTCCAGGTCGACCTCGGCGATCTGAAGAAGAAGTACGCTGAGAAATGGGCT
TCTGGGCCTAATGGCACCCGAGGTCCTGCATCCCTGGTGGACAAATACGTCAGAGATCTGGACTCTGACTACTCCGACTTGGAGGAAGACGAGGTCCCAAGTCAG
GAAGCTACTGAGGTCGGCACCACCCAAGAAGGAGTCCCTTCCCAGCAGAACAAATCTCAGGAGGTCAACCTTCTAGGTTCTCAAGGCGAGCTATCTTCTCACCTC
GGGAGCGGCTGA
Protein sequenceShow/hide protein sequence
MSSVAPHHDNVMTVEEKFGGHLGGICTTILHESSSNPVSGSHLNTRVDWHKRGKHSDAQVSIDQTPYLVRGNMTVAEYVSTCQVVGALKHFAVAYPEKIPAARWL
HVSDSLGSLGRTISSSPPKPSDSGEVLARRLESELEEIENFRFSDDGEDSDTSTSGQGLEYPSRMPEHYLGPLRKGFNIPNDILLRIPEEGERADNPPEGWVTLY
LKMFEYGLRLPLHLFAQEFLNRTGLALAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWF
FASGEWLAKDESGHPFFDVPARFGNLVSIKPIPELDQATFDTLKFYKDNFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTRSMKRKSKG
RAHALKTIQSSKPSTPAVDQNAAQDQAGPSSAVPTLVIELDSTGERSREKRSRSESEALDVSPLREVREGSPLKRRKKKKKTTSSSEVGPRGPLPSSHADLVDDL
EARMGGTSDVKMQFRMEPSSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHVVEVFTASIHSAVMIKAELDGREALAAKERENSSAALEATTTLKGE
LLKARSEVDILRAEVEAKAELLKREDERHKAHFRAAHAITKGLEKEKFQLLKEKDDLLQAFEGKDAAIGCLTAELKAEKERLSNGALLEAAFRQHPDFDRFVKNF
SYAGFKFLMKGIAADMPHLQVDLGDLKKKYAEKWASGPNGTRGPASLVDKYVRDLDSDYSDLEEDEVPSQEATEVGTTQEGVPSQQNKSQEVNLLGSQGELSSHL
GSG