; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g28560 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g28560
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr11:20849969..20854000
RNA-Seq ExpressionMoc11g28560
SyntenyMoc11g28560
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]1.9e-11952.94Show/hide
Query:  VSIKPIPELNQASWNTLKYYKDRFLSGRKVGTLVTDQLLLESGLLDYNPIVRPVEASRPNSELTMVCGFSSSVKHKSKGRAHALKAIQSTQPTTPADAQL
        +SIKPIPEL QA+++TLK+YKD F  GRK+GTLVTD+LLLESGLLDYNP+VRP+EASRPNSEL MVCGF+SSVK KSKGRAHALK +QS+ P TPA  Q 
Subjt:  VSIKPIPELNQASWNTLKYYKDRFLSGRKVGTLVTDQLLLESGLLDYNPIVRPVEASRPNSELTMVCGFSSSVKHKSKGRAHALKAIQSTQPTTPADAQL

Query:  AAQDKAGPSIEVPTPVIELDSAGEHSREKRPRNESEALDVSPLREVREESPLKRRRKKKKTTSSSEVGPRGPLPMSHADLVDDPEARMRGTSDVTMRFCV
        AAQD+AGPS   PTPVIELDS GE SREKR R+ESEALDVSPLREVR                                                     
Subjt:  AAQDKAGPSIEVPTPVIELDSAGEHSREKRPRNESEALDVSPLREVREESPLKRRRKKKKTTSSSEVGPRGPLPMSHADLVDDPEARMRGTSDVTMRFCV

Query:  EPSSFGVKDQVSRISASCLDRCLRRASKFVSDPRFVLQRAIDHAAEVLIASIHSAVMMKAELDIREIFVAKEKENSSAALEAATTMKGELLKARSEVDIL
                                                                                                            
Subjt:  EPSSFGVKDQVSRISASCLDRCLRRASKFVSDPRFVLQRAIDHAAEVLIASIHSAVMMKAELDIREIFVAKEKENSSAALEAATTMKGELLKARSEVDIL

Query:  KAEVEAKAQLLKKEDEKHKAYLRAAHAITKRLEKEKFQLLKQKNDMLQVLEEKDASIKHLTIELEVEKKRLSNRVLLEAAFRQHPDFDEFAKDFSDAGFK
            EAKA+LLK+EDE+HKA+LRAAHAITK LEKEKFQLLK+K+DMLQ LE KDA+I  L  EL+ EK+RL+N  LLEAAFRQHPDFD FAKDFSDAGFK
Subjt:  KAEVEAKAQLLKKEDEKHKAYLRAAHAITKRLEKEKFQLLKQKNDMLQVLEEKDASIKHLTIELEVEKKRLSNRVLLEAAFRQHPDFDEFAKDFSDAGFK

Query:  FMMRGIAADMPHLQIDLSDLKKKYAEKWAFGPNSTSGPQSLVEKYVRELDSDYSYIEEEDVPSQEQVEVGTTQEEAPSQQGGSQEVNLLGSQG
        F+M+GIAAD+PHL++DL DLKK+YAEKWA GPN TSGP SLV+KYVR+LDSDYS ++E++VPSQE  EVGTTQE  PSQQ GSQEVNLLGSQG
Subjt:  FMMRGIAADMPHLQIDLSDLKKKYAEKWAFGPNSTSGPQSLVEKYVRELDSDYSYIEEEDVPSQEQVEVGTTQEEAPSQQGGSQEVNLLGSQG

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]6.6e-11275.75Show/hide
Query:  MFEYGLRLPLHPLVQEFLNRTGLTPAQVAPNGWGIIFALAILFWLRAQEEDEGELLDVDQLLRCFEAKRIVKKPGRYYMCARKGACGIVKGPTSIKGWVK
        MFEYGLRLPLHP VQEFL RTGL PAQVAPNGWG+IFALAILFWLRA++ +E ELLDVDQLL CFEAKRI KKPGR+YMCARKGA GIVKGPTSIKGWV+
Subjt:  MFEYGLRLPLHPLVQEFLNRTGLTPAQVAPNGWGIIFALAILFWLRAQEEDEGELLDVDQLLRCFEAKRIVKKPGRYYMCARKGACGIVKGPTSIKGWVK

Query:  KWFFASGEWLAKNESGSPFFDVPVRFENLVSIKPIPELNQASWNTLKYYKDRFLSGRKVGTLVTDQLLLESGLLDYNPIVRPVEASRPNSELTMVCGFSS
        KWF+ASGEWLAK+ESG  FFDVP RF NLVSI+P+PEL QAS++TLKYYK+RF  GRKVGTLVTD+LLLESGLLDYNP VRP+E SRPNS L MVC F+S
Subjt:  KWFFASGEWLAKNESGSPFFDVPVRFENLVSIKPIPELNQASWNTLKYYKDRFLSGRKVGTLVTDQLLLESGLLDYNPIVRPVEASRPNSELTMVCGFSS

Query:  SVKHKSKGRAHALKAIQSTQPTTPADAQLAAQDKAGPSIEVPTPVIELDSAGEHSREKRPRNESEALD
         VK KSKGRAHAL+A QS++P TPA          GP+ E P PVIEL+S+G  SREKRPR+++EA+D
Subjt:  SVKHKSKGRAHALKAIQSTQPTTPADAQLAAQDKAGPSIEVPTPVIELDSAGEHSREKRPRNESEALD

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]1.8e-11777.12Show/hide
Query:  MRGTSDVTMRFCVEPSSFGVKDQVSRISASCLDRCLRRASKFVSDPRFVLQRAIDHAAEVLIASIHSAVMMKAELDIREIFVAKEKENSSAALEAATTMK
        M GT DV  RF +EPSS GVKDQVSRISA+CLDRCL+RASKFVSDP  VLQR ID+AAE  +ASIHSA+M+KAELD RE   AKE+ENSSAALEAATT+K
Subjt:  MRGTSDVTMRFCVEPSSFGVKDQVSRISASCLDRCLRRASKFVSDPRFVLQRAIDHAAEVLIASIHSAVMMKAELDIREIFVAKEKENSSAALEAATTMK

Query:  GELLKARSEVDILKAEVEAKAQLLKKEDEKHKAYLRAAHAITKRLEKEKFQLLKQKNDMLQVLEEKDASIKHLTIELEVEKKRLSNRVLLEAAFRQHPDF
        GELLKA+ EV IL+AEV+AKA+LLKKE EKHKA+LRAAHAITK LEKEKFQLLK+K+D+ QVLE KD SI  LT EL+  K+RL+N  LLE +FRQH DF
Subjt:  GELLKARSEVDILKAEVEAKAQLLKKEDEKHKAYLRAAHAITKRLEKEKFQLLKQKNDMLQVLEEKDASIKHLTIELEVEKKRLSNRVLLEAAFRQHPDF

Query:  DEFAKDFSDAGFKFMMRGIAADMPHLQIDLSDLKKKYAEKWAFGPNSTSGPQSLVEKYVRELDSDYSYIEEEDVPSQEQVEVGTTQEEAPSQQGGSQEVN
        D FAKDFSDAGFKF+M+GIAADMPHLQIDLS+LKKKY+EKWA GPN T GPQSLV KYVRELDSDYS +EEED PSQE  E+GTTQEE PSQQ GSQEVN
Subjt:  DEFAKDFSDAGFKFMMRGIAADMPHLQIDLSDLKKKYAEKWAFGPNSTSGPQSLVEKYVRELDSDYSYIEEEDVPSQEQVEVGTTQEEAPSQQGGSQEVN

Query:  LLGSQG
        LLGS+G
Subjt:  LLGSQG

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.1e-15476.11Show/hide
Query:  SSLKSTNSGEDLAHRLESELEEIDNFRFSDDGEDSDTSTSGQGLEYPSKIPEHYLGPLRRGFSIPDDILLRIPEERERVDNPPEGWVTLYLKMFEYGLRL
        SS  S+N   DLA RLES+LEEI+N R SDDGEDSD STSGQGLEYPS+IPEHYLG LRRGF+IP++ILLR+PEE ER DNPPEGWVTLY KMFEYGLRL
Subjt:  SSLKSTNSGEDLAHRLESELEEIDNFRFSDDGEDSDTSTSGQGLEYPSKIPEHYLGPLRRGFSIPDDILLRIPEERERVDNPPEGWVTLYLKMFEYGLRL

Query:  PLHPLVQEFLNRTGLTPAQVAPNGWGIIFALAILFWLRAQEEDEGELLDVDQLLRCFEAKRIVKKPGRYYMCARKGACGIVKGPTSIKGWVKKWFFASGE
        PLHP VQEFL RTGL PAQVAPNGWG+IFALAILFWLRA++ +E EL DVDQLL CFEAKRI KKPGR+YMCARKGA GIVKGPTSIKGWV+KWF+ASGE
Subjt:  PLHPLVQEFLNRTGLTPAQVAPNGWGIIFALAILFWLRAQEEDEGELLDVDQLLRCFEAKRIVKKPGRYYMCARKGACGIVKGPTSIKGWVKKWFFASGE

Query:  WLAKNESGSPFFDVPVRFENLVSIKPIPELNQASWNTLKYYKDRFLSGRKVGTLVTDQLLLESGLLDYNPIVRPVEASRPNSELTMVCGFSSSVKHKSKG
        WLAK+ESG  FFDVP RF NLVSI+P+PEL QAS++TLKYYK+RF  GRKVGTLVTD+LLLESGLLDYNP VRP+E+SRPNSEL MVCGF+S VK KSKG
Subjt:  WLAKNESGSPFFDVPVRFENLVSIKPIPELNQASWNTLKYYKDRFLSGRKVGTLVTDQLLLESGLLDYNPIVRPVEASRPNSELTMVCGFSSSVKHKSKG

Query:  RAHALKAIQSTQPTTPADAQLAAQDKAGPSIEVPTPVIELDSAGEHSREKRPRNESEALD
        RAHAL+A QS++P TPA          GP+ E P  VIEL+S+G  SREKRPR+++EA+D
Subjt:  RAHALKAIQSTQPTTPADAQLAAQDKAGPSIEVPTPVIELDSAGEHSREKRPRNESEALD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]9.8e-21776.59Show/hide
Query:  MCARKGACGIVKGPTSIKGWVKKWFFASGEWLAKNESGSPFFDVPVRFENLVSIKPIPELNQASWNTLKYYKDRFLSGRKVGTLVTDQLLLESGLLDYNP
        MCARKG  GIVKGPTSIKGWV KWFFASGEWLAK+ESG  FFDVP RF NLVSIK IPEL QA+++TLK+YKD F   RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGACGIVKGPTSIKGWVKKWFFASGEWLAKNESGSPFFDVPVRFENLVSIKPIPELNQASWNTLKYYKDRFLSGRKVGTLVTDQLLLESGLLDYNP

Query:  IVRPVEASRPNSELTMVCGFSSSVKHKSKGRAHALKAIQSTQPTTPADAQLAAQDKAGPSIEVPTPVIELDSAGEHSREKRPRNESEALDVSPLREVREE
        +VR +EASRPNSEL MVCGF+ SVK KSKGRAHALK +  T+P TP   +  AQ  +GPS  VPTPVIELD +G  S EKR R ESEALDVSPL EVR E
Subjt:  IVRPVEASRPNSELTMVCGFSSSVKHKSKGRAHALKAIQSTQPTTPADAQLAAQDKAGPSIEVPTPVIELDSAGEHSREKRPRNESEALDVSPLREVREE

Query:  SPLKRRRKKKKTTSSSEVGPRGPLPMSHADLVDDPEARMRGTSDVTMRFCVEPSSFGVKDQVSRISASCLDRCLRRASKFVSDPRFVLQRAIDHAAEVLI
        SPL+RRRKKKKT+SSSE G RG LP SHADLVDDPEARMRGTS+V MRF +EPSS GVKDQVSRISA+CLDR LRRASKFVSDP  VLQR ID+ AE  I
Subjt:  SPLKRRRKKKKTTSSSEVGPRGPLPMSHADLVDDPEARMRGTSDVTMRFCVEPSSFGVKDQVSRISASCLDRCLRRASKFVSDPRFVLQRAIDHAAEVLI

Query:  ASIHSAVMMKAELDIREIFVAKEKENSSAALEAATTMKGELLKARSEVDILKAEVEAKAQLLKKEDEKHKAYLRAAHAITKRLEKEKFQLLKQKNDMLQV
        ASIH AVM+KAELD RE   AKE+ENS AALEAATT+KGELLKA+ EVDIL+AEV+AK  LLKKE EKHKA+LRAAHAITK LEKEKFQLLK+K+D+ QV
Subjt:  ASIHSAVMMKAELDIREIFVAKEKENSSAALEAATTMKGELLKARSEVDILKAEVEAKAQLLKKEDEKHKAYLRAAHAITKRLEKEKFQLLKQKNDMLQV

Query:  LEEKDASIKHLTIELEVEKKRLSNRVLLEAAFRQHPDFDEFAKDFSDAGFKFMMRGIAADMPHLQIDLSDLKKKYAEKWAFGPNSTSGPQSLVEKYVREL
        LEEKDASI  LT EL+  K+RL+N  LLE +FRQHPDFD FAKDFSDAGFKF+M+GIAADMPHLQIDL+ LKKKY+EKWA GPN T  PQSLV+KYVREL
Subjt:  LEEKDASIKHLTIELEVEKKRLSNRVLLEAAFRQHPDFDEFAKDFSDAGFKFMMRGIAADMPHLQIDLSDLKKKYAEKWAFGPNSTSGPQSLVEKYVREL

Query:  DSDYSYIEEEDVPSQEQVEVGTTQEEAPSQQGGS
        DSDYS +EEED PSQE  EVGTTQEE PSQQGGS
Subjt:  DSDYSYIEEEDVPSQEQVEVGTTQEEAPSQQGGS

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124679.2e-12052.94Show/hide
Query:  VSIKPIPELNQASWNTLKYYKDRFLSGRKVGTLVTDQLLLESGLLDYNPIVRPVEASRPNSELTMVCGFSSSVKHKSKGRAHALKAIQSTQPTTPADAQL
        +SIKPIPEL QA+++TLK+YKD F  GRK+GTLVTD+LLLESGLLDYNP+VRP+EASRPNSEL MVCGF+SSVK KSKGRAHALK +QS+ P TPA  Q 
Subjt:  VSIKPIPELNQASWNTLKYYKDRFLSGRKVGTLVTDQLLLESGLLDYNPIVRPVEASRPNSELTMVCGFSSSVKHKSKGRAHALKAIQSTQPTTPADAQL

Query:  AAQDKAGPSIEVPTPVIELDSAGEHSREKRPRNESEALDVSPLREVREESPLKRRRKKKKTTSSSEVGPRGPLPMSHADLVDDPEARMRGTSDVTMRFCV
        AAQD+AGPS   PTPVIELDS GE SREKR R+ESEALDVSPLREVR                                                     
Subjt:  AAQDKAGPSIEVPTPVIELDSAGEHSREKRPRNESEALDVSPLREVREESPLKRRRKKKKTTSSSEVGPRGPLPMSHADLVDDPEARMRGTSDVTMRFCV

Query:  EPSSFGVKDQVSRISASCLDRCLRRASKFVSDPRFVLQRAIDHAAEVLIASIHSAVMMKAELDIREIFVAKEKENSSAALEAATTMKGELLKARSEVDIL
                                                                                                            
Subjt:  EPSSFGVKDQVSRISASCLDRCLRRASKFVSDPRFVLQRAIDHAAEVLIASIHSAVMMKAELDIREIFVAKEKENSSAALEAATTMKGELLKARSEVDIL

Query:  KAEVEAKAQLLKKEDEKHKAYLRAAHAITKRLEKEKFQLLKQKNDMLQVLEEKDASIKHLTIELEVEKKRLSNRVLLEAAFRQHPDFDEFAKDFSDAGFK
            EAKA+LLK+EDE+HKA+LRAAHAITK LEKEKFQLLK+K+DMLQ LE KDA+I  L  EL+ EK+RL+N  LLEAAFRQHPDFD FAKDFSDAGFK
Subjt:  KAEVEAKAQLLKKEDEKHKAYLRAAHAITKRLEKEKFQLLKQKNDMLQVLEEKDASIKHLTIELEVEKKRLSNRVLLEAAFRQHPDFDEFAKDFSDAGFK

Query:  FMMRGIAADMPHLQIDLSDLKKKYAEKWAFGPNSTSGPQSLVEKYVRELDSDYSYIEEEDVPSQEQVEVGTTQEEAPSQQGGSQEVNLLGSQG
        F+M+GIAAD+PHL++DL DLKK+YAEKWA GPN TSGP SLV+KYVR+LDSDYS ++E++VPSQE  EVGTTQE  PSQQ GSQEVNLLGSQG
Subjt:  FMMRGIAADMPHLQIDLSDLKKKYAEKWAFGPNSTSGPQSLVEKYVRELDSDYSYIEEEDVPSQEQVEVGTTQEEAPSQQGGSQEVNLLGSQG

A0A6J1CR42 uncharacterized protein LOC1110138263.2e-11275.75Show/hide
Query:  MFEYGLRLPLHPLVQEFLNRTGLTPAQVAPNGWGIIFALAILFWLRAQEEDEGELLDVDQLLRCFEAKRIVKKPGRYYMCARKGACGIVKGPTSIKGWVK
        MFEYGLRLPLHP VQEFL RTGL PAQVAPNGWG+IFALAILFWLRA++ +E ELLDVDQLL CFEAKRI KKPGR+YMCARKGA GIVKGPTSIKGWV+
Subjt:  MFEYGLRLPLHPLVQEFLNRTGLTPAQVAPNGWGIIFALAILFWLRAQEEDEGELLDVDQLLRCFEAKRIVKKPGRYYMCARKGACGIVKGPTSIKGWVK

Query:  KWFFASGEWLAKNESGSPFFDVPVRFENLVSIKPIPELNQASWNTLKYYKDRFLSGRKVGTLVTDQLLLESGLLDYNPIVRPVEASRPNSELTMVCGFSS
        KWF+ASGEWLAK+ESG  FFDVP RF NLVSI+P+PEL QAS++TLKYYK+RF  GRKVGTLVTD+LLLESGLLDYNP VRP+E SRPNS L MVC F+S
Subjt:  KWFFASGEWLAKNESGSPFFDVPVRFENLVSIKPIPELNQASWNTLKYYKDRFLSGRKVGTLVTDQLLLESGLLDYNPIVRPVEASRPNSELTMVCGFSS

Query:  SVKHKSKGRAHALKAIQSTQPTTPADAQLAAQDKAGPSIEVPTPVIELDSAGEHSREKRPRNESEALD
         VK KSKGRAHAL+A QS++P TPA          GP+ E P PVIEL+S+G  SREKRPR+++EA+D
Subjt:  SVKHKSKGRAHALKAIQSTQPTTPADAQLAAQDKAGPSIEVPTPVIELDSAGEHSREKRPRNESEALD

A0A6J1DF31 uncharacterized protein LOC1110199098.7e-11877.12Show/hide
Query:  MRGTSDVTMRFCVEPSSFGVKDQVSRISASCLDRCLRRASKFVSDPRFVLQRAIDHAAEVLIASIHSAVMMKAELDIREIFVAKEKENSSAALEAATTMK
        M GT DV  RF +EPSS GVKDQVSRISA+CLDRCL+RASKFVSDP  VLQR ID+AAE  +ASIHSA+M+KAELD RE   AKE+ENSSAALEAATT+K
Subjt:  MRGTSDVTMRFCVEPSSFGVKDQVSRISASCLDRCLRRASKFVSDPRFVLQRAIDHAAEVLIASIHSAVMMKAELDIREIFVAKEKENSSAALEAATTMK

Query:  GELLKARSEVDILKAEVEAKAQLLKKEDEKHKAYLRAAHAITKRLEKEKFQLLKQKNDMLQVLEEKDASIKHLTIELEVEKKRLSNRVLLEAAFRQHPDF
        GELLKA+ EV IL+AEV+AKA+LLKKE EKHKA+LRAAHAITK LEKEKFQLLK+K+D+ QVLE KD SI  LT EL+  K+RL+N  LLE +FRQH DF
Subjt:  GELLKARSEVDILKAEVEAKAQLLKKEDEKHKAYLRAAHAITKRLEKEKFQLLKQKNDMLQVLEEKDASIKHLTIELEVEKKRLSNRVLLEAAFRQHPDF

Query:  DEFAKDFSDAGFKFMMRGIAADMPHLQIDLSDLKKKYAEKWAFGPNSTSGPQSLVEKYVRELDSDYSYIEEEDVPSQEQVEVGTTQEEAPSQQGGSQEVN
        D FAKDFSDAGFKF+M+GIAADMPHLQIDLS+LKKKY+EKWA GPN T GPQSLV KYVRELDSDYS +EEED PSQE  E+GTTQEE PSQQ GSQEVN
Subjt:  DEFAKDFSDAGFKFMMRGIAADMPHLQIDLSDLKKKYAEKWAFGPNSTSGPQSLVEKYVRELDSDYSYIEEEDVPSQEQVEVGTTQEEAPSQQGGSQEVN

Query:  LLGSQG
        LLGS+G
Subjt:  LLGSQG

A0A6J1DXS5 uncharacterized protein LOC1110255025.2e-15576.11Show/hide
Query:  SSLKSTNSGEDLAHRLESELEEIDNFRFSDDGEDSDTSTSGQGLEYPSKIPEHYLGPLRRGFSIPDDILLRIPEERERVDNPPEGWVTLYLKMFEYGLRL
        SS  S+N   DLA RLES+LEEI+N R SDDGEDSD STSGQGLEYPS+IPEHYLG LRRGF+IP++ILLR+PEE ER DNPPEGWVTLY KMFEYGLRL
Subjt:  SSLKSTNSGEDLAHRLESELEEIDNFRFSDDGEDSDTSTSGQGLEYPSKIPEHYLGPLRRGFSIPDDILLRIPEERERVDNPPEGWVTLYLKMFEYGLRL

Query:  PLHPLVQEFLNRTGLTPAQVAPNGWGIIFALAILFWLRAQEEDEGELLDVDQLLRCFEAKRIVKKPGRYYMCARKGACGIVKGPTSIKGWVKKWFFASGE
        PLHP VQEFL RTGL PAQVAPNGWG+IFALAILFWLRA++ +E EL DVDQLL CFEAKRI KKPGR+YMCARKGA GIVKGPTSIKGWV+KWF+ASGE
Subjt:  PLHPLVQEFLNRTGLTPAQVAPNGWGIIFALAILFWLRAQEEDEGELLDVDQLLRCFEAKRIVKKPGRYYMCARKGACGIVKGPTSIKGWVKKWFFASGE

Query:  WLAKNESGSPFFDVPVRFENLVSIKPIPELNQASWNTLKYYKDRFLSGRKVGTLVTDQLLLESGLLDYNPIVRPVEASRPNSELTMVCGFSSSVKHKSKG
        WLAK+ESG  FFDVP RF NLVSI+P+PEL QAS++TLKYYK+RF  GRKVGTLVTD+LLLESGLLDYNP VRP+E+SRPNSEL MVCGF+S VK KSKG
Subjt:  WLAKNESGSPFFDVPVRFENLVSIKPIPELNQASWNTLKYYKDRFLSGRKVGTLVTDQLLLESGLLDYNPIVRPVEASRPNSELTMVCGFSSSVKHKSKG

Query:  RAHALKAIQSTQPTTPADAQLAAQDKAGPSIEVPTPVIELDSAGEHSREKRPRNESEALD
        RAHAL+A QS++P TPA          GP+ E P  VIEL+S+G  SREKRPR+++EA+D
Subjt:  RAHALKAIQSTQPTTPADAQLAAQDKAGPSIEVPTPVIELDSAGEHSREKRPRNESEALD

A0A6J1DZB3 uncharacterized protein LOC1110256654.8e-21776.59Show/hide
Query:  MCARKGACGIVKGPTSIKGWVKKWFFASGEWLAKNESGSPFFDVPVRFENLVSIKPIPELNQASWNTLKYYKDRFLSGRKVGTLVTDQLLLESGLLDYNP
        MCARKG  GIVKGPTSIKGWV KWFFASGEWLAK+ESG  FFDVP RF NLVSIK IPEL QA+++TLK+YKD F   RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGACGIVKGPTSIKGWVKKWFFASGEWLAKNESGSPFFDVPVRFENLVSIKPIPELNQASWNTLKYYKDRFLSGRKVGTLVTDQLLLESGLLDYNP

Query:  IVRPVEASRPNSELTMVCGFSSSVKHKSKGRAHALKAIQSTQPTTPADAQLAAQDKAGPSIEVPTPVIELDSAGEHSREKRPRNESEALDVSPLREVREE
        +VR +EASRPNSEL MVCGF+ SVK KSKGRAHALK +  T+P TP   +  AQ  +GPS  VPTPVIELD +G  S EKR R ESEALDVSPL EVR E
Subjt:  IVRPVEASRPNSELTMVCGFSSSVKHKSKGRAHALKAIQSTQPTTPADAQLAAQDKAGPSIEVPTPVIELDSAGEHSREKRPRNESEALDVSPLREVREE

Query:  SPLKRRRKKKKTTSSSEVGPRGPLPMSHADLVDDPEARMRGTSDVTMRFCVEPSSFGVKDQVSRISASCLDRCLRRASKFVSDPRFVLQRAIDHAAEVLI
        SPL+RRRKKKKT+SSSE G RG LP SHADLVDDPEARMRGTS+V MRF +EPSS GVKDQVSRISA+CLDR LRRASKFVSDP  VLQR ID+ AE  I
Subjt:  SPLKRRRKKKKTTSSSEVGPRGPLPMSHADLVDDPEARMRGTSDVTMRFCVEPSSFGVKDQVSRISASCLDRCLRRASKFVSDPRFVLQRAIDHAAEVLI

Query:  ASIHSAVMMKAELDIREIFVAKEKENSSAALEAATTMKGELLKARSEVDILKAEVEAKAQLLKKEDEKHKAYLRAAHAITKRLEKEKFQLLKQKNDMLQV
        ASIH AVM+KAELD RE   AKE+ENS AALEAATT+KGELLKA+ EVDIL+AEV+AK  LLKKE EKHKA+LRAAHAITK LEKEKFQLLK+K+D+ QV
Subjt:  ASIHSAVMMKAELDIREIFVAKEKENSSAALEAATTMKGELLKARSEVDILKAEVEAKAQLLKKEDEKHKAYLRAAHAITKRLEKEKFQLLKQKNDMLQV

Query:  LEEKDASIKHLTIELEVEKKRLSNRVLLEAAFRQHPDFDEFAKDFSDAGFKFMMRGIAADMPHLQIDLSDLKKKYAEKWAFGPNSTSGPQSLVEKYVREL
        LEEKDASI  LT EL+  K+RL+N  LLE +FRQHPDFD FAKDFSDAGFKF+M+GIAADMPHLQIDL+ LKKKY+EKWA GPN T  PQSLV+KYVREL
Subjt:  LEEKDASIKHLTIELEVEKKRLSNRVLLEAAFRQHPDFDEFAKDFSDAGFKFMMRGIAADMPHLQIDLSDLKKKYAEKWAFGPNSTSGPQSLVEKYVREL

Query:  DSDYSYIEEEDVPSQEQVEVGTTQEEAPSQQGGS
        DSDYS +EEED PSQE  EVGTTQEE PSQQGGS
Subjt:  DSDYSYIEEEDVPSQEQVEVGTTQEEAPSQQGGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G32010.1 myosin heavy chain-related2.3e-0622.63Show/hide
Query:  RLESELEEIDNFRFSDDGEDSDTSTSGQGLEY------PSKIPEHYLGPLRRGFSIPDDILLRIPEERERVDNPPEGWVTLYLKMF-EYGLRLPLHPLVQ
        R+ ++ +   N    D+ E +D + SG+  +       P+      +G       +P  + +RIP + +R  + PEG++ L+   F E GLR P+   + 
Subjt:  RLESELEEIDNFRFSDDGEDSDTSTSGQGLEY------PSKIPEHYLGPLRRGFSIPDDILLRIPEERERVDNPPEGWVTLYLKMF-EYGLRLPLHPLVQ

Query:  EFLNRTGLTPAQVAPNGWGIIFALAILFWLRAQEEDEGELLDVDQLLRCFEAKRIVKKPGRYYMCARKGACGIVKGPTSIKGWVKKWFFA
         F     +  +Q+       I   A L  L A+    G  L V+ +       ++  K G++Y+ + +G   +  GP+  + W+  +F+A
Subjt:  EFLNRTGLTPAQVAPNGWGIIFALAILFWLRAQEEDEGELLDVDQLLRCFEAKRIVKKPGRYYMCARKGACGIVKGPTSIKGWVKKWFFA

AT2G15420.1 myosin heavy chain-related1.7e-0423.57Show/hide
Query:  PDDILLRIPEERERVDNPPEGWVTLYLKMF-EYGLRLPLHPLVQEFLNRTGLTPAQVAPNGWGIIFALAILFWLRAQEEDEGELLDVDQLLRCFEAKRIV
        P +I L  P+  +R   PPEG++ LY   F   GL  PL   + E+  R  +  +Q+          LAIL        + G  +D D         R+ 
Subjt:  PDDILLRIPEERERVDNPPEGWVTLYLKMF-EYGLRLPLHPLVQEFLNRTGLTPAQVAPNGWGIIFALAILFWLRAQEEDEGELLDVDQLLRCFEAKRIV

Query:  KKPGRYYMCARKGACGIVKGPTS-IKGWVKKWFFAS--------------GEWLAKNESGSPFFDVPVRFENLVSIKPIPELNQASWNTLKYYKDRFLSG
        + PG YY  A K    IV G  S I GW +++FF                 +W    E      D P  F  L +I  I EL    W T  + + R    
Subjt:  KKPGRYYMCARKGACGIVKGPTS-IKGWVKKWFFAS--------------GEWLAKNESGSPFFDVPVRFENLVSIKPIPELNQASWNTLKYYKDRFLSG

Query:  RKVGTLVTDQLLLESGLLDYNPIVRPVEASRPNSELTMVCGFSSSVKHKSKGRAHALKAIQSTQPTTPADAQLAAQDKAGPSIEVPTPVIELDSAGEHSR
        R +G ++           +   ++  VE S   +E  +      +   +S GR  A ++         ++ +  A+DK      V    +        S+
Subjt:  RKVGTLVTDQLLLESGLLDYNPIVRPVEASRPNSELTMVCGFSSSVKHKSKGRAHALKAIQSTQPTTPADAQLAAQDKAGPSIEVPTPVIELDSAGEHSR

Query:  EKRPRNESEALDVSPLREVREESPLK-------RRRKKKKTTSSSEVGPRGPLPMSHADLVDDPEARMRGTSD--VTMRFCVEPSSFGVKDQVSRISASC
        ++  R+++E       +  R E+ ++         + K K  + ++         S ADLV    +R+RG SD   ++   VE   F  KD   +   S 
Subjt:  EKRPRNESEALDVSPLREVREESPLK-------RRRKKKKTTSSSEVGPRGPLPMSHADLVDDPEARMRGTSD--VTMRFCVEPSSFGVKDQVSRISASC

Query:  LDRCLRRASKFVSDPRFVLQRAIDHAAEVLIASIHSAVMMKAELDIREIFVAKEKENSSAALEAATTMKGELLKARSEVDILKAEVEAKAQLLKKEDEKH
              R S F+        +A   A  +   S     + K   ++     A E+E S+   + ++ +  ++   +S VD  + ++EA    L K     
Subjt:  LDRCLRRASKFVSDPRFVLQRAIDHAAEVLIASIHSAVMMKAELDIREIFVAKEKENSSAALEAATTMKGELLKARSEVDILKAEVEAKAQLLKKEDEKH

Query:  KAYLRAAHAITKRLEKEKFQLLKQKNDMLQVLE---EKDASIKHLTI-ELEVEKKRLSNRV-LLEAAFRQHPDFDEFAKDFSDAGFKFMMRGIA
         A LR +       E++K     Q    LQ LE   +K  +I   TI ELEV ++ L N V  LE A     D D F +  + A    ++ GI+
Subjt:  KAYLRAAHAITKRLEKEKFQLLKQKNDMLQVLE---EKDASIKHLTI-ELEVEKKRLSNRV-LLEAAFRQHPDFDEFAKDFSDAGFKFMMRGIA

AT5G38190.1 INVOLVED IN: biological_process unknown6.8e-0621.52Show/hide
Query:  RLESELEEIDNFRFSDDGEDSDTSTSGQGLEY------PSKIPEHYLGPLRRGFSIPDDILLRIPEERERVDNPPEGWVTLYLKMF-EYGLRLPLHPLVQ
        R +++ +   N    D+ E +D + SG+  +       P+      +G       +P  + +RIP + +R  + PEG++ L+   F E GLR P+   + 
Subjt:  RLESELEEIDNFRFSDDGEDSDTSTSGQGLEY------PSKIPEHYLGPLRRGFSIPDDILLRIPEERERVDNPPEGWVTLYLKMF-EYGLRLPLHPLVQ

Query:  EFLNRTGLTPAQVAPNGWGIIFALAILFWLRAQEEDEGELLDVDQLLRCFEAKRIVKKPGRYYMCARKGACGIVKGPTSIKGWVKKWFFASGEWLAKNES
         F     +  +Q+       I   A L  L A+    G  L V+ +       ++  K G++Y+ + +G   +   P+  + W+  +F+A    + +N  
Subjt:  EFLNRTGLTPAQVAPNGWGIIFALAILFWLRAQEEDEGELLDVDQLLRCFEAKRIVKKPGRYYMCARKGACGIVKGPTSIKGWVKKWFFASGEWLAKNES

Query:  GSPFFDVPVRFENLVSIKPIPELNQASWNTLKYYKDR
          P  +  +R +  +  K +P L     N  K  K +
Subjt:  GSPFFDVPVRFENLVSIKPIPELNQASWNTLKYYKDR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGTTGTTTATGCAAAGAGTTGTACAACACTATTCACGAATCTAGCTCGAACCCGGTCTCCGACTCGACCTGAACCTTGGAGTGAACCTGCACAAGAGGACAAACT
GTCAGACGATCAAGTTAGTGTAGGTGGTGGGTCCGACATCATTCACGACCGGCGGTTATCCCTGTTTTTTGTCATGTCGAACCTGTCGGGTTTGAGCAGATCGAACCCGT
TCAGGTCGAACCCCGACCTTTTACACTTAGCCTTGTATATAGACAAATTTGGTCTCCTCGGCAGGTCAAACCTTACGCTTCCTGAATTCTTAGAGTTCGATCTGAAATCA
GCTCGAACCCTCCGTAGTAGTGATAGCCTAGGTAGCGCAGGTCGGACTATAAACAGCTCGTCCCTTAAGTCAACTAACTCTGGGGAGGACTTAGCTCATAGGTTAGAGTC
TGAGCTGGAAGAGATAGATAACTTTAGGTTTTCTGATGACGGGGAAGATAGTGACACTTCCACCTCGGGCCAGGGTCTGGAATACCCTTCTAAAATACCTGAACACTATC
TCGGACCCCTCCGTAGGGGGTTTAGTATTCCTGATGATATCCTCCTTAGGATCCCGGAGGAAAGAGAAAGAGTTGACAACCCACCAGAGGGGTGGGTCACTCTTTACTTA
AAGATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTCTTGTCCAAGAGTTCTTAAACCGAACTGGGCTGACTCCTGCTCAAGTGGCCCCCAATGGATGGGGTATCAT
TTTTGCTTTAGCCATCCTCTTCTGGTTACGAGCTCAAGAAGAGGACGAGGGCGAGCTGCTAGATGTTGACCAGCTTCTACGGTGCTTCGAAGCCAAAAGAATAGTTAAGA
AGCCAGGTCGGTACTACATGTGCGCAAGGAAGGGCGCATGTGGTATAGTCAAAGGGCCGACCTCCATCAAAGGATGGGTGAAGAAGTGGTTCTTTGCCTCTGGAGAATGG
CTGGCGAAGAACGAGTCTGGTAGTCCTTTTTTTGACGTTCCCGTTAGGTTTGAGAATTTAGTGTCGATCAAACCAATTCCCGAGCTAAACCAAGCATCCTGGAATACCCT
CAAGTATTATAAGGATCGCTTCCTAAGTGGTAGGAAGGTCGGAACCTTGGTGACTGACCAGCTGTTACTTGAATCTGGGTTGTTAGATTACAACCCCATAGTACGTCCAG
TCGAAGCTTCAAGGCCAAACTCTGAGCTTACGATGGTGTGCGGATTCTCAAGCAGTGTAAAACATAAGTCTAAGGGTCGTGCTCACGCCCTTAAGGCTATACAGAGCACG
CAGCCAACGACTCCCGCTGATGCTCAACTTGCGGCTCAAGACAAAGCTGGGCCATCTATCGAAGTTCCAACTCCAGTGATCGAGCTGGATTCTGCTGGGGAGCACTCCCG
AGAGAAGCGTCCGAGGAACGAGTCCGAGGCGCTGGACGTGTCACCTCTACGCGAGGTGAGAGAAGAGTCTCCTCTGAAGAGGAGAAGGAAGAAAAAGAAAACCACCTCCT
CCTCGGAGGTTGGACCTCGTGGGCCCCTACCCATGAGCCATGCTGATCTAGTGGACGACCCTGAAGCTAGGATGAGGGGGACGTCTGACGTGACTATGCGGTTTTGTGTT
GAACCGTCAAGTTTCGGGGTGAAGGACCAGGTGTCCCGCATCTCGGCATCATGCTTGGACCGCTGCCTCAGAAGGGCGTCCAAGTTTGTAAGCGACCCAAGGTTCGTGCT
GCAAAGGGCCATCGACCACGCCGCTGAGGTGTTAATTGCTTCCATTCACTCAGCCGTCATGATGAAGGCCGAGCTGGACATAAGGGAAATCTTTGTGGCAAAGGAGAAGG
AGAACTCCTCCGCTGCCTTAGAAGCCGCCACCACAATGAAGGGCGAGCTATTGAAGGCTCGTTCCGAAGTGGACATTCTGAAGGCTGAGGTGGAAGCCAAGGCTCAACTG
CTGAAGAAAGAGGATGAGAAGCACAAGGCCTACCTCCGAGCTGCCCATGCCATCACAAAAAGGCTAGAGAAGGAGAAGTTCCAACTCTTGAAGCAGAAAAACGATATGCT
TCAAGTCCTCGAAGAGAAGGATGCTTCAATCAAACACCTCACTATCGAGCTCGAGGTGGAGAAGAAGCGCCTTAGCAACAGAGTTCTTCTAGAGGCAGCGTTCAGGCAAC
ATCCAGATTTTGACGAGTTTGCCAAAGATTTCAGCGACGCGGGCTTCAAATTTATGATGAGGGGCATTGCTGCCGATATGCCTCATCTTCAGATCGACCTCAGCGATCTG
AAGAAGAAGTACGCTGAGAAATGGGCCTTTGGGCCTAATAGCACCTCGGGCCCTCAATCCTTGGTGGAGAAATACGTCAGAGAGCTGGACTCTGACTACTCTTACATAGA
AGAGGAAGATGTTCCTAGTCAGGAGCAGGTCGAGGTGGGCACTACGCAAGAAGAAGCCCCTTCGCAGCAAGGCGGATCCCAGGAGGTCAACCTTCTGGGCTCCCAAGGAG
TTACGCACTTGAGGAGAGGCAGGGAGAATCCTCGTCGGTACAATGCTCCATCTCGGAGTATGAATCGAGCTGCTCTTCGTGCCATCTTCTTGTGTTGTTTTGAGTCTTGC
GGTGGGTTTCCTTTAATGAATTCCACAATTGGATCCATCCATGAGGGTGTTGATGTGTCAATATCCATCATGTCCGGTTCCAGAATTGACGGGTTATCTAGATCGGTCTC
GTATTCTGACGCCAATTTGGCTAAGGCATCTGCGTTTGAATTCTTTGATATTGGAATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGTTGTTTATGCAAAGAGTTGTACAACACTATTCACGAATCTAGCTCGAACCCGGTCTCCGACTCGACCTGAACCTTGGAGTGAACCTGCACAAGAGGACAAACT
GTCAGACGATCAAGTTAGTGTAGGTGGTGGGTCCGACATCATTCACGACCGGCGGTTATCCCTGTTTTTTGTCATGTCGAACCTGTCGGGTTTGAGCAGATCGAACCCGT
TCAGGTCGAACCCCGACCTTTTACACTTAGCCTTGTATATAGACAAATTTGGTCTCCTCGGCAGGTCAAACCTTACGCTTCCTGAATTCTTAGAGTTCGATCTGAAATCA
GCTCGAACCCTCCGTAGTAGTGATAGCCTAGGTAGCGCAGGTCGGACTATAAACAGCTCGTCCCTTAAGTCAACTAACTCTGGGGAGGACTTAGCTCATAGGTTAGAGTC
TGAGCTGGAAGAGATAGATAACTTTAGGTTTTCTGATGACGGGGAAGATAGTGACACTTCCACCTCGGGCCAGGGTCTGGAATACCCTTCTAAAATACCTGAACACTATC
TCGGACCCCTCCGTAGGGGGTTTAGTATTCCTGATGATATCCTCCTTAGGATCCCGGAGGAAAGAGAAAGAGTTGACAACCCACCAGAGGGGTGGGTCACTCTTTACTTA
AAGATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTCTTGTCCAAGAGTTCTTAAACCGAACTGGGCTGACTCCTGCTCAAGTGGCCCCCAATGGATGGGGTATCAT
TTTTGCTTTAGCCATCCTCTTCTGGTTACGAGCTCAAGAAGAGGACGAGGGCGAGCTGCTAGATGTTGACCAGCTTCTACGGTGCTTCGAAGCCAAAAGAATAGTTAAGA
AGCCAGGTCGGTACTACATGTGCGCAAGGAAGGGCGCATGTGGTATAGTCAAAGGGCCGACCTCCATCAAAGGATGGGTGAAGAAGTGGTTCTTTGCCTCTGGAGAATGG
CTGGCGAAGAACGAGTCTGGTAGTCCTTTTTTTGACGTTCCCGTTAGGTTTGAGAATTTAGTGTCGATCAAACCAATTCCCGAGCTAAACCAAGCATCCTGGAATACCCT
CAAGTATTATAAGGATCGCTTCCTAAGTGGTAGGAAGGTCGGAACCTTGGTGACTGACCAGCTGTTACTTGAATCTGGGTTGTTAGATTACAACCCCATAGTACGTCCAG
TCGAAGCTTCAAGGCCAAACTCTGAGCTTACGATGGTGTGCGGATTCTCAAGCAGTGTAAAACATAAGTCTAAGGGTCGTGCTCACGCCCTTAAGGCTATACAGAGCACG
CAGCCAACGACTCCCGCTGATGCTCAACTTGCGGCTCAAGACAAAGCTGGGCCATCTATCGAAGTTCCAACTCCAGTGATCGAGCTGGATTCTGCTGGGGAGCACTCCCG
AGAGAAGCGTCCGAGGAACGAGTCCGAGGCGCTGGACGTGTCACCTCTACGCGAGGTGAGAGAAGAGTCTCCTCTGAAGAGGAGAAGGAAGAAAAAGAAAACCACCTCCT
CCTCGGAGGTTGGACCTCGTGGGCCCCTACCCATGAGCCATGCTGATCTAGTGGACGACCCTGAAGCTAGGATGAGGGGGACGTCTGACGTGACTATGCGGTTTTGTGTT
GAACCGTCAAGTTTCGGGGTGAAGGACCAGGTGTCCCGCATCTCGGCATCATGCTTGGACCGCTGCCTCAGAAGGGCGTCCAAGTTTGTAAGCGACCCAAGGTTCGTGCT
GCAAAGGGCCATCGACCACGCCGCTGAGGTGTTAATTGCTTCCATTCACTCAGCCGTCATGATGAAGGCCGAGCTGGACATAAGGGAAATCTTTGTGGCAAAGGAGAAGG
AGAACTCCTCCGCTGCCTTAGAAGCCGCCACCACAATGAAGGGCGAGCTATTGAAGGCTCGTTCCGAAGTGGACATTCTGAAGGCTGAGGTGGAAGCCAAGGCTCAACTG
CTGAAGAAAGAGGATGAGAAGCACAAGGCCTACCTCCGAGCTGCCCATGCCATCACAAAAAGGCTAGAGAAGGAGAAGTTCCAACTCTTGAAGCAGAAAAACGATATGCT
TCAAGTCCTCGAAGAGAAGGATGCTTCAATCAAACACCTCACTATCGAGCTCGAGGTGGAGAAGAAGCGCCTTAGCAACAGAGTTCTTCTAGAGGCAGCGTTCAGGCAAC
ATCCAGATTTTGACGAGTTTGCCAAAGATTTCAGCGACGCGGGCTTCAAATTTATGATGAGGGGCATTGCTGCCGATATGCCTCATCTTCAGATCGACCTCAGCGATCTG
AAGAAGAAGTACGCTGAGAAATGGGCCTTTGGGCCTAATAGCACCTCGGGCCCTCAATCCTTGGTGGAGAAATACGTCAGAGAGCTGGACTCTGACTACTCTTACATAGA
AGAGGAAGATGTTCCTAGTCAGGAGCAGGTCGAGGTGGGCACTACGCAAGAAGAAGCCCCTTCGCAGCAAGGCGGATCCCAGGAGGTCAACCTTCTGGGCTCCCAAGGAG
TTACGCACTTGAGGAGAGGCAGGGAGAATCCTCGTCGGTACAATGCTCCATCTCGGAGTATGAATCGAGCTGCTCTTCGTGCCATCTTCTTGTGTTGTTTTGAGTCTTGC
GGTGGGTTTCCTTTAATGAATTCCACAATTGGATCCATCCATGAGGGTGTTGATGTGTCAATATCCATCATGTCCGGTTCCAGAATTGACGGGTTATCTAGATCGGTCTC
GTATTCTGACGCCAATTTGGCTAAGGCATCTGCGTTTGAATTCTTTGATATTGGAATTTGA
Protein sequenceShow/hide protein sequence
MVVVYAKSCTTLFTNLARTRSPTRPEPWSEPAQEDKLSDDQVSVGGGSDIIHDRRLSLFFVMSNLSGLSRSNPFRSNPDLLHLALYIDKFGLLGRSNLTLPEFLEFDLKS
ARTLRSSDSLGSAGRTINSSSLKSTNSGEDLAHRLESELEEIDNFRFSDDGEDSDTSTSGQGLEYPSKIPEHYLGPLRRGFSIPDDILLRIPEERERVDNPPEGWVTLYL
KMFEYGLRLPLHPLVQEFLNRTGLTPAQVAPNGWGIIFALAILFWLRAQEEDEGELLDVDQLLRCFEAKRIVKKPGRYYMCARKGACGIVKGPTSIKGWVKKWFFASGEW
LAKNESGSPFFDVPVRFENLVSIKPIPELNQASWNTLKYYKDRFLSGRKVGTLVTDQLLLESGLLDYNPIVRPVEASRPNSELTMVCGFSSSVKHKSKGRAHALKAIQST
QPTTPADAQLAAQDKAGPSIEVPTPVIELDSAGEHSREKRPRNESEALDVSPLREVREESPLKRRRKKKKTTSSSEVGPRGPLPMSHADLVDDPEARMRGTSDVTMRFCV
EPSSFGVKDQVSRISASCLDRCLRRASKFVSDPRFVLQRAIDHAAEVLIASIHSAVMMKAELDIREIFVAKEKENSSAALEAATTMKGELLKARSEVDILKAEVEAKAQL
LKKEDEKHKAYLRAAHAITKRLEKEKFQLLKQKNDMLQVLEEKDASIKHLTIELEVEKKRLSNRVLLEAAFRQHPDFDEFAKDFSDAGFKFMMRGIAADMPHLQIDLSDL
KKKYAEKWAFGPNSTSGPQSLVEKYVRELDSDYSYIEEEDVPSQEQVEVGTTQEEAPSQQGGSQEVNLLGSQGVTHLRRGRENPRRYNAPSRSMNRAALRAIFLCCFESC
GGFPLMNSTIGSIHEGVDVSISIMSGSRIDGLSRSVSYSDANLAKASAFEFFDIGI