; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS014648 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS014648
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of unknown function (DUF789)
Genome locationscaffold1096:309999..313506
RNA-Seq ExpressionMS014648
SyntenyMS014648
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134231.3 uncharacterized protein LOC101208769 isoform X1 [Cucumis sativus]9.7e-20984.58Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQ----------------KQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADST
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQ                KQ+ALDS     A+++ ID+L+KR+EFDECRSWSTRSDCSVSDR LADST
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQ----------------KQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADST

Query:  NLDRFLEHTTPLVPAQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVLSRRCSVDSDA
        NLDRFLEHTTPLVPA CIPKTSLRGWR REV+EA PYFVLGDLWESFKEWSAYG G+PLLLNGSDSVVQYYVPYLSGIQLY+DPSKSS LSRR   DSDA
Subjt:  NLDRFLEHTTPLVPAQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVLSRRCSVDSDA

Query:  ESSKETSSDGSSNCGTEKKMKAVLQDEWIQHSTILGSQR-VQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSC
        ESSKETSSDGSSN G EKK K  LQ+EWIQ   + GSQR +QMNVPS+ESSSDESDSCY HGQLVFEYLERDPPFCREPLTDKIT+LASRF ELKTYRSC
Subjt:  ESSKETSSDGSSNCGTEKKMKAVLQDEWIQHSTILGSQR-VQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSC

Query:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLW
        DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFH+LSTAFQGI TDGLQFHW R REVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEEC KA+SLW
Subjt:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLW

Query:  QAADNWLRLLNVNHPDYRFFASHNSFWR
        Q AD+WLRLLNVNHPDYRFFASHNSFWR
Subjt:  QAADNWLRLLNVNHPDYRFFASHNSFWR

XP_008438916.1 PREDICTED: uncharacterized protein LOC103483873 isoform X1 [Cucumis melo]2.8e-20885.11Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQ-----------KQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRF
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQ           KQ+ALDS     A+++ ID+L+KR+EFDECRSWSTRSDCSVSDR L DSTNLDRF
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQ-----------KQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRF

Query:  LEHTTPLVPAQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVLSRRCSVDSDAESSKE
        LEHTTPLVPA CIPKTSLRGWR REV+EA PYFVLGDLWESFKEWSAYG G+PLLLNGSDSVVQYYVPYLSGIQLY+DPSKS  LSRR   DSDAESSKE
Subjt:  LEHTTPLVPAQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVLSRRCSVDSDAESSKE

Query:  TSSDGSSNCGTEKKMKAVLQDEWIQHSTILGSQR-VQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPS
        TSSDGSSN G EKK K  LQ+EWIQ    LGSQR +QMNVPS+ESSSDESDSCY HGQLVFEYLERDPPFCREPLTDKIT+LASRFPELKTYRSCDLSPS
Subjt:  TSSDGSSNCGTEKKMKAVLQDEWIQHSTILGSQR-VQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPS

Query:  SWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADN
        SWISVAWYPIYRIPTGPTLQSLDACFLTFH+LSTA QG  TDGLQFHW R REVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEEC KA+SLWQ AD+
Subjt:  SWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADN

Query:  WLRLLNVNHPDYRFFASHNSFWR
        WLRLLNVNHPDYRFFASHNSFWR
Subjt:  WLRLLNVNHPDYRFFASHNSFWR

XP_022138197.1 uncharacterized protein LOC111009428 isoform X1 [Momordica charantia]4.0e-23998.79Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRL--QQQQQQKQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRFLEHTTPLVP
        MSVSGGVSIARIRGENRFYHPPAMRRRL  QQQQQQKQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRFLEHTTPLVP
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRL--QQQQQQKQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRFLEHTTPLVP

Query:  AQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVLSRRCSVDSDAESSKETSSDGSSNC
        AQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSV+SRRCSVDSDAESSKETSSDGSSNC
Subjt:  AQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVLSRRCSVDSDAESSKETSSDGSSNC

Query:  GTEKKMKAVLQDEWIQHSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPI
        GTEKKMKAVLQDEWI HSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKT+RSCDLSPSSWISVAWYPI
Subjt:  GTEKKMKAVLQDEWIQHSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPI

Query:  YRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADNWLRLLNVNHP
        YRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADNWLRLLNVNHP
Subjt:  YRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADNWLRLLNVNHP

Query:  DYRFFASHNSFWR
        DYRFFASHNSFWR
Subjt:  DYRFFASHNSFWR

XP_022138198.1 uncharacterized protein LOC111009428 isoform X2 [Momordica charantia]2.2e-23798.55Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRL--QQQQQQKQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRFLEHTTPLVP
        MSVSGGVSIARIRGENRFYHPPAMRRRL  QQQQQQKQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRFLEHTTPLVP
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRL--QQQQQQKQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRFLEHTTPLVP

Query:  AQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVLSRRCSVDSDAESSKETSSDGSSNC
        AQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSV+ RRCSVDSDAESSKETSSDGSSNC
Subjt:  AQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVLSRRCSVDSDAESSKETSSDGSSNC

Query:  GTEKKMKAVLQDEWIQHSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPI
        GTEKKMKAVLQDEWI HSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKT+RSCDLSPSSWISVAWYPI
Subjt:  GTEKKMKAVLQDEWIQHSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPI

Query:  YRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADNWLRLLNVNHP
        YRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADNWLRLLNVNHP
Subjt:  YRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADNWLRLLNVNHP

Query:  DYRFFASHNSFWR
        DYRFFASHNSFWR
Subjt:  DYRFFASHNSFWR

XP_038877692.1 uncharacterized protein LOC120069924 [Benincasa hispida]3.9e-21085.92Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQ--------------KQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNL
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQ              KQ+ LDS    VAS+  ID+L+KR+EFDECRSWSTRSDCSVSDR LADSTNL
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQ--------------KQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNL

Query:  DRFLEHTTPLVPAQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVLSRRCSVDSDAES
        DRFLEHTTPLVPA CIPKTSLRGWR REV EA PYFVLGDLWESFKEWSAYG G+PLLLNGSDSVVQYYVPYLSGIQLY+DPSKSS LSRR   DSDA S
Subjt:  DRFLEHTTPLVPAQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVLSRRCSVDSDAES

Query:  SKETSSDGSSNCGTEKKMKAVLQDEWIQHSTILGSQR-VQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDL
        SKETSSDGSSN G EKK K  LQDEWIQ  ++ GSQR +QMNVPS+ESSSDESDSCY HGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDL
Subjt:  SKETSSDGSSNCGTEKKMKAVLQDEWIQHSTILGSQR-VQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDL

Query:  SPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQA
        SPSSWISVAWYPIYRIPTGPTLQSLDACFLTFH+LSTAFQGI TDGLQFHW R REVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEEC KA+SLWQ 
Subjt:  SPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQA

Query:  ADNWLRLLNVNHPDYRFFASHNSFWR
        ADNWLRLLNVNHPDYRFFASHNSFWR
Subjt:  ADNWLRLLNVNHPDYRFFASHNSFWR

TrEMBL top hitse value%identityAlignment
A0A0A0L5V4 Uncharacterized protein3.2e-21086.6Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQ------KQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRFLEHTT
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQ      KQ+ALDS     A+++ ID+L+KR+EFDECRSWSTRSDCSVSDR LADSTNLDRFLEHTT
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQ------KQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRFLEHTT

Query:  PLVPAQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVLSRRCSVDSDAESSKETSSDG
        PLVPA CIPKTSLRGWR REV+EA PYFVLGDLWESFKEWSAYG G+PLLLNGSDSVVQYYVPYLSGIQLY+DPSKSS LSRR   DSDAESSKETSSDG
Subjt:  PLVPAQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVLSRRCSVDSDAESSKETSSDG

Query:  SSNCGTEKKMKAVLQDEWIQHSTILGSQR-VQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISV
        SSN G EKK K  LQ+EWIQ   + GSQR +QMNVPS+ESSSDESDSCY HGQLVFEYLERDPPFCREPLTDKIT+LASRF ELKTYRSCDLSPSSWISV
Subjt:  SSNCGTEKKMKAVLQDEWIQHSTILGSQR-VQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISV

Query:  AWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADNWLRLL
        AWYPIYRIPTGPTLQSLDACFLTFH+LSTAFQGI TDGLQFHW R REVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEEC KA+SLWQ AD+WLRLL
Subjt:  AWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADNWLRLL

Query:  NVNHPDYRFFASHNSFWR
        NVNHPDYRFFASHNSFWR
Subjt:  NVNHPDYRFFASHNSFWR

A0A1S3AY60 uncharacterized protein LOC103483873 isoform X11.4e-20885.11Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQ-----------KQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRF
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQ           KQ+ALDS     A+++ ID+L+KR+EFDECRSWSTRSDCSVSDR L DSTNLDRF
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQ-----------KQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRF

Query:  LEHTTPLVPAQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVLSRRCSVDSDAESSKE
        LEHTTPLVPA CIPKTSLRGWR REV+EA PYFVLGDLWESFKEWSAYG G+PLLLNGSDSVVQYYVPYLSGIQLY+DPSKS  LSRR   DSDAESSKE
Subjt:  LEHTTPLVPAQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVLSRRCSVDSDAESSKE

Query:  TSSDGSSNCGTEKKMKAVLQDEWIQHSTILGSQR-VQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPS
        TSSDGSSN G EKK K  LQ+EWIQ    LGSQR +QMNVPS+ESSSDESDSCY HGQLVFEYLERDPPFCREPLTDKIT+LASRFPELKTYRSCDLSPS
Subjt:  TSSDGSSNCGTEKKMKAVLQDEWIQHSTILGSQR-VQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPS

Query:  SWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADN
        SWISVAWYPIYRIPTGPTLQSLDACFLTFH+LSTA QG  TDGLQFHW R REVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEEC KA+SLWQ AD+
Subjt:  SWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADN

Query:  WLRLLNVNHPDYRFFASHNSFWR
        WLRLLNVNHPDYRFFASHNSFWR
Subjt:  WLRLLNVNHPDYRFFASHNSFWR

A0A1S3AY77 uncharacterized protein LOC103483873 isoform X24.4e-20784.87Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQ-----------KQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRF
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQ           KQ+ALDS     A+++ ID+L+KR+EFDECRSWSTRSDCSVSDR L DSTNLDRF
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQ-----------KQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRF

Query:  LEHTTPLVPAQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVLSRRCSVDSDAESSKE
        LEHTTPLVPA CIPKTSLRGWR REV+EA PYFVLGDLWESFKEWSAYG G+PLLLNGSDSVVQYYVPYLSGIQLY+DPSKS  L RR   DSDAESSKE
Subjt:  LEHTTPLVPAQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVLSRRCSVDSDAESSKE

Query:  TSSDGSSNCGTEKKMKAVLQDEWIQHSTILGSQR-VQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPS
        TSSDGSSN G EKK K  LQ+EWIQ    LGSQR +QMNVPS+ESSSDESDSCY HGQLVFEYLERDPPFCREPLTDKIT+LASRFPELKTYRSCDLSPS
Subjt:  TSSDGSSNCGTEKKMKAVLQDEWIQHSTILGSQR-VQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPS

Query:  SWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADN
        SWISVAWYPIYRIPTGPTLQSLDACFLTFH+LSTA QG  TDGLQFHW R REVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEEC KA+SLWQ AD+
Subjt:  SWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADN

Query:  WLRLLNVNHPDYRFFASHNSFWR
        WLRLLNVNHPDYRFFASHNSFWR
Subjt:  WLRLLNVNHPDYRFFASHNSFWR

A0A6J1CAE5 uncharacterized protein LOC111009428 isoform X11.9e-23998.79Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRL--QQQQQQKQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRFLEHTTPLVP
        MSVSGGVSIARIRGENRFYHPPAMRRRL  QQQQQQKQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRFLEHTTPLVP
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRL--QQQQQQKQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRFLEHTTPLVP

Query:  AQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVLSRRCSVDSDAESSKETSSDGSSNC
        AQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSV+SRRCSVDSDAESSKETSSDGSSNC
Subjt:  AQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVLSRRCSVDSDAESSKETSSDGSSNC

Query:  GTEKKMKAVLQDEWIQHSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPI
        GTEKKMKAVLQDEWI HSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKT+RSCDLSPSSWISVAWYPI
Subjt:  GTEKKMKAVLQDEWIQHSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPI

Query:  YRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADNWLRLLNVNHP
        YRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADNWLRLLNVNHP
Subjt:  YRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADNWLRLLNVNHP

Query:  DYRFFASHNSFWR
        DYRFFASHNSFWR
Subjt:  DYRFFASHNSFWR

A0A6J1CCE0 uncharacterized protein LOC111009428 isoform X21.1e-23798.55Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRL--QQQQQQKQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRFLEHTTPLVP
        MSVSGGVSIARIRGENRFYHPPAMRRRL  QQQQQQKQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRFLEHTTPLVP
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRL--QQQQQQKQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRFLEHTTPLVP

Query:  AQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVLSRRCSVDSDAESSKETSSDGSSNC
        AQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSV+ RRCSVDSDAESSKETSSDGSSNC
Subjt:  AQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVLSRRCSVDSDAESSKETSSDGSSNC

Query:  GTEKKMKAVLQDEWIQHSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPI
        GTEKKMKAVLQDEWI HSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKT+RSCDLSPSSWISVAWYPI
Subjt:  GTEKKMKAVLQDEWIQHSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPI

Query:  YRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADNWLRLLNVNHP
        YRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADNWLRLLNVNHP
Subjt:  YRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADNWLRLLNVNHP

Query:  DYRFFASHNSFWR
        DYRFFASHNSFWR
Subjt:  DYRFFASHNSFWR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)3.5e-8452.15Show/hide
Query:  ADSTNLDRFLEHTTPLVPAQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGS-DSVVQYYVPYLSGIQLY--IDPSKSSVLSRR
        A S+N++RFL+  TP VPA  + KT +R     +V    PYF+LGD+WESF EWSAYG GVPL LN + D V QYYVP LSGIQ+Y  +D   SS+ +RR
Subjt:  ADSTNLDRFLEHTTPLVPAQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGS-DSVVQYYVPYLSGIQLY--IDPSKSSVLSRR

Query:  CSVDSDAESSKETSSDGSSNCGTEKKMKAVLQDEWIQHSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPEL
           +S+++  +++SS+GSS   +E +       E I       S R +      +SSSD+ +     G+L+FEYLERD P+ REP  DK++ LASRFPEL
Subjt:  CSVDSDAESSKETSSDGSSNCGTEKKMKAVLQDEWIQHSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPEL

Query:  KTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECP
        KT RSCDL PSSW SVAWYPIY+IPTGPTL+ LDACFLT+HSL T FQG G      H  + RE        K++LP+FGLASYK +   W S G     
Subjt:  KTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECP

Query:  KANSLWQAADNWLRLLNVNHPDYRFF
         ANSL+QAADNWLRL  VNHPD+ FF
Subjt:  KANSLWQAADNWLRLLNVNHPDYRFF

AT2G01260.1 Protein of unknown function (DUF789)2.2e-7847.06Show/hide
Query:  SNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLAD--STNLDRFLEHTTPLVPAQCIPKTSLRGWRT-REVTEAPPYFVLGDLWESFKEWSAY
        +++K   ++ RID+L +R + D     S+           +D  S+NLDRFLE  TP VPAQ + KT LR  R   +  +  PYFVLGD+W+SF EWSAY
Subjt:  SNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLAD--STNLDRFLEHTTPLVPAQCIPKTSLRGWRT-REVTEAPPYFVLGDLWESFKEWSAY

Query:  GTGVPLLLNGS-DSVVQYYVPYLSGIQLYIDPS--KSSVLSRRCSVDSDAESSKETSSDGSSNCGTEKKMKAV----LQDEWIQHSTILGSQRVQMNVPS
        GTGVPL+LN + D V+QYYVP LS IQ+Y       SS+ SRR    SD++  +++SSD SS+  +E+    V    L+D   QH               
Subjt:  GTGVPLLLNGS-DSVVQYYVPYLSGIQLYIDPS--KSSVLSRRCSVDSDAESSKETSSDGSSNCGTEKKMKAV----LQDEWIQHSTILGSQRVQMNVPS

Query:  AESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTD
         +SSSD+ +     G+L+FEYLERD P+ REP  DK+  LA++FPEL T RSCDL  SSW SVAWYPIYRIPTGPTL+ LDACFLT+HSL T+F G G++
Subjt:  AESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTD

Query:  -GLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADNWLRLLNVNHPDYRFF
          +     R  E        K+ LP+FGLASYKF+   W   G  E    NSL+QAAD WL   +V+HPD+ FF
Subjt:  -GLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADNWLRLLNVNHPDYRFF

AT2G01260.2 Protein of unknown function (DUF789)4.2e-6148.65Show/hide
Query:  SNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLAD--STNLDRFLEHTTPLVPAQCIPKTSLRGWRT-REVTEAPPYFVLGDLWESFKEWSAY
        +++K   ++ RID+L +R + D     S+           +D  S+NLDRFLE  TP VPAQ + KT LR  R   +  +  PYFVLGD+W+SF EWSAY
Subjt:  SNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLAD--STNLDRFLEHTTPLVPAQCIPKTSLRGWRT-REVTEAPPYFVLGDLWESFKEWSAY

Query:  GTGVPLLLNGS-DSVVQYYVPYLSGIQLYIDPS--KSSVLSRRCSVDSDAESSKETSSDGSSNCGTEKKMKAV----LQDEWIQHSTILGSQRVQMNVPS
        GTGVPL+LN + D V+QYYVP LS IQ+Y       SS+ SRR    SD++  +++SSD SS+  +E+    V    L+D   QH               
Subjt:  GTGVPLLLNGS-DSVVQYYVPYLSGIQLYIDPS--KSSVLSRRCSVDSDAESSKETSSDGSSNCGTEKKMKAV----LQDEWIQHSTILGSQRVQMNVPS

Query:  AESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQG
         +SSSD+ +     G+L+FEYLERD P+ REP  DK+  LA++FPEL T RSCDL  SSW SVAWYPIYRIPTGPTL+ LDACFLT+HSL T+F G
Subjt:  AESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQG

AT4G16100.1 Protein of unknown function (DUF789)1.0e-9951.08Show/hide
Query:  RIRGENRFYHPPAMRRRLQQQQQQKQNALDSNSKEVASSTRIDELDKRTEFDECR----SWSTRSDCSVSDR-------TLADSTNLDRFLEHTTPLVPA
        RIRGENRFY+PP M R+LQQ++++K+   +   KE   +  I  LD++ + +E         + SDCSV  R       T   S+NL RFL+ TTP+V  
Subjt:  RIRGENRFYHPPAMRRRLQQQQQQKQNALDSNSKEVASSTRIDELDKRTEFDECR----SWSTRSDCSVSDR-------TLADSTNLDRFLEHTTPLVPA

Query:  QCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVLSRRCSVDSDAESSKETSSDGSSNCG
        Q +P TS +GWRTRE  E  PYF+L DLW+SF+EWSAYG GVPLLLNG DSVVQYYVPYLSGIQLY DPS++    RR   +SD +S ++ SSDGS++C 
Subjt:  QCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVLSRRCSVDSDAESSKETSSDGSSNCG

Query:  --TEKKMKAVLQDEWIQHSTILGSQRVQMNVPSAESSSDESD-SCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWY
          ++   +A L+++                 P   SSSDES+ S    G+LVFEYLE   PF REPLTDKI+ L+S+FP L+TYRSCDLSPSSW+SVAWY
Subjt:  --TEKKMKAVLQDEWIQHSTILGSQRVQMNVPSAESSSDESD-SCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWY

Query:  PIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWN-STGAEECPKANSLWQAADNWLRLLNV
        PIYRIP G +LQ+LDACFLTFHSLST  +G   +  Q   S ++ V +A    KL LP FGLASYKFK+  W+  +  +E  +  +L + A+ WLR L V
Subjt:  PIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWN-STGAEECPKANSLWQAADNWLRLLNV

Query:  NHPDYRFFASHN-SFWR
          PD+R F SH+ S WR
Subjt:  NHPDYRFFASHN-SFWR

AT5G49220.1 Protein of unknown function (DUF789)3.8e-9449.43Show/hide
Query:  MSVSGGVSIAR--IRGENRFYHPPAMRR-----RLQQQQQQKQN-------ALDSNSKEVAS----STR----IDELDKRTEFDECRSWSTRSDCSV-SD
        MS SGGVSIAR  IRGENRFY+PP MRR     +LQQQ ++KQ         +D   ++ A+    +TR    + E   R         +  SD S  S 
Subjt:  MSVSGGVSIAR--IRGENRFYHPPAMRR-----RLQQQQQQKQN-------ALDSNSKEVAS----STR----IDELDKRTEFDECRSWSTRSDCSV-SD

Query:  RTLADSTNLDRFLEHTTPLVPAQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGV-----PLLLNGSDSVVQYYVPYLSGIQLYIDPSKSS
        R L+D +NLDRFLEHTTP+VPA+  P  S    +TRE ++   YFVL DLWESF EWSAYG GV     PL ++G+DS VQYYVPYLSGIQLY+DP K  
Subjt:  RTLADSTNLDRFLEHTTPLVPAQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGV-----PLLLNGSDSVVQYYVPYLSGIQLYIDPSKSS

Query:  VLSRRCSVDSDAESSKETSSDGSSNCGTEKKMKAV--LQDEWIQHSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITIL
                  +     E SS+GSSN  T     +V  L    ++  +I GS            SS E++   P G+L+FEYLE +PPF REPL +KI+ L
Subjt:  VLSRRCSVDSDAESSKETSSDGSSNCGTEKKMKAV--LQDEWIQHSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITIL

Query:  ASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNS
        ASR PEL TYRSCDL PSSW+SV+WYPIYRIP GPTLQ+LDACFLTFHSLSTA               A     +    KL LP FGLASYK K+  WN 
Subjt:  ASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNS

Query:  TGAEECPKANSLWQAADNWLRLLNVNHPDYRFFASHN
           +E  K  SL QAAD WL+ L V+HPDYRFF S++
Subjt:  TGAEECPKANSLWQAADNWLRLLNVNHPDYRFFASHN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGTCTCCGGTGGGGTTTCGATTGCCAGAATCCGTGGCGAGAATCGGTTCTACCATCCACCTGCGATGCGGCGACGTTTGCAGCAGCAGCAGCAGCAGAAGCAGAA
CGCCTTGGATTCTAATTCTAAGGAGGTTGCTTCTTCTACTAGGATCGATGAGTTGGACAAGAGGACTGAGTTCGATGAGTGTCGTTCTTGGTCCACTCGCTCTGATTGCT
CCGTTTCGGATCGAACACTTGCTGATTCTACTAATTTGGATCGCTTCTTGGAGCATACTACTCCCCTCGTTCCGGCTCAATGTATTCCTAAGACGAGCCTGAGGGGTTGG
AGGACTCGTGAAGTCACAGAGGCGCCTCCTTATTTTGTGCTTGGTGATCTCTGGGAATCTTTCAAGGAATGGAGTGCATATGGAACGGGTGTCCCTCTTTTGTTAAATGG
TAGCGACTCTGTGGTACAGTACTACGTTCCCTATCTGTCTGGCATTCAACTCTATATAGATCCATCTAAGTCCTCTGTTCTAAGTAGAAGGTGTAGTGTGGATAGTGATG
CTGAGTCCTCAAAGGAAACAAGCAGTGATGGAAGCAGTAATTGTGGAACAGAAAAGAAGATGAAGGCCGTTCTTCAAGATGAGTGGATCCAGCACTCCACTATTCTGGGG
TCACAAAGAGTTCAAATGAATGTTCCTTCTGCTGAGTCATCAAGTGATGAAAGTGACTCTTGCTACCCTCATGGTCAGCTTGTGTTTGAATACTTGGAGCGAGATCCACC
ATTTTGTCGTGAACCATTGACTGATAAGATCACTATCCTTGCATCGCGTTTTCCTGAATTGAAGACATATAGGAGCTGTGATTTATCTCCTTCCAGTTGGATTTCTGTGG
CATGGTATCCAATCTATCGGATTCCAACGGGTCCGACTCTACAAAGTCTAGATGCTTGTTTCTTGACCTTCCATTCTCTGTCAACAGCATTTCAAGGCATTGGCACCGAT
GGTTTGCAGTTCCATTGGTCAAGAGCTAGAGAGGTGTACACTGCCGATTGCCCCCTCAAACTGCAGTTGCCAATATTTGGACTTGCTTCCTATAAGTTCAAAATTCCTTT
TTGGAATTCGACTGGTGCGGAGGAATGTCCGAAGGCCAACTCTTTGTGGCAAGCTGCTGACAACTGGCTCAGGTTATTAAACGTAAACCATCCTGATTATAGATTTTTCG
CATCTCACAATTCATTCTGGAGA
mRNA sequenceShow/hide mRNA sequence
ATGTCAGTCTCCGGTGGGGTTTCGATTGCCAGAATCCGTGGCGAGAATCGGTTCTACCATCCACCTGCGATGCGGCGACGTTTGCAGCAGCAGCAGCAGCAGAAGCAGAA
CGCCTTGGATTCTAATTCTAAGGAGGTTGCTTCTTCTACTAGGATCGATGAGTTGGACAAGAGGACTGAGTTCGATGAGTGTCGTTCTTGGTCCACTCGCTCTGATTGCT
CCGTTTCGGATCGAACACTTGCTGATTCTACTAATTTGGATCGCTTCTTGGAGCATACTACTCCCCTCGTTCCGGCTCAATGTATTCCTAAGACGAGCCTGAGGGGTTGG
AGGACTCGTGAAGTCACAGAGGCGCCTCCTTATTTTGTGCTTGGTGATCTCTGGGAATCTTTCAAGGAATGGAGTGCATATGGAACGGGTGTCCCTCTTTTGTTAAATGG
TAGCGACTCTGTGGTACAGTACTACGTTCCCTATCTGTCTGGCATTCAACTCTATATAGATCCATCTAAGTCCTCTGTTCTAAGTAGAAGGTGTAGTGTGGATAGTGATG
CTGAGTCCTCAAAGGAAACAAGCAGTGATGGAAGCAGTAATTGTGGAACAGAAAAGAAGATGAAGGCCGTTCTTCAAGATGAGTGGATCCAGCACTCCACTATTCTGGGG
TCACAAAGAGTTCAAATGAATGTTCCTTCTGCTGAGTCATCAAGTGATGAAAGTGACTCTTGCTACCCTCATGGTCAGCTTGTGTTTGAATACTTGGAGCGAGATCCACC
ATTTTGTCGTGAACCATTGACTGATAAGATCACTATCCTTGCATCGCGTTTTCCTGAATTGAAGACATATAGGAGCTGTGATTTATCTCCTTCCAGTTGGATTTCTGTGG
CATGGTATCCAATCTATCGGATTCCAACGGGTCCGACTCTACAAAGTCTAGATGCTTGTTTCTTGACCTTCCATTCTCTGTCAACAGCATTTCAAGGCATTGGCACCGAT
GGTTTGCAGTTCCATTGGTCAAGAGCTAGAGAGGTGTACACTGCCGATTGCCCCCTCAAACTGCAGTTGCCAATATTTGGACTTGCTTCCTATAAGTTCAAAATTCCTTT
TTGGAATTCGACTGGTGCGGAGGAATGTCCGAAGGCCAACTCTTTGTGGCAAGCTGCTGACAACTGGCTCAGGTTATTAAACGTAAACCATCCTGATTATAGATTTTTCG
CATCTCACAATTCATTCTGGAGA
Protein sequenceShow/hide protein sequence
MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQKQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRFLEHTTPLVPAQCIPKTSLRGW
RTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVLSRRCSVDSDAESSKETSSDGSSNCGTEKKMKAVLQDEWIQHSTILG
SQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTD
GLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADNWLRLLNVNHPDYRFFASHNSFWR