; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC05g0656 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC05g0656
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionProtein of unknown function (DUF789)
Genome locationMC05:5019221..5023998
RNA-Seq ExpressionMC05g0656
SyntenyMC05g0656
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134231.3 uncharacterized protein LOC101208769 isoform X1 [Cucumis sativus]2.62e-26484.35Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQ--------------KQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADST
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQ              KQ+ALDS     A+++ ID+L+KR+EFDECRSWSTRSDCSVSDR LADST
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQ--------------KQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADST

Query:  NLDRFLEHTTPLVPAQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVISRRCSVDSDA
        NLDRFLEHTTPLVPA CIPKTSLRGWR REV+EA PYFVLGDLWESFKEWSAYG G+PLLLNGSDSVVQYYVPYLSGIQLY+DPSKSS +SRR   DSDA
Subjt:  NLDRFLEHTTPLVPAQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVISRRCSVDSDA

Query:  ESSKETSSDGSSNCGTEKKMKAVLQDEWIHHSTILGSQR-VQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTHRSC
        ESSKETSSDGSSN G EKK K  LQ+EWI    + GSQR +QMNVPS+ESSSDESDSCY HGQLVFEYLERDPPFCREPLTDKIT+LASRF ELKT+RSC
Subjt:  ESSKETSSDGSSNCGTEKKMKAVLQDEWIHHSTILGSQR-VQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTHRSC

Query:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLW
        DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFH+LSTAFQGI TDGLQFHW R REVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEEC KA+SLW
Subjt:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLW

Query:  QAADNWLRLLNVNHPDYRFFASHNSFWR
        Q AD+WLRLLNVNHPDYRFFASHNSFWR
Subjt:  QAADNWLRLLNVNHPDYRFFASHNSFWR

XP_008438916.1 PREDICTED: uncharacterized protein LOC103483873 isoform X1 [Cucumis melo]8.85e-26484.87Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQ---------KQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRF
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQ         KQ+ALDS     A+++ ID+L+KR+EFDECRSWSTRSDCSVSDR L DSTNLDRF
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQ---------KQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRF

Query:  LEHTTPLVPAQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVISRRCSVDSDAESSKE
        LEHTTPLVPA CIPKTSLRGWR REV+EA PYFVLGDLWESFKEWSAYG G+PLLLNGSDSVVQYYVPYLSGIQLY+DPSKS  +SRR   DSDAESSKE
Subjt:  LEHTTPLVPAQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVISRRCSVDSDAESSKE

Query:  TSSDGSSNCGTEKKMKAVLQDEWIHHSTILGSQR-VQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTHRSCDLSPS
        TSSDGSSN G EKK K  LQ+EWI     LGSQR +QMNVPS+ESSSDESDSCY HGQLVFEYLERDPPFCREPLTDKIT+LASRFPELKT+RSCDLSPS
Subjt:  TSSDGSSNCGTEKKMKAVLQDEWIHHSTILGSQR-VQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTHRSCDLSPS

Query:  SWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADN
        SWISVAWYPIYRIPTGPTLQSLDACFLTFH+LSTA QG  TDGLQFHW R REVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEEC KA+SLWQ AD+
Subjt:  SWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADN

Query:  WLRLLNVNHPDYRFFASHNSFWR
        WLRLLNVNHPDYRFFASHNSFWR
Subjt:  WLRLLNVNHPDYRFFASHNSFWR

XP_022138197.1 uncharacterized protein LOC111009428 isoform X1 [Momordica charantia]2.00e-310100Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQKQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRFLEHTTPLVP
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQKQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRFLEHTTPLVP
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQKQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRFLEHTTPLVP

Query:  AQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVISRRCSVDSDAESSKETSSDGSSNC
        AQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVISRRCSVDSDAESSKETSSDGSSNC
Subjt:  AQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVISRRCSVDSDAESSKETSSDGSSNC

Query:  GTEKKMKAVLQDEWIHHSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTHRSCDLSPSSWISVAWYPI
        GTEKKMKAVLQDEWIHHSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTHRSCDLSPSSWISVAWYPI
Subjt:  GTEKKMKAVLQDEWIHHSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTHRSCDLSPSSWISVAWYPI

Query:  YRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADNWLRLLNVNHP
        YRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADNWLRLLNVNHP
Subjt:  YRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADNWLRLLNVNHP

Query:  DYRFFASHNSFWR
        DYRFFASHNSFWR
Subjt:  DYRFFASHNSFWR

XP_022138198.1 uncharacterized protein LOC111009428 isoform X2 [Momordica charantia]3.73e-30899.76Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQKQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRFLEHTTPLVP
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQKQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRFLEHTTPLVP
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQKQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRFLEHTTPLVP

Query:  AQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVISRRCSVDSDAESSKETSSDGSSNC
        AQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVI RRCSVDSDAESSKETSSDGSSNC
Subjt:  AQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVISRRCSVDSDAESSKETSSDGSSNC

Query:  GTEKKMKAVLQDEWIHHSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTHRSCDLSPSSWISVAWYPI
        GTEKKMKAVLQDEWIHHSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTHRSCDLSPSSWISVAWYPI
Subjt:  GTEKKMKAVLQDEWIHHSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTHRSCDLSPSSWISVAWYPI

Query:  YRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADNWLRLLNVNHP
        YRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADNWLRLLNVNHP
Subjt:  YRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADNWLRLLNVNHP

Query:  DYRFFASHNSFWR
        DYRFFASHNSFWR
Subjt:  DYRFFASHNSFWR

XP_038877692.1 uncharacterized protein LOC120069924 [Benincasa hispida]3.61e-26685.68Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQ------------KQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNL
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQ            KQ+ LDS    VAS+  ID+L+KR+EFDECRSWSTRSDCSVSDR LADSTNL
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQ------------KQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNL

Query:  DRFLEHTTPLVPAQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVISRRCSVDSDAES
        DRFLEHTTPLVPA CIPKTSLRGWR REV EA PYFVLGDLWESFKEWSAYG G+PLLLNGSDSVVQYYVPYLSGIQLY+DPSKSS +SRR   DSDA S
Subjt:  DRFLEHTTPLVPAQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVISRRCSVDSDAES

Query:  SKETSSDGSSNCGTEKKMKAVLQDEWIHHSTILGSQR-VQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTHRSCDL
        SKETSSDGSSN G EKK K  LQDEWI   ++ GSQR +QMNVPS+ESSSDESDSCY HGQLVFEYLERDPPFCREPLTDKITILASRFPELKT+RSCDL
Subjt:  SKETSSDGSSNCGTEKKMKAVLQDEWIHHSTILGSQR-VQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTHRSCDL

Query:  SPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQA
        SPSSWISVAWYPIYRIPTGPTLQSLDACFLTFH+LSTAFQGI TDGLQFHW R REVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEEC KA+SLWQ 
Subjt:  SPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQA

Query:  ADNWLRLLNVNHPDYRFFASHNSFWR
        ADNWLRLLNVNHPDYRFFASHNSFWR
Subjt:  ADNWLRLLNVNHPDYRFFASHNSFWR

TrEMBL top hitse value%identityAlignment
A0A0A0L5V4 Uncharacterized protein2.62e-26686.36Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQ----KQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRFLEHTT
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQ    KQ+ALDS     A+++ ID+L+KR+EFDECRSWSTRSDCSVSDR LADSTNLDRFLEHTT
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQ----KQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRFLEHTT

Query:  PLVPAQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVISRRCSVDSDAESSKETSSDG
        PLVPA CIPKTSLRGWR REV+EA PYFVLGDLWESFKEWSAYG G+PLLLNGSDSVVQYYVPYLSGIQLY+DPSKSS +SRR   DSDAESSKETSSDG
Subjt:  PLVPAQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVISRRCSVDSDAESSKETSSDG

Query:  SSNCGTEKKMKAVLQDEWIHHSTILGSQR-VQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTHRSCDLSPSSWISV
        SSN G EKK K  LQ+EWI    + GSQR +QMNVPS+ESSSDESDSCY HGQLVFEYLERDPPFCREPLTDKIT+LASRF ELKT+RSCDLSPSSWISV
Subjt:  SSNCGTEKKMKAVLQDEWIHHSTILGSQR-VQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTHRSCDLSPSSWISV

Query:  AWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADNWLRLL
        AWYPIYRIPTGPTLQSLDACFLTFH+LSTAFQGI TDGLQFHW R REVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEEC KA+SLWQ AD+WLRLL
Subjt:  AWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADNWLRLL

Query:  NVNHPDYRFFASHNSFWR
        NVNHPDYRFFASHNSFWR
Subjt:  NVNHPDYRFFASHNSFWR

A0A1S3AY60 uncharacterized protein LOC103483873 isoform X14.28e-26484.87Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQ---------KQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRF
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQ         KQ+ALDS     A+++ ID+L+KR+EFDECRSWSTRSDCSVSDR L DSTNLDRF
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQ---------KQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRF

Query:  LEHTTPLVPAQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVISRRCSVDSDAESSKE
        LEHTTPLVPA CIPKTSLRGWR REV+EA PYFVLGDLWESFKEWSAYG G+PLLLNGSDSVVQYYVPYLSGIQLY+DPSKS  +SRR   DSDAESSKE
Subjt:  LEHTTPLVPAQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVISRRCSVDSDAESSKE

Query:  TSSDGSSNCGTEKKMKAVLQDEWIHHSTILGSQR-VQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTHRSCDLSPS
        TSSDGSSN G EKK K  LQ+EWI     LGSQR +QMNVPS+ESSSDESDSCY HGQLVFEYLERDPPFCREPLTDKIT+LASRFPELKT+RSCDLSPS
Subjt:  TSSDGSSNCGTEKKMKAVLQDEWIHHSTILGSQR-VQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTHRSCDLSPS

Query:  SWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADN
        SWISVAWYPIYRIPTGPTLQSLDACFLTFH+LSTA QG  TDGLQFHW R REVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEEC KA+SLWQ AD+
Subjt:  SWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADN

Query:  WLRLLNVNHPDYRFFASHNSFWR
        WLRLLNVNHPDYRFFASHNSFWR
Subjt:  WLRLLNVNHPDYRFFASHNSFWR

A0A1S3AY77 uncharacterized protein LOC103483873 isoform X25.60e-26284.63Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQ---------KQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRF
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQ         KQ+ALDS     A+++ ID+L+KR+EFDECRSWSTRSDCSVSDR L DSTNLDRF
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQ---------KQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRF

Query:  LEHTTPLVPAQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVISRRCSVDSDAESSKE
        LEHTTPLVPA CIPKTSLRGWR REV+EA PYFVLGDLWESFKEWSAYG G+PLLLNGSDSVVQYYVPYLSGIQLY+DPSKS  + RR   DSDAESSKE
Subjt:  LEHTTPLVPAQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVISRRCSVDSDAESSKE

Query:  TSSDGSSNCGTEKKMKAVLQDEWIHHSTILGSQR-VQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTHRSCDLSPS
        TSSDGSSN G EKK K  LQ+EWI     LGSQR +QMNVPS+ESSSDESDSCY HGQLVFEYLERDPPFCREPLTDKIT+LASRFPELKT+RSCDLSPS
Subjt:  TSSDGSSNCGTEKKMKAVLQDEWIHHSTILGSQR-VQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTHRSCDLSPS

Query:  SWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADN
        SWISVAWYPIYRIPTGPTLQSLDACFLTFH+LSTA QG  TDGLQFHW R REVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEEC KA+SLWQ AD+
Subjt:  SWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADN

Query:  WLRLLNVNHPDYRFFASHNSFWR
        WLRLLNVNHPDYRFFASHNSFWR
Subjt:  WLRLLNVNHPDYRFFASHNSFWR

A0A6J1CAE5 uncharacterized protein LOC111009428 isoform X19.69e-311100Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQKQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRFLEHTTPLVP
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQKQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRFLEHTTPLVP
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQKQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRFLEHTTPLVP

Query:  AQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVISRRCSVDSDAESSKETSSDGSSNC
        AQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVISRRCSVDSDAESSKETSSDGSSNC
Subjt:  AQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVISRRCSVDSDAESSKETSSDGSSNC

Query:  GTEKKMKAVLQDEWIHHSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTHRSCDLSPSSWISVAWYPI
        GTEKKMKAVLQDEWIHHSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTHRSCDLSPSSWISVAWYPI
Subjt:  GTEKKMKAVLQDEWIHHSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTHRSCDLSPSSWISVAWYPI

Query:  YRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADNWLRLLNVNHP
        YRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADNWLRLLNVNHP
Subjt:  YRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADNWLRLLNVNHP

Query:  DYRFFASHNSFWR
        DYRFFASHNSFWR
Subjt:  DYRFFASHNSFWR

A0A6J1CCE0 uncharacterized protein LOC111009428 isoform X21.81e-30899.76Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQKQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRFLEHTTPLVP
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQKQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRFLEHTTPLVP
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQKQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRFLEHTTPLVP

Query:  AQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVISRRCSVDSDAESSKETSSDGSSNC
        AQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVI RRCSVDSDAESSKETSSDGSSNC
Subjt:  AQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVISRRCSVDSDAESSKETSSDGSSNC

Query:  GTEKKMKAVLQDEWIHHSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTHRSCDLSPSSWISVAWYPI
        GTEKKMKAVLQDEWIHHSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTHRSCDLSPSSWISVAWYPI
Subjt:  GTEKKMKAVLQDEWIHHSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTHRSCDLSPSSWISVAWYPI

Query:  YRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADNWLRLLNVNHP
        YRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADNWLRLLNVNHP
Subjt:  YRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADNWLRLLNVNHP

Query:  DYRFFASHNSFWR
        DYRFFASHNSFWR
Subjt:  DYRFFASHNSFWR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)7.9e-8452.15Show/hide
Query:  ADSTNLDRFLEHTTPLVPAQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGS-DSVVQYYVPYLSGIQLY--IDPSKSSVISRR
        A S+N++RFL+  TP VPA  + KT +R     +V    PYF+LGD+WESF EWSAYG GVPL LN + D V QYYVP LSGIQ+Y  +D   SS+ +RR
Subjt:  ADSTNLDRFLEHTTPLVPAQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGS-DSVVQYYVPYLSGIQLY--IDPSKSSVISRR

Query:  CSVDSDAESSKETSSDGSSNCGTEKKMKAVLQDEWIHHSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPEL
           +S+++  +++SS+GSS   +E +       E I       S R +      +SSSD+ +     G+L+FEYLERD P+ REP  DK++ LASRFPEL
Subjt:  CSVDSDAESSKETSSDGSSNCGTEKKMKAVLQDEWIHHSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPEL

Query:  KTHRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECP
        KT RSCDL PSSW SVAWYPIY+IPTGPTL+ LDACFLT+HSL T FQG G      H  + RE        K++LP+FGLASYK +   W S G     
Subjt:  KTHRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECP

Query:  KANSLWQAADNWLRLLNVNHPDYRFF
         ANSL+QAADNWLRL  VNHPD+ FF
Subjt:  KANSLWQAADNWLRLLNVNHPDYRFF

AT2G01260.1 Protein of unknown function (DUF789)4.2e-7744.55Show/hide
Query:  VSGGVSIARIR-GENRFYHPPAMRRRLQQQQQQQQKQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRFLEHTTPLVPA
        +  G  + R R G++ FY     RR  Q+  Q ++ Q+ + SN    A S    +L+              SD S        S+NLDRFLE  TP VPA
Subjt:  VSGGVSIARIR-GENRFYHPPAMRRRLQQQQQQQQKQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRFLEHTTPLVPA

Query:  QCIPKTSLRGWRT-REVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGS-DSVVQYYVPYLSGIQLYIDPS--KSSVISRRCSVDSDAESSKETSSDGS
        Q + KT LR  R   +  +  PYFVLGD+W+SF EWSAYGTGVPL+LN + D V+QYYVP LS IQ+Y       SS+ SRR    SD++  +++SSD S
Subjt:  QCIPKTSLRGWRT-REVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGS-DSVVQYYVPYLSGIQLYIDPS--KSSVISRRCSVDSDAESSKETSSDGS

Query:  SNCGTEKKMKAV----LQDEWIHHSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTHRSCDLSPSSWI
        S+  +E+    V    L+D+                    +SSSD+ +     G+L+FEYLERD P+ REP  DK+  LA++FPEL T RSCDL  SSW 
Subjt:  SNCGTEKKMKAV----LQDEWIHHSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTHRSCDLSPSSWI

Query:  SVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTD-GLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADNWL
        SVAWYPIYRIPTGPTL+ LDACFLT+HSL T+F G G++  +     R  E        K+ LP+FGLASYKF+   W   G  E    NSL+QAAD WL
Subjt:  SVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTD-GLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADNWL

Query:  RLLNVNHPDYRFF
           +V+HPD+ FF
Subjt:  RLLNVNHPDYRFF

AT2G01260.2 Protein of unknown function (DUF789)8.0e-6045.37Show/hide
Query:  VSGGVSIARIR-GENRFYHPPAMRRRLQQQQQQQQKQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRFLEHTTPLVPA
        +  G  + R R G++ FY     RR  Q+  Q ++ Q+ + SN    A S    +L+              SD S        S+NLDRFLE  TP VPA
Subjt:  VSGGVSIARIR-GENRFYHPPAMRRRLQQQQQQQQKQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRFLEHTTPLVPA

Query:  QCIPKTSLRGWRT-REVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGS-DSVVQYYVPYLSGIQLYIDPS--KSSVISRRCSVDSDAESSKETSSDGS
        Q + KT LR  R   +  +  PYFVLGD+W+SF EWSAYGTGVPL+LN + D V+QYYVP LS IQ+Y       SS+ SRR    SD++  +++SSD S
Subjt:  QCIPKTSLRGWRT-REVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGS-DSVVQYYVPYLSGIQLYIDPS--KSSVISRRCSVDSDAESSKETSSDGS

Query:  SNCGTEKKMKAV----LQDEWIHHSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTHRSCDLSPSSWI
        S+  +E+    V    L+D+                    +SSSD+ +     G+L+FEYLERD P+ REP  DK+  LA++FPEL T RSCDL  SSW 
Subjt:  SNCGTEKKMKAV----LQDEWIHHSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTHRSCDLSPSSWI

Query:  SVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQG
        SVAWYPIYRIPTGPTL+ LDACFLT+HSL T+F G
Subjt:  SVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQG

AT4G16100.1 Protein of unknown function (DUF789)1.3e-9749.4Show/hide
Query:  RIRGENRFYHPPAMRRRLQQQQQQQQKQNALDSNSKEVAS----STRIDELDKRTEFDECRSWSTRSDCSVSDR-------TLADSTNLDRFLEHTTPLV
        RIRGENRFY+PP MR+  Q++++++ +   ++   K+         +++E + + + +EC    + SDCSV  R       T   S+NL RFL+ TTP+V
Subjt:  RIRGENRFYHPPAMRRRLQQQQQQQQKQNALDSNSKEVAS----STRIDELDKRTEFDECRSWSTRSDCSVSDR-------TLADSTNLDRFLEHTTPLV

Query:  PAQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVISRRCSVDSDAESSKETSSDGSSN
          Q +P TS +GWRTRE  E  PYF+L DLW+SF+EWSAYG GVPLLLNG DSVVQYYVPYLSGIQLY DPS++    RR   +SD +S ++ SSDGS++
Subjt:  PAQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVISRRCSVDSDAESSKETSSDGSSN

Query:  CG--TEKKMKAVLQDEWIHHSTILGSQRVQMNVPSAESSSDESD-SCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTHRSCDLSPSSWISVA
        C   ++   +A L+++                 P   SSSDES+ S    G+LVFEYLE   PF REPLTDKI+ L+S+FP L+T+RSCDLSPSSW+SVA
Subjt:  CG--TEKKMKAVLQDEWIHHSTILGSQRVQMNVPSAESSSDESD-SCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTHRSCDLSPSSWISVA

Query:  WYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWN-STGAEECPKANSLWQAADNWLRLL
        WYPIYRIP G +LQ+LDACFLTFHSLST  +G   +  Q   S ++ V +A    KL LP FGLASYKFK+  W+  +  +E  +  +L + A+ WLR L
Subjt:  WYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWN-STGAEECPKANSLWQAADNWLRLL

Query:  NVNHPDYRFFASHN-SFWR
         V  PD+R F SH+ S WR
Subjt:  NVNHPDYRFFASHN-SFWR

AT5G49220.1 Protein of unknown function (DUF789)3.2e-9349.43Show/hide
Query:  MSVSGGVSIAR--IRGENRFYHPPAMRRRLQQQQQQQQ---KQN-------ALDSNSKEVAS----STR----IDELDKRTEFDECRSWSTRSDCSV-SD
        MS SGGVSIAR  IRGENRFY+PP MRR  Q+ Q QQQ   KQ         +D   ++ A+    +TR    + E   R         +  SD S  S 
Subjt:  MSVSGGVSIAR--IRGENRFYHPPAMRRRLQQQQQQQQ---KQN-------ALDSNSKEVAS----STR----IDELDKRTEFDECRSWSTRSDCSV-SD

Query:  RTLADSTNLDRFLEHTTPLVPAQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGV-----PLLLNGSDSVVQYYVPYLSGIQLYIDPSKSS
        R L+D +NLDRFLEHTTP+VPA+  P  S    +TRE ++   YFVL DLWESF EWSAYG GV     PL ++G+DS VQYYVPYLSGIQLY+DP K  
Subjt:  RTLADSTNLDRFLEHTTPLVPAQCIPKTSLRGWRTREVTEAPPYFVLGDLWESFKEWSAYGTGV-----PLLLNGSDSVVQYYVPYLSGIQLYIDPSKSS

Query:  VISRRCSVDSDAESSKETSSDGSSNCGTEKKMKAV--LQDEWIHHSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITIL
                  +     E SS+GSSN  T     +V  L    +   +I GS            SS E++   P G+L+FEYLE +PPF REPL +KI+ L
Subjt:  VISRRCSVDSDAESSKETSSDGSSNCGTEKKMKAV--LQDEWIHHSTILGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITIL

Query:  ASRFPELKTHRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNS
        ASR PEL T+RSCDL PSSW+SV+WYPIYRIP GPTLQ+LDACFLTFHSLSTA               A     +    KL LP FGLASYK K+  WN 
Subjt:  ASRFPELKTHRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNS

Query:  TGAEECPKANSLWQAADNWLRLLNVNHPDYRFFASHN
           +E  K  SL QAAD WL+ L V+HPDYRFF S++
Subjt:  TGAEECPKANSLWQAADNWLRLLNVNHPDYRFFASHN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGTCTCCGGTGGGGTTTCGATTGCCAGAATCCGTGGCGAGAATCGGTTCTACCATCCACCTGCGATGCGGCGACGTTTGCAGCAGCAGCAGCAGCAGCAGCAGAA
GCAGAACGCCTTGGATTCTAATTCTAAGGAGGTTGCTTCTTCTACTAGGATCGATGAGTTGGACAAGAGGACTGAGTTCGATGAGTGTCGTTCTTGGTCCACTCGCTCTG
ATTGCTCCGTTTCGGATCGAACACTTGCTGATTCTACTAATTTGGATCGCTTCTTGGAGCATACTACTCCCCTCGTTCCGGCTCAATGTATTCCTAAGACGAGCCTGAGG
GGTTGGAGGACTCGTGAAGTCACAGAGGCGCCTCCTTATTTTGTGCTTGGTGATCTCTGGGAATCTTTCAAGGAATGGAGTGCATATGGAACGGGTGTCCCTCTTTTGTT
AAATGGTAGCGACTCTGTGGTACAGTACTACGTTCCCTATCTGTCTGGCATTCAACTCTATATAGATCCATCTAAGTCCTCTGTTATAAGTAGAAGGTGTAGTGTGGATA
GTGATGCTGAGTCCTCAAAGGAAACAAGCAGTGATGGAAGCAGTAATTGTGGAACAGAAAAGAAGATGAAGGCCGTTCTTCAAGATGAGTGGATCCATCACTCCACTATT
CTGGGGTCACAAAGAGTTCAAATGAATGTTCCTTCTGCTGAGTCATCAAGTGATGAAAGTGACTCTTGCTACCCTCATGGTCAGCTTGTGTTTGAATACTTGGAGCGAGA
TCCACCATTTTGTCGTGAACCATTGACTGATAAGATCACTATCCTTGCATCGCGTTTTCCTGAATTGAAGACACATAGGAGCTGTGATTTATCTCCTTCCAGTTGGATTT
CTGTGGCATGGTATCCAATCTATCGGATTCCAACGGGTCCGACTCTACAAAGTCTAGATGCTTGTTTCTTGACCTTCCATTCTCTGTCAACAGCATTTCAAGGCATTGGC
ACCGATGGTTTGCAGTTCCATTGGTCAAGAGCTAGAGAGGTGTACACTGCTGATTGCCCCCTCAAATTGCAGTTGCCAATATTTGGACTTGCTTCCTATAAGTTCAAAAT
TCCTTTTTGGAATTCGACTGGTGCGGAGGAATGTCCGAAGGCCAACTCTTTGTGGCAAGCTGCTGACAACTGGCTCAGGTTATTAAACGTAAACCATCCTGATTATAGAT
TTTTCGCATCTCACAATTCATTCTGGAGATGA
mRNA sequenceShow/hide mRNA sequence
GACGAAGTGTGAGAGCCCATATATCTAAATTGGGCCGTGAAAATGGGCCGCCTTTGAGCCCAATGAGCCCGTAATCAGTCCAGGCAGCCGTCAGGAGAGAAGATTCGTGG
AATCGAAACTCCAAAGATCCAGATTCGAGGAAGGATAGATCGGTCATTAAGAGAAAAGATGGAAGGTCGAAATGGTACTTTCAGTGGCAGATACCACACAGCCATTACGC
AAGGATACGGAGATTGGTGGGGAAGATGACTAAAAAGGCGCTCAAAATTATCATTCTGGACGCTTCTCGTCCTCCTCTGACCTCACCCTCTTCCCTAAAATCTCCCTTCC
TTTTTCCTAATTCCACAAAATAGGGCACAAATTTTCCCAAATCTTTCAGTTAGTTATCTTCTCACTAGATTTTACAAAACCCTAGTTTCTCCTTCTCCTTCTTCTTCTTC
TTCTTCTTCTTCTTCCCACTGTATATTCAAACAAAACCTAGGCTCCTTTTTCCCCTCCCACGGTGCCCTATCGTTTGTTGCAATGTCAGTCTCCGGTGGGGTTTCGATTG
CCAGAATCCGTGGCGAGAATCGGTTCTACCATCCACCTGCGATGCGGCGACGTTTGCAGCAGCAGCAGCAGCAGCAGCAGAAGCAGAACGCCTTGGATTCTAATTCTAAG
GAGGTTGCTTCTTCTACTAGGATCGATGAGTTGGACAAGAGGACTGAGTTCGATGAGTGTCGTTCTTGGTCCACTCGCTCTGATTGCTCCGTTTCGGATCGAACACTTGC
TGATTCTACTAATTTGGATCGCTTCTTGGAGCATACTACTCCCCTCGTTCCGGCTCAATGTATTCCTAAGACGAGCCTGAGGGGTTGGAGGACTCGTGAAGTCACAGAGG
CGCCTCCTTATTTTGTGCTTGGTGATCTCTGGGAATCTTTCAAGGAATGGAGTGCATATGGAACGGGTGTCCCTCTTTTGTTAAATGGTAGCGACTCTGTGGTACAGTAC
TACGTTCCCTATCTGTCTGGCATTCAACTCTATATAGATCCATCTAAGTCCTCTGTTATAAGTAGAAGGTGTAGTGTGGATAGTGATGCTGAGTCCTCAAAGGAAACAAG
CAGTGATGGAAGCAGTAATTGTGGAACAGAAAAGAAGATGAAGGCCGTTCTTCAAGATGAGTGGATCCATCACTCCACTATTCTGGGGTCACAAAGAGTTCAAATGAATG
TTCCTTCTGCTGAGTCATCAAGTGATGAAAGTGACTCTTGCTACCCTCATGGTCAGCTTGTGTTTGAATACTTGGAGCGAGATCCACCATTTTGTCGTGAACCATTGACT
GATAAGATCACTATCCTTGCATCGCGTTTTCCTGAATTGAAGACACATAGGAGCTGTGATTTATCTCCTTCCAGTTGGATTTCTGTGGCATGGTATCCAATCTATCGGAT
TCCAACGGGTCCGACTCTACAAAGTCTAGATGCTTGTTTCTTGACCTTCCATTCTCTGTCAACAGCATTTCAAGGCATTGGCACCGATGGTTTGCAGTTCCATTGGTCAA
GAGCTAGAGAGGTGTACACTGCTGATTGCCCCCTCAAATTGCAGTTGCCAATATTTGGACTTGCTTCCTATAAGTTCAAAATTCCTTTTTGGAATTCGACTGGTGCGGAG
GAATGTCCGAAGGCCAACTCTTTGTGGCAAGCTGCTGACAACTGGCTCAGGTTATTAAACGTAAACCATCCTGATTATAGATTTTTCGCATCTCACAATTCATTCTGGAG
ATGATAACAAGGGTAGTATGCATGATTTCTTAAATGTGGGATTACAGTGTCTTAAGAGGAGGATCTAATTCCTAAGAAACTCGGTTCTCCTGAATGTTGTGAAAATTTAT
ATGGCATCTTTAGCTTTTTTTTTTTTCTTTGAATTTGTACGTTTGACCATATAAAGGGTTTTGTACAGGGAGAATGATGTTGATAGTAATATTTTACGGTTGACATGGGG
ATTGGGTTTTAGGATAGTAGTTATTCTGGGAGCGGCCAATCCCCTATGGTGTTAGTCTATTCTTCCATGTGTTGAGATATGAGTGAGTGGTGATGGGAATAGGGCGGTTG
TGGACGAGCAGAGAAACTGCTGCCAATAACAACACCGCCTTTTTTTTTTCTTTTTACTTTAAATTGTGGATTAGATTTGTAACTTTCACTGGCACCTCGATAATGATGAA
AAGTAAATCTAGTGTGTTAACTACAATCTCGCCGTATTTAACAGTTGATGTGATAATTTATTTACTGCTGGGTTTTAAGTGTTTTGAATACCATCTTTCTGCTCTGTATG
ACTTGCTATTGTGATCAAATCGTCAACATTTTCTATGTGCATTTGTTCTGTATTGTCATGTCATTTCATTCTTTTTATTTGGGATTCCCTTCATAGTTGATATTTTTATT
CGCTAAGAAAGATGATGTTTTTTGTTTCTTTTTTCAAGTTACTTTAGTATTGGACAATTATTTATGTTTCTTATTT
Protein sequenceShow/hide protein sequence
MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQKQNALDSNSKEVASSTRIDELDKRTEFDECRSWSTRSDCSVSDRTLADSTNLDRFLEHTTPLVPAQCIPKTSLR
GWRTREVTEAPPYFVLGDLWESFKEWSAYGTGVPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSVISRRCSVDSDAESSKETSSDGSSNCGTEKKMKAVLQDEWIHHSTI
LGSQRVQMNVPSAESSSDESDSCYPHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTHRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIG
TDGLQFHWSRAREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECPKANSLWQAADNWLRLLNVNHPDYRFFASHNSFWR