; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg031629 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg031629
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein of unknown function (DUF789)
Genome locationscaffold11:37441309..37445628
RNA-Seq ExpressionSpg031629
SyntenySpg031629
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134231.3 uncharacterized protein LOC101208769 isoform X1 [Cucumis sativus]7.0e-21383.33Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQ-----------KQSALDSKEVVAAAAARIDDLEKRSDFDECRSWSTRSDCSVSDRGLADST
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQ           KQSALDSK+VVAAA + IDDLEKRS+FDECRSWSTRSDCSVSDRGLADST
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQ-----------KQSALDSKEVVAAAAARIDDLEKRSDFDECRSWSTRSDCSVSDRGLADST

Query:  NLDRFLEHTTPLVPAQCIPKVNCFVVWTLVKYVIVVEIDVVCIKISLRTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYV
        NLDRFLEHTTPLVPA CIPK                            TS RGWR REVSEA PYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYV
Subjt:  NLDRFLEHTTPLVPAQCIPKVNCFVVWTLVKYVIVVEIDVVCIKISLRTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYV

Query:  PYLSGIQLYVDPSKSSAL-RRRGAESDAESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERD
        PYLSGIQLYVDPSKSSAL RRRGA+SDAESSKETSSDGSSN GAE KTKTALQ+EWIQD +  GSQRALQMN PS+ESSSDESDSCYRHGQLVFEYLERD
Subjt:  PYLSGIQLYVDPSKSSAL-RRRGAESDAESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERD

Query:  PPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPI
        PPFCREPLTDKIT+LASRF ELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFL+FH+LSTAFQGI TDG+Q HWPR+REVYTADCPLKLQLPI
Subjt:  PPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPI

Query:  FGLACYKFKIPFWSSAGAEECPKANSLWQYADDWLRSLKVNHPDYRFFASHNSFWR
        FGLA YKFKIPFW+S GAEEC KA+SLWQ AD WLR L VNHPDYRFFASHNSFWR
Subjt:  FGLACYKFKIPFWSSAGAEECPKANSLWQYADDWLRSLKVNHPDYRFFASHNSFWR

XP_008438916.1 PREDICTED: uncharacterized protein LOC103483873 isoform X1 [Cucumis melo]2.4e-21384.04Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQ------KQSALDSKEVVAAAAARIDDLEKRSDFDECRSWSTRSDCSVSDRGLADSTNLDRF
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQ      KQSALDSK+VVAAA + IDDLEKRS+FDECRSWSTRSDCSVSDRGL DSTNLDRF
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQ------KQSALDSKEVVAAAAARIDDLEKRSDFDECRSWSTRSDCSVSDRGLADSTNLDRF

Query:  LEHTTPLVPAQCIPKVNCFVVWTLVKYVIVVEIDVVCIKISLRTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSG
        LEHTTPLVPA CIPK                            TS RGWR REVSEA PYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSG
Subjt:  LEHTTPLVPAQCIPKVNCFVVWTLVKYVIVVEIDVVCIKISLRTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSG

Query:  IQLYVDPSKSSAL-RRRGAESDAESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCR
        IQLYVDPSKS AL RRRGA+SDAESSKETSSDGSSN GAE KTKTALQ+EWIQD +ALGSQRALQMN PS+ESSSDESDSCYRHGQLVFEYLERDPPFCR
Subjt:  IQLYVDPSKSSAL-RRRGAESDAESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCR

Query:  EPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLAC
        EPLTDKIT+LASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFL+FH+LSTA QG  TDG+Q HWPR+REVYTADCPLKLQLPIFGLA 
Subjt:  EPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLAC

Query:  YKFKIPFWSSAGAEECPKANSLWQYADDWLRSLKVNHPDYRFFASHNSFWR
        YKFKIPFW+S GAEEC KA+SLWQ AD WLR L VNHPDYRFFASHNSFWR
Subjt:  YKFKIPFWSSAGAEECPKANSLWQYADDWLRSLKVNHPDYRFFASHNSFWR

XP_008438917.1 PREDICTED: uncharacterized protein LOC103483873 isoform X2 [Cucumis melo]9.7e-21584.22Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQ------KQSALDSKEVVAAAAARIDDLEKRSDFDECRSWSTRSDCSVSDRGLADSTNLDRF
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQ      KQSALDSK+VVAAA + IDDLEKRS+FDECRSWSTRSDCSVSDRGL DSTNLDRF
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQ------KQSALDSKEVVAAAAARIDDLEKRSDFDECRSWSTRSDCSVSDRGLADSTNLDRF

Query:  LEHTTPLVPAQCIPKVNCFVVWTLVKYVIVVEIDVVCIKISLRTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSG
        LEHTTPLVPA CIPK                            TS RGWR REVSEA PYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSG
Subjt:  LEHTTPLVPAQCIPKVNCFVVWTLVKYVIVVEIDVVCIKISLRTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSG

Query:  IQLYVDPSKSSALRRRGAESDAESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCRE
        IQLYVDPSKS ALRRRGA+SDAESSKETSSDGSSN GAE KTKTALQ+EWIQD +ALGSQRALQMN PS+ESSSDESDSCYRHGQLVFEYLERDPPFCRE
Subjt:  IQLYVDPSKSSALRRRGAESDAESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCRE

Query:  PLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLACY
        PLTDKIT+LASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFL+FH+LSTA QG  TDG+Q HWPR+REVYTADCPLKLQLPIFGLA Y
Subjt:  PLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLACY

Query:  KFKIPFWSSAGAEECPKANSLWQYADDWLRSLKVNHPDYRFFASHNSFWR
        KFKIPFW+S GAEEC KA+SLWQ AD WLR L VNHPDYRFFASHNSFWR
Subjt:  KFKIPFWSSAGAEECPKANSLWQYADDWLRSLKVNHPDYRFFASHNSFWR

XP_011651067.2 uncharacterized protein LOC101208769 isoform X2 [Cucumis sativus]2.8e-21483.52Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQ-----------KQSALDSKEVVAAAAARIDDLEKRSDFDECRSWSTRSDCSVSDRGLADST
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQ           KQSALDSK+VVAAA + IDDLEKRS+FDECRSWSTRSDCSVSDRGLADST
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQ-----------KQSALDSKEVVAAAAARIDDLEKRSDFDECRSWSTRSDCSVSDRGLADST

Query:  NLDRFLEHTTPLVPAQCIPKVNCFVVWTLVKYVIVVEIDVVCIKISLRTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYV
        NLDRFLEHTTPLVPA CIPK                            TS RGWR REVSEA PYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYV
Subjt:  NLDRFLEHTTPLVPAQCIPKVNCFVVWTLVKYVIVVEIDVVCIKISLRTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYV

Query:  PYLSGIQLYVDPSKSSALRRRGAESDAESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDP
        PYLSGIQLYVDPSKSSALRRRGA+SDAESSKETSSDGSSN GAE KTKTALQ+EWIQD +  GSQRALQMN PS+ESSSDESDSCYRHGQLVFEYLERDP
Subjt:  PYLSGIQLYVDPSKSSALRRRGAESDAESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDP

Query:  PFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIF
        PFCREPLTDKIT+LASRF ELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFL+FH+LSTAFQGI TDG+Q HWPR+REVYTADCPLKLQLPIF
Subjt:  PFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIF

Query:  GLACYKFKIPFWSSAGAEECPKANSLWQYADDWLRSLKVNHPDYRFFASHNSFWR
        GLA YKFKIPFW+S GAEEC KA+SLWQ AD WLR L VNHPDYRFFASHNSFWR
Subjt:  GLACYKFKIPFWSSAGAEECPKANSLWQYADDWLRSLKVNHPDYRFFASHNSFWR

XP_038877692.1 uncharacterized protein LOC120069924 [Benincasa hispida]1.3e-21183.04Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQ---------KQSALDSKEVVAAAAARIDDLEKRSDFDECRSWSTRSDCSVSDRGLADSTNL
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQ         KQS LDSK+V+ A+ A IDDLEKRS+FDECRSWSTRSDCSVSDRGLADSTNL
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQ---------KQSALDSKEVVAAAAARIDDLEKRSDFDECRSWSTRSDCSVSDRGLADSTNL

Query:  DRFLEHTTPLVPAQCIPKVNCFVVWTLVKYVIVVEIDVVCIKISLRTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPY
        DRFLEHTTPLVPA CIPK                            TS RGWR REV EA PYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPY
Subjt:  DRFLEHTTPLVPAQCIPKVNCFVVWTLVKYVIVVEIDVVCIKISLRTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPY

Query:  LSGIQLYVDPSKSSAL-RRRGAESDAESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPP
        LSGIQLYVDPSKSS+L RRRG +SDA SSKETSSDGSSN GAE KTKTALQDEWIQD S  GSQRALQMN PS+ESSSDESDSCYRHGQLVFEYLERDPP
Subjt:  LSGIQLYVDPSKSSAL-RRRGAESDAESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPP

Query:  FCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFG
        FCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFL+FH+LSTAFQGI TDG+Q HWPR+REVYTADCPLKLQLPIFG
Subjt:  FCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFG

Query:  LACYKFKIPFWSSAGAEECPKANSLWQYADDWLRSLKVNHPDYRFFASHNSFWR
        LA YKFKIPFW+S GAEEC KA+SLWQ AD+WLR L VNHPDYRFFASHNSFWR
Subjt:  LACYKFKIPFWSSAGAEECPKANSLWQYADDWLRSLKVNHPDYRFFASHNSFWR

TrEMBL top hitse value%identityAlignment
A0A0A0L5V4 Uncharacterized protein2.3e-21485.2Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQ-KQSALDSKEVVAAAAARIDDLEKRSDFDECRSWSTRSDCSVSDRGLADSTNLDRFLEHTT
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQ KQSALDSK+VVAAA + IDDLEKRS+FDECRSWSTRSDCSVSDRGLADSTNLDRFLEHTT
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQ-KQSALDSKEVVAAAAARIDDLEKRSDFDECRSWSTRSDCSVSDRGLADSTNLDRFLEHTT

Query:  PLVPAQCIPKVNCFVVWTLVKYVIVVEIDVVCIKISLRTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYV
        PLVPA CIPK                            TS RGWR REVSEA PYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYV
Subjt:  PLVPAQCIPKVNCFVVWTLVKYVIVVEIDVVCIKISLRTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYV

Query:  DPSKSSAL-RRRGAESDAESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPLTD
        DPSKSSAL RRRGA+SDAESSKETSSDGSSN GAE KTKTALQ+EWIQD +  GSQRALQMN PS+ESSSDESDSCYRHGQLVFEYLERDPPFCREPLTD
Subjt:  DPSKSSAL-RRRGAESDAESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPLTD

Query:  KITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLACYKFKI
        KIT+LASRF ELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFL+FH+LSTAFQGI TDG+Q HWPR+REVYTADCPLKLQLPIFGLA YKFKI
Subjt:  KITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLACYKFKI

Query:  PFWSSAGAEECPKANSLWQYADDWLRSLKVNHPDYRFFASHNSFWR
        PFW+S GAEEC KA+SLWQ AD WLR L VNHPDYRFFASHNSFWR
Subjt:  PFWSSAGAEECPKANSLWQYADDWLRSLKVNHPDYRFFASHNSFWR

A0A1S3AY60 uncharacterized protein LOC103483873 isoform X11.2e-21384.04Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQ------KQSALDSKEVVAAAAARIDDLEKRSDFDECRSWSTRSDCSVSDRGLADSTNLDRF
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQ      KQSALDSK+VVAAA + IDDLEKRS+FDECRSWSTRSDCSVSDRGL DSTNLDRF
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQ------KQSALDSKEVVAAAAARIDDLEKRSDFDECRSWSTRSDCSVSDRGLADSTNLDRF

Query:  LEHTTPLVPAQCIPKVNCFVVWTLVKYVIVVEIDVVCIKISLRTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSG
        LEHTTPLVPA CIPK                            TS RGWR REVSEA PYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSG
Subjt:  LEHTTPLVPAQCIPKVNCFVVWTLVKYVIVVEIDVVCIKISLRTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSG

Query:  IQLYVDPSKSSAL-RRRGAESDAESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCR
        IQLYVDPSKS AL RRRGA+SDAESSKETSSDGSSN GAE KTKTALQ+EWIQD +ALGSQRALQMN PS+ESSSDESDSCYRHGQLVFEYLERDPPFCR
Subjt:  IQLYVDPSKSSAL-RRRGAESDAESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCR

Query:  EPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLAC
        EPLTDKIT+LASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFL+FH+LSTA QG  TDG+Q HWPR+REVYTADCPLKLQLPIFGLA 
Subjt:  EPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLAC

Query:  YKFKIPFWSSAGAEECPKANSLWQYADDWLRSLKVNHPDYRFFASHNSFWR
        YKFKIPFW+S GAEEC KA+SLWQ AD WLR L VNHPDYRFFASHNSFWR
Subjt:  YKFKIPFWSSAGAEECPKANSLWQYADDWLRSLKVNHPDYRFFASHNSFWR

A0A1S3AY77 uncharacterized protein LOC103483873 isoform X24.7e-21584.22Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQ------KQSALDSKEVVAAAAARIDDLEKRSDFDECRSWSTRSDCSVSDRGLADSTNLDRF
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQ      KQSALDSK+VVAAA + IDDLEKRS+FDECRSWSTRSDCSVSDRGL DSTNLDRF
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQ------KQSALDSKEVVAAAAARIDDLEKRSDFDECRSWSTRSDCSVSDRGLADSTNLDRF

Query:  LEHTTPLVPAQCIPKVNCFVVWTLVKYVIVVEIDVVCIKISLRTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSG
        LEHTTPLVPA CIPK                            TS RGWR REVSEA PYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSG
Subjt:  LEHTTPLVPAQCIPKVNCFVVWTLVKYVIVVEIDVVCIKISLRTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSG

Query:  IQLYVDPSKSSALRRRGAESDAESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCRE
        IQLYVDPSKS ALRRRGA+SDAESSKETSSDGSSN GAE KTKTALQ+EWIQD +ALGSQRALQMN PS+ESSSDESDSCYRHGQLVFEYLERDPPFCRE
Subjt:  IQLYVDPSKSSALRRRGAESDAESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCRE

Query:  PLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLACY
        PLTDKIT+LASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFL+FH+LSTA QG  TDG+Q HWPR+REVYTADCPLKLQLPIFGLA Y
Subjt:  PLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLACY

Query:  KFKIPFWSSAGAEECPKANSLWQYADDWLRSLKVNHPDYRFFASHNSFWR
        KFKIPFW+S GAEEC KA+SLWQ AD WLR L VNHPDYRFFASHNSFWR
Subjt:  KFKIPFWSSAGAEECPKANSLWQYADDWLRSLKVNHPDYRFFASHNSFWR

A0A6J1CCE0 uncharacterized protein LOC111009428 isoform X21.2e-20581.31Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQKQSALDSKEVVAAAAARIDDLEKRSDFDECRSWSTRSDCSVSDRGLADSTNLDRFLEHTTP
        MSVSGGVSIARIRGENRFYHPPAMRRRL   QQQQQQQQKQ+ALDS     A++ RID+L+KR++FDECRSWSTRSDCSVSDR LADSTNLDRFLEHTTP
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQKQSALDSKEVVAAAAARIDDLEKRSDFDECRSWSTRSDCSVSDRGLADSTNLDRFLEHTTP

Query:  LVPAQCIPKVNCFVVWTLVKYVIVVEIDVVCIKISLRTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVD
        LVPAQCIPK                            TS RGWRTREV+EAPPYFVLGDLWESFKEWSAYG G+PLLLNGSDSVVQYYVPYLSGIQLY+D
Subjt:  LVPAQCIPKVNCFVVWTLVKYVIVVEIDVVCIKISLRTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVD

Query:  PSKSSALRRRGAESDAESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKI
        PSKSS +RR   +SDAESSKETSSDGSSNCG E K K  LQDEWI   + LGSQR +QMN PSAESSSDESDSCY HGQLVFEYLERDPPFCREPLTDKI
Subjt:  PSKSSALRRRGAESDAESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKI

Query:  TILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLACYKFKIPF
        TILASRFPELKT+RSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFL+FHSLSTAFQGIGTDG+Q HW R REVYTADCPLKLQLPIFGLA YKFKIPF
Subjt:  TILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLACYKFKIPF

Query:  WSSAGAEECPKANSLWQYADDWLRSLKVNHPDYRFFASHNSFWR
        W+S GAEECPKANSLWQ AD+WLR L VNHPDYRFFASHNSFWR
Subjt:  WSSAGAEECPKANSLWQYADDWLRSLKVNHPDYRFFASHNSFWR

A0A6J1IRT8 uncharacterized protein LOC111479394 isoform X16.2e-20782.59Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQ---QQKQSALDSKEVVAAAAARIDDLEKRSDFDECRSWSTRSDCSVSDRGLADSTNLDRFLEH
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQ   QQKQSALDSK+VVAAA+ARIDDLEKRS+FDECRSWSTRSDCSVSDRGLADSTNLDRFLE 
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQ---QQKQSALDSKEVVAAAAARIDDLEKRSDFDECRSWSTRSDCSVSDRGLADSTNLDRFLEH

Query:  TTPLVPAQCIPKVNCFVVWTLVKYVIVVEIDVVCIKISLRTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQL
        TTPLV A CIPK                            T  RGWRTREVSEAPPYFVLGDLWES+KEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQL
Subjt:  TTPLVPAQCIPKVNCFVVWTLVKYVIVVEIDVVCIKISLRTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQL

Query:  YVDPSKSSAL-RRRGAESDAESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPL
        Y+ PSKSSAL RRRG +SDAESSKETSSDGSSNCGAE KTK  LQDE IQD+S  GSQRALQMN PSAESSSDESDSCYRHGQLVFEY+ERDPPFCREPL
Subjt:  YVDPSKSSAL-RRRGAESDAESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPL

Query:  TDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLACYKF
        TDKI ILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFL+FH+LSTAFQGI +DG+Q  WPR+REVYTADCPLKLQLPIFGLA YKF
Subjt:  TDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLACYKF

Query:  KIPFWSSAGAEECPKANSLWQYADDWLRSLKVNHPDYRFFASHNSFWR
        K+PFW+S G EEC KA SLWQ A+ WLR L VNHPDYRFF+SH+SF R
Subjt:  KIPFWSSAGAEECPKANSLWQYADDWLRSLKVNHPDYRFFASHNSFWR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)1.2e-7747.46Show/hide
Query:  ADSTNLDRFLEHTTPLVPAQCIPKVNCFVVWTLVKYVIVVEIDVVCIKISLRTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSV
        A S+N++RFL+  TP VPA  + K       T+V+                   +RG    +V    PYF+LGD+WESF EWSAYG G+PL LN + D V
Subjt:  ADSTNLDRFLEHTTPLVPAQCIPKVNCFVVWTLVKYVIVVEIDVVCIKISLRTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSV

Query:  VQYYVPYLSGIQLY--VDPSKSSALRRRGAESDAESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVF
         QYYVP LSGIQ+Y  VD   SS   RR  E      +++SS+GSS   +E++       E    ISA   + +L+      +SSSD+ +     G+L+F
Subjt:  VQYYVPYLSGIQLY--VDPSKSSALRRRGAESDAESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVF

Query:  EYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPL
        EYLERD P+ REP  DK++ LASRFPELKT RSCDL PSSW SVAWYPIY+IPTGPTL+ LDACFL++HSL T FQG G     +H  + RE        
Subjt:  EYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPL

Query:  KLQLPIFGLACYKFKIPFWSSAGAEECPKANSLWQYADDWLRSLKVNHPDYRFF
        K++LP+FGLA YK +   W+S G      ANSL+Q AD+WLR  +VNHPD+ FF
Subjt:  KLQLPIFGLACYKFKIPFWSSAGAEECPKANSLWQYADDWLRSLKVNHPDYRFF

AT2G01260.1 Protein of unknown function (DUF789)1.8e-7343.19Show/hide
Query:  AAARIDDLEK-RSDFDECRSWSTRSDCSVSDRGLADSTNLDRFLEHTTPLVPAQCIPKVNCFVVWTLVKYVIVVEIDVVCIKISLRTSQRGWRTREVSEA
        A  RID L + +SD     S +        +     S+NLDRFLE  TP VPAQ + K                         +L   +R     + ++ 
Subjt:  AAARIDDLEK-RSDFDECRSWSTRSDCSVSDRGLADSTNLDRFLEHTTPLVPAQCIPKVNCFVVWTLVKYVIVVEIDVVCIKISLRTSQRGWRTREVSEA

Query:  PPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLYVDPS--KSSALRRRGAESDAESSKETSSDGSSNCGAENKTKTALQDEWIQDI
         PYFVLGD+W+SF EWSAYG G+PL+LN + D V+QYYVP LS IQ+Y       SS   RR  +S     +++SSD SS+  +E  +          D 
Subjt:  PPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLYVDPS--KSSALRRRGAESDAESSKETSSDGSSNCGAENKTKTALQDEWIQDI

Query:  SALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFL
         +L  Q          +SSSD+ +     G+L+FEYLERD P+ REP  DK+  LA++FPEL T RSCDL  SSW SVAWYPIYRIPTGPTL+ LDACFL
Subjt:  SALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFL

Query:  SFHSLSTAFQGIGTD-GMQLHWPRIREVYTADCPLKLQLPIFGLACYKFKIPFWSSAGAEECPKANSLWQYADDWLRSLKVNHPDYRFF
        ++HSL T+F G G++  M L  PR  E        K+ LP+FGLA YKF+   W+  G  E    NSL+Q AD WL S  V+HPD+ FF
Subjt:  SFHSLSTAFQGIGTD-GMQLHWPRIREVYTADCPLKLQLPIFGLACYKFKIPFWSSAGAEECPKANSLWQYADDWLRSLKVNHPDYRFF

AT2G01260.2 Protein of unknown function (DUF789)9.8e-5643.09Show/hide
Query:  AAARIDDLEK-RSDFDECRSWSTRSDCSVSDRGLADSTNLDRFLEHTTPLVPAQCIPKVNCFVVWTLVKYVIVVEIDVVCIKISLRTSQRGWRTREVSEA
        A  RID L + +SD     S +        +     S+NLDRFLE  TP VPAQ + K                         +L   +R     + ++ 
Subjt:  AAARIDDLEK-RSDFDECRSWSTRSDCSVSDRGLADSTNLDRFLEHTTPLVPAQCIPKVNCFVVWTLVKYVIVVEIDVVCIKISLRTSQRGWRTREVSEA

Query:  PPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLYVDPS--KSSALRRRGAESDAESSKETSSDGSSNCGAENKTKTALQDEWIQDI
         PYFVLGD+W+SF EWSAYG G+PL+LN + D V+QYYVP LS IQ+Y       SS   RR  +S     +++SSD SS+  +E  +          D 
Subjt:  PPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLYVDPS--KSSALRRRGAESDAESSKETSSDGSSNCGAENKTKTALQDEWIQDI

Query:  SALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFL
         +L  Q          +SSSD+ +     G+L+FEYLERD P+ REP  DK+  LA++FPEL T RSCDL  SSW SVAWYPIYRIPTGPTL+ LDACFL
Subjt:  SALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFL

Query:  SFHSLSTAFQG
        ++HSL T+F G
Subjt:  SFHSLSTAFQG

AT4G16100.1 Protein of unknown function (DUF789)4.5e-9347.32Show/hide
Query:  RIRGENRFYHPPAMRRRLQQQQQQQQQQQKQSALDSKEVVAAAAARIDDLEKR-SDFDECRSWSTRSDCSVSDRGLADST-------NLDRFLEHTTPLV
        RIRGENRFY+PP M R+LQQ++++++ + ++   + K+       +I   EK     +EC    + SDCSV  R  + +T       NL RFL+ TTP+V
Subjt:  RIRGENRFYHPPAMRRRLQQQQQQQQQQQKQSALDSKEVVAAAAARIDDLEKR-SDFDECRSWSTRSDCSVSDRGLADST-------NLDRFLEHTTPLV

Query:  PAQCIPKVNCFVVWTLVKYVIVVEIDVVCIKISLRTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPS
          Q +P                             TS +GWRTRE  E  PYF+L DLW+SF+EWSAYG G+PLLLNG DSVVQYYVPYLSGIQLY DPS
Subjt:  PAQCIPKVNCFVVWTLVKYVIVVEIDVVCIKISLRTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPS

Query:  KSSALRRR-GAESDAESSKETSSDGSSNCG--AENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESD-SCYRHGQLVFEYLERDPPFCREPLTD
        ++   RRR G ESD +S ++ SSDGS++C   ++N  + +L+++                  P   SSSDES+ S    G+LVFEYLE   PF REPLTD
Subjt:  KSSALRRR-GAESDAESSKETSSDGSSNCG--AENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESD-SCYRHGQLVFEYLERDPPFCREPLTD

Query:  KITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLACYKFKI
        KI+ L+S+FP L+TYRSCDLSPSSW+SVAWYPIYRIP G +LQ+LDACFL+FHSLST  +G   +  Q      + V +A    KL LP FGLA YKFK+
Subjt:  KITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLACYKFKI

Query:  PFWS-SAGAEECPKANSLWQYADDWLRSLKVNHPDYRFFASHN-SFWR
          WS  +  +E  +  +L + A++WLR LKV  PD+R F SH+ S WR
Subjt:  PFWS-SAGAEECPKANSLWQYADDWLRSLKVNHPDYRFFASHN-SFWR

AT5G49220.1 Protein of unknown function (DUF789)5.2e-8946.68Show/hide
Query:  MSVSGGVSIAR--IRGENRFYHPPAMRRRLQQQQQQQQQQQKQSALDSKEVV--------AAAAAR-------IDDLEKR---SDFDECRSWSTRSDCSV
        MS SGGVSIAR  IRGENRFY+PP MRR  Q+ Q QQQ ++KQ   D  EV+        A  A R       + + + R   S  + C   S  S  S 
Subjt:  MSVSGGVSIAR--IRGENRFYHPPAMRRRLQQQQQQQQQQQKQSALDSKEVV--------AAAAAR-------IDDLEKR---SDFDECRSWSTRSDCSV

Query:  SDRGLADSTNLDRFLEHTTPLVPAQCIPKVNCFVVWTLVKYVIVVEIDVVCIKISLRTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGI-----P
        S R L+D +NLDRFLEHTTP+VPA+  P  +    W L                         +TRE S+   YFVL DLWESF EWSAYGAG+     P
Subjt:  SDRGLADSTNLDRFLEHTTPLVPAQCIPKVNCFVVWTLVKYVIVVEIDVVCIKISLRTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGI-----P

Query:  LLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALRRRGAESDAESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCY
        L ++G+DS VQYYVPYLSGIQLYVDP K    + R    D     E SS+GSSN      ++T   D  + +++ +    +L+    +   SS E++   
Subjt:  LLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALRRRGAESDAESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCY

Query:  RHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTA--FQGIGTDGMQLHWPRIR
          G+L+FEYLE +PPF REPL +KI+ LASR PEL TYRSCDL PSSW+SV+WYPIYRIP GPTLQ+LDACFL+FHSLSTA     +G    Q       
Subjt:  RHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTA--FQGIGTDGMQLHWPRIR

Query:  EVYTADCPLKLQLPIFGLACYKFKIPFWSSAGAEECPKANSLWQYADDWLRSLKVNHPDYRFFASHN
                 KL LP FGLA YK K+  W+    +E  K  SL Q AD WL+ L+V+HPDYRFF S++
Subjt:  EVYTADCPLKLQLPIFGLACYKFKIPFWSSAGAEECPKANSLWQYADDWLRSLKVNHPDYRFFASHN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGTCTCCGGTGGGGTTTCGATTGCCCGAATCCGTGGCGAGAATCGCTTCTACCATCCACCTGCGATGCGGCGTCGTTTGCAGCAACAGCAACAGCAACAGCAACA
GCAGCAGAAGCAGAGCGCCTTGGATTCCAAGGAGGTTGTCGCTGCTGCTGCTGCTAGGATCGATGACTTGGAGAAGAGGAGTGACTTTGATGAGTGTCGTTCTTGGTCCA
CTCGCTCTGATTGCTCTGTTTCGGATCGTGGACTTGCTGACTCTACTAATTTGGATCGGTTCTTGGAGCACACTACTCCTCTTGTTCCGGCTCAATGTATTCCTAAGGTT
AATTGTTTTGTTGTGTGGACTTTGGTTAAGTACGTTATTGTGGTGGAAATTGATGTTGTTTGTATAAAGATTAGCCTTCGCACGAGCCAGAGGGGATGGAGAACTCGTGA
AGTCTCAGAGGCACCTCCTTATTTTGTGCTCGGTGATCTCTGGGAATCTTTCAAGGAATGGAGTGCATACGGAGCCGGTATCCCTCTATTGTTAAATGGTAGCGATTCTG
TAGTACAGTACTATGTTCCGTATCTGTCCGGCATTCAACTCTATGTTGATCCATCAAAGTCCTCTGCCCTAAGAAGGCGTGGTGCAGAGAGTGATGCTGAGTCCTCAAAG
GAAACCAGCAGTGATGGAAGCAGTAATTGTGGGGCAGAAAATAAAACAAAGACTGCTCTTCAGGATGAGTGGATCCAGGATATTAGTGCTCTGGGGTCACAAAGAGCTCT
TCAAATGAATGGACCTTCTGCTGAGTCATCAAGTGATGAAAGTGACTCTTGCTACCGCCATGGTCAGCTTGTGTTTGAATACTTGGAGCGTGATCCACCATTTTGTCGTG
AACCATTAACTGATAAGATCACTATCCTTGCATCTCGTTTTCCTGAATTGAAGACATATCGGAGCTGTGATTTATCTCCTTCCAGTTGGATTTCTGTGGCATGGTATCCA
ATTTATAGGATTCCCACGGGTCCAACTCTACAAAGTCTGGATGCTTGTTTCTTGAGCTTCCATTCTCTGTCAACGGCATTTCAAGGCATTGGTACCGATGGGATGCAACT
CCATTGGCCAAGAATTAGAGAGGTGTACACTGCGGATTGCCCTCTCAAACTACAGTTGCCAATATTTGGACTTGCTTGCTATAAGTTCAAAATTCCTTTTTGGAGTTCGG
CTGGTGCTGAGGAATGTCCGAAGGCCAACTCTTTGTGGCAATATGCTGACGACTGGCTGAGGTCATTAAAAGTGAACCATCCTGATTACAGATTTTTCGCATCTCATAAT
TCATTCTGGAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGTCTCCGGTGGGGTTTCGATTGCCCGAATCCGTGGCGAGAATCGCTTCTACCATCCACCTGCGATGCGGCGTCGTTTGCAGCAACAGCAACAGCAACAGCAACA
GCAGCAGAAGCAGAGCGCCTTGGATTCCAAGGAGGTTGTCGCTGCTGCTGCTGCTAGGATCGATGACTTGGAGAAGAGGAGTGACTTTGATGAGTGTCGTTCTTGGTCCA
CTCGCTCTGATTGCTCTGTTTCGGATCGTGGACTTGCTGACTCTACTAATTTGGATCGGTTCTTGGAGCACACTACTCCTCTTGTTCCGGCTCAATGTATTCCTAAGGTT
AATTGTTTTGTTGTGTGGACTTTGGTTAAGTACGTTATTGTGGTGGAAATTGATGTTGTTTGTATAAAGATTAGCCTTCGCACGAGCCAGAGGGGATGGAGAACTCGTGA
AGTCTCAGAGGCACCTCCTTATTTTGTGCTCGGTGATCTCTGGGAATCTTTCAAGGAATGGAGTGCATACGGAGCCGGTATCCCTCTATTGTTAAATGGTAGCGATTCTG
TAGTACAGTACTATGTTCCGTATCTGTCCGGCATTCAACTCTATGTTGATCCATCAAAGTCCTCTGCCCTAAGAAGGCGTGGTGCAGAGAGTGATGCTGAGTCCTCAAAG
GAAACCAGCAGTGATGGAAGCAGTAATTGTGGGGCAGAAAATAAAACAAAGACTGCTCTTCAGGATGAGTGGATCCAGGATATTAGTGCTCTGGGGTCACAAAGAGCTCT
TCAAATGAATGGACCTTCTGCTGAGTCATCAAGTGATGAAAGTGACTCTTGCTACCGCCATGGTCAGCTTGTGTTTGAATACTTGGAGCGTGATCCACCATTTTGTCGTG
AACCATTAACTGATAAGATCACTATCCTTGCATCTCGTTTTCCTGAATTGAAGACATATCGGAGCTGTGATTTATCTCCTTCCAGTTGGATTTCTGTGGCATGGTATCCA
ATTTATAGGATTCCCACGGGTCCAACTCTACAAAGTCTGGATGCTTGTTTCTTGAGCTTCCATTCTCTGTCAACGGCATTTCAAGGCATTGGTACCGATGGGATGCAACT
CCATTGGCCAAGAATTAGAGAGGTGTACACTGCGGATTGCCCTCTCAAACTACAGTTGCCAATATTTGGACTTGCTTGCTATAAGTTCAAAATTCCTTTTTGGAGTTCGG
CTGGTGCTGAGGAATGTCCGAAGGCCAACTCTTTGTGGCAATATGCTGACGACTGGCTGAGGTCATTAAAAGTGAACCATCCTGATTACAGATTTTTCGCATCTCATAAT
TCATTCTGGAGATGA
Protein sequenceShow/hide protein sequence
MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQKQSALDSKEVVAAAAARIDDLEKRSDFDECRSWSTRSDCSVSDRGLADSTNLDRFLEHTTPLVPAQCIPKV
NCFVVWTLVKYVIVVEIDVVCIKISLRTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALRRRGAESDAESSK
ETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYP
IYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLACYKFKIPFWSSAGAEECPKANSLWQYADDWLRSLKVNHPDYRFFASHN
SFWR