; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021438 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021438
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of unknown function (DUF789)
Genome locationchr7:7706113..7709617
RNA-Seq ExpressionLag0021438
SyntenyLag0021438
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134231.3 uncharacterized protein LOC101208769 isoform X1 [Cucumis sativus]5.4e-22391.36Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQ-----KQSALDSKDVVAATAARIDELEKRSDFDECRSWSTRSDCSVSDRGLADST
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQ     KQSALDSKDVVAA  + ID+LEKRS+FDECRSWSTRSDCSVSDRGLADST
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQ-----KQSALDSKDVVAATAARIDELEKRSDFDECRSWSTRSDCSVSDRGLADST

Query:  NLDRFLEHTTPLVPAHCIPKTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA
        NLDRFLEHTTPLVPAHCIPKTS RGWR REVSEA PYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA
Subjt:  NLDRFLEHTTPLVPAHCIPKTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA

Query:  ESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSC
        ESSKETSSDGSSN GAE KTKTALQ+EWIQD +  GSQRALQMN PS+ESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKIT+LASRF ELKTYRSC
Subjt:  ESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSC

Query:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLASYKFKIPFWSSTGAEECPKANSLW
        DLSPSSWISVAWYPIYRIPTGPTLQSLDACFL+FH+LSTAFQGI TDG+Q HWPR+REVYTADCPLKLQLPIFGLASYKFKIPFW+STGAEEC KA+SLW
Subjt:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLASYKFKIPFWSSTGAEECPKANSLW

Query:  QYADDWLRSLNVNHPDYRFFASHNSFWR
        Q AD WLR LNVNHPDYRFFASHNSFWR
Subjt:  QYADDWLRSLNVNHPDYRFFASHNSFWR

XP_008438916.1 PREDICTED: uncharacterized protein LOC103483873 isoform X1 [Cucumis melo]4.9e-22491.96Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQKQSALDSKDVVAATAARIDELEKRSDFDECRSWSTRSDCSVSDRGLADSTNLDRF
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQ KQSALDSKDVVAA  + ID+LEKRS+FDECRSWSTRSDCSVSDRGL DSTNLDRF
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQKQSALDSKDVVAATAARIDELEKRSDFDECRSWSTRSDCSVSDRGLADSTNLDRF

Query:  LEHTTPLVPAHCIPKTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKE
        LEHTTPLVPAHCIPKTS RGWR REVSEA PYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKS ALSRRRGADSDAESSKE
Subjt:  LEHTTPLVPAHCIPKTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKE

Query:  TSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPS
        TSSDGSSN GAE KTKTALQ+EWIQD +ALGSQRALQMN PS+ESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKIT+LASRFPELKTYRSCDLSPS
Subjt:  TSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPS

Query:  SWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLASYKFKIPFWSSTGAEECPKANSLWQYADD
        SWISVAWYPIYRIPTGPTLQSLDACFL+FH+LSTA QG  TDG+Q HWPR+REVYTADCPLKLQLPIFGLASYKFKIPFW+STGAEEC KA+SLWQ AD 
Subjt:  SWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLASYKFKIPFWSSTGAEECPKANSLWQYADD

Query:  WLRSLNVNHPDYRFFASHNSFWR
        WLR LNVNHPDYRFFASHNSFWR
Subjt:  WLRSLNVNHPDYRFFASHNSFWR

XP_008438917.1 PREDICTED: uncharacterized protein LOC103483873 isoform X2 [Cucumis melo]2.7e-22291.73Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQKQSALDSKDVVAATAARIDELEKRSDFDECRSWSTRSDCSVSDRGLADSTNLDRF
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQ KQSALDSKDVVAA  + ID+LEKRS+FDECRSWSTRSDCSVSDRGL DSTNLDRF
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQKQSALDSKDVVAATAARIDELEKRSDFDECRSWSTRSDCSVSDRGLADSTNLDRF

Query:  LEHTTPLVPAHCIPKTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKE
        LEHTTPLVPAHCIPKTS RGWR REVSEA PYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKS AL RRRGADSDAESSKE
Subjt:  LEHTTPLVPAHCIPKTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKE

Query:  TSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPS
        TSSDGSSN GAE KTKTALQ+EWIQD +ALGSQRALQMN PS+ESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKIT+LASRFPELKTYRSCDLSPS
Subjt:  TSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPS

Query:  SWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLASYKFKIPFWSSTGAEECPKANSLWQYADD
        SWISVAWYPIYRIPTGPTLQSLDACFL+FH+LSTA QG  TDG+Q HWPR+REVYTADCPLKLQLPIFGLASYKFKIPFW+STGAEEC KA+SLWQ AD 
Subjt:  SWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLASYKFKIPFWSSTGAEECPKANSLWQYADD

Query:  WLRSLNVNHPDYRFFASHNSFWR
        WLR LNVNHPDYRFFASHNSFWR
Subjt:  WLRSLNVNHPDYRFFASHNSFWR

XP_011651067.2 uncharacterized protein LOC101208769 isoform X2 [Cucumis sativus]3.0e-22191.12Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQ-----KQSALDSKDVVAATAARIDELEKRSDFDECRSWSTRSDCSVSDRGLADST
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQ     KQSALDSKDVVAA  + ID+LEKRS+FDECRSWSTRSDCSVSDRGLADST
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQ-----KQSALDSKDVVAATAARIDELEKRSDFDECRSWSTRSDCSVSDRGLADST

Query:  NLDRFLEHTTPLVPAHCIPKTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA
        NLDRFLEHTTPLVPAHCIPKTS RGWR REVSEA PYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSAL RRRGADSDA
Subjt:  NLDRFLEHTTPLVPAHCIPKTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA

Query:  ESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSC
        ESSKETSSDGSSN GAE KTKTALQ+EWIQD +  GSQRALQMN PS+ESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKIT+LASRF ELKTYRSC
Subjt:  ESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSC

Query:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLASYKFKIPFWSSTGAEECPKANSLW
        DLSPSSWISVAWYPIYRIPTGPTLQSLDACFL+FH+LSTAFQGI TDG+Q HWPR+REVYTADCPLKLQLPIFGLASYKFKIPFW+STGAEEC KA+SLW
Subjt:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLASYKFKIPFWSSTGAEECPKANSLW

Query:  QYADDWLRSLNVNHPDYRFFASHNSFWR
        Q AD WLR LNVNHPDYRFFASHNSFWR
Subjt:  QYADDWLRSLNVNHPDYRFFASHNSFWR

XP_038877692.1 uncharacterized protein LOC120069924 [Benincasa hispida]3.5e-22291.31Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQ---KQSALDSKDVVAATAARIDELEKRSDFDECRSWSTRSDCSVSDRGLADSTNL
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQ   KQS LDSKDV+ A+ A ID+LEKRS+FDECRSWSTRSDCSVSDRGLADSTNL
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQ---KQSALDSKDVVAATAARIDELEKRSDFDECRSWSTRSDCSVSDRGLADSTNL

Query:  DRFLEHTTPLVPAHCIPKTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAES
        DRFLEHTTPLVPAHCIPKTS RGWR REV EA PYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSS+LSRRRG DSDA S
Subjt:  DRFLEHTTPLVPAHCIPKTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAES

Query:  SKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDL
        SKETSSDGSSN GAE KTKTALQDEWIQD S  GSQRALQMN PS+ESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDL
Subjt:  SKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDL

Query:  SPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLASYKFKIPFWSSTGAEECPKANSLWQY
        SPSSWISVAWYPIYRIPTGPTLQSLDACFL+FH+LSTAFQGI TDG+Q HWPR+REVYTADCPLKLQLPIFGLASYKFKIPFW+STGAEEC KA+SLWQ 
Subjt:  SPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLASYKFKIPFWSSTGAEECPKANSLWQY

Query:  ADDWLRSLNVNHPDYRFFASHNSFWR
        AD+WLR LNVNHPDYRFFASHNSFWR
Subjt:  ADDWLRSLNVNHPDYRFFASHNSFWR

TrEMBL top hitse value%identityAlignment
A0A0A0L5V4 Uncharacterized protein3.2e-22191.02Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQKQSALDSKDVVAATAARIDELEKRSDFDECRSWSTRSDCSVSDRGLADSTNLDRF
        MSVSGGVSIARIRGENRFYHPPAMRRRL     QQQQQQQQQQQ KQSALDSKDVVAA  + ID+LEKRS+FDECRSWSTRSDCSVSDRGLADSTNLDRF
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQKQSALDSKDVVAATAARIDELEKRSDFDECRSWSTRSDCSVSDRGLADSTNLDRF

Query:  LEHTTPLVPAHCIPKTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKE
        LEHTTPLVPAHCIPKTS RGWR REVSEA PYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKE
Subjt:  LEHTTPLVPAHCIPKTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKE

Query:  TSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPS
        TSSDGSSN GAE KTKTALQ+EWIQD +  GSQRALQMN PS+ESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKIT+LASRF ELKTYRSCDLSPS
Subjt:  TSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPS

Query:  SWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLASYKFKIPFWSSTGAEECPKANSLWQYADD
        SWISVAWYPIYRIPTGPTLQSLDACFL+FH+LSTAFQGI TDG+Q HWPR+REVYTADCPLKLQLPIFGLASYKFKIPFW+STGAEEC KA+SLWQ AD 
Subjt:  SWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLASYKFKIPFWSSTGAEECPKANSLWQYADD

Query:  WLRSLNVNHPDYRFFASHNSFWR
        WLR LNVNHPDYRFFASHNSFWR
Subjt:  WLRSLNVNHPDYRFFASHNSFWR

A0A1S3AY60 uncharacterized protein LOC103483873 isoform X12.4e-22491.96Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQKQSALDSKDVVAATAARIDELEKRSDFDECRSWSTRSDCSVSDRGLADSTNLDRF
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQ KQSALDSKDVVAA  + ID+LEKRS+FDECRSWSTRSDCSVSDRGL DSTNLDRF
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQKQSALDSKDVVAATAARIDELEKRSDFDECRSWSTRSDCSVSDRGLADSTNLDRF

Query:  LEHTTPLVPAHCIPKTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKE
        LEHTTPLVPAHCIPKTS RGWR REVSEA PYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKS ALSRRRGADSDAESSKE
Subjt:  LEHTTPLVPAHCIPKTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKE

Query:  TSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPS
        TSSDGSSN GAE KTKTALQ+EWIQD +ALGSQRALQMN PS+ESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKIT+LASRFPELKTYRSCDLSPS
Subjt:  TSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPS

Query:  SWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLASYKFKIPFWSSTGAEECPKANSLWQYADD
        SWISVAWYPIYRIPTGPTLQSLDACFL+FH+LSTA QG  TDG+Q HWPR+REVYTADCPLKLQLPIFGLASYKFKIPFW+STGAEEC KA+SLWQ AD 
Subjt:  SWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLASYKFKIPFWSSTGAEECPKANSLWQYADD

Query:  WLRSLNVNHPDYRFFASHNSFWR
        WLR LNVNHPDYRFFASHNSFWR
Subjt:  WLRSLNVNHPDYRFFASHNSFWR

A0A1S3AY77 uncharacterized protein LOC103483873 isoform X21.3e-22291.73Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQKQSALDSKDVVAATAARIDELEKRSDFDECRSWSTRSDCSVSDRGLADSTNLDRF
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQ KQSALDSKDVVAA  + ID+LEKRS+FDECRSWSTRSDCSVSDRGL DSTNLDRF
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQKQSALDSKDVVAATAARIDELEKRSDFDECRSWSTRSDCSVSDRGLADSTNLDRF

Query:  LEHTTPLVPAHCIPKTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKE
        LEHTTPLVPAHCIPKTS RGWR REVSEA PYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKS AL RRRGADSDAESSKE
Subjt:  LEHTTPLVPAHCIPKTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKE

Query:  TSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPS
        TSSDGSSN GAE KTKTALQ+EWIQD +ALGSQRALQMN PS+ESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKIT+LASRFPELKTYRSCDLSPS
Subjt:  TSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPS

Query:  SWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLASYKFKIPFWSSTGAEECPKANSLWQYADD
        SWISVAWYPIYRIPTGPTLQSLDACFL+FH+LSTA QG  TDG+Q HWPR+REVYTADCPLKLQLPIFGLASYKFKIPFW+STGAEEC KA+SLWQ AD 
Subjt:  SWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLASYKFKIPFWSSTGAEECPKANSLWQYADD

Query:  WLRSLNVNHPDYRFFASHNSFWR
        WLR LNVNHPDYRFFASHNSFWR
Subjt:  WLRSLNVNHPDYRFFASHNSFWR

A0A6J1GUB7 uncharacterized protein LOC111457542 isoform X15.9e-21589.18Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQ--QKQSALDSKDVVAATAARIDELEKRSDFDECRSWSTRSDCSVSDRGLADSTNLD
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQ  QKQSALDSKD VAA +ARID+LEKRS+FDECRSWSTRSDCSVSDRGLADSTNLD
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQ--QKQSALDSKDVVAATAARIDELEKRSDFDECRSWSTRSDCSVSDRGLADSTNLD

Query:  RFLEHTTPLVPAHCIPKTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESS
        RFLEHTTPLV AHCIPKT  RGWRTREV EA PYFVLGDLWES+KEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLY+ PSKSSALSRRRG DSDAESS
Subjt:  RFLEHTTPLVPAHCIPKTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESS

Query:  KETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLS
        KETSSDGSSNCGAE KTK  LQDE IQD S  GSQRALQMN PSAESSSDESDSCY HGQLVFEY+ERDPPFCREPLTDKITILASRFPELKTYRSCDLS
Subjt:  KETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLS

Query:  PSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLASYKFKIPFWSSTGAEECPKANSLWQYA
        PSSWISVAWYPIYRIPTGPTLQSLDACFL+FH+LSTAFQGI +DG+Q  WPR+REVYTADCPLKLQLPIFGLASYKFK+PFW+STG EEC KA SLWQ A
Subjt:  PSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLASYKFKIPFWSSTGAEECPKANSLWQYA

Query:  DDWLRSLNVNHPDYRFFASHNSFWR
        + WLR LNVNHPDYRFF+SH+SF R
Subjt:  DDWLRSLNVNHPDYRFFASHNSFWR

A0A6J1IRT8 uncharacterized protein LOC111479394 isoform X15.9e-21589.13Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQKQSALDSKDVVAATAARIDELEKRSDFDECRSWSTRSDCSVSDRGLADSTNLDRF
        MSVSGGVSIARIRGENRFYHPPAMRRRL   QQQQQQQQQQQ QQKQSALDSKDVVAA +ARID+LEKRS+FDECRSWSTRSDCSVSDRGLADSTNLDRF
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQKQSALDSKDVVAATAARIDELEKRSDFDECRSWSTRSDCSVSDRGLADSTNLDRF

Query:  LEHTTPLVPAHCIPKTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKE
        LE TTPLV AHCIPKT  RGWRTREVSEAPPYFVLGDLWES+KEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLY+ PSKSSALSRRRG DSDAESSKE
Subjt:  LEHTTPLVPAHCIPKTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKE

Query:  TSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPS
        TSSDGSSNCGAE KTK  LQDE IQD+S  GSQRALQMN PSAESSSDESDSCYRHGQLVFEY+ERDPPFCREPLTDKI ILASRFPELKTYRSCDLSPS
Subjt:  TSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPS

Query:  SWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLASYKFKIPFWSSTGAEECPKANSLWQYADD
        SWISVAWYPIYRIPTGPTLQSLDACFL+FH+LSTAFQGI +DG+Q  WPR+REVYTADCPLKLQLPIFGLASYKFK+PFW+STG EEC KA SLWQ A+ 
Subjt:  SWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLASYKFKIPFWSSTGAEECPKANSLWQYADD

Query:  WLRSLNVNHPDYRFFASHNSFWR
        WLR LNVNHPDYRFF+SH+SF R
Subjt:  WLRSLNVNHPDYRFFASHNSFWR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)1.4e-8351.68Show/hide
Query:  ADSTNLDRFLEHTTPLVPAHCIPKTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLY--VDPSKSSALSRR
        A S+N++RFL+  TP VPAH + KT  R     +V    PYF+LGD+WESF EWSAYG G+PL LN + D V QYYVP LSGIQ+Y  VD   SS  +RR
Subjt:  ADSTNLDRFLEHTTPLVPAHCIPKTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLY--VDPSKSSALSRR

Query:  RGADSDAESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPE
        +G +S+++  +++SS+GSS   +E++       E    ISA   + +L+      +SSSD+ +     G+L+FEYLERD P+ REP  DK++ LASRFPE
Subjt:  RGADSDAESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPE

Query:  LKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLASYKFKIPFWSSTGAEEC
        LKT RSCDL PSSW SVAWYPIY+IPTGPTL+ LDACFL++HSL T FQG G     +H  + RE        K++LP+FGLASYK +   W+S G    
Subjt:  LKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLASYKFKIPFWSSTGAEEC

Query:  PKANSLWQYADDWLRSLNVNHPDYRFF
          ANSL+Q AD+WLR   VNHPD+ FF
Subjt:  PKANSLWQYADDWLRSLNVNHPDYRFF

AT2G01260.1 Protein of unknown function (DUF789)7.9e-7947.08Show/hide
Query:  RIDELEK-RSDFDECRSWSTRSDCSVSDRGLADSTNLDRFLEHTTPLVPAHCIPKTSQRGWRT-REVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGS
        RID+L + +SD     S +        +     S+NLDRFLE  TP VPA  + KT  R  R   + ++  PYFVLGD+W+SF EWSAYG G+PL+LN +
Subjt:  RIDELEK-RSDFDECRSWSTRSDCSVSDRGLADSTNLDRFLEHTTPLVPAHCIPKTSQRGWRT-REVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGS

Query:  -DSVVQYYVPYLSGIQLYV-DPSKSSALSRRRGADSDAESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHG
         D V+QYYVP LS IQ+Y    +  S+L  RR  DS     +++SSD SS+  +E  +          D  +L  Q          +SSSD+ +     G
Subjt:  -DSVVQYYVPYLSGIQLYV-DPSKSSALSRRRGADSDAESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHG

Query:  QLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTD-GMQLHWPRIREVYT
        +L+FEYLERD P+ REP  DK+  LA++FPEL T RSCDL  SSW SVAWYPIYRIPTGPTL+ LDACFL++HSL T+F G G++  M L  PR  E   
Subjt:  QLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTD-GMQLHWPRIREVYT

Query:  ADCPLKLQLPIFGLASYKFKIPFWSSTGAEECPKANSLWQYADDWLRSLNVNHPDYRFF
             K+ LP+FGLASYKF+   W+  G  E    NSL+Q AD WL S +V+HPD+ FF
Subjt:  ADCPLKLQLPIFGLASYKFKIPFWSSTGAEECPKANSLWQYADDWLRSLNVNHPDYRFF

AT2G01260.2 Protein of unknown function (DUF789)2.8e-6047.69Show/hide
Query:  RIDELEK-RSDFDECRSWSTRSDCSVSDRGLADSTNLDRFLEHTTPLVPAHCIPKTSQRGWRT-REVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGS
        RID+L + +SD     S +        +     S+NLDRFLE  TP VPA  + KT  R  R   + ++  PYFVLGD+W+SF EWSAYG G+PL+LN +
Subjt:  RIDELEK-RSDFDECRSWSTRSDCSVSDRGLADSTNLDRFLEHTTPLVPAHCIPKTSQRGWRT-REVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGS

Query:  -DSVVQYYVPYLSGIQLYV-DPSKSSALSRRRGADSDAESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHG
         D V+QYYVP LS IQ+Y    +  S+L  RR  DS     +++SSD SS+  +E  +          D  +L  Q          +SSSD+ +     G
Subjt:  -DSVVQYYVPYLSGIQLYV-DPSKSSALSRRRGADSDAESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHG

Query:  QLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQG
        +L+FEYLERD P+ REP  DK+  LA++FPEL T RSCDL  SSW SVAWYPIYRIPTGPTL+ LDACFL++HSL T+F G
Subjt:  QLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQG

AT4G16100.1 Protein of unknown function (DUF789)7.8e-9549.41Show/hide
Query:  RIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQKQSALDSKDVVAATAARIDELEKRSDFDECRSWSTRSDCSVSDRGLADST-------NLDRFLEH
        RIRGENRFY+PP M R+LQQ++++++ + ++ +++K+ A +  D       +++E E +   +EC    + SDCSV  R  + +T       NL RFL+ 
Subjt:  RIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQKQSALDSKDVVAATAARIDELEKRSDFDECRSWSTRSDCSVSDRGLADST-------NLDRFLEH

Query:  TTPLVPAHCIPKTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKETSS
        TTP+V    +P TS +GWRTRE  E  PYF+L DLW+SF+EWSAYG G+PLLLNG DSVVQYYVPYLSGIQLY DPS++    RR G +SD +S ++ SS
Subjt:  TTPLVPAHCIPKTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKETSS

Query:  DGSSNCG--AENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESD-SCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPS
        DGS++C   ++N  + +L+++                  P   SSSDES+ S    G+LVFEYLE   PF REPLTDKI+ L+S+FP L+TYRSCDLSPS
Subjt:  DGSSNCG--AENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESD-SCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPS

Query:  SWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLASYKFKIPFWS-STGAEECPKANSLWQYAD
        SW+SVAWYPIYRIP G +LQ+LDACFL+FHSLST  +G   +  Q      + V +A    KL LP FGLASYKFK+  WS  +  +E  +  +L + A+
Subjt:  SWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLASYKFKIPFWS-STGAEECPKANSLWQYAD

Query:  DWLRSLNVNHPDYRFFASHN-SFWR
        +WLR L V  PD+R F SH+ S WR
Subjt:  DWLRSLNVNHPDYRFFASHN-SFWR

AT5G49220.1 Protein of unknown function (DUF789)6.2e-9249.32Show/hide
Query:  MSVSGGVSIAR--IRGENRFYHPPAMRRRLQQQQQQQQ-QQQQQQQQQKQSALDSKDVVAATAA--------RIDELEKR---SDFDECRSWSTRSDCSV
        MS SGGVSIAR  IRGENRFY+PP MRR  Q+ Q QQQ +++Q++  + +  +D +   AAT A         + E + R   S  + C   S  S  S 
Subjt:  MSVSGGVSIAR--IRGENRFYHPPAMRRRLQQQQQQQQ-QQQQQQQQQKQSALDSKDVVAATAA--------RIDELEKR---SDFDECRSWSTRSDCSV

Query:  SDRGLADSTNLDRFLEHTTPLVPAHCIPKTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGI-----PLLLNGSDSVVQYYVPYLSGIQLYVDPSK
        S R L+D +NLDRFLEHTTP+VPA   P  S+   +TRE S+   YFVL DLWESF EWSAYGAG+     PL ++G+DS VQYYVPYLSGIQLYVDP  
Subjt:  SDRGLADSTNLDRFLEHTTPLVPAHCIPKTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGI-----PLLLNGSDSVVQYYVPYLSGIQLYVDPSK

Query:  SSALSRRRGADSDAESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITI
           L + R    D     E SS+GSSN      ++T   D  + +++ +    +L+    +   SS E++     G+L+FEYLE +PPF REPL +KI+ 
Subjt:  SSALSRRRGADSDAESSKETSSDGSSNCGAENKTKTALQDEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITI

Query:  LASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTA--FQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLASYKFKIPF
        LASR PEL TYRSCDL PSSW+SV+WYPIYRIP GPTLQ+LDACFL+FHSLSTA     +G    Q                KL LP FGLASYK K+  
Subjt:  LASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFHSLSTA--FQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLASYKFKIPF

Query:  WSSTGAEECPKANSLWQYADDWLRSLNVNHPDYRFFASHN
        W+    +E  K  SL Q AD WL+ L V+HPDYRFF S++
Subjt:  WSSTGAEECPKANSLWQYADDWLRSLNVNHPDYRFFASHN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGTCTCCGGTGGGGTTTCGATTGCCCGAATCCGTGGCGAGAATCGCTTCTACCATCCACCTGCGATGCGGCGTCGTTTGCAGCAGCAGCAGCAACAGCAACAGCA
ACAGCAGCAGCAGCAGCAGCAGCAGAAGCAGAGCGCCTTGGATTCCAAGGACGTTGTCGCTGCTACTGCTGCTAGGATCGATGAGTTGGAGAAGAGGAGTGACTTTGATG
AGTGTCGTTCTTGGTCCACTCGCTCTGATTGCTCTGTTTCGGATCGTGGACTTGCTGACTCTACTAATTTGGATCGGTTCTTGGAGCACACTACTCCTCTTGTTCCGGCT
CATTGTATTCCTAAGACGAGCCAGAGGGGATGGAGAACTCGTGAAGTCTCAGAGGCACCTCCTTATTTTGTGCTCGGTGATCTCTGGGAATCTTTCAAGGAATGGAGTGC
ATACGGAGCGGGTATCCCTCTATTGCTAAATGGTAGTGATTCTGTAGTACAGTACTATGTTCCGTATCTCTCCGGCATTCAACTCTATGTTGATCCATCAAAGTCCTCTG
CCCTAAGTAGAAGGCGTGGTGCAGATAGTGATGCTGAGTCCTCAAAGGAAACCAGCAGTGATGGAAGCAGTAATTGTGGGGCAGAAAATAAAACAAAGACTGCTCTTCAG
GATGAGTGGATCCAGGATATTAGTGCTCTGGGGTCACAAAGAGCTCTTCAAATGAATGGACCTTCTGCTGAGTCATCAAGTGATGAAAGTGACTCTTGCTACCGCCATGG
TCAGCTTGTGTTTGAATACTTGGAGCGTGATCCACCATTTTGTCGTGAACCATTAACTGATAAGATCACTATCCTTGCATCTCGTTTTCCTGAATTGAAGACATATAGGA
GCTGTGATTTATCTCCTTCCAGTTGGATTTCTGTGGCATGGTATCCAATTTATAGGATTCCCACGGGTCCAACTCTACAAAGTCTGGATGCTTGTTTCTTGAGCTTCCAT
TCTCTGTCAACGGCATTTCAAGGCATTGGTACCGATGGGATGCAACTCCATTGGCCAAGAATTAGAGAGGTGTACACTGCGGATTGCCCTCTCAAACTACAGTTGCCAAT
ATTTGGACTTGCTTCCTATAAGTTCAAAATTCCCTTTTGGAGTTCGACTGGTGCTGAGGAATGTCCGAAGGCCAACTCTTTGTGGCAATATGCTGACGACTGGCTGAGGT
CATTAAACGTGAACCATCCTGATTACAGATTTTTTGCATCTCATAATTCATTCTGGAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGTCTCCGGTGGGGTTTCGATTGCCCGAATCCGTGGCGAGAATCGCTTCTACCATCCACCTGCGATGCGGCGTCGTTTGCAGCAGCAGCAGCAACAGCAACAGCA
ACAGCAGCAGCAGCAGCAGCAGCAGAAGCAGAGCGCCTTGGATTCCAAGGACGTTGTCGCTGCTACTGCTGCTAGGATCGATGAGTTGGAGAAGAGGAGTGACTTTGATG
AGTGTCGTTCTTGGTCCACTCGCTCTGATTGCTCTGTTTCGGATCGTGGACTTGCTGACTCTACTAATTTGGATCGGTTCTTGGAGCACACTACTCCTCTTGTTCCGGCT
CATTGTATTCCTAAGACGAGCCAGAGGGGATGGAGAACTCGTGAAGTCTCAGAGGCACCTCCTTATTTTGTGCTCGGTGATCTCTGGGAATCTTTCAAGGAATGGAGTGC
ATACGGAGCGGGTATCCCTCTATTGCTAAATGGTAGTGATTCTGTAGTACAGTACTATGTTCCGTATCTCTCCGGCATTCAACTCTATGTTGATCCATCAAAGTCCTCTG
CCCTAAGTAGAAGGCGTGGTGCAGATAGTGATGCTGAGTCCTCAAAGGAAACCAGCAGTGATGGAAGCAGTAATTGTGGGGCAGAAAATAAAACAAAGACTGCTCTTCAG
GATGAGTGGATCCAGGATATTAGTGCTCTGGGGTCACAAAGAGCTCTTCAAATGAATGGACCTTCTGCTGAGTCATCAAGTGATGAAAGTGACTCTTGCTACCGCCATGG
TCAGCTTGTGTTTGAATACTTGGAGCGTGATCCACCATTTTGTCGTGAACCATTAACTGATAAGATCACTATCCTTGCATCTCGTTTTCCTGAATTGAAGACATATAGGA
GCTGTGATTTATCTCCTTCCAGTTGGATTTCTGTGGCATGGTATCCAATTTATAGGATTCCCACGGGTCCAACTCTACAAAGTCTGGATGCTTGTTTCTTGAGCTTCCAT
TCTCTGTCAACGGCATTTCAAGGCATTGGTACCGATGGGATGCAACTCCATTGGCCAAGAATTAGAGAGGTGTACACTGCGGATTGCCCTCTCAAACTACAGTTGCCAAT
ATTTGGACTTGCTTCCTATAAGTTCAAAATTCCCTTTTGGAGTTCGACTGGTGCTGAGGAATGTCCGAAGGCCAACTCTTTGTGGCAATATGCTGACGACTGGCTGAGGT
CATTAAACGTGAACCATCCTGATTACAGATTTTTTGCATCTCATAATTCATTCTGGAGATGA
Protein sequenceShow/hide protein sequence
MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQKQSALDSKDVVAATAARIDELEKRSDFDECRSWSTRSDCSVSDRGLADSTNLDRFLEHTTPLVPA
HCIPKTSQRGWRTREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKETSSDGSSNCGAENKTKTALQ
DEWIQDISALGSQRALQMNGPSAESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLSFH
SLSTAFQGIGTDGMQLHWPRIREVYTADCPLKLQLPIFGLASYKFKIPFWSSTGAEECPKANSLWQYADDWLRSLNVNHPDYRFFASHNSFWR