; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg029538 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg029538
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationscaffold2:22117149..22126848
RNA-Seq ExpressionSpg029538
SyntenySpg029538
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008443756.1 PREDICTED: uncharacterized protein LOC103487273 [Cucumis melo]2.5e-22387.24Show/hide
Query:  IMVSISISSPFVFALFFCFVL-QRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPREWPKM
        +  SISIS+   F L FCFV+ QRF+LVCGLNYTY+++S LRLDRIQRHLD+INKP +LTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGP EWPK 
Subjt:  IMVSISISSPFVFALFFCFVL-QRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPREWPKM

Query:  KKVKENNEDGSERRVGSGAGGAFQTWRVNGTRCPRGSIPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKAT
        K VKEN E+  ERR GSGA  AFQTWRVNGTRCP+G+IPVRR+TV DVLR+KSLFDFGKK+RPILLDR+ DAPDVVSGNGHEHAIAYTGSSEEMYGAKAT
Subjt:  KKVKENNEDGSERRVGSGAGGAFQTWRVNGTRCPRGSIPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKAT

Query:  INVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPVSSFSGDQYDI
        INVWDPSIQ+VNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISP+SS SG QYDI
Subjt:  INVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPVSSFSGDQYDI

Query:  TILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAE
        TILIWKDPKLGNWWMGFG+NTLVGYWPAELFTHLADHATMVEWGGEVVNSR NGQHTSTQMGSG F +DGF +ASYFRNLEIVDSDNSLS VQ+IS +AE
Subjt:  TILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAE

Query:  NTNCYNIMSAYNDQWGTHFYYGGPGRNSECQ
        NTNCYNIMS+YNDQWGTHFYYGGPGRN +CQ
Subjt:  NTNCYNIMSAYNDQWGTHFYYGGPGRNSECQ

XP_011660253.1 uncharacterized protein LOC101208882 [Cucumis sativus]9.0e-22186.11Show/hide
Query:  IMVSISISSPFVFALFFCFVL-QRFSLVCGLNYTY-RQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPREWPK
        +  SISIS+   F L FCFV+ QRF+LVCGLNYTY + +S LRLDRIQRHLD+INKP +LTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGP EWPK
Subjt:  IMVSISISSPFVFALFFCFVL-QRFSLVCGLNYTY-RQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPREWPK

Query:  MKKVKENNEDGSERRVGSGAGGAFQTWRVNGTRCPRGSIPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKA
         K  KEN E+ SERR GSGA  +FQTWRVNGTRCP+G++PVRR+TV DVLR+KSLFDFGKKKRPILLDR+ DAPDVVSGNGHEHAIAYTGSSEEMYGAKA
Subjt:  MKKVKENNEDGSERRVGSGAGGAFQTWRVNGTRCPRGSIPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKA

Query:  TINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPVSSFSGDQYD
        TINVWDPSI++VNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLC+GFVQTNSKIAIGAAISP+SS +G QYD
Subjt:  TINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPVSSFSGDQYD

Query:  ITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLA
        ITILIWKDPKLGNWWMGFG+NTLVGYWPAELFTHLADHATMVEWGGEVVNSR NGQHTSTQMGSG F +DGF +ASYFRNLEIVDSDNSLS+VQ+IS +A
Subjt:  ITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLA

Query:  ENTNCYNIMSAYNDQWGTHFYYGGPGRNSECQ
        ENTNCYNIMS+YNDQWGTHFYYGGPGRN +CQ
Subjt:  ENTNCYNIMSAYNDQWGTHFYYGGPGRNSECQ

XP_022151674.1 uncharacterized protein LOC111019590 [Momordica charantia]1.9e-22388.73Show/hide
Query:  PFVFALFFCFVL-QRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPREWPKMKKVKENNE-
        P   AL    V+ +RFSLV GLNYTY+QVS LRLDRIQRHLDNINKP +LTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGP EWP++KK+KE NE 
Subjt:  PFVFALFFCFVL-QRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPREWPKMKKVKENNE-

Query:  ---DGSERRVGSGAGGAFQTWRVNGTRCPRGSIPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWD
           DGSE R GSGAGGA+QTWRVNGTRCP+GSIPVRRSTV+DVLRAKSLFDFGKK+RPILLDR+ DAPDVVSGNGHEHAIAYTGSS+EMYGAKATINVWD
Subjt:  ---DGSERRVGSGAGGAFQTWRVNGTRCPRGSIPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWD

Query:  PSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPVSSFSGDQYDITILIW
        PSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISP+SSF+G QYD+TILIW
Subjt:  PSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPVSSFSGDQYDITILIW

Query:  KDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTNCY
        KDPKLGNWWMGFGDNTLVGYWPAELFTHL DHATMVEWGGEVVNS+A G+HTSTQMGSGRFA +GFG+ASYFRNLEIVDSDNSLSAVQEISTLAEN +CY
Subjt:  KDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTNCY

Query:  NIMSAYNDQWGTHFYYGGPGRNSECQ
        NIMS+YNDQWGTHFYYGGPGRN ECQ
Subjt:  NIMSAYNDQWGTHFYYGGPGRNSECQ

XP_038878455.1 uncharacterized protein LOC120070684 isoform X1 [Benincasa hispida]3.6e-22286.55Show/hide
Query:  IMVSISI---SSP---FVFALFF---CFVLQRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILT-------IQSPDGDIIDCVHKRKQPALDHPLL
        I +SISI   SSP     FAL F     +LQRF+LVCGLNYTY+QVS LRL+RIQRHLD+INKP +LT       IQSPDGDIIDCVHKRKQPALDHPLL
Subjt:  IMVSISI---SSP---FVFALFF---CFVLQRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILT-------IQSPDGDIIDCVHKRKQPALDHPLL

Query:  KNHKIQRGPREWPKMKKVKENNEDGSERRVGSGAGGAFQTWRVNGTRCPRGSIPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAI
        KNHKIQRGP EWPK KKV EN E  S R  GSGAGGA QTWRVNGTRCP+GSIPVRRSTV+DVLR+KSLFDFGKKKRPILLDRR DAPDVVSGNGHEHAI
Subjt:  KNHKIQRGPREWPKMKKVKENNEDGSERRVGSGAGGAFQTWRVNGTRCPRGSIPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAI

Query:  AYTGSSEEMYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGA
        AYT SSEEMYGAKATINVWDPSIQ+VNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTN+KIAIGA
Subjt:  AYTGSSEEMYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGA

Query:  AISPVSSFSGDQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDS
        AISP+SSFSG QYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSG F +DGFG+ASYFRNLEIVDS
Subjt:  AISPVSSFSGDQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDS

Query:  DNSLSAVQEISTLAENTNCYNIMSAYNDQWGTHFYYGGPGRNSECQ
        DNSLSAVQ+IS +AENTNCYNIMS+YNDQWGTHFYYGGPGRN +CQ
Subjt:  DNSLSAVQEISTLAENTNCYNIMSAYNDQWGTHFYYGGPGRNSECQ

XP_038878456.1 uncharacterized protein LOC120070684 isoform X2 [Benincasa hispida]3.0e-22487.93Show/hide
Query:  IMVSISI---SSP---FVFALFF---CFVLQRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQR
        I +SISI   SSP     FAL F     +LQRF+LVCGLNYTY+QVS LRL+RIQRHLD+INKP +LTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQR
Subjt:  IMVSISI---SSP---FVFALFF---CFVLQRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQR

Query:  GPREWPKMKKVKENNEDGSERRVGSGAGGAFQTWRVNGTRCPRGSIPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSE
        GP EWPK KKV EN E  S R  GSGAGGA QTWRVNGTRCP+GSIPVRRSTV+DVLR+KSLFDFGKKKRPILLDRR DAPDVVSGNGHEHAIAYT SSE
Subjt:  GPREWPKMKKVKENNEDGSERRVGSGAGGAFQTWRVNGTRCPRGSIPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSE

Query:  EMYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPVSS
        EMYGAKATINVWDPSIQ+VNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTN+KIAIGAAISP+SS
Subjt:  EMYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPVSS

Query:  FSGDQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAV
        FSG QYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSG F +DGFG+ASYFRNLEIVDSDNSLSAV
Subjt:  FSGDQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAV

Query:  QEISTLAENTNCYNIMSAYNDQWGTHFYYGGPGRNSECQ
        Q+IS +AENTNCYNIMS+YNDQWGTHFYYGGPGRN +CQ
Subjt:  QEISTLAENTNCYNIMSAYNDQWGTHFYYGGPGRNSECQ

TrEMBL top hitse value%identityAlignment
A0A0A0LXY9 Uncharacterized protein4.3e-22186.11Show/hide
Query:  IMVSISISSPFVFALFFCFVL-QRFSLVCGLNYTY-RQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPREWPK
        +  SISIS+   F L FCFV+ QRF+LVCGLNYTY + +S LRLDRIQRHLD+INKP +LTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGP EWPK
Subjt:  IMVSISISSPFVFALFFCFVL-QRFSLVCGLNYTY-RQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPREWPK

Query:  MKKVKENNEDGSERRVGSGAGGAFQTWRVNGTRCPRGSIPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKA
         K  KEN E+ SERR GSGA  +FQTWRVNGTRCP+G++PVRR+TV DVLR+KSLFDFGKKKRPILLDR+ DAPDVVSGNGHEHAIAYTGSSEEMYGAKA
Subjt:  MKKVKENNEDGSERRVGSGAGGAFQTWRVNGTRCPRGSIPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKA

Query:  TINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPVSSFSGDQYD
        TINVWDPSI++VNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLC+GFVQTNSKIAIGAAISP+SS +G QYD
Subjt:  TINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPVSSFSGDQYD

Query:  ITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLA
        ITILIWKDPKLGNWWMGFG+NTLVGYWPAELFTHLADHATMVEWGGEVVNSR NGQHTSTQMGSG F +DGF +ASYFRNLEIVDSDNSLS+VQ+IS +A
Subjt:  ITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLA

Query:  ENTNCYNIMSAYNDQWGTHFYYGGPGRNSECQ
        ENTNCYNIMS+YNDQWGTHFYYGGPGRN +CQ
Subjt:  ENTNCYNIMSAYNDQWGTHFYYGGPGRNSECQ

A0A1S3B8R5 uncharacterized protein LOC1034872731.2e-22387.24Show/hide
Query:  IMVSISISSPFVFALFFCFVL-QRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPREWPKM
        +  SISIS+   F L FCFV+ QRF+LVCGLNYTY+++S LRLDRIQRHLD+INKP +LTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGP EWPK 
Subjt:  IMVSISISSPFVFALFFCFVL-QRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPREWPKM

Query:  KKVKENNEDGSERRVGSGAGGAFQTWRVNGTRCPRGSIPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKAT
        K VKEN E+  ERR GSGA  AFQTWRVNGTRCP+G+IPVRR+TV DVLR+KSLFDFGKK+RPILLDR+ DAPDVVSGNGHEHAIAYTGSSEEMYGAKAT
Subjt:  KKVKENNEDGSERRVGSGAGGAFQTWRVNGTRCPRGSIPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKAT

Query:  INVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPVSSFSGDQYDI
        INVWDPSIQ+VNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISP+SS SG QYDI
Subjt:  INVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPVSSFSGDQYDI

Query:  TILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAE
        TILIWKDPKLGNWWMGFG+NTLVGYWPAELFTHLADHATMVEWGGEVVNSR NGQHTSTQMGSG F +DGF +ASYFRNLEIVDSDNSLS VQ+IS +AE
Subjt:  TILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAE

Query:  NTNCYNIMSAYNDQWGTHFYYGGPGRNSECQ
        NTNCYNIMS+YNDQWGTHFYYGGPGRN +CQ
Subjt:  NTNCYNIMSAYNDQWGTHFYYGGPGRNSECQ

A0A6J1DDR6 uncharacterized protein LOC1110195909.4e-22488.73Show/hide
Query:  PFVFALFFCFVL-QRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPREWPKMKKVKENNE-
        P   AL    V+ +RFSLV GLNYTY+QVS LRLDRIQRHLDNINKP +LTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGP EWP++KK+KE NE 
Subjt:  PFVFALFFCFVL-QRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPREWPKMKKVKENNE-

Query:  ---DGSERRVGSGAGGAFQTWRVNGTRCPRGSIPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWD
           DGSE R GSGAGGA+QTWRVNGTRCP+GSIPVRRSTV+DVLRAKSLFDFGKK+RPILLDR+ DAPDVVSGNGHEHAIAYTGSS+EMYGAKATINVWD
Subjt:  ---DGSERRVGSGAGGAFQTWRVNGTRCPRGSIPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWD

Query:  PSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPVSSFSGDQYDITILIW
        PSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISP+SSF+G QYD+TILIW
Subjt:  PSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPVSSFSGDQYDITILIW

Query:  KDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTNCY
        KDPKLGNWWMGFGDNTLVGYWPAELFTHL DHATMVEWGGEVVNS+A G+HTSTQMGSGRFA +GFG+ASYFRNLEIVDSDNSLSAVQEISTLAEN +CY
Subjt:  KDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTNCY

Query:  NIMSAYNDQWGTHFYYGGPGRNSECQ
        NIMS+YNDQWGTHFYYGGPGRN ECQ
Subjt:  NIMSAYNDQWGTHFYYGGPGRNSECQ

A0A6J1HAZ7 uncharacterized protein LOC1114616988.2e-22087.89Show/hide
Query:  FALF---FCFVLQRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPREWPKMKKVKENNEDG
        FALF   F  ++QRF LVCGLN++ +QVS LRLDRIQRHLD INKP +LTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGP EWPK KKVKEN ED 
Subjt:  FALF---FCFVLQRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPREWPKMKKVKENNEDG

Query:  SERRVGSGAGGAFQTWRVNGTRCPRGSIPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIQV
         + R GSGAGG FQTWRVNGTRCP+GSIPVRRSTV+DVLR KS+FD+GKKKRPILLDR+ DAPDVVSGNGHEHAIAYT SS EMYGAKATINVWDPSIQV
Subjt:  SERRVGSGAGGAFQTWRVNGTRCPRGSIPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIQV

Query:  VNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPVSSFSGDQYDITILIWKDPKL
        VNEFSLSQ+WI+SGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLC+GFVQTN+KIAIGAAISPVSSFSG QYDITILIWKDPKL
Subjt:  VNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPVSSFSGDQYDITILIWKDPKL

Query:  GNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTNCYNIMSA
        G+WWMGFGDNTLVGYWPAELFTHL DHATMVEWGGEVVNSRANGQHTSTQMGSG F +DGFG+ASYFRNLEIVDSDNSLSAVQ+IS +AENTNCYNIMS+
Subjt:  GNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTNCYNIMSA

Query:  YNDQWGTHFYYGGPGRNSECQ
        YNDQWGTHFYYGGPGRN +CQ
Subjt:  YNDQWGTHFYYGGPGRNSECQ

A0A6J1JIQ1 uncharacterized protein LOC1114854609.1e-21987.86Show/hide
Query:  FALF---FCFVLQRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPREWPKMKKVKENNEDG
        FALF   F  ++QRF LVCGLN++ +QVS LRLDRIQRHLD INKP +LTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGP EWPK KKVKEN ED 
Subjt:  FALF---FCFVLQRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPREWPKMKKVKENNEDG

Query:  SERRVGSGAGGAFQTWRVNGTRCPRGSIPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIQV
         + R GSGAGG FQTWRVNGTRCP+GSIPVRRSTV+DVLRAKS+FD+GKKKRPILLDR+ DAPDVVSGNGHEHAIAYT SS EMYGAKATINVWDPSIQV
Subjt:  SERRVGSGAGGAFQTWRVNGTRCPRGSIPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIQV

Query:  VNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPVSSFSGDQYDITILIWKDPKL
        VNEFSLSQ+WI+SGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLC+GFVQTN+KIAIGAAISPVSSFSG QYDITILIWKDPKL
Subjt:  VNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPVSSFSGDQYDITILIWKDPKL

Query:  GNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTNCYNIMSA
        G+WWMGFGDNTLVGYWPAELFTHL DHATMVEWGGEVVNSRANGQHTSTQMGSG F +DGFG+ASYFRNLEIVDSDNSLS VQ+IS +AENTNCYNIMS+
Subjt:  GNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTNCYNIMSA

Query:  YNDQWGTHFYYGGPGRNSEC
        YNDQWGTHFYYGGPGRN +C
Subjt:  YNDQWGTHFYYGGPGRNSEC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55360.1 Protein of Unknown Function (DUF239)5.8e-13354.37Show/hide
Query:  LQRFSLVC-------GLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPREWP-------KMKKVKENN
        L R  LVC        L+Y  R     +   +++HL+ +NKP++ +IQS DGD+IDCV   KQPA DHP LK+HKIQ  P   P       K+   K N 
Subjt:  LQRFSLVC-------GLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPREWP-------KMKKVKENN

Query:  EDGSERRVGSGAGGAFQTWRVNGTRCPRGSIPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPS
        ++G             Q W   G +C  G+IP+RR+  DDVLRA S+  +GKKKR  +   ++  PD+++ +GH+HAIAY    ++ YGAKATINVW+P 
Subjt:  EDGSERRVGSGAGGAFQTWRVNGTRCPRGSIPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPS

Query:  IQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPVSSFSGDQYDITILIWKD
        IQ  NEFSLSQIW+L GSF G DLNSIEAGWQVSP+LYGD+  RLFTYWTSDAYQATGCYNLLC+GF+Q NS IA+GA+ISPVS +   QYDI+ILIWKD
Subjt:  IQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPVSSFSGDQYDITILIWKD

Query:  PKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTNCYNI
        PK G+WWM FG+  ++GYWP+ LF++L + A+M+EWGGEVVNS+++GQHTSTQMGSG+F  +GF +ASYFRN+++VD  N+L A + + T  E +NCY++
Subjt:  PKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTNCYNI

Query:  MSAYNDQWGTHFYYGGPGRNSEC
         +  ND WG +FYYGGPG+N +C
Subjt:  MSAYNDQWGTHFYYGGPGRNSEC

AT3G13510.1 Protein of Unknown Function (DUF239)2.7e-13056.88Show/hide
Query:  IQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPREWPKMKKVKENNEDGSERRVGSGAGGAFQTWRVNGTRCPRGSIPVRRSTV
        +++HL+ +NKP + TIQSPDGDIIDC+   KQPA DHP LK+HKIQ  P   P  + + ++N+  +E +         Q W   G +C  G+IP+RR+  
Subjt:  IQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPREWPKMKKVKENNEDGSERRVGSGAGGAFQTWRVNGTRCPRGSIPVRRSTV

Query:  DDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELY
        DDVLRA S+  +GKKK   +   ++  PD+++ NGH+HAIAY    ++ YGAKAT+NVW+P IQ  NEFSLSQIW+L GSF G DLNSIEAGWQVSP+LY
Subjt:  DDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELY

Query:  GDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPVSSFSGDQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGG
        GD+  RLFTYWTSDAYQATGCYNLLC+GF+Q NS IA+GA+ISPVS +   QYDI+ILIWKDPK G+WWM FG+  ++GYWP+ LF++L + A+M+EWGG
Subjt:  GDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPVSSFSGDQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGG

Query:  EVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTNCYNIMSAYNDQWGTHFYYGGPGRNSEC
        EVVNS++ G HT TQMGSG F  +GF +ASYFRN+++VD  N+L A + + T  E +NCY++ +  ND WG +FYYGGPG+N  C
Subjt:  EVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTNCYNIMSAYNDQWGTHFYYGGPGRNSEC

AT5G18460.1 Protein of Unknown Function (DUF239)1.7e-18073.48Show/hide
Query:  YRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPREWPKMKKVKENNEDGSERRVGSGAGGAFQTWRVNGTRCP
        YRQVS LRL RIQ+HL+ INK  + TIQSPDGD+IDCV KRKQPALDHPLLK+HKIQ+ P++ PKMK      +D   +   +   GA+Q W VNGTRCP
Subjt:  YRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPREWPKMKKVKENNEDGSERRVGSGAGGAFQTWRVNGTRCP

Query:  RGSIPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSI
        +G++P+RR+T++DVLRAKSLFDFGKK+R I LD+RT+ PD +  NGHEHAIAYT SS E+YGAKATINVWDP I+ VNEFSLSQIWILSGSF G DLNSI
Subjt:  RGSIPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSI

Query:  EAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPVSSFSGDQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHL
        EAGWQVSPELYGD+RPRLFTYWTSD+YQATGCYNLLC+GF+QTN+KIAIGAAISP+S+F G+Q+DITILIWKDPK+GNWWMG GD+TLVGYWPAELFTHL
Subjt:  EAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPVSSFSGDQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHL

Query:  ADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTNCYNIMSAYNDQWGTHFYYGGPGRNSEC
        ADHAT VEWGGEVVN+RA+G+HT+TQMGSG F ++GFG+ASYFRNLE+VDSDNSL  V ++  LAENT CY+I S+Y+++WGT+FYYGGPG N  C
Subjt:  ADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTNCYNIMSAYNDQWGTHFYYGGPGRNSEC

AT5G56530.1 Protein of Unknown Function (DUF239)2.0e-13355.45Show/hide
Query:  FALFFCFVLQRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPREWPKM----KKVKENNED
        F ++FCF     SL C    +  + +      + +HL+ +NKP++ +IQSPDGDIIDCVH  KQPA DHP LK+HKIQ GP   P+      KV E  ++
Subjt:  FALFFCFVLQRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPREWPKM----KKVKENNED

Query:  GSERRVGSGAGGAFQTWRVNGTRCPRGSIPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIQ
                      Q W  NG  C  G+IPVRR+  +DVLRA S+  +GKKK   +   R+  PD+++ +GH+HAIAY     + YGAKATINVW+P +Q
Subjt:  GSERRVGSGAGGAFQTWRVNGTRCPRGSIPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIQ

Query:  VVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPVSSFSGDQYDITILIWKDPK
          NEFSLSQ+WIL GSF G DLNSIEAGWQVSP+LYGD+  RLFTYWTSDAYQATGCYNLLC+GF+Q NS+IA+GA+ISPVS F   QYDI+I IWKDPK
Subjt:  VVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPVSSFSGDQYDITILIWKDPK

Query:  LGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTNCYNIMS
         G+WWM FGD  ++GYWP+ LF++LAD A++VEWGGEVVN   +G HT+TQMGSG+F ++GF +ASYFRN+++VDS N+L   + ++T  E +NCY++  
Subjt:  LGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTNCYNIMS

Query:  AYNDQWGTHFYYGGPGRNSECQ
          ND WG +FYYGGPGRN  CQ
Subjt:  AYNDQWGTHFYYGGPGRNSECQ

AT5G56530.2 Protein of Unknown Function (DUF239)2.0e-13355.45Show/hide
Query:  FALFFCFVLQRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPREWPKM----KKVKENNED
        F ++FCF     SL C    +  + +      + +HL+ +NKP++ +IQSPDGDIIDCVH  KQPA DHP LK+HKIQ GP   P+      KV E  ++
Subjt:  FALFFCFVLQRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPREWPKM----KKVKENNED

Query:  GSERRVGSGAGGAFQTWRVNGTRCPRGSIPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIQ
                      Q W  NG  C  G+IPVRR+  +DVLRA S+  +GKKK   +   R+  PD+++ +GH+HAIAY     + YGAKATINVW+P +Q
Subjt:  GSERRVGSGAGGAFQTWRVNGTRCPRGSIPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIQ

Query:  VVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPVSSFSGDQYDITILIWKDPK
          NEFSLSQ+WIL GSF G DLNSIEAGWQVSP+LYGD+  RLFTYWTSDAYQATGCYNLLC+GF+Q NS+IA+GA+ISPVS F   QYDI+I IWKDPK
Subjt:  VVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPVSSFSGDQYDITILIWKDPK

Query:  LGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTNCYNIMS
         G+WWM FGD  ++GYWP+ LF++LAD A++VEWGGEVVN   +G HT+TQMGSG+F ++GF +ASYFRN+++VDS N+L   + ++T  E +NCY++  
Subjt:  LGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTNCYNIMS

Query:  AYNDQWGTHFYYGGPGRNSECQ
          ND WG +FYYGGPGRN  CQ
Subjt:  AYNDQWGTHFYYGGPGRNSECQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGTTTGTGAGAGAGAAAATAATTATGGTTTCAATTTCAATTTCATCACCTTTTGTTTTTGCTTTGTTCTTCTGTTTTGTTCTTCAAAGATTCTCTTTGGTTTGTGG
CCTCAATTATACTTATAGACAAGTTAGTGGCTTGAGACTTGACAGGATTCAAAGGCATTTGGACAACATTAACAAGCCTTCTATTCTCACCATTCAGAGCCCAGATGGTG
ATATTATAGATTGTGTTCATAAAAGAAAACAGCCAGCTTTGGATCATCCCCTCTTGAAGAACCACAAGATTCAGAGAGGGCCAAGGGAGTGGCCAAAGATGAAGAAGGTG
AAAGAGAATAATGAAGATGGGAGTGAAAGGAGGGTGGGATCCGGTGCGGGTGGTGCATTTCAAACTTGGCGTGTGAACGGGACACGGTGTCCAAGAGGGAGTATTCCAGT
GCGACGCAGCACAGTGGATGATGTGTTAAGAGCCAAGTCTTTGTTTGACTTTGGAAAGAAGAAACGGCCGATCCTCCTCGATCGGCGAACGGACGCTCCAGATGTGGTTA
GTGGGAATGGTCACGAGCATGCGATCGCGTACACTGGATCATCGGAAGAGATGTACGGAGCGAAGGCGACAATAAACGTGTGGGACCCATCGATCCAAGTGGTCAACGAG
TTCAGCCTCTCACAGATTTGGATCCTCTCGGGATCATTCGACGGCTCAGATCTCAACAGCATAGAAGCTGGTTGGCAGGTCAGTCCGGAGCTTTATGGTGACAGCAGACC
TAGATTATTCACATATTGGACGAGTGACGCATATCAGGCAACGGGTTGCTACAATCTTTTATGCGCTGGATTTGTACAAACAAACAGCAAAATCGCGATCGGAGCCGCCA
TTTCTCCCGTCTCTTCTTTTTCCGGCGACCAATATGACATCACCATTCTCATTTGGAAGGATCCAAAGCTGGGAAACTGGTGGATGGGATTTGGGGACAACACATTGGTG
GGTTACTGGCCGGCGGAGCTGTTCACTCACCTGGCCGACCACGCCACCATGGTGGAGTGGGGCGGCGAGGTCGTCAACTCAAGGGCCAATGGCCAACACACTTCCACCCA
AATGGGCTCCGGCCGCTTCGCCAACGACGGTTTCGGCCAAGCTAGCTACTTTCGAAACCTCGAGATCGTCGACTCCGACAACAGCCTCAGCGCCGTCCAAGAGATCTCGA
CCTTGGCTGAGAACACCAATTGTTACAACATTATGAGCGCCTACAACGATCAATGGGGTACTCACTTCTACTACGGCGGTCCTGGTAGAAACTCTGAATGTCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGTTTGTGAGAGAGAAAATAATTATGGTTTCAATTTCAATTTCATCACCTTTTGTTTTTGCTTTGTTCTTCTGTTTTGTTCTTCAAAGATTCTCTTTGGTTTGTGG
CCTCAATTATACTTATAGACAAGTTAGTGGCTTGAGACTTGACAGGATTCAAAGGCATTTGGACAACATTAACAAGCCTTCTATTCTCACCATTCAGAGCCCAGATGGTG
ATATTATAGATTGTGTTCATAAAAGAAAACAGCCAGCTTTGGATCATCCCCTCTTGAAGAACCACAAGATTCAGAGAGGGCCAAGGGAGTGGCCAAAGATGAAGAAGGTG
AAAGAGAATAATGAAGATGGGAGTGAAAGGAGGGTGGGATCCGGTGCGGGTGGTGCATTTCAAACTTGGCGTGTGAACGGGACACGGTGTCCAAGAGGGAGTATTCCAGT
GCGACGCAGCACAGTGGATGATGTGTTAAGAGCCAAGTCTTTGTTTGACTTTGGAAAGAAGAAACGGCCGATCCTCCTCGATCGGCGAACGGACGCTCCAGATGTGGTTA
GTGGGAATGGTCACGAGCATGCGATCGCGTACACTGGATCATCGGAAGAGATGTACGGAGCGAAGGCGACAATAAACGTGTGGGACCCATCGATCCAAGTGGTCAACGAG
TTCAGCCTCTCACAGATTTGGATCCTCTCGGGATCATTCGACGGCTCAGATCTCAACAGCATAGAAGCTGGTTGGCAGGTCAGTCCGGAGCTTTATGGTGACAGCAGACC
TAGATTATTCACATATTGGACGAGTGACGCATATCAGGCAACGGGTTGCTACAATCTTTTATGCGCTGGATTTGTACAAACAAACAGCAAAATCGCGATCGGAGCCGCCA
TTTCTCCCGTCTCTTCTTTTTCCGGCGACCAATATGACATCACCATTCTCATTTGGAAGGATCCAAAGCTGGGAAACTGGTGGATGGGATTTGGGGACAACACATTGGTG
GGTTACTGGCCGGCGGAGCTGTTCACTCACCTGGCCGACCACGCCACCATGGTGGAGTGGGGCGGCGAGGTCGTCAACTCAAGGGCCAATGGCCAACACACTTCCACCCA
AATGGGCTCCGGCCGCTTCGCCAACGACGGTTTCGGCCAAGCTAGCTACTTTCGAAACCTCGAGATCGTCGACTCCGACAACAGCCTCAGCGCCGTCCAAGAGATCTCGA
CCTTGGCTGAGAACACCAATTGTTACAACATTATGAGCGCCTACAACGATCAATGGGGTACTCACTTCTACTACGGCGGTCCTGGTAGAAACTCTGAATGTCAATGA
Protein sequenceShow/hide protein sequence
MKFVREKIIMVSISISSPFVFALFFCFVLQRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPREWPKMKKV
KENNEDGSERRVGSGAGGAFQTWRVNGTRCPRGSIPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIQVVNE
FSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPVSSFSGDQYDITILIWKDPKLGNWWMGFGDNTLV
GYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTNCYNIMSAYNDQWGTHFYYGGPGRNSECQ