; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035417 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035417
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationchr3:21194030..21202298
RNA-Seq ExpressionLag0035417
SyntenyLag0035417
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008443756.1 PREDICTED: uncharacterized protein LOC103487273 [Cucumis melo]3.5e-22588.17Show/hide
Query:  VSISISISSPLVFALFFCFVL-QRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPMEWPKM
        VS SISIS+ L F L FCFV+ QRF+LVCGLNYTY+++S LRLDRIQRHLD+INKP +LTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGP EWPK 
Subjt:  VSISISISSPLVFALFFCFVL-QRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPMEWPKM

Query:  KKVKENNEGGSERRVGSGAGGAFQTWRVNGTRCPRGTVPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKAT
        K VKEN E   ERR GSGA  AFQTWRVNGTRCP+GT+PVRR+TV DVLR+KSLFDFGKK+RPILLDR+ DAPDVVSGNGHEHAIAYTGSSEEMYGAKAT
Subjt:  KKVKENNEGGSERRVGSGAGGAFQTWRVNGTRCPRGTVPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKAT

Query:  INVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGNQYDI
        INVWDPSIQ+VNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISS SG+QYDI
Subjt:  INVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGNQYDI

Query:  TILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAE
        TILIWKDPKLGNWWMGFG+NTLVGYWPAELFTHLADHATMVEWGGEVVNSR NGQHTSTQMGSG F +DGF +ASYFRNLEIVDSDNSLS VQ+IS +AE
Subjt:  TILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAE

Query:  NTDCYNIMSAYNDQWGTHFYYGGPGRNPECQ
        NT+CYNIMS+YNDQWGTHFYYGGPGRNP+CQ
Subjt:  NTDCYNIMSAYNDQWGTHFYYGGPGRNPECQ

XP_011660253.1 uncharacterized protein LOC101208882 [Cucumis sativus]7.2e-22387.5Show/hide
Query:  VSISISISSPLVFALFFCFVL-QRFSLVCGLNYTY-RQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPMEWPK
        VS SISIS+ L F L FCFV+ QRF+LVCGLNYTY + +S LRLDRIQRHLD+INKP +LTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGP EWPK
Subjt:  VSISISISSPLVFALFFCFVL-QRFSLVCGLNYTY-RQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPMEWPK

Query:  MKKVKENNEGGSERRVGSGAGGAFQTWRVNGTRCPRGTVPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKA
         K  KEN E  SERR GSGA  +FQTWRVNGTRCP+GTVPVRR+TV DVLR+KSLFDFGKKKRPILLDR+ DAPDVVSGNGHEHAIAYTGSSEEMYGAKA
Subjt:  MKKVKENNEGGSERRVGSGAGGAFQTWRVNGTRCPRGTVPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKA

Query:  TINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGNQYD
        TINVWDPSI++VNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLC+GFVQTNSKIAIGAAISPISS +G+QYD
Subjt:  TINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGNQYD

Query:  ITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLA
        ITILIWKDPKLGNWWMGFG+NTLVGYWPAELFTHLADHATMVEWGGEVVNSR NGQHTSTQMGSG F +DGF +ASYFRNLEIVDSDNSLS+VQ+IS +A
Subjt:  ITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLA

Query:  ENTDCYNIMSAYNDQWGTHFYYGGPGRNPECQ
        ENT+CYNIMS+YNDQWGTHFYYGGPGRNP+CQ
Subjt:  ENTDCYNIMSAYNDQWGTHFYYGGPGRNPECQ

XP_022151674.1 uncharacterized protein LOC111019590 [Momordica charantia]8.5e-22488.73Show/hide
Query:  PLVFALFFCFVL-QRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPMEWPKMKKVKENNE-
        PL  AL    V+ +RFSLV GLNYTY+QVS LRLDRIQRHLDNINKP +LTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGP EWP++KK+KE NE 
Subjt:  PLVFALFFCFVL-QRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPMEWPKMKKVKENNE-

Query:  ---GGSERRVGSGAGGAFQTWRVNGTRCPRGTVPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWD
            GSE R GSGAGGA+QTWRVNGTRCP+G++PVRRSTV+DVLRAKSLFDFGKK+RPILLDR+ DAPDVVSGNGHEHAIAYTGSS+EMYGAKATINVWD
Subjt:  ---GGSERRVGSGAGGAFQTWRVNGTRCPRGTVPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWD

Query:  PSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGNQYDITILIW
        PSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSF+G+QYD+TILIW
Subjt:  PSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGNQYDITILIW

Query:  KDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTDCY
        KDPKLGNWWMGFGDNTLVGYWPAELFTHL DHATMVEWGGEVVNS+A G+HTSTQMGSGRFA +GFG+ASYFRNLEIVDSDNSLSAVQEISTLAEN  CY
Subjt:  KDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTDCY

Query:  NIMSAYNDQWGTHFYYGGPGRNPECQ
        NIMS+YNDQWGTHFYYGGPGRNPECQ
Subjt:  NIMSAYNDQWGTHFYYGGPGRNPECQ

XP_038878455.1 uncharacterized protein LOC120070684 isoform X1 [Benincasa hispida]2.9e-22487Show/hide
Query:  VSISISI---SSP---LVFALFF---CFVLQRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILT-------IQSPDGDIIDCVHKRKQPALDHPLL
        +SISISI   SSP   L FAL F     +LQRF+LVCGLNYTY+QVS LRL+RIQRHLD+INKP +LT       IQSPDGDIIDCVHKRKQPALDHPLL
Subjt:  VSISISI---SSP---LVFALFF---CFVLQRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILT-------IQSPDGDIIDCVHKRKQPALDHPLL

Query:  KNHKIQRGPMEWPKMKKVKENNEGGSERRVGSGAGGAFQTWRVNGTRCPRGTVPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAI
        KNHKIQRGP EWPK KKV EN EG S R  GSGAGGA QTWRVNGTRCP+G++PVRRSTV+DVLR+KSLFDFGKKKRPILLDRR DAPDVVSGNGHEHAI
Subjt:  KNHKIQRGPMEWPKMKKVKENNEGGSERRVGSGAGGAFQTWRVNGTRCPRGTVPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAI

Query:  AYTGSSEEMYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGA
        AYT SSEEMYGAKATINVWDPSIQ+VNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTN+KIAIGA
Subjt:  AYTGSSEEMYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGA

Query:  AISPISSFSGNQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDS
        AISPISSFSG+QYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSG F +DGFG+ASYFRNLEIVDS
Subjt:  AISPISSFSGNQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDS

Query:  DNSLSAVQEISTLAENTDCYNIMSAYNDQWGTHFYYGGPGRNPECQ
        DNSLSAVQ+IS +AENT+CYNIMS+YNDQWGTHFYYGGPGRNP+CQ
Subjt:  DNSLSAVQEISTLAENTDCYNIMSAYNDQWGTHFYYGGPGRNPECQ

XP_038878456.1 uncharacterized protein LOC120070684 isoform X2 [Benincasa hispida]2.4e-22688.38Show/hide
Query:  VSISISI---SSP---LVFALFF---CFVLQRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQR
        +SISISI   SSP   L FAL F     +LQRF+LVCGLNYTY+QVS LRL+RIQRHLD+INKP +LTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQR
Subjt:  VSISISI---SSP---LVFALFF---CFVLQRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQR

Query:  GPMEWPKMKKVKENNEGGSERRVGSGAGGAFQTWRVNGTRCPRGTVPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSE
        GP EWPK KKV EN EG S R  GSGAGGA QTWRVNGTRCP+G++PVRRSTV+DVLR+KSLFDFGKKKRPILLDRR DAPDVVSGNGHEHAIAYT SSE
Subjt:  GPMEWPKMKKVKENNEGGSERRVGSGAGGAFQTWRVNGTRCPRGTVPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSE

Query:  EMYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISS
        EMYGAKATINVWDPSIQ+VNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTN+KIAIGAAISPISS
Subjt:  EMYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISS

Query:  FSGNQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAV
        FSG+QYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSG F +DGFG+ASYFRNLEIVDSDNSLSAV
Subjt:  FSGNQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAV

Query:  QEISTLAENTDCYNIMSAYNDQWGTHFYYGGPGRNPECQ
        Q+IS +AENT+CYNIMS+YNDQWGTHFYYGGPGRNP+CQ
Subjt:  QEISTLAENTDCYNIMSAYNDQWGTHFYYGGPGRNPECQ

TrEMBL top hitse value%identityAlignment
A0A0A0LXY9 Uncharacterized protein3.5e-22387.5Show/hide
Query:  VSISISISSPLVFALFFCFVL-QRFSLVCGLNYTY-RQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPMEWPK
        VS SISIS+ L F L FCFV+ QRF+LVCGLNYTY + +S LRLDRIQRHLD+INKP +LTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGP EWPK
Subjt:  VSISISISSPLVFALFFCFVL-QRFSLVCGLNYTY-RQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPMEWPK

Query:  MKKVKENNEGGSERRVGSGAGGAFQTWRVNGTRCPRGTVPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKA
         K  KEN E  SERR GSGA  +FQTWRVNGTRCP+GTVPVRR+TV DVLR+KSLFDFGKKKRPILLDR+ DAPDVVSGNGHEHAIAYTGSSEEMYGAKA
Subjt:  MKKVKENNEGGSERRVGSGAGGAFQTWRVNGTRCPRGTVPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKA

Query:  TINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGNQYD
        TINVWDPSI++VNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLC+GFVQTNSKIAIGAAISPISS +G+QYD
Subjt:  TINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGNQYD

Query:  ITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLA
        ITILIWKDPKLGNWWMGFG+NTLVGYWPAELFTHLADHATMVEWGGEVVNSR NGQHTSTQMGSG F +DGF +ASYFRNLEIVDSDNSLS+VQ+IS +A
Subjt:  ITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLA

Query:  ENTDCYNIMSAYNDQWGTHFYYGGPGRNPECQ
        ENT+CYNIMS+YNDQWGTHFYYGGPGRNP+CQ
Subjt:  ENTDCYNIMSAYNDQWGTHFYYGGPGRNPECQ

A0A1S3B8R5 uncharacterized protein LOC1034872731.7e-22588.17Show/hide
Query:  VSISISISSPLVFALFFCFVL-QRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPMEWPKM
        VS SISIS+ L F L FCFV+ QRF+LVCGLNYTY+++S LRLDRIQRHLD+INKP +LTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGP EWPK 
Subjt:  VSISISISSPLVFALFFCFVL-QRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPMEWPKM

Query:  KKVKENNEGGSERRVGSGAGGAFQTWRVNGTRCPRGTVPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKAT
        K VKEN E   ERR GSGA  AFQTWRVNGTRCP+GT+PVRR+TV DVLR+KSLFDFGKK+RPILLDR+ DAPDVVSGNGHEHAIAYTGSSEEMYGAKAT
Subjt:  KKVKENNEGGSERRVGSGAGGAFQTWRVNGTRCPRGTVPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKAT

Query:  INVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGNQYDI
        INVWDPSIQ+VNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISS SG+QYDI
Subjt:  INVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGNQYDI

Query:  TILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAE
        TILIWKDPKLGNWWMGFG+NTLVGYWPAELFTHLADHATMVEWGGEVVNSR NGQHTSTQMGSG F +DGF +ASYFRNLEIVDSDNSLS VQ+IS +AE
Subjt:  TILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAE

Query:  NTDCYNIMSAYNDQWGTHFYYGGPGRNPECQ
        NT+CYNIMS+YNDQWGTHFYYGGPGRNP+CQ
Subjt:  NTDCYNIMSAYNDQWGTHFYYGGPGRNPECQ

A0A6J1DDR6 uncharacterized protein LOC1110195904.1e-22488.73Show/hide
Query:  PLVFALFFCFVL-QRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPMEWPKMKKVKENNE-
        PL  AL    V+ +RFSLV GLNYTY+QVS LRLDRIQRHLDNINKP +LTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGP EWP++KK+KE NE 
Subjt:  PLVFALFFCFVL-QRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPMEWPKMKKVKENNE-

Query:  ---GGSERRVGSGAGGAFQTWRVNGTRCPRGTVPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWD
            GSE R GSGAGGA+QTWRVNGTRCP+G++PVRRSTV+DVLRAKSLFDFGKK+RPILLDR+ DAPDVVSGNGHEHAIAYTGSS+EMYGAKATINVWD
Subjt:  ---GGSERRVGSGAGGAFQTWRVNGTRCPRGTVPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWD

Query:  PSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGNQYDITILIW
        PSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSF+G+QYD+TILIW
Subjt:  PSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGNQYDITILIW

Query:  KDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTDCY
        KDPKLGNWWMGFGDNTLVGYWPAELFTHL DHATMVEWGGEVVNS+A G+HTSTQMGSGRFA +GFG+ASYFRNLEIVDSDNSLSAVQEISTLAEN  CY
Subjt:  KDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTDCY

Query:  NIMSAYNDQWGTHFYYGGPGRNPECQ
        NIMS+YNDQWGTHFYYGGPGRNPECQ
Subjt:  NIMSAYNDQWGTHFYYGGPGRNPECQ

A0A6J1HAZ7 uncharacterized protein LOC1114616984.0e-21986.76Show/hide
Query:  LVFALF---FCFVLQRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPMEWPKMKKVKENNE
        L FALF   F  ++QRF LVCGLN++ +QVS LRLDRIQRHLD INKP +LTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGP EWPK KKVKEN E
Subjt:  LVFALF---FCFVLQRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPMEWPKMKKVKENNE

Query:  GGSERRVGSGAGGAFQTWRVNGTRCPRGTVPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSI
           + R GSGAGG FQTWRVNGTRCP+G++PVRRSTV+DVLR KS+FD+GKKKRPILLDR+ DAPDVVSGNGHEHAIAYT SS EMYGAKATINVWDPSI
Subjt:  GGSERRVGSGAGGAFQTWRVNGTRCPRGTVPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSI

Query:  QVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGNQYDITILIWKDP
        QVVNEFSLSQ+WI+SGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLC+GFVQTN+KIAIGAAISP+SSFSG+QYDITILIWKDP
Subjt:  QVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGNQYDITILIWKDP

Query:  KLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTDCYNIM
        KLG+WWMGFGDNTLVGYWPAELFTHL DHATMVEWGGEVVNSRANGQHTSTQMGSG F +DGFG+ASYFRNLEIVDSDNSLSAVQ+IS +AENT+CYNIM
Subjt:  KLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTDCYNIM

Query:  SAYNDQWGTHFYYGGPGRNPECQ
        S+YNDQWGTHFYYGGPGRNP+CQ
Subjt:  SAYNDQWGTHFYYGGPGRNPECQ

A0A6J1JIQ1 uncharacterized protein LOC1114854604.4e-21886.73Show/hide
Query:  LVFALF---FCFVLQRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPMEWPKMKKVKENNE
        L FALF   F  ++QRF LVCGLN++ +QVS LRLDRIQRHLD INKP +LTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGP EWPK KKVKEN E
Subjt:  LVFALF---FCFVLQRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPMEWPKMKKVKENNE

Query:  GGSERRVGSGAGGAFQTWRVNGTRCPRGTVPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSI
           + R GSGAGG FQTWRVNGTRCP+G++PVRRSTV+DVLRAKS+FD+GKKKRPILLDR+ DAPDVVSGNGHEHAIAYT SS EMYGAKATINVWDPSI
Subjt:  GGSERRVGSGAGGAFQTWRVNGTRCPRGTVPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSI

Query:  QVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGNQYDITILIWKDP
        QVVNEFSLSQ+WI+SGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLC+GFVQTN+KIAIGAAISP+SSFSG+QYDITILIWKDP
Subjt:  QVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGNQYDITILIWKDP

Query:  KLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTDCYNIM
        KLG+WWMGFGDNTLVGYWPAELFTHL DHATMVEWGGEVVNSRANGQHTSTQMGSG F +DGFG+ASYFRNLEIVDSDNSLS VQ+IS +AENT+CYNIM
Subjt:  KLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTDCYNIM

Query:  SAYNDQWGTHFYYGGPGRNPEC
        S+YNDQWGTHFYYGGPGRNP+C
Subjt:  SAYNDQWGTHFYYGGPGRNPEC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10750.1 Protein of Unknown Function (DUF239)1.4e-13158Show/hide
Query:  NYTYRQVSGL-RLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPMEWPKMKKVKENNEGGSERRVGSGAGGAFQTWRVNG
        N T R +  L +L  I +HL  INKPSI TI SPDGDIIDCV    QPA DHP L+  K    P++ P  ++ + +N  G   +       +FQ W + G
Subjt:  NYTYRQVSGL-RLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPMEWPKMKKVKENNEGGSERRVGSGAGGAFQTWRVNG

Query:  TRCPRGTVPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSD
          CP GTVP+RR+  +D+LRA S+  FGKK R    D         S NGHEHA+ Y  S E+ YGAKA+INVW P +Q   EFSLSQIWI+SGSF G+D
Subjt:  TRCPRGTVPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSD

Query:  LNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGNQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAEL
        LN+IEAGWQVSPELYGD+ PR FTYWT+DAYQATGCYNLLC+GFVQTNS+IAIGAAISP SS+ G Q+DIT+LIWKDPK GNWW+ FG   LVGYWP+ L
Subjt:  LNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGNQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAEL

Query:  FTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTDCYNIMSAYNDQWGTHFYYGGPGRNPEC
        FTHL +HA+MV++GGE+VNS   G HTSTQMGSG FA +GF ++SYFRN+++VD DN+L     +  LA++ +CY+I    N  WG++FYYGGPG+NP+C
Subjt:  FTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTDCYNIMSAYNDQWGTHFYYGGPGRNPEC

AT1G55360.1 Protein of Unknown Function (DUF239)4.8e-13254.57Show/hide
Query:  LQRFSLVC-------GLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPMEWPKMKKVKENNEGGSERR
        L R  LVC        L+Y  R     +   +++HL+ +NKP++ +IQS DGD+IDCV   KQPA DHP LK+HKIQ  P   P  + + ++N+  S  +
Subjt:  LQRFSLVC-------GLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPMEWPKMKKVKENNEGGSERR

Query:  VGSGAGGAFQTWRVNGTRCPRGTVPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIQVVNEF
             G   Q W   G +C  GT+P+RR+  DDVLRA S+  +GKKKR  +   ++  PD+++ +GH+HAIAY    ++ YGAKATINVW+P IQ  NEF
Subjt:  VGSGAGGAFQTWRVNGTRCPRGTVPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIQVVNEF

Query:  SLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGNQYDITILIWKDPKLGNWW
        SLSQIW+L GSF G DLNSIEAGWQVSP+LYGD+  RLFTYWTSDAYQATGCYNLLC+GF+Q NS IA+GA+ISP+S +  +QYDI+ILIWKDPK G+WW
Subjt:  SLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGNQYDITILIWKDPKLGNWW

Query:  MGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTDCYNIMSAYNDQ
        M FG+  ++GYWP+ LF++L + A+M+EWGGEVVNS+++GQHTSTQMGSG+F  +GF +ASYFRN+++VD  N+L A + + T  E ++CY++ +  ND 
Subjt:  MGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTDCYNIMSAYNDQ

Query:  WGTHFYYGGPGRNPEC
        WG +FYYGGPG+N +C
Subjt:  WGTHFYYGGPGRNPEC

AT5G18460.1 Protein of Unknown Function (DUF239)1.9e-18170.37Show/hide
Query:  VSISISISSPLVFALFFCFVL-QRFSLVCGLNYT--YRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPMEWP
        V++  S  S L+F L     L Q+   +   N T  YRQVS LRL RIQ+HL+ INK  + TIQSPDGD+IDCV KRKQPALDHPLLK+HKIQ+ P + P
Subjt:  VSISISISSPLVFALFFCFVL-QRFSLVCGLNYT--YRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPMEWP

Query:  KMKKVKENNEGGSERRVGSGAGGAFQTWRVNGTRCPRGTVPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAK
        KMK      +    +   +   GA+Q W VNGTRCP+GTVP+RR+T++DVLRAKSLFDFGKK+R I LD+RT+ PD +  NGHEHAIAYT SS E+YGAK
Subjt:  KMKKVKENNEGGSERRVGSGAGGAFQTWRVNGTRCPRGTVPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAK

Query:  ATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGNQY
        ATINVWDP I+ VNEFSLSQIWILSGSF G DLNSIEAGWQVSPELYGD+RPRLFTYWTSD+YQATGCYNLLC+GF+QTN+KIAIGAAISP+S+F GNQ+
Subjt:  ATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGNQY

Query:  DITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTL
        DITILIWKDPK+GNWWMG GD+TLVGYWPAELFTHLADHAT VEWGGEVVN+RA+G+HT+TQMGSG F ++GFG+ASYFRNLE+VDSDNSL  V ++  L
Subjt:  DITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTL

Query:  AENTDCYNIMSAYNDQWGTHFYYGGPGRNPEC
        AENT+CY+I S+Y+++WGT+FYYGGPG NP C
Subjt:  AENTDCYNIMSAYNDQWGTHFYYGGPGRNPEC

AT5G56530.1 Protein of Unknown Function (DUF239)1.5e-13355.26Show/hide
Query:  FALFFCFVLQRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPMEWPKMKKVKENNEGGSER
        F ++FCF     SL C    +  + +      + +HL+ +NKP++ +IQSPDGDIIDCVH  KQPA DHP LK+HKIQ GP   P+        E     
Subjt:  FALFFCFVLQRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPMEWPKMKKVKENNEGGSER

Query:  RVGSGAGGAFQTWRVNGTRCPRGTVPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIQVVNE
        +         Q W  NG  C  GT+PVRR+  +DVLRA S+  +GKKK   +   R+  PD+++ +GH+HAIAY     + YGAKATINVW+P +Q  NE
Subjt:  RVGSGAGGAFQTWRVNGTRCPRGTVPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIQVVNE

Query:  FSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGNQYDITILIWKDPKLGNW
        FSLSQ+WIL GSF G DLNSIEAGWQVSP+LYGD+  RLFTYWTSDAYQATGCYNLLC+GF+Q NS+IA+GA+ISP+S F   QYDI+I IWKDPK G+W
Subjt:  FSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGNQYDITILIWKDPKLGNW

Query:  WMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTDCYNIMSAYND
        WM FGD  ++GYWP+ LF++LAD A++VEWGGEVVN   +G HT+TQMGSG+F ++GF +ASYFRN+++VDS N+L   + ++T  E ++CY++    ND
Subjt:  WMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTDCYNIMSAYND

Query:  QWGTHFYYGGPGRNPECQ
         WG +FYYGGPGRNP CQ
Subjt:  QWGTHFYYGGPGRNPECQ

AT5G56530.2 Protein of Unknown Function (DUF239)1.5e-13355.26Show/hide
Query:  FALFFCFVLQRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPMEWPKMKKVKENNEGGSER
        F ++FCF     SL C    +  + +      + +HL+ +NKP++ +IQSPDGDIIDCVH  KQPA DHP LK+HKIQ GP   P+        E     
Subjt:  FALFFCFVLQRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPMEWPKMKKVKENNEGGSER

Query:  RVGSGAGGAFQTWRVNGTRCPRGTVPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIQVVNE
        +         Q W  NG  C  GT+PVRR+  +DVLRA S+  +GKKK   +   R+  PD+++ +GH+HAIAY     + YGAKATINVW+P +Q  NE
Subjt:  RVGSGAGGAFQTWRVNGTRCPRGTVPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIQVVNE

Query:  FSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGNQYDITILIWKDPKLGNW
        FSLSQ+WIL GSF G DLNSIEAGWQVSP+LYGD+  RLFTYWTSDAYQATGCYNLLC+GF+Q NS+IA+GA+ISP+S F   QYDI+I IWKDPK G+W
Subjt:  FSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGNQYDITILIWKDPKLGNW

Query:  WMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTDCYNIMSAYND
        WM FGD  ++GYWP+ LF++LAD A++VEWGGEVVN   +G HT+TQMGSG+F ++GF +ASYFRN+++VDS N+L   + ++T  E ++CY++    ND
Subjt:  WMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTDCYNIMSAYND

Query:  QWGTHFYYGGPGRNPECQ
         WG +FYYGGPGRNP CQ
Subjt:  QWGTHFYYGGPGRNPECQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTCAATTTCAATTTCAATTTCATCACCTCTTGTTTTTGCTTTGTTCTTCTGTTTTGTTCTTCAAAGATTCTCTTTGGTTTGTGGCCTCAATTATACTTATAGACA
AGTTAGTGGCTTGAGACTTGACAGGATTCAAAGGCATTTGGACAACATTAACAAGCCTTCTATTCTCACCATTCAGAGCCCAGATGGTGATATTATAGATTGTGTTCATA
AAAGAAAACAGCCAGCTTTGGATCATCCCCTCTTGAAGAACCACAAGATTCAGAGAGGGCCAATGGAGTGGCCAAAAATGAAGAAGGTGAAAGAGAATAATGAAGGTGGG
AGTGAAAGGAGGGTGGGATCCGGTGCGGGTGGTGCATTTCAAACTTGGCGTGTGAACGGGACACGGTGTCCAAGAGGGACTGTTCCAGTGCGACGCAGCACAGTGGATGA
TGTGTTAAGAGCCAAGTCTTTGTTTGACTTTGGGAAGAAGAAACGGCCGATCCTCCTCGATCGGCGAACGGACGCTCCAGATGTGGTTAGTGGGAATGGTCACGAGCATG
CGATCGCGTACACTGGATCATCGGAAGAGATGTACGGAGCGAAGGCGACAATAAACGTGTGGGACCCATCAATCCAAGTGGTCAACGAGTTCAGCCTCTCTCAGATTTGG
ATCCTCTCGGGATCATTCGACGGCTCAGATCTCAACAGCATAGAAGCTGGTTGGCAGGTCAGTCCGGAGCTTTATGGTGACAGCAGACCTAGATTATTCACATATTGGAC
GAGTGACGCATATCAGGCAACGGGTTGCTACAATCTTTTATGCGCTGGATTTGTTCAAACAAACAGCAAAATCGCCATCGGAGCCGCCATTTCTCCCATCTCTTCTTTTT
CCGGCAACCAATATGACATCACCATTCTCATTTGGAAGGATCCAAAGCTGGGAAACTGGTGGATGGGATTTGGGGACAACACATTGGTGGGTTACTGGCCGGCGGAGCTG
TTCACTCACCTGGCCGACCACGCCACCATGGTGGAGTGGGGCGGCGAGGTCGTCAACTCAAGGGCCAATGGCCAACATACTTCCACCCAAATGGGCTCCGGTCGATTCGC
CAACGACGGTTTTGGCCAGGCTAGCTACTTTCGAAACCTCGAGATCGTCGACTCCGACAACAGCCTCAGCGCCGTCCAAGAGATCTCGACCTTGGCTGAGAACACCGATT
GTTACAACATTATGAGCGCCTACAACGATCAATGGGGCACTCACTTCTATTACGGCGGTCCTGGTAGAAACCCTGAATGTCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTTCAATTTCAATTTCAATTTCATCACCTCTTGTTTTTGCTTTGTTCTTCTGTTTTGTTCTTCAAAGATTCTCTTTGGTTTGTGGCCTCAATTATACTTATAGACA
AGTTAGTGGCTTGAGACTTGACAGGATTCAAAGGCATTTGGACAACATTAACAAGCCTTCTATTCTCACCATTCAGAGCCCAGATGGTGATATTATAGATTGTGTTCATA
AAAGAAAACAGCCAGCTTTGGATCATCCCCTCTTGAAGAACCACAAGATTCAGAGAGGGCCAATGGAGTGGCCAAAAATGAAGAAGGTGAAAGAGAATAATGAAGGTGGG
AGTGAAAGGAGGGTGGGATCCGGTGCGGGTGGTGCATTTCAAACTTGGCGTGTGAACGGGACACGGTGTCCAAGAGGGACTGTTCCAGTGCGACGCAGCACAGTGGATGA
TGTGTTAAGAGCCAAGTCTTTGTTTGACTTTGGGAAGAAGAAACGGCCGATCCTCCTCGATCGGCGAACGGACGCTCCAGATGTGGTTAGTGGGAATGGTCACGAGCATG
CGATCGCGTACACTGGATCATCGGAAGAGATGTACGGAGCGAAGGCGACAATAAACGTGTGGGACCCATCAATCCAAGTGGTCAACGAGTTCAGCCTCTCTCAGATTTGG
ATCCTCTCGGGATCATTCGACGGCTCAGATCTCAACAGCATAGAAGCTGGTTGGCAGGTCAGTCCGGAGCTTTATGGTGACAGCAGACCTAGATTATTCACATATTGGAC
GAGTGACGCATATCAGGCAACGGGTTGCTACAATCTTTTATGCGCTGGATTTGTTCAAACAAACAGCAAAATCGCCATCGGAGCCGCCATTTCTCCCATCTCTTCTTTTT
CCGGCAACCAATATGACATCACCATTCTCATTTGGAAGGATCCAAAGCTGGGAAACTGGTGGATGGGATTTGGGGACAACACATTGGTGGGTTACTGGCCGGCGGAGCTG
TTCACTCACCTGGCCGACCACGCCACCATGGTGGAGTGGGGCGGCGAGGTCGTCAACTCAAGGGCCAATGGCCAACATACTTCCACCCAAATGGGCTCCGGTCGATTCGC
CAACGACGGTTTTGGCCAGGCTAGCTACTTTCGAAACCTCGAGATCGTCGACTCCGACAACAGCCTCAGCGCCGTCCAAGAGATCTCGACCTTGGCTGAGAACACCGATT
GTTACAACATTATGAGCGCCTACAACGATCAATGGGGCACTCACTTCTATTACGGCGGTCCTGGTAGAAACCCTGAATGTCAATGA
Protein sequenceShow/hide protein sequence
MVSISISISSPLVFALFFCFVLQRFSLVCGLNYTYRQVSGLRLDRIQRHLDNINKPSILTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPMEWPKMKKVKENNEGG
SERRVGSGAGGAFQTWRVNGTRCPRGTVPVRRSTVDDVLRAKSLFDFGKKKRPILLDRRTDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWDPSIQVVNEFSLSQIW
ILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFSGNQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAEL
FTHLADHATMVEWGGEVVNSRANGQHTSTQMGSGRFANDGFGQASYFRNLEIVDSDNSLSAVQEISTLAENTDCYNIMSAYNDQWGTHFYYGGPGRNPECQ