; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g1200 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g1200
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationMC06:15543175..15551904
RNA-Seq ExpressionMC06g1200
SyntenyMC06g1200
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6590299.1 hypothetical protein SDJN03_15722, partial [Cucurbita argyrosperma subsp. sororia]4.48e-27886.38Show/hide
Query:  PLALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEV
        P AL L+L L++ +RF LV GLN++ KQVSSLRLDRIQRHLD INKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWP+ KK KE  E 
Subjt:  PLALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEV

Query:  KDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWD
           D  +GR GSGAGG +QTWRVNGTRCPKGSIPVRRSTVNDVLRAKS+FD+GKK+RPILLDRQ+DAPD+VSGNGHEHAIAYT SS EMYGAKATINVWD
Subjt:  KDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWD

Query:  PSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIW
        PSIQVVNEFSLSQ+WI+SGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLC+GFVQTN+KIAIGAAISP+SSF+GSQYD+TILIW
Subjt:  PSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIW

Query:  KDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCY
        KDPKLG+WWMGFGDNTLVGYWPAELFTHL DHATMVEWGGEVVNS+A G+HTSTQMGSG F ++GFGKASYFRNLEIVDSDNSLSAVQ+IS +AEN +CY
Subjt:  KDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCY

Query:  NIMSSYNDQWGTHFYYGGPGRNPECQ
        NIMSSYNDQWGTHFYYGGPGRNP+CQ
Subjt:  NIMSSYNDQWGTHFYYGGPGRNPECQ

XP_022151674.1 uncharacterized protein LOC111019590 [Momordica charantia]0.0100Show/hide
Query:  MGCGKRFFLSPPPLALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEW
        MGCGKRFFLSPPPLALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEW
Subjt:  MGCGKRFFLSPPPLALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEW

Query:  PRVKKMKEKNEVKDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQE
        PRVKKMKEKNEVKDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQE
Subjt:  PRVKKMKEKNEVKDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQE

Query:  MYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSF
        MYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSF
Subjt:  MYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSF

Query:  TGSQYDVTILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQ
        TGSQYDVTILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQ
Subjt:  TGSQYDVTILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQ

Query:  EISTLAENNHCYNIMSSYNDQWGTHFYYGGPGRNPECQ
        EISTLAENNHCYNIMSSYNDQWGTHFYYGGPGRNPECQ
Subjt:  EISTLAENNHCYNIMSSYNDQWGTHFYYGGPGRNPECQ

XP_022961080.1 uncharacterized protein LOC111461698 [Cucurbita moschata]6.36e-27886.38Show/hide
Query:  PLALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEV
        P AL L+L L++ +RF LV GLN++ KQVSSLRLDRIQRHLD INKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWP+ KK+KE  E 
Subjt:  PLALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEV

Query:  KDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWD
           D  +GR GSGAGG +QTWRVNGTRCPKGSIPVRRSTVNDVLR KS+FD+GKK+RPILLDRQ+DAPDVVSGNGHEHAIAYT SS EMYGAKATINVWD
Subjt:  KDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWD

Query:  PSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIW
        PSIQVVNEFSLSQ+WI+SGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLC+GFVQTN+KIAIGAAISP+SSF+GSQYD+TILIW
Subjt:  PSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIW

Query:  KDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCY
        KDPKLG+WWMGFGDNTLVGYWPAELFTHL DHATMVEWGGEVVNS+A G+HTSTQMGSG F ++GFGKASYFRNLEIVDSDNSLSAVQ+IS +AEN +CY
Subjt:  KDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCY

Query:  NIMSSYNDQWGTHFYYGGPGRNPECQ
        NIMSSYNDQWGTHFYYGGPGRNP+CQ
Subjt:  NIMSSYNDQWGTHFYYGGPGRNPECQ

XP_038878455.1 uncharacterized protein LOC120070684 isoform X1 [Benincasa hispida]1.90e-27986.08Show/hide
Query:  ALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQ-------SPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMK
        AL  +L++++ +RF+LV GLNYTYKQVSSLRL+RIQRHLD+INKPPLLTIQ       SPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWP+ KK+ 
Subjt:  ALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQ-------SPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMK

Query:  EKNEVKDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKAT
        E  E + G G+    GSGAGGA QTWRVNGTRCPKGSIPVRRSTVNDVLR+KSLFDFGKK+RPILLDR++DAPDVVSGNGHEHAIAYT SS+EMYGAKAT
Subjt:  EKNEVKDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKAT

Query:  INVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDV
        INVWDPSIQ+VNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTN+KIAIGAAISPISSF+GSQYD+
Subjt:  INVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDV

Query:  TILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAE
        TILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHL DHATMVEWGGEVVNS+A G+HTSTQMGSG F ++GFGKASYFRNLEIVDSDNSLSAVQ+IS +AE
Subjt:  TILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAE

Query:  NNHCYNIMSSYNDQWGTHFYYGGPGRNPECQ
        N +CYNIMSSYNDQWGTHFYYGGPGRNP+CQ
Subjt:  NNHCYNIMSSYNDQWGTHFYYGGPGRNPECQ

XP_038878456.1 uncharacterized protein LOC120070684 isoform X2 [Benincasa hispida]2.65e-28287.5Show/hide
Query:  ALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEVKD
        AL  +L++++ +RF+LV GLNYTYKQVSSLRL+RIQRHLD+INKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWP+ KK+ E  E + 
Subjt:  ALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEVKD

Query:  GDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWDPS
        G G+    GSGAGGA QTWRVNGTRCPKGSIPVRRSTVNDVLR+KSLFDFGKK+RPILLDR++DAPDVVSGNGHEHAIAYT SS+EMYGAKATINVWDPS
Subjt:  GDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWDPS

Query:  IQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIWKD
        IQ+VNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTN+KIAIGAAISPISSF+GSQYD+TILIWKD
Subjt:  IQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIWKD

Query:  PKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCYNI
        PKLGNWWMGFGDNTLVGYWPAELFTHL DHATMVEWGGEVVNS+A G+HTSTQMGSG F ++GFGKASYFRNLEIVDSDNSLSAVQ+IS +AEN +CYNI
Subjt:  PKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCYNI

Query:  MSSYNDQWGTHFYYGGPGRNPECQ
        MSSYNDQWGTHFYYGGPGRNP+CQ
Subjt:  MSSYNDQWGTHFYYGGPGRNPECQ

TrEMBL top hitse value%identityAlignment
A0A0A0LXY9 Uncharacterized protein1.03e-27085.61Show/hide
Query:  VVFERFSLVFGLNYTY-KQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEVKDGDGSEGR
        V+ +RF+LV GLNYTY K +SSLRLDRIQRHLD+INKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWP+ K  KE  E    + SE R
Subjt:  VVFERFSLVFGLNYTY-KQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEVKDGDGSEGR

Query:  RGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWDPSIQVVNEF
         GSGA  ++QTWRVNGTRCPKG++PVRR+TV DVLR+KSLFDFGKK+RPILLDR++DAPDVVSGNGHEHAIAYTGSS+EMYGAKATINVWDPSI++VNEF
Subjt:  RGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWDPSIQVVNEF

Query:  SLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIWKDPKLGNWW
        SLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLC+GFVQTNSKIAIGAAISPISS  GSQYD+TILIWKDPKLGNWW
Subjt:  SLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIWKDPKLGNWW

Query:  MGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCYNIMSSYNDQ
        MGFG+NTLVGYWPAELFTHL DHATMVEWGGEVVNS+  G+HTSTQMGSG F ++GF KASYFRNLEIVDSDNSLS+VQ+IS +AEN +CYNIMSSYNDQ
Subjt:  MGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCYNIMSSYNDQ

Query:  WGTHFYYGGPGRNPECQ
        WGTHFYYGGPGRNP+CQ
Subjt:  WGTHFYYGGPGRNPECQ

A0A1S3B8R5 uncharacterized protein LOC1034872734.25e-27486.78Show/hide
Query:  VVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEVKDGDGSEGRR
        VV +RF+LV GLNYTY+++SSLRLDRIQRHLD+INKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWP+ K +KE  E    +  E R 
Subjt:  VVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEVKDGDGSEGRR

Query:  GSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWDPSIQVVNEFS
        GSGA  A+QTWRVNGTRCPKG+IPVRR+TV DVLR+KSLFDFGKKRRPILLDR++DAPDVVSGNGHEHAIAYTGSS+EMYGAKATINVWDPSIQ+VNEFS
Subjt:  GSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWDPSIQVVNEFS

Query:  LSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIWKDPKLGNWWM
        LSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISS +GSQYD+TILIWKDPKLGNWWM
Subjt:  LSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIWKDPKLGNWWM

Query:  GFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCYNIMSSYNDQW
        GFG+NTLVGYWPAELFTHL DHATMVEWGGEVVNS+  G+HTSTQMGSG F ++GF KASYFRNLEIVDSDNSLS VQ+IS +AEN +CYNIMSSYNDQW
Subjt:  GFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCYNIMSSYNDQW

Query:  GTHFYYGGPGRNPECQ
        GTHFYYGGPGRNP+CQ
Subjt:  GTHFYYGGPGRNPECQ

A0A6J1DDR6 uncharacterized protein LOC1110195900.0100Show/hide
Query:  MGCGKRFFLSPPPLALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEW
        MGCGKRFFLSPPPLALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEW
Subjt:  MGCGKRFFLSPPPLALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEW

Query:  PRVKKMKEKNEVKDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQE
        PRVKKMKEKNEVKDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQE
Subjt:  PRVKKMKEKNEVKDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQE

Query:  MYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSF
        MYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSF
Subjt:  MYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSF

Query:  TGSQYDVTILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQ
        TGSQYDVTILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQ
Subjt:  TGSQYDVTILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQ

Query:  EISTLAENNHCYNIMSSYNDQWGTHFYYGGPGRNPECQ
        EISTLAENNHCYNIMSSYNDQWGTHFYYGGPGRNPECQ
Subjt:  EISTLAENNHCYNIMSSYNDQWGTHFYYGGPGRNPECQ

A0A6J1HAZ7 uncharacterized protein LOC1114616983.08e-27886.38Show/hide
Query:  PLALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEV
        P AL L+L L++ +RF LV GLN++ KQVSSLRLDRIQRHLD INKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWP+ KK+KE  E 
Subjt:  PLALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEV

Query:  KDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWD
           D  +GR GSGAGG +QTWRVNGTRCPKGSIPVRRSTVNDVLR KS+FD+GKK+RPILLDRQ+DAPDVVSGNGHEHAIAYT SS EMYGAKATINVWD
Subjt:  KDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWD

Query:  PSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIW
        PSIQVVNEFSLSQ+WI+SGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLC+GFVQTN+KIAIGAAISP+SSF+GSQYD+TILIW
Subjt:  PSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIW

Query:  KDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCY
        KDPKLG+WWMGFGDNTLVGYWPAELFTHL DHATMVEWGGEVVNS+A G+HTSTQMGSG F ++GFGKASYFRNLEIVDSDNSLSAVQ+IS +AEN +CY
Subjt:  KDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCY

Query:  NIMSSYNDQWGTHFYYGGPGRNPECQ
        NIMSSYNDQWGTHFYYGGPGRNP+CQ
Subjt:  NIMSSYNDQWGTHFYYGGPGRNPECQ

A0A6J1JIQ1 uncharacterized protein LOC1114854601.46e-27686.12Show/hide
Query:  PLALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEV
        P AL  +L L++ +RF LV GLN++ KQVSSLRLDRIQRHLD INKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWP+ KK+KE  E 
Subjt:  PLALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEV

Query:  KDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWD
           D  +GR GSGAGG +QTWRVNGTRCPKGSIPVRRSTVNDVLRAKS+FD+GKK+RPILLDRQ+DAPDVVSGNGHEHAIAYT SS EMYGAKATINVWD
Subjt:  KDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWD

Query:  PSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIW
        PSIQVVNEFSLSQ+WI+SGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLC+GFVQTN+KIAIGAAISP+SSF+GSQYD+TILIW
Subjt:  PSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIW

Query:  KDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCY
        KDPKLG+WWMGFGDNTLVGYWPAELFTHL DHATMVEWGGEVVNS+A G+HTSTQMGSG F ++GFGKASYFRNLEIVDSDNSLS VQ+IS +AEN +CY
Subjt:  KDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCY

Query:  NIMSSYNDQWGTHFYYGGPGRNPEC
        NIMSSYNDQWGTHFYYGGPGRNP+C
Subjt:  NIMSSYNDQWGTHFYYGGPGRNPEC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55360.1 Protein of Unknown Function (DUF239)1.1e-13155.17Show/hide
Query:  FGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEVKDGDGSEGRRGSGAGGAWQ
        F L+Y  +   S +   +++HL+ +NKP + +IQS DGD+IDCV   KQPA DHP LK+HKIQ  P   P  + + + N+V     S  +     G   Q
Subjt:  FGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEVKDGDGSEGRRGSGAGGAWQ

Query:  TWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWDPSIQVVNEFSLSQIWILSG
         W   G +C +G+IP+RR+  +DVLRA S+  +GKK+R  +   +   PD+++ +GH+HAIAY     + YGAKATINVW+P IQ  NEFSLSQIW+L G
Subjt:  TWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWDPSIQVVNEFSLSQIWILSG

Query:  SFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIWKDPKLGNWWMGFGDNTLVG
        SF G DLNSIEAGWQVSP+LYGD+  RLFTYWTSDAYQATGCYNLLC+GF+Q NS IA+GA+ISP+S +  SQYD++ILIWKDPK G+WWM FG+  ++G
Subjt:  SFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIWKDPKLGNWWMGFGDNTLVG

Query:  YWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCYNIMSSYNDQWGTHFYYGGP
        YWP+ LF++L + A+M+EWGGEVVNS++ G+HTSTQMGSG+F EEGF KASYFRN+++VD  N+L A + + T  E ++CY++ +  ND WG +FYYGGP
Subjt:  YWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCYNIMSSYNDQWGTHFYYGGP

Query:  GRNPEC
        G+N +C
Subjt:  GRNPEC

AT3G13510.1 Protein of Unknown Function (DUF239)4.1e-13156.17Show/hide
Query:  SSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEVK-DGDGSEGRRGSGAGGAWQTWRVNGTRC
        SS +   +++HL+ +NKPP+ TIQSPDGDIIDC+   KQPA DHP LK+HKIQ  P+  P  + + + N+V  +  G E           Q W   G +C
Subjt:  SSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEVK-DGDGSEGRRGSGAGGAWQTWRVNGTRC

Query:  PKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNS
         +G+IP+RR+  +DVLRA S+  +GKK+   +   +   PD+++ NGH+HAIAY     + YGAKAT+NVW+P IQ  NEFSLSQIW+L GSF G DLNS
Subjt:  PKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNS

Query:  IEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIWKDPKLGNWWMGFGDNTLVGYWPAELFTH
        IEAGWQVSP+LYGD+  RLFTYWTSDAYQATGCYNLLC+GF+Q NS IA+GA+ISP+S +  SQYD++ILIWKDPK G+WWM FG+  ++GYWP+ LF++
Subjt:  IEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIWKDPKLGNWWMGFGDNTLVGYWPAELFTH

Query:  LVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCYNIMSSYNDQWGTHFYYGGPGRNPEC
        L + A+M+EWGGEVVNS++ G HT TQMGSG F EEGF KASYFRN+++VD  N+L A + + T  E ++CY++ +  ND WG +FYYGGPG+N  C
Subjt:  LVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCYNIMSSYNDQWGTHFYYGGPGRNPEC

AT5G18460.1 Protein of Unknown Function (DUF239)7.4e-18171.09Show/hide
Query:  LALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEVKDG
        L ++L L    R    F     Y+QVSSLRL RIQ+HL+ INK P+ TIQSPDGD+IDCV KRKQPALDHPLLK+HKIQ+ P + P++K        KD 
Subjt:  LALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEVKDG

Query:  DGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWDPSI
        D  E    +   GAWQ W VNGTRCPKG++P+RR+T+NDVLRAKSLFDFGKKRR I LD++ + PD +  NGHEHAIAYT SS E+YGAKATINVWDP I
Subjt:  DGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWDPSI

Query:  QVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIWKDP
        + VNEFSLSQIWILSGSF G DLNSIEAGWQVSPELYGD+RPRLFTYWTSD+YQATGCYNLLC+GF+QTN+KIAIGAAISP+S+F G+Q+D+TILIWKDP
Subjt:  QVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIWKDP

Query:  KLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCYNIM
        K+GNWWMG GD+TLVGYWPAELFTHL DHAT VEWGGEVVN++A G HT+TQMGSG F +EGFGKASYFRNLE+VDSDNSL  V ++  LAEN  CY+I 
Subjt:  KLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCYNIM

Query:  SSYNDQWGTHFYYGGPGRNPEC
        SSY+++WGT+FYYGGPG NP C
Subjt:  SSYNDQWGTHFYYGGPGRNPEC

AT5G56530.1 Protein of Unknown Function (DUF239)1.7e-13257.8Show/hide
Query:  IQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEVKDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVR
        + +HL+ +NKP + +IQSPDGDIIDCVH  KQPA DHP LK+HKIQ GP+  P     + K   K  +              Q W  NG  C +G+IPVR
Subjt:  IQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEVKDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVR

Query:  RSTVNDVLRAKSLFDFGKKRR-PILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQV
        R+   DVLRA S+  +GKK+   + L R  D PD+++ +GH+HAIAY     + YGAKATINVW+P +Q  NEFSLSQ+WIL GSF G DLNSIEAGWQV
Subjt:  RSTVNDVLRAKSLFDFGKKRR-PILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQV

Query:  SPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATM
        SP+LYGD+  RLFTYWTSDAYQATGCYNLLC+GF+Q NS+IA+GA+ISP+S F   QYD++I IWKDPK G+WWM FGD  ++GYWP+ LF++L D A++
Subjt:  SPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATM

Query:  VEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCYNIMSSYNDQWGTHFYYGGPGRNPECQ
        VEWGGEVVN +  G HT+TQMGSG+F +EGF KASYFRN+++VDS N+L   + ++T  E ++CY++    ND WG +FYYGGPGRNP CQ
Subjt:  VEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCYNIMSSYNDQWGTHFYYGGPGRNPECQ

AT5G56530.2 Protein of Unknown Function (DUF239)1.7e-13257.8Show/hide
Query:  IQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEVKDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVR
        + +HL+ +NKP + +IQSPDGDIIDCVH  KQPA DHP LK+HKIQ GP+  P     + K   K  +              Q W  NG  C +G+IPVR
Subjt:  IQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEVKDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVR

Query:  RSTVNDVLRAKSLFDFGKKRR-PILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQV
        R+   DVLRA S+  +GKK+   + L R  D PD+++ +GH+HAIAY     + YGAKATINVW+P +Q  NEFSLSQ+WIL GSF G DLNSIEAGWQV
Subjt:  RSTVNDVLRAKSLFDFGKKRR-PILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQV

Query:  SPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATM
        SP+LYGD+  RLFTYWTSDAYQATGCYNLLC+GF+Q NS+IA+GA+ISP+S F   QYD++I IWKDPK G+WWM FGD  ++GYWP+ LF++L D A++
Subjt:  SPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATM

Query:  VEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCYNIMSSYNDQWGTHFYYGGPGRNPECQ
        VEWGGEVVN +  G HT+TQMGSG+F +EGF KASYFRN+++VDS N+L   + ++T  E ++CY++    ND WG +FYYGGPGRNP CQ
Subjt:  VEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCYNIMSSYNDQWGTHFYYGGPGRNPECQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTGTGGAAAGAGATTTTTTTTGAGCCCTCCTCCTCTGGCTCTTGCTCTTGCTCTTGTTCTTGTTGTTTTTGAAAGATTCAGTCTGGTTTTTGGGCTGAATTATAC
ATATAAACAAGTCAGCAGCTTGAGATTGGACAGGATTCAAAGGCATTTGGACAACATTAACAAGCCTCCTCTTCTCACCATTCAGAGCCCTGATGGTGATATTATAGATT
GTGTTCATAAACGAAAACAGCCGGCTCTTGATCATCCCCTCTTGAAGAACCACAAGATTCAGAGAGGGCCAACGGAATGGCCAAGAGTGAAGAAGATGAAGGAGAAGAAT
GAAGTAAAGGATGGCGATGGCAGTGAAGGGAGGCGGGGATCAGGTGCGGGAGGTGCATGGCAAACTTGGCGTGTGAACGGAACACGGTGTCCGAAGGGGAGTATTCCAGT
GCGACGGAGCACAGTGAACGACGTGCTACGAGCCAAGTCTTTGTTTGACTTTGGCAAGAAACGACGGCCGATTCTTCTTGACCGCCAAATGGACGCTCCTGATGTGGTCA
GCGGGAATGGTCACGAGCATGCGATCGCGTACACAGGATCCTCGCAGGAGATGTACGGAGCGAAGGCGACAATAAACGTGTGGGACCCGTCAATCCAAGTGGTCAACGAG
TTCAGCCTCTCCCAGATTTGGATCCTCTCGGGATCATTCGACGGCTCAGATCTCAACAGCATAGAAGCTGGTTGGCAGGTCAGTCCGGAGCTTTATGGGGACAGCAGACC
AAGATTATTCACATATTGGACGAGTGACGCGTATCAGGCAACGGGGTGCTACAACCTTCTATGCGCTGGATTTGTACAAACAAACAGCAAAATCGCGATCGGAGCGGCGA
TTTCTCCCATCTCTTCATTTACCGGCAGCCAATATGACGTCACCATTCTCATTTGGAAGGATCCAAAGCTGGGAAACTGGTGGATGGGATTTGGGGACAATACTTTGGTA
GGGTATTGGCCGGCGGAACTGTTCACTCACCTCGTCGACCACGCCACCATGGTGGAGTGGGGCGGCGAGGTGGTGAACTCAAAGGCCAGGGGTGAGCACACCTCCACCCA
AATGGGCTCCGGTCGTTTCGCTGAGGAGGGCTTTGGCAAGGCTAGCTACTTTCGAAACCTCGAGATCGTTGACTCCGACAATAGCCTTAGTGCTGTCCAAGAAATCTCAA
CGTTGGCGGAGAACAACCATTGCTACAATATTATGAGCTCCTACAATGACCAATGGGGCACCCACTTCTACTACGGCGGCCCTGGAAGAAATCCTGAATGCCAATGA
mRNA sequenceShow/hide mRNA sequence
TGTTTCAGTTTGGTAGGTTTCAAAATTTTTATTAAATCTCTCTCTCTCTCTCCCTTCTCTCTCTCTCTCTCTTCTTTTTCTTTTTTCTTCCCATTCATCTGAATCCAATG
CAATCATGATCTTGATGCTGAAAGTCCTTTAATTTAATGGTGTTTTTGTTCATTGAACTCTCTCTCTCTACAACTTGTTTGTTCTCTCTCTAGAGAAATGGGGTGTGGAA
AGAGATTTTTTTTGAGCCCTCCTCCTCTGGCTCTTGCTCTTGCTCTTGTTCTTGTTGTTTTTGAAAGATTCAGTCTGGTTTTTGGGCTGAATTATACATATAAACAAGTC
AGCAGCTTGAGATTGGACAGGATTCAAAGGCATTTGGACAACATTAACAAGCCTCCTCTTCTCACCATTCAGAGCCCTGATGGTGATATTATAGATTGTGTTCATAAACG
AAAACAGCCGGCTCTTGATCATCCCCTCTTGAAGAACCACAAGATTCAGAGAGGGCCAACGGAATGGCCAAGAGTGAAGAAGATGAAGGAGAAGAATGAAGTAAAGGATG
GCGATGGCAGTGAAGGGAGGCGGGGATCAGGTGCGGGAGGTGCATGGCAAACTTGGCGTGTGAACGGAACACGGTGTCCGAAGGGGAGTATTCCAGTGCGACGGAGCACA
GTGAACGACGTGCTACGAGCCAAGTCTTTGTTTGACTTTGGCAAGAAACGACGGCCGATTCTTCTTGACCGCCAAATGGACGCTCCTGATGTGGTCAGCGGGAATGGTCA
CGAGCATGCGATCGCGTACACAGGATCCTCGCAGGAGATGTACGGAGCGAAGGCGACAATAAACGTGTGGGACCCGTCAATCCAAGTGGTCAACGAGTTCAGCCTCTCCC
AGATTTGGATCCTCTCGGGATCATTCGACGGCTCAGATCTCAACAGCATAGAAGCTGGTTGGCAGGTCAGTCCGGAGCTTTATGGGGACAGCAGACCAAGATTATTCACA
TATTGGACGAGTGACGCGTATCAGGCAACGGGGTGCTACAACCTTCTATGCGCTGGATTTGTACAAACAAACAGCAAAATCGCGATCGGAGCGGCGATTTCTCCCATCTC
TTCATTTACCGGCAGCCAATATGACGTCACCATTCTCATTTGGAAGGATCCAAAGCTGGGAAACTGGTGGATGGGATTTGGGGACAATACTTTGGTAGGGTATTGGCCGG
CGGAACTGTTCACTCACCTCGTCGACCACGCCACCATGGTGGAGTGGGGCGGCGAGGTGGTGAACTCAAAGGCCAGGGGTGAGCACACCTCCACCCAAATGGGCTCCGGT
CGTTTCGCTGAGGAGGGCTTTGGCAAGGCTAGCTACTTTCGAAACCTCGAGATCGTTGACTCCGACAATAGCCTTAGTGCTGTCCAAGAAATCTCAACGTTGGCGGAGAA
CAACCATTGCTACAATATTATGAGCTCCTACAATGACCAATGGGGCACCCACTTCTACTACGGCGGCCCTGGAAGAAATCCTGAATGCCAATGATCATTGTCTCGTCCAC
TCATGCGAGATCTATTAAAAGGAAAATTCAATTCTCATTTTTTAATTTGACAAGGTGGATCAATTTTCACGGTATATATTTTCAATTTCATCAAATGGAGTAAAATGATA
AAGAATTATAACATTTATCTAAATTTGATGAAATTGAAAATTTATACCGTAAATATTGATTGAGGGGTAGGAATTGAATTTTTCTTTTATTCTCCTATATTATAACTATT
TTAAGGGCTATTTTAGTAACTGAAATTAATTCTTTAATTGTGTAATTAATGATTAATGTGGGCAGCTATTTCCACTCCTGTTTTTGCTTCACTTTCAAAGGTTTCGATTG
GGAGAGAAATGAACTTGTACCATTCGAATAAAGGCATTGTATCGCTTAAACATGCTTGGAAATAGCAATATTTTGTAAGCTTATTACCCACTTAAT
Protein sequenceShow/hide protein sequence
MGCGKRFFLSPPPLALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKN
EVKDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWDPSIQVVNE
FSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIWKDPKLGNWWMGFGDNTLV
GYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCYNIMSSYNDQWGTHFYYGGPGRNPECQ