; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g19780 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g19780
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionProtein of Unknown Function (DUF239)
Genome locationchr6:15515217..15523249
RNA-Seq ExpressionMoc06g19780
SyntenyMoc06g19780
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6590299.1 hypothetical protein SDJN03_15722, partial [Cucurbita argyrosperma subsp. sororia]1.4e-21886.38Show/hide
Query:  PLALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEV
        P AL L+L L++ +RF LV GLN++ KQVSSLRLDRIQRHLD INKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWP+ KK KE  E 
Subjt:  PLALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEV

Query:  KDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWD
           D  +GR GSGAGG +QTWRVNGTRCPKGSIPVRRSTVNDVLRAKS+FD+GKK+RPILLDRQ+DAPD+VSGNGHEHAIAYT SS EMYGAKATINVWD
Subjt:  KDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWD

Query:  PSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIW
        PSIQVVNEFSLSQ+WI+SGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLC+GFVQTN+KIAIGAAISP+SSF+GSQYD+TILIW
Subjt:  PSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIW

Query:  KDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCY
        KDPKLG+WWMGFGDNTLVGYWPAELFTHL DHATMVEWGGEVVNS+A G+HTSTQMGSG F ++GFGKASYFRNLEIVDSDNSLSAVQ+IS +AEN +CY
Subjt:  KDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCY

Query:  NIMSSYNDQWGTHFYYGGPGRNPECQ
        NIMSSYNDQWGTHFYYGGPGRNP+CQ
Subjt:  NIMSSYNDQWGTHFYYGGPGRNPECQ

XP_022151674.1 uncharacterized protein LOC111019590 [Momordica charantia]2.0e-260100Show/hide
Query:  MGCGKRFFLSPPPLALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEW
        MGCGKRFFLSPPPLALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEW
Subjt:  MGCGKRFFLSPPPLALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEW

Query:  PRVKKMKEKNEVKDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQE
        PRVKKMKEKNEVKDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQE
Subjt:  PRVKKMKEKNEVKDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQE

Query:  MYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSF
        MYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSF
Subjt:  MYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSF

Query:  TGSQYDVTILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQ
        TGSQYDVTILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQ
Subjt:  TGSQYDVTILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQ

Query:  EISTLAENNHCYNIMSSYNDQWGTHFYYGGPGRNPECQ
        EISTLAENNHCYNIMSSYNDQWGTHFYYGGPGRNPECQ
Subjt:  EISTLAENNHCYNIMSSYNDQWGTHFYYGGPGRNPECQ

XP_022961080.1 uncharacterized protein LOC111461698 [Cucurbita moschata]1.9e-21886.38Show/hide
Query:  PLALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEV
        P AL L+L L++ +RF LV GLN++ KQVSSLRLDRIQRHLD INKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWP+ KK+KE  E 
Subjt:  PLALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEV

Query:  KDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWD
           D  +GR GSGAGG +QTWRVNGTRCPKGSIPVRRSTVNDVLR KS+FD+GKK+RPILLDRQ+DAPDVVSGNGHEHAIAYT SS EMYGAKATINVWD
Subjt:  KDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWD

Query:  PSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIW
        PSIQVVNEFSLSQ+WI+SGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLC+GFVQTN+KIAIGAAISP+SSF+GSQYD+TILIW
Subjt:  PSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIW

Query:  KDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCY
        KDPKLG+WWMGFGDNTLVGYWPAELFTHL DHATMVEWGGEVVNS+A G+HTSTQMGSG F ++GFGKASYFRNLEIVDSDNSLSAVQ+IS +AEN +CY
Subjt:  KDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCY

Query:  NIMSSYNDQWGTHFYYGGPGRNPECQ
        NIMSSYNDQWGTHFYYGGPGRNP+CQ
Subjt:  NIMSSYNDQWGTHFYYGGPGRNPECQ

XP_038878455.1 uncharacterized protein LOC120070684 isoform X1 [Benincasa hispida]7.6e-22086.08Show/hide
Query:  ALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLT-------IQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMK
        AL  +L++++ +RF+LV GLNYTYKQVSSLRL+RIQRHLD+INKPPLLT       IQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWP+ KK+ 
Subjt:  ALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLT-------IQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMK

Query:  EKNEVKDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKAT
        E  E + G G+    GSGAGGA QTWRVNGTRCPKGSIPVRRSTVNDVLR+KSLFDFGKK+RPILLDR++DAPDVVSGNGHEHAIAYT SS+EMYGAKAT
Subjt:  EKNEVKDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKAT

Query:  INVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDV
        INVWDPSIQ+VNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTN+KIAIGAAISPISSF+GSQYD+
Subjt:  INVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDV

Query:  TILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAE
        TILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHL DHATMVEWGGEVVNS+A G+HTSTQMGSG F ++GFGKASYFRNLEIVDSDNSLSAVQ+IS +AE
Subjt:  TILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAE

Query:  NNHCYNIMSSYNDQWGTHFYYGGPGRNPECQ
        N +CYNIMSSYNDQWGTHFYYGGPGRNP+CQ
Subjt:  NNHCYNIMSSYNDQWGTHFYYGGPGRNPECQ

XP_038878456.1 uncharacterized protein LOC120070684 isoform X2 [Benincasa hispida]6.2e-22287.5Show/hide
Query:  ALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEVKD
        AL  +L++++ +RF+LV GLNYTYKQVSSLRL+RIQRHLD+INKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWP+ KK+ E  E + 
Subjt:  ALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEVKD

Query:  GDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWDPS
        G G+    GSGAGGA QTWRVNGTRCPKGSIPVRRSTVNDVLR+KSLFDFGKK+RPILLDR++DAPDVVSGNGHEHAIAYT SS+EMYGAKATINVWDPS
Subjt:  GDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWDPS

Query:  IQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIWKD
        IQ+VNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTN+KIAIGAAISPISSF+GSQYD+TILIWKD
Subjt:  IQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIWKD

Query:  PKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCYNI
        PKLGNWWMGFGDNTLVGYWPAELFTHL DHATMVEWGGEVVNS+A G+HTSTQMGSG F ++GFGKASYFRNLEIVDSDNSLSAVQ+IS +AEN +CYNI
Subjt:  PKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCYNI

Query:  MSSYNDQWGTHFYYGGPGRNPECQ
        MSSYNDQWGTHFYYGGPGRNP+CQ
Subjt:  MSSYNDQWGTHFYYGGPGRNPECQ

TrEMBL top hitse value%identityAlignment
A0A0A0LXY9 Uncharacterized protein3.3e-21384.27Show/hide
Query:  LALALALVLVVFERFSLVFGLNYTY-KQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEV
        L   L    V+ +RF+LV GLNYTY K +SSLRLDRIQRHLD+INKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWP+ K  KE  E 
Subjt:  LALALALVLVVFERFSLVFGLNYTY-KQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEV

Query:  KDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWD
           + SE R GSGA  ++QTWRVNGTRCPKG++PVRR+TV DVLR+KSLFDFGKK+RPILLDR++DAPDVVSGNGHEHAIAYTGSS+EMYGAKATINVWD
Subjt:  KDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWD

Query:  PSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIW
        PSI++VNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLC+GFVQTNSKIAIGAAISPISS  GSQYD+TILIW
Subjt:  PSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIW

Query:  KDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCY
        KDPKLGNWWMGFG+NTLVGYWPAELFTHL DHATMVEWGGEVVNS+  G+HTSTQMGSG F ++GF KASYFRNLEIVDSDNSLS+VQ+IS +AEN +CY
Subjt:  KDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCY

Query:  NIMSSYNDQWGTHFYYGGPGRNPECQ
        NIMSSYNDQWGTHFYYGGPGRNP+CQ
Subjt:  NIMSSYNDQWGTHFYYGGPGRNPECQ

A0A1S3B8R5 uncharacterized protein LOC1034872739.4e-21685.41Show/hide
Query:  LALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEVK
        L   L    VV +RF+LV GLNYTY+++SSLRLDRIQRHLD+INKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWP+ K +KE  E  
Subjt:  LALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEVK

Query:  DGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWDP
          +  E R GSGA  A+QTWRVNGTRCPKG+IPVRR+TV DVLR+KSLFDFGKKRRPILLDR++DAPDVVSGNGHEHAIAYTGSS+EMYGAKATINVWDP
Subjt:  DGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWDP

Query:  SIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIWK
        SIQ+VNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISS +GSQYD+TILIWK
Subjt:  SIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIWK

Query:  DPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCYN
        DPKLGNWWMGFG+NTLVGYWPAELFTHL DHATMVEWGGEVVNS+  G+HTSTQMGSG F ++GF KASYFRNLEIVDSDNSLS VQ+IS +AEN +CYN
Subjt:  DPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCYN

Query:  IMSSYNDQWGTHFYYGGPGRNPECQ
        IMSSYNDQWGTHFYYGGPGRNP+CQ
Subjt:  IMSSYNDQWGTHFYYGGPGRNPECQ

A0A6J1DDR6 uncharacterized protein LOC1110195909.6e-261100Show/hide
Query:  MGCGKRFFLSPPPLALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEW
        MGCGKRFFLSPPPLALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEW
Subjt:  MGCGKRFFLSPPPLALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEW

Query:  PRVKKMKEKNEVKDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQE
        PRVKKMKEKNEVKDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQE
Subjt:  PRVKKMKEKNEVKDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQE

Query:  MYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSF
        MYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSF
Subjt:  MYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSF

Query:  TGSQYDVTILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQ
        TGSQYDVTILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQ
Subjt:  TGSQYDVTILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQ

Query:  EISTLAENNHCYNIMSSYNDQWGTHFYYGGPGRNPECQ
        EISTLAENNHCYNIMSSYNDQWGTHFYYGGPGRNPECQ
Subjt:  EISTLAENNHCYNIMSSYNDQWGTHFYYGGPGRNPECQ

A0A6J1HAZ7 uncharacterized protein LOC1114616989.1e-21986.38Show/hide
Query:  PLALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEV
        P AL L+L L++ +RF LV GLN++ KQVSSLRLDRIQRHLD INKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWP+ KK+KE  E 
Subjt:  PLALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEV

Query:  KDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWD
           D  +GR GSGAGG +QTWRVNGTRCPKGSIPVRRSTVNDVLR KS+FD+GKK+RPILLDRQ+DAPDVVSGNGHEHAIAYT SS EMYGAKATINVWD
Subjt:  KDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWD

Query:  PSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIW
        PSIQVVNEFSLSQ+WI+SGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLC+GFVQTN+KIAIGAAISP+SSF+GSQYD+TILIW
Subjt:  PSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIW

Query:  KDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCY
        KDPKLG+WWMGFGDNTLVGYWPAELFTHL DHATMVEWGGEVVNS+A G+HTSTQMGSG F ++GFGKASYFRNLEIVDSDNSLSAVQ+IS +AEN +CY
Subjt:  KDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCY

Query:  NIMSSYNDQWGTHFYYGGPGRNPECQ
        NIMSSYNDQWGTHFYYGGPGRNP+CQ
Subjt:  NIMSSYNDQWGTHFYYGGPGRNPECQ

A0A6J1JIQ1 uncharacterized protein LOC1114854601.7e-21786.12Show/hide
Query:  PLALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEV
        P AL  +L L++ +RF LV GLN++ KQVSSLRLDRIQRHLD INKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWP+ KK+KE  E 
Subjt:  PLALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEV

Query:  KDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWD
           D  +GR GSGAGG +QTWRVNGTRCPKGSIPVRRSTVNDVLRAKS+FD+GKK+RPILLDRQ+DAPDVVSGNGHEHAIAYT SS EMYGAKATINVWD
Subjt:  KDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWD

Query:  PSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIW
        PSIQVVNEFSLSQ+WI+SGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLC+GFVQTN+KIAIGAAISP+SSF+GSQYD+TILIW
Subjt:  PSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIW

Query:  KDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCY
        KDPKLG+WWMGFGDNTLVGYWPAELFTHL DHATMVEWGGEVVNS+A G+HTSTQMGSG F ++GFGKASYFRNLEIVDSDNSLS VQ+IS +AEN +CY
Subjt:  KDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCY

Query:  NIMSSYNDQWGTHFYYGGPGRNPEC
        NIMSSYNDQWGTHFYYGGPGRNP+C
Subjt:  NIMSSYNDQWGTHFYYGGPGRNPEC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55360.1 Protein of Unknown Function (DUF239)1.1e-13155.17Show/hide
Query:  FGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEVKDGDGSEGRRGSGAGGAWQ
        F L+Y  +   S +   +++HL+ +NKP + +IQS DGD+IDCV   KQPA DHP LK+HKIQ  P   P  + + + N+V     S  +     G   Q
Subjt:  FGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEVKDGDGSEGRRGSGAGGAWQ

Query:  TWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWDPSIQVVNEFSLSQIWILSG
         W   G +C +G+IP+RR+  +DVLRA S+  +GKK+R  +   +   PD+++ +GH+HAIAY     + YGAKATINVW+P IQ  NEFSLSQIW+L G
Subjt:  TWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWDPSIQVVNEFSLSQIWILSG

Query:  SFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIWKDPKLGNWWMGFGDNTLVG
        SF G DLNSIEAGWQVSP+LYGD+  RLFTYWTSDAYQATGCYNLLC+GF+Q NS IA+GA+ISP+S +  SQYD++ILIWKDPK G+WWM FG+  ++G
Subjt:  SFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIWKDPKLGNWWMGFGDNTLVG

Query:  YWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCYNIMSSYNDQWGTHFYYGGP
        YWP+ LF++L + A+M+EWGGEVVNS++ G+HTSTQMGSG+F EEGF KASYFRN+++VD  N+L A + + T  E ++CY++ +  ND WG +FYYGGP
Subjt:  YWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCYNIMSSYNDQWGTHFYYGGP

Query:  GRNPEC
        G+N +C
Subjt:  GRNPEC

AT3G13510.1 Protein of Unknown Function (DUF239)4.1e-13156.17Show/hide
Query:  SSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEVK-DGDGSEGRRGSGAGGAWQTWRVNGTRC
        SS +   +++HL+ +NKPP+ TIQSPDGDIIDC+   KQPA DHP LK+HKIQ  P+  P  + + + N+V  +  G E           Q W   G +C
Subjt:  SSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEVK-DGDGSEGRRGSGAGGAWQTWRVNGTRC

Query:  PKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNS
         +G+IP+RR+  +DVLRA S+  +GKK+   +   +   PD+++ NGH+HAIAY     + YGAKAT+NVW+P IQ  NEFSLSQIW+L GSF G DLNS
Subjt:  PKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNS

Query:  IEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIWKDPKLGNWWMGFGDNTLVGYWPAELFTH
        IEAGWQVSP+LYGD+  RLFTYWTSDAYQATGCYNLLC+GF+Q NS IA+GA+ISP+S +  SQYD++ILIWKDPK G+WWM FG+  ++GYWP+ LF++
Subjt:  IEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIWKDPKLGNWWMGFGDNTLVGYWPAELFTH

Query:  LVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCYNIMSSYNDQWGTHFYYGGPGRNPEC
        L + A+M+EWGGEVVNS++ G HT TQMGSG F EEGF KASYFRN+++VD  N+L A + + T  E ++CY++ +  ND WG +FYYGGPG+N  C
Subjt:  LVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCYNIMSSYNDQWGTHFYYGGPGRNPEC

AT5G18460.1 Protein of Unknown Function (DUF239)7.4e-18171.09Show/hide
Query:  LALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEVKDG
        L ++L L    R    F     Y+QVSSLRL RIQ+HL+ INK P+ TIQSPDGD+IDCV KRKQPALDHPLLK+HKIQ+ P + P++K        KD 
Subjt:  LALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEVKDG

Query:  DGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWDPSI
        D  E    +   GAWQ W VNGTRCPKG++P+RR+T+NDVLRAKSLFDFGKKRR I LD++ + PD +  NGHEHAIAYT SS E+YGAKATINVWDP I
Subjt:  DGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWDPSI

Query:  QVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIWKDP
        + VNEFSLSQIWILSGSF G DLNSIEAGWQVSPELYGD+RPRLFTYWTSD+YQATGCYNLLC+GF+QTN+KIAIGAAISP+S+F G+Q+D+TILIWKDP
Subjt:  QVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIWKDP

Query:  KLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCYNIM
        K+GNWWMG GD+TLVGYWPAELFTHL DHAT VEWGGEVVN++A G HT+TQMGSG F +EGFGKASYFRNLE+VDSDNSL  V ++  LAEN  CY+I 
Subjt:  KLGNWWMGFGDNTLVGYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCYNIM

Query:  SSYNDQWGTHFYYGGPGRNPEC
        SSY+++WGT+FYYGGPG NP C
Subjt:  SSYNDQWGTHFYYGGPGRNPEC

AT5G56530.1 Protein of Unknown Function (DUF239)1.7e-13257.8Show/hide
Query:  IQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEVKDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVR
        + +HL+ +NKP + +IQSPDGDIIDCVH  KQPA DHP LK+HKIQ GP+  P     + K   K  +              Q W  NG  C +G+IPVR
Subjt:  IQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEVKDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVR

Query:  RSTVNDVLRAKSLFDFGKKRR-PILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQV
        R+   DVLRA S+  +GKK+   + L R  D PD+++ +GH+HAIAY     + YGAKATINVW+P +Q  NEFSLSQ+WIL GSF G DLNSIEAGWQV
Subjt:  RSTVNDVLRAKSLFDFGKKRR-PILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQV

Query:  SPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATM
        SP+LYGD+  RLFTYWTSDAYQATGCYNLLC+GF+Q NS+IA+GA+ISP+S F   QYD++I IWKDPK G+WWM FGD  ++GYWP+ LF++L D A++
Subjt:  SPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATM

Query:  VEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCYNIMSSYNDQWGTHFYYGGPGRNPECQ
        VEWGGEVVN +  G HT+TQMGSG+F +EGF KASYFRN+++VDS N+L   + ++T  E ++CY++    ND WG +FYYGGPGRNP CQ
Subjt:  VEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCYNIMSSYNDQWGTHFYYGGPGRNPECQ

AT5G56530.2 Protein of Unknown Function (DUF239)1.7e-13257.8Show/hide
Query:  IQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEVKDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVR
        + +HL+ +NKP + +IQSPDGDIIDCVH  KQPA DHP LK+HKIQ GP+  P     + K   K  +              Q W  NG  C +G+IPVR
Subjt:  IQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKNEVKDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVR

Query:  RSTVNDVLRAKSLFDFGKKRR-PILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQV
        R+   DVLRA S+  +GKK+   + L R  D PD+++ +GH+HAIAY     + YGAKATINVW+P +Q  NEFSLSQ+WIL GSF G DLNSIEAGWQV
Subjt:  RSTVNDVLRAKSLFDFGKKRR-PILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWDPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQV

Query:  SPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATM
        SP+LYGD+  RLFTYWTSDAYQATGCYNLLC+GF+Q NS+IA+GA+ISP+S F   QYD++I IWKDPK G+WWM FGD  ++GYWP+ LF++L D A++
Subjt:  SPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLVDHATM

Query:  VEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCYNIMSSYNDQWGTHFYYGGPGRNPECQ
        VEWGGEVVN +  G HT+TQMGSG+F +EGF KASYFRN+++VDS N+L   + ++T  E ++CY++    ND WG +FYYGGPGRNP CQ
Subjt:  VEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCYNIMSSYNDQWGTHFYYGGPGRNPECQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTGTGGAAAGAGATTTTTTTTGAGCCCTCCTCCTCTGGCTCTTGCTCTTGCTCTTGTTCTTGTTGTTTTTGAAAGATTCAGTCTGGTTTTTGGGCTGAATTATAC
ATATAAACAAGTCAGCAGCTTGAGATTGGACAGGATTCAAAGGCATTTGGACAACATTAACAAGCCTCCTCTTCTCACCATTCAGAGCCCTGATGGTGATATTATAGATT
GTGTTCATAAACGAAAACAGCCGGCTCTTGATCATCCCCTCTTGAAGAACCACAAGATTCAGAGAGGGCCAACGGAATGGCCAAGAGTGAAGAAGATGAAGGAGAAGAAT
GAAGTAAAGGATGGCGATGGCAGTGAAGGGAGGCGGGGATCAGGTGCGGGAGGTGCATGGCAAACTTGGCGTGTGAACGGAACACGGTGTCCGAAGGGGAGTATTCCAGT
GCGACGGAGCACAGTGAACGACGTGCTACGAGCCAAGTCTTTGTTTGACTTTGGCAAGAAACGACGGCCGATTCTTCTTGACCGCCAAATGGACGCTCCTGATGTGGTCA
GCGGGAATGGTCACGAGCATGCGATCGCGTACACAGGATCCTCGCAGGAGATGTACGGAGCGAAGGCGACAATAAACGTGTGGGACCCGTCAATCCAAGTGGTCAACGAG
TTCAGCCTCTCCCAGATTTGGATCCTCTCGGGATCATTCGACGGCTCAGATCTCAACAGCATAGAAGCTGGTTGGCAGGTCAGTCCGGAGCTTTATGGGGACAGCAGACC
AAGATTATTCACATATTGGACGAGTGACGCGTATCAGGCAACGGGGTGCTACAACCTTCTATGCGCTGGATTTGTACAAACAAACAGCAAAATCGCGATCGGAGCGGCGA
TTTCTCCCATCTCTTCATTTACCGGCAGCCAATATGACGTCACCATTCTCATTTGGAAGGATCCAAAGCTGGGAAACTGGTGGATGGGATTTGGGGACAATACTTTGGTA
GGGTATTGGCCGGCGGAACTGTTCACTCACCTCGTCGACCACGCCACCATGGTGGAGTGGGGCGGCGAGGTGGTGAACTCAAAGGCCAGGGGTGAGCACACCTCCACCCA
AATGGGCTCCGGTCGTTTCGCTGAGGAGGGCTTTGGCAAGGCTAGCTACTTTCGAAACCTCGAGATCGTTGACTCCGACAATAGCCTTAGTGCTGTCCAAGAAATCTCAA
CGTTGGCGGAGAACAACCATTGCTACAATATTATGAGCTCCTACAATGACCAATGGGGCACCCACTTCTACTACGGCGGCCCTGGAAGAAATCCTGAATGCCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGTGTGGAAAGAGATTTTTTTTGAGCCCTCCTCCTCTGGCTCTTGCTCTTGCTCTTGTTCTTGTTGTTTTTGAAAGATTCAGTCTGGTTTTTGGGCTGAATTATAC
ATATAAACAAGTCAGCAGCTTGAGATTGGACAGGATTCAAAGGCATTTGGACAACATTAACAAGCCTCCTCTTCTCACCATTCAGAGCCCTGATGGTGATATTATAGATT
GTGTTCATAAACGAAAACAGCCGGCTCTTGATCATCCCCTCTTGAAGAACCACAAGATTCAGAGAGGGCCAACGGAATGGCCAAGAGTGAAGAAGATGAAGGAGAAGAAT
GAAGTAAAGGATGGCGATGGCAGTGAAGGGAGGCGGGGATCAGGTGCGGGAGGTGCATGGCAAACTTGGCGTGTGAACGGAACACGGTGTCCGAAGGGGAGTATTCCAGT
GCGACGGAGCACAGTGAACGACGTGCTACGAGCCAAGTCTTTGTTTGACTTTGGCAAGAAACGACGGCCGATTCTTCTTGACCGCCAAATGGACGCTCCTGATGTGGTCA
GCGGGAATGGTCACGAGCATGCGATCGCGTACACAGGATCCTCGCAGGAGATGTACGGAGCGAAGGCGACAATAAACGTGTGGGACCCGTCAATCCAAGTGGTCAACGAG
TTCAGCCTCTCCCAGATTTGGATCCTCTCGGGATCATTCGACGGCTCAGATCTCAACAGCATAGAAGCTGGTTGGCAGGTCAGTCCGGAGCTTTATGGGGACAGCAGACC
AAGATTATTCACATATTGGACGAGTGACGCGTATCAGGCAACGGGGTGCTACAACCTTCTATGCGCTGGATTTGTACAAACAAACAGCAAAATCGCGATCGGAGCGGCGA
TTTCTCCCATCTCTTCATTTACCGGCAGCCAATATGACGTCACCATTCTCATTTGGAAGGATCCAAAGCTGGGAAACTGGTGGATGGGATTTGGGGACAATACTTTGGTA
GGGTATTGGCCGGCGGAACTGTTCACTCACCTCGTCGACCACGCCACCATGGTGGAGTGGGGCGGCGAGGTGGTGAACTCAAAGGCCAGGGGTGAGCACACCTCCACCCA
AATGGGCTCCGGTCGTTTCGCTGAGGAGGGCTTTGGCAAGGCTAGCTACTTTCGAAACCTCGAGATCGTTGACTCCGACAATAGCCTTAGTGCTGTCCAAGAAATCTCAA
CGTTGGCGGAGAACAACCATTGCTACAATATTATGAGCTCCTACAATGACCAATGGGGCACCCACTTCTACTACGGCGGCCCTGGAAGAAATCCTGAATGCCAATGA
Protein sequenceShow/hide protein sequence
MGCGKRFFLSPPPLALALALVLVVFERFSLVFGLNYTYKQVSSLRLDRIQRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPRVKKMKEKN
EVKDGDGSEGRRGSGAGGAWQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKRRPILLDRQMDAPDVVSGNGHEHAIAYTGSSQEMYGAKATINVWDPSIQVVNE
FSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSFTGSQYDVTILIWKDPKLGNWWMGFGDNTLV
GYWPAELFTHLVDHATMVEWGGEVVNSKARGEHTSTQMGSGRFAEEGFGKASYFRNLEIVDSDNSLSAVQEISTLAENNHCYNIMSSYNDQWGTHFYYGGPGRNPECQ