; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh05G001140 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh05G001140
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationCmo_Chr05:472047..475615
RNA-Seq ExpressionCmoCh05G001140
SyntenyCmoCh05G001140
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598306.1 hypothetical protein SDJN03_08084, partial [Cucurbita argyrosperma subsp. sororia]8.1e-26895.55Show/hide
Query:  MLLVHLLALQGAAVPPKHHAAMAAFGKGAQNDARVELPNKLFQFCVEQPELHREFFMGKRFLLGMLIVTLTVIVCGAVEGGSLSKQKSMRMNSLRKQAIK
        MLLVHLLALQGAAVPPKHHAAM            VELPNKLFQFCVEQP+L REFFMGKRFLLGML+VTL VIVCGAVEGGSLSKQKS+RMNSLRKQAIK
Subjt:  MLLVHLLALQGAAVPPKHHAAMAAFGKGAQNDARVELPNKLFQFCVEQPELHREFFMGKRFLLGMLIVTLTVIVCGAVEGGSLSKQKSMRMNSLRKQAIK

Query:  SIRSEDGDIIDCVSIYDQPAFDHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKEEKNSMVVKQTWRKSGSCPEGTIPIRRIRKHVLLKAKSIERYGRK
        SIRSEDGDIIDCVSIYDQPAFDHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKEEKNSMVVKQTWRKSGSCPEGTIPIRRI+KHVLLKA+SIERYGRK
Subjt:  SIRSEDGDIIDCVSIYDQPAFDHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKEEKNSMVVKQTWRKSGSCPEGTIPIRRIRKHVLLKAKSIERYGRK

Query:  KPMFSQENAQLPITRSSHTLLTNRSKAFLLTAGNNYNAAKGDIKVCNPRVEFDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTADA
        KPMFSQENAQLP TRSSHTLLTNRSKAFLLTAGNNYNAAKGDIKVCNPRVEFDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTADA
Subjt:  KPMFSQENAQLPITRSSHTLLTNRSKAFLLTAGNNYNAAKGDIKVCNPRVEFDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTADA

Query:  SEKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQYGESINIGYWPPELFSALSHTAETVQWGGEVYSTNIGRPPHTRT
        SEKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQYGESINIGYWPPELFSALSHTAETVQWGGEVYST IGRPPHTRT
Subjt:  SEKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQYGESINIGYWPPELFSALSHTAETVQWGGEVYSTNIGRPPHTRT

Query:  AMGSGRFPDFISGTSGWVKRMRVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLDDPELYYGGPGKNPRCP
        AMGSGRFPDFISGTSGWVKRMRVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLDDPELYYGGPGKNPRCP
Subjt:  AMGSGRFPDFISGTSGWVKRMRVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLDDPELYYGGPGKNPRCP

KAG7029277.1 hypothetical protein SDJN02_07615, partial [Cucurbita argyrosperma subsp. argyrosperma]3.0e-24698.8Show/hide
Query:  MGKRFLLGMLIVTLTVIVCGAVEGGSLSKQKSMRMNSLRKQAIKSIRSEDGDIIDCVSIYDQPAFDHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKE
        MGKRFLLGML+VTL VIVCGAVEGGSLSKQKS+RMNSLRKQAIKSIRSEDGDIIDCVSIYDQPAFDHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKE
Subjt:  MGKRFLLGMLIVTLTVIVCGAVEGGSLSKQKSMRMNSLRKQAIKSIRSEDGDIIDCVSIYDQPAFDHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKE

Query:  EKNSMVVKQTWRKSGSCPEGTIPIRRIRKHVLLKAKSIERYGRKKPMFSQENAQLPITRSSHTLLTNRSKAFLLTAGNNYNAAKGDIKVCNPRVEFDDEY
        EKNSMVVKQTWRKSGSCPEGTIPIRRIRKHVLLKA+SIERYGRKKPMFSQENAQLPITRSSHTLLTNRSKAFLLTAGNNYNAAKGDIKVCNPRVEFDDEY
Subjt:  EKNSMVVKQTWRKSGSCPEGTIPIRRIRKHVLLKAKSIERYGRKKPMFSQENAQLPITRSSHTLLTNRSKAFLLTAGNNYNAAKGDIKVCNPRVEFDDEY

Query:  STSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTADASEKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQ
        STSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTADASEKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQ
Subjt:  STSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTADASEKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQ

Query:  YGESINIGYWPPELFSALSHTAETVQWGGEVYSTNIGRPPHTRTAMGSGRFPDFISGTSGWVKRMRVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLD
        YGESINIGYWPPELFSALSHTAETVQWGGEVYST IGRPPHTRTAMGSGRFPDFISGTSGWVKRMRVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLD
Subjt:  YGESINIGYWPPELFSALSHTAETVQWGGEVYSTNIGRPPHTRTAMGSGRFPDFISGTSGWVKRMRVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLD

Query:  DPELYYGGPGKNPRCP
        DPELYYGGPGKNPRCP
Subjt:  DPELYYGGPGKNPRCP

XP_022961958.1 uncharacterized protein LOC111462577 isoform X1 [Cucurbita moschata]2.3e-270100Show/hide
Query:  MAAFGKGAQNDARVELPNKLFQFCVEQPELHREFFMGKRFLLGMLIVTLTVIVCGAVEGGSLSKQKSMRMNSLRKQAIKSIRSEDGDIIDCVSIYDQPAF
        MAAFGKGAQNDARVELPNKLFQFCVEQPELHREFFMGKRFLLGMLIVTLTVIVCGAVEGGSLSKQKSMRMNSLRKQAIKSIRSEDGDIIDCVSIYDQPAF
Subjt:  MAAFGKGAQNDARVELPNKLFQFCVEQPELHREFFMGKRFLLGMLIVTLTVIVCGAVEGGSLSKQKSMRMNSLRKQAIKSIRSEDGDIIDCVSIYDQPAF

Query:  DHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKEEKNSMVVKQTWRKSGSCPEGTIPIRRIRKHVLLKAKSIERYGRKKPMFSQENAQLPITRSSHTLL
        DHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKEEKNSMVVKQTWRKSGSCPEGTIPIRRIRKHVLLKAKSIERYGRKKPMFSQENAQLPITRSSHTLL
Subjt:  DHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKEEKNSMVVKQTWRKSGSCPEGTIPIRRIRKHVLLKAKSIERYGRKKPMFSQENAQLPITRSSHTLL

Query:  TNRSKAFLLTAGNNYNAAKGDIKVCNPRVEFDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTADASEKTGCFDLTCPGFVQTSNEI
        TNRSKAFLLTAGNNYNAAKGDIKVCNPRVEFDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTADASEKTGCFDLTCPGFVQTSNEI
Subjt:  TNRSKAFLLTAGNNYNAAKGDIKVCNPRVEFDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTADASEKTGCFDLTCPGFVQTSNEI

Query:  ALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQYGESINIGYWPPELFSALSHTAETVQWGGEVYSTNIGRPPHTRTAMGSGRFPDFISGTSGWVKRM
        ALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQYGESINIGYWPPELFSALSHTAETVQWGGEVYSTNIGRPPHTRTAMGSGRFPDFISGTSGWVKRM
Subjt:  ALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQYGESINIGYWPPELFSALSHTAETVQWGGEVYSTNIGRPPHTRTAMGSGRFPDFISGTSGWVKRM

Query:  RVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLDDPELYYGGPGKNPRCP
        RVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLDDPELYYGGPGKNPRCP
Subjt:  RVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLDDPELYYGGPGKNPRCP

XP_022997646.1 uncharacterized protein LOC111492515 isoform X1 [Cucurbita maxima]4.7e-27697.46Show/hide
Query:  MLLVHLLALQGAAVPPKHHAAMAAFGKGAQNDARVELPNKLFQFCVEQPELHREFFMGKRFLLGMLIVTLTVIVCGAVEGGSLSKQKSMRMNSLRKQAIK
        ML VHLLALQGAAVPPKHHAA+AAFGKGAQ DARVELPNKLFQFCVEQPEL REFFMGKRFLLGML+VTLTVIVCGAVEGGSLSKQKS+RMNSLRKQAIK
Subjt:  MLLVHLLALQGAAVPPKHHAAMAAFGKGAQNDARVELPNKLFQFCVEQPELHREFFMGKRFLLGMLIVTLTVIVCGAVEGGSLSKQKSMRMNSLRKQAIK

Query:  SIRSEDGDIIDCVSIYDQPAFDHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKEEKNSMVVKQTWRKSGSCPEGTIPIRRIRKHVLLKAKSIERYGRK
        SIRSEDGDIIDCVSIYDQPAFDHPAL NHTIQLAPTYDPTIDEHSKKATAEREGKEEKNSMVVKQTWRKSGSCPEGTIPIRRIRKHVLLKA+S+ERYGRK
Subjt:  SIRSEDGDIIDCVSIYDQPAFDHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKEEKNSMVVKQTWRKSGSCPEGTIPIRRIRKHVLLKAKSIERYGRK

Query:  KPMFSQENAQLPITRSSHTLLTNRSKAFLLTAGNNYNAAKGDIKVCNPRVEFDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTADA
        KPMFSQENAQLPITRSSHTLLTNRSKAFLLTAGNNYNAAKGDIKVCNPRVEFDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTADA
Subjt:  KPMFSQENAQLPITRSSHTLLTNRSKAFLLTAGNNYNAAKGDIKVCNPRVEFDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTADA

Query:  SEKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQYGESINIGYWPPELFSALSHTAETVQWGGEVYSTNIGRPPHTRT
        S+KTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQYGESINIGYWPPELF ALSHTAETVQWGGEVYST IGRPPHTRT
Subjt:  SEKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQYGESINIGYWPPELFSALSHTAETVQWGGEVYSTNIGRPPHTRT

Query:  AMGSGRFPDFISGTSGWVKRMRVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLDDPELYYGGPGKNPRCP
        AMGSGRFPDFISGTSGWVKRMRVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLDDPELYYGGPGKNPRCP
Subjt:  AMGSGRFPDFISGTSGWVKRMRVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLDDPELYYGGPGKNPRCP

XP_023546232.1 uncharacterized protein LOC111805388 isoform X2 [Cucurbita pepo subsp. pepo]3.9e-24698.56Show/hide
Query:  MGKRFLLGMLIVTLTVIVCGAVEGGSLSKQKSMRMNSLRKQAIKSIRSEDGDIIDCVSIYDQPAFDHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKE
        MGKRFLLGML+VTLTVIVCGAVEGGSLSKQKS+RMNSLRKQAIKSIRSEDGDIIDCVSIYDQPAFDHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKE
Subjt:  MGKRFLLGMLIVTLTVIVCGAVEGGSLSKQKSMRMNSLRKQAIKSIRSEDGDIIDCVSIYDQPAFDHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKE

Query:  EKNSMVVKQTWRKSGSCPEGTIPIRRIRKHVLLKAKSIERYGRKKPMFSQENAQLPITRSSHTLLTNRSKAFLLTAGNNYNAAKGDIKVCNPRVEFDDEY
        EKNSMVVKQTWRKSGSCPEGTIPIRRIRKHVLLKA+SIERYGRKKPMFSQENAQLPITRSSHTLLTNRSKAFLLTAGNNYNAAKGDIKVCNPRVEFDDEY
Subjt:  EKNSMVVKQTWRKSGSCPEGTIPIRRIRKHVLLKAKSIERYGRKKPMFSQENAQLPITRSSHTLLTNRSKAFLLTAGNNYNAAKGDIKVCNPRVEFDDEY

Query:  STSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTADASEKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQ
        STSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTADAS+KTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQ
Subjt:  STSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTADASEKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQ

Query:  YGESINIGYWPPELFSALSHTAETVQWGGEVYSTNIGRPPHTRTAMGSGRFPDFISGTSGWVKRMRVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLD
        YGESINIGYWPPELF ALSHTAETVQWGGEVYST IGRPPHTRTAMGSGRFPDFISGTSGWVKRMRVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLD
Subjt:  YGESINIGYWPPELFSALSHTAETVQWGGEVYSTNIGRPPHTRTAMGSGRFPDFISGTSGWVKRMRVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLD

Query:  DPELYYGGPGKNPRCP
        DPELYYGGPGKNPRCP
Subjt:  DPELYYGGPGKNPRCP

TrEMBL top hitse value%identityAlignment
A0A0A0LQZ6 Uncharacterized protein2.3e-19680.88Show/hide
Query:  MLIVTLTVIVCGAVEGGSLSKQKSMRMNSLRKQAIKSIRSEDGDIIDCVSIYDQPAFDHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKEEKNSMVVK
        ML+  LTVIVCG VE GSLSK K+ +M+SLRKQA KSI+SEDGDIIDCVSIYDQPAFDHPALRNHTIQ+APTYDPT+D+HSKKATAE EG  EK+SM VK
Subjt:  MLIVTLTVIVCGAVEGGSLSKQKSMRMNSLRKQAIKSIRSEDGDIIDCVSIYDQPAFDHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKEEKNSMVVK

Query:  QTWRKSGSCPEGTIPIRRIRKHVLLKAKSIERYGRKKPMFSQENAQLPITRSSHTLLTNRSKAFLLTAGNNYNAAKGDIKVCNPRVEFDDEYSTSQVALL
        Q WRKSGSCP+ TIPIRRIRKHV LKA S+  YG+K+P    E AQL  +RSSH LL N SKA LL  G+N+N AKGDIKVCNP VEFDDEYSTSQVALL
Subjt:  QTWRKSGSCPEGTIPIRRIRKHVLLKAKSIERYGRKKPMFSQENAQLPITRSSHTLLTNRSKAFLLTAGNNYNAAKGDIKVCNPRVEFDDEYSTSQVALL

Query:  TGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTADASEKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQYGESINIG
        TGPYYN+EAIESGWAVNPGVYGDRQTRLFVYWT DAS KTGCFDLTCPGFVQTSNEIALGSAIYPIST   LP+EI MFLFRDF+T+NWWVQYGESINIG
Subjt:  TGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTADASEKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQYGESINIG

Query:  YWPPELFSALSHTAETVQWGGEVYSTNIGRPPHTRTAMGSGRFPDFISGTSGWVKRMRVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLDDPELYYGG
        YWP ELF AL +TAETVQWGGEVYST +G PPHT T MG+G+FPD+ISG SGWVKR+RVRDNSM+L FP +VEHYSDEYDCYDVDFIR+YLDDPELYYGG
Subjt:  YWPPELFSALSHTAETVQWGGEVYSTNIGRPPHTRTAMGSGRFPDFISGTSGWVKRMRVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLDDPELYYGG

Query:  PGKNPRCP
        PGKN RCP
Subjt:  PGKNPRCP

A0A6J1HDC5 uncharacterized protein LOC111462577 isoform X11.1e-270100Show/hide
Query:  MAAFGKGAQNDARVELPNKLFQFCVEQPELHREFFMGKRFLLGMLIVTLTVIVCGAVEGGSLSKQKSMRMNSLRKQAIKSIRSEDGDIIDCVSIYDQPAF
        MAAFGKGAQNDARVELPNKLFQFCVEQPELHREFFMGKRFLLGMLIVTLTVIVCGAVEGGSLSKQKSMRMNSLRKQAIKSIRSEDGDIIDCVSIYDQPAF
Subjt:  MAAFGKGAQNDARVELPNKLFQFCVEQPELHREFFMGKRFLLGMLIVTLTVIVCGAVEGGSLSKQKSMRMNSLRKQAIKSIRSEDGDIIDCVSIYDQPAF

Query:  DHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKEEKNSMVVKQTWRKSGSCPEGTIPIRRIRKHVLLKAKSIERYGRKKPMFSQENAQLPITRSSHTLL
        DHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKEEKNSMVVKQTWRKSGSCPEGTIPIRRIRKHVLLKAKSIERYGRKKPMFSQENAQLPITRSSHTLL
Subjt:  DHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKEEKNSMVVKQTWRKSGSCPEGTIPIRRIRKHVLLKAKSIERYGRKKPMFSQENAQLPITRSSHTLL

Query:  TNRSKAFLLTAGNNYNAAKGDIKVCNPRVEFDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTADASEKTGCFDLTCPGFVQTSNEI
        TNRSKAFLLTAGNNYNAAKGDIKVCNPRVEFDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTADASEKTGCFDLTCPGFVQTSNEI
Subjt:  TNRSKAFLLTAGNNYNAAKGDIKVCNPRVEFDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTADASEKTGCFDLTCPGFVQTSNEI

Query:  ALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQYGESINIGYWPPELFSALSHTAETVQWGGEVYSTNIGRPPHTRTAMGSGRFPDFISGTSGWVKRM
        ALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQYGESINIGYWPPELFSALSHTAETVQWGGEVYSTNIGRPPHTRTAMGSGRFPDFISGTSGWVKRM
Subjt:  ALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQYGESINIGYWPPELFSALSHTAETVQWGGEVYSTNIGRPPHTRTAMGSGRFPDFISGTSGWVKRM

Query:  RVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLDDPELYYGGPGKNPRCP
        RVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLDDPELYYGGPGKNPRCP
Subjt:  RVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLDDPELYYGGPGKNPRCP

A0A6J1HFH9 uncharacterized protein LOC111462577 isoform X23.8e-223100Show/hide
Query:  SEDGDIIDCVSIYDQPAFDHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKEEKNSMVVKQTWRKSGSCPEGTIPIRRIRKHVLLKAKSIERYGRKKPM
        SEDGDIIDCVSIYDQPAFDHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKEEKNSMVVKQTWRKSGSCPEGTIPIRRIRKHVLLKAKSIERYGRKKPM
Subjt:  SEDGDIIDCVSIYDQPAFDHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKEEKNSMVVKQTWRKSGSCPEGTIPIRRIRKHVLLKAKSIERYGRKKPM

Query:  FSQENAQLPITRSSHTLLTNRSKAFLLTAGNNYNAAKGDIKVCNPRVEFDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTADASEK
        FSQENAQLPITRSSHTLLTNRSKAFLLTAGNNYNAAKGDIKVCNPRVEFDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTADASEK
Subjt:  FSQENAQLPITRSSHTLLTNRSKAFLLTAGNNYNAAKGDIKVCNPRVEFDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTADASEK

Query:  TGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQYGESINIGYWPPELFSALSHTAETVQWGGEVYSTNIGRPPHTRTAMG
        TGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQYGESINIGYWPPELFSALSHTAETVQWGGEVYSTNIGRPPHTRTAMG
Subjt:  TGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQYGESINIGYWPPELFSALSHTAETVQWGGEVYSTNIGRPPHTRTAMG

Query:  SGRFPDFISGTSGWVKRMRVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLDDPELYYGGPGKNPRCP
        SGRFPDFISGTSGWVKRMRVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLDDPELYYGGPGKNPRCP
Subjt:  SGRFPDFISGTSGWVKRMRVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLDDPELYYGGPGKNPRCP

A0A6J1K5P5 uncharacterized protein LOC111492515 isoform X24.0e-22098.37Show/hide
Query:  SEDGDIIDCVSIYDQPAFDHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKEEKNSMVVKQTWRKSGSCPEGTIPIRRIRKHVLLKAKSIERYGRKKPM
        SEDGDIIDCVSIYDQPAFDHPAL NHTIQLAPTYDPTIDEHSKKATAEREGKEEKNSMVVKQTWRKSGSCPEGTIPIRRIRKHVLLKA+S+ERYGRKKPM
Subjt:  SEDGDIIDCVSIYDQPAFDHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKEEKNSMVVKQTWRKSGSCPEGTIPIRRIRKHVLLKAKSIERYGRKKPM

Query:  FSQENAQLPITRSSHTLLTNRSKAFLLTAGNNYNAAKGDIKVCNPRVEFDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTADASEK
        FSQENAQLPITRSSHTLLTNRSKAFLLTAGNNYNAAKGDIKVCNPRVEFDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTADAS+K
Subjt:  FSQENAQLPITRSSHTLLTNRSKAFLLTAGNNYNAAKGDIKVCNPRVEFDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTADASEK

Query:  TGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQYGESINIGYWPPELFSALSHTAETVQWGGEVYSTNIGRPPHTRTAMG
        TGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQYGESINIGYWPPELF ALSHTAETVQWGGEVYST IGRPPHTRTAMG
Subjt:  TGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQYGESINIGYWPPELFSALSHTAETVQWGGEVYSTNIGRPPHTRTAMG

Query:  SGRFPDFISGTSGWVKRMRVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLDDPELYYGGPGKNPRCP
        SGRFPDFISGTSGWVKRMRVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLDDPELYYGGPGKNPRCP
Subjt:  SGRFPDFISGTSGWVKRMRVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLDDPELYYGGPGKNPRCP

A0A6J1K820 uncharacterized protein LOC111492515 isoform X12.3e-27697.46Show/hide
Query:  MLLVHLLALQGAAVPPKHHAAMAAFGKGAQNDARVELPNKLFQFCVEQPELHREFFMGKRFLLGMLIVTLTVIVCGAVEGGSLSKQKSMRMNSLRKQAIK
        ML VHLLALQGAAVPPKHHAA+AAFGKGAQ DARVELPNKLFQFCVEQPEL REFFMGKRFLLGML+VTLTVIVCGAVEGGSLSKQKS+RMNSLRKQAIK
Subjt:  MLLVHLLALQGAAVPPKHHAAMAAFGKGAQNDARVELPNKLFQFCVEQPELHREFFMGKRFLLGMLIVTLTVIVCGAVEGGSLSKQKSMRMNSLRKQAIK

Query:  SIRSEDGDIIDCVSIYDQPAFDHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKEEKNSMVVKQTWRKSGSCPEGTIPIRRIRKHVLLKAKSIERYGRK
        SIRSEDGDIIDCVSIYDQPAFDHPAL NHTIQLAPTYDPTIDEHSKKATAEREGKEEKNSMVVKQTWRKSGSCPEGTIPIRRIRKHVLLKA+S+ERYGRK
Subjt:  SIRSEDGDIIDCVSIYDQPAFDHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKEEKNSMVVKQTWRKSGSCPEGTIPIRRIRKHVLLKAKSIERYGRK

Query:  KPMFSQENAQLPITRSSHTLLTNRSKAFLLTAGNNYNAAKGDIKVCNPRVEFDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTADA
        KPMFSQENAQLPITRSSHTLLTNRSKAFLLTAGNNYNAAKGDIKVCNPRVEFDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTADA
Subjt:  KPMFSQENAQLPITRSSHTLLTNRSKAFLLTAGNNYNAAKGDIKVCNPRVEFDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTADA

Query:  SEKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQYGESINIGYWPPELFSALSHTAETVQWGGEVYSTNIGRPPHTRT
        S+KTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQYGESINIGYWPPELF ALSHTAETVQWGGEVYST IGRPPHTRT
Subjt:  SEKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQYGESINIGYWPPELFSALSHTAETVQWGGEVYSTNIGRPPHTRT

Query:  AMGSGRFPDFISGTSGWVKRMRVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLDDPELYYGGPGKNPRCP
        AMGSGRFPDFISGTSGWVKRMRVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLDDPELYYGGPGKNPRCP
Subjt:  AMGSGRFPDFISGTSGWVKRMRVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLDDPELYYGGPGKNPRCP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55360.1 Protein of Unknown Function (DUF239)3.1e-8439.8Show/hide
Query:  LSKQK---SMRMNSLRKQAIKSIRSEDGDIIDCVSIYDQPAFDHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKEEKNSMVVKQTWRKSGSCPEGTIP
        +SKQK      +N L K A+KSI+S DGD+IDCV I  QPAFDHP L++H IQ+ P Y P       K +A +  ++E +   + Q W + G C EGTIP
Subjt:  LSKQK---SMRMNSLRKQAIKSIRSEDGDIIDCVSIYDQPAFDHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKEEKNSMVVKQTWRKSGSCPEGTIP

Query:  IRRIRKHVLLKAKSIERYGRKKPMFSQENAQLPITRSSHTLLTNRS---KAFLLTAGNNYNAAKGDIKVCNPRVEFDDEYSTSQVALLTGPY-YNFEAIE
        +RR ++  +L+A S++RYG+KK         +P+ +S+   L N+S    A     G+ Y  AK  I V  P+++  +E+S SQ+ LL G +  +  +IE
Subjt:  IRRIRKHVLLKAKSIERYGRKKPMFSQENAQLPITRSSHTLLTNRS---KAFLLTAGNNYNAAKGDIKVCNPRVEFDDEYSTSQVALLTGPY-YNFEAIE

Query:  SGWAVNPGVYGDRQTRLFVYWTADASEKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQYGESINIGYWPPELFSALS
        +GW V+P +YGD  TRLF YWT+DA + TGC++L C GF+Q +++IA+G++I P+S      Y+I + +++D K  +WW+Q+G    +GYWP  LFS L+
Subjt:  SGWAVNPGVYGDRQTRLFVYWTADASEKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQYGESINIGYWPPELFSALS

Query:  HTAETVQWGGEVYSTNIGRPPHTRTAMGSGRFPDFISGTSGWVKRMRVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLDDPELYYGGPGKNPRCP
         +A  ++WGGEV ++      HT T MGSG+FP+     + + + ++V D S  L  P  +  ++++ +CYDV    +       YYGGPGKN +CP
Subjt:  HTAETVQWGGEVYSTNIGRPPHTRTAMGSGRFPDFISGTSGWVKRMRVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLDDPELYYGGPGKNPRCP

AT2G44210.1 Protein of Unknown Function (DUF239)4.1e-8439.08Show/hide
Query:  LIVTLTVIVCGAVEGGSLSKQKSMR--MNSLRKQAIKSIRSEDGDIIDCVSIYDQPAFDHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKEEKNSMVV
        L++T+ ++    V G +      +R  +  L K A+KSI+S DGD+IDCV I DQPAF HP L NHT+Q+ P+ +P       K +++ + ++   S  +
Subjt:  LIVTLTVIVCGAVEGGSLSKQKSMR--MNSLRKQAIKSIRSEDGDIIDCVSIYDQPAFDHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKEEKNSMVV

Query:  KQTWRKSGSCPEGTIPIRRIRKHVLLKAKSIERYGRKKPMFSQENAQLPITRSSHTLLT--NRSKAFLLTAGNNYNAAKGDIKVCNPRVEFDDEYSTSQV
         Q W  +G CP+ TIPIRR R+  L +A S+E YG K    +Q++   P +     +LT      A +      +  AK  I V  P VE  +E+S +Q+
Subjt:  KQTWRKSGSCPEGTIPIRRIRKHVLLKAKSIERYGRKKPMFSQENAQLPITRSSHTLLT--NRSKAFLLTAGNNYNAAKGDIKVCNPRVEFDDEYSTSQV

Query:  ALLTGPY-YNFEAIESGWAVNPGVYGDRQTRLFVYWTADASEKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQYGES
         +L G +  +  +IE+GW V+P +YGD +TRLF YWT+DA + TGC++L C GFVQ + EIA+G +I P+S      Y+I + +++D K  +WW+Q+GE 
Subjt:  ALLTGPY-YNFEAIESGWAVNPGVYGDRQTRLFVYWTADASEKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQYGES

Query:  INIGYWPPELFSALSHTAETVQWGGEVYSTNIGRPPHTRTAMGSGRFPDFISGTSGWVKRMRVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLDDPEL
          IGYWP  LFS LS +A  ++WGGEV ++      HT T MGSGRF +   G + + K ++V D S  L  P  ++ ++D+ +CY+V            
Subjt:  INIGYWPPELFSALSHTAETVQWGGEVYSTNIGRPPHTRTAMGSGRFPDFISGTSGWVKRMRVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLDDPEL

Query:  YYGGPGKNPRCP
        YYGGPG+NP CP
Subjt:  YYGGPGKNPRCP

AT3G13510.1 Protein of Unknown Function (DUF239)1.8e-8440Show/hide
Query:  CGAVEGGSLSKQK---SMRMNSLRKQAIKSIRSEDGDIIDCVSIYDQPAFDHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKEEKNSMVVKQTWRKSG
        C A   GS S+QK      +N L K  +K+I+S DGDIIDC+ I  QPAFDHP L++H IQ+ P+Y P       K +AE +GKE      + Q W + G
Subjt:  CGAVEGGSLSKQK---SMRMNSLRKQAIKSIRSEDGDIIDCVSIYDQPAFDHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKEEKNSMVVKQTWRKSG

Query:  SCPEGTIPIRRIRKHVLLKAKSIERYGRKKPMFSQENAQLPITRSSHTLLTNRS---KAFLLTAGNNYNAAKGDIKVCNPRVEFDDEYSTSQVALLTGPY
         C EGTIP+RR R+  +L+A S++RYG+KK      +  +PI +S+   L N++    A     G+ Y  AK  + V  P+++  +E+S SQ+ LL G +
Subjt:  SCPEGTIPIRRIRKHVLLKAKSIERYGRKKPMFSQENAQLPITRSSHTLLTNRS---KAFLLTAGNNYNAAKGDIKVCNPRVEFDDEYSTSQVALLTGPY

Query:  -YNFEAIESGWAVNPGVYGDRQTRLFVYWTADASEKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQYGESINIGYWP
          +  +IE+GW V+P +YGD  TRLF YWT+DA + TGC++L C GF+Q +++IA+G++I P+S      Y+I + +++D K  +WW+Q+G    +GYWP
Subjt:  -YNFEAIESGWAVNPGVYGDRQTRLFVYWTADASEKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQYGESINIGYWP

Query:  PELFSALSHTAETVQWGGEVYSTNIGRPPHTRTAMGSGRFPDFISGTSGWVKRMRVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLDDPELYYGGPGK
          LFS L+ +A  ++WGGEV ++      HT T MGSG FP+     + + + ++V D S  L  P  +  ++++ +CYDV    +       YYGGPGK
Subjt:  PELFSALSHTAETVQWGGEVYSTNIGRPPHTRTAMGSGRFPDFISGTSGWVKRMRVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLDDPELYYGGPGK

Query:  NPRCP
        N  CP
Subjt:  NPRCP

AT5G25950.1 Protein of Unknown Function (DUF239)2.8e-9341.94Show/hide
Query:  MLIVTLTVIVCGAVEG--GSLSKQKSMRMNSLRKQAIKSIRSEDGDIIDCVSIYDQPAFDHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKEEKNSMV
        M    + VI+CG      G  S    +++ +L K A+K+I+SEDGDIIDC+ IY Q AFDHPAL+NH IQ+     P++   +KK T    G  E    +
Subjt:  MLIVTLTVIVCGAVEG--GSLSKQKSMRMNSLRKQAIKSIRSEDGDIIDCVSIYDQPAFDHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKEEKNSMV

Query:  VKQTWRKSGSCPEGTIPIRRIRKHVLLKAKSIERYGRKKP---------MFSQENAQLPITRSSHTLLTNRSKAFLLTAGNNYNAAKGDIKVCN-PRVEF
          Q W KSG CP GTIP+RR+ +  + +A S   +GRK P         +  + N  +   + +      RS+AF++  G N+  A+ DI + N PRVE 
Subjt:  VKQTWRKSGSCPEGTIPIRRIRKHVLLKAKSIERYGRKKP---------MFSQENAQLPITRSSHTLLTNRSKAFLLTAGNNYNAAKGDIKVCN-PRVEF

Query:  DDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTADASEKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEIIMFLFRDFKTDN
         D YST+Q+ L+ G   NFE++E GW VNP V+GD +TRLF+ WT D   KTGC +L C GFVQTS + ALG+ + P+S+ +   Y I + +F D  + N
Subjt:  DDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTADASEKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEIIMFLFRDFKTDN

Query:  WWVQYGESINIGYWPPELFSALSHTAETVQWGGEVYSTNIG-RPPHTRTAMGSGRFPDFISGTSGWVKRMRVRDNSMVLMFPGWVEHYSDEYDCYDVDFI
        WW+    ++ +GYWP  LF+ L H+A  VQWGGEV+S N+  + PHT TAMGSG++  +I   + +   +R++D SM L +P ++  Y+DEY+CY     
Subjt:  WWVQYGESINIGYWPPELFSALSHTAETVQWGGEVYSTNIG-RPPHTRTAMGSGRFPDFISGTSGWVKRMRVRDNSMVLMFPGWVEHYSDEYDCYDVDFI

Query:  R-DYLDDPELYYGGPGKNPRCP
        R  Y+ +P  Y+GGPG+N RCP
Subjt:  R-DYLDDPELYYGGPGKNPRCP

AT5G25960.1 Protein of Unknown Function (DUF239)4.8e-8540.78Show/hide
Query:  MLIVTLTVIVCGAVEG--GSLSKQKSMRMNSLRKQAIKSIRSEDGDIIDCVSIYDQPAFDHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKEEKNSMV
        M +  +  I+CG      G  S    M++ SL K ++K+I+SEDGDIIDC+ IY Q AFDHPALRNH IQ+     P++D  +KK T    G  E+   +
Subjt:  MLIVTLTVIVCGAVEG--GSLSKQKSMRMNSLRKQAIKSIRSEDGDIIDCVSIYDQPAFDHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKEEKNSMV

Query:  VKQTWRKSGSCPEGTIPIRRIRKHVLLKAKSIERYGRKKPMFSQENAQLPITRSSHTLLTNRSKAFLLTAGNNYNAAKGDIKVCN-PRVEFDDEYSTSQV
          Q W KSG+CP+GTIP                                              +A L+  G N+  A+ DI V N PRV+  D YS++Q+
Subjt:  VKQTWRKSGSCPEGTIPIRRIRKHVLLKAKSIERYGRKKPMFSQENAQLPITRSSHTLLTNRSKAFLLTAGNNYNAAKGDIKVCN-PRVEFDDEYSTSQV

Query:  ALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTADASEKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQYGESI
         LL G    FE+IE+GWAVNP V+GD +TRLF YWT D   KTGC +L C GFVQT+ + ALG+AI P+ST +   + I      D  + NWW+    ++
Subjt:  ALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTADASEKTGCFDLTCPGFVQTSNEIALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQYGESI

Query:  NIGYWPPELFSALSHTAETVQWGGEVYSTNIGRPPHTRTAMGSGRFPDFISGTSGWVKRMRVRDNSMVLMFPGWVEHYSDEYDCYDVDFIR-DYLDDPEL
         IGYWP  LF+ L H+A  VQ GGEV+S N+G+ PHTRT+MGSG++  ++   + +   +R++D S+ + +P ++  Y+DEY+CY     R  Y+ +P  
Subjt:  NIGYWPPELFSALSHTAETVQWGGEVYSTNIGRPPHTRTAMGSGRFPDFISGTSGWVKRMRVRDNSMVLMFPGWVEHYSDEYDCYDVDFIR-DYLDDPEL

Query:  YYGGPGKNPRCP
        Y+GGPG+N RCP
Subjt:  YYGGPGKNPRCP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGTTGGTCCATTTGCTTGCTCTACAAGGAGCTGCCGTGCCTCCCAAGCACCACGCGGCCATGGCGGCTTTTGGAAAGGGCGCGCAGAATGACGCCAGGGTT
GAACTTCCAAATAAACTATTTCAGTTCTGCGTCGAGCAACCCGAGCTTCATCGAGAGTTTTTCATGGGGAAGCGATTCTTGCTCGGAATGTTGATTGTGACATTG
ACAGTCATAGTTTGTGGAGCTGTGGAGGGTGGTTCGCTTTCTAAACAGAAGAGTATGAGAATGAACTCTCTTAGGAAGCAAGCAATAAAAAGCATTCGGAGCGAA
GATGGCGACATCATAGATTGTGTAAGCATTTACGATCAGCCTGCTTTTGATCATCCTGCTCTGAGGAATCACACCATTCAGTTGGCACCCACTTATGATCCCACC
ATAGATGAGCATTCAAAGAAAGCAACAGCAGAAAGGGAAGGGAAGGAGGAAAAGAATTCCATGGTTGTGAAACAAACATGGAGGAAAAGTGGCAGTTGCCCTGAA
GGAACAATACCGATTCGAAGGATCCGAAAGCATGTCCTACTCAAAGCTAAATCGATAGAACGCTATGGACGAAAGAAACCGATGTTTTCCCAAGAAAACGCTCAG
CTTCCCATCACCCGAAGTTCGCATACTCTGCTAACGAATCGTTCGAAGGCATTTCTGCTTACTGCAGGAAACAACTACAATGCAGCTAAAGGAGACATTAAAGTA
TGTAACCCAAGGGTCGAGTTCGATGATGAATATAGTACTTCCCAAGTCGCTCTATTAACCGGCCCTTACTATAATTTTGAGGCTATCGAATCCGGATGGGCAGTA
AATCCAGGTGTTTACGGGGATCGACAGACTCGACTGTTCGTGTATTGGACTGCCGATGCTTCCGAAAAAACAGGCTGCTTTGATCTCACTTGCCCAGGTTTTGTT
CAAACAAGCAATGAAATTGCTCTTGGTTCAGCTATTTATCCAATCTCAACTCCCAATGGGCTTCCATATGAAATAATTATGTTCCTTTTTAGAGATTTCAAGACA
GATAATTGGTGGGTGCAATATGGGGAAAGCATTAACATAGGTTATTGGCCTCCTGAGCTATTCAGTGCACTAAGTCACACAGCAGAAACAGTACAATGGGGAGGG
GAAGTTTACAGCACAAACATAGGACGGCCTCCTCACACTAGAACAGCTATGGGCAGTGGAAGGTTTCCTGACTTCATTTCTGGTACTTCGGGTTGGGTAAAACGG
ATGCGAGTTCGAGATAATTCCATGGTTTTGATGTTTCCTGGTTGGGTTGAGCACTACTCCGATGAATACGATTGTTATGATGTCGATTTCATCCGAGATTATTTA
GATGATCCTGAACTATACTATGGAGGACCTGGTAAAAATCCAAGGTGCCCCTAA
mRNA sequenceShow/hide mRNA sequence
CGCCATCCTCATCGCCAGTACAAGTGAGAATGATAGGAACCGTCGATCGGAGCTCCCTTACACTCCACCGCCGATTTCTTCTCCGGTTTTGGATTCTTCACCGGA
ATCTCCTTCCCGACGGCGGAGACTTGGGATGAGGAGAAGCAACAGCGGAACATTCGAGAGAAGAATCTCATTTCGAAATTTTGCCCTAATCGAGATGATCCTGTA
ATGTATTCGTGGTGTTTCTGTGGAAATTGATAATTAATAATATTGAAACATTTATAGTGTGAAGATGAAGAAGGAGCAGATGGCATGCTGTTGGTCCATTTGCTT
GCTCTACAAGGAGCTGCCGTGCCTCCCAAGCACCACGCGGCCATGGCGGCTTTTGGAAAGGGCGCGCAGAATGACGCCAGGGTTGAACTTCCAAATAAACTATTT
CAGTTCTGCGTCGAGCAACCCGAGCTTCATCGAGAGTTTTTCATGGGGAAGCGATTCTTGCTCGGAATGTTGATTGTGACATTGACAGTCATAGTTTGTGGAGCT
GTGGAGGGTGGTTCGCTTTCTAAACAGAAGAGTATGAGAATGAACTCTCTTAGGAAGCAAGCAATAAAAAGCATTCGGAGCGAAGATGGCGACATCATAGATTGT
GTAAGCATTTACGATCAGCCTGCTTTTGATCATCCTGCTCTGAGGAATCACACCATTCAGTTGGCACCCACTTATGATCCCACCATAGATGAGCATTCAAAGAAA
GCAACAGCAGAAAGGGAAGGGAAGGAGGAAAAGAATTCCATGGTTGTGAAACAAACATGGAGGAAAAGTGGCAGTTGCCCTGAAGGAACAATACCGATTCGAAGG
ATCCGAAAGCATGTCCTACTCAAAGCTAAATCGATAGAACGCTATGGACGAAAGAAACCGATGTTTTCCCAAGAAAACGCTCAGCTTCCCATCACCCGAAGTTCG
CATACTCTGCTAACGAATCGTTCGAAGGCATTTCTGCTTACTGCAGGAAACAACTACAATGCAGCTAAAGGAGACATTAAAGTATGTAACCCAAGGGTCGAGTTC
GATGATGAATATAGTACTTCCCAAGTCGCTCTATTAACCGGCCCTTACTATAATTTTGAGGCTATCGAATCCGGATGGGCAGTAAATCCAGGTGTTTACGGGGAT
CGACAGACTCGACTGTTCGTGTATTGGACTGCCGATGCTTCCGAAAAAACAGGCTGCTTTGATCTCACTTGCCCAGGTTTTGTTCAAACAAGCAATGAAATTGCT
CTTGGTTCAGCTATTTATCCAATCTCAACTCCCAATGGGCTTCCATATGAAATAATTATGTTCCTTTTTAGAGATTTCAAGACAGATAATTGGTGGGTGCAATAT
GGGGAAAGCATTAACATAGGTTATTGGCCTCCTGAGCTATTCAGTGCACTAAGTCACACAGCAGAAACAGTACAATGGGGAGGGGAAGTTTACAGCACAAACATA
GGACGGCCTCCTCACACTAGAACAGCTATGGGCAGTGGAAGGTTTCCTGACTTCATTTCTGGTACTTCGGGTTGGGTAAAACGGATGCGAGTTCGAGATAATTCC
ATGGTTTTGATGTTTCCTGGTTGGGTTGAGCACTACTCCGATGAATACGATTGTTATGATGTCGATTTCATCCGAGATTATTTAGATGATCCTGAACTATACTAT
GGAGGACCTGGTAAAAATCCAAGGTGCCCCTAAGAGACTAACAAAATTGAGATGATTACCTCTTTTGCTGTCATTAGGTTCCTTCATGTAATGAATAACATAGAA
CTATCAAAAAGAAAGCTATTTCTCTTGTTTAAAATTCTCCTATAAGCATACAAGTTACTGAGACAGATGAGGGCTATTTCATTAAAAGAGCAATAATCA
Protein sequenceShow/hide protein sequence
MLLVHLLALQGAAVPPKHHAAMAAFGKGAQNDARVELPNKLFQFCVEQPELHREFFMGKRFLLGMLIVTLTVIVCGAVEGGSLSKQKSMRMNSLRKQAIKSIRSE
DGDIIDCVSIYDQPAFDHPALRNHTIQLAPTYDPTIDEHSKKATAEREGKEEKNSMVVKQTWRKSGSCPEGTIPIRRIRKHVLLKAKSIERYGRKKPMFSQENAQ
LPITRSSHTLLTNRSKAFLLTAGNNYNAAKGDIKVCNPRVEFDDEYSTSQVALLTGPYYNFEAIESGWAVNPGVYGDRQTRLFVYWTADASEKTGCFDLTCPGFV
QTSNEIALGSAIYPISTPNGLPYEIIMFLFRDFKTDNWWVQYGESINIGYWPPELFSALSHTAETVQWGGEVYSTNIGRPPHTRTAMGSGRFPDFISGTSGWVKR
MRVRDNSMVLMFPGWVEHYSDEYDCYDVDFIRDYLDDPELYYGGPGKNPRCP