; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g00400 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g00400
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionProtein of Unknown Function (DUF239)
Genome locationchr8:247970..249837
RNA-Seq ExpressionMoc08g00400
SyntenyMoc08g00400
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008444361.1 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103487714 [Cucumis melo]7.2e-22386.48Show/hide
Query:  DQTRLGQRLSMASGVLDISRT----IAILFLLAGSTISPVHS----SATNQT-FHPEEELNKLKMIRARLEEINKPAVRTIQSADGDLIDCVLSHQQPAF
        D + LG RLSMAS +  I+ T    I +LF L  STI+PVHS     +TNQT FHP++ELNKLKMIRA L++INKPAV TIQS DGD+IDCVLSH QPAF
Subjt:  DQTRLGQRLSMASGVLDISRT----IAILFLLAGSTISPVHS----SATNQT-FHPEEELNKLKMIRARLEEINKPAVRTIQSADGDLIDCVLSHQQPAF

Query:  DHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASI
        DHP L+GQ PLDPPERP+GHK PRT TESFQLWSM GE CPEGTVPIRRTTEE+MLRATSFQMFG+KVR+WVRRET+SDGHEHAVGYVTG+HYFGAKASI
Subjt:  DHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASI

Query:  NVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISL
        NVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSS +GGQFDISL
Subjt:  NVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISL

Query:  LVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHP
        LVWKDPKHGNWWLEFG GVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHT TEMGSGHFAGEGF KASYFRNL+VVDWDNSLVPLSNLVVLADHP
Subjt:  LVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHP

Query:  NCYDIQGGVNTIWGNYFYYGGPGRNDRCP
        NCYDI+GG+NT+WGNYFYYGGPGRNDRCP
Subjt:  NCYDIQGGVNTIWGNYFYYGGPGRNDRCP

XP_022131433.1 uncharacterized protein LOC111004647 [Momordica charantia]1.6e-259100Show/hide
Query:  MLGWALHFFDQTRLGQRLSMASGVLDISRTIAILFLLAGSTISPVHSSATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQSADGDLIDCVLSHQQPAF
        MLGWALHFFDQTRLGQRLSMASGVLDISRTIAILFLLAGSTISPVHSSATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQSADGDLIDCVLSHQQPAF
Subjt:  MLGWALHFFDQTRLGQRLSMASGVLDISRTIAILFLLAGSTISPVHSSATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQSADGDLIDCVLSHQQPAF

Query:  DHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASI
        DHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASI
Subjt:  DHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASI

Query:  NVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISL
        NVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISL
Subjt:  NVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISL

Query:  LVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHP
        LVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHP
Subjt:  LVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHP

Query:  NCYDIQGGVNTIWGNYFYYGGPGRNDRCP
        NCYDIQGGVNTIWGNYFYYGGPGRNDRCP
Subjt:  NCYDIQGGVNTIWGNYFYYGGPGRNDRCP

XP_022961843.1 uncharacterized protein LOC111462488 [Cucurbita moschata]3.2e-22389Show/hide
Query:  ASGVLDISRTIAI----LFLLAGSTISPVHSS-----ATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQSADGDLIDCVLSHQQPAFDHPNLKGQKPL
        +S   +IS+ I I    L LL  STI+P+HSS     ATNQTFHP++ELNKLKMIRARL+ INKPA+ TIQS DGDLIDCVLSH QPAFDHP LKGQ PL
Subjt:  ASGVLDISRTIAI----LFLLAGSTISPVHSS-----ATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQSADGDLIDCVLSHQQPAFDHPNLKGQKPL

Query:  DPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQY
        DPPERP GHK PRTVTESFQLWSM+GE CPEGTVPIRRTTEE+MLRATSFQMFGRKVRRWVRRET+SDGHEHAVGYVTGDHYFGAKASINVWAPRVADQY
Subjt:  DPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQY

Query:  EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISLLVWKDPKHGNW
        EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSS DGGQFDISLLVWKDPKHGNW
Subjt:  EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISLLVWKDPKHGNW

Query:  WLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHPNCYDIQGGVNT
        WLEFG GVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHT TEMGSGHFAGEGF KASYFRNL+VVDWDNSLVPLSNLVVLADHPNCYDIQGG+NT
Subjt:  WLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHPNCYDIQGGVNT

Query:  IWGNYFYYGGPGRNDRCP
        +WGNYFYYGGPGRN RCP
Subjt:  IWGNYFYYGGPGRNDRCP

XP_023546698.1 uncharacterized protein LOC111805725 [Cucurbita pepo subsp. pepo]4.2e-22388.76Show/hide
Query:  ASGVLDISRTIAI----LFLLAGSTISPVHSS-----ATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQSADGDLIDCVLSHQQPAFDHPNLKGQKPL
        +S   +IS+ I +    L LL  STI+P+HSS     ATNQTFHP++ELNKLKMIRARL+ INKPA+ TIQS DGDLIDCVLSH QPAFDHP LKGQ PL
Subjt:  ASGVLDISRTIAI----LFLLAGSTISPVHSS-----ATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQSADGDLIDCVLSHQQPAFDHPNLKGQKPL

Query:  DPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQY
        DPPERP GHK PRTVTESFQLWSM+GE CPEGTVPIRRTTEE+MLRATSFQMFGRKVRRWVRRET+SDGHEHAVGYVTGDHYFGAKASINVWAPRVADQY
Subjt:  DPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQY

Query:  EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISLLVWKDPKHGNW
        EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSS DGGQFDISLLVWKDPKHGNW
Subjt:  EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISLLVWKDPKHGNW

Query:  WLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHPNCYDIQGGVNT
        WLEFG GVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHT TEMGSGHFAGEGF KASYFRNL+VVDWDNSLVPLSNLVVLADHPNCYDIQGG+NT
Subjt:  WLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHPNCYDIQGGVNT

Query:  IWGNYFYYGGPGRNDRCP
        +WGNYFYYGGPGRN RCP
Subjt:  IWGNYFYYGGPGRNDRCP

XP_038885684.1 uncharacterized protein LOC120075988 [Benincasa hispida]2.5e-22388Show/hide
Query:  GQRLSMASGV--LDISRTIAILFLLAG----STISPVHSS---ATNQT-FHPEEELNKLKMIRARLEEINKPAVRTIQSADGDLIDCVLSHQQPAFDHPN
        G RLSMAS +  ++IS+TI I  LL      STI+PVHS+   ATNQT FHP+EELNKL MIRA L++INKPA+ TIQS DGDLIDCVLSHQQPAFDHP 
Subjt:  GQRLSMASGV--LDISRTIAILFLLAG----STISPVHSS---ATNQT-FHPEEELNKLKMIRARLEEINKPAVRTIQSADGDLIDCVLSHQQPAFDHPN

Query:  LKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASINVWA
        L+GQ PLDPPERP+GHK PRTVTESFQLWSM+GE CPEGTVPIRRTTEE+MLRATSFQMFG+KV RWVRRET+SDGHEHAVGYVTGDHYFGAKASINVWA
Subjt:  LKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASINVWA

Query:  PRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISLLVWK
        PRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSS +GGQFDISLLVWK
Subjt:  PRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISLLVWK

Query:  DPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHPNCYD
        DPKHGNWWLEFG GVLVGYWPSFLFTHL+DHATMVQFGGEVVNSSPSGFHT TEMGSGHFAGEGF KASYFRNL+VVDWDNSLVPLSNLVVLADHPNCYD
Subjt:  DPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHPNCYD

Query:  IQGGVNTIWGNYFYYGGPGRNDRCP
        IQGG+NT+WGNYFYYGGPGRNDRCP
Subjt:  IQGGVNTIWGNYFYYGGPGRNDRCP

TrEMBL top hitse value%identityAlignment
A0A0A0LKI2 Uncharacterized protein3.8e-22286.25Show/hide
Query:  DQTRLGQRLSMASGV--LDISRTIAI---LFLLAGSTISPVHS---SATNQT-FHPEEELNKLKMIRARLEEINKPAVRTIQSADGDLIDCVLSHQQPAF
        D + LG RLSMAS +  ++I+ TI I   LF L  STI+PVHS     TNQT FHP++ELNKLKMIRA L++INKPA+ TIQS DGD+IDCVLSH QPAF
Subjt:  DQTRLGQRLSMASGV--LDISRTIAI---LFLLAGSTISPVHS---SATNQT-FHPEEELNKLKMIRARLEEINKPAVRTIQSADGDLIDCVLSHQQPAF

Query:  DHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASI
        DHP L+GQKPLDPPERP+GHK PRT TESFQLWS  GE CPEGTVPIRRTTEE++LRATSFQMFGRKVR+WVRRET+SDGHEHAVGYVTG+HYFGAKASI
Subjt:  DHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASI

Query:  NVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISL
        NVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSS +GGQFDISL
Subjt:  NVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISL

Query:  LVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHP
        LVWKDPKHGNWWLEFG GVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSG HT TEMGSGHFAGEGF KASYFRNL+VVDWDNSLVPLSNLVVLADHP
Subjt:  LVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHP

Query:  NCYDIQGGVNTIWGNYFYYGGPGRNDRCP
        NCYDI+GG+NT+WGNYFYYGGPGRNDRCP
Subjt:  NCYDIQGGVNTIWGNYFYYGGPGRNDRCP

A0A1S3BA82 LOW QUALITY PROTEIN: uncharacterized protein LOC1034877143.5e-22386.48Show/hide
Query:  DQTRLGQRLSMASGVLDISRT----IAILFLLAGSTISPVHS----SATNQT-FHPEEELNKLKMIRARLEEINKPAVRTIQSADGDLIDCVLSHQQPAF
        D + LG RLSMAS +  I+ T    I +LF L  STI+PVHS     +TNQT FHP++ELNKLKMIRA L++INKPAV TIQS DGD+IDCVLSH QPAF
Subjt:  DQTRLGQRLSMASGVLDISRT----IAILFLLAGSTISPVHS----SATNQT-FHPEEELNKLKMIRARLEEINKPAVRTIQSADGDLIDCVLSHQQPAF

Query:  DHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASI
        DHP L+GQ PLDPPERP+GHK PRT TESFQLWSM GE CPEGTVPIRRTTEE+MLRATSFQMFG+KVR+WVRRET+SDGHEHAVGYVTG+HYFGAKASI
Subjt:  DHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASI

Query:  NVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISL
        NVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSS +GGQFDISL
Subjt:  NVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISL

Query:  LVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHP
        LVWKDPKHGNWWLEFG GVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHT TEMGSGHFAGEGF KASYFRNL+VVDWDNSLVPLSNLVVLADHP
Subjt:  LVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHP

Query:  NCYDIQGGVNTIWGNYFYYGGPGRNDRCP
        NCYDI+GG+NT+WGNYFYYGGPGRNDRCP
Subjt:  NCYDIQGGVNTIWGNYFYYGGPGRNDRCP

A0A6J1BTC9 uncharacterized protein LOC1110046477.9e-260100Show/hide
Query:  MLGWALHFFDQTRLGQRLSMASGVLDISRTIAILFLLAGSTISPVHSSATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQSADGDLIDCVLSHQQPAF
        MLGWALHFFDQTRLGQRLSMASGVLDISRTIAILFLLAGSTISPVHSSATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQSADGDLIDCVLSHQQPAF
Subjt:  MLGWALHFFDQTRLGQRLSMASGVLDISRTIAILFLLAGSTISPVHSSATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQSADGDLIDCVLSHQQPAF

Query:  DHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASI
        DHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASI
Subjt:  DHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASI

Query:  NVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISL
        NVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISL
Subjt:  NVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISL

Query:  LVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHP
        LVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHP
Subjt:  LVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHP

Query:  NCYDIQGGVNTIWGNYFYYGGPGRNDRCP
        NCYDIQGGVNTIWGNYFYYGGPGRNDRCP
Subjt:  NCYDIQGGVNTIWGNYFYYGGPGRNDRCP

A0A6J1HCZ9 uncharacterized protein LOC1114624881.6e-22389Show/hide
Query:  ASGVLDISRTIAI----LFLLAGSTISPVHSS-----ATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQSADGDLIDCVLSHQQPAFDHPNLKGQKPL
        +S   +IS+ I I    L LL  STI+P+HSS     ATNQTFHP++ELNKLKMIRARL+ INKPA+ TIQS DGDLIDCVLSH QPAFDHP LKGQ PL
Subjt:  ASGVLDISRTIAI----LFLLAGSTISPVHSS-----ATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQSADGDLIDCVLSHQQPAFDHPNLKGQKPL

Query:  DPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQY
        DPPERP GHK PRTVTESFQLWSM+GE CPEGTVPIRRTTEE+MLRATSFQMFGRKVRRWVRRET+SDGHEHAVGYVTGDHYFGAKASINVWAPRVADQY
Subjt:  DPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQY

Query:  EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISLLVWKDPKHGNW
        EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSS DGGQFDISLLVWKDPKHGNW
Subjt:  EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISLLVWKDPKHGNW

Query:  WLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHPNCYDIQGGVNT
        WLEFG GVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHT TEMGSGHFAGEGF KASYFRNL+VVDWDNSLVPLSNLVVLADHPNCYDIQGG+NT
Subjt:  WLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHPNCYDIQGGVNT

Query:  IWGNYFYYGGPGRNDRCP
        +WGNYFYYGGPGRN RCP
Subjt:  IWGNYFYYGGPGRNDRCP

A0A6J1KBP1 uncharacterized protein LOC1114918483.8e-22288.52Show/hide
Query:  ASGVLDISRTIAI----LFLLAGSTISPVHSS-----ATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQSADGDLIDCVLSHQQPAFDHPNLKGQKPL
        +S   +IS+ I I    L LL  STI+P+HSS     ATNQTFHP++ELNKLKMIRARL+ INKPA+ TIQS DGDLIDCVLSH QPAFDHP LKGQ PL
Subjt:  ASGVLDISRTIAI----LFLLAGSTISPVHSS-----ATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQSADGDLIDCVLSHQQPAFDHPNLKGQKPL

Query:  DPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQY
        DPPERP GHK PRTVTESFQLWSM+GE CPEGTVPIRRTTEE++LRATSFQMFGRKVRRWVRRET+SDGHEHAVGYVTGDHYFGAKASINVWAPRVADQY
Subjt:  DPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQY

Query:  EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISLLVWKDPKHGNW
        EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSS DGGQFDISLLVWKDPKHGNW
Subjt:  EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISLLVWKDPKHGNW

Query:  WLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHPNCYDIQGGVNT
        WLEFG GVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSG HT TEMGSGHFAGEGF KASYFRNL+VVDWDNSLVPLSNLVVLADHPNCYDIQGG+NT
Subjt:  WLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHPNCYDIQGGVNT

Query:  IWGNYFYYGGPGRNDRCP
        +WGNYFYYGGPGRN RCP
Subjt:  IWGNYFYYGGPGRNDRCP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10750.1 Protein of Unknown Function (DUF239)5.7e-18673.33Show/hide
Query:  SRTIAILFLLAGSTISPVHS---SATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQSADGDLIDCVLSHQQPAFDHPNLKGQKPLDPPERPRGHKQPR
        S  + +  LL  S+ S V S   S  NQT  P +ELNKLK I   L +INKP+++TI S DGD+IDCVL H QPAFDHP+L+GQKPLDPPERPRGH +  
Subjt:  SRTIAILFLLAGSTISPVHS---SATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQSADGDLIDCVLSHQQPAFDHPNLKGQKPLDPPERPRGHKQPR

Query:  TVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGS
           +SFQLW M+GETCPEGTVPIRRT EE++LRA S   FG+K+R + RR+T+S+GHEHAVGYV+G+ Y+GAKASINVWAP+V +QYEFSLSQ+W+ISGS
Subjt:  TVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGS

Query:  FGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISLLVWKDPKHGNWWLEFGGGVLVGYW
        FG+DLNTIEAGWQVSPELYGDNYPRFFTYWT+DAYQATGCYNLLCSGFVQTN++IAIGAAISP+SS  GGQFDI+LL+WKDPKHGNWWLEFG G+LVGYW
Subjt:  FGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISLLVWKDPKHGNWWLEFGGGVLVGYW

Query:  PSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHPNCYDIQGGVNTIWGNYFYYGGPGR
        PSFLFTHL++HA+MVQ+GGE+VNSSP G HT T+MGSGHFA EGF K+SYFRN++VVDWDN+LVP  NL VLADHPNCYDIQGG N  WG+YFYYGGPG+
Subjt:  PSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHPNCYDIQGGVNTIWGNYFYYGGPGR

Query:  NDRCP
        N +CP
Subjt:  NDRCP

AT1G23340.1 Protein of Unknown Function (DUF239)1.5e-18170.75Show/hide
Query:  TIAILFLLAGSTISPVHSSATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQSADGDLIDCVLSHQQPAFDHPNLKGQKPLDPPERPRGHKQPRTVTES
        T  +L  L  S  SP +S++      P+ E+ K+K+IR +L++INKPA++TI S+DGD IDCV SH QPAFDHP L+GQ+P+DPPE P G+ Q     E+
Subjt:  TIAILFLLAGSTISPVHSSATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQSADGDLIDCVLSHQQPAFDHPNLKGQKPLDPPERPRGHKQPRTVTES

Query:  FQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDL
        FQLWS+ GE+CPEGT+PIRRTTE++MLRA S + FGRK+RR VRR+++S+GHEHAVGYV+G  Y+GAKASINVW PRV  QYEFSLSQ+W+I+GSF  DL
Subjt:  FQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDL

Query:  NTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISLLVWKDPKHGNWWLEFGGGVLVGYWPSFLF
        NTIEAGWQ+SPELYGD  PRFFTYWTSDAYQATGCYNLLCSGFVQTNN+IAIGAAISP SS  GGQFDISLL+WKDPKHG+WWL+FG G LVGYWP  LF
Subjt:  NTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISLLVWKDPKHGNWWLEFGGGVLVGYWPSFLF

Query:  THLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHPNCYDIQGGVNTIWGNYFYYGGPGRNDRCP
        THL++H  MVQFGGE+VN+ P G HT T+MGSGHFAGEGF KASYFRNL++VDWDN+L+P+SNL VLADHPNCYDI+GGVN +WGN+FYYGGPG+N +CP
Subjt:  THLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHPNCYDIQGGVNTIWGNYFYYGGPGRNDRCP

AT1G23340.2 Protein of Unknown Function (DUF239)1.5e-18170.75Show/hide
Query:  TIAILFLLAGSTISPVHSSATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQSADGDLIDCVLSHQQPAFDHPNLKGQKPLDPPERPRGHKQPRTVTES
        T  +L  L  S  SP +S++      P+ E+ K+K+IR +L++INKPA++TI S+DGD IDCV SH QPAFDHP L+GQ+P+DPPE P G+ Q     E+
Subjt:  TIAILFLLAGSTISPVHSSATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQSADGDLIDCVLSHQQPAFDHPNLKGQKPLDPPERPRGHKQPRTVTES

Query:  FQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDL
        FQLWS+ GE+CPEGT+PIRRTTE++MLRA S + FGRK+RR VRR+++S+GHEHAVGYV+G  Y+GAKASINVW PRV  QYEFSLSQ+W+I+GSF  DL
Subjt:  FQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDL

Query:  NTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISLLVWKDPKHGNWWLEFGGGVLVGYWPSFLF
        NTIEAGWQ+SPELYGD  PRFFTYWTSDAYQATGCYNLLCSGFVQTNN+IAIGAAISP SS  GGQFDISLL+WKDPKHG+WWL+FG G LVGYWP  LF
Subjt:  NTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISLLVWKDPKHGNWWLEFGGGVLVGYWPSFLF

Query:  THLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHPNCYDIQGGVNTIWGNYFYYGGPGRNDRCP
        THL++H  MVQFGGE+VN+ P G HT T+MGSGHFAGEGF KASYFRNL++VDWDN+L+P+SNL VLADHPNCYDI+GGVN +WGN+FYYGGPG+N +CP
Subjt:  THLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHPNCYDIQGGVNTIWGNYFYYGGPGRNDRCP

AT1G70550.1 Protein of Unknown Function (DUF239)8.3e-18570Show/hide
Query:  QRLSMASGVLDISRTIAILFLLA------GSTISPVHSSATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQSADGDLIDCVLSHQQPAFDHPNLKGQK
        Q++S  S  +  S  + ++ LL        ST S  +S+A +QT  P+EEL KL +IR  L++INKPAV+TIQS+DGD IDCV +HQQPAFDHP L+GQK
Subjt:  QRLSMASGVLDISRTIAILFLLA------GSTISPVHSSATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQSADGDLIDCVLSHQQPAFDHPNLKGQK

Query:  PLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASINVWAPRVAD
        PLDPPE P+G+ +     E+ QLWS+ GE+CPEGT+PIRRTTE++MLRA+S Q FGRK+RR V+R++T++GHEHAVGYVTG  Y+GAKASINVW+PRV  
Subjt:  PLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASINVWAPRVAD

Query:  QYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISLLVWKDPKHG
        QYEFSLSQ+WVI+GSF  DLNTIEAGWQ+SPELYGD YPRFFTYWTSDAY+ TGCYNLLCSGFVQTN +IAIGAAISP SS  GGQFDISLL+WKDPKHG
Subjt:  QYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISLLVWKDPKHG

Query:  NWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHPNCYDIQGGV
        +WWL+FG G LVGYWP+FLFTHL+ H +MVQFGGE+VN+ P G HT T+MGSGHFAGEGF KASYFRNL++VDWDN+L+P SNL +LADHPNCYDI+GG 
Subjt:  NWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHPNCYDIQGGV

Query:  NTIWGNYFYYGGPGRNDRCP
        N +WGNYFYYGGPG+N RCP
Subjt:  NTIWGNYFYYGGPGRNDRCP

AT1G70550.2 Protein of Unknown Function (DUF239)1.1e-18472.28Show/hide
Query:  RTIAILFLLA---GSTISPVHSSATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQSADGDLIDCVLSHQQPAFDHPNLKGQKPLDPPERPRGHKQPRT
        R I +L L++    ST S  +S+A +QT  P+EEL KL +IR  L++INKPAV+TIQS+DGD IDCV +HQQPAFDHP L+GQKPLDPPE P+G+ +   
Subjt:  RTIAILFLLA---GSTISPVHSSATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQSADGDLIDCVLSHQQPAFDHPNLKGQKPLDPPERPRGHKQPRT

Query:  VTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSF
          E+ QLWS+ GE+CPEGT+PIRRTTE++MLRA+S Q FGRK+RR V+R++T++GHEHAVGYVTG  Y+GAKASINVW+PRV  QYEFSLSQ+WVI+GSF
Subjt:  VTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSF

Query:  GDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISLLVWKDPKHGNWWLEFGGGVLVGYWP
          DLNTIEAGWQ+SPELYGD YPRFFTYWTSDAY+ TGCYNLLCSGFVQTN +IAIGAAISP SS  GGQFDISLL+WKDPKHG+WWL+FG G LVGYWP
Subjt:  GDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISLLVWKDPKHGNWWLEFGGGVLVGYWP

Query:  SFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHPNCYDIQGGVNTIWGNYFYYGGPGRN
        +FLFTHL+ H +MVQFGGE+VN+ P G HT T+MGSGHFAGEGF KASYFRNL++VDWDN+L+P SNL +LADHPNCYDI+GG N +WGNYFYYGGPG+N
Subjt:  SFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHPNCYDIQGGVNTIWGNYFYYGGPGRN

Query:  DRCP
         RCP
Subjt:  DRCP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGGGCTGGGCGCTGCATTTCTTTGATCAAACAAGACTCGGACAGCGCCTAAGCATGGCCTCCGGCGTGTTGGACATCTCCCGAACCATTGCCATTCTCTTT
CTTCTTGCAGGCTCCACAATTAGCCCGGTGCACTCGTCTGCCACTAACCAAACCTTCCATCCCGAAGAGGAATTAAACAAGTTGAAGATGATAAGAGCTCGCTTG
GAGGAGATCAACAAGCCTGCTGTCCGCACAATTCAGAGTGCTGACGGGGACCTCATAGATTGTGTTTTATCTCATCAGCAACCAGCATTTGACCATCCTAATTTG
AAAGGACAAAAACCGTTGGATCCGCCGGAGAGACCTCGAGGACATAAGCAGCCTAGGACGGTGACAGAAAGCTTCCAATTATGGAGCATGGATGGGGAAACTTGT
CCAGAAGGAACAGTCCCCATAAGAAGAACAACCGAAGAAGAAATGCTAAGAGCTACCTCTTTTCAGATGTTTGGAAGGAAAGTAAGAAGATGGGTTAGGAGAGAA
ACAACGAGCGACGGGCATGAGCACGCAGTGGGGTATGTGACCGGCGATCACTACTTCGGAGCAAAGGCAAGTATTAACGTGTGGGCACCTCGCGTCGCCGATCAG
TATGAATTCAGTTTGTCGCAAATGTGGGTCATCTCCGGCTCTTTCGGCGACGATCTCAACACCATTGAAGCTGGTTGGCAGGTTAGCCCAGAGCTGTACGGAGAC
AATTACCCAAGATTCTTCACTTACTGGACTTCGGATGCATACCAAGCAACCGGATGCTACAATTTGCTGTGCTCGGGGTTTGTTCAGACAAATAACAAAATTGCA
ATTGGGGCTGCGATTTCTCCAACCTCATCGTTGGACGGCGGGCAATTTGACATCAGCCTGTTGGTGTGGAAGGACCCGAAGCATGGAAATTGGTGGCTGGAATTC
GGAGGGGGAGTTCTGGTGGGGTACTGGCCGTCGTTCTTGTTCACCCACCTACAGGACCACGCGACGATGGTGCAGTTCGGGGGAGAGGTGGTGAATTCGAGCCCG
TCGGGGTTCCACACGAGGACGGAGATGGGGAGCGGGCATTTCGCCGGCGAGGGATTCAGAAAGGCTTCGTATTTCCGGAATCTGGAAGTGGTGGATTGGGACAAC
AGCTTGGTTCCGTTGTCAAACCTGGTGGTACTGGCGGATCATCCAAACTGCTACGACATTCAAGGCGGGGTCAACACCATTTGGGGTAACTACTTCTACTACGGC
GGACCTGGTCGAAACGACAGGTGTCCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTGGGCTGGGCGCTGCATTTCTTTGATCAAACAAGACTCGGACAGCGCCTAAGCATGGCCTCCGGCGTGTTGGACATCTCCCGAACCATTGCCATTCTCTTT
CTTCTTGCAGGCTCCACAATTAGCCCGGTGCACTCGTCTGCCACTAACCAAACCTTCCATCCCGAAGAGGAATTAAACAAGTTGAAGATGATAAGAGCTCGCTTG
GAGGAGATCAACAAGCCTGCTGTCCGCACAATTCAGAGTGCTGACGGGGACCTCATAGATTGTGTTTTATCTCATCAGCAACCAGCATTTGACCATCCTAATTTG
AAAGGACAAAAACCGTTGGATCCGCCGGAGAGACCTCGAGGACATAAGCAGCCTAGGACGGTGACAGAAAGCTTCCAATTATGGAGCATGGATGGGGAAACTTGT
CCAGAAGGAACAGTCCCCATAAGAAGAACAACCGAAGAAGAAATGCTAAGAGCTACCTCTTTTCAGATGTTTGGAAGGAAAGTAAGAAGATGGGTTAGGAGAGAA
ACAACGAGCGACGGGCATGAGCACGCAGTGGGGTATGTGACCGGCGATCACTACTTCGGAGCAAAGGCAAGTATTAACGTGTGGGCACCTCGCGTCGCCGATCAG
TATGAATTCAGTTTGTCGCAAATGTGGGTCATCTCCGGCTCTTTCGGCGACGATCTCAACACCATTGAAGCTGGTTGGCAGGTTAGCCCAGAGCTGTACGGAGAC
AATTACCCAAGATTCTTCACTTACTGGACTTCGGATGCATACCAAGCAACCGGATGCTACAATTTGCTGTGCTCGGGGTTTGTTCAGACAAATAACAAAATTGCA
ATTGGGGCTGCGATTTCTCCAACCTCATCGTTGGACGGCGGGCAATTTGACATCAGCCTGTTGGTGTGGAAGGACCCGAAGCATGGAAATTGGTGGCTGGAATTC
GGAGGGGGAGTTCTGGTGGGGTACTGGCCGTCGTTCTTGTTCACCCACCTACAGGACCACGCGACGATGGTGCAGTTCGGGGGAGAGGTGGTGAATTCGAGCCCG
TCGGGGTTCCACACGAGGACGGAGATGGGGAGCGGGCATTTCGCCGGCGAGGGATTCAGAAAGGCTTCGTATTTCCGGAATCTGGAAGTGGTGGATTGGGACAAC
AGCTTGGTTCCGTTGTCAAACCTGGTGGTACTGGCGGATCATCCAAACTGCTACGACATTCAAGGCGGGGTCAACACCATTTGGGGTAACTACTTCTACTACGGC
GGACCTGGTCGAAACGACAGGTGTCCCTGA
Protein sequenceShow/hide protein sequence
MLGWALHFFDQTRLGQRLSMASGVLDISRTIAILFLLAGSTISPVHSSATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQSADGDLIDCVLSHQQPAFDHPNL
KGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQ
YEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISLLVWKDPKHGNWWLEF
GGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHPNCYDIQGGVNTIWGNYFYYG
GPGRNDRCP