; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS000911 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS000911
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationscaffold36:237391..239331
RNA-Seq ExpressionMS000911
SyntenyMS000911
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0061068.1 uncharacterized protein E6C27_scaffold501G002060 [Cucumis melo var. makuwa]3.9e-21782.3Show/hide
Query:  LGQRLSMASGV--LDISRTIAI---LFLLAGSTISPVHS----SATNQT-FHPEEELNKLKMIRARLEEINKPAVRTIQVLITSTLKLAAGIYSSADGDL
        LG RLSMAS +  ++I+ TI I   LF L  STI+PVHS     +TNQT FHP++ELNKLKMIRA L++INKPAV TIQ               S DGD+
Subjt:  LGQRLSMASGV--LDISRTIAI---LFLLAGSTISPVHS----SATNQT-FHPEEELNKLKMIRARLEEINKPAVRTIQVLITSTLKLAAGIYSSADGDL

Query:  IDCVLSHQQPAFDHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYV
        IDCVLSH QPAFDHP L+GQ PLDPPERP+GHK PRT TESFQLWSM GE CPEGTVPIRRTTEE+MLRATSFQMFG+KVR+WVRRET+SDGHEHAVGYV
Subjt:  IDCVLSHQQPAFDHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYV

Query:  TGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLCLNSFLFFVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNN
        TG+HYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACL          PELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNN
Subjt:  TGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLCLNSFLFFVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNN

Query:  KIAIGAAISPTSSLDGGQFDISLLVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRN
        KIAIGAAISPTSS +GGQFDISLLVWKDPKHGNWWLEFG GVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHT TEMGSGHFAGEGF KASYFRN
Subjt:  KIAIGAAISPTSSLDGGQFDISLLVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRN

Query:  LEVVDWDNSLVPLSNLVVLADHPNCYDIQGGVNTIWGNYFYYGGPGRNDRCP
        L+VVDWDNSLVPLSNLVVLADHPNCYDI+GG+NT+WGNYFYYGGPGRNDRCP
Subjt:  LEVVDWDNSLVPLSNLVVLADHPNCYDIQGGVNTIWGNYFYYGGPGRNDRCP

XP_022131433.1 uncharacterized protein LOC111004647 [Momordica charantia]4.4e-24593.91Show/hide
Query:  QLGQRLSMASGVLDISRTIAILFLLAGSTISPVHSSATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQVLITSTLKLAAGIYSSADGDLIDCVLSHQQ
        +LGQRLSMASGVLDISRTIAILFLLAGSTISPVHSSATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQ               SADGDLIDCVLSHQQ
Subjt:  QLGQRLSMASGVLDISRTIAILFLLAGSTISPVHSSATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQVLITSTLKLAAGIYSSADGDLIDCVLSHQQ

Query:  PAFDHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAK
        PAFDHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAK
Subjt:  PAFDHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAK

Query:  ASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLCLNSFLFFVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAIS
        ASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQ           VSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAIS
Subjt:  ASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLCLNSFLFFVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAIS

Query:  PTSSLDGGQFDISLLVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNS
        PTSSLDGGQFDISLLVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNS
Subjt:  PTSSLDGGQFDISLLVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNS

Query:  LVPLSNLVVLADHPNCYDIQGGVNTIWGNYFYYGGPGRNDRCP
        LVPLSNLVVLADHPNCYDIQGGVNTIWGNYFYYGGPGRNDRCP
Subjt:  LVPLSNLVVLADHPNCYDIQGGVNTIWGNYFYYGGPGRNDRCP

XP_022961843.1 uncharacterized protein LOC111462488 [Cucurbita moschata]2.3e-21783.78Show/hide
Query:  ASGVLDISRTIAI----LFLLAGSTISPVHSS-----ATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQVLITSTLKLAAGIYSSADGDLIDCVLSHQ
        +S   +IS+ I I    L LL  STI+P+HSS     ATNQTFHP++ELNKLKMIRARL+ INKPA+ TIQ               S DGDLIDCVLSH 
Subjt:  ASGVLDISRTIAI----LFLLAGSTISPVHSS-----ATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQVLITSTLKLAAGIYSSADGDLIDCVLSHQ

Query:  QPAFDHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGA
        QPAFDHP LKGQ PLDPPERP GHK PRTVTESFQLWSM+GE CPEGTVPIRRTTEE+MLRATSFQMFGRKVRRWVRRET+SDGHEHAVGYVTGDHYFGA
Subjt:  QPAFDHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGA

Query:  KASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLCLNSFLFFVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAI
        KASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQ           VSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAI
Subjt:  KASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLCLNSFLFFVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAI

Query:  SPTSSLDGGQFDISLLVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDN
        SPTSS DGGQFDISLLVWKDPKHGNWWLEFG GVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHT TEMGSGHFAGEGF KASYFRNL+VVDWDN
Subjt:  SPTSSLDGGQFDISLLVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDN

Query:  SLVPLSNLVVLADHPNCYDIQGGVNTIWGNYFYYGGPGRNDRCP
        SLVPLSNLVVLADHPNCYDIQGG+NT+WGNYFYYGGPGRN RCP
Subjt:  SLVPLSNLVVLADHPNCYDIQGGVNTIWGNYFYYGGPGRNDRCP

XP_023546698.1 uncharacterized protein LOC111805725 [Cucurbita pepo subsp. pepo]3.0e-21783.56Show/hide
Query:  ASGVLDISRTIAI----LFLLAGSTISPVHSS-----ATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQVLITSTLKLAAGIYSSADGDLIDCVLSHQ
        +S   +IS+ I +    L LL  STI+P+HSS     ATNQTFHP++ELNKLKMIRARL+ INKPA+ TIQ               S DGDLIDCVLSH 
Subjt:  ASGVLDISRTIAI----LFLLAGSTISPVHSS-----ATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQVLITSTLKLAAGIYSSADGDLIDCVLSHQ

Query:  QPAFDHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGA
        QPAFDHP LKGQ PLDPPERP GHK PRTVTESFQLWSM+GE CPEGTVPIRRTTEE+MLRATSFQMFGRKVRRWVRRET+SDGHEHAVGYVTGDHYFGA
Subjt:  QPAFDHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGA

Query:  KASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLCLNSFLFFVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAI
        KASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQ           VSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAI
Subjt:  KASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLCLNSFLFFVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAI

Query:  SPTSSLDGGQFDISLLVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDN
        SPTSS DGGQFDISLLVWKDPKHGNWWLEFG GVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHT TEMGSGHFAGEGF KASYFRNL+VVDWDN
Subjt:  SPTSSLDGGQFDISLLVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDN

Query:  SLVPLSNLVVLADHPNCYDIQGGVNTIWGNYFYYGGPGRNDRCP
        SLVPLSNLVVLADHPNCYDIQGG+NT+WGNYFYYGGPGRN RCP
Subjt:  SLVPLSNLVVLADHPNCYDIQGGVNTIWGNYFYYGGPGRNDRCP

XP_038885684.1 uncharacterized protein LOC120075988 [Benincasa hispida]2.3e-21782.93Show/hide
Query:  GQRLSMASGV--LDISRTIAILFLLAG----STISPVHSS---ATNQT-FHPEEELNKLKMIRARLEEINKPAVRTIQVLITSTLKLAAGIYSSADGDLI
        G RLSMAS +  ++IS+TI I  LL      STI+PVHS+   ATNQT FHP+EELNKL MIRA L++INKPA+ TIQ               S DGDLI
Subjt:  GQRLSMASGV--LDISRTIAILFLLAG----STISPVHSS---ATNQT-FHPEEELNKLKMIRARLEEINKPAVRTIQVLITSTLKLAAGIYSSADGDLI

Query:  DCVLSHQQPAFDHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVT
        DCVLSHQQPAFDHP L+GQ PLDPPERP+GHK PRTVTESFQLWSM+GE CPEGTVPIRRTTEE+MLRATSFQMFG+KV RWVRRET+SDGHEHAVGYVT
Subjt:  DCVLSHQQPAFDHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVT

Query:  GDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLCLNSFLFFVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNK
        GDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQ           VSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNK
Subjt:  GDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLCLNSFLFFVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNK

Query:  IAIGAAISPTSSLDGGQFDISLLVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNL
        IAIGAAISPTSS +GGQFDISLLVWKDPKHGNWWLEFG GVLVGYWPSFLFTHL+DHATMVQFGGEVVNSSPSGFHT TEMGSGHFAGEGF KASYFRNL
Subjt:  IAIGAAISPTSSLDGGQFDISLLVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNL

Query:  EVVDWDNSLVPLSNLVVLADHPNCYDIQGGVNTIWGNYFYYGGPGRNDRCP
        +VVDWDNSLVPLSNLVVLADHPNCYDIQGG+NT+WGNYFYYGGPGRNDRCP
Subjt:  EVVDWDNSLVPLSNLVVLADHPNCYDIQGGVNTIWGNYFYYGGPGRNDRCP

TrEMBL top hitse value%identityAlignment
A0A1S3BA82 LOW QUALITY PROTEIN: uncharacterized protein LOC1034877141.6e-21682.04Show/hide
Query:  LGQRLSMASGVLDISRT----IAILFLLAGSTISPVHS----SATNQT-FHPEEELNKLKMIRARLEEINKPAVRTIQVLITSTLKLAAGIYSSADGDLI
        LG RLSMAS +  I+ T    I +LF L  STI+PVHS     +TNQT FHP++ELNKLKMIRA L++INKPAV TIQ               S DGD+I
Subjt:  LGQRLSMASGVLDISRT----IAILFLLAGSTISPVHS----SATNQT-FHPEEELNKLKMIRARLEEINKPAVRTIQVLITSTLKLAAGIYSSADGDLI

Query:  DCVLSHQQPAFDHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVT
        DCVLSH QPAFDHP L+GQ PLDPPERP+GHK PRT TESFQLWSM GE CPEGTVPIRRTTEE+MLRATSFQMFG+KVR+WVRRET+SDGHEHAVGYVT
Subjt:  DCVLSHQQPAFDHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVT

Query:  GDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLCLNSFLFFVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNK
        G+HYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQ           VSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNK
Subjt:  GDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLCLNSFLFFVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNK

Query:  IAIGAAISPTSSLDGGQFDISLLVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNL
        IAIGAAISPTSS +GGQFDISLLVWKDPKHGNWWLEFG GVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHT TEMGSGHFAGEGF KASYFRNL
Subjt:  IAIGAAISPTSSLDGGQFDISLLVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNL

Query:  EVVDWDNSLVPLSNLVVLADHPNCYDIQGGVNTIWGNYFYYGGPGRNDRCP
        +VVDWDNSLVPLSNLVVLADHPNCYDI+GG+NT+WGNYFYYGGPGRNDRCP
Subjt:  EVVDWDNSLVPLSNLVVLADHPNCYDIQGGVNTIWGNYFYYGGPGRNDRCP

A0A5A7V0I8 Uncharacterized protein1.9e-21782.3Show/hide
Query:  LGQRLSMASGV--LDISRTIAI---LFLLAGSTISPVHS----SATNQT-FHPEEELNKLKMIRARLEEINKPAVRTIQVLITSTLKLAAGIYSSADGDL
        LG RLSMAS +  ++I+ TI I   LF L  STI+PVHS     +TNQT FHP++ELNKLKMIRA L++INKPAV TIQ               S DGD+
Subjt:  LGQRLSMASGV--LDISRTIAI---LFLLAGSTISPVHS----SATNQT-FHPEEELNKLKMIRARLEEINKPAVRTIQVLITSTLKLAAGIYSSADGDL

Query:  IDCVLSHQQPAFDHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYV
        IDCVLSH QPAFDHP L+GQ PLDPPERP+GHK PRT TESFQLWSM GE CPEGTVPIRRTTEE+MLRATSFQMFG+KVR+WVRRET+SDGHEHAVGYV
Subjt:  IDCVLSHQQPAFDHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYV

Query:  TGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLCLNSFLFFVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNN
        TG+HYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACL          PELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNN
Subjt:  TGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLCLNSFLFFVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNN

Query:  KIAIGAAISPTSSLDGGQFDISLLVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRN
        KIAIGAAISPTSS +GGQFDISLLVWKDPKHGNWWLEFG GVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHT TEMGSGHFAGEGF KASYFRN
Subjt:  KIAIGAAISPTSSLDGGQFDISLLVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRN

Query:  LEVVDWDNSLVPLSNLVVLADHPNCYDIQGGVNTIWGNYFYYGGPGRNDRCP
        L+VVDWDNSLVPLSNLVVLADHPNCYDI+GG+NT+WGNYFYYGGPGRNDRCP
Subjt:  LEVVDWDNSLVPLSNLVVLADHPNCYDIQGGVNTIWGNYFYYGGPGRNDRCP

A0A6J1BTC9 uncharacterized protein LOC1110046472.1e-24593.91Show/hide
Query:  QLGQRLSMASGVLDISRTIAILFLLAGSTISPVHSSATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQVLITSTLKLAAGIYSSADGDLIDCVLSHQQ
        +LGQRLSMASGVLDISRTIAILFLLAGSTISPVHSSATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQ               SADGDLIDCVLSHQQ
Subjt:  QLGQRLSMASGVLDISRTIAILFLLAGSTISPVHSSATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQVLITSTLKLAAGIYSSADGDLIDCVLSHQQ

Query:  PAFDHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAK
        PAFDHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAK
Subjt:  PAFDHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAK

Query:  ASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLCLNSFLFFVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAIS
        ASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQ           VSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAIS
Subjt:  ASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLCLNSFLFFVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAIS

Query:  PTSSLDGGQFDISLLVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNS
        PTSSLDGGQFDISLLVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNS
Subjt:  PTSSLDGGQFDISLLVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNS

Query:  LVPLSNLVVLADHPNCYDIQGGVNTIWGNYFYYGGPGRNDRCP
        LVPLSNLVVLADHPNCYDIQGGVNTIWGNYFYYGGPGRNDRCP
Subjt:  LVPLSNLVVLADHPNCYDIQGGVNTIWGNYFYYGGPGRNDRCP

A0A6J1HCZ9 uncharacterized protein LOC1114624881.1e-21783.78Show/hide
Query:  ASGVLDISRTIAI----LFLLAGSTISPVHSS-----ATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQVLITSTLKLAAGIYSSADGDLIDCVLSHQ
        +S   +IS+ I I    L LL  STI+P+HSS     ATNQTFHP++ELNKLKMIRARL+ INKPA+ TIQ               S DGDLIDCVLSH 
Subjt:  ASGVLDISRTIAI----LFLLAGSTISPVHSS-----ATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQVLITSTLKLAAGIYSSADGDLIDCVLSHQ

Query:  QPAFDHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGA
        QPAFDHP LKGQ PLDPPERP GHK PRTVTESFQLWSM+GE CPEGTVPIRRTTEE+MLRATSFQMFGRKVRRWVRRET+SDGHEHAVGYVTGDHYFGA
Subjt:  QPAFDHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGA

Query:  KASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLCLNSFLFFVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAI
        KASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQ           VSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAI
Subjt:  KASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLCLNSFLFFVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAI

Query:  SPTSSLDGGQFDISLLVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDN
        SPTSS DGGQFDISLLVWKDPKHGNWWLEFG GVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHT TEMGSGHFAGEGF KASYFRNL+VVDWDN
Subjt:  SPTSSLDGGQFDISLLVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDN

Query:  SLVPLSNLVVLADHPNCYDIQGGVNTIWGNYFYYGGPGRNDRCP
        SLVPLSNLVVLADHPNCYDIQGG+NT+WGNYFYYGGPGRN RCP
Subjt:  SLVPLSNLVVLADHPNCYDIQGGVNTIWGNYFYYGGPGRNDRCP

A0A6J1KBP1 uncharacterized protein LOC1114918482.7e-21683.33Show/hide
Query:  ASGVLDISRTIAI----LFLLAGSTISPVHSS-----ATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQVLITSTLKLAAGIYSSADGDLIDCVLSHQ
        +S   +IS+ I I    L LL  STI+P+HSS     ATNQTFHP++ELNKLKMIRARL+ INKPA+ TIQ               S DGDLIDCVLSH 
Subjt:  ASGVLDISRTIAI----LFLLAGSTISPVHSS-----ATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQVLITSTLKLAAGIYSSADGDLIDCVLSHQ

Query:  QPAFDHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGA
        QPAFDHP LKGQ PLDPPERP GHK PRTVTESFQLWSM+GE CPEGTVPIRRTTEE++LRATSFQMFGRKVRRWVRRET+SDGHEHAVGYVTGDHYFGA
Subjt:  QPAFDHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGA

Query:  KASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLCLNSFLFFVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAI
        KASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQ           VSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAI
Subjt:  KASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLCLNSFLFFVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAI

Query:  SPTSSLDGGQFDISLLVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDN
        SPTSS DGGQFDISLLVWKDPKHGNWWLEFG GVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSG HT TEMGSGHFAGEGF KASYFRNL+VVDWDN
Subjt:  SPTSSLDGGQFDISLLVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDN

Query:  SLVPLSNLVVLADHPNCYDIQGGVNTIWGNYFYYGGPGRNDRCP
        SLVPLSNLVVLADHPNCYDIQGG+NT+WGNYFYYGGPGRN RCP
Subjt:  SLVPLSNLVVLADHPNCYDIQGGVNTIWGNYFYYGGPGRNDRCP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10750.1 Protein of Unknown Function (DUF239)3.1e-18068.91Show/hide
Query:  SRTIAILFLLAGSTISPVHS---SATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQVLITSTLKLAAGIYSSADGDLIDCVLSHQQPAFDHPNLKGQK
        S  + +  LL  S+ S V S   S  NQT  P +ELNKLK I   L +INKP+++TI                S DGD+IDCVL H QPAFDHP+L+GQK
Subjt:  SRTIAILFLLAGSTISPVHS---SATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQVLITSTLKLAAGIYSSADGDLIDCVLSHQQPAFDHPNLKGQK

Query:  PLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASINVWAPRVAD
        PLDPPERPRGH +     +SFQLW M+GETCPEGTVPIRRT EE++LRA S   FG+K+R + RR+T+S+GHEHAVGYV+G+ Y+GAKASINVWAP+V +
Subjt:  PLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASINVWAPRVAD

Query:  QYEFSLSQMWVISGSFGDDLNTIEAGWQACLCLNSFLFFVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDI
        QYEFSLSQ+W+ISGSFG+DLNTIEAGWQ           VSPELYGDNYPRFFTYWT+DAYQATGCYNLLCSGFVQTN++IAIGAAISP+SS  GGQFDI
Subjt:  QYEFSLSQMWVISGSFGDDLNTIEAGWQACLCLNSFLFFVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDI

Query:  SLLVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLAD
        +LL+WKDPKHGNWWLEFG G+LVGYWPSFLFTHL++HA+MVQ+GGE+VNSSP G HT T+MGSGHFA EGF K+SYFRN++VVDWDN+LVP  NL VLAD
Subjt:  SLLVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLAD

Query:  HPNCYDIQGGVNTIWGNYFYYGGPGRNDRCP
        HPNCYDIQGG N  WG+YFYYGGPG+N +CP
Subjt:  HPNCYDIQGGVNTIWGNYFYYGGPGRNDRCP

AT1G23340.1 Protein of Unknown Function (DUF239)1.0e-17566.43Show/hide
Query:  TIAILFLLAGSTISPVHSSATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQVLITSTLKLAAGIYSSADGDLIDCVLSHQQPAFDHPNLKGQKPLDPP
        T  +L  L  S  SP +S++      P+ E+ K+K+IR +L++INKPA++TI                S+DGD IDCV SH QPAFDHP L+GQ+P+DPP
Subjt:  TIAILFLLAGSTISPVHSSATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQVLITSTLKLAAGIYSSADGDLIDCVLSHQQPAFDHPNLKGQKPLDPP

Query:  ERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFS
        E P G+ Q     E+FQLWS+ GE+CPEGT+PIRRTTE++MLRA S + FGRK+RR VRR+++S+GHEHAVGYV+G  Y+GAKASINVW PRV  QYEFS
Subjt:  ERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFS

Query:  LSQMWVISGSFGDDLNTIEAGWQACLCLNSFLFFVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISLLVW
        LSQ+W+I+GSF  DLNTIEAGWQ           +SPELYGD  PRFFTYWTSDAYQATGCYNLLCSGFVQTNN+IAIGAAISP SS  GGQFDISLL+W
Subjt:  LSQMWVISGSFGDDLNTIEAGWQACLCLNSFLFFVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISLLVW

Query:  KDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHPNCY
        KDPKHG+WWL+FG G LVGYWP  LFTHL++H  MVQFGGE+VN+ P G HT T+MGSGHFAGEGF KASYFRNL++VDWDN+L+P+SNL VLADHPNCY
Subjt:  KDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHPNCY

Query:  DIQGGVNTIWGNYFYYGGPGRNDRCP
        DI+GGVN +WGN+FYYGGPG+N +CP
Subjt:  DIQGGVNTIWGNYFYYGGPGRNDRCP

AT1G23340.2 Protein of Unknown Function (DUF239)1.0e-17566.43Show/hide
Query:  TIAILFLLAGSTISPVHSSATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQVLITSTLKLAAGIYSSADGDLIDCVLSHQQPAFDHPNLKGQKPLDPP
        T  +L  L  S  SP +S++      P+ E+ K+K+IR +L++INKPA++TI                S+DGD IDCV SH QPAFDHP L+GQ+P+DPP
Subjt:  TIAILFLLAGSTISPVHSSATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQVLITSTLKLAAGIYSSADGDLIDCVLSHQQPAFDHPNLKGQKPLDPP

Query:  ERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFS
        E P G+ Q     E+FQLWS+ GE+CPEGT+PIRRTTE++MLRA S + FGRK+RR VRR+++S+GHEHAVGYV+G  Y+GAKASINVW PRV  QYEFS
Subjt:  ERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFS

Query:  LSQMWVISGSFGDDLNTIEAGWQACLCLNSFLFFVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISLLVW
        LSQ+W+I+GSF  DLNTIEAGWQ           +SPELYGD  PRFFTYWTSDAYQATGCYNLLCSGFVQTNN+IAIGAAISP SS  GGQFDISLL+W
Subjt:  LSQMWVISGSFGDDLNTIEAGWQACLCLNSFLFFVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDISLLVW

Query:  KDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHPNCY
        KDPKHG+WWL+FG G LVGYWP  LFTHL++H  MVQFGGE+VN+ P G HT T+MGSGHFAGEGF KASYFRNL++VDWDN+L+P+SNL VLADHPNCY
Subjt:  KDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADHPNCY

Query:  DIQGGVNTIWGNYFYYGGPGRNDRCP
        DI+GGVN +WGN+FYYGGPG+N +CP
Subjt:  DIQGGVNTIWGNYFYYGGPGRNDRCP

AT1G70550.1 Protein of Unknown Function (DUF239)5.8e-17965.92Show/hide
Query:  QRLSMASGVLDISRTIAILFLLA------GSTISPVHSSATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQVLITSTLKLAAGIYSSADGDLIDCVLS
        Q++S  S  +  S  + ++ LL        ST S  +S+A +QT  P+EEL KL +IR  L++INKPAV+TIQ               S+DGD IDCV +
Subjt:  QRLSMASGVLDISRTIAILFLLA------GSTISPVHSSATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQVLITSTLKLAAGIYSSADGDLIDCVLS

Query:  HQQPAFDHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYF
        HQQPAFDHP L+GQKPLDPPE P+G+ +     E+ QLWS+ GE+CPEGT+PIRRTTE++MLRA+S Q FGRK+RR V+R++T++GHEHAVGYVTG  Y+
Subjt:  HQQPAFDHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYF

Query:  GAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLCLNSFLFFVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGA
        GAKASINVW+PRV  QYEFSLSQ+WVI+GSF  DLNTIEAGWQ           +SPELYGD YPRFFTYWTSDAY+ TGCYNLLCSGFVQTN +IAIGA
Subjt:  GAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLCLNSFLFFVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGA

Query:  AISPTSSLDGGQFDISLLVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDW
        AISP SS  GGQFDISLL+WKDPKHG+WWL+FG G LVGYWP+FLFTHL+ H +MVQFGGE+VN+ P G HT T+MGSGHFAGEGF KASYFRNL++VDW
Subjt:  AISPTSSLDGGQFDISLLVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDW

Query:  DNSLVPLSNLVVLADHPNCYDIQGGVNTIWGNYFYYGGPGRNDRCP
        DN+L+P SNL +LADHPNCYDI+GG N +WGNYFYYGGPG+N RCP
Subjt:  DNSLVPLSNLVVLADHPNCYDIQGGVNTIWGNYFYYGGPGRNDRCP

AT1G70550.2 Protein of Unknown Function (DUF239)7.6e-17967.91Show/hide
Query:  RTIAILFLLA---GSTISPVHSSATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQVLITSTLKLAAGIYSSADGDLIDCVLSHQQPAFDHPNLKGQKP
        R I +L L++    ST S  +S+A +QT  P+EEL KL +IR  L++INKPAV+TIQ               S+DGD IDCV +HQQPAFDHP L+GQKP
Subjt:  RTIAILFLLA---GSTISPVHSSATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQVLITSTLKLAAGIYSSADGDLIDCVLSHQQPAFDHPNLKGQKP

Query:  LDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQ
        LDPPE P+G+ +     E+ QLWS+ GE+CPEGT+PIRRTTE++MLRA+S Q FGRK+RR V+R++T++GHEHAVGYVTG  Y+GAKASINVW+PRV  Q
Subjt:  LDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQ

Query:  YEFSLSQMWVISGSFGDDLNTIEAGWQACLCLNSFLFFVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDIS
        YEFSLSQ+WVI+GSF  DLNTIEAGWQ           +SPELYGD YPRFFTYWTSDAY+ TGCYNLLCSGFVQTN +IAIGAAISP SS  GGQFDIS
Subjt:  YEFSLSQMWVISGSFGDDLNTIEAGWQACLCLNSFLFFVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSLDGGQFDIS

Query:  LLVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADH
        LL+WKDPKHG+WWL+FG G LVGYWP+FLFTHL+ H +MVQFGGE+VN+ P G HT T+MGSGHFAGEGF KASYFRNL++VDWDN+L+P SNL +LADH
Subjt:  LLVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPLSNLVVLADH

Query:  PNCYDIQGGVNTIWGNYFYYGGPGRNDRCP
        PNCYDI+GG N +WGNYFYYGGPG+N RCP
Subjt:  PNCYDIQGGVNTIWGNYFYYGGPGRNDRCP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AGGATTATCCTTTTGCTTTCTCACAACAATGCTCCAAATTTCGTTGCCTTGACAGTTCACAATTCACAATTCACATGCTGGGCTGGGCGCTGCATTTCTTTGATCAAACA
ACTCGGACAGCGCCTAAGCATGGCCTCCGGCGTGTTGGACATCTCCCGAACCATTGCCATTCTCTTTCTTCTTGCAGGCTCCACAATTAGCCCGGTGCACTCGTCTGCCA
CTAACCAAACCTTCCATCCCGAAGAGGAATTAAACAAGTTGAAGATGATAAGAGCTCGCTTGGAGGAGATCAACAAGCCTGCTGTCCGCACAATTCAGGTACTAATCACT
TCCACTTTAAAACTTGCAGCAGGTATATACTCTAGTGCTGACGGGGACCTAATAGATTGTGTTTTATCTCATCAGCAACCAGCATTTGACCATCCTAATTTGAAAGGACA
AAAACCGTTGGATCCGCCGGAGAGACCTCGAGGACATAAGCAGCCTAGGACGGTGACAGAAAGCTTCCAATTATGGAGCATGGATGGGGAAACTTGTCCAGAAGGAACAG
TCCCCATAAGAAGAACAACCGAAGAAGAAATGCTAAGAGCTACCTCTTTTCAGATGTTTGGAAGGAAAGTAAGAAGATGGGTTAGGAGAGAAACAACGAGCGACGGGCAT
GAGCACGCAGTGGGGTATGTGACCGGCGATCACTACTTCGGAGCAAAGGCAAGTATTAACGTGTGGGCACCTCGCGTCGCCGATCAGTATGAATTCAGTTTGTCGCAAAT
GTGGGTCATCTCCGGCTCTTTCGGCGACGATCTGAACACCATTGAAGCTGGTTGGCAGGCATGTCTCTGTCTCAACTCTTTTCTTTTTTTTGTTAGCCCAGAGCTGTACG
GAGACAATTACCCAAGATTCTTCACTTACTGGACTTCGGATGCATACCAAGCAACCGGATGCTACAATTTGCTGTGCTCGGGGTTTGTTCAGACAAATAACAAAATTGCA
ATTGGGGCTGCGATTTCTCCAACCTCATCGTTGGACGGCGGGCAATTTGACATCAGCCTGTTGGTGTGGAAGGACCCGAAGCATGGAAATTGGTGGCTGGAATTCGGAGG
GGGAGTTCTGGTGGGGTACTGGCCGTCGTTCTTGTTCACCCACCTACAGGACCACGCGACGATGGTGCAGTTCGGGGGAGAGGTGGTGAATTCGAGCCCGTCGGGGTTCC
ACACGAGGACGGAGATGGGGAGCGGGCATTTCGCCGGCGAGGGATTCAGAAAGGCTTCGTATTTCCGGAATCTGGAAGTGGTGGATTGGGACAACAGCTTGGTTCCGTTG
TCAAACCTGGTGGTACTGGCGGATCATCCAAACTGCTACGACATTCAAGGCGGGGTCAACACCATTTGGGGTAACTACTTCTACTACGGCGGACCTGGTCGAAACGACAG
GTGTCCC
mRNA sequenceShow/hide mRNA sequence
AGGATTATCCTTTTGCTTTCTCACAACAATGCTCCAAATTTCGTTGCCTTGACAGTTCACAATTCACAATTCACATGCTGGGCTGGGCGCTGCATTTCTTTGATCAAACA
ACTCGGACAGCGCCTAAGCATGGCCTCCGGCGTGTTGGACATCTCCCGAACCATTGCCATTCTCTTTCTTCTTGCAGGCTCCACAATTAGCCCGGTGCACTCGTCTGCCA
CTAACCAAACCTTCCATCCCGAAGAGGAATTAAACAAGTTGAAGATGATAAGAGCTCGCTTGGAGGAGATCAACAAGCCTGCTGTCCGCACAATTCAGGTACTAATCACT
TCCACTTTAAAACTTGCAGCAGGTATATACTCTAGTGCTGACGGGGACCTAATAGATTGTGTTTTATCTCATCAGCAACCAGCATTTGACCATCCTAATTTGAAAGGACA
AAAACCGTTGGATCCGCCGGAGAGACCTCGAGGACATAAGCAGCCTAGGACGGTGACAGAAAGCTTCCAATTATGGAGCATGGATGGGGAAACTTGTCCAGAAGGAACAG
TCCCCATAAGAAGAACAACCGAAGAAGAAATGCTAAGAGCTACCTCTTTTCAGATGTTTGGAAGGAAAGTAAGAAGATGGGTTAGGAGAGAAACAACGAGCGACGGGCAT
GAGCACGCAGTGGGGTATGTGACCGGCGATCACTACTTCGGAGCAAAGGCAAGTATTAACGTGTGGGCACCTCGCGTCGCCGATCAGTATGAATTCAGTTTGTCGCAAAT
GTGGGTCATCTCCGGCTCTTTCGGCGACGATCTGAACACCATTGAAGCTGGTTGGCAGGCATGTCTCTGTCTCAACTCTTTTCTTTTTTTTGTTAGCCCAGAGCTGTACG
GAGACAATTACCCAAGATTCTTCACTTACTGGACTTCGGATGCATACCAAGCAACCGGATGCTACAATTTGCTGTGCTCGGGGTTTGTTCAGACAAATAACAAAATTGCA
ATTGGGGCTGCGATTTCTCCAACCTCATCGTTGGACGGCGGGCAATTTGACATCAGCCTGTTGGTGTGGAAGGACCCGAAGCATGGAAATTGGTGGCTGGAATTCGGAGG
GGGAGTTCTGGTGGGGTACTGGCCGTCGTTCTTGTTCACCCACCTACAGGACCACGCGACGATGGTGCAGTTCGGGGGAGAGGTGGTGAATTCGAGCCCGTCGGGGTTCC
ACACGAGGACGGAGATGGGGAGCGGGCATTTCGCCGGCGAGGGATTCAGAAAGGCTTCGTATTTCCGGAATCTGGAAGTGGTGGATTGGGACAACAGCTTGGTTCCGTTG
TCAAACCTGGTGGTACTGGCGGATCATCCAAACTGCTACGACATTCAAGGCGGGGTCAACACCATTTGGGGTAACTACTTCTACTACGGCGGACCTGGTCGAAACGACAG
GTGTCCC
Protein sequenceShow/hide protein sequence
RIILLLSHNNAPNFVALTVHNSQFTCWAGRCISLIKQLGQRLSMASGVLDISRTIAILFLLAGSTISPVHSSATNQTFHPEEELNKLKMIRARLEEINKPAVRTIQVLIT
STLKLAAGIYSSADGDLIDCVLSHQQPAFDHPNLKGQKPLDPPERPRGHKQPRTVTESFQLWSMDGETCPEGTVPIRRTTEEEMLRATSFQMFGRKVRRWVRRETTSDGH
EHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGWQACLCLNSFLFFVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIA
IGAAISPTSSLDGGQFDISLLVWKDPKHGNWWLEFGGGVLVGYWPSFLFTHLQDHATMVQFGGEVVNSSPSGFHTRTEMGSGHFAGEGFRKASYFRNLEVVDWDNSLVPL
SNLVVLADHPNCYDIQGGVNTIWGNYFYYGGPGRNDRCP