; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005258 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005258
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationchr6:12835097..12837068
RNA-Seq ExpressionLag0005258
SyntenyLag0005258
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131433.1 uncharacterized protein LOC111004647 [Momordica charantia]1.3e-22490.07Show/hide
Query:  MASASFNISPTIAIPLLLLLLVASKITPLHSYATNQTFHPQEELNKLKMIRAHLENINKPAVQTIQSPDGDIIDCVLSDQQPAFDHPKLKGQKPLDPPER
        MAS   +IS TIAI   L LL  S I+P+HS ATNQTFHP+EELNKLKMIRA LE INKPAV+TIQS DGD+IDCVLS QQPAFDHP LKGQKPLDPPER
Subjt:  MASASFNISPTIAIPLLLLLLVASKITPLHSYATNQTFHPQEELNKLKMIRAHLENINKPAVQTIQSPDGDIIDCVLSDQQPAFDHPKLKGQKPLDPPER

Query:  PQGHKPPRTVTENFQLWSMDGETCPEGTVPIRRTREEDMLRATSFQKFGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLS
        P+GHK PRTVTE+FQLWSMDGETCPEGTVPIRRT EE+MLRATSFQ FGRKVRRWVRRET+SDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLS
Subjt:  PQGHKPPRTVTENFQLWSMDGETCPEGTVPIRRTREEDMLRATSFQKFGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLS

Query:  QMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQYDISLLVWKDPKHGNWWLEFG
        QMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSS DGGQ+DISLLVWKDPKHGNWWLEFG
Subjt:  QMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQYDISLLVWKDPKHGNWWLEFG

Query:  SGVLVGYWPSFLFTHLQEHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEEFRKASYFRNLQVVDWDNSLVPLSNLIVLADHPNCYDIQGGINNVWGNY
         GVLVGYWPSFLFTHLQ+HATMVQFGGEVVNSSPSGFHT TEMGSGHFAGE FRKASYFRNL+VVDWDNSLVPLSNL+VLADHPNCYDIQGG+N +WGNY
Subjt:  SGVLVGYWPSFLFTHLQEHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEEFRKASYFRNLQVVDWDNSLVPLSNLIVLADHPNCYDIQGGINNVWGNY

Query:  FYYGGPGRNDRCP
        FYYGGPGRNDRCP
Subjt:  FYYGGPGRNDRCP

XP_022961843.1 uncharacterized protein LOC111462488 [Cucurbita moschata]1.8e-22690.91Show/hide
Query:  ASASFNISPTIAI-PLLLLLLVASKITPLHS-----YATNQTFHPQEELNKLKMIRAHLENINKPAVQTIQSPDGDIIDCVLSDQQPAFDHPKLKGQKPL
        +SA FNIS  I I  LLLLLL+ S ITPLHS     +ATNQTFHPQ+ELNKLKMIRA L+NINKPA+ TIQSPDGD+IDCVLS  QPAFDHPKLKGQ PL
Subjt:  ASASFNISPTIAI-PLLLLLLVASKITPLHS-----YATNQTFHPQEELNKLKMIRAHLENINKPAVQTIQSPDGDIIDCVLSDQQPAFDHPKLKGQKPL

Query:  DPPERPQGHKPPRTVTENFQLWSMDGETCPEGTVPIRRTREEDMLRATSFQKFGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQY
        DPPERP+GHKPPRTVTE+FQLWSM+GE CPEGTVPIRRT EEDMLRATSFQ FGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQY
Subjt:  DPPERPQGHKPPRTVTENFQLWSMDGETCPEGTVPIRRTREEDMLRATSFQKFGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQY

Query:  EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQYDISLLVWKDPKHGNW
        EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQ+DISLLVWKDPKHGNW
Subjt:  EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQYDISLLVWKDPKHGNW

Query:  WLEFGSGVLVGYWPSFLFTHLQEHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEEFRKASYFRNLQVVDWDNSLVPLSNLIVLADHPNCYDIQGGINN
        WLEFGSGVLVGYWPSFLFTHLQ+HATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGE F KASYFRNLQVVDWDNSLVPLSNL+VLADHPNCYDIQGGIN 
Subjt:  WLEFGSGVLVGYWPSFLFTHLQEHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEEFRKASYFRNLQVVDWDNSLVPLSNLIVLADHPNCYDIQGGINN

Query:  VWGNYFYYGGPGRNDRCP
        VWGNYFYYGGPGRN RCP
Subjt:  VWGNYFYYGGPGRNDRCP

XP_022996673.1 uncharacterized protein LOC111491848 [Cucurbita maxima]2.2e-22490.19Show/hide
Query:  ASASFNISPTIAI-PLLLLLLVASKITPLHS-----YATNQTFHPQEELNKLKMIRAHLENINKPAVQTIQSPDGDIIDCVLSDQQPAFDHPKLKGQKPL
        +SA FNIS  I I  LLLLLL+ S ITPLHS     +ATNQTFHPQ+ELNKLKMIRA L+ INKPA+ TIQSPDGD+IDCVLS  QPAFDHPKLKGQ PL
Subjt:  ASASFNISPTIAI-PLLLLLLVASKITPLHS-----YATNQTFHPQEELNKLKMIRAHLENINKPAVQTIQSPDGDIIDCVLSDQQPAFDHPKLKGQKPL

Query:  DPPERPQGHKPPRTVTENFQLWSMDGETCPEGTVPIRRTREEDMLRATSFQKFGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQY
        DPPERP+GHKPPRTVTE+FQLWSM+GE CPEGTVPIRRT EED+LRATSFQ FGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQY
Subjt:  DPPERPQGHKPPRTVTENFQLWSMDGETCPEGTVPIRRTREEDMLRATSFQKFGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQY

Query:  EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQYDISLLVWKDPKHGNW
        EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQ+DISLLVWKDPKHGNW
Subjt:  EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQYDISLLVWKDPKHGNW

Query:  WLEFGSGVLVGYWPSFLFTHLQEHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEEFRKASYFRNLQVVDWDNSLVPLSNLIVLADHPNCYDIQGGINN
        WLEFGSGVLVGYWPSFLFTHLQ+HATMVQFGGEVVNSSPSG HTTTEMGSGHFAGE F KASYFRNLQVVDWDNSLVPLSNL+VLADHPNCYDIQGGIN 
Subjt:  WLEFGSGVLVGYWPSFLFTHLQEHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEEFRKASYFRNLQVVDWDNSLVPLSNLIVLADHPNCYDIQGGINN

Query:  VWGNYFYYGGPGRNDRCP
        VWGNYFYYGGPGRN RCP
Subjt:  VWGNYFYYGGPGRNDRCP

XP_023546698.1 uncharacterized protein LOC111805725 [Cucurbita pepo subsp. pepo]2.3e-22690.67Show/hide
Query:  ASASFNISPTIAI-PLLLLLLVASKITPLHS-----YATNQTFHPQEELNKLKMIRAHLENINKPAVQTIQSPDGDIIDCVLSDQQPAFDHPKLKGQKPL
        +SA FNIS  I +  LLLLLL+ S ITPLHS     +ATNQTFHPQ+ELNKLKMIRA L+NINKPA+ TIQSPDGD+IDCVLS  QPAFDHPKLKGQ PL
Subjt:  ASASFNISPTIAI-PLLLLLLVASKITPLHS-----YATNQTFHPQEELNKLKMIRAHLENINKPAVQTIQSPDGDIIDCVLSDQQPAFDHPKLKGQKPL

Query:  DPPERPQGHKPPRTVTENFQLWSMDGETCPEGTVPIRRTREEDMLRATSFQKFGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQY
        DPPERP+GHKPPRTVTE+FQLWSM+GE CPEGTVPIRRT EEDMLRATSFQ FGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQY
Subjt:  DPPERPQGHKPPRTVTENFQLWSMDGETCPEGTVPIRRTREEDMLRATSFQKFGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQY

Query:  EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQYDISLLVWKDPKHGNW
        EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQ+DISLLVWKDPKHGNW
Subjt:  EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQYDISLLVWKDPKHGNW

Query:  WLEFGSGVLVGYWPSFLFTHLQEHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEEFRKASYFRNLQVVDWDNSLVPLSNLIVLADHPNCYDIQGGINN
        WLEFGSGVLVGYWPSFLFTHLQ+HATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGE F KASYFRNLQVVDWDNSLVPLSNL+VLADHPNCYDIQGGIN 
Subjt:  WLEFGSGVLVGYWPSFLFTHLQEHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEEFRKASYFRNLQVVDWDNSLVPLSNLIVLADHPNCYDIQGGINN

Query:  VWGNYFYYGGPGRNDRCP
        VWGNYFYYGGPGRN RCP
Subjt:  VWGNYFYYGGPGRNDRCP

XP_038885684.1 uncharacterized protein LOC120075988 [Benincasa hispida]1.2e-22289.76Show/hide
Query:  MASA--SFNISPTIAIPLLLL-LLVASKITPLHS---YATNQT-FHPQEELNKLKMIRAHLENINKPAVQTIQSPDGDIIDCVLSDQQPAFDHPKLKGQK
        MASA    NIS TI I +LL  LL  S ITP+HS   +ATNQT FHPQEELNKL MIRAHL+ INKPA+ TIQSPDGD+IDCVLS QQPAFDHPKL+GQ 
Subjt:  MASA--SFNISPTIAIPLLLL-LLVASKITPLHS---YATNQT-FHPQEELNKLKMIRAHLENINKPAVQTIQSPDGDIIDCVLSDQQPAFDHPKLKGQK

Query:  PLDPPERPQGHKPPRTVTENFQLWSMDGETCPEGTVPIRRTREEDMLRATSFQKFGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVAD
        PLDPPERPQGHKPPRTVTE+FQLWSM+GE CPEGTVPIRRT EEDMLRATSFQ FG+KV RWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVAD
Subjt:  PLDPPERPQGHKPPRTVTENFQLWSMDGETCPEGTVPIRRTREEDMLRATSFQKFGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVAD

Query:  QYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQYDISLLVWKDPKHG
        QYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSF+GGQ+DISLLVWKDPKHG
Subjt:  QYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQYDISLLVWKDPKHG

Query:  NWWLEFGSGVLVGYWPSFLFTHLQEHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEEFRKASYFRNLQVVDWDNSLVPLSNLIVLADHPNCYDIQGGI
        NWWLEFGSGVLVGYWPSFLFTHL++HATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGE F KASYFRNLQVVDWDNSLVPLSNL+VLADHPNCYDIQGGI
Subjt:  NWWLEFGSGVLVGYWPSFLFTHLQEHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEEFRKASYFRNLQVVDWDNSLVPLSNLIVLADHPNCYDIQGGI

Query:  NNVWGNYFYYGGPGRNDRCP
        N VWGNYFYYGGPGRNDRCP
Subjt:  NNVWGNYFYYGGPGRNDRCP

TrEMBL top hitse value%identityAlignment
A0A0A0LKI2 Uncharacterized protein1.7e-22289.02Show/hide
Query:  MASA--SFNISPTIAIPLLLLLLVASKITPLHSY---ATNQT-FHPQEELNKLKMIRAHLENINKPAVQTIQSPDGDIIDCVLSDQQPAFDHPKLKGQKP
        MASA    NI+PTI I +LL  L+AS ITP+HS     TNQT FHPQ+ELNKLKMIRAHL+ INKPA+ TIQSPDGDIIDCVLS  QPAFDHPKL+GQKP
Subjt:  MASA--SFNISPTIAIPLLLLLLVASKITPLHSY---ATNQT-FHPQEELNKLKMIRAHLENINKPAVQTIQSPDGDIIDCVLSDQQPAFDHPKLKGQKP

Query:  LDPPERPQGHKPPRTVTENFQLWSMDGETCPEGTVPIRRTREEDMLRATSFQKFGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQ
        LDPPERPQGHKPPRT TE+FQLWS  GE CPEGTVPIRRT EED+LRATSFQ FGRKVR+WVRRETSSDGHEHAVGYVTG+HYFGAKASINVWAPRVADQ
Subjt:  LDPPERPQGHKPPRTVTENFQLWSMDGETCPEGTVPIRRTREEDMLRATSFQKFGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQ

Query:  YEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQYDISLLVWKDPKHGN
        YEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSF+GGQ+DISLLVWKDPKHGN
Subjt:  YEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQYDISLLVWKDPKHGN

Query:  WWLEFGSGVLVGYWPSFLFTHLQEHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEEFRKASYFRNLQVVDWDNSLVPLSNLIVLADHPNCYDIQGGIN
        WWLEFGSGVLVGYWPSFLFTHLQ+HATMVQFGGEVVNSSPSG HTTTEMGSGHFAGE F KASYFRNLQVVDWDNSLVPLSNL+VLADHPNCYDI+GGIN
Subjt:  WWLEFGSGVLVGYWPSFLFTHLQEHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEEFRKASYFRNLQVVDWDNSLVPLSNLIVLADHPNCYDIQGGIN

Query:  NVWGNYFYYGGPGRNDRCP
         VWGNYFYYGGPGRNDRCP
Subjt:  NVWGNYFYYGGPGRNDRCP

A0A1S3BA82 LOW QUALITY PROTEIN: uncharacterized protein LOC1034877146.3e-22289.05Show/hide
Query:  MASA--SFNISPTIAIPLLLLLLVASKITPLHSY----ATNQT-FHPQEELNKLKMIRAHLENINKPAVQTIQSPDGDIIDCVLSDQQPAFDHPKLKGQK
        MASA    NI+PTI I  +L  L+AS ITP+HS     +TNQT FHPQ+ELNKLKMIRAHL+ INKPAV TIQSPDGDIIDCVLS  QPAFDHPKL+GQ 
Subjt:  MASA--SFNISPTIAIPLLLLLLVASKITPLHSY----ATNQT-FHPQEELNKLKMIRAHLENINKPAVQTIQSPDGDIIDCVLSDQQPAFDHPKLKGQK

Query:  PLDPPERPQGHKPPRTVTENFQLWSMDGETCPEGTVPIRRTREEDMLRATSFQKFGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVAD
        PLDPPERPQGHKPPRT TE+FQLWSM GE CPEGTVPIRRT EEDMLRATSFQ FG+KVR+WVRRETSSDGHEHAVGYVTG+HYFGAKASINVWAPRVAD
Subjt:  PLDPPERPQGHKPPRTVTENFQLWSMDGETCPEGTVPIRRTREEDMLRATSFQKFGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVAD

Query:  QYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQYDISLLVWKDPKHG
        QYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSF+GGQ+DISLLVWKDPKHG
Subjt:  QYEFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQYDISLLVWKDPKHG

Query:  NWWLEFGSGVLVGYWPSFLFTHLQEHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEEFRKASYFRNLQVVDWDNSLVPLSNLIVLADHPNCYDIQGGI
        NWWLEFGSGVLVGYWPSFLFTHLQ+HATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGE F KASYFRNLQVVDWDNSLVPLSNL+VLADHPNCYDI+GGI
Subjt:  NWWLEFGSGVLVGYWPSFLFTHLQEHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEEFRKASYFRNLQVVDWDNSLVPLSNLIVLADHPNCYDIQGGI

Query:  NNVWGNYFYYGGPGRNDRCP
        N VWGNYFYYGGPGRNDRCP
Subjt:  NNVWGNYFYYGGPGRNDRCP

A0A6J1BTC9 uncharacterized protein LOC1110046476.1e-22590.07Show/hide
Query:  MASASFNISPTIAIPLLLLLLVASKITPLHSYATNQTFHPQEELNKLKMIRAHLENINKPAVQTIQSPDGDIIDCVLSDQQPAFDHPKLKGQKPLDPPER
        MAS   +IS TIAI   L LL  S I+P+HS ATNQTFHP+EELNKLKMIRA LE INKPAV+TIQS DGD+IDCVLS QQPAFDHP LKGQKPLDPPER
Subjt:  MASASFNISPTIAIPLLLLLLVASKITPLHSYATNQTFHPQEELNKLKMIRAHLENINKPAVQTIQSPDGDIIDCVLSDQQPAFDHPKLKGQKPLDPPER

Query:  PQGHKPPRTVTENFQLWSMDGETCPEGTVPIRRTREEDMLRATSFQKFGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLS
        P+GHK PRTVTE+FQLWSMDGETCPEGTVPIRRT EE+MLRATSFQ FGRKVRRWVRRET+SDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLS
Subjt:  PQGHKPPRTVTENFQLWSMDGETCPEGTVPIRRTREEDMLRATSFQKFGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLS

Query:  QMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQYDISLLVWKDPKHGNWWLEFG
        QMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSS DGGQ+DISLLVWKDPKHGNWWLEFG
Subjt:  QMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQYDISLLVWKDPKHGNWWLEFG

Query:  SGVLVGYWPSFLFTHLQEHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEEFRKASYFRNLQVVDWDNSLVPLSNLIVLADHPNCYDIQGGINNVWGNY
         GVLVGYWPSFLFTHLQ+HATMVQFGGEVVNSSPSGFHT TEMGSGHFAGE FRKASYFRNL+VVDWDNSLVPLSNL+VLADHPNCYDIQGG+N +WGNY
Subjt:  SGVLVGYWPSFLFTHLQEHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEEFRKASYFRNLQVVDWDNSLVPLSNLIVLADHPNCYDIQGGINNVWGNY

Query:  FYYGGPGRNDRCP
        FYYGGPGRNDRCP
Subjt:  FYYGGPGRNDRCP

A0A6J1HCZ9 uncharacterized protein LOC1114624888.5e-22790.91Show/hide
Query:  ASASFNISPTIAI-PLLLLLLVASKITPLHS-----YATNQTFHPQEELNKLKMIRAHLENINKPAVQTIQSPDGDIIDCVLSDQQPAFDHPKLKGQKPL
        +SA FNIS  I I  LLLLLL+ S ITPLHS     +ATNQTFHPQ+ELNKLKMIRA L+NINKPA+ TIQSPDGD+IDCVLS  QPAFDHPKLKGQ PL
Subjt:  ASASFNISPTIAI-PLLLLLLVASKITPLHS-----YATNQTFHPQEELNKLKMIRAHLENINKPAVQTIQSPDGDIIDCVLSDQQPAFDHPKLKGQKPL

Query:  DPPERPQGHKPPRTVTENFQLWSMDGETCPEGTVPIRRTREEDMLRATSFQKFGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQY
        DPPERP+GHKPPRTVTE+FQLWSM+GE CPEGTVPIRRT EEDMLRATSFQ FGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQY
Subjt:  DPPERPQGHKPPRTVTENFQLWSMDGETCPEGTVPIRRTREEDMLRATSFQKFGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQY

Query:  EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQYDISLLVWKDPKHGNW
        EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQ+DISLLVWKDPKHGNW
Subjt:  EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQYDISLLVWKDPKHGNW

Query:  WLEFGSGVLVGYWPSFLFTHLQEHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEEFRKASYFRNLQVVDWDNSLVPLSNLIVLADHPNCYDIQGGINN
        WLEFGSGVLVGYWPSFLFTHLQ+HATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGE F KASYFRNLQVVDWDNSLVPLSNL+VLADHPNCYDIQGGIN 
Subjt:  WLEFGSGVLVGYWPSFLFTHLQEHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEEFRKASYFRNLQVVDWDNSLVPLSNLIVLADHPNCYDIQGGINN

Query:  VWGNYFYYGGPGRNDRCP
        VWGNYFYYGGPGRN RCP
Subjt:  VWGNYFYYGGPGRNDRCP

A0A6J1KBP1 uncharacterized protein LOC1114918481.0e-22490.19Show/hide
Query:  ASASFNISPTIAI-PLLLLLLVASKITPLHS-----YATNQTFHPQEELNKLKMIRAHLENINKPAVQTIQSPDGDIIDCVLSDQQPAFDHPKLKGQKPL
        +SA FNIS  I I  LLLLLL+ S ITPLHS     +ATNQTFHPQ+ELNKLKMIRA L+ INKPA+ TIQSPDGD+IDCVLS  QPAFDHPKLKGQ PL
Subjt:  ASASFNISPTIAI-PLLLLLLVASKITPLHS-----YATNQTFHPQEELNKLKMIRAHLENINKPAVQTIQSPDGDIIDCVLSDQQPAFDHPKLKGQKPL

Query:  DPPERPQGHKPPRTVTENFQLWSMDGETCPEGTVPIRRTREEDMLRATSFQKFGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQY
        DPPERP+GHKPPRTVTE+FQLWSM+GE CPEGTVPIRRT EED+LRATSFQ FGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQY
Subjt:  DPPERPQGHKPPRTVTENFQLWSMDGETCPEGTVPIRRTREEDMLRATSFQKFGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQY

Query:  EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQYDISLLVWKDPKHGNW
        EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQ+DISLLVWKDPKHGNW
Subjt:  EFSLSQMWVISGSFGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQYDISLLVWKDPKHGNW

Query:  WLEFGSGVLVGYWPSFLFTHLQEHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEEFRKASYFRNLQVVDWDNSLVPLSNLIVLADHPNCYDIQGGINN
        WLEFGSGVLVGYWPSFLFTHLQ+HATMVQFGGEVVNSSPSG HTTTEMGSGHFAGE F KASYFRNLQVVDWDNSLVPLSNL+VLADHPNCYDIQGGIN 
Subjt:  WLEFGSGVLVGYWPSFLFTHLQEHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEEFRKASYFRNLQVVDWDNSLVPLSNLIVLADHPNCYDIQGGINN

Query:  VWGNYFYYGGPGRNDRCP
        VWGNYFYYGGPGRN RCP
Subjt:  VWGNYFYYGGPGRNDRCP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10750.1 Protein of Unknown Function (DUF239)3.2e-18673.58Show/hide
Query:  SPTIAIPLLLLLLVASKITPLHSYATNQTFHPQEELNKLKMIRAHLENINKPAVQTIQSPDGDIIDCVLSDQQPAFDHPKLKGQKPLDPPERPQGHKPPR
        S  + + LLLL    S +   +    NQT  P +ELNKLK I  HL  INKP+++TI SPDGDIIDCVL   QPAFDHP L+GQKPLDPPERP+GH    
Subjt:  SPTIAIPLLLLLLVASKITPLHSYATNQTFHPQEELNKLKMIRAHLENINKPAVQTIQSPDGDIIDCVLSDQQPAFDHPKLKGQKPLDPPERPQGHKPPR

Query:  TVTENFQLWSMDGETCPEGTVPIRRTREEDMLRATSFQKFGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGS
           ++FQLW M+GETCPEGTVPIRRT+EED+LRA S   FG+K+R + RR+TSS+GHEHAVGYV+G+ Y+GAKASINVWAP+V +QYEFSLSQ+W+ISGS
Subjt:  TVTENFQLWSMDGETCPEGTVPIRRTREEDMLRATSFQKFGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGS

Query:  FGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQYDISLLVWKDPKHGNWWLEFGSGVLVGYW
        FG+DLNTIEAGWQVSPELYGDNYPRFFTYWT+DAYQATGCYNLLCSGFVQTN++IAIGAAISP+SS+ GGQ+DI+LL+WKDPKHGNWWLEFGSG+LVGYW
Subjt:  FGDDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQYDISLLVWKDPKHGNWWLEFGSGVLVGYW

Query:  PSFLFTHLQEHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEEFRKASYFRNLQVVDWDNSLVPLSNLIVLADHPNCYDIQGGINNVWGNYFYYGGPGR
        PSFLFTHL+EHA+MVQ+GGE+VNSSP G HT+T+MGSGHFA E F K+SYFRN+QVVDWDN+LVP  NL VLADHPNCYDIQGG N  WG+YFYYGGPG+
Subjt:  PSFLFTHLQEHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEEFRKASYFRNLQVVDWDNSLVPLSNLIVLADHPNCYDIQGGINNVWGNYFYYGGPGR

Query:  NDRCP
        N +CP
Subjt:  NDRCP

AT1G23340.1 Protein of Unknown Function (DUF239)2.0e-18071.61Show/hide
Query:  LLLLLLVASKITPLHSYATNQTFHPQEELNKLKMIRAHLENINKPAVQTIQSPDGDIIDCVLSDQQPAFDHPKLKGQKPLDPPERPQGHKPPRTVTENFQ
        +LLL L +S  +P +S +      PQ E+ K+K+IR  L+ INKPA++TI S DGD IDCV S  QPAFDHP L+GQ+P+DPPE P G+       ENFQ
Subjt:  LLLLLLVASKITPLHSYATNQTFHPQEELNKLKMIRAHLENINKPAVQTIQSPDGDIIDCVLSDQQPAFDHPKLKGQKPLDPPERPQGHKPPRTVTENFQ

Query:  LWSMDGETCPEGTVPIRRTREEDMLRATSFQKFGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNT
        LWS+ GE+CPEGT+PIRRT E+DMLRA S ++FGRK+RR VRR++SS+GHEHAVGYV+G  Y+GAKASINVW PRV  QYEFSLSQ+W+I+GSF  DLNT
Subjt:  LWSMDGETCPEGTVPIRRTREEDMLRATSFQKFGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNT

Query:  IEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQYDISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTH
        IEAGWQ+SPELYGD  PRFFTYWTSDAYQATGCYNLLCSGFVQTNN+IAIGAAISP SS+ GGQ+DISLL+WKDPKHG+WWL+FGSG LVGYWP  LFTH
Subjt:  IEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQYDISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTH

Query:  LQEHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEEFRKASYFRNLQVVDWDNSLVPLSNLIVLADHPNCYDIQGGINNVWGNYFYYGGPGRNDRCP
        L+EH  MVQFGGE+VN+ P G HT+T+MGSGHFAGE F KASYFRNLQ+VDWDN+L+P+SNL VLADHPNCYDI+GG+N VWGN+FYYGGPG+N +CP
Subjt:  LQEHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEEFRKASYFRNLQVVDWDNSLVPLSNLIVLADHPNCYDIQGGINNVWGNYFYYGGPGRNDRCP

AT1G23340.2 Protein of Unknown Function (DUF239)2.0e-18071.61Show/hide
Query:  LLLLLLVASKITPLHSYATNQTFHPQEELNKLKMIRAHLENINKPAVQTIQSPDGDIIDCVLSDQQPAFDHPKLKGQKPLDPPERPQGHKPPRTVTENFQ
        +LLL L +S  +P +S +      PQ E+ K+K+IR  L+ INKPA++TI S DGD IDCV S  QPAFDHP L+GQ+P+DPPE P G+       ENFQ
Subjt:  LLLLLLVASKITPLHSYATNQTFHPQEELNKLKMIRAHLENINKPAVQTIQSPDGDIIDCVLSDQQPAFDHPKLKGQKPLDPPERPQGHKPPRTVTENFQ

Query:  LWSMDGETCPEGTVPIRRTREEDMLRATSFQKFGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNT
        LWS+ GE+CPEGT+PIRRT E+DMLRA S ++FGRK+RR VRR++SS+GHEHAVGYV+G  Y+GAKASINVW PRV  QYEFSLSQ+W+I+GSF  DLNT
Subjt:  LWSMDGETCPEGTVPIRRTREEDMLRATSFQKFGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNT

Query:  IEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQYDISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTH
        IEAGWQ+SPELYGD  PRFFTYWTSDAYQATGCYNLLCSGFVQTNN+IAIGAAISP SS+ GGQ+DISLL+WKDPKHG+WWL+FGSG LVGYWP  LFTH
Subjt:  IEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQYDISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTH

Query:  LQEHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEEFRKASYFRNLQVVDWDNSLVPLSNLIVLADHPNCYDIQGGINNVWGNYFYYGGPGRNDRCP
        L+EH  MVQFGGE+VN+ P G HT+T+MGSGHFAGE F KASYFRNLQ+VDWDN+L+P+SNL VLADHPNCYDI+GG+N VWGN+FYYGGPG+N +CP
Subjt:  LQEHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEEFRKASYFRNLQVVDWDNSLVPLSNLIVLADHPNCYDIQGGINNVWGNYFYYGGPGRNDRCP

AT1G70550.1 Protein of Unknown Function (DUF239)9.7e-18372.89Show/hide
Query:  LLLLLLVASKITPL----HSYATNQTFHPQEELNKLKMIRAHLENINKPAVQTIQSPDGDIIDCVLSDQQPAFDHPKLKGQKPLDPPERPQGHKPPRTVT
        +LLL LV+S  +      +S A +QT  PQEEL KL +IR  L+ INKPAV+TIQS DGD IDCV + QQPAFDHP L+GQKPLDPPE P+G+       
Subjt:  LLLLLLVASKITPL----HSYATNQTFHPQEELNKLKMIRAHLENINKPAVQTIQSPDGDIIDCVLSDQQPAFDHPKLKGQKPLDPPERPQGHKPPRTVT

Query:  ENFQLWSMDGETCPEGTVPIRRTREEDMLRATSFQKFGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGD
        EN QLWS+ GE+CPEGT+PIRRT E+DMLRA+S Q+FGRK+RR V+R+++++GHEHAVGYVTG  Y+GAKASINVW+PRV  QYEFSLSQ+WVI+GSF  
Subjt:  ENFQLWSMDGETCPEGTVPIRRTREEDMLRATSFQKFGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGD

Query:  DLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQYDISLLVWKDPKHGNWWLEFGSGVLVGYWPSF
        DLNTIEAGWQ+SPELYGD YPRFFTYWTSDAY+ TGCYNLLCSGFVQTN +IAIGAAISP SS+ GGQ+DISLL+WKDPKHG+WWL+FGSG LVGYWP+F
Subjt:  DLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQYDISLLVWKDPKHGNWWLEFGSGVLVGYWPSF

Query:  LFTHLQEHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEEFRKASYFRNLQVVDWDNSLVPLSNLIVLADHPNCYDIQGGINNVWGNYFYYGGPGRNDR
        LFTHL++H +MVQFGGE+VN+ P G HTTT+MGSGHFAGE F KASYFRNLQ+VDWDN+L+P SNL +LADHPNCYDI+GG N VWGNYFYYGGPG+N R
Subjt:  LFTHLQEHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEEFRKASYFRNLQVVDWDNSLVPLSNLIVLADHPNCYDIQGGINNVWGNYFYYGGPGRNDR

Query:  CP
        CP
Subjt:  CP

AT1G70550.2 Protein of Unknown Function (DUF239)9.7e-18372.89Show/hide
Query:  LLLLLLVASKITPL----HSYATNQTFHPQEELNKLKMIRAHLENINKPAVQTIQSPDGDIIDCVLSDQQPAFDHPKLKGQKPLDPPERPQGHKPPRTVT
        +LLL LV+S  +      +S A +QT  PQEEL KL +IR  L+ INKPAV+TIQS DGD IDCV + QQPAFDHP L+GQKPLDPPE P+G+       
Subjt:  LLLLLLVASKITPL----HSYATNQTFHPQEELNKLKMIRAHLENINKPAVQTIQSPDGDIIDCVLSDQQPAFDHPKLKGQKPLDPPERPQGHKPPRTVT

Query:  ENFQLWSMDGETCPEGTVPIRRTREEDMLRATSFQKFGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGD
        EN QLWS+ GE+CPEGT+PIRRT E+DMLRA+S Q+FGRK+RR V+R+++++GHEHAVGYVTG  Y+GAKASINVW+PRV  QYEFSLSQ+WVI+GSF  
Subjt:  ENFQLWSMDGETCPEGTVPIRRTREEDMLRATSFQKFGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGD

Query:  DLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQYDISLLVWKDPKHGNWWLEFGSGVLVGYWPSF
        DLNTIEAGWQ+SPELYGD YPRFFTYWTSDAY+ TGCYNLLCSGFVQTN +IAIGAAISP SS+ GGQ+DISLL+WKDPKHG+WWL+FGSG LVGYWP+F
Subjt:  DLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQYDISLLVWKDPKHGNWWLEFGSGVLVGYWPSF

Query:  LFTHLQEHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEEFRKASYFRNLQVVDWDNSLVPLSNLIVLADHPNCYDIQGGINNVWGNYFYYGGPGRNDR
        LFTHL++H +MVQFGGE+VN+ P G HTTT+MGSGHFAGE F KASYFRNLQ+VDWDN+L+P SNL +LADHPNCYDI+GG N VWGNYFYYGGPG+N R
Subjt:  LFTHLQEHATMVQFGGEVVNSSPSGFHTTTEMGSGHFAGEEFRKASYFRNLQVVDWDNSLVPLSNLIVLADHPNCYDIQGGINNVWGNYFYYGGPGRNDR

Query:  CP
        CP
Subjt:  CP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTCCGCCAGCTTCAACATCTCCCCAACCATTGCCATTCCTCTGCTGCTTTTGCTTCTTGTTGCTTCTAAAATAACTCCTCTGCACTCGTATGCCACGAACCAGAC
TTTCCATCCCCAGGAGGAATTAAACAAGTTGAAGATGATAAGGGCTCATTTGGAGAACATCAACAAGCCTGCTGTCCAGACAATTCAGAGTCCTGATGGGGACATCATAG
ATTGTGTTTTATCTGATCAGCAACCGGCATTTGACCATCCAAAGTTGAAAGGGCAAAAACCGTTGGATCCGCCAGAGAGGCCACAAGGACACAAGCCGCCTAGGACGGTG
ACAGAGAACTTCCAATTGTGGAGCATGGATGGGGAAACTTGCCCAGAAGGAACAGTTCCAATAAGAAGAACAAGGGAGGAAGACATGCTAAGAGCTACCTCTTTCCAGAA
GTTTGGAAGAAAAGTAAGAAGATGGGTTAGGAGAGAAACAAGCAGCGACGGACATGAGCACGCTGTGGGGTATGTGACCGGCGATCACTACTTCGGAGCAAAGGCAAGCA
TCAATGTGTGGGCACCTCGAGTCGCCGATCAGTACGAATTTAGCTTATCGCAAATGTGGGTCATCTCAGGCTCTTTCGGCGACGATCTCAACACCATTGAAGCCGGTTGG
CAGGTTAGTCCAGAACTGTACGGAGACAATTACCCAAGATTCTTTACTTACTGGACCTCGGATGCATACCAAGCAACCGGGTGTTACAATTTGCTTTGCTCGGGGTTCGT
TCAAACAAACAACAAAATTGCAATCGGAGCTGCAATTTCTCCAACGTCGTCGTTCGACGGCGGTCAATACGATATCAGCTTGTTGGTTTGGAAGGATCCGAAGCATGGAA
ATTGGTGGCTGGAATTCGGATCGGGAGTTTTGGTGGGATACTGGCCGTCGTTTCTGTTCACCCATTTACAAGAGCATGCAACAATGGTGCAATTCGGGGGAGAGGTGGTG
AATTCCAGTCCGTCGGGATTCCACACGACGACGGAGATGGGGAGTGGGCATTTCGCCGGCGAGGAATTCAGAAAAGCTTCGTATTTTCGGAACCTGCAAGTGGTGGATTG
GGACAACAGCTTAGTTCCATTGTCGAACCTCATCGTACTGGCGGATCATCCCAACTGCTACGACATTCAAGGTGGGATCAACAACGTTTGGGGCAACTACTTTTACTACG
GTGGACCTGGTCGAAACGACAGATGTCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCTCCGCCAGCTTCAACATCTCCCCAACCATTGCCATTCCTCTGCTGCTTTTGCTTCTTGTTGCTTCTAAAATAACTCCTCTGCACTCGTATGCCACGAACCAGAC
TTTCCATCCCCAGGAGGAATTAAACAAGTTGAAGATGATAAGGGCTCATTTGGAGAACATCAACAAGCCTGCTGTCCAGACAATTCAGAGTCCTGATGGGGACATCATAG
ATTGTGTTTTATCTGATCAGCAACCGGCATTTGACCATCCAAAGTTGAAAGGGCAAAAACCGTTGGATCCGCCAGAGAGGCCACAAGGACACAAGCCGCCTAGGACGGTG
ACAGAGAACTTCCAATTGTGGAGCATGGATGGGGAAACTTGCCCAGAAGGAACAGTTCCAATAAGAAGAACAAGGGAGGAAGACATGCTAAGAGCTACCTCTTTCCAGAA
GTTTGGAAGAAAAGTAAGAAGATGGGTTAGGAGAGAAACAAGCAGCGACGGACATGAGCACGCTGTGGGGTATGTGACCGGCGATCACTACTTCGGAGCAAAGGCAAGCA
TCAATGTGTGGGCACCTCGAGTCGCCGATCAGTACGAATTTAGCTTATCGCAAATGTGGGTCATCTCAGGCTCTTTCGGCGACGATCTCAACACCATTGAAGCCGGTTGG
CAGGTTAGTCCAGAACTGTACGGAGACAATTACCCAAGATTCTTTACTTACTGGACCTCGGATGCATACCAAGCAACCGGGTGTTACAATTTGCTTTGCTCGGGGTTCGT
TCAAACAAACAACAAAATTGCAATCGGAGCTGCAATTTCTCCAACGTCGTCGTTCGACGGCGGTCAATACGATATCAGCTTGTTGGTTTGGAAGGATCCGAAGCATGGAA
ATTGGTGGCTGGAATTCGGATCGGGAGTTTTGGTGGGATACTGGCCGTCGTTTCTGTTCACCCATTTACAAGAGCATGCAACAATGGTGCAATTCGGGGGAGAGGTGGTG
AATTCCAGTCCGTCGGGATTCCACACGACGACGGAGATGGGGAGTGGGCATTTCGCCGGCGAGGAATTCAGAAAAGCTTCGTATTTTCGGAACCTGCAAGTGGTGGATTG
GGACAACAGCTTAGTTCCATTGTCGAACCTCATCGTACTGGCGGATCATCCCAACTGCTACGACATTCAAGGTGGGATCAACAACGTTTGGGGCAACTACTTTTACTACG
GTGGACCTGGTCGAAACGACAGATGTCCTTGA
Protein sequenceShow/hide protein sequence
MASASFNISPTIAIPLLLLLLVASKITPLHSYATNQTFHPQEELNKLKMIRAHLENINKPAVQTIQSPDGDIIDCVLSDQQPAFDHPKLKGQKPLDPPERPQGHKPPRTV
TENFQLWSMDGETCPEGTVPIRRTREEDMLRATSFQKFGRKVRRWVRRETSSDGHEHAVGYVTGDHYFGAKASINVWAPRVADQYEFSLSQMWVISGSFGDDLNTIEAGW
QVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPTSSFDGGQYDISLLVWKDPKHGNWWLEFGSGVLVGYWPSFLFTHLQEHATMVQFGGEVV
NSSPSGFHTTTEMGSGHFAGEEFRKASYFRNLQVVDWDNSLVPLSNLIVLADHPNCYDIQGGINNVWGNYFYYGGPGRNDRCP