; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0027000 (gene) of Chayote v1 genome

Gene IDSed0027000
OrganismSechium edule (Chayote v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationLG08:37979656..37984125
RNA-Seq ExpressionSed0027000
SyntenySed0027000
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573073.1 hypothetical protein SDJN03_26960, partial [Cucurbita argyrosperma subsp. sororia]3.4e-22588.36Show/hide
Query:  MASSSSSSSFCSCSCFVVFLLVFTSFSSVFSTSISHKKALKNQTFFHPNKELRKLNHIRAYLHKINKPPIKTIQSSDGDIIDCVLSHLQPAFDHPELKGQ
        MASS SSSS  SCSCFVV LLVFTS +SVFSTSI+H+  LKNQT FHP KEL+KL HIRAYL KINKPPIKTIQSSDGD+IDCVLSHLQPAFDHPELKG 
Subjt:  MASSSSSSSFCSCSCFVVFLLVFTSFSSVFSTSISHKKALKNQTFFHPNKELRKLNHIRAYLHKINKPPIKTIQSSDGDIIDCVLSHLQPAFDHPELKGQ

Query:  SPLEPPERPKGNKSMEEMAENLQLWSVSGEFCPEGTIPIRRTTEKDIFRANSVQRFGRKPIRRTRRDSAGDGHEHAVMFVNGEQYYGAKASLNIWAPRVS
        SPLEPPERP+ NKSMEE+A+N QLWS SGEFCPEGTIPIRRTTEKDIFRANS++RFGRKPIR  RRDS+G+GHEHAV+FVNGEQYYGAKASLNIWAPRV+
Subjt:  SPLEPPERPKGNKSMEEMAENLQLWSVSGEFCPEGTIPIRRTTEKDIFRANSVQRFGRKPIRRTRRDSAGDGHEHAVMFVNGEQYYGAKASLNIWAPRVS

Query:  AEYEFSLSQIWLISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYHATGCYNLLCSGFVQINNRIAIGAAISPISSYGGKQFDIGLMVWMDPKH
         +YEFSLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAY ATGCYNLLCSGFVQ NNRIAIGAAISP+SSY GKQFDIGLMVW DPKH
Subjt:  AEYEFSLSQIWLISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYHATGCYNLLCSGFVQINNRIAIGAAISPISSYGGKQFDIGLMVWMDPKH

Query:  GHWWLEYGSGMLVGYWPSFLFSHLRSHASMVQFGGEIVNRRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLKLLADHSECYDIRQA
        GHWWLEYGSGMLVGYWP+FLFSHLRSHASMVQFGGE+VNRR+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNL +LADHS+CYDIRQ 
Subjt:  GHWWLEYGSGMLVGYWPSFLFSHLRSHASMVQFGGEIVNRRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLKLLADHSECYDIRQA

Query:  SNHVWGTYFYYGGPGRNVKCP
        SN+VWGTYFYYGGPGR V+CP
Subjt:  SNHVWGTYFYYGGPGRNVKCP

XP_022954586.1 uncharacterized protein LOC111456810 [Cucurbita moschata]6.4e-22487.89Show/hide
Query:  MASSSSSSSFCSCSCFVVFLLVFTSFSSVFSTSISHKKALKNQTFFHPNKELRKLNHIRAYLHKINKPPIKTIQSSDGDIIDCVLSHLQPAFDHPELKGQ
        MASS SSSS  SCSCFVV LLVFTS +SVFSTSI+H+  LKNQT FHP KEL+KL HIRAYL KINKPPIKTIQSSDGD+IDCVLSHLQPAFDHPELKG 
Subjt:  MASSSSSSSFCSCSCFVVFLLVFTSFSSVFSTSISHKKALKNQTFFHPNKELRKLNHIRAYLHKINKPPIKTIQSSDGDIIDCVLSHLQPAFDHPELKGQ

Query:  SPLEPPERPKGNKSMEEMAENLQLWSVSGEFCPEGTIPIRRTTEKDIFRANSVQRFGRKPIRRTRRDSAGDGHEHAVMFVNGEQYYGAKASLNIWAPRVS
        SPLEPPERP+ NKSMEE+A+N QLWS SGEFCPEGTIPIRRTTEKDIFRANS++RFGRKPIR  RRDS+G+GHEHAV+FVNGEQYYGAKASLNIWAPRV+
Subjt:  SPLEPPERPKGNKSMEEMAENLQLWSVSGEFCPEGTIPIRRTTEKDIFRANSVQRFGRKPIRRTRRDSAGDGHEHAVMFVNGEQYYGAKASLNIWAPRVS

Query:  AEYEFSLSQIWLISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYHATGCYNLLCSGFVQINNRIAIGAAISPISSYGGKQFDIGLMVWMDPKH
         +YEFSLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAY ATGCYNLLCSGFVQ N+RIAIGAAISP+SSY GKQFDIGLMVW DPKH
Subjt:  AEYEFSLSQIWLISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYHATGCYNLLCSGFVQINNRIAIGAAISPISSYGGKQFDIGLMVWMDPKH

Query:  GHWWLEYGSGMLVGYWPSFLFSHLRSHASMVQFGGEIVNRRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLKLLADHSECYDIRQA
        GHWWLEYGSG LVGYWP+FLFSHLRSHASMVQFGGE+VNRR+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNL +LADHS+CYDIRQ 
Subjt:  GHWWLEYGSGMLVGYWPSFLFSHLRSHASMVQFGGEIVNRRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLKLLADHSECYDIRQA

Query:  SNHVWGTYFYYGGPGRNVKCP
        SN+VWGTYFYYGGPGR V+CP
Subjt:  SNHVWGTYFYYGGPGRNVKCP

XP_022994621.1 uncharacterized protein LOC111490277 [Cucurbita maxima]8.3e-22488.07Show/hide
Query:  SSSSSSSFCSCSCFVVFLLVFTSFSSVFSTSISHKKALKNQTFFHPNKELRKLNHIRAYLHKINKPPIKTIQSSDGDIIDCVLSHLQPAFDHPELKGQSP
        +SS SSS  SCSCFVVFLLVFTS +SVFSTSI+H+  LKNQT FHP KEL+KL +IRAYL KINKPPIKTIQSSDGD+IDCVLSHLQPAFDHPELKG SP
Subjt:  SSSSSSSFCSCSCFVVFLLVFTSFSSVFSTSISHKKALKNQTFFHPNKELRKLNHIRAYLHKINKPPIKTIQSSDGDIIDCVLSHLQPAFDHPELKGQSP

Query:  LEPPERPKGNKSMEEMAENLQLWSVSGEFCPEGTIPIRRTTEKDIFRANSVQRFGRKPIRRTRRDSAGDGHEHAVMFVNGEQYYGAKASLNIWAPRVSAE
        LEPPERP+ NKSMEE+A+N QLWS SGEFCPEGTIPIRRTTEKDIFRANS++RFGRKPIR  RRDS+G+GHEHAV+FVNGEQYYGAKASLNIWAPRV+ +
Subjt:  LEPPERPKGNKSMEEMAENLQLWSVSGEFCPEGTIPIRRTTEKDIFRANSVQRFGRKPIRRTRRDSAGDGHEHAVMFVNGEQYYGAKASLNIWAPRVSAE

Query:  YEFSLSQIWLISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYHATGCYNLLCSGFVQINNRIAIGAAISPISSYGGKQFDIGLMVWMDPKHGH
        YEFSLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAY ATGCYNLLCSGFVQ NNRIAIGAAISP+SSY GKQFDIGLMVW DPKHGH
Subjt:  YEFSLSQIWLISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYHATGCYNLLCSGFVQINNRIAIGAAISPISSYGGKQFDIGLMVWMDPKHGH

Query:  WWLEYGSGMLVGYWPSFLFSHLRSHASMVQFGGEIVNRRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLKLLADHSECYDIRQASN
        WWLEYGSGMLVGYWP+FLFSHLRSHASMVQFGGE+VNRR+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNL +LADHS+CYDIRQ SN
Subjt:  WWLEYGSGMLVGYWPSFLFSHLRSHASMVQFGGEIVNRRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLKLLADHSECYDIRQASN

Query:  HVWGTYFYYGGPGRNVKCP
        +VWGTYFYYGGPGR V+CP
Subjt:  HVWGTYFYYGGPGRNVKCP

XP_023542314.1 uncharacterized protein LOC111802247 [Cucurbita pepo subsp. pepo]2.7e-22287.8Show/hide
Query:  SSSSSSFCSCSCFVVFLLVFTSFSSVFSTSISHKKALKNQTFFHPNKELRKLNHIRAYLHKINKPPIKTIQSSDGDIIDCVLSHLQPAFDHPELKGQSPL
        SSSSSS  SCSCFVV LLVFTS +S FSTSI+H+  LKNQT FHP KEL+KL +IRAYL KINKPPIKTIQSSDGD+IDCVLSHLQPAFDHPELKG SPL
Subjt:  SSSSSSFCSCSCFVVFLLVFTSFSSVFSTSISHKKALKNQTFFHPNKELRKLNHIRAYLHKINKPPIKTIQSSDGDIIDCVLSHLQPAFDHPELKGQSPL

Query:  EPPERPKGNKSMEEMAENLQLWSVSGEFCPEGTIPIRRTTEKDIFRANSVQRFGRKPIRRTRRDSAGDGHEHAVMFVNGEQYYGAKASLNIWAPRVSAEY
        EPPERP+ NKSMEE+A+N QLWS SGEFCPEGTIPIRRTTEKDIFRANS++RFGRKPIR  RRDS+G+GHEHAV+FVNGEQYYGAKASLNIWAPRV+ +Y
Subjt:  EPPERPKGNKSMEEMAENLQLWSVSGEFCPEGTIPIRRTTEKDIFRANSVQRFGRKPIRRTRRDSAGDGHEHAVMFVNGEQYYGAKASLNIWAPRVSAEY

Query:  EFSLSQIWLISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYHATGCYNLLCSGFVQINNRIAIGAAISPISSYGGKQFDIGLMVWMDPKHGHW
        EFSLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAY ATGCYNLLCSGFVQ NNRIAIGAAISP+SSY GKQFDIGLMVW DPKHGHW
Subjt:  EFSLSQIWLISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYHATGCYNLLCSGFVQINNRIAIGAAISPISSYGGKQFDIGLMVWMDPKHGHW

Query:  WLEYGSGMLVGYWPSFLFSHLRSHASMVQFGGEIVNRRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLKLLADHSECYDIRQASNH
        WLEYGSG LVGYWP+FLFSHLRSHASMVQFGGE+VNRR+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNL +LADHS+CYDIRQ SN+
Subjt:  WLEYGSGMLVGYWPSFLFSHLRSHASMVQFGGEIVNRRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLKLLADHSECYDIRQASNH

Query:  VWGTYFYYGGPGRNVKCP
        VWGTYFYYGGPGR V+CP
Subjt:  VWGTYFYYGGPGRNVKCP

XP_038895687.1 uncharacterized protein LOC120083860 isoform X1 [Benincasa hispida]8.3e-22488.86Show/hide
Query:  MASSSSSSSFCSCSCFVVFLLVF-TSFSSVFSTSISHKKALKNQTFFHPNKELRKLNHIRAYLHKINKPPIKTIQSSDGDIIDCVLSHLQPAFDHPELKG
        MASSSSSSS CSCSCFVVFLLVF TSFSSVFS+SISH+   KNQTFFHP KEL+KL HIR YL KINKPPIKTI+SSDGD+IDCVLSHLQPAFDHPELKG
Subjt:  MASSSSSSSFCSCSCFVVFLLVF-TSFSSVFSTSISHKKALKNQTFFHPNKELRKLNHIRAYLHKINKPPIKTIQSSDGDIIDCVLSHLQPAFDHPELKG

Query:  QSPLEPPERPKGNKSMEEMAENLQLWSVSGEFCPEGTIPIRRTTEKDIFRANSVQRFGRKPIRRTRRDSAGDGHEHAVMFVNGEQYYGAKASLNIWAPRV
         +PLEPPERP+GN S EE+AEN QLWS SG+FCPEGTIPIRRTTE+DIFRA+S +RFGRKPIRR RRDS+G+GHEHAV+FVNGEQYYGAKASLNIWAPRV
Subjt:  QSPLEPPERPKGNKSMEEMAENLQLWSVSGEFCPEGTIPIRRTTEKDIFRANSVQRFGRKPIRRTRRDSAGDGHEHAVMFVNGEQYYGAKASLNIWAPRV

Query:  SAEYEFSLSQIWLISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYHATGCYNLLCSGFVQINNRIAIGAAISPISSYGGKQFDIGLMVWMDPK
        + +YEFSLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAY ATGCYNLLCSGFVQ NNRIAIGAAISPISSY GKQFDIGLMVW DPK
Subjt:  SAEYEFSLSQIWLISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYHATGCYNLLCSGFVQINNRIAIGAAISPISSYGGKQFDIGLMVWMDPK

Query:  HGHWWLEYGSGMLVGYWPSFLFSHLRSHASMVQFGGEIVNRRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLKLLADHSECYDIRQ
        HGHWWLEYGSG+LVGYWP+FLFSHLRSHASMVQFGGEIVN RSSGFHT TQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNL LLADHS+CYDIRQ
Subjt:  HGHWWLEYGSGMLVGYWPSFLFSHLRSHASMVQFGGEIVNRRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLKLLADHSECYDIRQ

Query:  ASNHVWGTYFYYGGPGRNVKCP
        A+N+VWGTYFYYGGPGRNVKCP
Subjt:  ASNHVWGTYFYYGGPGRNVKCP

TrEMBL top hitse value%identityAlignment
A0A0A0LTK3 Uncharacterized protein1.0e-21485.34Show/hide
Query:  MASSSSSSSFCSCSCFVVFLLVFTSFSSVFSTSISHKKALKNQTFFHPNKELRKLNHIRAYLHKINKPPIKTIQSSDGDIIDCVLSHLQPAFDHPELKGQ
        MASSSSSSS  SCSCFVV LLVFTSFSSV S+SISH+   KNQT FHP KEL+KL HIR YL KINKPPIK IQSSDGD+IDCVLSHLQPAFDHP+LKG 
Subjt:  MASSSSSSSFCSCSCFVVFLLVFTSFSSVFSTSISHKKALKNQTFFHPNKELRKLNHIRAYLHKINKPPIKTIQSSDGDIIDCVLSHLQPAFDHPELKGQ

Query:  SPLEPPERPKGN-KSMEEMAENLQLWSVSGEFCPEGTIPIRRTTEKDIFRANSVQRFGRKPIRRTRRDSAGDGHEHAVMFVNGEQYYGAKASLNIWAPRV
        SPLEPPERP+GN  S EE  EN QLWS SGEFCPEGTIPIRRTTEKDI+RA+S +R+GRKPI+  +RDS+G+GHEHAV++VNGEQYYGAKASLNIWAPRV
Subjt:  SPLEPPERPKGN-KSMEEMAENLQLWSVSGEFCPEGTIPIRRTTEKDIFRANSVQRFGRKPIRRTRRDSAGDGHEHAVMFVNGEQYYGAKASLNIWAPRV

Query:  SAEYEFSLSQIWLISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYHATGCYNLLCSGFVQINNRIAIGAAISPISSYGGKQFDIGLMVWMDPK
        + +YEFS+SQIW+ISGSF NDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAY ATGCYNLLCSGFVQ NNRIAIGAAISPISSY GKQFDIGLMVW DPK
Subjt:  SAEYEFSLSQIWLISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYHATGCYNLLCSGFVQINNRIAIGAAISPISSYGGKQFDIGLMVWMDPK

Query:  HGHWWLEYGSGMLVGYWPSFLFSHLRSHASMVQFGGEIVNRRS-SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLKLLADHSECYDIR
        HGHWWLEYGSG+LVGYWP+FLFSHLRSHASMVQFGGE+VN RS SGFHT TQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNL++LADHS+CYDIR
Subjt:  HGHWWLEYGSGMLVGYWPSFLFSHLRSHASMVQFGGEIVNRRS-SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLKLLADHSECYDIR

Query:  QASNHVWGTYFYYGGPGRNVKCP
        Q +N+VWGTYFYYGGPGRNVKCP
Subjt:  QASNHVWGTYFYYGGPGRNVKCP

A0A1S3AXP9 uncharacterized protein LOC1034837232.1e-21785.51Show/hide
Query:  MASSSSSSS-----FCSCSCFVVFLLVFTSFSSVFSTSISHKKALKNQTFFHPNKELRKLNHIRAYLHKINKPPIKTIQSSDGDIIDCVLSHLQPAFDHP
        MASSSSSSS      CSCSCFVV LLVFTSFSSVFS+SISH+   KNQT FHP +EL+KL HIR YL KINKPPIKTIQSSDGD+IDCVLSHLQPAFDHP
Subjt:  MASSSSSSS-----FCSCSCFVVFLLVFTSFSSVFSTSISHKKALKNQTFFHPNKELRKLNHIRAYLHKINKPPIKTIQSSDGDIIDCVLSHLQPAFDHP

Query:  ELKGQSPLEPPERPKG-NKSMEEMAENLQLWSVSGEFCPEGTIPIRRTTEKDIFRANSVQRFGRKPIRR-TRRDSAGDGHEHAVMFVNGEQYYGAKASLN
        +LKG +PLEPPERP+G N S+EE  EN QLWS SGEFCPEGTIPIRRTTEKDI+RA+S +R+GRKPIRR  RRDS+G+GHEHAV++VNGEQYYGAKASLN
Subjt:  ELKGQSPLEPPERPKG-NKSMEEMAENLQLWSVSGEFCPEGTIPIRRTTEKDIFRANSVQRFGRKPIRR-TRRDSAGDGHEHAVMFVNGEQYYGAKASLN

Query:  IWAPRVSAEYEFSLSQIWLISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYHATGCYNLLCSGFVQINNRIAIGAAISPISSYGGKQFDIGLM
        IWAPRV+ +YEFS+SQIW+ISGSF NDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAY ATGCYNLLCSGFVQ NNRIAIGAAISPISSY GKQFDIGLM
Subjt:  IWAPRVSAEYEFSLSQIWLISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYHATGCYNLLCSGFVQINNRIAIGAAISPISSYGGKQFDIGLM

Query:  VWMDPKHGHWWLEYGSGMLVGYWPSFLFSHLRSHASMVQFGGEIVNRRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLKLLADHSE
        VW DPKHGHWWLEYGSG+LVGYWP+FLFSHLRSHASMVQFGGE+VN RSSGFHT TQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLK+LADHS+
Subjt:  VWMDPKHGHWWLEYGSGMLVGYWPSFLFSHLRSHASMVQFGGEIVNRRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLKLLADHSE

Query:  CYDIRQASNHVWGTYFYYGGPGRNVKCP
        CYDIRQ +N VWGTYFYYGGPGRNVKCP
Subjt:  CYDIRQASNHVWGTYFYYGGPGRNVKCP

A0A6J1GRA4 uncharacterized protein LOC1114568103.1e-22487.89Show/hide
Query:  MASSSSSSSFCSCSCFVVFLLVFTSFSSVFSTSISHKKALKNQTFFHPNKELRKLNHIRAYLHKINKPPIKTIQSSDGDIIDCVLSHLQPAFDHPELKGQ
        MASS SSSS  SCSCFVV LLVFTS +SVFSTSI+H+  LKNQT FHP KEL+KL HIRAYL KINKPPIKTIQSSDGD+IDCVLSHLQPAFDHPELKG 
Subjt:  MASSSSSSSFCSCSCFVVFLLVFTSFSSVFSTSISHKKALKNQTFFHPNKELRKLNHIRAYLHKINKPPIKTIQSSDGDIIDCVLSHLQPAFDHPELKGQ

Query:  SPLEPPERPKGNKSMEEMAENLQLWSVSGEFCPEGTIPIRRTTEKDIFRANSVQRFGRKPIRRTRRDSAGDGHEHAVMFVNGEQYYGAKASLNIWAPRVS
        SPLEPPERP+ NKSMEE+A+N QLWS SGEFCPEGTIPIRRTTEKDIFRANS++RFGRKPIR  RRDS+G+GHEHAV+FVNGEQYYGAKASLNIWAPRV+
Subjt:  SPLEPPERPKGNKSMEEMAENLQLWSVSGEFCPEGTIPIRRTTEKDIFRANSVQRFGRKPIRRTRRDSAGDGHEHAVMFVNGEQYYGAKASLNIWAPRVS

Query:  AEYEFSLSQIWLISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYHATGCYNLLCSGFVQINNRIAIGAAISPISSYGGKQFDIGLMVWMDPKH
         +YEFSLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAY ATGCYNLLCSGFVQ N+RIAIGAAISP+SSY GKQFDIGLMVW DPKH
Subjt:  AEYEFSLSQIWLISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYHATGCYNLLCSGFVQINNRIAIGAAISPISSYGGKQFDIGLMVWMDPKH

Query:  GHWWLEYGSGMLVGYWPSFLFSHLRSHASMVQFGGEIVNRRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLKLLADHSECYDIRQA
        GHWWLEYGSG LVGYWP+FLFSHLRSHASMVQFGGE+VNRR+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNL +LADHS+CYDIRQ 
Subjt:  GHWWLEYGSGMLVGYWPSFLFSHLRSHASMVQFGGEIVNRRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLKLLADHSECYDIRQA

Query:  SNHVWGTYFYYGGPGRNVKCP
        SN+VWGTYFYYGGPGR V+CP
Subjt:  SNHVWGTYFYYGGPGRNVKCP

A0A6J1JZN8 uncharacterized protein LOC1114902774.0e-22488.07Show/hide
Query:  SSSSSSSFCSCSCFVVFLLVFTSFSSVFSTSISHKKALKNQTFFHPNKELRKLNHIRAYLHKINKPPIKTIQSSDGDIIDCVLSHLQPAFDHPELKGQSP
        +SS SSS  SCSCFVVFLLVFTS +SVFSTSI+H+  LKNQT FHP KEL+KL +IRAYL KINKPPIKTIQSSDGD+IDCVLSHLQPAFDHPELKG SP
Subjt:  SSSSSSSFCSCSCFVVFLLVFTSFSSVFSTSISHKKALKNQTFFHPNKELRKLNHIRAYLHKINKPPIKTIQSSDGDIIDCVLSHLQPAFDHPELKGQSP

Query:  LEPPERPKGNKSMEEMAENLQLWSVSGEFCPEGTIPIRRTTEKDIFRANSVQRFGRKPIRRTRRDSAGDGHEHAVMFVNGEQYYGAKASLNIWAPRVSAE
        LEPPERP+ NKSMEE+A+N QLWS SGEFCPEGTIPIRRTTEKDIFRANS++RFGRKPIR  RRDS+G+GHEHAV+FVNGEQYYGAKASLNIWAPRV+ +
Subjt:  LEPPERPKGNKSMEEMAENLQLWSVSGEFCPEGTIPIRRTTEKDIFRANSVQRFGRKPIRRTRRDSAGDGHEHAVMFVNGEQYYGAKASLNIWAPRVSAE

Query:  YEFSLSQIWLISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYHATGCYNLLCSGFVQINNRIAIGAAISPISSYGGKQFDIGLMVWMDPKHGH
        YEFSLSQIW+ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAY ATGCYNLLCSGFVQ NNRIAIGAAISP+SSY GKQFDIGLMVW DPKHGH
Subjt:  YEFSLSQIWLISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYHATGCYNLLCSGFVQINNRIAIGAAISPISSYGGKQFDIGLMVWMDPKHGH

Query:  WWLEYGSGMLVGYWPSFLFSHLRSHASMVQFGGEIVNRRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLKLLADHSECYDIRQASN
        WWLEYGSGMLVGYWP+FLFSHLRSHASMVQFGGE+VNRR+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNL +LADHS+CYDIRQ SN
Subjt:  WWLEYGSGMLVGYWPSFLFSHLRSHASMVQFGGEIVNRRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLKLLADHSECYDIRQASN

Query:  HVWGTYFYYGGPGRNVKCP
        +VWGTYFYYGGPGR V+CP
Subjt:  HVWGTYFYYGGPGRNVKCP

A0A6J1KJ26 uncharacterized protein LOC1114950541.4e-21384.88Show/hide
Query:  SCSCFVVFLLVFTSFSSVFSTSISHKKALKNQTFFHPNKELRKLNHIRAYLHKINKPPIKTIQSSDGDIIDCVLSHLQPAFDHPELKGQSPLEPPERPKG
        S SCFVV LLVFTSFSSVF TSI+HK   KNQT+FHP+KEL +L HIRAYL KINKPP KTIQSSDGD+IDCVLSHLQPAFDHP LKG +PL PPERP+G
Subjt:  SCSCFVVFLLVFTSFSSVFSTSISHKKALKNQTFFHPNKELRKLNHIRAYLHKINKPPIKTIQSSDGDIIDCVLSHLQPAFDHPELKGQSPLEPPERPKG

Query:  NKSMEEMAENLQLWSVSGEFCPEGTIPIRRTTEKDIFRANSVQRFGRKPIRRTRRDSAGDGHEHAVMFVNGEQYYGAKASLNIWAPRVSAEYEFSLSQIW
        N S EE+AEN QLWS SG+FCPEGTIPIRRTTE+DI+RA+S +RFGRKPIR  RRDS+G+GHEHAV+FVNGEQYYGAKASLNIWAPRV+ + EFSLSQIW
Subjt:  NKSMEEMAENLQLWSVSGEFCPEGTIPIRRTTEKDIFRANSVQRFGRKPIRRTRRDSAGDGHEHAVMFVNGEQYYGAKASLNIWAPRVSAEYEFSLSQIW

Query:  LISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYHATGCYNLLCSGFVQINNRIAIGAAISPISSYGGKQFDIGLMVWMDPKHGHWWLEYGSGM
        +ISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAY ATGCYNLLCSGFVQ NNRIAIGAAISP+SSY GKQFD+G+MVW DPKHGHWWLEYGSG+
Subjt:  LISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYHATGCYNLLCSGFVQINNRIAIGAAISPISSYGGKQFDIGLMVWMDPKHGHWWLEYGSGM

Query:  LVGYWPSFLFSHLRSHASMVQFGGEIVNRRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLKLLADHSECYDIRQASNHVWGTYFYY
        LVGYWP+FLFSHLRSH SMVQFGGEIVN R SGFHTAT+MGSGHF EEGF KASYFRNLQVVDWDNNLLPLTNL +LADHS+CYDIRQ +N VWGTYFYY
Subjt:  LVGYWPSFLFSHLRSHASMVQFGGEIVNRRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLKLLADHSECYDIRQASNHVWGTYFYY

Query:  GGPGRNVKCP
        GGPGRNVKCP
Subjt:  GGPGRNVKCP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G23340.1 Protein of Unknown Function (DUF239)3.8e-17469.47Show/hide
Query:  SSSSFCSCSCFVVFLLVFTSFSSVFSTSISHKKALKNQTFFHPNKELRKLNHIRAYLHKINKPPIKTIQSSDGDIIDCVLSHLQPAFDHPELKGQSPLEP
        SSSS C    F++ L +F+S++S  S S S    L+      P +E++K+  IR  L KINKP IKTI SSDGD IDCV SH QPAFDHP L+GQ P++P
Subjt:  SSSSFCSCSCFVVFLLVFTSFSSVFSTSISHKKALKNQTFFHPNKELRKLNHIRAYLHKINKPPIKTIQSSDGDIIDCVLSHLQPAFDHPELKGQSPLEP

Query:  PERPKGNKSMEEMAENLQLWSVSGEFCPEGTIPIRRTTEKDIFRANSVQRFGRKPIRRTRRDSAGDGHEHAVMFVNGEQYYGAKASLNIWAPRVSAEYEF
        PE P G     E  EN QLWS+ GE CPEGTIPIRRTTE+D+ RANSV+RFGRK IRR RRDS+ +GHEHAV +V+G QYYGAKAS+N+W PRV ++YEF
Subjt:  PERPKGNKSMEEMAENLQLWSVSGEFCPEGTIPIRRTTEKDIFRANSVQRFGRKPIRRTRRDSAGDGHEHAVMFVNGEQYYGAKASLNIWAPRVSAEYEF

Query:  SLSQIWLISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYHATGCYNLLCSGFVQINNRIAIGAAISPISSYGGKQFDIGLMVWMDPKHGHWWL
        SLSQIW+I+GSF  DLNTIEAGWQ+SPELYGD NPRFFTYWT+DAY ATGCYNLLCSGFVQ NNRIAIGAAISP+SSY G QFDI L++W DPKHGHWWL
Subjt:  SLSQIWLISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYHATGCYNLLCSGFVQINNRIAIGAAISPISSYGGKQFDIGLMVWMDPKHGHWWL

Query:  EYGSGMLVGYWPSFLFSHLRSHASMVQFGGEIVNRRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLKLLADHSECYDIRQASNHVW
        ++GSG LVGYWP  LF+HLR H +MVQFGGEIVN R  G HT+TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P++NLK+LADH  CYDIR   N VW
Subjt:  EYGSGMLVGYWPSFLFSHLRSHASMVQFGGEIVNRRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLKLLADHSECYDIRQASNHVW

Query:  GTYFYYGGPGRNVKCP
        G +FYYGGPG+N KCP
Subjt:  GTYFYYGGPGRNVKCP

AT1G23340.2 Protein of Unknown Function (DUF239)3.8e-17469.47Show/hide
Query:  SSSSFCSCSCFVVFLLVFTSFSSVFSTSISHKKALKNQTFFHPNKELRKLNHIRAYLHKINKPPIKTIQSSDGDIIDCVLSHLQPAFDHPELKGQSPLEP
        SSSS C    F++ L +F+S++S  S S S    L+      P +E++K+  IR  L KINKP IKTI SSDGD IDCV SH QPAFDHP L+GQ P++P
Subjt:  SSSSFCSCSCFVVFLLVFTSFSSVFSTSISHKKALKNQTFFHPNKELRKLNHIRAYLHKINKPPIKTIQSSDGDIIDCVLSHLQPAFDHPELKGQSPLEP

Query:  PERPKGNKSMEEMAENLQLWSVSGEFCPEGTIPIRRTTEKDIFRANSVQRFGRKPIRRTRRDSAGDGHEHAVMFVNGEQYYGAKASLNIWAPRVSAEYEF
        PE P G     E  EN QLWS+ GE CPEGTIPIRRTTE+D+ RANSV+RFGRK IRR RRDS+ +GHEHAV +V+G QYYGAKAS+N+W PRV ++YEF
Subjt:  PERPKGNKSMEEMAENLQLWSVSGEFCPEGTIPIRRTTEKDIFRANSVQRFGRKPIRRTRRDSAGDGHEHAVMFVNGEQYYGAKASLNIWAPRVSAEYEF

Query:  SLSQIWLISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYHATGCYNLLCSGFVQINNRIAIGAAISPISSYGGKQFDIGLMVWMDPKHGHWWL
        SLSQIW+I+GSF  DLNTIEAGWQ+SPELYGD NPRFFTYWT+DAY ATGCYNLLCSGFVQ NNRIAIGAAISP+SSY G QFDI L++W DPKHGHWWL
Subjt:  SLSQIWLISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYHATGCYNLLCSGFVQINNRIAIGAAISPISSYGGKQFDIGLMVWMDPKHGHWWL

Query:  EYGSGMLVGYWPSFLFSHLRSHASMVQFGGEIVNRRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLKLLADHSECYDIRQASNHVW
        ++GSG LVGYWP  LF+HLR H +MVQFGGEIVN R  G HT+TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P++NLK+LADH  CYDIR   N VW
Subjt:  EYGSGMLVGYWPSFLFSHLRSHASMVQFGGEIVNRRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLKLLADHSECYDIRQASNHVW

Query:  GTYFYYGGPGRNVKCP
        G +FYYGGPG+N KCP
Subjt:  GTYFYYGGPGRNVKCP

AT1G70550.1 Protein of Unknown Function (DUF239)1.2e-17268.25Show/hide
Query:  SSSSSSFCS----CSCFVVFLLVFTSFSSVFSTSISHKKALKNQTFFHPNKELRKLNHIRAYLHKINKPPIKTIQSSDGDIIDCVLSHLQPAFDHPELKG
        SS   SF S     S F+  +L+    SS FS++ S   +        P +EL+KL  IR  L KINKP +KTIQSSDGD IDCV +H QPAFDHP L+G
Subjt:  SSSSSSFCS----CSCFVVFLLVFTSFSSVFSTSISHKKALKNQTFFHPNKELRKLNHIRAYLHKINKPPIKTIQSSDGDIIDCVLSHLQPAFDHPELKG

Query:  QSPLEPPERPKGNKSMEEMAENLQLWSVSGEFCPEGTIPIRRTTEKDIFRANSVQRFGRKPIRRTRRDSAGDGHEHAVMFVNGEQYYGAKASLNIWAPRV
        Q PL+PPE PKG    +   EN QLWS+SGE CPEGTIPIRRTTE+D+ RA+SVQRFGRK IRR +RDS  +GHEHAV +V G QYYGAKAS+N+W+PRV
Subjt:  QSPLEPPERPKGNKSMEEMAENLQLWSVSGEFCPEGTIPIRRTTEKDIFRANSVQRFGRKPIRRTRRDSAGDGHEHAVMFVNGEQYYGAKASLNIWAPRV

Query:  SAEYEFSLSQIWLISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYHATGCYNLLCSGFVQINNRIAIGAAISPISSYGGKQFDIGLMVWMDPK
        +++YEFSLSQIW+I+GSF +DLNTIEAGWQ+SPELYGD  PRFFTYWT+DAY  TGCYNLLCSGFVQ N RIAIGAAISP SSY G QFDI L++W DPK
Subjt:  SAEYEFSLSQIWLISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYHATGCYNLLCSGFVQINNRIAIGAAISPISSYGGKQFDIGLMVWMDPK

Query:  HGHWWLEYGSGMLVGYWPSFLFSHLRSHASMVQFGGEIVNRRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLKLLADHSECYDIRQ
        HGHWWL++GSG LVGYWP+FLF+HL+ H SMVQFGGEIVN R  G HT TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P +NLK+LADH  CYDIR 
Subjt:  HGHWWLEYGSGMLVGYWPSFLFSHLRSHASMVQFGGEIVNRRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLKLLADHSECYDIRQ

Query:  ASNHVWGTYFYYGGPGRNVKCP
         +N VWG YFYYGGPG+N +CP
Subjt:  ASNHVWGTYFYYGGPGRNVKCP

AT1G70550.2 Protein of Unknown Function (DUF239)1.1e-17369.36Show/hide
Query:  SCFVVFLLVFTSFSSVFSTSISHKKALKNQTFFHPNKELRKLNHIRAYLHKINKPPIKTIQSSDGDIIDCVLSHLQPAFDHPELKGQSPLEPPERPKGNK
        S F+  +L+    SS FS++ S   +        P +EL+KL  IR  L KINKP +KTIQSSDGD IDCV +H QPAFDHP L+GQ PL+PPE PKG  
Subjt:  SCFVVFLLVFTSFSSVFSTSISHKKALKNQTFFHPNKELRKLNHIRAYLHKINKPPIKTIQSSDGDIIDCVLSHLQPAFDHPELKGQSPLEPPERPKGNK

Query:  SMEEMAENLQLWSVSGEFCPEGTIPIRRTTEKDIFRANSVQRFGRKPIRRTRRDSAGDGHEHAVMFVNGEQYYGAKASLNIWAPRVSAEYEFSLSQIWLI
          +   EN QLWS+SGE CPEGTIPIRRTTE+D+ RA+SVQRFGRK IRR +RDS  +GHEHAV +V G QYYGAKAS+N+W+PRV+++YEFSLSQIW+I
Subjt:  SMEEMAENLQLWSVSGEFCPEGTIPIRRTTEKDIFRANSVQRFGRKPIRRTRRDSAGDGHEHAVMFVNGEQYYGAKASLNIWAPRVSAEYEFSLSQIWLI

Query:  SGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYHATGCYNLLCSGFVQINNRIAIGAAISPISSYGGKQFDIGLMVWMDPKHGHWWLEYGSGMLV
        +GSF +DLNTIEAGWQ+SPELYGD  PRFFTYWT+DAY  TGCYNLLCSGFVQ N RIAIGAAISP SSY G QFDI L++W DPKHGHWWL++GSG LV
Subjt:  SGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYHATGCYNLLCSGFVQINNRIAIGAAISPISSYGGKQFDIGLMVWMDPKHGHWWLEYGSGMLV

Query:  GYWPSFLFSHLRSHASMVQFGGEIVNRRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLKLLADHSECYDIRQASNHVWGTYFYYGG
        GYWP+FLF+HL+ H SMVQFGGEIVN R  G HT TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P +NLK+LADH  CYDIR  +N VWG YFYYGG
Subjt:  GYWPSFLFSHLRSHASMVQFGGEIVNRRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLKLLADHSECYDIRQASNHVWGTYFYYGG

Query:  PGRNVKCP
        PG+N +CP
Subjt:  PGRNVKCP

AT5G50150.1 Protein of Unknown Function (DUF239)4.8e-18574.06Show/hide
Query:  MASSSSSSSFCS--CSCFVVFLLVFTSFSSVFSTSISHKKALKNQTFFHPNKELRKLNHIRAYLHKINKPPIKTIQSSDGDIIDCVLSHLQPAFDHPELK
        MASSSSSSS  S   S F+  +L+ +    +   S  H   LKNQT F PN+E++KL  + AYL KINKP IKTI S DGD+I+CV SHLQPAFDHP+L+
Subjt:  MASSSSSSSFCS--CSCFVVFLLVFTSFSSVFSTSISHKKALKNQTFFHPNKELRKLNHIRAYLHKINKPPIKTIQSSDGDIIDCVLSHLQPAFDHPELK

Query:  GQSPLEPPERP-KGNKSMEEMAENLQLWSVSGEFCPEGTIPIRRTTEKDIFRANSVQRFGRKPIRRTRRDSAGDGHEHAVMFVNGEQYYGAKASLNIWAP
        GQ PL+ P RP KGN++  E + N QLWS+SGE CP G+IPIR+TT+ D+ RANSV+RFGRK  R  RRDS+G GHEHAV+FVNGEQYYGAKAS+N+WAP
Subjt:  GQSPLEPPERP-KGNKSMEEMAENLQLWSVSGEFCPEGTIPIRRTTEKDIFRANSVQRFGRKPIRRTRRDSAGDGHEHAVMFVNGEQYYGAKASLNIWAP

Query:  RVSAEYEFSLSQIWLISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYHATGCYNLLCSGFVQINNRIAIGAAISPISSYGGKQFDIGLMVWMD
        RV+  YEFSLSQIWLISGSFG+DLNTIEAGWQVSPELYGDN PRFFTYWTTDAY ATGCYNLLCSGFVQ NN+IAIGAAISP SSY G+QFDIGLM+W D
Subjt:  RVSAEYEFSLSQIWLISGSFGNDLNTIEAGWQVSPELYGDNNPRFFTYWTTDAYHATGCYNLLCSGFVQINNRIAIGAAISPISSYGGKQFDIGLMVWMD

Query:  PKHGHWWLEYGSGMLVGYWPSFLFSHLRSHASMVQFGGEIVNRRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLKLLADHSECYDI
        PKHGHWWLE G+G+LVGYWP+FLFSHLRSHASMVQFGGE+VN RSSG HT TQMGSGHFA+EGF KA+YFRNLQVVDWDNNLLPL NL +LADH  CYDI
Subjt:  PKHGHWWLEYGSGMLVGYWPSFLFSHLRSHASMVQFGGEIVNRRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLKLLADHSECYDI

Query:  RQASNHVWGTYFYYGGPGRNVKCP
        RQ  N+VWGTYFYYGGPGRN +CP
Subjt:  RQASNHVWGTYFYYGGPGRNVKCP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTCTTCTTCTTCTTCTTCTTTCTGTTCTTGTTCTTGTTTTGTTGTTTTCCTTCTGGTTTTTACTTCTTTCTCCTCTGTTTTTTCAACTTCAATTTCTCATAA
AAAAGCACTAAAAAACCAAACTTTTTTCCACCCCAACAAAGAGCTGAGGAAACTAAACCACATCAGAGCTTATTTGCACAAAATCAACAAGCCTCCAATCAAGACAATTC
AGAGCTCAGATGGTGATATCATAGACTGTGTTCTTTCTCATCTCCAGCCTGCTTTTGACCATCCTGAGCTCAAAGGACAATCCCCATTGGAACCGCCTGAGAGGCCAAAA
GGGAACAAATCCATGGAAGAAATGGCAGAGAACTTGCAATTATGGTCAGTTTCAGGCGAATTTTGCCCTGAAGGTACAATTCCCATTAGAAGAACAACAGAGAAAGACAT
TTTCAGAGCAAATTCTGTTCAAAGATTTGGAAGAAAACCAATTAGACGTACGAGGAGAGATTCTGCTGGCGATGGCCATGAGCATGCAGTGATGTTTGTAAATGGAGAAC
AATATTATGGAGCAAAGGCCAGCTTAAACATATGGGCACCACGAGTAAGTGCTGAATACGAGTTCAGCTTATCTCAAATATGGCTCATTTCAGGCTCATTTGGTAATGAT
TTGAACACAATTGAAGCTGGATGGCAGGTTAGTCCTGAACTTTATGGCGACAACAATCCTAGGTTCTTTACGTACTGGACGACTGATGCTTATCATGCTACTGGATGTTA
TAATCTACTCTGCTCTGGCTTCGTTCAAATTAATAACAGGATTGCCATTGGAGCTGCAATTTCGCCTATTTCTTCTTATGGTGGGAAGCAATTTGACATTGGTTTGATGG
TTTGGATGGATCCGAAACACGGGCACTGGTGGCTCGAGTACGGGTCGGGTATGCTGGTCGGGTACTGGCCGTCATTTCTATTCAGCCATTTGAGGAGCCATGCAAGCATG
GTGCAATTTGGAGGGGAAATAGTGAACAGAAGATCTTCAGGGTTCCACACAGCCACACAAATGGGGAGTGGCCATTTTGCAGAAGAGGGATTTGGTAAAGCTTCTTATTT
TAGGAACCTGCAAGTGGTTGATTGGGACAACAATTTGCTTCCCCTCACAAATCTTAAGCTCTTGGCTGACCATTCTGAGTGCTATGATATTAGACAAGCCAGTAATCATG
TTTGGGGCACTTATTTTTACTATGGAGGCCCTGGTAGGAATGTTAAATGCCCTTGA
mRNA sequenceShow/hide mRNA sequence
GTTATGAAAAAGTACATGTCAAGTTAAAATGGGTCTATTTTAAAACCCTCACACCTGAAACATTTTTAGAAAGGCAAACCCTTTGAAAAAGAAGAAAGAAAAAGAGGATG
AGATTATTTTATATTTTGTGTTGGGAATGGTAAGGCAAAGAAGAAGAAAACAAAACCAAAGTTTGTGTTTTTATCAGAATGGTTTTTTTTTTTGTAAGGGCTTGAGGAAC
TGAGATTTGTTTTATTTTGTTTTGTTTTCTATATTTTTCTTCACCACACAACCAGACAACCTCAGTACTTACAAATCACAAATCATTACAGTGTACAACAAACGACTTCG
CCGAAACATTCCAACCAAATGGCTTCTTCTTCTTCTTCTTCTTCTTTCTGTTCTTGTTCTTGTTTTGTTGTTTTCCTTCTGGTTTTTACTTCTTTCTCCTCTGTTTTTTC
AACTTCAATTTCTCATAAAAAAGCACTAAAAAACCAAACTTTTTTCCACCCCAACAAAGAGCTGAGGAAACTAAACCACATCAGAGCTTATTTGCACAAAATCAACAAGC
CTCCAATCAAGACAATTCAGAGCTCAGATGGTGATATCATAGACTGTGTTCTTTCTCATCTCCAGCCTGCTTTTGACCATCCTGAGCTCAAAGGACAATCCCCATTGGAA
CCGCCTGAGAGGCCAAAAGGGAACAAATCCATGGAAGAAATGGCAGAGAACTTGCAATTATGGTCAGTTTCAGGCGAATTTTGCCCTGAAGGTACAATTCCCATTAGAAG
AACAACAGAGAAAGACATTTTCAGAGCAAATTCTGTTCAAAGATTTGGAAGAAAACCAATTAGACGTACGAGGAGAGATTCTGCTGGCGATGGCCATGAGCATGCAGTGA
TGTTTGTAAATGGAGAACAATATTATGGAGCAAAGGCCAGCTTAAACATATGGGCACCACGAGTAAGTGCTGAATACGAGTTCAGCTTATCTCAAATATGGCTCATTTCA
GGCTCATTTGGTAATGATTTGAACACAATTGAAGCTGGATGGCAGGTTAGTCCTGAACTTTATGGCGACAACAATCCTAGGTTCTTTACGTACTGGACGACTGATGCTTA
TCATGCTACTGGATGTTATAATCTACTCTGCTCTGGCTTCGTTCAAATTAATAACAGGATTGCCATTGGAGCTGCAATTTCGCCTATTTCTTCTTATGGTGGGAAGCAAT
TTGACATTGGTTTGATGGTTTGGATGGATCCGAAACACGGGCACTGGTGGCTCGAGTACGGGTCGGGTATGCTGGTCGGGTACTGGCCGTCATTTCTATTCAGCCATTTG
AGGAGCCATGCAAGCATGGTGCAATTTGGAGGGGAAATAGTGAACAGAAGATCTTCAGGGTTCCACACAGCCACACAAATGGGGAGTGGCCATTTTGCAGAAGAGGGATT
TGGTAAAGCTTCTTATTTTAGGAACCTGCAAGTGGTTGATTGGGACAACAATTTGCTTCCCCTCACAAATCTTAAGCTCTTGGCTGACCATTCTGAGTGCTATGATATTA
GACAAGCCAGTAATCATGTTTGGGGCACTTATTTTTACTATGGAGGCCCTGGTAGGAATGTTAAATGCCCTTGATCTATTTAATTGTTTTCTTATTATTAATAATATTCT
TCAAAAATAGGATACACTGTAATTTATAGTTTCTTTTTTTTTTTTTTTTTGGTCCTAGGCCTTGAGGTTTGATTTGGGTGTGTATTTTTATTTGTATTTTAATATTAATT
Protein sequenceShow/hide protein sequence
MASSSSSSSFCSCSCFVVFLLVFTSFSSVFSTSISHKKALKNQTFFHPNKELRKLNHIRAYLHKINKPPIKTIQSSDGDIIDCVLSHLQPAFDHPELKGQSPLEPPERPK
GNKSMEEMAENLQLWSVSGEFCPEGTIPIRRTTEKDIFRANSVQRFGRKPIRRTRRDSAGDGHEHAVMFVNGEQYYGAKASLNIWAPRVSAEYEFSLSQIWLISGSFGND
LNTIEAGWQVSPELYGDNNPRFFTYWTTDAYHATGCYNLLCSGFVQINNRIAIGAAISPISSYGGKQFDIGLMVWMDPKHGHWWLEYGSGMLVGYWPSFLFSHLRSHASM
VQFGGEIVNRRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLKLLADHSECYDIRQASNHVWGTYFYYGGPGRNVKCP