; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0002811 (gene) of Chayote v1 genome

Gene IDSed0002811
OrganismSechium edule (Chayote v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationLG12:34193561..34198343
RNA-Seq ExpressionSed0002811
SyntenySed0002811
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN63561.1 hypothetical protein Csa_013107 [Cucumis sativus]1.7e-22691.38Show/hide
Query:  MLLAAFVALFL-----FVLSAPPFAAEHFRPAAAEFQKLNGVKDYLKNVNKPSVKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLEPPDRPKGYNSSA
        MLLAAFVA FL     F+LSAPP  AE+F+PAA E+QKLN VK YLKN+NKP +K IQSPDGDLIDCVLSHLQPAFDHHKLKGQLPL+PP+RP+GYNSSA
Subjt:  MLLAAFVALFL-----FVLSAPPFAAEHFRPAAAEFQKLNGVKDYLKNVNKPSVKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLEPPDRPKGYNSSA

Query:  DSVAKIFQLWRQTGESCPKGTVPIRRTTEQDILRASSVQRFGKKPLKSVRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVTDQYEFSLSQIWLISG
        DSVA+ FQLWRQTGESCP+GTVPIRRTTEQDILRASSVQRFG+KPLKS+RRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHV+DQYEFSLSQIW+ISG
Subjt:  DSVAKIFQLWRQTGESCPKGTVPIRRTTEQDILRASSVQRFGKKPLKSVRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVTDQYEFSLSQIWLISG

Query:  SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYLNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGY
        SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWT+DAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSY NGRQFDVGLMIWKDPRHGNWWLE GQGLLVGY
Subjt:  SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYLNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGY

Query:  WPAFLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQVIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKFWGTYFYYGGPG
        WPAFLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQ+IDWDNSLLPVSNLHLLADHPNCYDIRQGKNK WG YFYYGGPG
Subjt:  WPAFLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQVIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKFWGTYFYYGGPG

Query:  RNVHCP
        RNVHCP
Subjt:  RNVHCP

XP_008464544.1 PREDICTED: uncharacterized protein LOC103502395 [Cucumis melo]1.9e-22590.89Show/hide
Query:  MLLAAFVALFL-----FVLSAPPFAAEHFRPAAAEFQKLNGVKDYLKNVNKPSVKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLEPPDRPKGYNSSA
        MLLAAFVA FL     F+LSAPP  AE+F+PA  E+QKLN VK YLKN+NKP +K IQSPDGDLIDCVLSHLQPAFDHHKLKGQLPL+PP+RP+GYNSSA
Subjt:  MLLAAFVALFL-----FVLSAPPFAAEHFRPAAAEFQKLNGVKDYLKNVNKPSVKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLEPPDRPKGYNSSA

Query:  DSVAKIFQLWRQTGESCPKGTVPIRRTTEQDILRASSVQRFGKKPLKSVRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVTDQYEFSLSQIWLISG
        DSVA+ FQLWRQTGESCP+GTVPIRRTTEQDILRASSVQRFG+KPLKS+RRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHV+DQYEFSLSQIW+ISG
Subjt:  DSVAKIFQLWRQTGESCPKGTVPIRRTTEQDILRASSVQRFGKKPLKSVRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVTDQYEFSLSQIWLISG

Query:  SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYLNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGY
        SFN+DLNTIEAGWQVSPELYGDNYPRFFTYWT+DAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSY NGRQFDVGLMIWKDPRHGNWWLE GQGLLVGY
Subjt:  SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYLNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGY

Query:  WPAFLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQVIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKFWGTYFYYGGPG
        WPAFLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQ+IDWDNSLLPVSNLHLLADHPNCYDIRQGKNK WG YFYYGGPG
Subjt:  WPAFLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQVIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKFWGTYFYYGGPG

Query:  RNVHCP
        RNVHCP
Subjt:  RNVHCP

XP_022135449.1 uncharacterized protein LOC111007405 [Momordica charantia]7.9e-22491.09Show/hide
Query:  MLLAAFVALFL---FVLSAPPFAAEHFRPAAAEFQKLNGVKDYLKNVNKPSVKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLEPPDRPKGYNSSADS
        MLLAAFVALFL    VLSAPP AAE+F+PAAAEFQKLN  + YL+N+NKP+VKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQ PL+PP+RPKGYNSSADS
Subjt:  MLLAAFVALFL---FVLSAPPFAAEHFRPAAAEFQKLNGVKDYLKNVNKPSVKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLEPPDRPKGYNSSADS

Query:  VAKIFQLWRQTGESCPKGTVPIRRTTEQDILRASSVQRFGKKPLKSVRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVTDQYEFSLSQIWLISGSF
        +A+  QLWRQTGE CP+GTVPIRRTTEQDILRASS QRFG+KPLKSVRRDST SGHEHAVVFVNGEQYYGAKANINVWAPHV+DQYEFSLSQ+W+ISGSF
Subjt:  VAKIFQLWRQTGESCPKGTVPIRRTTEQDILRASSVQRFGKKPLKSVRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVTDQYEFSLSQIWLISGSF

Query:  NNDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYLNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGYWP
        N+DLNTIEAGWQVSPELYGDNYPRFFTYWT+DAYQATGCYNLLCSGFVQTNNKIAIGAAISPRS  NGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGYWP
Subjt:  NNDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYLNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGYWP

Query:  AFLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQVIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKFWGTYFYYGGPGRN
        AFLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQ+IDWDNSLLPVSNLHLLADHPNCYDIRQGKNK WG YFYYGGPGRN
Subjt:  AFLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQVIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKFWGTYFYYGGPGRN

Query:  VHCP
        VHCP
Subjt:  VHCP

XP_022988380.1 uncharacterized protein LOC111485640 isoform X1 [Cucurbita maxima]2.7e-22491.79Show/hide
Query:  MLLAAFVALFLFV-LSAPPFAAEHFRPAAAEFQKLNGVKDYLKNVNKPSVKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLEPPDRPKGYNSSADSVA
        MLLAAFVALFL    SAPP AAE+F P A E+QKL G+K YLKN+NKP VKTIQSPDGDLIDCVLSHLQPAFDH+KLKGQLPL+PP+RPKG+NSSA SVA
Subjt:  MLLAAFVALFLFV-LSAPPFAAEHFRPAAAEFQKLNGVKDYLKNVNKPSVKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLEPPDRPKGYNSSADSVA

Query:  KIFQLWRQTGESCPKGTVPIRRTTEQDILRASSVQRFGKKPLKSVRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVTDQYEFSLSQIWLISGSFNN
        + FQLWRQTGESCPKGTVPIRRTTEQDILRASSVQRFG+KP+KSVRRDS+ SGHEHAVVFVNGEQYYGAKANINVWAPHV+DQYEFSLSQIW+ISGSFNN
Subjt:  KIFQLWRQTGESCPKGTVPIRRTTEQDILRASSVQRFGKKPLKSVRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVTDQYEFSLSQIWLISGSFNN

Query:  DLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYLNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGYWPAF
        DLNTIEAGWQVSPELYGDNYPRFFTYWT+DAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSY NGRQFDVGLMIWKDPRHGNWWLE GQGLLVGYWPAF
Subjt:  DLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYLNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGYWPAF

Query:  LFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQVIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKFWGTYFYYGGPGRNVH
        LFSHLGSHASMIQFGGEIVNTRSTG HTSTQMGSGHFAEEGYGKASYFRNLQ+IDWDNSLLPVSNLHLLADHPNCYDIRQGKNKFWGTYFYYGGPGRNVH
Subjt:  LFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQVIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKFWGTYFYYGGPGRNVH

Query:  CP
        CP
Subjt:  CP

XP_038880687.1 uncharacterized protein LOC120072305 [Benincasa hispida]2.2e-22691.87Show/hide
Query:  MLLAAFVALFL-----FVLSAPPFAAEHFRPAAAEFQKLNGVKDYLKNVNKPSVKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLEPPDRPKGYNSSA
        MLLAAF+ALFL     F+LSAPP AAE+FRPAA E+QKLNGV+ YLKNVNKP VK IQSPDGDLIDCVLSHLQPAFDHHKLKGQ P +PP+RPKGYNSSA
Subjt:  MLLAAFVALFL-----FVLSAPPFAAEHFRPAAAEFQKLNGVKDYLKNVNKPSVKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLEPPDRPKGYNSSA

Query:  DSVAKIFQLWRQTGESCPKGTVPIRRTTEQDILRASSVQRFGKKPLKSVRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVTDQYEFSLSQIWLISG
        DSVA+ FQLWRQTGESCP+GTVPIRRTTEQDILRASSVQRFG+KPLKS+RRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHV+DQYEFSLSQIW+ISG
Subjt:  DSVAKIFQLWRQTGESCPKGTVPIRRTTEQDILRASSVQRFGKKPLKSVRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVTDQYEFSLSQIWLISG

Query:  SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYLNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGY
        SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWT+DAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSY NGRQFDVGLMIWKDPRHGNWWLE GQGLLVGY
Subjt:  SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYLNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGY

Query:  WPAFLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQVIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKFWGTYFYYGGPG
        WPAFLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQ+IDWDNSLLPVSNLHLLAD PNCYDIRQGKNK WG YFYYGGPG
Subjt:  WPAFLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQVIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKFWGTYFYYGGPG

Query:  RNVHCP
        RNVHCP
Subjt:  RNVHCP

TrEMBL top hitse value%identityAlignment
A0A0A0LU89 Uncharacterized protein8.3e-22791.38Show/hide
Query:  MLLAAFVALFL-----FVLSAPPFAAEHFRPAAAEFQKLNGVKDYLKNVNKPSVKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLEPPDRPKGYNSSA
        MLLAAFVA FL     F+LSAPP  AE+F+PAA E+QKLN VK YLKN+NKP +K IQSPDGDLIDCVLSHLQPAFDHHKLKGQLPL+PP+RP+GYNSSA
Subjt:  MLLAAFVALFL-----FVLSAPPFAAEHFRPAAAEFQKLNGVKDYLKNVNKPSVKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLEPPDRPKGYNSSA

Query:  DSVAKIFQLWRQTGESCPKGTVPIRRTTEQDILRASSVQRFGKKPLKSVRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVTDQYEFSLSQIWLISG
        DSVA+ FQLWRQTGESCP+GTVPIRRTTEQDILRASSVQRFG+KPLKS+RRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHV+DQYEFSLSQIW+ISG
Subjt:  DSVAKIFQLWRQTGESCPKGTVPIRRTTEQDILRASSVQRFGKKPLKSVRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVTDQYEFSLSQIWLISG

Query:  SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYLNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGY
        SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWT+DAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSY NGRQFDVGLMIWKDPRHGNWWLE GQGLLVGY
Subjt:  SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYLNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGY

Query:  WPAFLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQVIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKFWGTYFYYGGPG
        WPAFLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQ+IDWDNSLLPVSNLHLLADHPNCYDIRQGKNK WG YFYYGGPG
Subjt:  WPAFLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQVIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKFWGTYFYYGGPG

Query:  RNVHCP
        RNVHCP
Subjt:  RNVHCP

A0A1S3CN98 uncharacterized protein LOC1035023959.1e-22690.89Show/hide
Query:  MLLAAFVALFL-----FVLSAPPFAAEHFRPAAAEFQKLNGVKDYLKNVNKPSVKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLEPPDRPKGYNSSA
        MLLAAFVA FL     F+LSAPP  AE+F+PA  E+QKLN VK YLKN+NKP +K IQSPDGDLIDCVLSHLQPAFDHHKLKGQLPL+PP+RP+GYNSSA
Subjt:  MLLAAFVALFL-----FVLSAPPFAAEHFRPAAAEFQKLNGVKDYLKNVNKPSVKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLEPPDRPKGYNSSA

Query:  DSVAKIFQLWRQTGESCPKGTVPIRRTTEQDILRASSVQRFGKKPLKSVRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVTDQYEFSLSQIWLISG
        DSVA+ FQLWRQTGESCP+GTVPIRRTTEQDILRASSVQRFG+KPLKS+RRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHV+DQYEFSLSQIW+ISG
Subjt:  DSVAKIFQLWRQTGESCPKGTVPIRRTTEQDILRASSVQRFGKKPLKSVRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVTDQYEFSLSQIWLISG

Query:  SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYLNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGY
        SFN+DLNTIEAGWQVSPELYGDNYPRFFTYWT+DAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSY NGRQFDVGLMIWKDPRHGNWWLE GQGLLVGY
Subjt:  SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYLNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGY

Query:  WPAFLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQVIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKFWGTYFYYGGPG
        WPAFLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQ+IDWDNSLLPVSNLHLLADHPNCYDIRQGKNK WG YFYYGGPG
Subjt:  WPAFLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQVIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKFWGTYFYYGGPG

Query:  RNVHCP
        RNVHCP
Subjt:  RNVHCP

A0A6J1C1G7 uncharacterized protein LOC1110074053.8e-22491.09Show/hide
Query:  MLLAAFVALFL---FVLSAPPFAAEHFRPAAAEFQKLNGVKDYLKNVNKPSVKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLEPPDRPKGYNSSADS
        MLLAAFVALFL    VLSAPP AAE+F+PAAAEFQKLN  + YL+N+NKP+VKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQ PL+PP+RPKGYNSSADS
Subjt:  MLLAAFVALFL---FVLSAPPFAAEHFRPAAAEFQKLNGVKDYLKNVNKPSVKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLEPPDRPKGYNSSADS

Query:  VAKIFQLWRQTGESCPKGTVPIRRTTEQDILRASSVQRFGKKPLKSVRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVTDQYEFSLSQIWLISGSF
        +A+  QLWRQTGE CP+GTVPIRRTTEQDILRASS QRFG+KPLKSVRRDST SGHEHAVVFVNGEQYYGAKANINVWAPHV+DQYEFSLSQ+W+ISGSF
Subjt:  VAKIFQLWRQTGESCPKGTVPIRRTTEQDILRASSVQRFGKKPLKSVRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVTDQYEFSLSQIWLISGSF

Query:  NNDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYLNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGYWP
        N+DLNTIEAGWQVSPELYGDNYPRFFTYWT+DAYQATGCYNLLCSGFVQTNNKIAIGAAISPRS  NGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGYWP
Subjt:  NNDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYLNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGYWP

Query:  AFLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQVIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKFWGTYFYYGGPGRN
        AFLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQ+IDWDNSLLPVSNLHLLADHPNCYDIRQGKNK WG YFYYGGPGRN
Subjt:  AFLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQVIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKFWGTYFYYGGPGRN

Query:  VHCP
        VHCP
Subjt:  VHCP

A0A6J1EFF9 uncharacterized protein LOC1114328583.6e-22289.9Show/hide
Query:  MLLAAFVALFLF-----VLSAPPFAAEHFRPAAAEFQKLNGVKDYLKNVNKPSVKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLEPPDRPKGYNSSA
        MLLAAFVALFL      VL AP  AAE FRP AAEFQKL+GV+ +LKN+NKPSVK IQSPDGDLIDCVLSHLQPAFDH+KLK QLPL+PPDRPKGYNSSA
Subjt:  MLLAAFVALFLF-----VLSAPPFAAEHFRPAAAEFQKLNGVKDYLKNVNKPSVKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLEPPDRPKGYNSSA

Query:  DSVAKIFQLWRQTGESCPKGTVPIRRTTEQDILRASSVQRFGKKPLKSVRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVTDQYEFSLSQIWLISG
        DS+A  FQLWRQ+GESCP+GT+PIRRTTEQDILRASSVQRFGKKPLKSVRRDS+GSGHEHAVVFVNGEQYYGAKANINVWAPHV+D+YEFSLSQIW+ISG
Subjt:  DSVAKIFQLWRQTGESCPKGTVPIRRTTEQDILRASSVQRFGKKPLKSVRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVTDQYEFSLSQIWLISG

Query:  SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYLNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGY
        SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWT+DAY+ TGCYNLLCSGFVQTNNKIAIGAAISPRSY NGRQFD+GLMIWKDPRHGNWWLEFGQG+LVGY
Subjt:  SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYLNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGY

Query:  WPAFLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQVIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKFWGTYFYYGGPG
        WPA+LFSHLGSHASMIQFGGE+VNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQ+IDWDNSLLPVSNL LLADHPNCYDIRQGKNK WGTYFYYGGPG
Subjt:  WPAFLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQVIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKFWGTYFYYGGPG

Query:  RNVHCP
        RNVHCP
Subjt:  RNVHCP

A0A6J1JJE3 uncharacterized protein LOC111485640 isoform X11.3e-22491.79Show/hide
Query:  MLLAAFVALFLFV-LSAPPFAAEHFRPAAAEFQKLNGVKDYLKNVNKPSVKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLEPPDRPKGYNSSADSVA
        MLLAAFVALFL    SAPP AAE+F P A E+QKL G+K YLKN+NKP VKTIQSPDGDLIDCVLSHLQPAFDH+KLKGQLPL+PP+RPKG+NSSA SVA
Subjt:  MLLAAFVALFLFV-LSAPPFAAEHFRPAAAEFQKLNGVKDYLKNVNKPSVKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLEPPDRPKGYNSSADSVA

Query:  KIFQLWRQTGESCPKGTVPIRRTTEQDILRASSVQRFGKKPLKSVRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVTDQYEFSLSQIWLISGSFNN
        + FQLWRQTGESCPKGTVPIRRTTEQDILRASSVQRFG+KP+KSVRRDS+ SGHEHAVVFVNGEQYYGAKANINVWAPHV+DQYEFSLSQIW+ISGSFNN
Subjt:  KIFQLWRQTGESCPKGTVPIRRTTEQDILRASSVQRFGKKPLKSVRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVTDQYEFSLSQIWLISGSFNN

Query:  DLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYLNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGYWPAF
        DLNTIEAGWQVSPELYGDNYPRFFTYWT+DAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSY NGRQFDVGLMIWKDPRHGNWWLE GQGLLVGYWPAF
Subjt:  DLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYLNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGYWPAF

Query:  LFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQVIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKFWGTYFYYGGPGRNVH
        LFSHLGSHASMIQFGGEIVNTRSTG HTSTQMGSGHFAEEGYGKASYFRNLQ+IDWDNSLLPVSNLHLLADHPNCYDIRQGKNKFWGTYFYYGGPGRNVH
Subjt:  LFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQVIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKFWGTYFYYGGPGRNVH

Query:  CP
        CP
Subjt:  CP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10750.1 Protein of Unknown Function (DUF239)5.4e-17067.91Show/hide
Query:  FVALFLFVLSAPPFAAEHFRP------AAAEFQKLNGVKDYLKNVNKPSVKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLEPPDRPKGYNSSADSVA
        F++L L   S     +E+  P         E  KL  +  +L+ +NKPS+KTI SPDGD+IDCVL H QPAFDH  L+GQ PL+PP+RP+G+N       
Subjt:  FVALFLFVLSAPPFAAEHFRP------AAAEFQKLNGVKDYLKNVNKPSVKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLEPPDRPKGYNSSADSVA

Query:  KIFQLWRQTGESCPKGTVPIRRTTEQDILRASSVQRFGKKPLKSVRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVTDQYEFSLSQIWLISGSFNN
        K FQLW   GE+CP+GTVPIRRT E+DILRA+SV  FGKK L+  RRD++ +GHEHAV +V+GE+YYGAKA+INVWAP V +QYEFSLSQIW+ISGSF N
Subjt:  KIFQLWRQTGESCPKGTVPIRRTTEQDILRASSVQRFGKKPLKSVRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVTDQYEFSLSQIWLISGSFNN

Query:  DLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYLNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGYWPAF
        DLNTIEAGWQVSPELYGDNYPRFFTYWT+DAYQATGCYNLLCSGFVQTN++IAIGAAISP S   G QFD+ L+IWKDP+HGNWWLEFG G+LVGYWP+F
Subjt:  DLNTIEAGWQVSPELYGDNYPRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYLNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGYWPAF

Query:  LFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQVIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKFWGTYFYYGGPGRNVH
        LF+HL  HASM+Q+GGEIVN+   G HTSTQMGSGHFAEEG+ K+SYFRN+QV+DWDN+L+P  NL +LADHPNCYDI+ G N+ WG+YFYYGGPG+N  
Subjt:  LFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQVIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKFWGTYFYYGGPGRNVH

Query:  CP
        CP
Subjt:  CP

AT1G23340.1 Protein of Unknown Function (DUF239)1.1e-16570.16Show/hide
Query:  EFQKLNGVKDYLKNVNKPSVKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLEPPDRPKGYNSSADSVAKIFQLWRQTGESCPKGTVPIRRTTEQDILR
        E QK+  ++  L+ +NKP++KTI S DGD IDCV SH QPAFDH  L+GQ P++PP+ P GY+   +S  + FQLW   GESCP+GT+PIRRTTEQD+LR
Subjt:  EFQKLNGVKDYLKNVNKPSVKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLEPPDRPKGYNSSADSVAKIFQLWRQTGESCPKGTVPIRRTTEQDILR

Query:  ASSVQRFGKKPLKSVRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVTDQYEFSLSQIWLISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTSD
        A+SV+RFG+K ++ VRRDS+ +GHEHAV +V+G QYYGAKA+INVW P V  QYEFSLSQIW+I+GSF  DLNTIEAGWQ+SPELYGD  PRFFTYWTSD
Subjt:  ASSVQRFGKKPLKSVRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVTDQYEFSLSQIWLISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTSD

Query:  AYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYLNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGYWPAFLFSHLGSHASMIQFGGEIVNTRSTGFHTST
        AYQATGCYNLLCSGFVQTNN+IAIGAAISP S   G QFD+ L+IWKDP+HG+WWL+FG G LVGYWP  LF+HL  H +M+QFGGEIVNTR  G HTST
Subjt:  AYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYLNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGYWPAFLFSHLGSHASMIQFGGEIVNTRSTGFHTST

Query:  QMGSGHFAEEGYGKASYFRNLQVIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKFWGTYFYYGGPGRNVHCP
        QMGSGHFA EG+GKASYFRNLQ++DWDN+L+P+SNL +LADHPNCYDIR G N+ WG +FYYGGPG+N  CP
Subjt:  QMGSGHFAEEGYGKASYFRNLQVIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKFWGTYFYYGGPGRNVHCP

AT1G70550.1 Protein of Unknown Function (DUF239)1.4e-17070.94Show/hide
Query:  AAEHFRPAAAEFQKLNGVKDYLKNVNKPSVKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLEPPDRPKGYNSSADSVAKIFQLWRQTGESCPKGTVPI
        AA+       E QKL  ++  L  +NKP+VKTIQS DGD IDCV +H QPAFDH  L+GQ PL+PP+ PKGY S  D   +  QLW  +GESCP+GT+PI
Subjt:  AAEHFRPAAAEFQKLNGVKDYLKNVNKPSVKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLEPPDRPKGYNSSADSVAKIFQLWRQTGESCPKGTVPI

Query:  RRTTEQDILRASSVQRFGKKPLKSVRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVTDQYEFSLSQIWLISGSFNNDLNTIEAGWQVSPELYGDNY
        RRTTEQD+LRASSVQRFG+K ++ V+RDST +GHEHAV +V G QYYGAKA+INVW+P VT QYEFSLSQIW+I+GSF +DLNTIEAGWQ+SPELYGD Y
Subjt:  RRTTEQDILRASSVQRFGKKPLKSVRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVTDQYEFSLSQIWLISGSFNNDLNTIEAGWQVSPELYGDNY

Query:  PRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYLNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGYWPAFLFSHLGSHASMIQFGGEIVN
        PRFFTYWTSDAY+ TGCYNLLCSGFVQTN +IAIGAAISPRS   G QFD+ L+IWKDP+HG+WWL+FG G LVGYWPAFLF+HL  H SM+QFGGEIVN
Subjt:  PRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYLNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGYWPAFLFSHLGSHASMIQFGGEIVN

Query:  TRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQVIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKFWGTYFYYGGPGRNVHCP
         R  G HT+TQMGSGHFA EG+GKASYFRNLQ++DWDN+L+P SNL +LADHPNCYDIR G N+ WG YFYYGGPG+N  CP
Subjt:  TRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQVIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKFWGTYFYYGGPGRNVHCP

AT1G70550.2 Protein of Unknown Function (DUF239)1.4e-17070.94Show/hide
Query:  AAEHFRPAAAEFQKLNGVKDYLKNVNKPSVKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLEPPDRPKGYNSSADSVAKIFQLWRQTGESCPKGTVPI
        AA+       E QKL  ++  L  +NKP+VKTIQS DGD IDCV +H QPAFDH  L+GQ PL+PP+ PKGY S  D   +  QLW  +GESCP+GT+PI
Subjt:  AAEHFRPAAAEFQKLNGVKDYLKNVNKPSVKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLEPPDRPKGYNSSADSVAKIFQLWRQTGESCPKGTVPI

Query:  RRTTEQDILRASSVQRFGKKPLKSVRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVTDQYEFSLSQIWLISGSFNNDLNTIEAGWQVSPELYGDNY
        RRTTEQD+LRASSVQRFG+K ++ V+RDST +GHEHAV +V G QYYGAKA+INVW+P VT QYEFSLSQIW+I+GSF +DLNTIEAGWQ+SPELYGD Y
Subjt:  RRTTEQDILRASSVQRFGKKPLKSVRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVTDQYEFSLSQIWLISGSFNNDLNTIEAGWQVSPELYGDNY

Query:  PRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYLNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGYWPAFLFSHLGSHASMIQFGGEIVN
        PRFFTYWTSDAY+ TGCYNLLCSGFVQTN +IAIGAAISPRS   G QFD+ L+IWKDP+HG+WWL+FG G LVGYWPAFLF+HL  H SM+QFGGEIVN
Subjt:  PRFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYLNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGYWPAFLFSHLGSHASMIQFGGEIVN

Query:  TRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQVIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKFWGTYFYYGGPGRNVHCP
         R  G HT+TQMGSGHFA EG+GKASYFRNLQ++DWDN+L+P SNL +LADHPNCYDIR G N+ WG YFYYGGPG+N  CP
Subjt:  TRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQVIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKFWGTYFYYGGPGRNVHCP

AT5G50150.1 Protein of Unknown Function (DUF239)8.9e-18176.19Show/hide
Query:  FRPAAAEFQKLNGVKDYLKNVNKPSVKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLEPPDRPKGYNSSADSVAKIFQLWRQTGESCPKGTVPIRRTT
        FRP   E QKL  V+ YL  +NKPS+KTI SPDGD+I+CV SHLQPAFDH +L+GQ PL+ P RP   N +        QLW  +GESCP G++PIR+TT
Subjt:  FRPAAAEFQKLNGVKDYLKNVNKPSVKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLEPPDRPKGYNSSADSVAKIFQLWRQTGESCPKGTVPIRRTT

Query:  EQDILRASSVQRFGKKPLKSVRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVTDQYEFSLSQIWLISGSFNNDLNTIEAGWQVSPELYGDNYPRFF
        + D+LRA+SV+RFG+K  + +RRDS+G GHEHAVVFVNGEQYYGAKA+INVWAP VTD YEFSLSQIWLISGSF +DLNTIEAGWQVSPELYGDNYPRFF
Subjt:  EQDILRASSVQRFGKKPLKSVRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVTDQYEFSLSQIWLISGSFNNDLNTIEAGWQVSPELYGDNYPRFF

Query:  TYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYLNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGYWPAFLFSHLGSHASMIQFGGEIVNTRST
        TYWT+DAYQATGCYNLLCSGFVQTNNKIAIGAAISPRS  NGRQFD+GLMIWKDP+HG+WWLE G GLLVGYWPAFLFSHL SHASM+QFGGE+VN+RS+
Subjt:  TYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYLNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGYWPAFLFSHLGSHASMIQFGGEIVNTRST

Query:  GFHTSTQMGSGHFAEEGYGKASYFRNLQVIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKFWGTYFYYGGPGRNVHCP
        G HT TQMGSGHFA+EG+ KA+YFRNLQV+DWDN+LLP+ NLH+LADHP CYDIRQGKN  WGTYFYYGGPGRN  CP
Subjt:  GFHTSTQMGSGHFAEEGYGKASYFRNLQVIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKFWGTYFYYGGPGRNVHCP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTATTAGCGGCTTTTGTTGCTTTATTTCTGTTTGTTTTGTCGGCGCCGCCCTTTGCGGCGGAGCATTTCCGACCCGCCGCCGCAGAGTTTCAGAAACTGAACGGCGT
TAAAGACTACTTGAAGAACGTCAATAAACCCTCCGTCAAAACAATTCAGAGCCCAGATGGGGATTTAATCGATTGTGTTCTTTCTCATCTTCAACCTGCTTTTGATCATC
ATAAACTCAAAGGGCAGCTTCCATTGGAACCACCAGATAGGCCAAAAGGTTACAACTCTTCTGCTGATTCAGTTGCAAAGATCTTTCAGCTATGGAGGCAGACTGGTGAA
TCATGTCCTAAAGGAACTGTTCCAATTAGAAGAACTACAGAACAAGACATTTTAAGGGCAAGCTCTGTTCAAAGATTTGGGAAAAAGCCATTAAAATCTGTTAGAAGGGA
CTCAACAGGCAGTGGCCATGAGCATGCTGTTGTGTTTGTTAATGGAGAACAATACTATGGAGCAAAGGCAAATATAAATGTTTGGGCACCTCATGTGACTGATCAATATG
AGTTCAGTTTGTCTCAAATCTGGCTTATTTCTGGATCATTCAATAATGATCTGAATACCATTGAAGCTGGTTGGCAGGTTAGTCCTGAGTTATATGGAGATAATTATCCA
AGGTTCTTCACTTATTGGACGTCAGATGCATACCAAGCAACAGGCTGCTACAACTTACTGTGCTCTGGTTTTGTCCAAACTAATAACAAGATTGCCATTGGAGCTGCAAT
ATCTCCAAGATCTTATTTAAATGGCAGACAATTTGATGTGGGCTTAATGATTTGGAAGGATCCGAGGCACGGAAACTGGTGGCTGGAATTTGGGCAGGGTCTGTTAGTAG
GGTACTGGCCAGCATTTTTGTTCAGTCACTTGGGAAGCCATGCCAGCATGATCCAATTTGGTGGAGAAATAGTAAACACAAGATCAACAGGGTTCCACACATCAACACAA
ATGGGAAGTGGGCATTTTGCTGAAGAAGGTTATGGAAAAGCTTCTTATTTCAGAAATCTTCAAGTTATTGATTGGGACAACAGTTTGCTACCTGTCTCAAACCTTCATCT
ATTGGCTGATCATCCAAATTGTTATGATATAAGACAAGGTAAAAACAAGTTTTGGGGCACTTATTTTTACTATGGAGGTCCTGGTAGAAATGTCCACTGTCCGTAA
mRNA sequenceShow/hide mRNA sequence
ATTCAGGCCGTGTTTGTCTGGTTGATTCCCAGAAATGAAATCATAATTCATAACCAAATCAAAGCAAAACCCACCTCAATTTCATCCATCTTCTTCACAAAATCTCTCCA
AATTCGTCAAATCAAATGCCCATTTCTTCACATTTTTATGTCCCACTCCTGCAAAAAAGTAAAAGCAAAGAACCCAATTCCTTTTTTTTAGTTTCTCGAGCCTGTTGACC
AGTTCTTGTCTATAAAAAAAGAATTGGTTTCTCTCGAATTTCTCTGTCTGTCAGAAAAATCGCCATTAACGAAGCTAAGATGCTTTTCTTCGAAGGGAGGGAAGATTTAT
GTTAAAGAAAGCTGAAAATGGGGGATTTTGTGTTAAAAAATTCATGTGGGTTTTTGTTTCTGAGAAGAATGGTTCGTTCTATTTTGTAAGAACATGCAGAGAATTCGAAG
CTCAGTGATAATTGATTTTCATCTTTGTTCTAATACGACAATCCAAATACGAACAAACAACCAAAACGACAGACCACAACATTCTTTGGATTTGGAATTGGAATTGGCCG
GTCTCGATGTTATTAGCGGCTTTTGTTGCTTTATTTCTGTTTGTTTTGTCGGCGCCGCCCTTTGCGGCGGAGCATTTCCGACCCGCCGCCGCAGAGTTTCAGAAACTGAA
CGGCGTTAAAGACTACTTGAAGAACGTCAATAAACCCTCCGTCAAAACAATTCAGAGCCCAGATGGGGATTTAATCGATTGTGTTCTTTCTCATCTTCAACCTGCTTTTG
ATCATCATAAACTCAAAGGGCAGCTTCCATTGGAACCACCAGATAGGCCAAAAGGTTACAACTCTTCTGCTGATTCAGTTGCAAAGATCTTTCAGCTATGGAGGCAGACT
GGTGAATCATGTCCTAAAGGAACTGTTCCAATTAGAAGAACTACAGAACAAGACATTTTAAGGGCAAGCTCTGTTCAAAGATTTGGGAAAAAGCCATTAAAATCTGTTAG
AAGGGACTCAACAGGCAGTGGCCATGAGCATGCTGTTGTGTTTGTTAATGGAGAACAATACTATGGAGCAAAGGCAAATATAAATGTTTGGGCACCTCATGTGACTGATC
AATATGAGTTCAGTTTGTCTCAAATCTGGCTTATTTCTGGATCATTCAATAATGATCTGAATACCATTGAAGCTGGTTGGCAGGTTAGTCCTGAGTTATATGGAGATAAT
TATCCAAGGTTCTTCACTTATTGGACGTCAGATGCATACCAAGCAACAGGCTGCTACAACTTACTGTGCTCTGGTTTTGTCCAAACTAATAACAAGATTGCCATTGGAGC
TGCAATATCTCCAAGATCTTATTTAAATGGCAGACAATTTGATGTGGGCTTAATGATTTGGAAGGATCCGAGGCACGGAAACTGGTGGCTGGAATTTGGGCAGGGTCTGT
TAGTAGGGTACTGGCCAGCATTTTTGTTCAGTCACTTGGGAAGCCATGCCAGCATGATCCAATTTGGTGGAGAAATAGTAAACACAAGATCAACAGGGTTCCACACATCA
ACACAAATGGGAAGTGGGCATTTTGCTGAAGAAGGTTATGGAAAAGCTTCTTATTTCAGAAATCTTCAAGTTATTGATTGGGACAACAGTTTGCTACCTGTCTCAAACCT
TCATCTATTGGCTGATCATCCAAATTGTTATGATATAAGACAAGGTAAAAACAAGTTTTGGGGCACTTATTTTTACTATGGAGGTCCTGGTAGAAATGTCCACTGTCCGT
AAGCAGATCATACTGGCATTTTCAATAATTTGTTTTTGATATGTGCGAGTGTATTCTAGCCAATTTGTGTGCATCTCGTTTCTTTTCATAAGAGATACAATTTGATCTTA
CGGCAGTTGGATATCAAACAAGCCCATAGTAGACTAGGTAGCAGTTTAGATTGAATTCATAACCTCGTGATTCCAATGCCCTATTTGGCCACATGATTACTTTTCTTTTT
CTTTTATTTTTTTGGGATTTTATTTTTCTTACCCACTTTCTCTATAGGCCTTAGAATGATTTGGGTTGCTTTTATTCTTATTATAATGATATTTTTTGGTTTTCTTTTTT
CCCATGTGGCCCCATTTTGGGATATTGTAAATTTTGACGATAATAGGATGTCT
Protein sequenceShow/hide protein sequence
MLLAAFVALFLFVLSAPPFAAEHFRPAAAEFQKLNGVKDYLKNVNKPSVKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLEPPDRPKGYNSSADSVAKIFQLWRQTGE
SCPKGTVPIRRTTEQDILRASSVQRFGKKPLKSVRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVTDQYEFSLSQIWLISGSFNNDLNTIEAGWQVSPELYGDNYP
RFFTYWTSDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYLNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGYWPAFLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQ
MGSGHFAEEGYGKASYFRNLQVIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKFWGTYFYYGGPGRNVHCP