; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005527 (gene) of Snake gourd v1 genome

Gene IDTan0005527
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationLG09:72738862..72742837
RNA-Seq ExpressionTan0005527
SyntenyTan0005527
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN63561.1 hypothetical protein Csa_013107 [Cucumis sativus]1.6e-23294.58Show/hide
Query:  MLLAAFVALFLLASTSSVLSAPPIAAENFRPAAAEFQKLSGVNDYLTNINKPPIKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPPDRPKGYNSSA
        MLLAAFVA FLLASTS +LSAPPI AENF+PAA E+QKL+ V  YL NINKPPIK IQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPP+RP+GYNSSA
Subjt:  MLLAAFVALFLLASTSSVLSAPPIAAENFRPAAAEFQKLSGVNDYLTNINKPPIKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPPDRPKGYNSSA

Query:  DSVAEIFQLWRKTGESCPEGTVPIRRTTEQDILRASSVQTFGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQIWLISG
        DSVAE FQLWR+TGESCPEGTVPIRRTTEQDILRASSVQ FGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQIW+ISG
Subjt:  DSVAEIFQLWRKTGESCPEGTVPIRRTTEQDILRASSVQTFGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQIWLISG

Query:  SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGY
        SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLE GQGLLVGY
Subjt:  SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGY

Query:  WPAFLFSHLGSHASMIQFGGEIVNTKSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWGTYFYYGGPG
        WPAFLFSHLGSHASMIQFGGEIVNT+STGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWG YFYYGGPG
Subjt:  WPAFLFSHLGSHASMIQFGGEIVNTKSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWGTYFYYGGPG

Query:  RNVHCP
        RNVHCP
Subjt:  RNVHCP

XP_008464544.1 PREDICTED: uncharacterized protein LOC103502395 [Cucumis melo]1.8e-23194.09Show/hide
Query:  MLLAAFVALFLLASTSSVLSAPPIAAENFRPAAAEFQKLSGVNDYLTNINKPPIKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPPDRPKGYNSSA
        MLLAAFVA FLLASTS +LSAPPI AENF+PA  E+QKL+ V  YL NINKPPIK IQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPP+RP+GYNSSA
Subjt:  MLLAAFVALFLLASTSSVLSAPPIAAENFRPAAAEFQKLSGVNDYLTNINKPPIKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPPDRPKGYNSSA

Query:  DSVAEIFQLWRKTGESCPEGTVPIRRTTEQDILRASSVQTFGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQIWLISG
        DSVAE FQLWR+TGESCPEGTVPIRRTTEQDILRASSVQ FGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQIW+ISG
Subjt:  DSVAEIFQLWRKTGESCPEGTVPIRRTTEQDILRASSVQTFGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQIWLISG

Query:  SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGY
        SFN+DLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLE GQGLLVGY
Subjt:  SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGY

Query:  WPAFLFSHLGSHASMIQFGGEIVNTKSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWGTYFYYGGPG
        WPAFLFSHLGSHASMIQFGGEIVNT+STGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWG YFYYGGPG
Subjt:  WPAFLFSHLGSHASMIQFGGEIVNTKSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWGTYFYYGGPG

Query:  RNVHCP
        RNVHCP
Subjt:  RNVHCP

XP_022135449.1 uncharacterized protein LOC111007405 [Momordica charantia]1.6e-22792.61Show/hide
Query:  MLLAAFVALFLLASTSSVLSAPPIAAENFRPAAAEFQKLSGVNDYLTNINKPPIKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPPDRPKGYNSSA
        MLLAAFVALFL  +T+SVLSAPPIAAENF+PAAAEFQKL+    YL NINKP +KTIQSPDGDLIDCVLSHLQPAFDHHKLKGQ PLDPP+RPKGYNSSA
Subjt:  MLLAAFVALFLLASTSSVLSAPPIAAENFRPAAAEFQKLSGVNDYLTNINKPPIKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPPDRPKGYNSSA

Query:  DSVAEIFQLWRKTGESCPEGTVPIRRTTEQDILRASSVQTFGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQIWLISG
        DS+AE  QLWR+TGE CPEGTVPIRRTTEQDILRASS Q FGRKPLKS+RRDST SGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQ+W+ISG
Subjt:  DSVAEIFQLWRKTGESCPEGTVPIRRTTEQDILRASSVQTFGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQIWLISG

Query:  SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGY
        SFN+DLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRS YNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGY
Subjt:  SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGY

Query:  WPAFLFSHLGSHASMIQFGGEIVNTKSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWGTYFYYGGPG
        WPAFLFSHLGSHASMIQFGGEIVNT+STGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWG YFYYGGPG
Subjt:  WPAFLFSHLGSHASMIQFGGEIVNTKSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWGTYFYYGGPG

Query:  RNVHCP
        RNVHCP
Subjt:  RNVHCP

XP_023529935.1 uncharacterized protein LOC111792632 isoform X1 [Cucurbita pepo subsp. pepo]5.4e-22892.12Show/hide
Query:  MLLAAFVALFLLASTSSVLSAPPIAAENFRPAAAEFQKLSGVNDYLTNINKPPIKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPPDRPKGYNSSA
        MLLAAFVALFL+ STSSVL AP IAAE+FRPAAAEFQKLSGV  +L NINKP +KTIQSPDGDLIDCVLSHLQPAFDH+KLK QLPLDPPDRPKGYNSSA
Subjt:  MLLAAFVALFLLASTSSVLSAPPIAAENFRPAAAEFQKLSGVNDYLTNINKPPIKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPPDRPKGYNSSA

Query:  DSVAEIFQLWRKTGESCPEGTVPIRRTTEQDILRASSVQTFGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQIWLISG
        DS+A  FQLWR++GESCPEGT+PIRRTTEQDILRASSVQ FGRKPLKS+RRDS+GSGHEHAVVFVNGEQYYGAKANINVWAPHVSD+YEFSLSQIW+ISG
Subjt:  DSVAEIFQLWRKTGESCPEGTVPIRRTTEQDILRASSVQTFGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQIWLISG

Query:  SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGY
        SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAY+ TGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFD+GLMIWKDPRHGNWWLEFGQG+LVGY
Subjt:  SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGY

Query:  WPAFLFSHLGSHASMIQFGGEIVNTKSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWGTYFYYGGPG
        WPA+LFSHLGSHASMIQFGGE+VNT+STGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNL LLADHPNCYDIRQGKNKLWGTYFYYGGPG
Subjt:  WPAFLFSHLGSHASMIQFGGEIVNTKSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWGTYFYYGGPG

Query:  RNVHCP
        RNVHCP
Subjt:  RNVHCP

XP_038880687.1 uncharacterized protein LOC120072305 [Benincasa hispida]6.1e-23294.09Show/hide
Query:  MLLAAFVALFLLASTSSVLSAPPIAAENFRPAAAEFQKLSGVNDYLTNINKPPIKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPPDRPKGYNSSA
        MLLAAF+ALFLL STS +LSAPPIAAENFRPAA E+QKL+GV  YL N+NKPP+K IQSPDGDLIDCVLSHLQPAFDHHKLKGQ P DPP+RPKGYNSSA
Subjt:  MLLAAFVALFLLASTSSVLSAPPIAAENFRPAAAEFQKLSGVNDYLTNINKPPIKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPPDRPKGYNSSA

Query:  DSVAEIFQLWRKTGESCPEGTVPIRRTTEQDILRASSVQTFGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQIWLISG
        DSVAE FQLWR+TGESCPEGTVPIRRTTEQDILRASSVQ FGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQIW+ISG
Subjt:  DSVAEIFQLWRKTGESCPEGTVPIRRTTEQDILRASSVQTFGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQIWLISG

Query:  SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGY
        SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLE GQGLLVGY
Subjt:  SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGY

Query:  WPAFLFSHLGSHASMIQFGGEIVNTKSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWGTYFYYGGPG
        WPAFLFSHLGSHASMIQFGGEIVNT+STGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLAD PNCYDIRQGKNKLWG YFYYGGPG
Subjt:  WPAFLFSHLGSHASMIQFGGEIVNTKSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWGTYFYYGGPG

Query:  RNVHCP
        RNVHCP
Subjt:  RNVHCP

TrEMBL top hitse value%identityAlignment
A0A0A0LU89 Uncharacterized protein7.8e-23394.58Show/hide
Query:  MLLAAFVALFLLASTSSVLSAPPIAAENFRPAAAEFQKLSGVNDYLTNINKPPIKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPPDRPKGYNSSA
        MLLAAFVA FLLASTS +LSAPPI AENF+PAA E+QKL+ V  YL NINKPPIK IQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPP+RP+GYNSSA
Subjt:  MLLAAFVALFLLASTSSVLSAPPIAAENFRPAAAEFQKLSGVNDYLTNINKPPIKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPPDRPKGYNSSA

Query:  DSVAEIFQLWRKTGESCPEGTVPIRRTTEQDILRASSVQTFGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQIWLISG
        DSVAE FQLWR+TGESCPEGTVPIRRTTEQDILRASSVQ FGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQIW+ISG
Subjt:  DSVAEIFQLWRKTGESCPEGTVPIRRTTEQDILRASSVQTFGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQIWLISG

Query:  SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGY
        SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLE GQGLLVGY
Subjt:  SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGY

Query:  WPAFLFSHLGSHASMIQFGGEIVNTKSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWGTYFYYGGPG
        WPAFLFSHLGSHASMIQFGGEIVNT+STGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWG YFYYGGPG
Subjt:  WPAFLFSHLGSHASMIQFGGEIVNTKSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWGTYFYYGGPG

Query:  RNVHCP
        RNVHCP
Subjt:  RNVHCP

A0A1S3CN98 uncharacterized protein LOC1035023958.6e-23294.09Show/hide
Query:  MLLAAFVALFLLASTSSVLSAPPIAAENFRPAAAEFQKLSGVNDYLTNINKPPIKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPPDRPKGYNSSA
        MLLAAFVA FLLASTS +LSAPPI AENF+PA  E+QKL+ V  YL NINKPPIK IQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPP+RP+GYNSSA
Subjt:  MLLAAFVALFLLASTSSVLSAPPIAAENFRPAAAEFQKLSGVNDYLTNINKPPIKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPPDRPKGYNSSA

Query:  DSVAEIFQLWRKTGESCPEGTVPIRRTTEQDILRASSVQTFGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQIWLISG
        DSVAE FQLWR+TGESCPEGTVPIRRTTEQDILRASSVQ FGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQIW+ISG
Subjt:  DSVAEIFQLWRKTGESCPEGTVPIRRTTEQDILRASSVQTFGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQIWLISG

Query:  SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGY
        SFN+DLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLE GQGLLVGY
Subjt:  SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGY

Query:  WPAFLFSHLGSHASMIQFGGEIVNTKSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWGTYFYYGGPG
        WPAFLFSHLGSHASMIQFGGEIVNT+STGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWG YFYYGGPG
Subjt:  WPAFLFSHLGSHASMIQFGGEIVNTKSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWGTYFYYGGPG

Query:  RNVHCP
        RNVHCP
Subjt:  RNVHCP

A0A6J1C1G7 uncharacterized protein LOC1110074057.6e-22892.61Show/hide
Query:  MLLAAFVALFLLASTSSVLSAPPIAAENFRPAAAEFQKLSGVNDYLTNINKPPIKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPPDRPKGYNSSA
        MLLAAFVALFL  +T+SVLSAPPIAAENF+PAAAEFQKL+    YL NINKP +KTIQSPDGDLIDCVLSHLQPAFDHHKLKGQ PLDPP+RPKGYNSSA
Subjt:  MLLAAFVALFLLASTSSVLSAPPIAAENFRPAAAEFQKLSGVNDYLTNINKPPIKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPPDRPKGYNSSA

Query:  DSVAEIFQLWRKTGESCPEGTVPIRRTTEQDILRASSVQTFGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQIWLISG
        DS+AE  QLWR+TGE CPEGTVPIRRTTEQDILRASS Q FGRKPLKS+RRDST SGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQ+W+ISG
Subjt:  DSVAEIFQLWRKTGESCPEGTVPIRRTTEQDILRASSVQTFGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQIWLISG

Query:  SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGY
        SFN+DLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRS YNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGY
Subjt:  SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGY

Query:  WPAFLFSHLGSHASMIQFGGEIVNTKSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWGTYFYYGGPG
        WPAFLFSHLGSHASMIQFGGEIVNT+STGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWG YFYYGGPG
Subjt:  WPAFLFSHLGSHASMIQFGGEIVNTKSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWGTYFYYGGPG

Query:  RNVHCP
        RNVHCP
Subjt:  RNVHCP

A0A6J1EFF9 uncharacterized protein LOC1114328588.4e-22791.38Show/hide
Query:  MLLAAFVALFLLASTSSVLSAPPIAAENFRPAAAEFQKLSGVNDYLTNINKPPIKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPPDRPKGYNSSA
        MLLAAFVALFL+ STSSVL AP IAAE+FRP AAEFQKLSGV  +L NINKP +K IQSPDGDLIDCVLSHLQPAFDH+KLK QLPLDPPDRPKGYNSSA
Subjt:  MLLAAFVALFLLASTSSVLSAPPIAAENFRPAAAEFQKLSGVNDYLTNINKPPIKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPPDRPKGYNSSA

Query:  DSVAEIFQLWRKTGESCPEGTVPIRRTTEQDILRASSVQTFGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQIWLISG
        DS+A  FQLWR++GESCPEGT+PIRRTTEQDILRASSVQ FG+KPLKS+RRDS+GSGHEHAVVFVNGEQYYGAKANINVWAPHVSD+YEFSLSQIW+ISG
Subjt:  DSVAEIFQLWRKTGESCPEGTVPIRRTTEQDILRASSVQTFGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQIWLISG

Query:  SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGY
        SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAY+ TGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFD+GLMIWKDPRHGNWWLEFGQG+LVGY
Subjt:  SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGY

Query:  WPAFLFSHLGSHASMIQFGGEIVNTKSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWGTYFYYGGPG
        WPA+LFSHLGSHASMIQFGGE+VNT+STGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNL LLADHPNCYDIRQGKNKLWGTYFYYGGPG
Subjt:  WPAFLFSHLGSHASMIQFGGEIVNTKSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWGTYFYYGGPG

Query:  RNVHCP
        RNVHCP
Subjt:  RNVHCP

A0A6J1JJE3 uncharacterized protein LOC111485640 isoform X12.9e-22792.36Show/hide
Query:  MLLAAFVALFLLASTSSVLSAPPIAAENFRPAAAEFQKLSGVNDYLTNINKPPIKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPPDRPKGYNSSA
        MLLAAFVALFLLAST    SAPPIAAENF P A E+QKL+G+  YL NINKPP+KTIQSPDGDLIDCVLSHLQPAFDH+KLKGQLPLDPP+RPKG+NSSA
Subjt:  MLLAAFVALFLLASTSSVLSAPPIAAENFRPAAAEFQKLSGVNDYLTNINKPPIKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPPDRPKGYNSSA

Query:  DSVAEIFQLWRKTGESCPEGTVPIRRTTEQDILRASSVQTFGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQIWLISG
         SVAE FQLWR+TGESCP+GTVPIRRTTEQDILRASSVQ FGRKP+KS+RRDS+ SGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQIW+ISG
Subjt:  DSVAEIFQLWRKTGESCPEGTVPIRRTTEQDILRASSVQTFGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQIWLISG

Query:  SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGY
        SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLE GQGLLVGY
Subjt:  SFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGY

Query:  WPAFLFSHLGSHASMIQFGGEIVNTKSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWGTYFYYGGPG
        WPAFLFSHLGSHASMIQFGGEIVNT+STG HTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNK WGTYFYYGGPG
Subjt:  WPAFLFSHLGSHASMIQFGGEIVNTKSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWGTYFYYGGPG

Query:  RNVHCP
        RNVHCP
Subjt:  RNVHCP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10750.1 Protein of Unknown Function (DUF239)1.5e-17268.98Show/hide
Query:  FVALFLLASTSSVLSAPPIAAEN--FRPAAAEFQKLSGVNDYLTNINKPPIKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPPDRPKGYNSSADSV
        F++L LL+S+ S + +  ++  N   RP   E  KL  +N +L  INKP IKTI SPDGD+IDCVL H QPAFDH  L+GQ PLDPP+RP+G+N      
Subjt:  FVALFLLASTSSVLSAPPIAAEN--FRPAAAEFQKLSGVNDYLTNINKPPIKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPPDRPKGYNSSADSV

Query:  AEIFQLWRKTGESCPEGTVPIRRTTEQDILRASSVQTFGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQIWLISGSFN
         + FQLW   GE+CPEGTVPIRRT E+DILRA+SV +FG+K L+  RRD++ +GHEHAV +V+GE+YYGAKA+INVWAP V +QYEFSLSQIW+ISGSF 
Subjt:  AEIFQLWRKTGESCPEGTVPIRRTTEQDILRASSVQTFGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQIWLISGSFN

Query:  NDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGYWPA
        NDLNTIEAGWQVSPELYGDNYPRFFTYWT DAYQATGCYNLLCSGFVQTN++IAIGAAISP S Y G QFD+ L+IWKDP+HGNWWLEFG G+LVGYWP+
Subjt:  NDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGYWPA

Query:  FLFSHLGSHASMIQFGGEIVNTKSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWGTYFYYGGPGRNV
        FLF+HL  HASM+Q+GGEIVN+   G HTSTQMGSGHFAEEG+ K+SYFRN+Q++DWDN+L+P  NL +LADHPNCYDI+ G N+ WG+YFYYGGPG+N 
Subjt:  FLFSHLGSHASMIQFGGEIVNTKSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWGTYFYYGGPGRNV

Query:  HCP
         CP
Subjt:  HCP

AT1G23340.1 Protein of Unknown Function (DUF239)3.7e-16667.25Show/hide
Query:  FVALFLLASTSSVLSAPPIAAENFRP--AAAEFQKLSGVNDYLTNINKPPIKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPPDRPKGYNSSADSV
        F    LL S  S  ++P  +     P     E QK+  +   L  INKP IKTI S DGD IDCV SH QPAFDH  L+GQ P+DPP+ P GY+   +S 
Subjt:  FVALFLLASTSSVLSAPPIAAENFRP--AAAEFQKLSGVNDYLTNINKPPIKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPPDRPKGYNSSADSV

Query:  AEIFQLWRKTGESCPEGTVPIRRTTEQDILRASSVQTFGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQIWLISGSFN
         E FQLW   GESCPEGT+PIRRTTEQD+LRA+SV+ FGRK ++ +RRDS+ +GHEHAV +V+G QYYGAKA+INVW P V  QYEFSLSQIW+I+GSF 
Subjt:  AEIFQLWRKTGESCPEGTVPIRRTTEQDILRASSVQTFGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQIWLISGSFN

Query:  NDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGYWPA
         DLNTIEAGWQ+SPELYGD  PRFFTYWT+DAYQATGCYNLLCSGFVQTNN+IAIGAAISP S Y G QFD+ L+IWKDP+HG+WWL+FG G LVGYWP 
Subjt:  NDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGYWPA

Query:  FLFSHLGSHASMIQFGGEIVNTKSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWGTYFYYGGPGRNV
         LF+HL  H +M+QFGGEIVNT+  G HTSTQMGSGHFA EG+GKASYFRNLQ++DWDN+L+P+SNL +LADHPNCYDIR G N++WG +FYYGGPG+N 
Subjt:  FLFSHLGSHASMIQFGGEIVNTKSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWGTYFYYGGPGRNV

Query:  HCP
         CP
Subjt:  HCP

AT1G70550.1 Protein of Unknown Function (DUF239)8.4e-17167.72Show/hide
Query:  MLLAAFVALFLL------ASTSSVLSAPPIAAENFRPAAAEFQKLSGVNDYLTNINKPPIKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPPDRPK
        M  ++F+ L LL      + +S+  S+   AA+       E QKL+ +   L  INKP +KTIQS DGD IDCV +H QPAFDH  L+GQ PLDPP+ PK
Subjt:  MLLAAFVALFLL------ASTSSVLSAPPIAAENFRPAAAEFQKLSGVNDYLTNINKPPIKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPPDRPK

Query:  GYNSSADSVAEIFQLWRKTGESCPEGTVPIRRTTEQDILRASSVQTFGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQ
        GY S  D   E  QLW  +GESCPEGT+PIRRTTEQD+LRASSVQ FGRK ++ ++RDST +GHEHAV +V G QYYGAKA+INVW+P V+ QYEFSLSQ
Subjt:  GYNSSADSVAEIFQLWRKTGESCPEGTVPIRRTTEQDILRASSVQTFGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQ

Query:  IWLISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLEFGQ
        IW+I+GSF +DLNTIEAGWQ+SPELYGD YPRFFTYWT+DAY+ TGCYNLLCSGFVQTN +IAIGAAISPRS Y G QFD+ L+IWKDP+HG+WWL+FG 
Subjt:  IWLISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLEFGQ

Query:  GLLVGYWPAFLFSHLGSHASMIQFGGEIVNTKSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWGTYF
        G LVGYWPAFLF+HL  H SM+QFGGEIVN +  G HT+TQMGSGHFA EG+GKASYFRNLQI+DWDN+L+P SNL +LADHPNCYDIR G N++WG YF
Subjt:  GLLVGYWPAFLFSHLGSHASMIQFGGEIVNTKSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWGTYF

Query:  YYGGPGRNVHCP
        YYGGPG+N  CP
Subjt:  YYGGPGRNVHCP

AT1G70550.2 Protein of Unknown Function (DUF239)8.4e-17167.72Show/hide
Query:  MLLAAFVALFLL------ASTSSVLSAPPIAAENFRPAAAEFQKLSGVNDYLTNINKPPIKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPPDRPK
        M  ++F+ L LL      + +S+  S+   AA+       E QKL+ +   L  INKP +KTIQS DGD IDCV +H QPAFDH  L+GQ PLDPP+ PK
Subjt:  MLLAAFVALFLL------ASTSSVLSAPPIAAENFRPAAAEFQKLSGVNDYLTNINKPPIKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPPDRPK

Query:  GYNSSADSVAEIFQLWRKTGESCPEGTVPIRRTTEQDILRASSVQTFGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQ
        GY S  D   E  QLW  +GESCPEGT+PIRRTTEQD+LRASSVQ FGRK ++ ++RDST +GHEHAV +V G QYYGAKA+INVW+P V+ QYEFSLSQ
Subjt:  GYNSSADSVAEIFQLWRKTGESCPEGTVPIRRTTEQDILRASSVQTFGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQ

Query:  IWLISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLEFGQ
        IW+I+GSF +DLNTIEAGWQ+SPELYGD YPRFFTYWT+DAY+ TGCYNLLCSGFVQTN +IAIGAAISPRS Y G QFD+ L+IWKDP+HG+WWL+FG 
Subjt:  IWLISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLEFGQ

Query:  GLLVGYWPAFLFSHLGSHASMIQFGGEIVNTKSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWGTYF
        G LVGYWPAFLF+HL  H SM+QFGGEIVN +  G HT+TQMGSGHFA EG+GKASYFRNLQI+DWDN+L+P SNL +LADHPNCYDIR G N++WG YF
Subjt:  GLLVGYWPAFLFSHLGSHASMIQFGGEIVNTKSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWGTYF

Query:  YYGGPGRNVHCP
        YYGGPG+N  CP
Subjt:  YYGGPGRNVHCP

AT5G50150.1 Protein of Unknown Function (DUF239)9.0e-18173.17Show/hide
Query:  LLAAFVALFLLASTS-SVLSAPPIAAEN---FRPAAAEFQKLSGVNDYLTNINKPPIKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPPDRPKGYN
        + + F+ L LL S    +L    I  +N   FRP   E QKL  V  YL+ INKP IKTI SPDGD+I+CV SHLQPAFDH +L+GQ PLD P RP   N
Subjt:  LLAAFVALFLLASTS-SVLSAPPIAAEN---FRPAAAEFQKLSGVNDYLTNINKPPIKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPPDRPKGYN

Query:  SSADSVAEIF-QLWRKTGESCPEGTVPIRRTTEQDILRASSVQTFGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQIW
         +  +  E F QLW  +GESCP G++PIR+TT+ D+LRA+SV+ FGRK  + IRRDS+G GHEHAVVFVNGEQYYGAKA+INVWAP V+D YEFSLSQIW
Subjt:  SSADSVAEIF-QLWRKTGESCPEGTVPIRRTTEQDILRASSVQTFGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQIW

Query:  LISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLEFGQGL
        LISGSF +DLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRS YNGRQFD+GLMIWKDP+HG+WWLE G GL
Subjt:  LISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLEFGQGL

Query:  LVGYWPAFLFSHLGSHASMIQFGGEIVNTKSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWGTYFYY
        LVGYWPAFLFSHL SHASM+QFGGE+VN++S+G HT TQMGSGHFA+EG+ KA+YFRNLQ++DWDN+LLP+ NLH+LADHP CYDIRQGKN +WGTYFYY
Subjt:  LVGYWPAFLFSHLGSHASMIQFGGEIVNTKSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWGTYFYY

Query:  GGPGRNVHCP
        GGPGRN  CP
Subjt:  GGPGRNVHCP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTATTAGCGGCTTTTGTTGCTTTATTTCTCCTCGCTTCGACTTCCTCTGTTCTGTCGGCACCACCCATTGCGGCGGAGAATTTCAGACCCGCCGCCGCAGAGTTCCA
GAAATTGAGCGGCGTTAATGACTACTTGACGAACATCAATAAACCCCCCATCAAAACAATTCAGAGCCCAGATGGGGATTTAATTGATTGCGTGCTTTCTCATCTTCAAC
CTGCCTTTGATCATCATAAGCTCAAAGGGCAACTTCCATTGGATCCCCCAGATAGGCCAAAAGGTTACAACTCCTCTGCCGATTCAGTTGCAGAGATCTTCCAGCTATGG
CGGAAAACTGGTGAATCGTGCCCTGAAGGAACTGTTCCCATTAGAAGAACTACAGAACAAGACATTTTAAGAGCAAGCTCTGTCCAAACATTTGGGAGAAAGCCATTAAA
ATCTATCAGAAGGGACTCAACAGGCAGTGGCCATGAGCATGCTGTTGTGTTTGTTAATGGAGAACAATACTATGGAGCAAAAGCAAACATAAATGTTTGGGCACCCCATG
TGAGTGATCAATATGAGTTCAGCTTGTCTCAAATCTGGCTTATTTCTGGATCATTTAATAATGATCTGAACACCATTGAAGCTGGTTGGCAGGTTAGTCCTGAGTTGTAT
GGAGATAATTATCCACGATTCTTCACTTATTGGACGACAGATGCATACCAAGCAACAGGCTGCTACAACTTACTGTGCTCTGGTTTTGTCCAAACTAATAACAAGATTGC
CATAGGAGCTGCAATATCTCCAAGATCTTATTACAATGGCAGACAATTTGATGTGGGTTTAATGATTTGGAAGGATCCGAGGCACGGGAACTGGTGGCTGGAATTTGGGC
AGGGTCTGTTAGTAGGGTACTGGCCAGCATTTTTGTTCAGTCACTTGGGAAGCCATGCCAGCATGATCCAATTTGGAGGAGAAATAGTAAACACAAAATCAACAGGGTTT
CACACATCAACACAGATGGGAAGTGGTCATTTTGCAGAAGAAGGCTATGGAAAAGCTTCTTATTTCAGAAATCTGCAAATAATTGATTGGGACAACAGTTTGCTTCCTGT
CTCAAATCTTCATCTATTGGCTGATCATCCAAATTGCTATGACATAAGACAAGGAAAAAACAAGCTTTGGGGCACTTATTTTTACTATGGAGGTCCTGGTAGAAATGTAC
ACTGTCCATGA
mRNA sequenceShow/hide mRNA sequence
GTCTGGGTTAATTGCATAGATGTCTGTATGAATATGATAGCGAAGAGAGAGAAAATTCTGCTGCTTTTAGAAAAAGTAAAAAAAGGAAAAGAAAGTGTACTGCAAGCAAC
TGTATCAGAGTGGAACCAGTTTTGCAGGTGAAATAACAGAACATTAAAAAAAAAAAAAGATTCAGGCCATGTTTGTCTGGTTGATTCCCGTCTCTGTTTCAACCATCGCC
AGAATTCTCTAAAAGTGAAAACCCGAAAATGAAATCATAATCCAATCAAAACAAAACCCACCTCAATTTCTTTCATCTTCTTCACAATCTCCCTCAATTCATCAAATCAA
ATGGCCATTTCTTCTTCTTTCTTTTACAATCCCCAACTTTCTCTCTCCCATCCCTGCAAAAAAGAAAAAGAAAAGAACCCATATTTTCTTTTTTTAGTTTCTCGAGCCTT
TTGACCAGTTTCCTATTCATAAAATCGACAACCCGAGTCATCTGGGTCCTTTCTTAAAAAAAGAACAGGTTCTCTCGAATTTCTCTGTCTGTCAGAAAAATCGCCATTAA
CGAAGCTAAGATGCTCTTCTTCGAAGGGAGAAAAGATTTATATTAAAGAAGCTGAAAATGGGGGACGTTGTGTTAAAAAATTCATGTGGGTTTTTGTTTCTGAGGGGAAT
GGTTCGTTCTGTTTTGTAAGGACATGCAGAGAATTCGAAGCAGAGGAGGAAGAAGCCAAGTGGTTTAGAAGCTCTGTGGTTTTTTTTTTCTTTTCTTTTTCATCTTTGCT
CTAATACGACAACCCCAATACGTACAAACGCACGTATTCCTCTACCAAAACGACACACTAAAACATTCCTTGGAATTGGAACTGGCCGTTTTCTACTCTTTCTTCCTCTC
AAATGTTATTAGCGGCTTTTGTTGCTTTATTTCTCCTCGCTTCGACTTCCTCTGTTCTGTCGGCACCACCCATTGCGGCGGAGAATTTCAGACCCGCCGCCGCAGAGTTC
CAGAAATTGAGCGGCGTTAATGACTACTTGACGAACATCAATAAACCCCCCATCAAAACAATTCAGAGCCCAGATGGGGATTTAATTGATTGCGTGCTTTCTCATCTTCA
ACCTGCCTTTGATCATCATAAGCTCAAAGGGCAACTTCCATTGGATCCCCCAGATAGGCCAAAAGGTTACAACTCCTCTGCCGATTCAGTTGCAGAGATCTTCCAGCTAT
GGCGGAAAACTGGTGAATCGTGCCCTGAAGGAACTGTTCCCATTAGAAGAACTACAGAACAAGACATTTTAAGAGCAAGCTCTGTCCAAACATTTGGGAGAAAGCCATTA
AAATCTATCAGAAGGGACTCAACAGGCAGTGGCCATGAGCATGCTGTTGTGTTTGTTAATGGAGAACAATACTATGGAGCAAAAGCAAACATAAATGTTTGGGCACCCCA
TGTGAGTGATCAATATGAGTTCAGCTTGTCTCAAATCTGGCTTATTTCTGGATCATTTAATAATGATCTGAACACCATTGAAGCTGGTTGGCAGGTTAGTCCTGAGTTGT
ATGGAGATAATTATCCACGATTCTTCACTTATTGGACGACAGATGCATACCAAGCAACAGGCTGCTACAACTTACTGTGCTCTGGTTTTGTCCAAACTAATAACAAGATT
GCCATAGGAGCTGCAATATCTCCAAGATCTTATTACAATGGCAGACAATTTGATGTGGGTTTAATGATTTGGAAGGATCCGAGGCACGGGAACTGGTGGCTGGAATTTGG
GCAGGGTCTGTTAGTAGGGTACTGGCCAGCATTTTTGTTCAGTCACTTGGGAAGCCATGCCAGCATGATCCAATTTGGAGGAGAAATAGTAAACACAAAATCAACAGGGT
TTCACACATCAACACAGATGGGAAGTGGTCATTTTGCAGAAGAAGGCTATGGAAAAGCTTCTTATTTCAGAAATCTGCAAATAATTGATTGGGACAACAGTTTGCTTCCT
GTCTCAAATCTTCATCTATTGGCTGATCATCCAAATTGCTATGACATAAGACAAGGAAAAAACAAGCTTTGGGGCACTTATTTTTACTATGGAGGTCCTGGTAGAAATGT
ACACTGTCCATGAGCAGATCATAATGGCATAGAAGAAGTTTTTCTTTTCTTTTTTCTTTTTTAATTTTTCTTTTTTCCTTTTATTTTTTTGGGTATTTTATTTCTTACCC
ACTTTCTCTATAGGCCTTAGAATGATTTGGGTTGCTTTTATTATTAAATTATATATTTTTTGGTTTCTTTCTTTTTTTTTTTTTTCTATGTAGGCCCATTTTGGGATATT
GTAAAGTTTGGAGATAATAGGATGTATTCAAAATTATGAGGTGATGGGGT
Protein sequenceShow/hide protein sequence
MLLAAFVALFLLASTSSVLSAPPIAAENFRPAAAEFQKLSGVNDYLTNINKPPIKTIQSPDGDLIDCVLSHLQPAFDHHKLKGQLPLDPPDRPKGYNSSADSVAEIFQLW
RKTGESCPEGTVPIRRTTEQDILRASSVQTFGRKPLKSIRRDSTGSGHEHAVVFVNGEQYYGAKANINVWAPHVSDQYEFSLSQIWLISGSFNNDLNTIEAGWQVSPELY
GDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMIWKDPRHGNWWLEFGQGLLVGYWPAFLFSHLGSHASMIQFGGEIVNTKSTGF
HTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHPNCYDIRQGKNKLWGTYFYYGGPGRNVHCP