; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh01G017580 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh01G017580
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationCmo_Chr01:13072565..13074431
RNA-Seq ExpressionCmoCh01G017580
SyntenyCmoCh01G017580
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608337.1 hypothetical protein SDJN03_01679, partial [Cucurbita argyrosperma subsp. sororia]3.6e-21988.33Show/hide
Query:  MRNTYHEGKLLAMVAFAIAAAILQSHAAILQSHAAIPDINNSQQLSAQIHNKLKLLNKPALHTIYTKDGDIIDCVDIYKQPAFDHPALKNHTILMEPDWG
        MRNTYHEGKLLAMVAFAIA       AAILQSHAAIPDINNSQQLSAQI NKLKLLNKPALHTIY+KDGDIIDCVDIYKQPAFDHPALKNHTI ME +WG
Subjt:  MRNTYHEGKLLAMVAFAIAAAILQSHAAILQSHAAIPDINNSQQLSAQIHNKLKLLNKPALHTIYTKDGDIIDCVDIYKQPAFDHPALKNHTILMEPDWG

Query:  VDWRMSTEHNEAFQVWQRSGSCPNGTIPIRRVREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGASGQLNVWNPKVDLPSDFTASRI
        V W+MS E NE FQVWQRSGSCPNGTIPIRR+RE DLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGA+GQ+NVWNPKVDLP+DFTASRI
Subjt:  VDWRMSTEHNEAFQVWQRSGSCPNGTIPIRRVREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGASGQLNVWNPKVDLPSDFTASRI

Query:  WLKNGPSERFESVEAGWMVNPRLYGDTKTRLSVHWTVRSRVFIYQSVGLSKRTRKWCLVHGFVQTNPKVVLGAVIDPLSTRGGQQFIITVGIFQDPQSSN
        WLKNGPSE+FES+EAGWMVN RLYGDTKTRLSVHWTV S    Y+S G    T       GFVQTNPKVVLGA+IDPLSTRGGQQFIITVGIFQDP+SSN
Subjt:  WLKNGPSERFESVEAGWMVNPRLYGDTKTRLSVHWTVRSRVFIYQSVGLSKRTRKWCLVHGFVQTNPKVVLGAVIDPLSTRGGQQFIITVGIFQDPQSSN

Query:  WWLKMQGQPVGYWPPTLFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGEHYKYASFVRQPRIVDYSLQLKYPVRVGTWVDEYSCYSVDNYRS
        WWL MQGQPVGYWPPTLFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAG HYKYAS+VRQPRIVDYSLQLKYPVRVGTW DEYSCYSVDNYRS
Subjt:  WWLKMQGQPVGYWPPTLFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGEHYKYASFVRQPRIVDYSLQLKYPVRVGTWVDEYSCYSVDNYRS

Query:  TIPTEPVFFYGGPGRSRDCH
        TIPTEPVFFYGGPGRSRDCH
Subjt:  TIPTEPVFFYGGPGRSRDCH

KAG6608391.1 hypothetical protein SDJN03_01733, partial [Cucurbita argyrosperma subsp. sororia]7.8e-22289.05Show/hide
Query:  MRNTYHEGKLLAMVAFAIAAAILQSHAAILQSHAAIPDINNSQQLSAQIHNKLKLLNKPALHTIYTKDGDIIDCVDIYKQPAFDHPALKNHTILMEPDWG
        MRNTYHEGKLLAMVAFAIA       AAILQSHAAIPDINNSQQLSAQI NKLKLLNKPALHTIY+KDGDIIDCVDIYKQPAFDHPALKNHTI MEP+WG
Subjt:  MRNTYHEGKLLAMVAFAIAAAILQSHAAILQSHAAIPDINNSQQLSAQIHNKLKLLNKPALHTIYTKDGDIIDCVDIYKQPAFDHPALKNHTILMEPDWG

Query:  VDWRMSTEHNEAFQVWQRSGSCPNGTIPIRRVREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGASGQLNVWNPKVDLPSDFTASRI
        VDW+MS E NEAFQVWQRSGSCPNGTIPIRR+REQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGA+GQ+NVWNPKVDLP+DFTASRI
Subjt:  VDWRMSTEHNEAFQVWQRSGSCPNGTIPIRRVREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGASGQLNVWNPKVDLPSDFTASRI

Query:  WLKNGPSERFESVEAGWMVNPRLYGDTKTRLSVHWTVRSRVFIYQSVGLSKRTRKWCLVHGFVQTNPKVVLGAVIDPLSTRGGQQFIITVGIFQDPQSSN
        WLKNGPSE+FES+EAGWMVN RLYGDTKTRLSVHWTV S    Y+S G    T       GFVQTNPKVVLGA+IDPLSTRGGQQFIITVGIFQDP+SSN
Subjt:  WLKNGPSERFESVEAGWMVNPRLYGDTKTRLSVHWTVRSRVFIYQSVGLSKRTRKWCLVHGFVQTNPKVVLGAVIDPLSTRGGQQFIITVGIFQDPQSSN

Query:  WWLKMQGQPVGYWPPTLFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGEHYKYASFVRQPRIVDYSLQLKYPVRVGTWVDEYSCYSVDNYRS
        WWL MQGQPVGYWPPTLFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAG HYKYAS+VRQPRI+DYSLQLKYPVRVGTW DEYSCYSVDNYRS
Subjt:  WWLKMQGQPVGYWPPTLFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGEHYKYASFVRQPRIVDYSLQLKYPVRVGTWVDEYSCYSVDNYRS

Query:  TIPTEPVFFYGGPGRSRDCH
        TIPTEPVFFYGGPGRSRDCH
Subjt:  TIPTEPVFFYGGPGRSRDCH

KAG7037688.1 hypothetical protein SDJN02_01318 [Cucurbita argyrosperma subsp. argyrosperma]1.7e-22188.81Show/hide
Query:  MRNTYHEGKLLAMVAFAIAAAILQSHAAILQSHAAIPDINNSQQLSAQIHNKLKLLNKPALHTIYTKDGDIIDCVDIYKQPAFDHPALKNHTILMEPDWG
        MRNTYHEGKLLAMVAFAIA       AAILQSHAAIPDINNSQQLSAQI NKLKLLNKPALHTIY+KDGDIIDCVDIYKQPAFDHPALKNHTI MEP+WG
Subjt:  MRNTYHEGKLLAMVAFAIAAAILQSHAAILQSHAAIPDINNSQQLSAQIHNKLKLLNKPALHTIYTKDGDIIDCVDIYKQPAFDHPALKNHTILMEPDWG

Query:  VDWRMSTEHNEAFQVWQRSGSCPNGTIPIRRVREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGASGQLNVWNPKVDLPSDFTASRI
        VDW+MS E NEAFQVWQRSGSCPNGTIPIRR+REQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGA+GQ+NVWNPK+DLP+DFTASRI
Subjt:  VDWRMSTEHNEAFQVWQRSGSCPNGTIPIRRVREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGASGQLNVWNPKVDLPSDFTASRI

Query:  WLKNGPSERFESVEAGWMVNPRLYGDTKTRLSVHWTVRSRVFIYQSVGLSKRTRKWCLVHGFVQTNPKVVLGAVIDPLSTRGGQQFIITVGIFQDPQSSN
        WLKNGPSE+FES+EAGWMVN RLYGDTKTRLSVHWTV S    Y+S G    T       GFVQTNPKVVLGA+IDPLSTRGGQQFIITVGIFQDP+SSN
Subjt:  WLKNGPSERFESVEAGWMVNPRLYGDTKTRLSVHWTVRSRVFIYQSVGLSKRTRKWCLVHGFVQTNPKVVLGAVIDPLSTRGGQQFIITVGIFQDPQSSN

Query:  WWLKMQGQPVGYWPPTLFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGEHYKYASFVRQPRIVDYSLQLKYPVRVGTWVDEYSCYSVDNYRS
        WWL MQGQPVGYWPPTLFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAG HYKYAS+VRQPRI+DYSLQLKYPVRVGTW DEYSCYSVDNYRS
Subjt:  WWLKMQGQPVGYWPPTLFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGEHYKYASFVRQPRIVDYSLQLKYPVRVGTWVDEYSCYSVDNYRS

Query:  TIPTEPVFFYGGPGRSRDCH
        TIPTEPVFFYGGPGRSRDCH
Subjt:  TIPTEPVFFYGGPGRSRDCH

XP_022941158.1 uncharacterized protein LOC111446539 [Cucurbita moschata]1.6e-22792.14Show/hide
Query:  MRNTYHEGKLLAMVAFAIAAAILQSHAAILQSHAAIPDINNSQQLSAQIHNKLKLLNKPALHTIYTKDGDIIDCVDIYKQPAFDHPALKNHTILMEPDWG
        MRNTYHEGKLLAMVAFAIA       AAILQ+ AAIP++N SQQLS QIH KLKLLNKPALHTIYTKDGDIIDCVDIYKQPAFDHPALKNHTI MEPDWG
Subjt:  MRNTYHEGKLLAMVAFAIAAAILQSHAAILQSHAAIPDINNSQQLSAQIHNKLKLLNKPALHTIYTKDGDIIDCVDIYKQPAFDHPALKNHTILMEPDWG

Query:  VDWRMSTEHNEAFQVWQRSGSCPNGTIPIRRVREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGASGQLNVWNPKVDLPSDFTASRI
        VDW+MSTEHNEAFQVWQRSGSCPNGTIPIRRVREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGASGQLNVWNPKVDLPSDFTASRI
Subjt:  VDWRMSTEHNEAFQVWQRSGSCPNGTIPIRRVREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGASGQLNVWNPKVDLPSDFTASRI

Query:  WLKNGPSERFESVEAGWMVNPRLYGDTKTRLSVHWTVRSRVFIYQSVGLSKRTRKWCLVHGFVQTNPKVVLGAVIDPLSTRGGQQFIITVGIFQDPQSSN
        WLKNGPSERFESVEAGWMVNPRLYGDTKTRLSVHWTV S    YQS G    T       GFVQTNPKVVLGAVIDPLSTRGGQQFIITVGIFQDPQSSN
Subjt:  WLKNGPSERFESVEAGWMVNPRLYGDTKTRLSVHWTVRSRVFIYQSVGLSKRTRKWCLVHGFVQTNPKVVLGAVIDPLSTRGGQQFIITVGIFQDPQSSN

Query:  WWLKMQGQPVGYWPPTLFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGEHYKYASFVRQPRIVDYSLQLKYPVRVGTWVDEYSCYSVDNYRS
        WWLKMQGQPVGYWPPTLFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGEHYKYASFVRQPRIVDYSLQLKYPVRVGTWVDEYSCYSVDNYRS
Subjt:  WWLKMQGQPVGYWPPTLFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGEHYKYASFVRQPRIVDYSLQLKYPVRVGTWVDEYSCYSVDNYRS

Query:  TIPTEPVFFYGGPGRSRDCH
        TIPTEPVFFYGGPGRSRDCH
Subjt:  TIPTEPVFFYGGPGRSRDCH

XP_023524233.1 uncharacterized protein LOC111788198 [Cucurbita pepo subsp. pepo]6.0e-21487.14Show/hide
Query:  MRNTYHEGKLLAMVAFAIAAAILQSHAAILQSHAAIPDINNSQQLSAQIHNKLKLLNKPALHTIYTKDGDIIDCVDIYKQPAFDHPALKNHTILMEPDWG
        MRNTYHEGKLLAMVAF IA       AAILQSHAAIP INNSQQLSAQIHNKLKLLNKPALHTIY+KDGDIIDCVDIYKQPAFDHP LKNHTI MEP+ G
Subjt:  MRNTYHEGKLLAMVAFAIAAAILQSHAAILQSHAAIPDINNSQQLSAQIHNKLKLLNKPALHTIYTKDGDIIDCVDIYKQPAFDHPALKNHTILMEPDWG

Query:  VDWRMSTEHNEAFQVWQRSGSCPNGTIPIRRVREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGASGQLNVWNPKVDLPSDFTASRI
        V W+MS E NEAFQVWQRSGSCPNGTIPIRRVREQDLLRANSLDSFGKKFPY SSKLGKEVNRSTAILYTAGFNYIGASGQ+NVWNPKVDL +DFTASRI
Subjt:  VDWRMSTEHNEAFQVWQRSGSCPNGTIPIRRVREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGASGQLNVWNPKVDLPSDFTASRI

Query:  WLKNGPSERFESVEAGWMVNPRLYGDTKTRLSVHWTVRSRVFIYQSVGLSKRTRKWCLVHGFVQTNPKVVLGAVIDPLSTRGGQQFIITVGIFQDPQSSN
        WLKNGPSE+FESVEAGWMVN RLYGDT+TRLSVHWT  S    Y+S G    T       GFVQTNPKVVLGAVIDPLSTRGGQQF ITVGIFQDP+SSN
Subjt:  WLKNGPSERFESVEAGWMVNPRLYGDTKTRLSVHWTVRSRVFIYQSVGLSKRTRKWCLVHGFVQTNPKVVLGAVIDPLSTRGGQQFIITVGIFQDPQSSN

Query:  WWLKMQGQPVGYWPPTLFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGEHYKYASFVRQPRIVDYSLQLKYPVRVGTWVDEYSCYSVDNYRS
        WWL +QG PVGYWPPTLFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGEHYKYASFVRQPRIVDYSLQLKYPVRVG+W DEYSCYSVDNYR 
Subjt:  WWLKMQGQPVGYWPPTLFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGEHYKYASFVRQPRIVDYSLQLKYPVRVGTWVDEYSCYSVDNYRS

Query:  TIPTEPVFFYGGPGRSRDCH
        T+ TEPVFFYGGPGRSRDCH
Subjt:  TIPTEPVFFYGGPGRSRDCH

TrEMBL top hitse value%identityAlignment
A0A0A0L0M3 Uncharacterized protein1.8e-16366.75Show/hide
Query:  LLAMVAFAIAAAILQSHAAILQSHAAIPDINNSQQLSAQIHNKLKLLNKPALHTIYTKDGDIIDCVDIYKQPAFDHPALKNHTILMEPDWGVDWRMSTEH
        +LAMV   +  AI+  +A  ++ + ++           QI NKLKLLNKP++ TIY++DGDI++CVD+YKQPAFDHP LKNHTI M+PD  +D +MS+  
Subjt:  LLAMVAFAIAAAILQSHAAILQSHAAIPDINNSQQLSAQIHNKLKLLNKPALHTIYTKDGDIIDCVDIYKQPAFDHPALKNHTILMEPDWGVDWRMSTEH

Query:  NEA-------FQVWQRSGSCPNGTIPIRRVREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGASGQLNVWNPKVDLPSDFTASRIWL
        NE+       FQ WQ+SGSCP GTIPIRRV  +DLLRANSL  FGKKFPY  SKLG+E NRSTAIL T G NYIGASG +NVWNPKVDLP+DFTAS++WL
Subjt:  NEA-------FQVWQRSGSCPNGTIPIRRVREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGASGQLNVWNPKVDLPSDFTASRIWL

Query:  KNGPSERFESVEAGWMVNPRLYGDTKTRLSVHWTVRSRVFIYQSVGLSKRTRKWCLVHGFVQTNPKVVLGAVIDPLSTRGGQQFIITVGIFQDPQSSNWW
        KNGPSE+FESVEAGWMVNP+LYGD KTRLS++WTV S    Y++ G    T       GFVQTNP V +GAVI+PLS+  GQQ+ I++GIFQDP S NWW
Subjt:  KNGPSERFESVEAGWMVNPRLYGDTKTRLSVHWTVRSRVFIYQSVGLSKRTRKWCLVHGFVQTNPKVVLGAVIDPLSTRGGQQFIITVGIFQDPQSSNWW

Query:  LKMQGQPVGYWPPTLFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGEHYKYASFVRQPRIVDYSLQLKYPVRVGTWVDEYSCYSVDNYRSTI
        LK QG PVGYWP TLFGYL +SATLVEWGGEVFSSNIK VPHTGTGMGSGDYA   Y+YASFV++PRIVDYSLQLKYP RVGTW DE SCYSVDNY+ + 
Subjt:  LKMQGQPVGYWPPTLFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGEHYKYASFVRQPRIVDYSLQLKYPVRVGTWVDEYSCYSVDNYRSTI

Query:  PTEPVFFYGGPGRSRDCH
         TEPVF++GGPG SRDCH
Subjt:  PTEPVFFYGGPGRSRDCH

A0A1S4DZ87 uncharacterized protein LOC1034938972.1e-16467.7Show/hide
Query:  LLAMVAFAIAAAILQSHAAILQSHAAIPDINNSQQLSAQIHNKLKLLNKPALHTIYTKDGDIIDCVDIYKQPAFDHPALKNHTILMEPDWGVDWRMSTEH
        +L MVA  +  AI+  +A  ++      D++N      QI NKLKLLNKP++ TIY++DGD+I CVDIYKQPAFDHP LKNHTI M+PD  +D +MS   
Subjt:  LLAMVAFAIAAAILQSHAAILQSHAAIPDINNSQQLSAQIHNKLKLLNKPALHTIYTKDGDIIDCVDIYKQPAFDHPALKNHTILMEPDWGVDWRMSTEH

Query:  NEA-------FQVWQRSGSCPNGTIPIRRVREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGASGQLNVWNPKVDLPSDFTASRIWL
        N++       FQ+WQ+SGSCP GTIPIRRVR +DLLRANS+  FGKKFPY +SKLG+E NRSTAIL T G NYIGASG +NVWNPKVDLP+DFTAS+IWL
Subjt:  NEA-------FQVWQRSGSCPNGTIPIRRVREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGASGQLNVWNPKVDLPSDFTASRIWL

Query:  KNGPSERFESVEAGWMVNPRLYGDTKTRLSVHWTVRSRVFIYQSVGLSKRTRKWCLVHGFVQTNPKVVLGAVIDPLSTRGGQQFIITVGIFQDPQSSNWW
        KNGPSE+FESVEAGWMVNP+LYGD KTR S++WTV S    Y+S G    T       GFVQTNP V +GAVIDPLS+  GQQ+ I +GIFQDP+S NWW
Subjt:  KNGPSERFESVEAGWMVNPRLYGDTKTRLSVHWTVRSRVFIYQSVGLSKRTRKWCLVHGFVQTNPKVVLGAVIDPLSTRGGQQFIITVGIFQDPQSSNWW

Query:  LKMQGQPVGYWPPTLFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGEHYKYASFVRQPRIVDYSLQLKYPVRVGTWVDEYSCYSVDNYRSTI
        LK Q QPVGYWPPTLFGYL +SATLVEWGGEVFSSNIK VPHTGTGMGSGDYA   Y+YASFV+QPRIVDYS+QLKYP +VGTW DE SCYSVDNY+ T 
Subjt:  LKMQGQPVGYWPPTLFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGEHYKYASFVRQPRIVDYSLQLKYPVRVGTWVDEYSCYSVDNYRSTI

Query:  PTEPVFFYGGPGRSRDCH
         +EPVF++GGPG SRDCH
Subjt:  PTEPVFFYGGPGRSRDCH

A0A5A7V8M6 Uncharacterized protein2.1e-16467.7Show/hide
Query:  LLAMVAFAIAAAILQSHAAILQSHAAIPDINNSQQLSAQIHNKLKLLNKPALHTIYTKDGDIIDCVDIYKQPAFDHPALKNHTILMEPDWGVDWRMSTEH
        +L MVA  +  AI+  +A  ++      D++N      QI NKLKLLNKP++ TIY++DGD+I CVDIYKQPAFDHP LKNHTI M+PD  +D +MS   
Subjt:  LLAMVAFAIAAAILQSHAAILQSHAAIPDINNSQQLSAQIHNKLKLLNKPALHTIYTKDGDIIDCVDIYKQPAFDHPALKNHTILMEPDWGVDWRMSTEH

Query:  NEA-------FQVWQRSGSCPNGTIPIRRVREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGASGQLNVWNPKVDLPSDFTASRIWL
        N++       FQ+WQ+SGSCP GTIPIRRVR +DLLRANS+  FGKKFPY +SKLG+E NRSTAIL T G NYIGASG +NVWNPKVDLP+DFTAS+IWL
Subjt:  NEA-------FQVWQRSGSCPNGTIPIRRVREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGASGQLNVWNPKVDLPSDFTASRIWL

Query:  KNGPSERFESVEAGWMVNPRLYGDTKTRLSVHWTVRSRVFIYQSVGLSKRTRKWCLVHGFVQTNPKVVLGAVIDPLSTRGGQQFIITVGIFQDPQSSNWW
        KNGPSE+FESVEAGWMVNP+LYGD KTR S++WTV S    Y+S G    T       GFVQTNP V +GAVIDPLS+  GQQ+ I +GIFQDP+S NWW
Subjt:  KNGPSERFESVEAGWMVNPRLYGDTKTRLSVHWTVRSRVFIYQSVGLSKRTRKWCLVHGFVQTNPKVVLGAVIDPLSTRGGQQFIITVGIFQDPQSSNWW

Query:  LKMQGQPVGYWPPTLFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGEHYKYASFVRQPRIVDYSLQLKYPVRVGTWVDEYSCYSVDNYRSTI
        LK Q QPVGYWPPTLFGYL +SATLVEWGGEVFSSNIK VPHTGTGMGSGDYA   Y+YASFV+QPRIVDYS+QLKYP +VGTW DE SCYSVDNY+ T 
Subjt:  LKMQGQPVGYWPPTLFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGEHYKYASFVRQPRIVDYSLQLKYPVRVGTWVDEYSCYSVDNYRSTI

Query:  PTEPVFFYGGPGRSRDCH
         +EPVF++GGPG SRDCH
Subjt:  PTEPVFFYGGPGRSRDCH

A0A6J1FRA8 uncharacterized protein LOC1114465397.8e-22892.14Show/hide
Query:  MRNTYHEGKLLAMVAFAIAAAILQSHAAILQSHAAIPDINNSQQLSAQIHNKLKLLNKPALHTIYTKDGDIIDCVDIYKQPAFDHPALKNHTILMEPDWG
        MRNTYHEGKLLAMVAFAIA       AAILQ+ AAIP++N SQQLS QIH KLKLLNKPALHTIYTKDGDIIDCVDIYKQPAFDHPALKNHTI MEPDWG
Subjt:  MRNTYHEGKLLAMVAFAIAAAILQSHAAILQSHAAIPDINNSQQLSAQIHNKLKLLNKPALHTIYTKDGDIIDCVDIYKQPAFDHPALKNHTILMEPDWG

Query:  VDWRMSTEHNEAFQVWQRSGSCPNGTIPIRRVREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGASGQLNVWNPKVDLPSDFTASRI
        VDW+MSTEHNEAFQVWQRSGSCPNGTIPIRRVREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGASGQLNVWNPKVDLPSDFTASRI
Subjt:  VDWRMSTEHNEAFQVWQRSGSCPNGTIPIRRVREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGASGQLNVWNPKVDLPSDFTASRI

Query:  WLKNGPSERFESVEAGWMVNPRLYGDTKTRLSVHWTVRSRVFIYQSVGLSKRTRKWCLVHGFVQTNPKVVLGAVIDPLSTRGGQQFIITVGIFQDPQSSN
        WLKNGPSERFESVEAGWMVNPRLYGDTKTRLSVHWTV S    YQS G    T       GFVQTNPKVVLGAVIDPLSTRGGQQFIITVGIFQDPQSSN
Subjt:  WLKNGPSERFESVEAGWMVNPRLYGDTKTRLSVHWTVRSRVFIYQSVGLSKRTRKWCLVHGFVQTNPKVVLGAVIDPLSTRGGQQFIITVGIFQDPQSSN

Query:  WWLKMQGQPVGYWPPTLFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGEHYKYASFVRQPRIVDYSLQLKYPVRVGTWVDEYSCYSVDNYRS
        WWLKMQGQPVGYWPPTLFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGEHYKYASFVRQPRIVDYSLQLKYPVRVGTWVDEYSCYSVDNYRS
Subjt:  WWLKMQGQPVGYWPPTLFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGEHYKYASFVRQPRIVDYSLQLKYPVRVGTWVDEYSCYSVDNYRS

Query:  TIPTEPVFFYGGPGRSRDCH
        TIPTEPVFFYGGPGRSRDCH
Subjt:  TIPTEPVFFYGGPGRSRDCH

A0A6J1FT01 uncharacterized protein LOC1114466011.0e-19079.41Show/hide
Query:  MVAFAIAAAILQSHAAILQSHAAIPDINNSQQLSAQIHNKLKLLNKPALHTIYTKDGDIIDCVDIYKQPAFDHPALKNHTILMEPDWGVDWRMSTEHNEA
        MVA AIA       AAILQ+HAAIP++NNSQQLS QI  KLKLLNKPALHTIY++DGDIIDCVDIYKQPAFDHPALKNHTI MEPDWGVDW+MS E NE 
Subjt:  MVAFAIAAAILQSHAAILQSHAAIPDINNSQQLSAQIHNKLKLLNKPALHTIYTKDGDIIDCVDIYKQPAFDHPALKNHTILMEPDWGVDWRMSTEHNEA

Query:  FQVWQRSGSCPNGTIPIRRVREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGASGQLNVWNPKVDLPSDFTASRIWLKNGPSERFES
        FQVWQRSG CP GTIPIRRVREQDLLR NSLDSFGK F Y SSKLG EVNRST+ILYTAG+NYIGASGQ+NVWNPKVDLP+DFTASRIWLKNGPSE FES
Subjt:  FQVWQRSGSCPNGTIPIRRVREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGASGQLNVWNPKVDLPSDFTASRIWLKNGPSERFES

Query:  VEAGWMVNPRLYGDTKTRLSVHWTVRSRVFIYQSVGLSKRTRKWCLVHGFVQTNPKVVLGAVIDPLSTRGGQQFIITVGIFQDPQSSNWWLKMQGQPVGY
        VEAGWMVN RLYGDTKTR SVHWTV S    Y+S G    T       GFVQTNPK+VLGAVIDP+STRGGQQFII+VG+FQDP+S NWWL +QG PVGY
Subjt:  VEAGWMVNPRLYGDTKTRLSVHWTVRSRVFIYQSVGLSKRTRKWCLVHGFVQTNPKVVLGAVIDPLSTRGGQQFIITVGIFQDPQSSNWWLKMQGQPVGY

Query:  WPPTLFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGEHYKYASFVRQPRIVDYSLQLKYPVRVGTWVDEYSCYSVDNYRSTIPTEPVFFYGG
        WPPTLFGYLR+SATLVEWGGEVFSS++KKVPHT T MGSGDYAG HY++AS+V  PRIVD SLQLKYP RVGTW +E  CYS DNY+ T  TEPVFFYGG
Subjt:  WPPTLFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGEHYKYASFVRQPRIVDYSLQLKYPVRVGTWVDEYSCYSVDNYRSTIPTEPVFFYGG

Query:  PGRSRDCH
        PGRSRDCH
Subjt:  PGRSRDCH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55360.1 Protein of Unknown Function (DUF239)9.2e-8039.64Show/hide
Query:  QIHNKLKLLNKPALHTIYTKDGDIIDCVDIYKQPAFDHPALKNHTILMEPDWGVDW----------RMSTEHNEAFQVWQRSGSCPNGTIPIRRVREQDL
        ++   L  LNKPA+ +I + DGD+IDCV I KQPAFDHP LK+H I M+P++  +           + + +     Q+W R G C  GTIP+RR +E D+
Subjt:  QIHNKLKLLNKPALHTIYTKDGDIIDCVDIYKQPAFDHPALKNHTILMEPDWGVDW----------RMSTEHNEAFQVWQRSGSCPNGTIPIRRVREQDL

Query:  LRANSLDSFGKK----FPYESSKLGKEVNRS---TAILYTAGFNYIGASGQLNVWNPKVDLPSDFTASRIWLKNGP-SERFESVEAGWMVNPRLYGDTKT
        LRA+S+  +GKK     P   S     +N+S    AI Y  G  Y GA   +NVW PK+   ++F+ S+IWL  G   +   S+EAGW V+P LYGD  T
Subjt:  LRANSLDSFGKK----FPYESSKLGKEVNRS---TAILYTAGFNYIGASGQLNVWNPKVDLPSDFTASRIWLKNGP-SERFESVEAGWMVNPRLYGDTKT

Query:  RLSVHWTVRSRVFIYQSVGLSKRTRKWCLVHGFVQTNPKVVLGAVIDPLSTRGGQQFIITVGIFQDPQSSNWWLKM-QGQPVGYWPPTLFGYLRNSATLV
        RL  +WT  +    YQ+ G         L  GF+Q N  + +GA I P+S     Q+ I++ I++DP+  +WW++   G  +GYWP  LF YL  SA+++
Subjt:  RLSVHWTVRSRVFIYQSVGLSKRTRKWCLVHGFVQTNPKVVLGAVIDPLSTRGGQQFIITVGIFQDPQSSNWWLKM-QGQPVGYWPPTLFGYLRNSATLV

Query:  EWGGEVFSSNIKKVPHTGTGMGSGDYAGEHYKYASFVRQPRIVDYSLQLKYPVRVGTWVDEYSCYSVDNYRSTIPTEPVFFYGGPGRSRDC
        EWGGEV +S      HT T MGSG +  E +  AS+ R  ++VD S  LK P  +GT+ ++ +CY V    S       F+YGGPG+++ C
Subjt:  EWGGEVFSSNIKKVPHTGTGMGSGDYAGEHYKYASFVRQPRIVDYSLQLKYPVRVGTWVDEYSCYSVDNYRSTIPTEPVFFYGGPGRSRDC

AT2G44210.1 Protein of Unknown Function (DUF239)9.2e-8039.54Show/hide
Query:  QIHNKLKLLNKPALHTIYTKDGDIIDCVDIYKQPAFDHPALKNHTILMEPDWGVDWRMS----------TEHNEAFQVWQRSGSCPNGTIPIRRVREQDL
        +I   LK LNKPAL +I + DGD+IDCV I  QPAF HP L NHT+ M P    +   S           + N   Q+W  +G CP  TIPIRR R QDL
Subjt:  QIHNKLKLLNKPALHTIYTKDGDIIDCVDIYKQPAFDHPALKNHTILMEPDWGVDWRMS----------TEHNEAFQVWQRSGSCPNGTIPIRRVREQDL

Query:  LRANSLDSFG----KKFPYESSKLGKEV----NRSTAILYTAGFNYIGASGQLNVWNPKVDLPSDFTASRIWLKNGP-SERFESVEAGWMVNPRLYGDTK
         RA+S++++G    K  P   S     V        AI+Y     + GA  ++NVW P V++P++F+ ++IW+  G  +    S+EAGW V+P+LYGD +
Subjt:  LRANSLDSFG----KKFPYESSKLGKEV----NRSTAILYTAGFNYIGASGQLNVWNPKVDLPSDFTASRIWLKNGP-SERFESVEAGWMVNPRLYGDTK

Query:  TRLSVHWTVRSRVFIYQSVGLSKRTRKWCLVHGFVQTNPKVVLGAVIDPLSTRGGQQFIITVGIFQDPQSSNWWLKM-QGQPVGYWPPTLFGYLRNSATL
        TRL  +WT  +    YQ  G         L  GFVQ N ++ +G  I PLS  G  Q+ IT+ I++DP+  +WWL+  +   +GYWP +LF YL  SA++
Subjt:  TRLSVHWTVRSRVFIYQSVGLSKRTRKWCLVHGFVQTNPKVVLGAVIDPLSTRGGQQFIITVGIFQDPQSSNWWLKM-QGQPVGYWPPTLFGYLRNSATL

Query:  VEWGGEVFSSNIKKVPHTGTGMGSGDYAGEHYKYASFVRQPRIVDYSLQLKYPVRVGTWVDEYSCYSVDNYRSTIPTEPVFFYGGPGRSRDC
        +EWGGEV +S  ++  HT T MGSG +A E +  AS+ +  ++VD S +L+ P  +  + D+ +CY+V +          F+YGGPGR+ +C
Subjt:  VEWGGEVFSSNIKKVPHTGTGMGSGDYAGEHYKYASFVRQPRIVDYSLQLKYPVRVGTWVDEYSCYSVDNYRSTIPTEPVFFYGGPGRSRDC

AT3G13510.1 Protein of Unknown Function (DUF239)9.2e-8040.66Show/hide
Query:  SQQLSAQIHNKLKLLNKPALHTIYTKDGDIIDCVDIYKQPAFDHPALKNHTILMEPDW---GV--DWRMSTE----HNEAFQVWQRSGSCPNGTIPIRRV
        S +   ++   L  LNKP + TI + DGDIIDC+ I KQPAFDHP LK+H I M P +   G+  D ++S E         Q+W R G C  GTIP+RR 
Subjt:  SQQLSAQIHNKLKLLNKPALHTIYTKDGDIIDCVDIYKQPAFDHPALKNHTILMEPDW---GV--DWRMSTE----HNEAFQVWQRSGSCPNGTIPIRRV

Query:  REQDLLRANSLDSFGKK----FPYESSKLGKEVNRS---TAILYTAGFNYIGASGQLNVWNPKVDLPSDFTASRIWLKNGP-SERFESVEAGWMVNPRLY
        RE D+LRA+S+  +GKK     P   S     +N++    AI Y  G  Y GA   LNVW PK+   ++F+ S+IWL  G   +   S+EAGW V+P LY
Subjt:  REQDLLRANSLDSFGKK----FPYESSKLGKEVNRS---TAILYTAGFNYIGASGQLNVWNPKVDLPSDFTASRIWLKNGP-SERFESVEAGWMVNPRLY

Query:  GDTKTRLSVHWTVRSRVFIYQSVGLSKRTRKWCLVHGFVQTNPKVVLGAVIDPLSTRGGQQFIITVGIFQDPQSSNWWLKM-QGQPVGYWPPTLFGYLRN
        GD  TRL  +WT  +    YQ+ G         L  GF+Q N  + +GA I P+S     Q+ I++ I++DP+  +WW++   G  +GYWP  LF YL  
Subjt:  GDTKTRLSVHWTVRSRVFIYQSVGLSKRTRKWCLVHGFVQTNPKVVLGAVIDPLSTRGGQQFIITVGIFQDPQSSNWWLKM-QGQPVGYWPPTLFGYLRN

Query:  SATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGEHYKYASFVRQPRIVDYSLQLKYPVRVGTWVDEYSCYSVDNYRSTIPTEPVFFYGGPGRSRDC
        SA+++EWGGEV +S   +  HT T MGSG +  E +  AS+ R  ++VD S  LK P  +GT+ ++ +CY V    S       F+YGGPG++++C
Subjt:  SATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGEHYKYASFVRQPRIVDYSLQLKYPVRVGTWVDEYSCYSVDNYRSTIPTEPVFFYGGPGRSRDC

AT5G25950.1 Protein of Unknown Function (DUF239)5.0e-10246.5Show/hide
Query:  SAQIHNKLKLLNKPALHTIYTKDGDIIDCVDIYKQPAFDHPALKNHTILMEPDWGVDWRMSTEHNEA------FQVWQRSGSCPNGTIPIRRVREQDLLR
        S  I  KLK LNKPAL TI ++DGDIIDC+DIYKQ AFDHPALKNH I M+P      + +T  N         Q+W +SG CP GTIP+RRV  +D+ R
Subjt:  SAQIHNKLKLLNKPALHTIYTKDGDIIDCVDIYKQPAFDHPALKNHTILMEPDWGVDWRMSTEHNEA------FQVWQRSGSCPNGTIPIRRVREQDLLR

Query:  ANSLDSFGKKFPYESSKLGKEVN-------------------RSTAILYTAGFNYIGASGQLNVWNPKVDLPSDFTASRIWLKNGPSERFESVEAGWMVN
        A+S   FG+K P++ S L   +                    RS A +   GFN++GA   +N+WNP     +D++ ++IWL  G SE FESVE GWMVN
Subjt:  ANSLDSFGKKFPYESSKLGKEVN-------------------RSTAILYTAGFNYIGASGQLNVWNPKVDLPSDFTASRIWLKNGPSERFESVEAGWMVN

Query:  PRLYGDTKTRLSVHWTVRSRVFIYQSVGLSKRTRKWCLVHGFVQTNPKVVLGAVIDPLSTRGGQQFIITVGIFQDPQSSNWWLKMQGQPVGYWPPTLFGY
        P ++GD++TRL + WT         + G +K      L  GFVQT+ K  LGA ++P+S+    Q+ ITV IF DP S NWWL  +   +GYWP TLF Y
Subjt:  PRLYGDTKTRLSVHWTVRSRVFIYQSVGLSKRTRKWCLVHGFVQTNPKVVLGAVIDPLSTRGGQQFIITVGIFQDPQSSNWWLKMQGQPVGYWPPTLFGY

Query:  LRNSATLVEWGGEVFSSN-IKKVPHTGTGMGSGDYAGEHYKYASFVRQPRIVDYSLQLKYPVRVGTWVDEYSCYSVDNYRSTIPTEPVFFYGGPGRSRDC
        L++SAT V+WGGEV S N + K PHT T MGSG +A   +  A F    RI DYS+QLKYP  +  + DEY+CYS   +R T  +EP F++GGPGR+  C
Subjt:  LRNSATLVEWGGEVFSSN-IKKVPHTGTGMGSGDYAGEHYKYASFVRQPRIVDYSLQLKYPVRVGTWVDEYSCYSVDNYRSTIPTEPVFFYGGPGRSRDC

AT5G25960.1 Protein of Unknown Function (DUF239)1.2e-8743.68Show/hide
Query:  SAQIHNKLKLLNKPALHTIYTKDGDIIDCVDIYKQPAFDHPALKNHTILMEPDWGVDWRMSTEHNE------AFQVWQRSGSCPNGTIPIRRVREQDLLR
        S  I  KLK LNKP+L TI ++DGDIIDC+DIYKQ AFDHPAL+NH I M+P      + +T  N         Q+W +SG+CP GTIP           
Subjt:  SAQIHNKLKLLNKPALHTIYTKDGDIIDCVDIYKQPAFDHPALKNHTILMEPDWGVDWRMSTEHNE------AFQVWQRSGSCPNGTIPIRRVREQDLLR

Query:  ANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGASGQLNVWNPKVDLPSDFTASRIWLKNGPSERFESVEAGWMVNPRLYGDTKTRLSVHWTVRS
                                  A+L   G+N+IGA   +NVWNP     SD+++++IWL  G S+ FES+EAGW VNP ++GD++TRL  +WT   
Subjt:  ANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGASGQLNVWNPKVDLPSDFTASRIWLKNGPSERFESVEAGWMVNPRLYGDTKTRLSVHWTVRS

Query:  RVFIYQSVGLSKRTRKWCLVHGFVQTNPKVVLGAVIDPLSTRGGQQFIITVGIFQDPQSSNWWLKMQGQPVGYWPPTLFGYLRNSATLVEWGGEVFSSNI
                G SK      L  GFVQT  K  LGA I+P+ST   +Q  IT     D  S NWWL      +GYWP TLF YL++SAT V+ GGEV S N+
Subjt:  RVFIYQSVGLSKRTRKWCLVHGFVQTNPKVVLGAVIDPLSTRGGQQFIITVGIFQDPQSSNWWLKMQGQPVGYWPPTLFGYLRNSATLVEWGGEVFSSNI

Query:  KKVPHTGTGMGSGDYAGEHYKYASFVRQPRIVDYSLQLKYPVRVGTWVDEYSCYSVDNYRSTIPTEPVFFYGGPGRSRDC
         K PHT T MGSG +A   +  A +    RI DYSLQ+KYP  +  + DEY CYS   +R T  +EP F++GGPG++  C
Subjt:  KKVPHTGTGMGSGDYAGEHYKYASFVRQPRIVDYSLQLKYPVRVGTWVDEYSCYSVDNYRSTIPTEPVFFYGGPGRSRDC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAACACATATCATGAAGGGAAGCTCTTGGCAATGGTGGCTTTCGCCATTGCTGCTGCCATTCTCCAATCTCATGCCGCCATTCTCCAATCTCATGCCGCCATTCC
AGACATCAATAATTCTCAACAATTATCTGCGCAGATTCACAACAAATTAAAGCTTCTCAATAAGCCTGCCCTCCACACCATCTACACTAAAGATGGAGATATCATCGATT
GTGTTGACATTTACAAGCAGCCTGCTTTTGACCATCCCGCTCTAAAGAACCACACCATTCTGATGGAACCCGATTGGGGCGTCGATTGGAGGATGTCGACCGAGCATAAC
GAGGCCTTTCAGGTATGGCAAAGAAGTGGGAGTTGTCCCAATGGAACCATTCCAATTCGCAGAGTTCGTGAACAAGACTTATTAAGAGCCAATTCTCTTGATAGCTTTGG
AAAGAAATTTCCTTATGAAAGCTCCAAACTCGGGAAAGAAGTCAATCGTTCGACGGCGATCCTGTATACGGCCGGCTTCAATTACATTGGCGCTTCAGGACAGCTTAATG
TTTGGAACCCTAAAGTTGATTTGCCGAGTGATTTCACAGCTTCAAGAATTTGGTTGAAAAATGGGCCTTCGGAAAGATTTGAAAGCGTAGAAGCCGGCTGGATGGTTAAT
CCAAGGTTGTATGGAGATACGAAAACTCGTCTTAGTGTACATTGGACAGTAAGATCGAGGGTTTTCATTTATCAGAGCGTGGGTTTGTCCAAACGAACCCGAAAGTGGTG
CTTGGTGCATGGGTTTGTCCAAACGAACCCGAAAGTGGTGCTTGGTGCAGTTATTGACCCATTGTCGACCAGAGGTGGACAACAGTTCATTATCACGGTCGGTATCTTTC
AGGATCCTCAGTCAAGCAACTGGTGGCTGAAAATGCAAGGGCAACCAGTGGGGTATTGGCCGCCGACGCTATTTGGATACTTGCGCAACAGCGCGACACTGGTGGAATGG
GGCGGGGAGGTGTTTAGCTCAAACATAAAGAAAGTGCCACACACGGGGACGGGCATGGGGAGCGGAGACTATGCAGGGGAGCATTACAAGTACGCTAGCTTCGTGAGGCA
GCCAAGGATCGTGGACTATTCGCTACAGTTGAAGTATCCGGTGAGAGTTGGAACTTGGGTTGATGAGTATTCTTGCTACTCTGTTGATAATTATCGAAGTACAATCCCAA
CTGAACCTGTTTTCTTCTATGGCGGTCCTGGACGCAGCCGTGACTGCCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGAACACATATCATGAAGGGAAGCTCTTGGCAATGGTGGCTTTCGCCATTGCTGCTGCCATTCTCCAATCTCATGCCGCCATTCTCCAATCTCATGCCGCCATTCC
AGACATCAATAATTCTCAACAATTATCTGCGCAGATTCACAACAAATTAAAGCTTCTCAATAAGCCTGCCCTCCACACCATCTACACTAAAGATGGAGATATCATCGATT
GTGTTGACATTTACAAGCAGCCTGCTTTTGACCATCCCGCTCTAAAGAACCACACCATTCTGATGGAACCCGATTGGGGCGTCGATTGGAGGATGTCGACCGAGCATAAC
GAGGCCTTTCAGGTATGGCAAAGAAGTGGGAGTTGTCCCAATGGAACCATTCCAATTCGCAGAGTTCGTGAACAAGACTTATTAAGAGCCAATTCTCTTGATAGCTTTGG
AAAGAAATTTCCTTATGAAAGCTCCAAACTCGGGAAAGAAGTCAATCGTTCGACGGCGATCCTGTATACGGCCGGCTTCAATTACATTGGCGCTTCAGGACAGCTTAATG
TTTGGAACCCTAAAGTTGATTTGCCGAGTGATTTCACAGCTTCAAGAATTTGGTTGAAAAATGGGCCTTCGGAAAGATTTGAAAGCGTAGAAGCCGGCTGGATGGTTAAT
CCAAGGTTGTATGGAGATACGAAAACTCGTCTTAGTGTACATTGGACAGTAAGATCGAGGGTTTTCATTTATCAGAGCGTGGGTTTGTCCAAACGAACCCGAAAGTGGTG
CTTGGTGCATGGGTTTGTCCAAACGAACCCGAAAGTGGTGCTTGGTGCAGTTATTGACCCATTGTCGACCAGAGGTGGACAACAGTTCATTATCACGGTCGGTATCTTTC
AGGATCCTCAGTCAAGCAACTGGTGGCTGAAAATGCAAGGGCAACCAGTGGGGTATTGGCCGCCGACGCTATTTGGATACTTGCGCAACAGCGCGACACTGGTGGAATGG
GGCGGGGAGGTGTTTAGCTCAAACATAAAGAAAGTGCCACACACGGGGACGGGCATGGGGAGCGGAGACTATGCAGGGGAGCATTACAAGTACGCTAGCTTCGTGAGGCA
GCCAAGGATCGTGGACTATTCGCTACAGTTGAAGTATCCGGTGAGAGTTGGAACTTGGGTTGATGAGTATTCTTGCTACTCTGTTGATAATTATCGAAGTACAATCCCAA
CTGAACCTGTTTTCTTCTATGGCGGTCCTGGACGCAGCCGTGACTGCCATTGA
Protein sequenceShow/hide protein sequence
MRNTYHEGKLLAMVAFAIAAAILQSHAAILQSHAAIPDINNSQQLSAQIHNKLKLLNKPALHTIYTKDGDIIDCVDIYKQPAFDHPALKNHTILMEPDWGVDWRMSTEHN
EAFQVWQRSGSCPNGTIPIRRVREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGASGQLNVWNPKVDLPSDFTASRIWLKNGPSERFESVEAGWMVN
PRLYGDTKTRLSVHWTVRSRVFIYQSVGLSKRTRKWCLVHGFVQTNPKVVLGAVIDPLSTRGGQQFIITVGIFQDPQSSNWWLKMQGQPVGYWPPTLFGYLRNSATLVEW
GGEVFSSNIKKVPHTGTGMGSGDYAGEHYKYASFVRQPRIVDYSLQLKYPVRVGTWVDEYSCYSVDNYRSTIPTEPVFFYGGPGRSRDCH