; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg10719 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg10719
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionProtein of Unknown Function (DUF239)
Genome locationCarg_Chr01:11794819..11796809
RNA-Seq ExpressionCarg10719
SyntenyCarg10719
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608337.1 hypothetical protein SDJN03_01679, partial [Cucurbita argyrosperma subsp. sororia]2.1e-24098.27Show/hide
Query:  MRNTYHEGKLLAMVAFAIAAAILQSHAAIPDINNSQQLSAQIKNKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHPALKNHTIQMEPNWGVDWKMSV
        MRNTYHEGKLLAMVAFAIAAAILQSHAAIPDINNSQQLSAQIKNKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHPALKNHTIQME NWGV WKMSV
Subjt:  MRNTYHEGKLLAMVAFAIAAAILQSHAAIPDINNSQQLSAQIKNKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHPALKNHTIQMEPNWGVDWKMSV

Query:  EQNEAFQVWQRSGSCPNGTIPIRRIREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGATGQINVWNPKLDLPNDFTASRIWLKNGPS
        EQNE FQVWQRSGSCPNGTIPIRRIRE DLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGATGQINVWNPK+DLPNDFTASRIWLKNGPS
Subjt:  EQNEAFQVWQRSGSCPNGTIPIRRIREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGATGQINVWNPKLDLPNDFTASRIWLKNGPS

Query:  EKFESIEAGWMVNRRLYGDTKTRLSVHWTVDSYKSKGCFDLTCSGFVQTNPKVVLGAIIDPLSTRGGQQFIITVGIFQDPKSSNWWLNMQGQPVGYWPPT
        EKFESIEAGWMVNRRLYGDTKTRLSVHWTVDSYKS GCFDLTCSGFVQTNPKVVLGAIIDPLSTRGGQQFIITVGIFQDPKSSNWWLNMQGQPVGYWPPT
Subjt:  EKFESIEAGWMVNRRLYGDTKTRLSVHWTVDSYKSKGCFDLTCSGFVQTNPKVVLGAIIDPLSTRGGQQFIITVGIFQDPKSSNWWLNMQGQPVGYWPPT

Query:  LFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGGHYKYASYVRQPRILDYSLQLKYPVRVGTWADEYSCYSVDNYRSTIPTEPVFFYGGPGRS
        LFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGGHYKYASYVRQPRI+DYSLQLKYPVRVGTWADEYSCYSVDNYRSTIPTEPVFFYGGPGRS
Subjt:  LFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGGHYKYASYVRQPRILDYSLQLKYPVRVGTWADEYSCYSVDNYRSTIPTEPVFFYGGPGRS

Query:  RDCH
        RDCH
Subjt:  RDCH

KAG6608391.1 hypothetical protein SDJN03_01733, partial [Cucurbita argyrosperma subsp. sororia]1.1e-24499.75Show/hide
Query:  MRNTYHEGKLLAMVAFAIAAAILQSHAAIPDINNSQQLSAQIKNKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHPALKNHTIQMEPNWGVDWKMSV
        MRNTYHEGKLLAMVAFAIAAAILQSHAAIPDINNSQQLSAQIKNKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHPALKNHTIQMEPNWGVDWKMSV
Subjt:  MRNTYHEGKLLAMVAFAIAAAILQSHAAIPDINNSQQLSAQIKNKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHPALKNHTIQMEPNWGVDWKMSV

Query:  EQNEAFQVWQRSGSCPNGTIPIRRIREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGATGQINVWNPKLDLPNDFTASRIWLKNGPS
        EQNEAFQVWQRSGSCPNGTIPIRRIREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGATGQINVWNPK+DLPNDFTASRIWLKNGPS
Subjt:  EQNEAFQVWQRSGSCPNGTIPIRRIREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGATGQINVWNPKLDLPNDFTASRIWLKNGPS

Query:  EKFESIEAGWMVNRRLYGDTKTRLSVHWTVDSYKSKGCFDLTCSGFVQTNPKVVLGAIIDPLSTRGGQQFIITVGIFQDPKSSNWWLNMQGQPVGYWPPT
        EKFESIEAGWMVNRRLYGDTKTRLSVHWTVDSYKSKGCFDLTCSGFVQTNPKVVLGAIIDPLSTRGGQQFIITVGIFQDPKSSNWWLNMQGQPVGYWPPT
Subjt:  EKFESIEAGWMVNRRLYGDTKTRLSVHWTVDSYKSKGCFDLTCSGFVQTNPKVVLGAIIDPLSTRGGQQFIITVGIFQDPKSSNWWLNMQGQPVGYWPPT

Query:  LFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGGHYKYASYVRQPRILDYSLQLKYPVRVGTWADEYSCYSVDNYRSTIPTEPVFFYGGPGRS
        LFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGGHYKYASYVRQPRILDYSLQLKYPVRVGTWADEYSCYSVDNYRSTIPTEPVFFYGGPGRS
Subjt:  LFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGGHYKYASYVRQPRILDYSLQLKYPVRVGTWADEYSCYSVDNYRSTIPTEPVFFYGGPGRS

Query:  RDCH
        RDCH
Subjt:  RDCH

KAG7037688.1 hypothetical protein SDJN02_01318 [Cucurbita argyrosperma subsp. argyrosperma]4.8e-245100Show/hide
Query:  MRNTYHEGKLLAMVAFAIAAAILQSHAAIPDINNSQQLSAQIKNKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHPALKNHTIQMEPNWGVDWKMSV
        MRNTYHEGKLLAMVAFAIAAAILQSHAAIPDINNSQQLSAQIKNKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHPALKNHTIQMEPNWGVDWKMSV
Subjt:  MRNTYHEGKLLAMVAFAIAAAILQSHAAIPDINNSQQLSAQIKNKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHPALKNHTIQMEPNWGVDWKMSV

Query:  EQNEAFQVWQRSGSCPNGTIPIRRIREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGATGQINVWNPKLDLPNDFTASRIWLKNGPS
        EQNEAFQVWQRSGSCPNGTIPIRRIREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGATGQINVWNPKLDLPNDFTASRIWLKNGPS
Subjt:  EQNEAFQVWQRSGSCPNGTIPIRRIREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGATGQINVWNPKLDLPNDFTASRIWLKNGPS

Query:  EKFESIEAGWMVNRRLYGDTKTRLSVHWTVDSYKSKGCFDLTCSGFVQTNPKVVLGAIIDPLSTRGGQQFIITVGIFQDPKSSNWWLNMQGQPVGYWPPT
        EKFESIEAGWMVNRRLYGDTKTRLSVHWTVDSYKSKGCFDLTCSGFVQTNPKVVLGAIIDPLSTRGGQQFIITVGIFQDPKSSNWWLNMQGQPVGYWPPT
Subjt:  EKFESIEAGWMVNRRLYGDTKTRLSVHWTVDSYKSKGCFDLTCSGFVQTNPKVVLGAIIDPLSTRGGQQFIITVGIFQDPKSSNWWLNMQGQPVGYWPPT

Query:  LFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGGHYKYASYVRQPRILDYSLQLKYPVRVGTWADEYSCYSVDNYRSTIPTEPVFFYGGPGRS
        LFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGGHYKYASYVRQPRILDYSLQLKYPVRVGTWADEYSCYSVDNYRSTIPTEPVFFYGGPGRS
Subjt:  LFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGGHYKYASYVRQPRILDYSLQLKYPVRVGTWADEYSCYSVDNYRSTIPTEPVFFYGGPGRS

Query:  RDCH
        RDCH
Subjt:  RDCH

XP_022941158.1 uncharacterized protein LOC111446539 [Cucurbita moschata]2.3e-23193.07Show/hide
Query:  MRNTYHEGKLLAMVAFAIAAAILQSHAAIPDINNSQQLSAQIKNKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHPALKNHTIQMEPNWGVDWKMSV
        MRNTYHEGKLLAMVAFAIAAAILQ+ AAIP++N SQQLS QI  KLKLLNKPALHTIY+KDGDIIDCVDIYKQPAFDHPALKNHTIQMEP+WGVDWKMS 
Subjt:  MRNTYHEGKLLAMVAFAIAAAILQSHAAIPDINNSQQLSAQIKNKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHPALKNHTIQMEPNWGVDWKMSV

Query:  EQNEAFQVWQRSGSCPNGTIPIRRIREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGATGQINVWNPKLDLPNDFTASRIWLKNGPS
        E NEAFQVWQRSGSCPNGTIPIRR+REQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGA+GQ+NVWNPK+DLP+DFTASRIWLKNGPS
Subjt:  EQNEAFQVWQRSGSCPNGTIPIRRIREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGATGQINVWNPKLDLPNDFTASRIWLKNGPS

Query:  EKFESIEAGWMVNRRLYGDTKTRLSVHWTVDSYKSKGCFDLTCSGFVQTNPKVVLGAIIDPLSTRGGQQFIITVGIFQDPKSSNWWLNMQGQPVGYWPPT
        E+FES+EAGWMVN RLYGDTKTRLSVHWTVDSY+SKGCFDLTCSGFVQTNPKVVLGA+IDPLSTRGGQQFIITVGIFQDP+SSNWWL MQGQPVGYWPPT
Subjt:  EKFESIEAGWMVNRRLYGDTKTRLSVHWTVDSYKSKGCFDLTCSGFVQTNPKVVLGAIIDPLSTRGGQQFIITVGIFQDPKSSNWWLNMQGQPVGYWPPT

Query:  LFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGGHYKYASYVRQPRILDYSLQLKYPVRVGTWADEYSCYSVDNYRSTIPTEPVFFYGGPGRS
        LFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAG HYKYAS+VRQPRI+DYSLQLKYPVRVGTW DEYSCYSVDNYRSTIPTEPVFFYGGPGRS
Subjt:  LFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGGHYKYASYVRQPRILDYSLQLKYPVRVGTWADEYSCYSVDNYRSTIPTEPVFFYGGPGRS

Query:  RDCH
        RDCH
Subjt:  RDCH

XP_023524233.1 uncharacterized protein LOC111788198 [Cucurbita pepo subsp. pepo]2.6e-23093.81Show/hide
Query:  MRNTYHEGKLLAMVAFAIAAAILQSHAAIPDINNSQQLSAQIKNKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHPALKNHTIQMEPNWGVDWKMSV
        MRNTYHEGKLLAMVAF IAAAILQSHAAIP INNSQQLSAQI NKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHP LKNHTIQMEPN GV WKMSV
Subjt:  MRNTYHEGKLLAMVAFAIAAAILQSHAAIPDINNSQQLSAQIKNKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHPALKNHTIQMEPNWGVDWKMSV

Query:  EQNEAFQVWQRSGSCPNGTIPIRRIREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGATGQINVWNPKLDLPNDFTASRIWLKNGPS
        EQNEAFQVWQRSGSCPNGTIPIRR+REQDLLRANSLDSFGKKFPY SSKLGKEVNRSTAILYTAGFNYIGA+GQINVWNPK+DL NDFTASRIWLKNGPS
Subjt:  EQNEAFQVWQRSGSCPNGTIPIRRIREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGATGQINVWNPKLDLPNDFTASRIWLKNGPS

Query:  EKFESIEAGWMVNRRLYGDTKTRLSVHWTVDSYKSKGCFDLTCSGFVQTNPKVVLGAIIDPLSTRGGQQFIITVGIFQDPKSSNWWLNMQGQPVGYWPPT
        EKFES+EAGWMVNRRLYGDT+TRLSVHWT DSYKSKGCFDLTCSGFVQTNPKVVLGA+IDPLSTRGGQQF ITVGIFQDPKSSNWWLN+QG PVGYWPPT
Subjt:  EKFESIEAGWMVNRRLYGDTKTRLSVHWTVDSYKSKGCFDLTCSGFVQTNPKVVLGAIIDPLSTRGGQQFIITVGIFQDPKSSNWWLNMQGQPVGYWPPT

Query:  LFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGGHYKYASYVRQPRILDYSLQLKYPVRVGTWADEYSCYSVDNYRSTIPTEPVFFYGGPGRS
        LFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAG HYKYAS+VRQPRI+DYSLQLKYPVRVG+WADEYSCYSVDNYR T+ TEPVFFYGGPGRS
Subjt:  LFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGGHYKYASYVRQPRILDYSLQLKYPVRVGTWADEYSCYSVDNYRSTIPTEPVFFYGGPGRS

Query:  RDCH
        RDCH
Subjt:  RDCH

TrEMBL top hitse value%identityAlignment
A0A0A0L0M3 Uncharacterized protein3.3e-17571.64Show/hide
Query:  LLAMVAFAIAAAILQSHAAIPDINNSQQLSAQIKNKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHPALKNHTIQMEPNWGVDWKMSVEQNEA----
        +LAMV   +  AI+  +A   ++N S     QI+NKLKLLNKP++ TIYS+DGDI++CVD+YKQPAFDHP LKNHTIQM+P+  +D KMS  QNE+    
Subjt:  LLAMVAFAIAAAILQSHAAIPDINNSQQLSAQIKNKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHPALKNHTIQMEPNWGVDWKMSVEQNEA----

Query:  ---FQVWQRSGSCPNGTIPIRRIREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGATGQINVWNPKLDLPNDFTASRIWLKNGPSEK
           FQ WQ+SGSCP GTIPIRR+  +DLLRANSL  FGKKFPY  SKLG+E NRSTAIL T G NYIGA+G INVWNPK+DLPNDFTAS++WLKNGPSEK
Subjt:  ---FQVWQRSGSCPNGTIPIRRIREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGATGQINVWNPKLDLPNDFTASRIWLKNGPSEK

Query:  FESIEAGWMVNRRLYGDTKTRLSVHWTVDSYKSKGCFDLTCSGFVQTNPKVVLGAIIDPLSTRGGQQFIITVGIFQDPKSSNWWLNMQGQPVGYWPPTLF
        FES+EAGWMVN +LYGD KTRLS++WTVDSYK+ GCFDLTCSGFVQTNP V +GA+I+PLS+  GQQ+ I++GIFQDP S NWWL  QG PVGYWP TLF
Subjt:  FESIEAGWMVNRRLYGDTKTRLSVHWTVDSYKSKGCFDLTCSGFVQTNPKVVLGAIIDPLSTRGGQQFIITVGIFQDPKSSNWWLNMQGQPVGYWPPTLF

Query:  GYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGGHYKYASYVRQPRILDYSLQLKYPVRVGTWADEYSCYSVDNYRSTIPTEPVFFYGGPGRSRD
        GYL +SATLVEWGGEVFSSNIK VPHTGTGMGSGDYA G Y+YAS+V++PRI+DYSLQLKYP RVGTWADE SCYSVDNY+ +  TEPVF++GGPG SRD
Subjt:  GYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGGHYKYASYVRQPRILDYSLQLKYPVRVGTWADEYSCYSVDNYRSTIPTEPVFFYGGPGRSRD

Query:  CH
        CH
Subjt:  CH

A0A1S4DZ87 uncharacterized protein LOC1034938971.3e-17672.39Show/hide
Query:  LLAMVAFAIAAAILQSHAAIPDINNSQQLSAQIKNKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHPALKNHTIQMEPNWGVDWKMSVEQNEA----
        +L MVA  +  AI+  +A   +++ S  L  QI+NKLKLLNKP++ TIYS+DGD+I CVDIYKQPAFDHP LKNHTIQM+P+  +D KMS  QN++    
Subjt:  LLAMVAFAIAAAILQSHAAIPDINNSQQLSAQIKNKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHPALKNHTIQMEPNWGVDWKMSVEQNEA----

Query:  ---FQVWQRSGSCPNGTIPIRRIREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGATGQINVWNPKLDLPNDFTASRIWLKNGPSEK
           FQ+WQ+SGSCP GTIPIRR+R +DLLRANS+  FGKKFPY +SKLG+E NRSTAIL T G NYIGA+G INVWNPK+DLPNDFTAS+IWLKNGPSEK
Subjt:  ---FQVWQRSGSCPNGTIPIRRIREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGATGQINVWNPKLDLPNDFTASRIWLKNGPSEK

Query:  FESIEAGWMVNRRLYGDTKTRLSVHWTVDSYKSKGCFDLTCSGFVQTNPKVVLGAIIDPLSTRGGQQFIITVGIFQDPKSSNWWLNMQGQPVGYWPPTLF
        FES+EAGWMVN +LYGD KTR S++WTVDSYKS GCFDLTCSGFVQTNP V +GA+IDPLS+  GQQ+ I +GIFQDPKS NWWL  Q QPVGYWPPTLF
Subjt:  FESIEAGWMVNRRLYGDTKTRLSVHWTVDSYKSKGCFDLTCSGFVQTNPKVVLGAIIDPLSTRGGQQFIITVGIFQDPKSSNWWLNMQGQPVGYWPPTLF

Query:  GYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGGHYKYASYVRQPRILDYSLQLKYPVRVGTWADEYSCYSVDNYRSTIPTEPVFFYGGPGRSRD
        GYL +SATLVEWGGEVFSSNIK VPHTGTGMGSGDYA G Y+YAS+V+QPRI+DYS+QLKYP +VGTWADE SCYSVDNY+ T  +EPVF++GGPG SRD
Subjt:  GYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGGHYKYASYVRQPRILDYSLQLKYPVRVGTWADEYSCYSVDNYRSTIPTEPVFFYGGPGRSRD

Query:  CH
        CH
Subjt:  CH

A0A5A7V8M6 Uncharacterized protein1.3e-17672.39Show/hide
Query:  LLAMVAFAIAAAILQSHAAIPDINNSQQLSAQIKNKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHPALKNHTIQMEPNWGVDWKMSVEQNEA----
        +L MVA  +  AI+  +A   +++ S  L  QI+NKLKLLNKP++ TIYS+DGD+I CVDIYKQPAFDHP LKNHTIQM+P+  +D KMS  QN++    
Subjt:  LLAMVAFAIAAAILQSHAAIPDINNSQQLSAQIKNKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHPALKNHTIQMEPNWGVDWKMSVEQNEA----

Query:  ---FQVWQRSGSCPNGTIPIRRIREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGATGQINVWNPKLDLPNDFTASRIWLKNGPSEK
           FQ+WQ+SGSCP GTIPIRR+R +DLLRANS+  FGKKFPY +SKLG+E NRSTAIL T G NYIGA+G INVWNPK+DLPNDFTAS+IWLKNGPSEK
Subjt:  ---FQVWQRSGSCPNGTIPIRRIREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGATGQINVWNPKLDLPNDFTASRIWLKNGPSEK

Query:  FESIEAGWMVNRRLYGDTKTRLSVHWTVDSYKSKGCFDLTCSGFVQTNPKVVLGAIIDPLSTRGGQQFIITVGIFQDPKSSNWWLNMQGQPVGYWPPTLF
        FES+EAGWMVN +LYGD KTR S++WTVDSYKS GCFDLTCSGFVQTNP V +GA+IDPLS+  GQQ+ I +GIFQDPKS NWWL  Q QPVGYWPPTLF
Subjt:  FESIEAGWMVNRRLYGDTKTRLSVHWTVDSYKSKGCFDLTCSGFVQTNPKVVLGAIIDPLSTRGGQQFIITVGIFQDPKSSNWWLNMQGQPVGYWPPTLF

Query:  GYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGGHYKYASYVRQPRILDYSLQLKYPVRVGTWADEYSCYSVDNYRSTIPTEPVFFYGGPGRSRD
        GYL +SATLVEWGGEVFSSNIK VPHTGTGMGSGDYA G Y+YAS+V+QPRI+DYS+QLKYP +VGTWADE SCYSVDNY+ T  +EPVF++GGPG SRD
Subjt:  GYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGGHYKYASYVRQPRILDYSLQLKYPVRVGTWADEYSCYSVDNYRSTIPTEPVFFYGGPGRSRD

Query:  CH
        CH
Subjt:  CH

A0A6J1FRA8 uncharacterized protein LOC1114465391.1e-23193.07Show/hide
Query:  MRNTYHEGKLLAMVAFAIAAAILQSHAAIPDINNSQQLSAQIKNKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHPALKNHTIQMEPNWGVDWKMSV
        MRNTYHEGKLLAMVAFAIAAAILQ+ AAIP++N SQQLS QI  KLKLLNKPALHTIY+KDGDIIDCVDIYKQPAFDHPALKNHTIQMEP+WGVDWKMS 
Subjt:  MRNTYHEGKLLAMVAFAIAAAILQSHAAIPDINNSQQLSAQIKNKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHPALKNHTIQMEPNWGVDWKMSV

Query:  EQNEAFQVWQRSGSCPNGTIPIRRIREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGATGQINVWNPKLDLPNDFTASRIWLKNGPS
        E NEAFQVWQRSGSCPNGTIPIRR+REQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGA+GQ+NVWNPK+DLP+DFTASRIWLKNGPS
Subjt:  EQNEAFQVWQRSGSCPNGTIPIRRIREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGATGQINVWNPKLDLPNDFTASRIWLKNGPS

Query:  EKFESIEAGWMVNRRLYGDTKTRLSVHWTVDSYKSKGCFDLTCSGFVQTNPKVVLGAIIDPLSTRGGQQFIITVGIFQDPKSSNWWLNMQGQPVGYWPPT
        E+FES+EAGWMVN RLYGDTKTRLSVHWTVDSY+SKGCFDLTCSGFVQTNPKVVLGA+IDPLSTRGGQQFIITVGIFQDP+SSNWWL MQGQPVGYWPPT
Subjt:  EKFESIEAGWMVNRRLYGDTKTRLSVHWTVDSYKSKGCFDLTCSGFVQTNPKVVLGAIIDPLSTRGGQQFIITVGIFQDPKSSNWWLNMQGQPVGYWPPT

Query:  LFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGGHYKYASYVRQPRILDYSLQLKYPVRVGTWADEYSCYSVDNYRSTIPTEPVFFYGGPGRS
        LFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAG HYKYAS+VRQPRI+DYSLQLKYPVRVGTW DEYSCYSVDNYRSTIPTEPVFFYGGPGRS
Subjt:  LFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGGHYKYASYVRQPRILDYSLQLKYPVRVGTWADEYSCYSVDNYRSTIPTEPVFFYGGPGRS

Query:  RDCH
        RDCH
Subjt:  RDCH

A0A6J1FT01 uncharacterized protein LOC1114466017.3e-20785.71Show/hide
Query:  MVAFAIAAAILQSHAAIPDINNSQQLSAQIKNKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHPALKNHTIQMEPNWGVDWKMSVEQNEAFQVWQRS
        MVA AIAAAILQ+HAAIP++NNSQQLS QI+ KLKLLNKPALHTIYS+DGDIIDCVDIYKQPAFDHPALKNHTIQMEP+WGVDWKMSVEQNE FQVWQRS
Subjt:  MVAFAIAAAILQSHAAIPDINNSQQLSAQIKNKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHPALKNHTIQMEPNWGVDWKMSVEQNEAFQVWQRS

Query:  GSCPNGTIPIRRIREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGATGQINVWNPKLDLPNDFTASRIWLKNGPSEKFESIEAGWMV
        G CP GTIPIRR+REQDLLR NSLDSFGK F Y SSKLG EVNRST+ILYTAG+NYIGA+GQINVWNPK+DLPNDFTASRIWLKNGPSE FES+EAGWMV
Subjt:  GSCPNGTIPIRRIREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGATGQINVWNPKLDLPNDFTASRIWLKNGPSEKFESIEAGWMV

Query:  NRRLYGDTKTRLSVHWTVDSYKSKGCFDLTCSGFVQTNPKVVLGAIIDPLSTRGGQQFIITVGIFQDPKSSNWWLNMQGQPVGYWPPTLFGYLRNSATLV
        NRRLYGDTKTR SVHWTVDSYKS GCFDLTCSGFVQTNPK+VLGA+IDP+STRGGQQFII+VG+FQDP+S NWWLN+QG PVGYWPPTLFGYLR+SATLV
Subjt:  NRRLYGDTKTRLSVHWTVDSYKSKGCFDLTCSGFVQTNPKVVLGAIIDPLSTRGGQQFIITVGIFQDPKSSNWWLNMQGQPVGYWPPTLFGYLRNSATLV

Query:  EWGGEVFSSNIKKVPHTGTGMGSGDYAGGHYKYASYVRQPRILDYSLQLKYPVRVGTWADEYSCYSVDNYRSTIPTEPVFFYGGPGRSRDCH
        EWGGEVFSS++KKVPHT T MGSGDYAG HY++ASYV  PRI+D SLQLKYP RVGTWA+E  CYS DNY+ T  TEPVFFYGGPGRSRDCH
Subjt:  EWGGEVFSSNIKKVPHTGTGMGSGDYAGGHYKYASYVRQPRILDYSLQLKYPVRVGTWADEYSCYSVDNYRSTIPTEPVFFYGGPGRSRDCH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55360.1 Protein of Unknown Function (DUF239)3.6e-8943.19Show/hide
Query:  QIKNKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHPALKNHTIQMEPNWGVDW----------KMSVEQNEAFQVWQRSGSCPNGTIPIRRIREQDL
        ++K  L  LNKPA+ +I S DGD+IDCV I KQPAFDHP LK+H IQM+PN+  +           K + ++    Q+W R G C  GTIP+RR +E D+
Subjt:  QIKNKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHPALKNHTIQMEPNWGVDW----------KMSVEQNEAFQVWQRSGSCPNGTIPIRRIREQDL

Query:  LRANSLDSFGKK----FPYESSKLGKEVNRS---TAILYTAGFNYIGATGQINVWNPKLDLPNDFTASRIWLKNGP-SEKFESIEAGWMVNRRLYGDTKT
        LRA+S+  +GKK     P   S     +N+S    AI Y  G  Y GA   INVW PK+   N+F+ S+IWL  G   +   SIEAGW V+  LYGD  T
Subjt:  LRANSLDSFGKK----FPYESSKLGKEVNRS---TAILYTAGFNYIGATGQINVWNPKLDLPNDFTASRIWLKNGP-SEKFESIEAGWMVNRRLYGDTKT

Query:  RLSVHWTVDSYKSKGCFDLTCSGFVQTNPKVVLGAIIDPLSTRGGQQFIITVGIFQDPKSSNWWLNM-QGQPVGYWPPTLFGYLRNSATLVEWGGEVFSS
        RL  +WT D+Y++ GC++L CSGF+Q N  + +GA I P+S     Q+ I++ I++DPK  +WW+    G  +GYWP  LF YL  SA+++EWGGEV +S
Subjt:  RLSVHWTVDSYKSKGCFDLTCSGFVQTNPKVVLGAIIDPLSTRGGQQFIITVGIFQDPKSSNWWLNM-QGQPVGYWPPTLFGYLRNSATLVEWGGEVFSS

Query:  NIKKVPHTGTGMGSGDYAGGHYKYASYVRQPRILDYSLQLKYPVRVGTWADEYSCYSVDNYRSTIPTEPVFFYGGPGRSRDC
              HT T MGSG +    +  ASY R  +++D S  LK P  +GT+ ++ +CY V    S       F+YGGPG+++ C
Subjt:  NIKKVPHTGTGMGSGDYAGGHYKYASYVRQPRILDYSLQLKYPVRVGTWADEYSCYSVDNYRSTIPTEPVFFYGGPGRSRDC

AT2G44210.1 Protein of Unknown Function (DUF239)2.0e-8740.44Show/hide
Query:  LAMVAFAIAAAILQSHAAIPDINNSQQLSAQIKNKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHPALKNHTIQMEPNWG---------VDWKMSVE
        L M    +A +++       D+        +I+  LK LNKPAL +I S DGD+IDCV I  QPAF HP L NHT+QM P+           V  K   +
Subjt:  LAMVAFAIAAAILQSHAAIPDINNSQQLSAQIKNKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHPALKNHTIQMEPNWG---------VDWKMSVE

Query:  QNEAF-QVWQRSGSCPNGTIPIRRIREQDLLRANSLDSFG----KKFPYESSKLGKEV----NRSTAILYTAGFNYIGATGQINVWNPKLDLPNDFTASR
        Q+ A  Q+W  +G CP  TIPIRR R QDL RA+S++++G    K  P   S     V        AI+Y     + GA  +INVW P +++PN+F+ ++
Subjt:  QNEAF-QVWQRSGSCPNGTIPIRRIREQDLLRANSLDSFG----KKFPYESSKLGKEV----NRSTAILYTAGFNYIGATGQINVWNPKLDLPNDFTASR

Query:  IWLKNGP-SEKFESIEAGWMVNRRLYGDTKTRLSVHWTVDSYKSKGCFDLTCSGFVQTNPKVVLGAIIDPLSTRGGQQFIITVGIFQDPKSSNWWLNM-Q
        IW+  G  +    SIEAGW V+ +LYGD +TRL  +WT D+Y+  GC++L CSGFVQ N ++ +G  I PLS  G  Q+ IT+ I++DPK  +WWL   +
Subjt:  IWLKNGP-SEKFESIEAGWMVNRRLYGDTKTRLSVHWTVDSYKSKGCFDLTCSGFVQTNPKVVLGAIIDPLSTRGGQQFIITVGIFQDPKSSNWWLNM-Q

Query:  GQPVGYWPPTLFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGGHYKYASYVRQPRILDYSLQLKYPVRVGTWADEYSCYSVDNYRSTIPTEP
           +GYWP +LF YL  SA+++EWGGEV +S  ++  HT T MGSG +A   +  ASY +  +++D S +L+ P  +  + D+ +CY+V +         
Subjt:  GQPVGYWPPTLFGYLRNSATLVEWGGEVFSSNIKKVPHTGTGMGSGDYAGGHYKYASYVRQPRILDYSLQLKYPVRVGTWADEYSCYSVDNYRSTIPTEP

Query:  VFFYGGPGRSRDC
         F+YGGPGR+ +C
Subjt:  VFFYGGPGRSRDC

AT3G13510.1 Protein of Unknown Function (DUF239)2.3e-8843.41Show/hide
Query:  SQQLSAQIKNKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHPALKNHTIQMEPNW---GV--DWKMSVE----QNEAFQVWQRSGSCPNGTIPIRRI
        S +   ++K  L  LNKP + TI S DGDIIDC+ I KQPAFDHP LK+H IQM P++   G+  D K+S E    +    Q+W R G C  GTIP+RR 
Subjt:  SQQLSAQIKNKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHPALKNHTIQMEPNW---GV--DWKMSVE----QNEAFQVWQRSGSCPNGTIPIRRI

Query:  REQDLLRANSLDSFGKK----FPYESSKLGKEVNRS---TAILYTAGFNYIGATGQINVWNPKLDLPNDFTASRIWLKNGP-SEKFESIEAGWMVNRRLY
        RE D+LRA+S+  +GKK     P   S     +N++    AI Y  G  Y GA   +NVW PK+   N+F+ S+IWL  G   +   SIEAGW V+  LY
Subjt:  REQDLLRANSLDSFGKK----FPYESSKLGKEVNRS---TAILYTAGFNYIGATGQINVWNPKLDLPNDFTASRIWLKNGP-SEKFESIEAGWMVNRRLY

Query:  GDTKTRLSVHWTVDSYKSKGCFDLTCSGFVQTNPKVVLGAIIDPLSTRGGQQFIITVGIFQDPKSSNWWLNM-QGQPVGYWPPTLFGYLRNSATLVEWGG
        GD  TRL  +WT D+Y++ GC++L CSGF+Q N  + +GA I P+S     Q+ I++ I++DPK  +WW+    G  +GYWP  LF YL  SA+++EWGG
Subjt:  GDTKTRLSVHWTVDSYKSKGCFDLTCSGFVQTNPKVVLGAIIDPLSTRGGQQFIITVGIFQDPKSSNWWLNM-QGQPVGYWPPTLFGYLRNSATLVEWGG

Query:  EVFSSNIKKVPHTGTGMGSGDYAGGHYKYASYVRQPRILDYSLQLKYPVRVGTWADEYSCYSVDNYRSTIPTEPVFFYGGPGRSRDC
        EV +S   +  HT T MGSG +    +  ASY R  +++D S  LK P  +GT+ ++ +CY V    S       F+YGGPG++++C
Subjt:  EVFSSNIKKVPHTGTGMGSGDYAGGHYKYASYVRQPRILDYSLQLKYPVRVGTWADEYSCYSVDNYRSTIPTEPVFFYGGPGRSRDC

AT5G25950.1 Protein of Unknown Function (DUF239)5.3e-10948.34Show/hide
Query:  SAQIKNKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHPALKNHTIQMEPNWGVDWKMSVEQNEA------FQVWQRSGSCPNGTIPIRRIREQDLLR
        S  I  KLK LNKPAL TI S+DGDIIDC+DIYKQ AFDHPALKNH IQM+P+     K +   N         Q+W +SG CP GTIP+RR+  +D+ R
Subjt:  SAQIKNKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHPALKNHTIQMEPNWGVDWKMSVEQNEA------FQVWQRSGSCPNGTIPIRRIREQDLLR

Query:  ANSLDSFGKKFPYESSKLGKEVN-------------------RSTAILYTAGFNYIGATGQINVWNPKLDLPNDFTASRIWLKNGPSEKFESIEAGWMVN
        A+S   FG+K P++ S L   +                    RS A +   GFN++GA   IN+WNP      D++ ++IWL  G SE FES+E GWMVN
Subjt:  ANSLDSFGKKFPYESSKLGKEVN-------------------RSTAILYTAGFNYIGATGQINVWNPKLDLPNDFTASRIWLKNGPSEKFESIEAGWMVN

Query:  RRLYGDTKTRLSVHWTVDSYKSKGCFDLTCSGFVQTNPKVVLGAIIDPLSTRGGQQFIITVGIFQDPKSSNWWLNMQGQPVGYWPPTLFGYLRNSATLVE
          ++GD++TRL + WT D Y   GC +L C+GFVQT+ K  LGA ++P+S+    Q+ ITV IF DP S NWWL  +   +GYWP TLF YL++SAT V+
Subjt:  RRLYGDTKTRLSVHWTVDSYKSKGCFDLTCSGFVQTNPKVVLGAIIDPLSTRGGQQFIITVGIFQDPKSSNWWLNMQGQPVGYWPPTLFGYLRNSATLVE

Query:  WGGEVFSSN-IKKVPHTGTGMGSGDYAGGHYKYASYVRQPRILDYSLQLKYPVRVGTWADEYSCYSVDNYRSTIPTEPVFFYGGPGRSRDC
        WGGEV S N + K PHT T MGSG +A   +  A +    RI DYS+QLKYP  +  +ADEY+CYS   +R T  +EP F++GGPGR+  C
Subjt:  WGGEVFSSN-IKKVPHTGTGMGSGDYAGGHYKYASYVRQPRILDYSLQLKYPVRVGTWADEYSCYSVDNYRSTIPTEPVFFYGGPGRSRDC

AT5G25960.1 Protein of Unknown Function (DUF239)7.5e-9546.36Show/hide
Query:  SAQIKNKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHPALKNHTIQMEPNWGVDWKMSVEQNE------AFQVWQRSGSCPNGTIPIRRIREQDLLR
        S  I  KLK LNKP+L TI S+DGDIIDC+DIYKQ AFDHPAL+NH IQM+P+     K +   N         Q+W +SG+CP GTIP           
Subjt:  SAQIKNKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHPALKNHTIQMEPNWGVDWKMSVEQNE------AFQVWQRSGSCPNGTIPIRRIREQDLLR

Query:  ANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGATGQINVWNPKLDLPNDFTASRIWLKNGPSEKFESIEAGWMVNRRLYGDTKTRLSVHWTVDS
                                  A+L   G+N+IGA   INVWNP     +D+++++IWL  G S+ FESIEAGW VN  ++GD++TRL  +WT D 
Subjt:  ANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGATGQINVWNPKLDLPNDFTASRIWLKNGPSEKFESIEAGWMVNRRLYGDTKTRLSVHWTVDS

Query:  YKSKGCFDLTCSGFVQTNPKVVLGAIIDPLSTRGGQQFIITVGIFQDPKSSNWWLNMQGQPVGYWPPTLFGYLRNSATLVEWGGEVFSSNIKKVPHTGTG
        Y   GC +L C+GFVQT  K  LGA I+P+ST   +Q  IT     D  S NWWL      +GYWP TLF YL++SAT V+ GGEV S N+ K PHT T 
Subjt:  YKSKGCFDLTCSGFVQTNPKVVLGAIIDPLSTRGGQQFIITVGIFQDPKSSNWWLNMQGQPVGYWPPTLFGYLRNSATLVEWGGEVFSSNIKKVPHTGTG

Query:  MGSGDYAGGHYKYASYVRQPRILDYSLQLKYPVRVGTWADEYSCYSVDNYRSTIPTEPVFFYGGPGRSRDC
        MGSG +A   +  A Y    RI DYSLQ+KYP  +  +ADEY CYS   +R T  +EP F++GGPG++  C
Subjt:  MGSGDYAGGHYKYASYVRQPRILDYSLQLKYPVRVGTWADEYSCYSVDNYRSTIPTEPVFFYGGPGRSRDC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAACACATATCATGAAGGGAAGCTCTTGGCAATGGTGGCTTTCGCCATTGCTGCTGCCATTCTCCAATCTCATGCCGCCATTCCAGACATCAATAATTCTCAACA
ATTATCTGCGCAGATTAAGAACAAATTAAAGCTTCTCAATAAGCCTGCCCTCCACACCATCTACAGTAAAGATGGAGATATCATCGATTGCGTTGACATTTACAAGCAGC
CTGCTTTTGACCATCCGGCTCTAAAGAATCACACCATTCAGATGGAACCCAATTGGGGCGTCGATTGGAAGATGTCGGTTGAGCAAAACGAGGCGTTTCAAGTATGGCAA
AGAAGTGGGAGTTGTCCCAATGGAACCATTCCAATTCGCAGAATTCGTGAACAAGACTTGTTAAGAGCTAATTCTCTTGATAGCTTTGGAAAGAAATTTCCTTATGAAAG
CTCCAAACTCGGGAAAGAAGTCAATCGTTCTACGGCCATCCTGTATACGGCAGGGTTCAATTACATTGGCGCTACAGGACAGATTAATGTTTGGAACCCTAAACTTGATT
TGCCTAATGATTTCACAGCTTCAAGAATTTGGTTGAAAAATGGGCCTTCTGAAAAATTTGAAAGCATAGAAGCAGGCTGGATGGTTAATCGAAGGTTATATGGAGATACA
AAAACTCGTCTTAGCGTACATTGGACAGTGGACTCCTACAAATCGAAAGGGTGCTTTGATTTGACTTGCAGTGGGTTTGTCCAAACGAACCCGAAAGTGGTGCTTGGTGC
AATCATTGACCCATTGTCGACCAGAGGTGGACAACAGTTCATTATCACCGTTGGTATCTTTCAGGATCCTAAGTCAAGCAACTGGTGGCTGAATATGCAAGGGCAACCAG
TGGGATATTGGCCGCCGACGCTATTTGGATACTTACGCAACAGCGCAACACTGGTGGAATGGGGCGGGGAGGTGTTTAGCTCAAACATAAAGAAAGTGCCACACACGGGG
ACGGGCATGGGGAGCGGAGATTATGCAGGTGGGCATTACAAGTACGCTAGCTACGTGAGGCAGCCAAGGATCCTGGACTATTCGCTACAGTTGAAGTATCCGGTGAGAGT
TGGAACTTGGGCTGATGAGTATTCTTGCTACTCTGTTGATAATTATCGAAGTACAATCCCAACTGAACCTGTTTTCTTCTATGGCGGTCCTGGACGCAGCCGTGACTGCC
ATTGA
mRNA sequenceShow/hide mRNA sequence
CAAGAAAACAACTAAAAATCAACTTTTGTATTGGGTTCCTTCGGAGAAAATGAGGAACACATATCATGAAGGGAAGCTCTTGGCAATGGTGGCTTTCGCCATTGCTGCTG
CCATTCTCCAATCTCATGCCGCCATTCCAGACATCAATAATTCTCAACAATTATCTGCGCAGATTAAGAACAAATTAAAGCTTCTCAATAAGCCTGCCCTCCACACCATC
TACAGTAAAGATGGAGATATCATCGATTGCGTTGACATTTACAAGCAGCCTGCTTTTGACCATCCGGCTCTAAAGAATCACACCATTCAGATGGAACCCAATTGGGGCGT
CGATTGGAAGATGTCGGTTGAGCAAAACGAGGCGTTTCAAGTATGGCAAAGAAGTGGGAGTTGTCCCAATGGAACCATTCCAATTCGCAGAATTCGTGAACAAGACTTGT
TAAGAGCTAATTCTCTTGATAGCTTTGGAAAGAAATTTCCTTATGAAAGCTCCAAACTCGGGAAAGAAGTCAATCGTTCTACGGCCATCCTGTATACGGCAGGGTTCAAT
TACATTGGCGCTACAGGACAGATTAATGTTTGGAACCCTAAACTTGATTTGCCTAATGATTTCACAGCTTCAAGAATTTGGTTGAAAAATGGGCCTTCTGAAAAATTTGA
AAGCATAGAAGCAGGCTGGATGGTTAATCGAAGGTTATATGGAGATACAAAAACTCGTCTTAGCGTACATTGGACAGTGGACTCCTACAAATCGAAAGGGTGCTTTGATT
TGACTTGCAGTGGGTTTGTCCAAACGAACCCGAAAGTGGTGCTTGGTGCAATCATTGACCCATTGTCGACCAGAGGTGGACAACAGTTCATTATCACCGTTGGTATCTTT
CAGGATCCTAAGTCAAGCAACTGGTGGCTGAATATGCAAGGGCAACCAGTGGGATATTGGCCGCCGACGCTATTTGGATACTTACGCAACAGCGCAACACTGGTGGAATG
GGGCGGGGAGGTGTTTAGCTCAAACATAAAGAAAGTGCCACACACGGGGACGGGCATGGGGAGCGGAGATTATGCAGGTGGGCATTACAAGTACGCTAGCTACGTGAGGC
AGCCAAGGATCCTGGACTATTCGCTACAGTTGAAGTATCCGGTGAGAGTTGGAACTTGGGCTGATGAGTATTCTTGCTACTCTGTTGATAATTATCGAAGTACAATCCCA
ACTGAACCTGTTTTCTTCTATGGCGGTCCTGGACGCAGCCGTGACTGCCATTGATATCATCTAAATCATTTAATAGCGTGGAAAAAGTTGTCCCTTCAATGTGTTTCCTT
AATATATCCTCTATCCTTTTTGCAACTTACTTTAAAAATTGTCTAATACATCTCTAAATTTTCGTTATATGATCTTTTTTTTTAATATAAACTTGATAACTTGTGGACTA
AACATGTAATTTAACCTAAAAATTATCAA
Protein sequenceShow/hide protein sequence
MRNTYHEGKLLAMVAFAIAAAILQSHAAIPDINNSQQLSAQIKNKLKLLNKPALHTIYSKDGDIIDCVDIYKQPAFDHPALKNHTIQMEPNWGVDWKMSVEQNEAFQVWQ
RSGSCPNGTIPIRRIREQDLLRANSLDSFGKKFPYESSKLGKEVNRSTAILYTAGFNYIGATGQINVWNPKLDLPNDFTASRIWLKNGPSEKFESIEAGWMVNRRLYGDT
KTRLSVHWTVDSYKSKGCFDLTCSGFVQTNPKVVLGAIIDPLSTRGGQQFIITVGIFQDPKSSNWWLNMQGQPVGYWPPTLFGYLRNSATLVEWGGEVFSSNIKKVPHTG
TGMGSGDYAGGHYKYASYVRQPRILDYSLQLKYPVRVGTWADEYSCYSVDNYRSTIPTEPVFFYGGPGRSRDCH