; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr026293 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr026293
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionCLTH domain-containing protein
Genome locationtig00153031:3701092..3706914
RNA-Seq ExpressionSgr026293
SyntenySgr026293
Gene Ontology termsGO:0043161 - proteasome-mediated ubiquitin-dependent protein catabolic process (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
InterPro domainsIPR006595 - CTLH, C-terminal LisH motif


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6595041.1 hypothetical protein SDJN03_11594, partial [Cucurbita argyrosperma subsp. sororia]6.2e-30092.24Show/hide
Query:  MDSIPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLQAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQVTLI
        MDSIPLNWEALDALIIDFARSENLIEDSFSSSPP  PSPSPSPSSLSS+SYHSRLIIRQIRRSL+ GDIDCAIDLLRLHAPFILDDHRLLFRLQKQ    
Subjt:  MDSIPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLQAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQVTLI

Query:  MLRLLLHNKFIELLRKGTAEDRDSAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWSERRRFDIAGLMSSVLRAHMQAYDPVFSMT
                KFIELLRKGTAEDRD AIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDK+NQTSPVTYEWSERRRFDIAGLMSSVLRAHMQAYDPVFSMT
Subjt:  MLRLLLHNKFIELLRKGTAEDRDSAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWSERRRFDIAGLMSSVLRAHMQAYDPVFSMT

Query:  LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDEL
        LRYLISIHKGFCF EGVSSPISDLTERLLLDE DPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDEL
Subjt:  LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDEL

Query:  VREYCIYRGIVDSGRGSLSGMQNLSSSSKVNQSEPEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQG-TDIELRYAFEPTSNREDC
        VREYCIYRGIVDSG G+LSGMQNLSSSSKVNQSE EYCSSRN SFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQG  ++ELRYA EPTSNREDC
Subjt:  VREYCIYRGIVDSGRGSLSGMQNLSSSSKVNQSEPEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQG-TDIELRYAFEPTSNREDC

Query:  STSDSIHVGNSRTLLVNKNRGIVERSKRKRWRGRQDDRGLHDVSYSGCSKQELSTTTVASTTMAKEQQNLEKHLPSESTGKEDKYEIVLGIRELASKRLA
        STSDSIHVGNSRTL  NKN GIVERSKRKRWRGR DDR LHDVSYSGCSK ELST TVASTTM+KEQQNLEKH+P +STGKEDKYEIVLGIRELASKRLA
Subjt:  STSDSIHVGNSRTLLVNKNRGIVERSKRKRWRGRQDDRGLHDVSYSGCSKQELSTTTVASTTMAKEQQNLEKHLPSESTGKEDKYEIVLGIRELASKRLA

Query:  AEVVEEINALDPNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPIHALANSLQ
        AEVVEEINALDPNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAAN+PSLLKQLKETLLALLLPNED L KGFP++ALANSLQ
Subjt:  AEVVEEINALDPNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPIHALANSLQ

KAG7027064.1 hypothetical protein SDJN02_11073 [Cucurbita argyrosperma subsp. argyrosperma]9.0e-29992.07Show/hide
Query:  MDSIPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLQAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQVTLI
        MDSIPLNWEALDALIIDFARSENLIEDSFSSSPP  PSPS SPSSLSS+SYHSRLIIRQIRRSL+ GDIDCAIDLLRLHAPFILDDHRLLFRLQKQ    
Subjt:  MDSIPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLQAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQVTLI

Query:  MLRLLLHNKFIELLRKGTAEDRDSAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWSERRRFDIAGLMSSVLRAHMQAYDPVFSMT
                KFIELLRKGTAEDRD AIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDK+NQTSPVTYEWSERRRFDIAGLMSSVLRAHMQAYDPVFSMT
Subjt:  MLRLLLHNKFIELLRKGTAEDRDSAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWSERRRFDIAGLMSSVLRAHMQAYDPVFSMT

Query:  LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDEL
        LRYLISIHKGFCF EGVSSPISDLTERLLLDE DPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDEL
Subjt:  LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDEL

Query:  VREYCIYRGIVDSGRGSLSGMQNLSSSSKVNQSEPEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQG-TDIELRYAFEPTSNREDC
        VREYCIYRGIVDSG G+LSGMQNLSSSSKVNQSE EYCSSRN SFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQG  ++ELRYA EPTSNREDC
Subjt:  VREYCIYRGIVDSGRGSLSGMQNLSSSSKVNQSEPEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQG-TDIELRYAFEPTSNREDC

Query:  STSDSIHVGNSRTLLVNKNRGIVERSKRKRWRGRQDDRGLHDVSYSGCSKQELSTTTVASTTMAKEQQNLEKHLPSESTGKEDKYEIVLGIRELASKRLA
        STSDSIHVGNSRTL  NKN GIVERSKRKRWRGR DDR LHDVSYSGCSK ELST TVASTTM+KEQQNLEKH+P +STGKEDKYEIVLGIRELASKRLA
Subjt:  STSDSIHVGNSRTLLVNKNRGIVERSKRKRWRGRQDDRGLHDVSYSGCSKQELSTTTVASTTMAKEQQNLEKHLPSESTGKEDKYEIVLGIRELASKRLA

Query:  AEVVEEINALDPNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPIHALANSLQ
        AEVVEEINALDPNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAAN+PSLLKQLKETLLALLLPNED L KGFP++ALANSLQ
Subjt:  AEVVEEINALDPNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPIHALANSLQ

XP_022132833.1 uncharacterized protein LOC111005585 isoform X1 [Momordica charantia]1.6e-30092.06Show/hide
Query:  MDSIPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLQAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQVTLI
        MDS PLNWEALDALIIDFARSENLIEDSFSSSPPS  SPSPSPSSLSSSSYHSRLIIR IRRSL+AG ID AI LLRLHAPFILDDHRLLFRL KQ    
Subjt:  MDSIPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLQAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQVTLI

Query:  MLRLLLHNKFIELLRKGTAEDRDSAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWSERRRFDIAGLMSSVLRAHMQAYDPVFSMT
                KFIELLRKGTAEDRD AIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWSERRRFDIAGLMSSVLRAHMQAYDPVFSMT
Subjt:  MLRLLLHNKFIELLRKGTAEDRDSAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWSERRRFDIAGLMSSVLRAHMQAYDPVFSMT

Query:  LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDEL
        LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDEL
Subjt:  LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDEL

Query:  VREYCIYRGIVDSGRGSLSGMQNLSSSSKVNQSEPEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQGTDIELRYAFEPTSNREDCS
        VREYCIYRGIVDSGRG L GMQNLSSSSK+NQSE EYCSSRNCSFEVD+ATSKLSDGEISV NSRVDSSPENIADVTSSQGTDIELRYAFEPT+NREDCS
Subjt:  VREYCIYRGIVDSGRGSLSGMQNLSSSSKVNQSEPEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQGTDIELRYAFEPTSNREDCS

Query:  TSDSIHVGNSRTLLVNKNRGIVERSKRKRWRGRQDDRGLHDVSYSGCSKQELSTTTVASTTMAKEQQNLEKHLPSESTGKEDKYEIVLGIRELASKRLAA
        TSDSIHVGNSRTL VNKNRGIVERSKRKRWRGR DDRGLHDVSYSGCSKQELST TVAS T++K+QQNLEK LP EST KEDKYEIVLGIRE+ASKRLAA
Subjt:  TSDSIHVGNSRTLLVNKNRGIVERSKRKRWRGRQDDRGLHDVSYSGCSKQELSTTTVASTTMAKEQQNLEKHLPSESTGKEDKYEIVLGIRELASKRLAA

Query:  EVVEEINALDPNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPIHALANSLQ
        EVVEEINALDPNFF+QNPI LFQLKQVEF KLVS+GDYSS LRVACTHLGPLAANDPSLLKQLKETLLALLLPNED+LGKGFPI+ALANSLQ
Subjt:  EVVEEINALDPNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPIHALANSLQ

XP_022963234.1 uncharacterized protein LOC111463509 [Cucurbita moschata]9.6e-30192.24Show/hide
Query:  MDSIPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLQAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQVTLI
        MDSIPLNWEALDALIIDFARSENLIEDSFSSSPP  PSPSPSPSSLSSSSYHSRLIIRQIRRSL+ GDIDCAIDLLRLHAPFILDDHRLLFRLQKQ    
Subjt:  MDSIPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLQAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQVTLI

Query:  MLRLLLHNKFIELLRKGTAEDRDSAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWSERRRFDIAGLMSSVLRAHMQAYDPVFSMT
                KFIE LRKGTAEDRD AIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDK+NQTSPVTYEWSERRRFDIAGLMSSVLRAHMQAYDPVFSMT
Subjt:  MLRLLLHNKFIELLRKGTAEDRDSAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWSERRRFDIAGLMSSVLRAHMQAYDPVFSMT

Query:  LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDEL
        LRYLISIHKGFCF EGVSSPISDLTERLLLDE DPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDEL
Subjt:  LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDEL

Query:  VREYCIYRGIVDSGRGSLSGMQNLSSSSKVNQSEPEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQG-TDIELRYAFEPTSNREDC
        VREYCIYRGIVDSG G+LSGMQNLSSSSKVNQSE EYCSSRN SFEVDYAT KLSDGEISVSNSRVDSSPENIADVTSSQG  ++ELRYA EPTSNREDC
Subjt:  VREYCIYRGIVDSGRGSLSGMQNLSSSSKVNQSEPEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQG-TDIELRYAFEPTSNREDC

Query:  STSDSIHVGNSRTLLVNKNRGIVERSKRKRWRGRQDDRGLHDVSYSGCSKQELSTTTVASTTMAKEQQNLEKHLPSESTGKEDKYEIVLGIRELASKRLA
        STSDSIHVGNSRTL  NKNRGIVERSKRKRWRGR DDR LHDVSYSGCSK ELST TVASTTM+KE+QNLEKH+P +STGKEDKYEIVLGIRELASKRLA
Subjt:  STSDSIHVGNSRTLLVNKNRGIVERSKRKRWRGRQDDRGLHDVSYSGCSKQELSTTTVASTTMAKEQQNLEKHLPSESTGKEDKYEIVLGIRELASKRLA

Query:  AEVVEEINALDPNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPIHALANSLQ
        AEVVEEINALDPNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAAN+PSLLKQLKETLLALLLPNED LGKGFP++ALANSLQ
Subjt:  AEVVEEINALDPNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPIHALANSLQ

XP_023003547.1 uncharacterized protein LOC111497115 [Cucurbita maxima]1.5e-29891.74Show/hide
Query:  MDSIPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLQAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQVTLI
        MDSIPLNWEALDALIIDFARSENLIEDSFSSSPP  PSPSPSPSSLSSSSYHSRLIIRQIRRSL+ GDIDCAIDLLRLHAPFILDDHRLLFRLQKQ    
Subjt:  MDSIPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLQAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQVTLI

Query:  MLRLLLHNKFIELLRKGTAEDRDSAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWSERRRFDIAGLMSSVLRAHMQAYDPVFSMT
                KFIELLRKGTAEDRD AIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWSERRRFDIAGLMSSVLRAHMQAYDPVFSMT
Subjt:  MLRLLLHNKFIELLRKGTAEDRDSAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWSERRRFDIAGLMSSVLRAHMQAYDPVFSMT

Query:  LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDEL
        LRYLISIHKGFCF EGVSSPISDLTERLLLDE DPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDEL
Subjt:  LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDEL

Query:  VREYCIYRGIVDSGRGSLSGMQNLSSSSKVNQSEPEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQ-GTDIELRYAFEPTSNREDC
        VREYCIYRGIVDSG G LSGMQNLSSSSKVNQSE EYCSSRN SFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQ   ++ELRYA EPTSNREDC
Subjt:  VREYCIYRGIVDSGRGSLSGMQNLSSSSKVNQSEPEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQ-GTDIELRYAFEPTSNREDC

Query:  STSDSIHVGNSRTLLVNKNRGIVERSKRKRWRGRQDDRGLHDVSYSGCSKQELSTTTVASTTMAKEQQNLEKHLPSESTGKEDKYEIVLGIRELASKRLA
        STSDS+HVGNSRTL  NKNRGIVERSKRKRWRGR DDR LHDVSYSGCSK ELS  TVAS  M+KEQQNLEKH+P +STG+EDKYEIVLGIRELASKRLA
Subjt:  STSDSIHVGNSRTLLVNKNRGIVERSKRKRWRGRQDDRGLHDVSYSGCSKQELSTTTVASTTMAKEQQNLEKHLPSESTGKEDKYEIVLGIRELASKRLA

Query:  AEVVEEINALDPNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPIHALANSLQ
        AEVVEEINALDPNFFVQNPI LFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAAN+PSLLKQLKETLLALLLPNED LGKGFP++ALANSLQ
Subjt:  AEVVEEINALDPNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPIHALANSLQ

TrEMBL top hitse value%identityAlignment
A0A6J1BXE9 uncharacterized protein LOC111005585 isoform X18.0e-30192.06Show/hide
Query:  MDSIPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLQAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQVTLI
        MDS PLNWEALDALIIDFARSENLIEDSFSSSPPS  SPSPSPSSLSSSSYHSRLIIR IRRSL+AG ID AI LLRLHAPFILDDHRLLFRL KQ    
Subjt:  MDSIPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLQAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQVTLI

Query:  MLRLLLHNKFIELLRKGTAEDRDSAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWSERRRFDIAGLMSSVLRAHMQAYDPVFSMT
                KFIELLRKGTAEDRD AIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWSERRRFDIAGLMSSVLRAHMQAYDPVFSMT
Subjt:  MLRLLLHNKFIELLRKGTAEDRDSAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWSERRRFDIAGLMSSVLRAHMQAYDPVFSMT

Query:  LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDEL
        LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDEL
Subjt:  LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDEL

Query:  VREYCIYRGIVDSGRGSLSGMQNLSSSSKVNQSEPEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQGTDIELRYAFEPTSNREDCS
        VREYCIYRGIVDSGRG L GMQNLSSSSK+NQSE EYCSSRNCSFEVD+ATSKLSDGEISV NSRVDSSPENIADVTSSQGTDIELRYAFEPT+NREDCS
Subjt:  VREYCIYRGIVDSGRGSLSGMQNLSSSSKVNQSEPEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQGTDIELRYAFEPTSNREDCS

Query:  TSDSIHVGNSRTLLVNKNRGIVERSKRKRWRGRQDDRGLHDVSYSGCSKQELSTTTVASTTMAKEQQNLEKHLPSESTGKEDKYEIVLGIRELASKRLAA
        TSDSIHVGNSRTL VNKNRGIVERSKRKRWRGR DDRGLHDVSYSGCSKQELST TVAS T++K+QQNLEK LP EST KEDKYEIVLGIRE+ASKRLAA
Subjt:  TSDSIHVGNSRTLLVNKNRGIVERSKRKRWRGRQDDRGLHDVSYSGCSKQELSTTTVASTTMAKEQQNLEKHLPSESTGKEDKYEIVLGIRELASKRLAA

Query:  EVVEEINALDPNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPIHALANSLQ
        EVVEEINALDPNFF+QNPI LFQLKQVEF KLVS+GDYSS LRVACTHLGPLAANDPSLLKQLKETLLALLLPNED+LGKGFPI+ALANSLQ
Subjt:  EVVEEINALDPNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPIHALANSLQ

A0A6J1GF09 uncharacterized protein LOC111453581 isoform X21.7e-29589.77Show/hide
Query:  MDSIPLNWEALDALIIDFARSENLIEDSFSSSPPS----SPSPSPSPSSLSSSSYHSRLIIRQIRRSLQAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQ
        MDS PLNWEALDALIIDFARSENLIEDSFSSSPPS    SPSPSPSPSSLSSSSYHSRLIIRQIRR L++GDIDCAIDLLRLHAPFILDDHRLLFRLQKQ
Subjt:  MDSIPLNWEALDALIIDFARSENLIEDSFSSSPPS----SPSPSPSPSSLSSSSYHSRLIIRQIRRSLQAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQ

Query:  VTLIMLRLLLHNKFIELLRKGTAEDRDSAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWSERRRFDIAGLMSSVLRAHMQAYDPV
                    KFIELLRKGT EDR  AI+C+RT LAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEW E RRFDIAGLMSSVLRAHMQAYDPV
Subjt:  VTLIMLRLLLHNKFIELLRKGTAEDRDSAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWSERRRFDIAGLMSSVLRAHMQAYDPV

Query:  FSMTLRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSV
        FSMTLRYLISIHKGFCFREGV SPISDLTERLLLDERDPPATP+ESL+EAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSV
Subjt:  FSMTLRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSV

Query:  LDELVREYCIYRGIVDSGRGSLSGMQNLSSSSKVNQSEPEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQGTDIELRYAFEPTSNR
        LDELVREYCIYRGIVDSGRG+LSGMQN SSS KV+QSE EYCSSRNCS EVDYATSKLSDGEISV+NSRVDSSPENIADVTSSQGT+ +LRY+ EPTSNR
Subjt:  LDELVREYCIYRGIVDSGRGSLSGMQNLSSSSKVNQSEPEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQGTDIELRYAFEPTSNR

Query:  EDCSTSDSIHVGNSRTLLVNKNRGIVERSKRKRWRGRQDDRGLHDVSYSGCSKQELSTTTVASTTMAKEQQNLEKHLPSESTGKEDKYEIVLGIRELASK
        EDCSTSDSIHVGNSRTL VNKNRGIVERSKRKRWRGR DDR L D+SYSGCSKQE+STTTV STTM+KEQQNLEKHLP ESTGK+DKYEIVLGIRELASK
Subjt:  EDCSTSDSIHVGNSRTLLVNKNRGIVERSKRKRWRGRQDDRGLHDVSYSGCSKQELSTTTVASTTMAKEQQNLEKHLPSESTGKEDKYEIVLGIRELASK

Query:  RLAAEVVEEINALDPNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPIHALANSLQ
        RLAAEVVEEINA+DP FF QNPI LFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAA+DPSLLKQLKE LLALLLPNEDILGKGFPI++LANSLQ
Subjt:  RLAAEVVEEINALDPNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPIHALANSLQ

A0A6J1HJH4 uncharacterized protein LOC1114635094.7e-30192.24Show/hide
Query:  MDSIPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLQAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQVTLI
        MDSIPLNWEALDALIIDFARSENLIEDSFSSSPP  PSPSPSPSSLSSSSYHSRLIIRQIRRSL+ GDIDCAIDLLRLHAPFILDDHRLLFRLQKQ    
Subjt:  MDSIPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLQAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQVTLI

Query:  MLRLLLHNKFIELLRKGTAEDRDSAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWSERRRFDIAGLMSSVLRAHMQAYDPVFSMT
                KFIE LRKGTAEDRD AIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDK+NQTSPVTYEWSERRRFDIAGLMSSVLRAHMQAYDPVFSMT
Subjt:  MLRLLLHNKFIELLRKGTAEDRDSAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWSERRRFDIAGLMSSVLRAHMQAYDPVFSMT

Query:  LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDEL
        LRYLISIHKGFCF EGVSSPISDLTERLLLDE DPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDEL
Subjt:  LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDEL

Query:  VREYCIYRGIVDSGRGSLSGMQNLSSSSKVNQSEPEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQG-TDIELRYAFEPTSNREDC
        VREYCIYRGIVDSG G+LSGMQNLSSSSKVNQSE EYCSSRN SFEVDYAT KLSDGEISVSNSRVDSSPENIADVTSSQG  ++ELRYA EPTSNREDC
Subjt:  VREYCIYRGIVDSGRGSLSGMQNLSSSSKVNQSEPEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQG-TDIELRYAFEPTSNREDC

Query:  STSDSIHVGNSRTLLVNKNRGIVERSKRKRWRGRQDDRGLHDVSYSGCSKQELSTTTVASTTMAKEQQNLEKHLPSESTGKEDKYEIVLGIRELASKRLA
        STSDSIHVGNSRTL  NKNRGIVERSKRKRWRGR DDR LHDVSYSGCSK ELST TVASTTM+KE+QNLEKH+P +STGKEDKYEIVLGIRELASKRLA
Subjt:  STSDSIHVGNSRTLLVNKNRGIVERSKRKRWRGRQDDRGLHDVSYSGCSKQELSTTTVASTTMAKEQQNLEKHLPSESTGKEDKYEIVLGIRELASKRLA

Query:  AEVVEEINALDPNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPIHALANSLQ
        AEVVEEINALDPNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAAN+PSLLKQLKETLLALLLPNED LGKGFP++ALANSLQ
Subjt:  AEVVEEINALDPNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPIHALANSLQ

A0A6J1ISM6 uncharacterized protein LOC111478049 isoform X11.9e-29490.2Show/hide
Query:  MDSIPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLQAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQVTLI
        MDS PLNWEALDALIIDFARSENLIEDSFSSSPPS  SPSPSPSSLSSSSYHSRLIIRQIRRSL++GDIDCAIDLLRLHAPFILDDHRLLFRLQKQ    
Subjt:  MDSIPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLQAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQVTLI

Query:  MLRLLLHNKFIELLRKGTAEDRDSAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWSERRRFDIAGLMSSVLRAHMQAYDPVFSMT
                KFIELLRKGT EDR  AIQC+RT LAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEW E RRFDIAGLMSSVLRAHMQAYDPVFSMT
Subjt:  MLRLLLHNKFIELLRKGTAEDRDSAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWSERRRFDIAGLMSSVLRAHMQAYDPVFSMT

Query:  LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDEL
        LRYLISIHKGFCFREGV SPISDLTERLLLDERDPPATP ESL+EAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDEL
Subjt:  LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDEL

Query:  VREYCIYRGIVDSGRGSLSGMQNLSSSSKVNQSEPEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQGTDIELRYAFEPTSNREDCS
        VREYCIYRGIVDSGRG+LSGMQN SSS KV+QSE EYCSSRNCS EVDYATSKLSDGEISV+NSRVDSSPENIADVTSSQGT+ +LRY+ EP SNREDCS
Subjt:  VREYCIYRGIVDSGRGSLSGMQNLSSSSKVNQSEPEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQGTDIELRYAFEPTSNREDCS

Query:  TSDSIHVGNSRTLLVNKNRGIVERSKRKRWRGRQDDRGLHDVSYSGCSKQELSTTTVASTTMAKEQQNLEKHLPSESTGKEDKYEIVLGIRELASKRLAA
        TSD IHVGNSRTL VNKNRGIVERSKRKRWRGR DDR L D+SYSGCSKQE+STTTVASTTM+KEQQNLEKHLP ESTGK+DKYEIVLGIRELASKRLAA
Subjt:  TSDSIHVGNSRTLLVNKNRGIVERSKRKRWRGRQDDRGLHDVSYSGCSKQELSTTTVASTTMAKEQQNLEKHLPSESTGKEDKYEIVLGIRELASKRLAA

Query:  EVVEEINALDPNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPIHALANSLQ
        EVVEEINA+DP FF QNPI LFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAA+DPSLLKQLKE LLALLLPNEDI GKGFPI+ALANSLQ
Subjt:  EVVEEINALDPNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPIHALANSLQ

A0A6J1KMW1 uncharacterized protein LOC1114971157.5e-29991.74Show/hide
Query:  MDSIPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLQAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQVTLI
        MDSIPLNWEALDALIIDFARSENLIEDSFSSSPP  PSPSPSPSSLSSSSYHSRLIIRQIRRSL+ GDIDCAIDLLRLHAPFILDDHRLLFRLQKQ    
Subjt:  MDSIPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLQAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQVTLI

Query:  MLRLLLHNKFIELLRKGTAEDRDSAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWSERRRFDIAGLMSSVLRAHMQAYDPVFSMT
                KFIELLRKGTAEDRD AIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWSERRRFDIAGLMSSVLRAHMQAYDPVFSMT
Subjt:  MLRLLLHNKFIELLRKGTAEDRDSAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWSERRRFDIAGLMSSVLRAHMQAYDPVFSMT

Query:  LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDEL
        LRYLISIHKGFCF EGVSSPISDLTERLLLDE DPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDEL
Subjt:  LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDEL

Query:  VREYCIYRGIVDSGRGSLSGMQNLSSSSKVNQSEPEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQ-GTDIELRYAFEPTSNREDC
        VREYCIYRGIVDSG G LSGMQNLSSSSKVNQSE EYCSSRN SFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQ   ++ELRYA EPTSNREDC
Subjt:  VREYCIYRGIVDSGRGSLSGMQNLSSSSKVNQSEPEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQ-GTDIELRYAFEPTSNREDC

Query:  STSDSIHVGNSRTLLVNKNRGIVERSKRKRWRGRQDDRGLHDVSYSGCSKQELSTTTVASTTMAKEQQNLEKHLPSESTGKEDKYEIVLGIRELASKRLA
        STSDS+HVGNSRTL  NKNRGIVERSKRKRWRGR DDR LHDVSYSGCSK ELS  TVAS  M+KEQQNLEKH+P +STG+EDKYEIVLGIRELASKRLA
Subjt:  STSDSIHVGNSRTLLVNKNRGIVERSKRKRWRGRQDDRGLHDVSYSGCSKQELSTTTVASTTMAKEQQNLEKHLPSESTGKEDKYEIVLGIRELASKRLA

Query:  AEVVEEINALDPNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPIHALANSLQ
        AEVVEEINALDPNFFVQNPI LFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAAN+PSLLKQLKETLLALLLPNED LGKGFP++ALANSLQ
Subjt:  AEVVEEINALDPNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPIHALANSLQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G66810.1 CONTAINS InterPro DOMAIN/s: CTLH, C-terminal LisH motif (InterPro:IPR006595)1.8e-18360.54Show/hide
Query:  MDSIPLNWEALDALIIDFARSENLIEDSFSS-----SPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLQAGDIDCAIDLLRLHAPFILDDHRLLFRLQK
        MDS P+NWEALDALIIDF  SENL+ED+ ++     SP SSPS S SP S+SSSSYHSRLIIR+IR S+++GDI+ AID+LR HAPF+LDDHR+LFRLQK
Subjt:  MDSIPLNWEALDALIIDFARSENLIEDSFSS-----SPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLQAGDIDCAIDLLRLHAPFILDDHRLLFRLQK

Query:  QVTLIMLRLLLHNKFIELLRKGTAEDRDSAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWSERRRFDIAGLMSSVLRAHMQAYDP
        Q            KFIELLRKGT E   +AI CLRT +APCALDAYPEAYEEFKHVLLA IYDKD+QTSPV  EW+E+RR+++AGLMSSVLRA +QAYDP
Subjt:  QVTLIMLRLLLHNKFIELLRKGTAEDRDSAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWSERRRFDIAGLMSSVLRAHMQAYDP

Query:  VFSMTLRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLS
        VFSMTLRYLISIHKGFCF +G+SS +SDLT RLLL+ERD PATP ES+YE PPFDEVDIQALAHAVELTRQGA+DS++F KGDLF AFQNELCRM+LD+S
Subjt:  VFSMTLRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLS

Query:  VLDELVREYCIYRGIVDSGRGSLSGMQNLSSSSKVNQSEPEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQGTDIELRYAFEPTSN
        VLDELV+EYCIYRGIVD      S MQ ++  +K NQSE     SR+CS E+D  TS+ SD E   + S +D S     +++  +G D+  RY  EPTS 
Subjt:  VLDELVREYCIYRGIVDSGRGSLSGMQNLSSSSKVNQSEPEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQGTDIELRYAFEPTSN

Query:  REDCSTSDSIHVGNSRTLLVNKNRGIVERSKRKRWRGRQDDRG-LHDVSYSGCSKQELSTTTVASTTMAKEQQNLEKHLPSESTGKEDKYEIVLGIRELA
         EDCSTS S    N+R LL  ++    E +KRKRW GR  +   L  +S+   +  E  T  +                       EDKYEI L ++EL 
Subjt:  REDCSTSDSIHVGNSRTLLVNKNRGIVERSKRKRWRGRQDDRG-LHDVSYSGCSKQELSTTTVASTTMAKEQQNLEKHLPSESTGKEDKYEIVLGIRELA

Query:  SKRLAAEVVEEINALDPNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPIHALANSLQ
        S+ +AAE   EI+ +DP+FF QNP  LF LKQVEFLKLVS+GD++ AL+VAC HLGPLAAND SLLK LKETLL LL P+    GK  P++ LAN+LQ
Subjt:  SKRLAAEVVEEINALDPNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPIHALANSLQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTCCATCCCCTTGAACTGGGAAGCTCTCGATGCCCTAATTATCGATTTCGCTAGATCAGAGAACTTGATTGAGGATTCCTTTTCATCCTCTCCACCTTCTTCTCC
TTCTCCTTCCCCTTCCCCTTCCTCGCTTTCCTCCTCTTCTTACCATTCCAGGTTGATCATCCGCCAGATCAGGCGCTCTTTACAGGCCGGTGACATTGACTGCGCCATCG
ATCTCCTCCGCCTTCACGCGCCCTTTATTCTTGACGATCATAGACTTCTATTCCGGTTGCAGAAGCAGGTTACTCTAATTATGCTCCGTCTCTTGCTGCATAACAAATTT
ATCGAGCTTTTGAGGAAAGGGACTGCAGAGGATCGTGATTCGGCCATTCAATGCCTCCGGACGGCGCTCGCTCCTTGTGCTCTTGATGCATACCCGGAAGCATATGAGGA
GTTCAAGCATGTTCTTCTTGCCTTCATTTATGACAAGGATAATCAAACATCCCCAGTGACATATGAGTGGTCTGAAAGGAGGAGGTTTGATATTGCCGGATTAATGTCCT
CAGTCCTACGAGCTCATATGCAGGCATATGATCCAGTCTTCTCAATGACTTTGAGATACTTGATAAGCATACATAAAGGTTTTTGCTTTCGTGAAGGTGTGTCATCTCCC
ATATCTGATCTCACTGAGAGGTTGCTTCTAGATGAACGTGATCCACCTGCAACACCCAAGGAGAGTCTGTATGAAGCACCTCCATTTGACGAGGTGGACATTCAAGCTCT
TGCACATGCTGTAGAGCTTACAAGACAGGGGGCAATTGATAGCTTGAGATTCACCAAAGGTGATTTGTTTCATGCATTCCAGAATGAGTTGTGCCGGATGAAATTGGACC
TTTCTGTACTTGATGAGCTTGTTCGTGAATATTGCATCTATAGAGGAATTGTGGATTCTGGTCGTGGATCTCTCTCCGGGATGCAGAATCTCTCTAGTTCATCAAAAGTC
AATCAATCAGAGCCGGAGTATTGCTCATCAAGGAATTGTTCTTTTGAAGTGGACTATGCAACCAGTAAACTTTCGGATGGTGAAATTTCTGTTAGCAATTCCCGTGTGGA
TAGTTCACCTGAAAATATTGCTGATGTGACCAGTTCACAAGGTACTGACATTGAATTACGATATGCATTCGAGCCAACATCCAATCGAGAGGATTGTAGCACAAGTGATT
CAATTCATGTGGGAAATTCAAGAACATTACTAGTAAACAAGAATCGTGGGATTGTAGAAAGGAGCAAGCGTAAGAGATGGAGAGGAAGACAGGATGATAGAGGACTTCAC
GATGTGTCTTACAGTGGATGCAGTAAACAAGAACTTAGCACGACAACGGTGGCCAGTACAACCATGGCTAAGGAACAACAGAACCTTGAAAAACATTTACCATCGGAATC
TACTGGCAAGGAGGATAAATATGAAATTGTCTTAGGCATTAGAGAACTGGCAAGTAAAAGGTTGGCTGCAGAGGTTGTGGAAGAAATTAACGCCCTGGATCCAAACTTTT
TTGTGCAAAATCCTATTTTCCTATTCCAACTTAAGCAGGTTGAATTTTTAAAGCTAGTTAGCTCTGGTGATTATTCTAGTGCTTTGAGGGTTGCATGCACTCACTTAGGC
CCATTAGCAGCTAATGATCCATCCTTGTTGAAGCAATTAAAGGAGACTTTATTGGCTTTGCTCCTGCCCAATGAAGATATACTTGGGAAAGGCTTCCCTATACATGCTCT
TGCTAATTCTCTTCAGAGGGGCAGAGAGAATGACTATATTCCTACACTGCAGACAGGTGTTGGAGTATTACTTCTTAGTAGAAATCTGAACACTTCTTTGGTAGCTGGTA
CTGGAGTGGATGTTTGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGACTCCATCCCCTTGAACTGGGAAGCTCTCGATGCCCTAATTATCGATTTCGCTAGATCAGAGAACTTGATTGAGGATTCCTTTTCATCCTCTCCACCTTCTTCTCC
TTCTCCTTCCCCTTCCCCTTCCTCGCTTTCCTCCTCTTCTTACCATTCCAGGTTGATCATCCGCCAGATCAGGCGCTCTTTACAGGCCGGTGACATTGACTGCGCCATCG
ATCTCCTCCGCCTTCACGCGCCCTTTATTCTTGACGATCATAGACTTCTATTCCGGTTGCAGAAGCAGGTTACTCTAATTATGCTCCGTCTCTTGCTGCATAACAAATTT
ATCGAGCTTTTGAGGAAAGGGACTGCAGAGGATCGTGATTCGGCCATTCAATGCCTCCGGACGGCGCTCGCTCCTTGTGCTCTTGATGCATACCCGGAAGCATATGAGGA
GTTCAAGCATGTTCTTCTTGCCTTCATTTATGACAAGGATAATCAAACATCCCCAGTGACATATGAGTGGTCTGAAAGGAGGAGGTTTGATATTGCCGGATTAATGTCCT
CAGTCCTACGAGCTCATATGCAGGCATATGATCCAGTCTTCTCAATGACTTTGAGATACTTGATAAGCATACATAAAGGTTTTTGCTTTCGTGAAGGTGTGTCATCTCCC
ATATCTGATCTCACTGAGAGGTTGCTTCTAGATGAACGTGATCCACCTGCAACACCCAAGGAGAGTCTGTATGAAGCACCTCCATTTGACGAGGTGGACATTCAAGCTCT
TGCACATGCTGTAGAGCTTACAAGACAGGGGGCAATTGATAGCTTGAGATTCACCAAAGGTGATTTGTTTCATGCATTCCAGAATGAGTTGTGCCGGATGAAATTGGACC
TTTCTGTACTTGATGAGCTTGTTCGTGAATATTGCATCTATAGAGGAATTGTGGATTCTGGTCGTGGATCTCTCTCCGGGATGCAGAATCTCTCTAGTTCATCAAAAGTC
AATCAATCAGAGCCGGAGTATTGCTCATCAAGGAATTGTTCTTTTGAAGTGGACTATGCAACCAGTAAACTTTCGGATGGTGAAATTTCTGTTAGCAATTCCCGTGTGGA
TAGTTCACCTGAAAATATTGCTGATGTGACCAGTTCACAAGGTACTGACATTGAATTACGATATGCATTCGAGCCAACATCCAATCGAGAGGATTGTAGCACAAGTGATT
CAATTCATGTGGGAAATTCAAGAACATTACTAGTAAACAAGAATCGTGGGATTGTAGAAAGGAGCAAGCGTAAGAGATGGAGAGGAAGACAGGATGATAGAGGACTTCAC
GATGTGTCTTACAGTGGATGCAGTAAACAAGAACTTAGCACGACAACGGTGGCCAGTACAACCATGGCTAAGGAACAACAGAACCTTGAAAAACATTTACCATCGGAATC
TACTGGCAAGGAGGATAAATATGAAATTGTCTTAGGCATTAGAGAACTGGCAAGTAAAAGGTTGGCTGCAGAGGTTGTGGAAGAAATTAACGCCCTGGATCCAAACTTTT
TTGTGCAAAATCCTATTTTCCTATTCCAACTTAAGCAGGTTGAATTTTTAAAGCTAGTTAGCTCTGGTGATTATTCTAGTGCTTTGAGGGTTGCATGCACTCACTTAGGC
CCATTAGCAGCTAATGATCCATCCTTGTTGAAGCAATTAAAGGAGACTTTATTGGCTTTGCTCCTGCCCAATGAAGATATACTTGGGAAAGGCTTCCCTATACATGCTCT
TGCTAATTCTCTTCAGAGGGGCAGAGAGAATGACTATATTCCTACACTGCAGACAGGTGTTGGAGTATTACTTCTTAGTAGAAATCTGAACACTTCTTTGGTAGCTGGTA
CTGGAGTGGATGTTTGCTAG
Protein sequenceShow/hide protein sequence
MDSIPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLQAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQVTLIMLRLLLHNKF
IELLRKGTAEDRDSAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWSERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFCFREGVSSP
ISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRGSLSGMQNLSSSSKV
NQSEPEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQGTDIELRYAFEPTSNREDCSTSDSIHVGNSRTLLVNKNRGIVERSKRKRWRGRQDDRGLH
DVSYSGCSKQELSTTTVASTTMAKEQQNLEKHLPSESTGKEDKYEIVLGIRELASKRLAAEVVEEINALDPNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLG
PLAANDPSLLKQLKETLLALLLPNEDILGKGFPIHALANSLQRGRENDYIPTLQTGVGVLLLSRNLNTSLVAGTGVDVC