; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0000189 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0000189
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionCLTH domain-containing protein
Genome locationContig00140_ERROPOS1480835:141117..146915
RNA-Seq ExpressionPay0000189
SyntenyPay0000189
Gene Ontology termsGO:0043161 - proteasome-mediated ubiquitin-dependent protein catabolic process (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
InterPro domainsIPR006595 - CTLH, C-terminal LisH motif


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK12895.1 CLTH domain-containing protein [Cucumis melo var. makuwa]7.3e-27983.33Show/hide
Query:  HPSFFR--RPKFIELLRKGTPEDRDLAIQCLRTALAPCALDAYPVTILHIRGLFSIDAFKKVLSGYRIEQK------VAAFFKERKF----------RLN
        H   FR  + KFIELLRKGTPEDRDLAIQCLRTALAPCALDAYP          + + FK VL  +  ++          + + R+F          R +
Subjt:  HPSFFR--RPKFIELLRKGTPEDRDLAIQCLRTALAPCALDAYPVTILHIRGLFSIDAFKKVLSGYRIEQK------VAAFFKERKF----------RLN

Query:  NDWRNKEVAVT----SSIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDE---------------------------------NELCR
            +   ++T     SIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDE                                 NELCR
Subjt:  NDWRNKEVAVT----SSIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDE---------------------------------NELCR

Query:  MKLDLSVLDELVREYCIYRGIVDSGRGALSGMQNLSSSSKANQSEQEYCSRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQGTDIELRYAS
        MKLDLSVLDELVREYCIYRGIVDSGRGALSGMQNLSSSSKANQSEQEYCSRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQGTDIELRYAS
Subjt:  MKLDLSVLDELVREYCIYRGIVDSGRGALSGMQNLSSSSKANQSEQEYCSRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQGTDIELRYAS

Query:  EPTSNREDCSTSDSVHVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELHDVSYSGCSKQELSATTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELAS
        EPTSNREDCSTSDS+HV NSRMLQVNKNRGIVERSKRKRWRGRLDDTELHDVSYSGCSKQELSATTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELAS
Subjt:  EPTSNREDCSTSDSVHVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELHDVSYSGCSKQELSATTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELAS

Query:  KRFAAEVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLLKQLKETLLALLLPKEDILGKGFPINALANSLQVAV
        KRFAAEVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLLKQLKETLLALLLPKEDILGKGFPINALANSLQVAV
Subjt:  KRFAAEVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLLKQLKETLLALLLPKEDILGKGFPINALANSLQVAV

Query:  GRRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSLGARTSEDGSSPTQASSRDACDEN
        GRRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSLGARTSEDGSSPTQASSRDACDEN
Subjt:  GRRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSLGARTSEDGSSPTQASSRDACDEN

Query:  AILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA
        AILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA
Subjt:  AILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA

XP_008440269.1 PREDICTED: uncharacterized protein LOC103484770 isoform X1 [Cucumis melo]1.1e-27983.65Show/hide
Query:  HPSFFR--RPKFIELLRKGTPEDRDLAIQCLRTALAPCALDAYPVTILHIRGLFSIDAFKKVLSGYRIEQK------VAAFFKERKF----------RLN
        H   FR  + KFIELLRKGTPEDRDLAIQCLRTALAPCALDAYP          + + FK VL  +  ++          + + R+F          R +
Subjt:  HPSFFR--RPKFIELLRKGTPEDRDLAIQCLRTALAPCALDAYPVTILHIRGLFSIDAFKKVLSGYRIEQK------VAAFFKERKF----------RLN

Query:  NDWRNKEVAVT----SSIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDE---------------------------------NELCR
            +   ++T     SIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDE                                 NELCR
Subjt:  NDWRNKEVAVT----SSIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDE---------------------------------NELCR

Query:  MKLDLSVLDELVREYCIYRGIVDSGRGALSGMQNLSSSSKANQSEQEYCSRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQGTDIELRYAS
        MKLDLSVLDELVREYCIYRGIVDSGRGALSGMQNLSSSSKANQSEQEYCSRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQGTDIELRYAS
Subjt:  MKLDLSVLDELVREYCIYRGIVDSGRGALSGMQNLSSSSKANQSEQEYCSRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQGTDIELRYAS

Query:  EPTSNREDCSTSDSVHVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELHDVSYSGCSKQELSATTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELAS
        EPTSNREDCSTSDSVHVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELHDVSYSGCSKQELSATTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELAS
Subjt:  EPTSNREDCSTSDSVHVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELHDVSYSGCSKQELSATTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELAS

Query:  KRFAAEVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLLKQLKETLLALLLPKEDILGKGFPINALANSLQVAV
        KRFAAEVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLLKQLKETLLALLLPKEDILGKGFPINALANSLQVAV
Subjt:  KRFAAEVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLLKQLKETLLALLLPKEDILGKGFPINALANSLQVAV

Query:  GRRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSLGARTSEDGSSPTQASSRDACDEN
        GRRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSLGARTSEDGSSPTQASSRDACDEN
Subjt:  GRRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSLGARTSEDGSSPTQASSRDACDEN

Query:  AILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA
        AILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA
Subjt:  AILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA

XP_008440270.1 PREDICTED: uncharacterized protein LOC103484770 isoform X2 [Cucumis melo]8.7e-28083.65Show/hide
Query:  HPSFFR--RPKFIELLRKGTPEDRDLAIQCLRTALAPCALDAYPVTILHIRGLFSIDAFKKVLSGYRIEQK------VAAFFKERKF----------RLN
        H   FR  + KFIELLRKGTPEDRDLAIQCLRTALAPCALDAYP          + + FK VL  +  ++          + + R+F          R +
Subjt:  HPSFFR--RPKFIELLRKGTPEDRDLAIQCLRTALAPCALDAYPVTILHIRGLFSIDAFKKVLSGYRIEQK------VAAFFKERKF----------RLN

Query:  NDWRNKEVAVT----SSIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDE---------------------------------NELCR
            +   ++T     SIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDE                                 NELCR
Subjt:  NDWRNKEVAVT----SSIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDE---------------------------------NELCR

Query:  MKLDLSVLDELVREYCIYRGIVDSGRGALSGMQNLSSSSKANQSEQEYCSRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQGTDIELRYAS
        MKLDLSVLDELVREYCIYRGIVDSGRGALSGMQNLSSSSKANQSEQEYCSRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQGTDIELRYAS
Subjt:  MKLDLSVLDELVREYCIYRGIVDSGRGALSGMQNLSSSSKANQSEQEYCSRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQGTDIELRYAS

Query:  EPTSNREDCSTSDSVHVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELHDVSYSGCSKQELSATTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELAS
        EPTSNREDCSTSDSVHVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELHDVSYSGCSKQELSATTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELAS
Subjt:  EPTSNREDCSTSDSVHVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELHDVSYSGCSKQELSATTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELAS

Query:  KRFAAEVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLLKQLKETLLALLLPKEDILGKGFPINALANSLQVAV
        KRFAAEVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLLKQLKETLLALLLPKEDILGKGFPINALANSLQVAV
Subjt:  KRFAAEVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLLKQLKETLLALLLPKEDILGKGFPINALANSLQVAV

Query:  GRRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSLGARTSEDGSSPTQASSRDACDEN
        GRRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSLGARTSEDGSSPTQASSRDACDEN
Subjt:  GRRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSLGARTSEDGSSPTQASSRDACDEN

Query:  AILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA
        AILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA
Subjt:  AILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA

XP_011657859.1 uncharacterized protein LOC101218546 isoform X1 [Cucumis sativus]4.5e-27682.39Show/hide
Query:  HPSFFR--RPKFIELLRKGTPEDRDLAIQCLRTALAPCALDAYPVTILHIRGLFSIDAFKKVLSGYRIEQK------VAAFFKERKF----------RLN
        H   FR  + KFIELLRKGTPEDRDLAIQCLRTALAPCALDAYP          + + FK VL  +  ++          + + R+F          R +
Subjt:  HPSFFR--RPKFIELLRKGTPEDRDLAIQCLRTALAPCALDAYPVTILHIRGLFSIDAFKKVLSGYRIEQK------VAAFFKERKF----------RLN

Query:  NDWRNKEVAVT----SSIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDE---------------------------------NELCR
            +   ++T     SIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDE                                 NELCR
Subjt:  NDWRNKEVAVT----SSIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDE---------------------------------NELCR

Query:  MKLDLSVLDELVREYCIYRGIVDSGRGALSGMQNLSSSSKANQSEQEYCSRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQGTDIELRYAS
        MKLDLSVLDELVREYCIYRGIVDSGRG+LSGMQNLSSS KANQSEQEYCSRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQGTDIELRYAS
Subjt:  MKLDLSVLDELVREYCIYRGIVDSGRGALSGMQNLSSSSKANQSEQEYCSRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQGTDIELRYAS

Query:  EPTSNREDCSTSDSVHVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELHDVSYSGCSKQELSATTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELAS
        EPTSNREDCSTSDS+HVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELHDVSYSGCSKQELS TTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELAS
Subjt:  EPTSNREDCSTSDSVHVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELHDVSYSGCSKQELSATTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELAS

Query:  KRFAAEVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLLKQLKETLLALLLPKEDILGKGFPINALANSLQVAV
        KRFAAEVVEEINAVDPNFF+QNPILLFQLKQVEFLKLVSSGDYSSAL+VACTHLGPLA NDPSLLKQLKETLLALLLPKEDILGKGFPINALANSLQVAV
Subjt:  KRFAAEVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLLKQLKETLLALLLPKEDILGKGFPINALANSLQVAV

Query:  GRRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSLGARTSEDGSSPTQASSRDACDEN
        GRRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKS GARTSEDGSSPTQASSRDACDEN
Subjt:  GRRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSLGARTSEDGSSPTQASSRDACDEN

Query:  AILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA
        AILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA
Subjt:  AILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA

XP_011657860.1 uncharacterized protein LOC101218546 isoform X2 [Cucumis sativus]3.4e-27682.39Show/hide
Query:  HPSFFR--RPKFIELLRKGTPEDRDLAIQCLRTALAPCALDAYPVTILHIRGLFSIDAFKKVLSGYRIEQK------VAAFFKERKF----------RLN
        H   FR  + KFIELLRKGTPEDRDLAIQCLRTALAPCALDAYP          + + FK VL  +  ++          + + R+F          R +
Subjt:  HPSFFR--RPKFIELLRKGTPEDRDLAIQCLRTALAPCALDAYPVTILHIRGLFSIDAFKKVLSGYRIEQK------VAAFFKERKF----------RLN

Query:  NDWRNKEVAVT----SSIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDE---------------------------------NELCR
            +   ++T     SIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDE                                 NELCR
Subjt:  NDWRNKEVAVT----SSIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDE---------------------------------NELCR

Query:  MKLDLSVLDELVREYCIYRGIVDSGRGALSGMQNLSSSSKANQSEQEYCSRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQGTDIELRYAS
        MKLDLSVLDELVREYCIYRGIVDSGRG+LSGMQNLSSS KANQSEQEYCSRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQGTDIELRYAS
Subjt:  MKLDLSVLDELVREYCIYRGIVDSGRGALSGMQNLSSSSKANQSEQEYCSRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQGTDIELRYAS

Query:  EPTSNREDCSTSDSVHVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELHDVSYSGCSKQELSATTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELAS
        EPTSNREDCSTSDS+HVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELHDVSYSGCSKQELS TTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELAS
Subjt:  EPTSNREDCSTSDSVHVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELHDVSYSGCSKQELSATTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELAS

Query:  KRFAAEVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLLKQLKETLLALLLPKEDILGKGFPINALANSLQVAV
        KRFAAEVVEEINAVDPNFF+QNPILLFQLKQVEFLKLVSSGDYSSAL+VACTHLGPLA NDPSLLKQLKETLLALLLPKEDILGKGFPINALANSLQVAV
Subjt:  KRFAAEVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLLKQLKETLLALLLPKEDILGKGFPINALANSLQVAV

Query:  GRRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSLGARTSEDGSSPTQASSRDACDEN
        GRRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKS GARTSEDGSSPTQASSRDACDEN
Subjt:  GRRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSLGARTSEDGSSPTQASSRDACDEN

Query:  AILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA
        AILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA
Subjt:  AILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA

TrEMBL top hitse value%identityAlignment
A0A0A0KGB9 Uncharacterized protein2.2e-27682.39Show/hide
Query:  HPSFFR--RPKFIELLRKGTPEDRDLAIQCLRTALAPCALDAYPVTILHIRGLFSIDAFKKVLSGYRIEQK------VAAFFKERKF----------RLN
        H   FR  + KFIELLRKGTPEDRDLAIQCLRTALAPCALDAYP          + + FK VL  +  ++          + + R+F          R +
Subjt:  HPSFFR--RPKFIELLRKGTPEDRDLAIQCLRTALAPCALDAYPVTILHIRGLFSIDAFKKVLSGYRIEQK------VAAFFKERKF----------RLN

Query:  NDWRNKEVAVT----SSIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDE---------------------------------NELCR
            +   ++T     SIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDE                                 NELCR
Subjt:  NDWRNKEVAVT----SSIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDE---------------------------------NELCR

Query:  MKLDLSVLDELVREYCIYRGIVDSGRGALSGMQNLSSSSKANQSEQEYCSRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQGTDIELRYAS
        MKLDLSVLDELVREYCIYRGIVDSGRG+LSGMQNLSSS KANQSEQEYCSRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQGTDIELRYAS
Subjt:  MKLDLSVLDELVREYCIYRGIVDSGRGALSGMQNLSSSSKANQSEQEYCSRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQGTDIELRYAS

Query:  EPTSNREDCSTSDSVHVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELHDVSYSGCSKQELSATTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELAS
        EPTSNREDCSTSDS+HVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELHDVSYSGCSKQELS TTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELAS
Subjt:  EPTSNREDCSTSDSVHVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELHDVSYSGCSKQELSATTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELAS

Query:  KRFAAEVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLLKQLKETLLALLLPKEDILGKGFPINALANSLQVAV
        KRFAAEVVEEINAVDPNFF+QNPILLFQLKQVEFLKLVSSGDYSSAL+VACTHLGPLA NDPSLLKQLKETLLALLLPKEDILGKGFPINALANSLQVAV
Subjt:  KRFAAEVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLLKQLKETLLALLLPKEDILGKGFPINALANSLQVAV

Query:  GRRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSLGARTSEDGSSPTQASSRDACDEN
        GRRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKS GARTSEDGSSPTQASSRDACDEN
Subjt:  GRRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSLGARTSEDGSSPTQASSRDACDEN

Query:  AILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA
        AILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA
Subjt:  AILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA

A0A1S3B0B0 uncharacterized protein LOC103484770 isoform X31.2e-27482.7Show/hide
Query:  HPSFFR--RPKFIELLRKGTPEDRDLAIQCLRTALAPCALDAYPVTILHIRGLFSIDAFKKVLSGYRIEQK------VAAFFKERKF----------RLN
        H   FR  + KFIELLRKGTPEDRDLAIQCLRTALAPCALDAYP          + + FK VL  +  ++          + + R+F          R +
Subjt:  HPSFFR--RPKFIELLRKGTPEDRDLAIQCLRTALAPCALDAYPVTILHIRGLFSIDAFKKVLSGYRIEQK------VAAFFKERKF----------RLN

Query:  NDWRNKEVAVT----SSIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDE---------------------------------NELCR
            +   ++T     SIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDE                                 NELCR
Subjt:  NDWRNKEVAVT----SSIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDE---------------------------------NELCR

Query:  MKLDLSVLDELVREYCIYRGIVDSGRGALSGMQNLSSSSKANQSEQEYCSRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQGTDIELRYAS
        MKLDLSVLDELVREYCIYRGIVD      SGMQNLSSSSKANQSEQEYCSRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQGTDIELRYAS
Subjt:  MKLDLSVLDELVREYCIYRGIVDSGRGALSGMQNLSSSSKANQSEQEYCSRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQGTDIELRYAS

Query:  EPTSNREDCSTSDSVHVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELHDVSYSGCSKQELSATTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELAS
        EPTSNREDCSTSDSVHVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELHDVSYSGCSKQELSATTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELAS
Subjt:  EPTSNREDCSTSDSVHVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELHDVSYSGCSKQELSATTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELAS

Query:  KRFAAEVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLLKQLKETLLALLLPKEDILGKGFPINALANSLQVAV
        KRFAAEVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLLKQLKETLLALLLPKEDILGKGFPINALANSLQVAV
Subjt:  KRFAAEVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLLKQLKETLLALLLPKEDILGKGFPINALANSLQVAV

Query:  GRRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSLGARTSEDGSSPTQASSRDACDEN
        GRRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSLGARTSEDGSSPTQASSRDACDEN
Subjt:  GRRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSLGARTSEDGSSPTQASSRDACDEN

Query:  AILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA
        AILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA
Subjt:  AILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA

A0A1S3B1B9 uncharacterized protein LOC103484770 isoform X15.5e-28083.65Show/hide
Query:  HPSFFR--RPKFIELLRKGTPEDRDLAIQCLRTALAPCALDAYPVTILHIRGLFSIDAFKKVLSGYRIEQK------VAAFFKERKF----------RLN
        H   FR  + KFIELLRKGTPEDRDLAIQCLRTALAPCALDAYP          + + FK VL  +  ++          + + R+F          R +
Subjt:  HPSFFR--RPKFIELLRKGTPEDRDLAIQCLRTALAPCALDAYPVTILHIRGLFSIDAFKKVLSGYRIEQK------VAAFFKERKF----------RLN

Query:  NDWRNKEVAVT----SSIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDE---------------------------------NELCR
            +   ++T     SIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDE                                 NELCR
Subjt:  NDWRNKEVAVT----SSIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDE---------------------------------NELCR

Query:  MKLDLSVLDELVREYCIYRGIVDSGRGALSGMQNLSSSSKANQSEQEYCSRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQGTDIELRYAS
        MKLDLSVLDELVREYCIYRGIVDSGRGALSGMQNLSSSSKANQSEQEYCSRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQGTDIELRYAS
Subjt:  MKLDLSVLDELVREYCIYRGIVDSGRGALSGMQNLSSSSKANQSEQEYCSRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQGTDIELRYAS

Query:  EPTSNREDCSTSDSVHVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELHDVSYSGCSKQELSATTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELAS
        EPTSNREDCSTSDSVHVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELHDVSYSGCSKQELSATTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELAS
Subjt:  EPTSNREDCSTSDSVHVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELHDVSYSGCSKQELSATTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELAS

Query:  KRFAAEVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLLKQLKETLLALLLPKEDILGKGFPINALANSLQVAV
        KRFAAEVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLLKQLKETLLALLLPKEDILGKGFPINALANSLQVAV
Subjt:  KRFAAEVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLLKQLKETLLALLLPKEDILGKGFPINALANSLQVAV

Query:  GRRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSLGARTSEDGSSPTQASSRDACDEN
        GRRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSLGARTSEDGSSPTQASSRDACDEN
Subjt:  GRRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSLGARTSEDGSSPTQASSRDACDEN

Query:  AILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA
        AILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA
Subjt:  AILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA

A0A1S3B1G5 uncharacterized protein LOC103484770 isoform X24.2e-28083.65Show/hide
Query:  HPSFFR--RPKFIELLRKGTPEDRDLAIQCLRTALAPCALDAYPVTILHIRGLFSIDAFKKVLSGYRIEQK------VAAFFKERKF----------RLN
        H   FR  + KFIELLRKGTPEDRDLAIQCLRTALAPCALDAYP          + + FK VL  +  ++          + + R+F          R +
Subjt:  HPSFFR--RPKFIELLRKGTPEDRDLAIQCLRTALAPCALDAYPVTILHIRGLFSIDAFKKVLSGYRIEQK------VAAFFKERKF----------RLN

Query:  NDWRNKEVAVT----SSIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDE---------------------------------NELCR
            +   ++T     SIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDE                                 NELCR
Subjt:  NDWRNKEVAVT----SSIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDE---------------------------------NELCR

Query:  MKLDLSVLDELVREYCIYRGIVDSGRGALSGMQNLSSSSKANQSEQEYCSRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQGTDIELRYAS
        MKLDLSVLDELVREYCIYRGIVDSGRGALSGMQNLSSSSKANQSEQEYCSRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQGTDIELRYAS
Subjt:  MKLDLSVLDELVREYCIYRGIVDSGRGALSGMQNLSSSSKANQSEQEYCSRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQGTDIELRYAS

Query:  EPTSNREDCSTSDSVHVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELHDVSYSGCSKQELSATTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELAS
        EPTSNREDCSTSDSVHVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELHDVSYSGCSKQELSATTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELAS
Subjt:  EPTSNREDCSTSDSVHVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELHDVSYSGCSKQELSATTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELAS

Query:  KRFAAEVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLLKQLKETLLALLLPKEDILGKGFPINALANSLQVAV
        KRFAAEVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLLKQLKETLLALLLPKEDILGKGFPINALANSLQVAV
Subjt:  KRFAAEVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLLKQLKETLLALLLPKEDILGKGFPINALANSLQVAV

Query:  GRRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSLGARTSEDGSSPTQASSRDACDEN
        GRRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSLGARTSEDGSSPTQASSRDACDEN
Subjt:  GRRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSLGARTSEDGSSPTQASSRDACDEN

Query:  AILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA
        AILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA
Subjt:  AILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA

A0A5D3CMW8 CLTH domain-containing protein3.6e-27983.33Show/hide
Query:  HPSFFR--RPKFIELLRKGTPEDRDLAIQCLRTALAPCALDAYPVTILHIRGLFSIDAFKKVLSGYRIEQK------VAAFFKERKF----------RLN
        H   FR  + KFIELLRKGTPEDRDLAIQCLRTALAPCALDAYP          + + FK VL  +  ++          + + R+F          R +
Subjt:  HPSFFR--RPKFIELLRKGTPEDRDLAIQCLRTALAPCALDAYPVTILHIRGLFSIDAFKKVLSGYRIEQK------VAAFFKERKF----------RLN

Query:  NDWRNKEVAVT----SSIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDE---------------------------------NELCR
            +   ++T     SIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDE                                 NELCR
Subjt:  NDWRNKEVAVT----SSIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDE---------------------------------NELCR

Query:  MKLDLSVLDELVREYCIYRGIVDSGRGALSGMQNLSSSSKANQSEQEYCSRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQGTDIELRYAS
        MKLDLSVLDELVREYCIYRGIVDSGRGALSGMQNLSSSSKANQSEQEYCSRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQGTDIELRYAS
Subjt:  MKLDLSVLDELVREYCIYRGIVDSGRGALSGMQNLSSSSKANQSEQEYCSRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQGTDIELRYAS

Query:  EPTSNREDCSTSDSVHVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELHDVSYSGCSKQELSATTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELAS
        EPTSNREDCSTSDS+HV NSRMLQVNKNRGIVERSKRKRWRGRLDDTELHDVSYSGCSKQELSATTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELAS
Subjt:  EPTSNREDCSTSDSVHVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELHDVSYSGCSKQELSATTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELAS

Query:  KRFAAEVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLLKQLKETLLALLLPKEDILGKGFPINALANSLQVAV
        KRFAAEVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLLKQLKETLLALLLPKEDILGKGFPINALANSLQVAV
Subjt:  KRFAAEVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLLKQLKETLLALLLPKEDILGKGFPINALANSLQVAV

Query:  GRRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSLGARTSEDGSSPTQASSRDACDEN
        GRRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSLGARTSEDGSSPTQASSRDACDEN
Subjt:  GRRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSLGARTSEDGSSPTQASSRDACDEN

Query:  AILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA
        AILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA
Subjt:  AILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G66810.1 CONTAINS InterPro DOMAIN/s: CTLH, C-terminal LisH motif (InterPro:IPR006595)3.4e-13348.15Show/hide
Query:  HPSFFR--RPKFIELLRKGTPEDRDLAIQCLRTALAPCALDAYPVTILHIRGLFSIDAFKKVLSGYRIEQKVAAFFKERKFRLNNDWRNKE---------
        H   FR  + KFIELLRKGT E    AI CLRT +APCALDAYP          + + FK VL     ++       ++   + N+W  K          
Subjt:  HPSFFR--RPKFIELLRKGTPEDRDLAIQCLRTALAPCALDAYPVTILHIRGLFSIDAFKKVLSGYRIEQKVAAFFKERKFRLNNDWRNKE---------

Query:  -----------------VAVTSSIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDE--------------------------------
                         +    SIHKGFCF +G+SS +SDLT RLLL+ERD PATP ES+YE PPFDE                                
Subjt:  -----------------VAVTSSIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDE--------------------------------

Query:  -NELCRMKLDLSVLDELVREYCIYRGIVDSGRGALSGMQNLSSSSKANQSE-QEYCSRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQGTD
         NELCRM+LD+SVLDELV+EYCIYRGIVD      S MQ ++  +K NQSE     SR+CS E+D  TS+ SD E   + S +D S     +++  +G D
Subjt:  -NELCRMKLDLSVLDELVREYCIYRGIVDSGRGALSGMQNLSSSSKANQSE-QEYCSRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQGTD

Query:  IELRYASEPTSNREDCSTSDSVHVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTE-LHDVSYSGCSKQELSATTMSKEQQNLEKHIPVESTGKEDKYEIV
        +  RY SEPTS  EDCSTS S    N+R L   ++    E +KRKRW GR  + + L  +S++            S+   N     P+     EDKYEI 
Subjt:  IELRYASEPTSNREDCSTSDSVHVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTE-LHDVSYSGCSKQELSATTMSKEQQNLEKHIPVESTGKEDKYEIV

Query:  LGIRELASKRFAAEVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLLKQLKETLLALLLPKEDILGKGFPINAL
        L ++EL S+  AAE   EI+ +DP+FF+QNP LLF LKQVEFLKLVS+GD++ AL+VAC HLGPLA ND SLLK LKETLL LL P     GK  P+N L
Subjt:  LGIRELASKRFAAEVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLLKQLKETLLALLLPKEDILGKGFPINAL

Query:  ANSLQVAVGRRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQV-TKSLGARTSEDGSSP---
        AN+LQV+VG RLGIEEP+LMK+++ATLH+H+EWFKLQMCKDRF  LLKID LKEVN  L+       KS  DS ++ SSQV T S    TSEDG S    
Subjt:  ANSLQVAVGRRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQV-TKSLGARTSEDGSSP---

Query:  --TQASSRDAC-DENAILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIF
          TQ   R+A  +E+AILKVMEFLA+PR+DAI LL+QYNG+AE VIQQ+F
Subjt:  --TQASSRDAC-DENAILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACCCTTCATTCTTTCGACGACCAAAATTTATTGAGCTTTTGAGGAAAGGGACTCCAGAGGATCGAGATTTGGCCATTCAATGCCTTCGGACTGCACTCGCTCCTTG
TGCTCTTGATGCATACCCGGTAACCATCCTACACATTCGTGGTTTGTTTTCGATTGATGCTTTTAAGAAAGTATTATCGGGTTATAGGATCGAGCAAAAAGTTGCTGCCT
TTTTTAAAGAAAGAAAATTTCGGTTGAATAATGATTGGCGTAATAAAGAAGTAGCTGTAACTTCAAGCATACATAAGGGTTTTTGCTTTCGTGAAGGTGTGTCGTCTCCC
ATATCAGATCTCACCGAGAGGTTGCTTCTAGATGAACGCGATCCACCTGCCACACCTAAGGAGAGTCTGTACGAAGCTCCTCCATTTGATGAGAATGAGTTGTGCCGGAT
GAAATTGGACCTTTCTGTTCTCGATGAGCTCGTTCGTGAATATTGCATCTACAGAGGAATTGTGGATTCAGGTCGAGGAGCCCTCTCTGGGATGCAGAATCTCTCTAGTT
CATCGAAAGCTAATCAATCTGAGCAGGAGTATTGTTCTAGGAATTGTTCTTTTGAAGTTGACTACACAACCAGTAAACTTTCGGATGGTGAAATTTCTGTTAGCAATTCC
CGTGTGGATAGTTCTCCTGAAAATACTGCTGATGTGACCAGTTCACAAGGTACTGATATTGAACTTAGATATGCATCGGAGCCAACGTCCAATCGAGAAGATTGTAGCAC
TAGTGATTCAGTTCATGTGGGAAATTCAAGAATGTTACAAGTGAACAAGAATCGAGGGATTGTAGAGAGGAGCAAGAGAAAGAGATGGAGAGGAAGACTTGATGATACAG
AACTTCATGATGTGTCTTACAGTGGGTGCAGTAAACAAGAACTTAGCGCTACAACCATGTCCAAGGAACAACAGAACCTTGAAAAACATATACCAGTAGAGTCTACTGGC
AAGGAGGATAAATATGAAATTGTCTTGGGCATTAGAGAACTGGCAAGTAAAAGGTTTGCTGCAGAGGTTGTGGAAGAAATTAATGCCGTGGATCCGAACTTTTTTTCACA
AAATCCTATTCTCCTATTCCAACTTAAGCAGGTTGAATTTTTGAAGCTGGTTAGTTCTGGCGATTATTCCAGTGCATTGAGGGTCGCATGCACTCACTTGGGCCCATTAG
CCACTAATGATCCTTCCTTGTTGAAGCAATTAAAGGAGACTTTGTTGGCTTTGCTCCTGCCCAAGGAAGATATTCTTGGGAAAGGCTTCCCTATAAATGCTCTTGCTAAT
TCTCTCCAGGTTGCTGTTGGAAGGAGACTTGGTATTGAAGAGCCACAACTAATGAAGTTGATGAGAGCCACACTTCACTCTCATAGTGAATGGTTTAAACTTCAAATGTG
CAAGGATCGGTTTGAAGGTCTTTTGAAGATTGATTTGTTGAAGGAAGTTAATCCACCTTTGCTTTCTACTACCGCTGGGCTACTGAAATCAAATTCAGATAGTTGCAGCC
ACGGTTCTTCCCAAGTCACAAAATCTCTGGGTGCAAGAACCTCAGAAGATGGTAGCAGTCCCACACAAGCATCATCTAGAGATGCATGTGACGAAAATGCAATACTTAAA
GTCATGGAGTTTCTTGCCTTGCCCAGGGCTGATGCAATCCATCTTCTTGCGCAGTATAATGGAAATGCAGAAATGGTGATACAGCAAATATTTGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGCACCCTTCATTCTTTCGACGACCAAAATTTATTGAGCTTTTGAGGAAAGGGACTCCAGAGGATCGAGATTTGGCCATTCAATGCCTTCGGACTGCACTCGCTCCTTG
TGCTCTTGATGCATACCCGGTAACCATCCTACACATTCGTGGTTTGTTTTCGATTGATGCTTTTAAGAAAGTATTATCGGGTTATAGGATCGAGCAAAAAGTTGCTGCCT
TTTTTAAAGAAAGAAAATTTCGGTTGAATAATGATTGGCGTAATAAAGAAGTAGCTGTAACTTCAAGCATACATAAGGGTTTTTGCTTTCGTGAAGGTGTGTCGTCTCCC
ATATCAGATCTCACCGAGAGGTTGCTTCTAGATGAACGCGATCCACCTGCCACACCTAAGGAGAGTCTGTACGAAGCTCCTCCATTTGATGAGAATGAGTTGTGCCGGAT
GAAATTGGACCTTTCTGTTCTCGATGAGCTCGTTCGTGAATATTGCATCTACAGAGGAATTGTGGATTCAGGTCGAGGAGCCCTCTCTGGGATGCAGAATCTCTCTAGTT
CATCGAAAGCTAATCAATCTGAGCAGGAGTATTGTTCTAGGAATTGTTCTTTTGAAGTTGACTACACAACCAGTAAACTTTCGGATGGTGAAATTTCTGTTAGCAATTCC
CGTGTGGATAGTTCTCCTGAAAATACTGCTGATGTGACCAGTTCACAAGGTACTGATATTGAACTTAGATATGCATCGGAGCCAACGTCCAATCGAGAAGATTGTAGCAC
TAGTGATTCAGTTCATGTGGGAAATTCAAGAATGTTACAAGTGAACAAGAATCGAGGGATTGTAGAGAGGAGCAAGAGAAAGAGATGGAGAGGAAGACTTGATGATACAG
AACTTCATGATGTGTCTTACAGTGGGTGCAGTAAACAAGAACTTAGCGCTACAACCATGTCCAAGGAACAACAGAACCTTGAAAAACATATACCAGTAGAGTCTACTGGC
AAGGAGGATAAATATGAAATTGTCTTGGGCATTAGAGAACTGGCAAGTAAAAGGTTTGCTGCAGAGGTTGTGGAAGAAATTAATGCCGTGGATCCGAACTTTTTTTCACA
AAATCCTATTCTCCTATTCCAACTTAAGCAGGTTGAATTTTTGAAGCTGGTTAGTTCTGGCGATTATTCCAGTGCATTGAGGGTCGCATGCACTCACTTGGGCCCATTAG
CCACTAATGATCCTTCCTTGTTGAAGCAATTAAAGGAGACTTTGTTGGCTTTGCTCCTGCCCAAGGAAGATATTCTTGGGAAAGGCTTCCCTATAAATGCTCTTGCTAAT
TCTCTCCAGGTTGCTGTTGGAAGGAGACTTGGTATTGAAGAGCCACAACTAATGAAGTTGATGAGAGCCACACTTCACTCTCATAGTGAATGGTTTAAACTTCAAATGTG
CAAGGATCGGTTTGAAGGTCTTTTGAAGATTGATTTGTTGAAGGAAGTTAATCCACCTTTGCTTTCTACTACCGCTGGGCTACTGAAATCAAATTCAGATAGTTGCAGCC
ACGGTTCTTCCCAAGTCACAAAATCTCTGGGTGCAAGAACCTCAGAAGATGGTAGCAGTCCCACACAAGCATCATCTAGAGATGCATGTGACGAAAATGCAATACTTAAA
GTCATGGAGTTTCTTGCCTTGCCCAGGGCTGATGCAATCCATCTTCTTGCGCAGTATAATGGAAATGCAGAAATGGTGATACAGCAAATATTTGCATGA
Protein sequenceShow/hide protein sequence
MHPSFFRRPKFIELLRKGTPEDRDLAIQCLRTALAPCALDAYPVTILHIRGLFSIDAFKKVLSGYRIEQKVAAFFKERKFRLNNDWRNKEVAVTSSIHKGFCFREGVSSP
ISDLTERLLLDERDPPATPKESLYEAPPFDENELCRMKLDLSVLDELVREYCIYRGIVDSGRGALSGMQNLSSSSKANQSEQEYCSRNCSFEVDYTTSKLSDGEISVSNS
RVDSSPENTADVTSSQGTDIELRYASEPTSNREDCSTSDSVHVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELHDVSYSGCSKQELSATTMSKEQQNLEKHIPVESTG
KEDKYEIVLGIRELASKRFAAEVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLLKQLKETLLALLLPKEDILGKGFPINALAN
SLQVAVGRRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSLGARTSEDGSSPTQASSRDACDENAILK
VMEFLALPRADAIHLLAQYNGNAEMVIQQIFA