; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg012308 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg012308
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCLTH domain-containing protein
Genome locationscaffold1:8879944..8890118
RNA-Seq ExpressionSpg012308
SyntenySpg012308
Gene Ontology termsGO:0043161 - proteasome-mediated ubiquitin-dependent protein catabolic process (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
InterPro domainsIPR006595 - CTLH, C-terminal LisH motif
IPR024964 - CTLH/CRA C-terminal to LisH motif domain
IPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6595041.1 hypothetical protein SDJN03_11594, partial [Cucurbita argyrosperma subsp. sororia]1.1e-30394.63Show/hide
Query:  STPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFVELLRK
        S PLNWEALDALIIDFARSENLIEDSFSSSPP SPSPSPSSLSS+SYHSRLIIRQIRRSLE GDIDCAIDLLRLHAPFILDDHRLLFRLQKQKF+ELLRK
Subjt:  STPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFVELLRK

Query:  GTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFCFREG
        GTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDK+NQTSPVTYEW ERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFCF EG
Subjt:  GTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFCFREG

Query:  VSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRG
        VSSPISDLTERLLLDE DPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSG G
Subjt:  VSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRG

Query:  AVPGMQNPSSSSKVNQSELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQG-NEIELRYASEPTSNREDCSTSDSIHVGNSRTLQV
        A+ GMQN SSSSKVNQSELEYCSSRN SFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQG NE+ELRYASEPTSNREDCSTSDSIHVGNSRTLQ 
Subjt:  AVPGMQNPSSSSKVNQSELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQG-NEIELRYASEPTSNREDCSTSDSIHVGNSRTLQV

Query:  NKNRGIVERSKRKRWRGRHDDRELNDVSNSGCSKQELSTTTMASTTMSKEQQNLEKHLPLESTGKEDKYEIILGIRELASKRLAAEVVEEINALDPNFFV
        NKN GIVERSKRKRWRGRHDDREL+DVS SGCSK ELST T+ASTTMSKEQQNLEKH+P++STGKEDKYEI+LGIRELASKRLAAEVVEEINALDPNFFV
Subjt:  NKNRGIVERSKRKRWRGRHDDRELNDVSNSGCSKQELSTTTMASTTMSKEQQNLEKHLPLESTGKEDKYEIILGIRELASKRLAAEVVEEINALDPNFFV

Query:  QNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALANSLQ
        QNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAAN+PSLLKQLKETLLALLLPNED L KGFP+NALANSLQ
Subjt:  QNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALANSLQ

KAG7027064.1 hypothetical protein SDJN02_11073 [Cucurbita argyrosperma subsp. argyrosperma]1.6e-30294.45Show/hide
Query:  STPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFVELLRK
        S PLNWEALDALIIDFARSENLIEDSFSSSPP SPS SPSSLSS+SYHSRLIIRQIRRSLE GDIDCAIDLLRLHAPFILDDHRLLFRLQKQKF+ELLRK
Subjt:  STPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFVELLRK

Query:  GTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFCFREG
        GTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDK+NQTSPVTYEW ERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFCF EG
Subjt:  GTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFCFREG

Query:  VSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRG
        VSSPISDLTERLLLDE DPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSG G
Subjt:  VSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRG

Query:  AVPGMQNPSSSSKVNQSELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQG-NEIELRYASEPTSNREDCSTSDSIHVGNSRTLQV
        A+ GMQN SSSSKVNQSELEYCSSRN SFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQG NE+ELRYASEPTSNREDCSTSDSIHVGNSRTLQ 
Subjt:  AVPGMQNPSSSSKVNQSELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQG-NEIELRYASEPTSNREDCSTSDSIHVGNSRTLQV

Query:  NKNRGIVERSKRKRWRGRHDDRELNDVSNSGCSKQELSTTTMASTTMSKEQQNLEKHLPLESTGKEDKYEIILGIRELASKRLAAEVVEEINALDPNFFV
        NKN GIVERSKRKRWRGRHDDREL+DVS SGCSK ELST T+ASTTMSKEQQNLEKH+P++STGKEDKYEI+LGIRELASKRLAAEVVEEINALDPNFFV
Subjt:  NKNRGIVERSKRKRWRGRHDDRELNDVSNSGCSKQELSTTTMASTTMSKEQQNLEKHLPLESTGKEDKYEIILGIRELASKRLAAEVVEEINALDPNFFV

Query:  QNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALANSLQ
        QNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAAN+PSLLKQLKETLLALLLPNED L KGFP+NALANSLQ
Subjt:  QNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALANSLQ

XP_022963234.1 uncharacterized protein LOC111463509 [Cucurbita moschata]1.8e-30494.63Show/hide
Query:  STPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFVELLRK
        S PLNWEALDALIIDFARSENLIEDSFSSSPP SPSPSPSSLSSSSYHSRLIIRQIRRSLE GDIDCAIDLLRLHAPFILDDHRLLFRLQKQKF+E LRK
Subjt:  STPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFVELLRK

Query:  GTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFCFREG
        GTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDK+NQTSPVTYEW ERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFCF EG
Subjt:  GTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFCFREG

Query:  VSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRG
        VSSPISDLTERLLLDE DPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSG G
Subjt:  VSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRG

Query:  AVPGMQNPSSSSKVNQSELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQG-NEIELRYASEPTSNREDCSTSDSIHVGNSRTLQV
        A+ GMQN SSSSKVNQSELEYCSSRN SFEVDYAT KLSDGEISVSNSRVDSSPENIADVTSSQG NE+ELRYASEPTSNREDCSTSDSIHVGNSRTLQ 
Subjt:  AVPGMQNPSSSSKVNQSELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQG-NEIELRYASEPTSNREDCSTSDSIHVGNSRTLQV

Query:  NKNRGIVERSKRKRWRGRHDDRELNDVSNSGCSKQELSTTTMASTTMSKEQQNLEKHLPLESTGKEDKYEIILGIRELASKRLAAEVVEEINALDPNFFV
        NKNRGIVERSKRKRWRGRHDDREL+DVS SGCSK ELST T+ASTTMSKE+QNLEKH+P++STGKEDKYEI+LGIRELASKRLAAEVVEEINALDPNFFV
Subjt:  NKNRGIVERSKRKRWRGRHDDRELNDVSNSGCSKQELSTTTMASTTMSKEQQNLEKHLPLESTGKEDKYEIILGIRELASKRLAAEVVEEINALDPNFFV

Query:  QNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALANSLQ
        QNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAAN+PSLLKQLKETLLALLLPNED LGKGFP+NALANSLQ
Subjt:  QNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALANSLQ

XP_023003547.1 uncharacterized protein LOC111497115 [Cucurbita maxima]3.7e-30293.93Show/hide
Query:  STPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFVELLRK
        S PLNWEALDALIIDFARSENLIEDSFSSSPP SPSPSPSSLSSSSYHSRLIIRQIRRSLE GDIDCAIDLLRLHAPFILDDHRLLFRLQKQKF+ELLRK
Subjt:  STPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFVELLRK

Query:  GTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFCFREG
        GTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEW ERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFCF EG
Subjt:  GTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFCFREG

Query:  VSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRG
        VSSPISDLTERLLLDE DPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSG G
Subjt:  VSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRG

Query:  AVPGMQNPSSSSKVNQSELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQ-GNEIELRYASEPTSNREDCSTSDSIHVGNSRTLQV
         + GMQN SSSSKVNQSELEYCSSRN SFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQ  NE+ELRYASEPTSNREDCSTSDS+HVGNSRTLQ 
Subjt:  AVPGMQNPSSSSKVNQSELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQ-GNEIELRYASEPTSNREDCSTSDSIHVGNSRTLQV

Query:  NKNRGIVERSKRKRWRGRHDDRELNDVSNSGCSKQELSTTTMASTTMSKEQQNLEKHLPLESTGKEDKYEIILGIRELASKRLAAEVVEEINALDPNFFV
        NKNRGIVERSKRKRWRGRHDDREL+DVS SGCSK ELS  T+AS  MSKEQQNLEKH+P++STG+EDKYEI+LGIRELASKRLAAEVVEEINALDPNFFV
Subjt:  NKNRGIVERSKRKRWRGRHDDRELNDVSNSGCSKQELSTTTMASTTMSKEQQNLEKHLPLESTGKEDKYEIILGIRELASKRLAAEVVEEINALDPNFFV

Query:  QNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALANSLQ
        QNPI LFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAAN+PSLLKQLKETLLALLLPNED LGKGFP+NALANSLQ
Subjt:  QNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALANSLQ

XP_023544365.1 uncharacterized protein LOC111803970 isoform X1 [Cucurbita pepo subsp. pepo]3.7e-30293.58Show/hide
Query:  STPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFVELLRK
        STPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSSLSSSSYHSRLIIRQIRRSLE+GDIDCAIDLLRLHAPFILDDHRLLFRLQKQKF+ELLRK
Subjt:  STPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFVELLRK

Query:  GTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFCFREG
        GT EDR LAI+C+RT LAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCE RRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFCFREG
Subjt:  GTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFCFREG

Query:  VSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRG
        V SPISDLTERLLLDERDPPATPKESL+EAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRG
Subjt:  VSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRG

Query:  AVPGMQNPSSSSKVNQSELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQGNEIELRYASEPTSNREDCSTSDSIHVGNSRTLQVN
        A+ GMQN SSS KV+QSELEYCSSRNCS EVDYATSKLSDGEISV+NSRVDSSPENIADVTSSQG E +LRY+  PTSNREDCSTSDSIHVGNSRTLQVN
Subjt:  AVPGMQNPSSSSKVNQSELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQGNEIELRYASEPTSNREDCSTSDSIHVGNSRTLQVN

Query:  KNRGIVERSKRKRWRGRHDDRELNDVSNSGCSKQELSTTTMASTTMSKEQQNLEKHLPLESTGKEDKYEIILGIRELASKRLAAEVVEEINALDPNFFVQ
        KNRGIVERSKRKRWRGRHDDREL D+S SGCSKQE+STTT+ASTTMS EQQNLEKHLPLESTGK+DKYEI+LGIRELASKRLAAEVVEEINA+DP FF Q
Subjt:  KNRGIVERSKRKRWRGRHDDRELNDVSNSGCSKQELSTTTMASTTMSKEQQNLEKHLPLESTGKEDKYEIILGIRELASKRLAAEVVEEINALDPNFFVQ

Query:  NPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALANSLQ
        NPI LFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAA+DPSLLKQLKE LLALLLPNEDILGKGFPINALANSLQ
Subjt:  NPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALANSLQ

TrEMBL top hitse value%identityAlignment
A0A6J1BXE9 uncharacterized protein LOC111005585 isoform X13.9e-30293.75Show/hide
Query:  STPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFVELLRK
        S PLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSSLSSSSYHSRLIIR IRRSLEAG ID AI LLRLHAPFILDDHRLLFRL KQKF+ELLRK
Subjt:  STPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFVELLRK

Query:  GTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFCFREG
        GTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEW ERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFCFREG
Subjt:  GTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFCFREG

Query:  VSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRG
        VSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRG
Subjt:  VSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRG

Query:  AVPGMQNPSSSSKVNQSELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQGNEIELRYASEPTSNREDCSTSDSIHVGNSRTLQVN
         +PGMQN SSSSK+NQSELEYCSSRNCSFEVD+ATSKLSDGEISV NSRVDSSPENIADVTSSQG +IELRYA EPT+NREDCSTSDSIHVGNSRTLQVN
Subjt:  AVPGMQNPSSSSKVNQSELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQGNEIELRYASEPTSNREDCSTSDSIHVGNSRTLQVN

Query:  KNRGIVERSKRKRWRGRHDDRELNDVSNSGCSKQELSTTTMASTTMSKEQQNLEKHLPLESTGKEDKYEIILGIRELASKRLAAEVVEEINALDPNFFVQ
        KNRGIVERSKRKRWRGRHDDR L+DVS SGCSKQELST T+AS T+SK+QQNLEK LPLEST KEDKYEI+LGIRE+ASKRLAAEVVEEINALDPNFF+Q
Subjt:  KNRGIVERSKRKRWRGRHDDRELNDVSNSGCSKQELSTTTMASTTMSKEQQNLEKHLPLESTGKEDKYEIILGIRELASKRLAAEVVEEINALDPNFFVQ

Query:  NPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALANSLQ
        NPI LFQLKQVEF KLVS+GDYSS LRVACTHLGPLAANDPSLLKQLKETLLALLLPNED+LGKGFPINALANSLQ
Subjt:  NPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALANSLQ

A0A6J1GF09 uncharacterized protein LOC111453581 isoform X24.8e-30092.27Show/hide
Query:  STPLNWEALDALIIDFARSENLIEDSFSSSPPS------SPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKF
        STPLNWEALDALIIDFARSENLIEDSFSSSPPS      SPSPSPSSLSSSSYHSRLIIRQIRR LE+GDIDCAIDLLRLHAPFILDDHRLLFRLQKQKF
Subjt:  STPLNWEALDALIIDFARSENLIEDSFSSSPPS------SPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKF

Query:  VELLRKGTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKG
        +ELLRKGT EDR LAI+C+RT LAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCE RRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKG
Subjt:  VELLRKGTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKG

Query:  FCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGI
        FCFREGV SPISDLTERLLLDERDPPATP+ESL+EAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGI
Subjt:  FCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGI

Query:  VDSGRGAVPGMQNPSSSSKVNQSELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQGNEIELRYASEPTSNREDCSTSDSIHVGNS
        VDSGRGA+ GMQN SSS KV+QSELEYCSSRNCS EVDYATSKLSDGEISV+NSRVDSSPENIADVTSSQG E +LRY+ EPTSNREDCSTSDSIHVGNS
Subjt:  VDSGRGAVPGMQNPSSSSKVNQSELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQGNEIELRYASEPTSNREDCSTSDSIHVGNS

Query:  RTLQVNKNRGIVERSKRKRWRGRHDDRELNDVSNSGCSKQELSTTTMASTTMSKEQQNLEKHLPLESTGKEDKYEIILGIRELASKRLAAEVVEEINALD
        RTLQVNKNRGIVERSKRKRWRGRHDDREL D+S SGCSKQE+STTT+ STTMSKEQQNLEKHLPLESTGK+DKYEI+LGIRELASKRLAAEVVEEINA+D
Subjt:  RTLQVNKNRGIVERSKRKRWRGRHDDRELNDVSNSGCSKQELSTTTMASTTMSKEQQNLEKHLPLESTGKEDKYEIILGIRELASKRLAAEVVEEINALD

Query:  PNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALANSLQ
        P FF QNPI LFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAA+DPSLLKQLKE LLALLLPNEDILGKGFPIN+LANSLQ
Subjt:  PNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALANSLQ

A0A6J1HJH4 uncharacterized protein LOC1114635098.5e-30594.63Show/hide
Query:  STPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFVELLRK
        S PLNWEALDALIIDFARSENLIEDSFSSSPP SPSPSPSSLSSSSYHSRLIIRQIRRSLE GDIDCAIDLLRLHAPFILDDHRLLFRLQKQKF+E LRK
Subjt:  STPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFVELLRK

Query:  GTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFCFREG
        GTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDK+NQTSPVTYEW ERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFCF EG
Subjt:  GTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFCFREG

Query:  VSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRG
        VSSPISDLTERLLLDE DPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSG G
Subjt:  VSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRG

Query:  AVPGMQNPSSSSKVNQSELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQG-NEIELRYASEPTSNREDCSTSDSIHVGNSRTLQV
        A+ GMQN SSSSKVNQSELEYCSSRN SFEVDYAT KLSDGEISVSNSRVDSSPENIADVTSSQG NE+ELRYASEPTSNREDCSTSDSIHVGNSRTLQ 
Subjt:  AVPGMQNPSSSSKVNQSELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQG-NEIELRYASEPTSNREDCSTSDSIHVGNSRTLQV

Query:  NKNRGIVERSKRKRWRGRHDDRELNDVSNSGCSKQELSTTTMASTTMSKEQQNLEKHLPLESTGKEDKYEIILGIRELASKRLAAEVVEEINALDPNFFV
        NKNRGIVERSKRKRWRGRHDDREL+DVS SGCSK ELST T+ASTTMSKE+QNLEKH+P++STGKEDKYEI+LGIRELASKRLAAEVVEEINALDPNFFV
Subjt:  NKNRGIVERSKRKRWRGRHDDRELNDVSNSGCSKQELSTTTMASTTMSKEQQNLEKHLPLESTGKEDKYEIILGIRELASKRLAAEVVEEINALDPNFFV

Query:  QNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALANSLQ
        QNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAAN+PSLLKQLKETLLALLLPNED LGKGFP+NALANSLQ
Subjt:  QNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALANSLQ

A0A6J1ISM6 uncharacterized protein LOC111478049 isoform X18.8e-30293.4Show/hide
Query:  STPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFVELLRK
        STPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSSLSSSSYHSRLIIRQIRRSLE+GDIDCAIDLLRLHAPFILDDHRLLFRLQKQKF+ELLRK
Subjt:  STPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFVELLRK

Query:  GTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFCFREG
        GT EDR LAIQC+RT LAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCE RRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFCFREG
Subjt:  GTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFCFREG

Query:  VSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRG
        V SPISDLTERLLLDERDPPATP ESL+EAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRG
Subjt:  VSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRG

Query:  AVPGMQNPSSSSKVNQSELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQGNEIELRYASEPTSNREDCSTSDSIHVGNSRTLQVN
        A+ GMQN SSS KV+QSELEYCSSRNCS EVDYATSKLSDGEISV+NSRVDSSPENIADVTSSQG E +LRY+ EP SNREDCSTSD IHVGNSRTLQVN
Subjt:  AVPGMQNPSSSSKVNQSELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQGNEIELRYASEPTSNREDCSTSDSIHVGNSRTLQVN

Query:  KNRGIVERSKRKRWRGRHDDRELNDVSNSGCSKQELSTTTMASTTMSKEQQNLEKHLPLESTGKEDKYEIILGIRELASKRLAAEVVEEINALDPNFFVQ
        KNRGIVERSKRKRWRGRHDDREL D+S SGCSKQE+STTT+ASTTMSKEQQNLEKHLPLESTGK+DKYEI+LGIRELASKRLAAEVVEEINA+DP FF Q
Subjt:  KNRGIVERSKRKRWRGRHDDRELNDVSNSGCSKQELSTTTMASTTMSKEQQNLEKHLPLESTGKEDKYEIILGIRELASKRLAAEVVEEINALDPNFFVQ

Query:  NPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALANSLQ
        NPI LFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAA+DPSLLKQLKE LLALLLPNEDI GKGFPINALANSLQ
Subjt:  NPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALANSLQ

A0A6J1KMW1 uncharacterized protein LOC1114971151.8e-30293.93Show/hide
Query:  STPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFVELLRK
        S PLNWEALDALIIDFARSENLIEDSFSSSPP SPSPSPSSLSSSSYHSRLIIRQIRRSLE GDIDCAIDLLRLHAPFILDDHRLLFRLQKQKF+ELLRK
Subjt:  STPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFVELLRK

Query:  GTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFCFREG
        GTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEW ERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFCF EG
Subjt:  GTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFCFREG

Query:  VSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRG
        VSSPISDLTERLLLDE DPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSG G
Subjt:  VSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRG

Query:  AVPGMQNPSSSSKVNQSELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQ-GNEIELRYASEPTSNREDCSTSDSIHVGNSRTLQV
         + GMQN SSSSKVNQSELEYCSSRN SFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQ  NE+ELRYASEPTSNREDCSTSDS+HVGNSRTLQ 
Subjt:  AVPGMQNPSSSSKVNQSELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQ-GNEIELRYASEPTSNREDCSTSDSIHVGNSRTLQV

Query:  NKNRGIVERSKRKRWRGRHDDRELNDVSNSGCSKQELSTTTMASTTMSKEQQNLEKHLPLESTGKEDKYEIILGIRELASKRLAAEVVEEINALDPNFFV
        NKNRGIVERSKRKRWRGRHDDREL+DVS SGCSK ELS  T+AS  MSKEQQNLEKH+P++STG+EDKYEI+LGIRELASKRLAAEVVEEINALDPNFFV
Subjt:  NKNRGIVERSKRKRWRGRHDDRELNDVSNSGCSKQELSTTTMASTTMSKEQQNLEKHLPLESTGKEDKYEIILGIRELASKRLAAEVVEEINALDPNFFV

Query:  QNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALANSLQ
        QNPI LFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAAN+PSLLKQLKETLLALLLPNED LGKGFP+NALANSLQ
Subjt:  QNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALANSLQ

SwissProt top hitse value%identityAlignment
Q54X16 Glucose-induced degradation protein 8 homolog1.0e-0431.62Show/hide
Query:  IRRSLEAGDIDCAIDLLRLHAPFILDDH-RLLFRLQKQKFVELLRKG-TAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYE
        IR +++ GD++  I+++    P ILD + +L F LQ+QK +EL+RKG TAE    A++  +  LAP   +   +  EE +  +   +++ D   SP++  
Subjt:  IRRSLEAGDIDCAIDLLRLHAPFILDDH-RLLFRLQKQKFVELLRKG-TAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYE

Query:  WCERRRFDIAG-LMSSVLRAHMQAYDPVFSMTLRYL
            +R   AG L S++L +  Q  DP     L+ L
Subjt:  WCERRRFDIAG-LMSSVLRAHMQAYDPVFSMTLRYL

Arabidopsis top hitse value%identityAlignment
AT1G45063.1 copper ion binding;electron carriers2.8e-1033.33Show/hide
Query:  KFCKWSLS-PSGCRLCLRMDENRDHLFIHCKFAQQVWGWFARKVGLHFCLPQKVED---WLLEGLTAWNLGHKAKIIIGCAFRATLWHIWLERNARAFEN
        +F  W +  PS C LC  +DE R H+F  C F+ +VW +F          P+  +D   WL           K   I+  A++A+++HIW ERN R   N
Subjt:  KFCKWSLS-PSGCRLCLRMDENRDHLFIHCKFAQQVWGWFARKVGLHFCLPQKVED---WLLEGLTAWNLGHKAKIIIGCAFRATLWHIWLERNARAFEN

Query:  KA
        K+
Subjt:  KA

AT3G25270.1 Ribonuclease H-like superfamily protein7.0e-0928.79Show/hide
Query:  IWKGNSPKKVKVFLWSLAYRSLNTDEKLQKKFCKWSLSPSGCRLCLRMDENRDHLFIHCKFAQQVWGWFARKVGL---HFCLPQKVEDWLLEGLTAWNLG
        IWK  +  K+K FLW L   +L T + L+++  +   +   C  C + DE   HLF  C +AQQVW    R  G+            +  +E L +  L 
Subjt:  IWKGNSPKKVKVFLWSLAYRSLNTDEKLQKKFCKWSLSPSGCRLCLRMDENRDHLFIHCKFAQQVWGWFARKVGL---HFCLPQKVEDWLLEGLTAWNLG

Query:  HKAKIIIGCAFRATLWHIWLERNARAFENKAL
        ++   +   A    LW +W  RN   F+ K++
Subjt:  HKAKIIIGCAFRATLWHIWLERNARAFENKAL

AT5G66810.1 CONTAINS InterPro DOMAIN/s: CTLH, C-terminal LisH motif (InterPro:IPR006595)1.5e-18461.64Show/hide
Query:  STPLNWEALDALIIDFARSENLIEDSFSS-----SPPSSPSPSPS-SLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKF
        STP+NWEALDALIIDF  SENL+ED+ ++     SP SSPS S S S+SSSSYHSRLIIR+IR S+E+GDI+ AID+LR HAPF+LDDHR+LFRLQKQKF
Subjt:  STPLNWEALDALIIDFARSENLIEDSFSS-----SPPSSPSPSPS-SLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKF

Query:  VELLRKGTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKG
        +ELLRKGT E    AI CLRT +APCALDAYPEAYEEFKHVLLA IYDKD+QTSPV  EW E+RR+++AGLMSSVLRA +QAYDPVFSMTLRYLISIHKG
Subjt:  VELLRKGTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKG

Query:  FCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGI
        FCF +G+SS +SDLT RLLL+ERD PATP ES+YE PPFDEVDIQALAHAVELTRQGA+DS++F KGDLF AFQNELCRM+LD+SVLDELV+EYCIYRGI
Subjt:  FCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGI

Query:  VDSGRGAVPGMQNPSSSSKVNQSELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQGNEIELRYASEPTSNREDCSTSDSIHVGNS
        VDS       MQ  +  +K NQSE+    SR+CS E+D  TS+ SD E   + S +D S     +++  +G ++  RY SEPTS  EDCSTS S    N+
Subjt:  VDSGRGAVPGMQNPSSSSKVNQSELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQGNEIELRYASEPTSNREDCSTSDSIHVGNS

Query:  RTLQVNKNRGIVERSKRKRWRGRHDDRELNDVSNSGCSKQELSTTTMASTTMSKEQQNLEKHLPLESTGKEDKYEIILGIRELASKRLAAEVVEEINALD
        R L   ++    E +KRKRW GR    E++ +     +  E  T  +                       EDKYEI L ++EL S+ +AAE   EI+ +D
Subjt:  RTLQVNKNRGIVERSKRKRWRGRHDDRELNDVSNSGCSKQELSTTTMASTTMSKEQQNLEKHLPLESTGKEDKYEIILGIRELASKRLAAEVVEEINALD

Query:  PNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALANSLQES
        P+FF QNP  LF LKQVEFLKLVS+GD++ AL+VAC HLGPLAAND SLLK LKETLL LL P+    GK  P+N LAN+LQ S
Subjt:  PNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALANSLQES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTCGTCCACCCCTTTGAACTGGGAAGCTCTCGATGCCCTAATTATCGATTTCGCTAGATCAGAGAACTTGATTGAGGATTCCTTTTCTTCTTCTCCACCTTCTTC
TCCTTCCCCTTCCCCTTCCTCGCTTTCCTCCTCCTCCTACCACTCCAGGCTGATCATCCGCCAGATCAGGCGCTCTTTGGAGGCCGGTGATATTGATTGCGCCATCGATC
TCCTCCGCCTTCATGCACCCTTCATTCTCGACGATCATAGGCTTCTATTCCGCTTGCAGAAGCAGAAATTTGTCGAGCTCTTGAGGAAAGGGACTGCAGAGGATCGTGAT
TTGGCCATTCAATGTCTCCGGACGGCGCTCGCTCCTTGTGCTCTTGATGCATACCCGGAAGCGTATGAGGAGTTCAAGCATGTTCTTCTTGCCTTCATTTACGACAAGGA
TAATCAAACATCCCCAGTGACATATGAGTGGTGTGAAAGGAGGAGGTTTGATATTGCTGGATTAATGTCCTCTGTCCTACGAGCTCATATGCAGGCATATGATCCAGTCT
TCTCTATGACTTTGAGATACCTGATAAGCATACATAAAGGTTTTTGCTTTCGTGAAGGTGTGTCATCTCCCATATCAGATCTCACTGAGAGGTTGCTTCTAGATGAACGT
GATCCACCTGCCACACCCAAGGAGAGTCTGTACGAAGCACCTCCATTTGATGAGGTGGACATTCAAGCTCTTGCACATGCTGTAGAGCTAACAAGACAGGGGGCAATTGA
TAGCTTGAGATTCACCAAAGGTGATTTGTTTCATGCATTCCAGAATGAGTTGTGCCGGATGAAATTGGACCTTTCTGTACTTGATGAGCTTGTTCGTGAGTATTGCATCT
ACAGAGGAATTGTGGATTCTGGTCGTGGAGCTGTCCCTGGGATGCAGAATCCCTCCAGTTCATCAAAAGTTAATCAATCAGAACTGGAGTATTGTTCATCAAGGAATTGT
TCTTTTGAAGTGGACTATGCAACCAGTAAACTTTCAGATGGTGAAATTTCTGTCAGCAATTCCCGCGTGGATAGTTCTCCTGAGAATATTGCTGATGTGACCAGTTCACA
AGGTAATGAGATTGAATTACGATATGCATCGGAGCCAACGTCCAATCGAGAAGATTGTAGCACCAGTGATTCAATTCATGTGGGAAATTCAAGAACATTACAAGTAAACA
AGAATCGTGGGATTGTAGAAAGGAGCAAGCGTAAGAGATGGAGAGGAAGACATGATGATAGAGAACTTAATGATGTGTCTAACAGTGGATGCAGTAAACAAGAACTTAGC
ACTACAACAATGGCCAGTACAACCATGTCTAAGGAGCAACAGAACCTTGAAAAACATTTACCATTAGAATCTACTGGCAAGGAGGATAAATATGAGATTATCTTGGGCAT
TAGAGAACTGGCAAGTAAAAGGTTGGCTGCAGAGGTTGTGGAAGAAATTAACGCTCTGGATCCAAACTTTTTTGTACAAAATCCTATTTTCCTATTCCAACTGAAACAGG
TTGAATTTTTGAAGCTGGTCAGTTCCGGTGATTATTCTAGTGCTTTGAGGGTCGCATGCACTCACCTAGGCCCATTAGCCGCTAATGATCCTTCCTTGTTGAAGCAATTA
AAAGAGACGCTATTGGCTTTGCTCCTGCCCAATGAAGATATTCTTGGGAAAGGCTTCCCTATAAATGCTCTTGCTAATTCTCTTCAGGAATCAAATCCACAATATCCTCA
AATGGACCGAATTCTCTGGACTCTTGAAAGCTCGGGATCCTACACTAGTAAATCTACATTCTATCATATGCACGAGAGGCAGCAAATAGTAAATCCGACCCTGACGAATC
TGATTTGGAAGGGTAATAGTCCCAAGAAAGTTAAGGTTTTTCTATGGTCCCTAGCTTATAGAAGTTTGAACACTGACGAAAAACTTCAGAAAAAGTTCTGCAAGTGGTCT
TTGTCTCCGTCTGGCTGCAGATTATGTCTTAGGATGGACGAAAATCGTGATCACCTCTTTATCCACTGTAAGTTTGCGCAACAAGTCTGGGGTTGGTTTGCTAGGAAGGT
TGGTCTTCATTTTTGTTTGCCTCAGAAAGTGGAAGATTGGCTTTTAGAAGGGCTCACGGCTTGGAACTTGGGGCACAAAGCCAAGATCATTATCGGGTGCGCCTTTAGAG
CGACTCTTTGGCACATTTGGTTGGAAAGGAATGCTAGAGCTTTCGAGAACAAAGCTCTTAGGCTAGATTCTTTTTGTGATTATGTACAAAATACGGCTTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACTCGTCCACCCCTTTGAACTGGGAAGCTCTCGATGCCCTAATTATCGATTTCGCTAGATCAGAGAACTTGATTGAGGATTCCTTTTCTTCTTCTCCACCTTCTTC
TCCTTCCCCTTCCCCTTCCTCGCTTTCCTCCTCCTCCTACCACTCCAGGCTGATCATCCGCCAGATCAGGCGCTCTTTGGAGGCCGGTGATATTGATTGCGCCATCGATC
TCCTCCGCCTTCATGCACCCTTCATTCTCGACGATCATAGGCTTCTATTCCGCTTGCAGAAGCAGAAATTTGTCGAGCTCTTGAGGAAAGGGACTGCAGAGGATCGTGAT
TTGGCCATTCAATGTCTCCGGACGGCGCTCGCTCCTTGTGCTCTTGATGCATACCCGGAAGCGTATGAGGAGTTCAAGCATGTTCTTCTTGCCTTCATTTACGACAAGGA
TAATCAAACATCCCCAGTGACATATGAGTGGTGTGAAAGGAGGAGGTTTGATATTGCTGGATTAATGTCCTCTGTCCTACGAGCTCATATGCAGGCATATGATCCAGTCT
TCTCTATGACTTTGAGATACCTGATAAGCATACATAAAGGTTTTTGCTTTCGTGAAGGTGTGTCATCTCCCATATCAGATCTCACTGAGAGGTTGCTTCTAGATGAACGT
GATCCACCTGCCACACCCAAGGAGAGTCTGTACGAAGCACCTCCATTTGATGAGGTGGACATTCAAGCTCTTGCACATGCTGTAGAGCTAACAAGACAGGGGGCAATTGA
TAGCTTGAGATTCACCAAAGGTGATTTGTTTCATGCATTCCAGAATGAGTTGTGCCGGATGAAATTGGACCTTTCTGTACTTGATGAGCTTGTTCGTGAGTATTGCATCT
ACAGAGGAATTGTGGATTCTGGTCGTGGAGCTGTCCCTGGGATGCAGAATCCCTCCAGTTCATCAAAAGTTAATCAATCAGAACTGGAGTATTGTTCATCAAGGAATTGT
TCTTTTGAAGTGGACTATGCAACCAGTAAACTTTCAGATGGTGAAATTTCTGTCAGCAATTCCCGCGTGGATAGTTCTCCTGAGAATATTGCTGATGTGACCAGTTCACA
AGGTAATGAGATTGAATTACGATATGCATCGGAGCCAACGTCCAATCGAGAAGATTGTAGCACCAGTGATTCAATTCATGTGGGAAATTCAAGAACATTACAAGTAAACA
AGAATCGTGGGATTGTAGAAAGGAGCAAGCGTAAGAGATGGAGAGGAAGACATGATGATAGAGAACTTAATGATGTGTCTAACAGTGGATGCAGTAAACAAGAACTTAGC
ACTACAACAATGGCCAGTACAACCATGTCTAAGGAGCAACAGAACCTTGAAAAACATTTACCATTAGAATCTACTGGCAAGGAGGATAAATATGAGATTATCTTGGGCAT
TAGAGAACTGGCAAGTAAAAGGTTGGCTGCAGAGGTTGTGGAAGAAATTAACGCTCTGGATCCAAACTTTTTTGTACAAAATCCTATTTTCCTATTCCAACTGAAACAGG
TTGAATTTTTGAAGCTGGTCAGTTCCGGTGATTATTCTAGTGCTTTGAGGGTCGCATGCACTCACCTAGGCCCATTAGCCGCTAATGATCCTTCCTTGTTGAAGCAATTA
AAAGAGACGCTATTGGCTTTGCTCCTGCCCAATGAAGATATTCTTGGGAAAGGCTTCCCTATAAATGCTCTTGCTAATTCTCTTCAGGAATCAAATCCACAATATCCTCA
AATGGACCGAATTCTCTGGACTCTTGAAAGCTCGGGATCCTACACTAGTAAATCTACATTCTATCATATGCACGAGAGGCAGCAAATAGTAAATCCGACCCTGACGAATC
TGATTTGGAAGGGTAATAGTCCCAAGAAAGTTAAGGTTTTTCTATGGTCCCTAGCTTATAGAAGTTTGAACACTGACGAAAAACTTCAGAAAAAGTTCTGCAAGTGGTCT
TTGTCTCCGTCTGGCTGCAGATTATGTCTTAGGATGGACGAAAATCGTGATCACCTCTTTATCCACTGTAAGTTTGCGCAACAAGTCTGGGGTTGGTTTGCTAGGAAGGT
TGGTCTTCATTTTTGTTTGCCTCAGAAAGTGGAAGATTGGCTTTTAGAAGGGCTCACGGCTTGGAACTTGGGGCACAAAGCCAAGATCATTATCGGGTGCGCCTTTAGAG
CGACTCTTTGGCACATTTGGTTGGAAAGGAATGCTAGAGCTTTCGAGAACAAAGCTCTTAGGCTAGATTCTTTTTGTGATTATGTACAAAATACGGCTTCTTGA
Protein sequenceShow/hide protein sequence
MDSSTPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFVELLRKGTAEDRD
LAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFCFREGVSSPISDLTERLLLDER
DPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRGAVPGMQNPSSSSKVNQSELEYCSSRNC
SFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQGNEIELRYASEPTSNREDCSTSDSIHVGNSRTLQVNKNRGIVERSKRKRWRGRHDDRELNDVSNSGCSKQELS
TTTMASTTMSKEQQNLEKHLPLESTGKEDKYEIILGIRELASKRLAAEVVEEINALDPNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQL
KETLLALLLPNEDILGKGFPINALANSLQESNPQYPQMDRILWTLESSGSYTSKSTFYHMHERQQIVNPTLTNLIWKGNSPKKVKVFLWSLAYRSLNTDEKLQKKFCKWS
LSPSGCRLCLRMDENRDHLFIHCKFAQQVWGWFARKVGLHFCLPQKVEDWLLEGLTAWNLGHKAKIIIGCAFRATLWHIWLERNARAFENKALRLDSFCDYVQNTAS