; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009244 (gene) of Snake gourd v1 genome

Gene IDTan0009244
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCLTH domain-containing protein
Genome locationLG03:71576212..71585282
RNA-Seq ExpressionTan0009244
SyntenyTan0009244
Gene Ontology termsGO:0043161 - proteasome-mediated ubiquitin-dependent protein catabolic process (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
InterPro domainsIPR006595 - CTLH, C-terminal LisH motif
IPR024964 - CTLH/CRA C-terminal to LisH motif domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6595041.1 hypothetical protein SDJN03_11594, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0093.47Show/hide
Query:  MDSTPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFIE
        MDS PLNWEALDALIIDFARSENLIEDSFSSSPP  PSPSPSPSSLSS+SYHSRLIIRQIRRSLE GDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFIE
Subjt:  MDSTPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFIE

Query:  LLRKGTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFC
        LLRKGTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDK+NQTSPVTYEW ERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFC
Subjt:  LLRKGTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFC

Query:  FREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVD
        F EGVSSPISDLTERLLLDE DPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVD
Subjt:  FREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVD

Query:  SGRGALSGMQNLSSSSKVTQSEPEYCSSRNTSFEVDYAASKLSDGEISVSNSRVDSSPENIADVTSSQG-NEVELRYASEPTSNREDCSTSDSIHVGNSR
        SG GALSGMQNLSSSSKV QSE EYCSSRNTSFEVDYA SKLSDGEISVSNSRVDSSPENIADVTSSQG NEVELRYASEPTSNREDCSTSDSIHVGNSR
Subjt:  SGRGALSGMQNLSSSSKVTQSEPEYCSSRNTSFEVDYAASKLSDGEISVSNSRVDSSPENIADVTSSQG-NEVELRYASEPTSNREDCSTSDSIHVGNSR

Query:  TLQANKNRGIVERSKRKRWRGRHDDRELYDLSCSGCSKQELSTTTVASPTISKEQQNLEKHLPLESTGKEDKYEIVLGIREMASKRLAAEVVEEINALDP
        TLQANKN GIVERSKRKRWRGRHDDREL+D+S SGCSK ELST TVAS T+SKEQQNLEKH+P++STGKEDKYEIVLGIRE+ASKRLAAEVVEEINALDP
Subjt:  TLQANKNRGIVERSKRKRWRGRHDDRELYDLSCSGCSKQELSTTTVASPTISKEQQNLEKHLPLESTGKEDKYEIVLGIREMASKRLAAEVVEEINALDP

Query:  NFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALASSLQVAFGRRLGIEEPQLMKLMR
        NFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAAN+PSLLKQLKETLLALLLPNED L KGFP+NALA+SLQVAFGRRLGIEEP+LMKLMR
Subjt:  NFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALASSLQVAFGRRLGIEEPQLMKLMR

Query:  ATLHSHSEWFKLQMCKDRFESLLKIDSLKEVNPPSLSTSSVLLKSNSDSCTHGFSQGTKSSGARTLEDGNSPMQASSRDACDENAILKVMEFLALPRADA
        AT+HSHSEWFKLQMCKDRFESLLKIDSLKEVN P LSTS  LLKSNSDSC HG SQ  KSS  RT EDG+SPMQASS DA DENAILKVMEFLALPRADA
Subjt:  ATLHSHSEWFKLQMCKDRFESLLKIDSLKEVNPPSLSTSSVLLKSNSDSCTHGFSQGTKSSGARTLEDGNSPMQASSRDACDENAILKVMEFLALPRADA

Query:  IHLLAQYNGNAEMVIQQIFA
        IHLLAQYNGNAEMVIQQIFA
Subjt:  IHLLAQYNGNAEMVIQQIFA

KAG7027064.1 hypothetical protein SDJN02_11073 [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0093.47Show/hide
Query:  MDSTPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFIE
        MDS PLNWEALDALIIDFARSENLIEDSFSSSPP  PSPS SPSSLSS+SYHSRLIIRQIRRSLE GDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFIE
Subjt:  MDSTPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFIE

Query:  LLRKGTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFC
        LLRKGTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDK+NQTSPVTYEW ERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFC
Subjt:  LLRKGTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFC

Query:  FREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVD
        F EGVSSPISDLTERLLLDE DPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVD
Subjt:  FREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVD

Query:  SGRGALSGMQNLSSSSKVTQSEPEYCSSRNTSFEVDYAASKLSDGEISVSNSRVDSSPENIADVTSSQG-NEVELRYASEPTSNREDCSTSDSIHVGNSR
        SG GALSGMQNLSSSSKV QSE EYCSSRNTSFEVDYA SKLSDGEISVSNSRVDSSPENIADVTSSQG NEVELRYASEPTSNREDCSTSDSIHVGNSR
Subjt:  SGRGALSGMQNLSSSSKVTQSEPEYCSSRNTSFEVDYAASKLSDGEISVSNSRVDSSPENIADVTSSQG-NEVELRYASEPTSNREDCSTSDSIHVGNSR

Query:  TLQANKNRGIVERSKRKRWRGRHDDRELYDLSCSGCSKQELSTTTVASPTISKEQQNLEKHLPLESTGKEDKYEIVLGIREMASKRLAAEVVEEINALDP
        TLQANKN GIVERSKRKRWRGRHDDREL+D+S SGCSK ELST TVAS T+SKEQQNLEKH+P++STGKEDKYEIVLGIRE+ASKRLAAEVVEEINALDP
Subjt:  TLQANKNRGIVERSKRKRWRGRHDDRELYDLSCSGCSKQELSTTTVASPTISKEQQNLEKHLPLESTGKEDKYEIVLGIREMASKRLAAEVVEEINALDP

Query:  NFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALASSLQVAFGRRLGIEEPQLMKLMR
        NFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAAN+PSLLKQLKETLLALLLPNED L KGFP+NALA+SLQVAFGRRLGIEEP+LMKLMR
Subjt:  NFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALASSLQVAFGRRLGIEEPQLMKLMR

Query:  ATLHSHSEWFKLQMCKDRFESLLKIDSLKEVNPPSLSTSSVLLKSNSDSCTHGFSQGTKSSGARTLEDGNSPMQASSRDACDENAILKVMEFLALPRADA
        AT+HSHSEWFKLQMCKDRFESLLKIDSLKEVN PSLSTS  LLKSNSDSC HG SQ  KSS  RT EDG+SPMQASS DA DENAILKVMEFLALPRADA
Subjt:  ATLHSHSEWFKLQMCKDRFESLLKIDSLKEVNPPSLSTSSVLLKSNSDSCTHGFSQGTKSSGARTLEDGNSPMQASSRDACDENAILKVMEFLALPRADA

Query:  IHLLAQYNGNAEMVIQQIFA
        IHLLAQYNGNAEMVIQQIFA
Subjt:  IHLLAQYNGNAEMVIQQIFA

XP_022132833.1 uncharacterized protein LOC111005585 isoform X1 [Momordica charantia]0.0e+0093.32Show/hide
Query:  MDSTPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFIE
        MDS PLNWEALDALIIDFARSENLIEDSFSSSPPS  SPSPSPSSLSSSSYHSRLIIR IRRSLEAG ID AI LLRLHAPFILDDHRLLFRL KQKFIE
Subjt:  MDSTPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFIE

Query:  LLRKGTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFC
        LLRKGTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEW ERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFC
Subjt:  LLRKGTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFC

Query:  FREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVD
        FREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVD
Subjt:  FREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVD

Query:  SGRGALSGMQNLSSSSKVTQSEPEYCSSRNTSFEVDYAASKLSDGEISVSNSRVDSSPENIADVTSSQGNEVELRYASEPTSNREDCSTSDSIHVGNSRT
        SGRG L GMQNLSSSSK+ QSE EYCSSRN SFEVD+A SKLSDGEISV NSRVDSSPENIADVTSSQG ++ELRYA EPT+NREDCSTSDSIHVGNSRT
Subjt:  SGRGALSGMQNLSSSSKVTQSEPEYCSSRNTSFEVDYAASKLSDGEISVSNSRVDSSPENIADVTSSQGNEVELRYASEPTSNREDCSTSDSIHVGNSRT

Query:  LQANKNRGIVERSKRKRWRGRHDDRELYDLSCSGCSKQELSTTTVASPTISKEQQNLEKHLPLESTGKEDKYEIVLGIREMASKRLAAEVVEEINALDPN
        LQ NKNRGIVERSKRKRWRGRHDDR L+D+S SGCSKQELST TVAS TISK+QQNLEK LPLEST KEDKYEIVLGIREMASKRLAAEVVEEINALDPN
Subjt:  LQANKNRGIVERSKRKRWRGRHDDRELYDLSCSGCSKQELSTTTVASPTISKEQQNLEKHLPLESTGKEDKYEIVLGIREMASKRLAAEVVEEINALDPN

Query:  FFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALASSLQVAFGRRLGIEEPQLMKLMRA
        FF+QNPI LFQLKQVEF KLVS+GDYSS LRVACTHLGPLAANDPSLLKQLKETLLALLLPNED+LGKGFPINALA+SLQVAFGRRLGIEEPQLMKLMRA
Subjt:  FFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALASSLQVAFGRRLGIEEPQLMKLMRA

Query:  TLHSHSEWFKLQMCKDRFESLLKIDSLKEVNPPSLSTSSVLLKSNSDSCTHGFSQGTKSSGARTLEDGNSPMQASSRDACDENAILKVMEFLALPRADAI
        TLHSHSEWFKLQMCKDRFESLLKIDSLKEVNPP LSTSS LLKSNSDSCT G SQ TKSSGART EDG+SP+QASSRDACDENAILKVMEFLALPRADAI
Subjt:  TLHSHSEWFKLQMCKDRFESLLKIDSLKEVNPPSLSTSSVLLKSNSDSCTHGFSQGTKSSGARTLEDGNSPMQASSRDACDENAILKVMEFLALPRADAI

Query:  HLLAQYNGNAEMVIQQIFA
        HLLAQYNGNAEMVIQQIFA
Subjt:  HLLAQYNGNAEMVIQQIFA

XP_022963234.1 uncharacterized protein LOC111463509 [Cucurbita moschata]0.0e+0093.61Show/hide
Query:  MDSTPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFIE
        MDS PLNWEALDALIIDFARSENLIEDSFSSSPP  PSPSPSPSSLSSSSYHSRLIIRQIRRSLE GDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFIE
Subjt:  MDSTPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFIE

Query:  LLRKGTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFC
         LRKGTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDK+NQTSPVTYEW ERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFC
Subjt:  LLRKGTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFC

Query:  FREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVD
        F EGVSSPISDLTERLLLDE DPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVD
Subjt:  FREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVD

Query:  SGRGALSGMQNLSSSSKVTQSEPEYCSSRNTSFEVDYAASKLSDGEISVSNSRVDSSPENIADVTSSQG-NEVELRYASEPTSNREDCSTSDSIHVGNSR
        SG GALSGMQNLSSSSKV QSE EYCSSRNTSFEVDYA  KLSDGEISVSNSRVDSSPENIADVTSSQG NEVELRYASEPTSNREDCSTSDSIHVGNSR
Subjt:  SGRGALSGMQNLSSSSKVTQSEPEYCSSRNTSFEVDYAASKLSDGEISVSNSRVDSSPENIADVTSSQG-NEVELRYASEPTSNREDCSTSDSIHVGNSR

Query:  TLQANKNRGIVERSKRKRWRGRHDDRELYDLSCSGCSKQELSTTTVASPTISKEQQNLEKHLPLESTGKEDKYEIVLGIREMASKRLAAEVVEEINALDP
        TLQANKNRGIVERSKRKRWRGRHDDREL+D+S SGCSK ELST TVAS T+SKE+QNLEKH+P++STGKEDKYEIVLGIRE+ASKRLAAEVVEEINALDP
Subjt:  TLQANKNRGIVERSKRKRWRGRHDDRELYDLSCSGCSKQELSTTTVASPTISKEQQNLEKHLPLESTGKEDKYEIVLGIREMASKRLAAEVVEEINALDP

Query:  NFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALASSLQVAFGRRLGIEEPQLMKLMR
        NFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAAN+PSLLKQLKETLLALLLPNED LGKGFP+NALA+SLQVAFGRRLGIEEP+LMKLMR
Subjt:  NFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALASSLQVAFGRRLGIEEPQLMKLMR

Query:  ATLHSHSEWFKLQMCKDRFESLLKIDSLKEVNPPSLSTSSVLLKSNSDSCTHGFSQGTKSSGARTLEDGNSPMQASSRDACDENAILKVMEFLALPRADA
        AT+HSHSEWFKLQMCKDRFESLLKIDSLKEVN PSLSTS  LLKSNSDSC HG SQ  KSS  RT EDG+SPMQASS DA DENAILKVMEFLALPRADA
Subjt:  ATLHSHSEWFKLQMCKDRFESLLKIDSLKEVNPPSLSTSSVLLKSNSDSCTHGFSQGTKSSGARTLEDGNSPMQASSRDACDENAILKVMEFLALPRADA

Query:  IHLLAQYNGNAEMVIQQIFA
        IHLLAQYNGNAEMVIQQIFA
Subjt:  IHLLAQYNGNAEMVIQQIFA

XP_023003547.1 uncharacterized protein LOC111497115 [Cucurbita maxima]0.0e+0093.2Show/hide
Query:  MDSTPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFIE
        MDS PLNWEALDALIIDFARSENLIEDSFSSSPP  PSPSPSPSSLSSSSYHSRLIIRQIRRSLE GDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFIE
Subjt:  MDSTPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFIE

Query:  LLRKGTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFC
        LLRKGTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEW ERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFC
Subjt:  LLRKGTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFC

Query:  FREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVD
        F EGVSSPISDLTERLLLDE DPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVD
Subjt:  FREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVD

Query:  SGRGALSGMQNLSSSSKVTQSEPEYCSSRNTSFEVDYAASKLSDGEISVSNSRVDSSPENIADVTSSQ-GNEVELRYASEPTSNREDCSTSDSIHVGNSR
        SG G LSGMQNLSSSSKV QSE EYCSSRNTSFEVDYA SKLSDGEISVSNSRVDSSPENIADVTSSQ  NEVELRYASEPTSNREDCSTSDS+HVGNSR
Subjt:  SGRGALSGMQNLSSSSKVTQSEPEYCSSRNTSFEVDYAASKLSDGEISVSNSRVDSSPENIADVTSSQ-GNEVELRYASEPTSNREDCSTSDSIHVGNSR

Query:  TLQANKNRGIVERSKRKRWRGRHDDRELYDLSCSGCSKQELSTTTVASPTISKEQQNLEKHLPLESTGKEDKYEIVLGIREMASKRLAAEVVEEINALDP
        TLQANKNRGIVERSKRKRWRGRHDDREL+D+S SGCSK ELS  TVAS  +SKEQQNLEKH+P++STG+EDKYEIVLGIRE+ASKRLAAEVVEEINALDP
Subjt:  TLQANKNRGIVERSKRKRWRGRHDDRELYDLSCSGCSKQELSTTTVASPTISKEQQNLEKHLPLESTGKEDKYEIVLGIREMASKRLAAEVVEEINALDP

Query:  NFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALASSLQVAFGRRLGIEEPQLMKLMR
        NFFVQNPI LFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAAN+PSLLKQLKETLLALLLPNED LGKGFP+NALA+SLQVAFGRRLGIEEP+LMKLMR
Subjt:  NFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALASSLQVAFGRRLGIEEPQLMKLMR

Query:  ATLHSHSEWFKLQMCKDRFESLLKIDSLKEVNPPSLSTSSVLLKSNSDSCTHGFSQGTKSS-GARTLEDGNSPMQASSRDACDENAILKVMEFLALPRAD
        AT+HSHSEWFKLQMCKDRFESLLKIDSLKEVN PSLSTS  LLKSNSDSC HG SQ  KSS  ART EDG+SPMQASS DA DENAILKVMEFLALPRAD
Subjt:  ATLHSHSEWFKLQMCKDRFESLLKIDSLKEVNPPSLSTSSVLLKSNSDSCTHGFSQGTKSS-GARTLEDGNSPMQASSRDACDENAILKVMEFLALPRAD

Query:  AIHLLAQYNGNAEMVIQQIFA
        AIHLLAQYNGNAEMVIQQIFA
Subjt:  AIHLLAQYNGNAEMVIQQIFA

TrEMBL top hitse value%identityAlignment
A0A6J1BXE9 uncharacterized protein LOC111005585 isoform X10.0e+0093.32Show/hide
Query:  MDSTPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFIE
        MDS PLNWEALDALIIDFARSENLIEDSFSSSPPS  SPSPSPSSLSSSSYHSRLIIR IRRSLEAG ID AI LLRLHAPFILDDHRLLFRL KQKFIE
Subjt:  MDSTPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFIE

Query:  LLRKGTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFC
        LLRKGTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEW ERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFC
Subjt:  LLRKGTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFC

Query:  FREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVD
        FREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVD
Subjt:  FREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVD

Query:  SGRGALSGMQNLSSSSKVTQSEPEYCSSRNTSFEVDYAASKLSDGEISVSNSRVDSSPENIADVTSSQGNEVELRYASEPTSNREDCSTSDSIHVGNSRT
        SGRG L GMQNLSSSSK+ QSE EYCSSRN SFEVD+A SKLSDGEISV NSRVDSSPENIADVTSSQG ++ELRYA EPT+NREDCSTSDSIHVGNSRT
Subjt:  SGRGALSGMQNLSSSSKVTQSEPEYCSSRNTSFEVDYAASKLSDGEISVSNSRVDSSPENIADVTSSQGNEVELRYASEPTSNREDCSTSDSIHVGNSRT

Query:  LQANKNRGIVERSKRKRWRGRHDDRELYDLSCSGCSKQELSTTTVASPTISKEQQNLEKHLPLESTGKEDKYEIVLGIREMASKRLAAEVVEEINALDPN
        LQ NKNRGIVERSKRKRWRGRHDDR L+D+S SGCSKQELST TVAS TISK+QQNLEK LPLEST KEDKYEIVLGIREMASKRLAAEVVEEINALDPN
Subjt:  LQANKNRGIVERSKRKRWRGRHDDRELYDLSCSGCSKQELSTTTVASPTISKEQQNLEKHLPLESTGKEDKYEIVLGIREMASKRLAAEVVEEINALDPN

Query:  FFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALASSLQVAFGRRLGIEEPQLMKLMRA
        FF+QNPI LFQLKQVEF KLVS+GDYSS LRVACTHLGPLAANDPSLLKQLKETLLALLLPNED+LGKGFPINALA+SLQVAFGRRLGIEEPQLMKLMRA
Subjt:  FFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALASSLQVAFGRRLGIEEPQLMKLMRA

Query:  TLHSHSEWFKLQMCKDRFESLLKIDSLKEVNPPSLSTSSVLLKSNSDSCTHGFSQGTKSSGARTLEDGNSPMQASSRDACDENAILKVMEFLALPRADAI
        TLHSHSEWFKLQMCKDRFESLLKIDSLKEVNPP LSTSS LLKSNSDSCT G SQ TKSSGART EDG+SP+QASSRDACDENAILKVMEFLALPRADAI
Subjt:  TLHSHSEWFKLQMCKDRFESLLKIDSLKEVNPPSLSTSSVLLKSNSDSCTHGFSQGTKSSGARTLEDGNSPMQASSRDACDENAILKVMEFLALPRADAI

Query:  HLLAQYNGNAEMVIQQIFA
        HLLAQYNGNAEMVIQQIFA
Subjt:  HLLAQYNGNAEMVIQQIFA

A0A6J1GF09 uncharacterized protein LOC111453581 isoform X20.0e+0091.84Show/hide
Query:  MDSTPLNWEALDALIIDFARSENLIEDSFSSSPPS----SPSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQ
        MDSTPLNWEALDALIIDFARSENLIEDSFSSSPPS    SPSPSPSPSSLSSSSYHSRLIIRQIRR LE+GDIDCAIDLLRLHAPFILDDHRLLFRLQKQ
Subjt:  MDSTPLNWEALDALIIDFARSENLIEDSFSSSPPS----SPSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQ

Query:  KFIELLRKGTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIH
        KFIELLRKGT EDR LAI+C+RT LAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCE RRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIH
Subjt:  KFIELLRKGTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIH

Query:  KGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYR
        KGFCFREGV SPISDLTERLLLDERDPPATP+ESL+EAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYR
Subjt:  KGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYR

Query:  GIVDSGRGALSGMQNLSSSSKVTQSEPEYCSSRNTSFEVDYAASKLSDGEISVSNSRVDSSPENIADVTSSQGNEVELRYASEPTSNREDCSTSDSIHVG
        GIVDSGRGALSGMQN SSS KV QSE EYCSSRN S EVDYA SKLSDGEISV+NSRVDSSPENIADVTSSQG E +LRY+ EPTSNREDCSTSDSIHVG
Subjt:  GIVDSGRGALSGMQNLSSSSKVTQSEPEYCSSRNTSFEVDYAASKLSDGEISVSNSRVDSSPENIADVTSSQGNEVELRYASEPTSNREDCSTSDSIHVG

Query:  NSRTLQANKNRGIVERSKRKRWRGRHDDRELYDLSCSGCSKQELSTTTVASPTISKEQQNLEKHLPLESTGKEDKYEIVLGIREMASKRLAAEVVEEINA
        NSRTLQ NKNRGIVERSKRKRWRGRHDDREL DLS SGCSKQE+STTTV S T+SKEQQNLEKHLPLESTGK+DKYEIVLGIRE+ASKRLAAEVVEEINA
Subjt:  NSRTLQANKNRGIVERSKRKRWRGRHDDRELYDLSCSGCSKQELSTTTVASPTISKEQQNLEKHLPLESTGKEDKYEIVLGIREMASKRLAAEVVEEINA

Query:  LDPNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALASSLQVAFGRRLGIEEPQLMK
        +DP FF QNPI LFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAA+DPSLLKQLKE LLALLLPNEDILGKGFPIN+LA+SLQVAFGRRLGIEEPQLMK
Subjt:  LDPNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALASSLQVAFGRRLGIEEPQLMK

Query:  LMRATLHSHSEWFKLQMCKDRFESLLKIDSLKEVNPPSLSTSSVLLKSNSDSCTHGFSQGTKSSGARTLEDGNSPMQASSRDACDENAILKVMEFLALPR
        LM+ TL+SHSEWFKLQMCKDRFESLLKIDSLKEVNPP LSTS+ LLKSNSDSCT G SQ  KSSGART  DG+SP QASSRDACDENAILKVMEFLALPR
Subjt:  LMRATLHSHSEWFKLQMCKDRFESLLKIDSLKEVNPPSLSTSSVLLKSNSDSCTHGFSQGTKSSGARTLEDGNSPMQASSRDACDENAILKVMEFLALPR

Query:  ADAIHLLAQYNGNAEMVIQQIFA
        ADAIHLLAQYNGNAEMVIQQIFA
Subjt:  ADAIHLLAQYNGNAEMVIQQIFA

A0A6J1HJH4 uncharacterized protein LOC1114635090.0e+0093.61Show/hide
Query:  MDSTPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFIE
        MDS PLNWEALDALIIDFARSENLIEDSFSSSPP  PSPSPSPSSLSSSSYHSRLIIRQIRRSLE GDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFIE
Subjt:  MDSTPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFIE

Query:  LLRKGTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFC
         LRKGTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDK+NQTSPVTYEW ERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFC
Subjt:  LLRKGTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFC

Query:  FREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVD
        F EGVSSPISDLTERLLLDE DPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVD
Subjt:  FREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVD

Query:  SGRGALSGMQNLSSSSKVTQSEPEYCSSRNTSFEVDYAASKLSDGEISVSNSRVDSSPENIADVTSSQG-NEVELRYASEPTSNREDCSTSDSIHVGNSR
        SG GALSGMQNLSSSSKV QSE EYCSSRNTSFEVDYA  KLSDGEISVSNSRVDSSPENIADVTSSQG NEVELRYASEPTSNREDCSTSDSIHVGNSR
Subjt:  SGRGALSGMQNLSSSSKVTQSEPEYCSSRNTSFEVDYAASKLSDGEISVSNSRVDSSPENIADVTSSQG-NEVELRYASEPTSNREDCSTSDSIHVGNSR

Query:  TLQANKNRGIVERSKRKRWRGRHDDRELYDLSCSGCSKQELSTTTVASPTISKEQQNLEKHLPLESTGKEDKYEIVLGIREMASKRLAAEVVEEINALDP
        TLQANKNRGIVERSKRKRWRGRHDDREL+D+S SGCSK ELST TVAS T+SKE+QNLEKH+P++STGKEDKYEIVLGIRE+ASKRLAAEVVEEINALDP
Subjt:  TLQANKNRGIVERSKRKRWRGRHDDRELYDLSCSGCSKQELSTTTVASPTISKEQQNLEKHLPLESTGKEDKYEIVLGIREMASKRLAAEVVEEINALDP

Query:  NFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALASSLQVAFGRRLGIEEPQLMKLMR
        NFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAAN+PSLLKQLKETLLALLLPNED LGKGFP+NALA+SLQVAFGRRLGIEEP+LMKLMR
Subjt:  NFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALASSLQVAFGRRLGIEEPQLMKLMR

Query:  ATLHSHSEWFKLQMCKDRFESLLKIDSLKEVNPPSLSTSSVLLKSNSDSCTHGFSQGTKSSGARTLEDGNSPMQASSRDACDENAILKVMEFLALPRADA
        AT+HSHSEWFKLQMCKDRFESLLKIDSLKEVN PSLSTS  LLKSNSDSC HG SQ  KSS  RT EDG+SPMQASS DA DENAILKVMEFLALPRADA
Subjt:  ATLHSHSEWFKLQMCKDRFESLLKIDSLKEVNPPSLSTSSVLLKSNSDSCTHGFSQGTKSSGARTLEDGNSPMQASSRDACDENAILKVMEFLALPRADA

Query:  IHLLAQYNGNAEMVIQQIFA
        IHLLAQYNGNAEMVIQQIFA
Subjt:  IHLLAQYNGNAEMVIQQIFA

A0A6J1ISM6 uncharacterized protein LOC111478049 isoform X10.0e+0092.07Show/hide
Query:  MDSTPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFIE
        MDSTPLNWEALDALIIDFARSENLIEDSFSSSPPS  SPSPSPSSLSSSSYHSRLIIRQIRRSLE+GDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFIE
Subjt:  MDSTPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFIE

Query:  LLRKGTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFC
        LLRKGT EDR LAIQC+RT LAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCE RRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFC
Subjt:  LLRKGTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFC

Query:  FREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVD
        FREGV SPISDLTERLLLDERDPPATP ESL+EAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVD
Subjt:  FREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVD

Query:  SGRGALSGMQNLSSSSKVTQSEPEYCSSRNTSFEVDYAASKLSDGEISVSNSRVDSSPENIADVTSSQGNEVELRYASEPTSNREDCSTSDSIHVGNSRT
        SGRGALSGMQN SSS KV QSE EYCSSRN S EVDYA SKLSDGEISV+NSRVDSSPENIADVTSSQG E +LRY+ EP SNREDCSTSD IHVGNSRT
Subjt:  SGRGALSGMQNLSSSSKVTQSEPEYCSSRNTSFEVDYAASKLSDGEISVSNSRVDSSPENIADVTSSQGNEVELRYASEPTSNREDCSTSDSIHVGNSRT

Query:  LQANKNRGIVERSKRKRWRGRHDDRELYDLSCSGCSKQELSTTTVASPTISKEQQNLEKHLPLESTGKEDKYEIVLGIREMASKRLAAEVVEEINALDPN
        LQ NKNRGIVERSKRKRWRGRHDDREL DLS SGCSKQE+STTTVAS T+SKEQQNLEKHLPLESTGK+DKYEIVLGIRE+ASKRLAAEVVEEINA+DP 
Subjt:  LQANKNRGIVERSKRKRWRGRHDDRELYDLSCSGCSKQELSTTTVASPTISKEQQNLEKHLPLESTGKEDKYEIVLGIREMASKRLAAEVVEEINALDPN

Query:  FFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALASSLQVAFGRRLGIEEPQLMKLMRA
        FF QNPI LFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAA+DPSLLKQLKE LLALLLPNEDI GKGFPINALA+SLQVAFGRRLGIEEPQLMKLM+ 
Subjt:  FFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALASSLQVAFGRRLGIEEPQLMKLMRA

Query:  TLHSHSEWFKLQMCKDRFESLLKIDSLKEVNPPSLSTSSVLLKSNSDSCTHGFSQGTKSSGARTLEDGNSPMQASSRDACDENAILKVMEFLALPRADAI
        TL+SHSEWFKLQMCKDRFESLLKIDSLKEVNPP LSTS+ LLKSNSDSCT G SQ  KSSGART  DG+SP QASSRDACDE+AILKVMEFLALPRADAI
Subjt:  TLHSHSEWFKLQMCKDRFESLLKIDSLKEVNPPSLSTSSVLLKSNSDSCTHGFSQGTKSSGARTLEDGNSPMQASSRDACDENAILKVMEFLALPRADAI

Query:  HLLAQYNGNAEMVIQQIFA
        HLLAQYNGNAEMVIQQIFA
Subjt:  HLLAQYNGNAEMVIQQIFA

A0A6J1KMW1 uncharacterized protein LOC1114971150.0e+0093.2Show/hide
Query:  MDSTPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFIE
        MDS PLNWEALDALIIDFARSENLIEDSFSSSPP  PSPSPSPSSLSSSSYHSRLIIRQIRRSLE GDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFIE
Subjt:  MDSTPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFIE

Query:  LLRKGTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFC
        LLRKGTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEW ERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFC
Subjt:  LLRKGTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFC

Query:  FREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVD
        F EGVSSPISDLTERLLLDE DPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVD
Subjt:  FREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVD

Query:  SGRGALSGMQNLSSSSKVTQSEPEYCSSRNTSFEVDYAASKLSDGEISVSNSRVDSSPENIADVTSSQ-GNEVELRYASEPTSNREDCSTSDSIHVGNSR
        SG G LSGMQNLSSSSKV QSE EYCSSRNTSFEVDYA SKLSDGEISVSNSRVDSSPENIADVTSSQ  NEVELRYASEPTSNREDCSTSDS+HVGNSR
Subjt:  SGRGALSGMQNLSSSSKVTQSEPEYCSSRNTSFEVDYAASKLSDGEISVSNSRVDSSPENIADVTSSQ-GNEVELRYASEPTSNREDCSTSDSIHVGNSR

Query:  TLQANKNRGIVERSKRKRWRGRHDDRELYDLSCSGCSKQELSTTTVASPTISKEQQNLEKHLPLESTGKEDKYEIVLGIREMASKRLAAEVVEEINALDP
        TLQANKNRGIVERSKRKRWRGRHDDREL+D+S SGCSK ELS  TVAS  +SKEQQNLEKH+P++STG+EDKYEIVLGIRE+ASKRLAAEVVEEINALDP
Subjt:  TLQANKNRGIVERSKRKRWRGRHDDRELYDLSCSGCSKQELSTTTVASPTISKEQQNLEKHLPLESTGKEDKYEIVLGIREMASKRLAAEVVEEINALDP

Query:  NFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALASSLQVAFGRRLGIEEPQLMKLMR
        NFFVQNPI LFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAAN+PSLLKQLKETLLALLLPNED LGKGFP+NALA+SLQVAFGRRLGIEEP+LMKLMR
Subjt:  NFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALASSLQVAFGRRLGIEEPQLMKLMR

Query:  ATLHSHSEWFKLQMCKDRFESLLKIDSLKEVNPPSLSTSSVLLKSNSDSCTHGFSQGTKSS-GARTLEDGNSPMQASSRDACDENAILKVMEFLALPRAD
        AT+HSHSEWFKLQMCKDRFESLLKIDSLKEVN PSLSTS  LLKSNSDSC HG SQ  KSS  ART EDG+SPMQASS DA DENAILKVMEFLALPRAD
Subjt:  ATLHSHSEWFKLQMCKDRFESLLKIDSLKEVNPPSLSTSSVLLKSNSDSCTHGFSQGTKSS-GARTLEDGNSPMQASSRDACDENAILKVMEFLALPRAD

Query:  AIHLLAQYNGNAEMVIQQIFA
        AIHLLAQYNGNAEMVIQQIFA
Subjt:  AIHLLAQYNGNAEMVIQQIFA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G66810.1 CONTAINS InterPro DOMAIN/s: CTLH, C-terminal LisH motif (InterPro:IPR006595)1.9e-22661.23Show/hide
Query:  MDSTPLNWEALDALIIDFARSENLIEDSFSS-----SPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQK
        MDSTP+NWEALDALIIDF  SENL+ED+ ++     SP SSPS S SP S+SSSSYHSRLIIR+IR S+E+GDI+ AID+LR HAPF+LDDHR+LFRLQK
Subjt:  MDSTPLNWEALDALIIDFARSENLIEDSFSS-----SPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQK

Query:  QKFIELLRKGTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISI
        QKFIELLRKGT E    AI CLRT +APCALDAYPEAYEEFKHVLLA IYDKD+QTSPV  EW E+RR+++AGLMSSVLRA +QAYDPVFSMTLRYLISI
Subjt:  QKFIELLRKGTAEDRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISI

Query:  HKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIY
        HKGFCF +G+SS +SDLT RLLL+ERD PATP ES+YE PPFDEVDIQALAHAVELTRQGA+DS++F KGDLF AFQNELCRM+LD+SVLDELV+EYCIY
Subjt:  HKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIY

Query:  RGIVDSGRGALSGMQNLSSSSKVTQSEPEYCSSRNTSFEVDYAASKLSDGEISVSNSRVDSSPENIADVTSSQGNEVELRYASEPTSNREDCSTSDSIHV
        RGIVD      S MQ ++  +K  QSE     SR+ S E+D   S+ SD E   + S +D S     +++  +G +V  RY SEPTS  EDCSTS S   
Subjt:  RGIVDSGRGALSGMQNLSSSSKVTQSEPEYCSSRNTSFEVDYAASKLSDGEISVSNSRVDSSPENIADVTSSQGNEVELRYASEPTSNREDCSTSDSIHV

Query:  GNSRTLQANKNRGIVERSKRKRWRGRHDDRELYDLSCSGCSKQELSTTTVASPTISKEQQNLEKHLPLESTGKEDKYEIVLGIREMASKRLAAEVVEEIN
         N+R L   ++    E +KRKRW GR    E+  L     +  E  T  +                       EDKYEI L ++E+ S+ +AAE   EI+
Subjt:  GNSRTLQANKNRGIVERSKRKRWRGRHDDRELYDLSCSGCSKQELSTTTVASPTISKEQQNLEKHLPLESTGKEDKYEIVLGIREMASKRLAAEVVEEIN

Query:  ALDPNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALASSLQVAFGRRLGIEEPQLM
         +DP+FF QNP  LF LKQVEFLKLVS+GD++ AL+VAC HLGPLAAND SLLK LKETLL LL P+    GK  P+N LA++LQV+ G RLGIEEP+LM
Subjt:  ALDPNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQLKETLLALLLPNEDILGKGFPINALASSLQVAFGRRLGIEEPQLM

Query:  KLMRATLHSHSEWFKLQMCKDRFESLLKIDSLKEVNPPSLSTSSVLLKSNSDSCTHGFSQ-GTKSSGARTLEDGNSP-----MQASSRDAC-DENAILKV
        K+++ATLH+H+EWFKLQMCKDRF +LLKIDSLKEVN   +   ++  KS  DS T+  SQ  T SS   T EDG S       Q   R+A  +E+AILKV
Subjt:  KLMRATLHSHSEWFKLQMCKDRFESLLKIDSLKEVNPPSLSTSSVLLKSNSDSCTHGFSQ-GTKSSGARTLEDGNSP-----MQASSRDAC-DENAILKV

Query:  MEFLALPRADAIHLLAQYNGNAEMVIQQIF
        MEFLA+PR+DAI LL+QYNG+AE VIQQ+F
Subjt:  MEFLALPRADAIHLLAQYNGNAEMVIQQIF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTCCACCCCTTTGAACTGGGAAGCTCTCGATGCCCTAATCATCGATTTTGCTAGATCAGAGAACTTGATTGAGGATTCCTTTTCATCTTCCCCACCTTCTTCTCC
TTCTCCTTCCCCTTCCCCTTCCTCGCTTTCCTCCTCCTCTTACCATTCCAGGTTGATCATCCGCCAGATCAGGCGCTCTCTGGAGGCCGGCGATATCGATTGCGCCATCG
ATCTCCTCCGCCTTCATGCACCCTTCATTCTCGACGATCATAGACTTCTATTCCGCTTGCAGAAGCAGAAATTTATCGAGCTTTTGAGGAAAGGGACTGCAGAGGATCGT
GATTTGGCCATTCAATGTCTCCGGACGGCGCTCGCTCCTTGTGCTCTTGATGCTTACCCAGAAGCGTATGAGGAGTTCAAGCATGTTCTACTTGCCTTCATTTATGACAA
GGATAATCAAACATCCCCGGTGACATATGAGTGGTGTGAAAGGAGGAGGTTTGATATTGCTGGTTTAATGTCCTCAGTCCTACGAGCTCATATGCAGGCATATGATCCAG
TCTTCTCAATGACTTTGAGATACTTGATAAGCATACATAAGGGGTTTTGCTTTCGTGAAGGTGTGTCGTCTCCCATATCAGATCTCACCGAGAGGTTGCTTCTAGATGAA
CGCGATCCACCTGCCACACCCAAGGAGAGCCTGTATGAAGCACCTCCTTTTGATGAGGTTGACATTCAAGCTCTAGCACATGCTGTAGAGCTAACAAGACAGGGGGCAAT
TGATAGCTTGAGATTCACCAAAGGTGATTTGTTTCATGCATTCCAGAATGAGTTGTGCCGGATGAAATTGGACCTTTCTGTACTTGATGAGCTTGTTCGTGAGTATTGCA
TCTACAGAGGAATTGTGGATTCTGGTCGTGGAGCTCTCTCTGGGATGCAGAATCTCTCTAGTTCATCAAAAGTTACTCAATCAGAGCCGGAGTATTGTTCATCAAGGAAT
ACTTCTTTTGAAGTGGACTATGCAGCCAGTAAACTTTCGGATGGTGAAATTTCTGTCAGTAATTCCCGTGTGGATAGTTCTCCTGAAAATATTGCTGACGTGACCAGTTC
ACAAGGTAATGAGGTTGAATTACGATATGCATCAGAGCCAACATCCAATCGAGAAGATTGTAGCACTAGTGATTCAATTCATGTGGGAAATTCAAGAACGTTACAAGCAA
ACAAGAATCGTGGGATTGTAGAAAGGAGCAAGCGTAAGAGATGGAGAGGAAGACACGATGATAGAGAACTTTATGATTTGTCTTGCAGTGGATGCAGTAAACAAGAACTT
AGCACTACAACAGTGGCCAGTCCGACCATATCTAAGGAACAACAGAACCTTGAAAAACATTTACCACTAGAATCTACTGGCAAGGAGGATAAATATGAGATTGTCTTGGG
CATTAGAGAAATGGCAAGTAAAAGGTTGGCTGCAGAGGTTGTGGAAGAAATTAATGCCCTGGATCCGAACTTTTTTGTACAAAATCCTATTTTCCTATTCCAACTTAAGC
AGGTTGAATTTTTGAAGCTGGTTAGTTCCGGTGATTATTCTAGTGCTTTGAGGGTCGCATGCACTCACTTAGGCCCGTTAGCCGCTAATGATCCATCCTTGTTGAAGCAA
TTAAAGGAGACTTTATTGGCTTTGCTCCTGCCCAACGAAGATATTCTTGGGAAAGGCTTCCCTATAAATGCTCTTGCTAGTTCTCTTCAGGTTGCTTTTGGTAGGAGACT
TGGTATTGAAGAACCCCAACTAATGAAGTTGATGAGAGCCACACTTCACTCTCATAGTGAATGGTTTAAACTTCAGATGTGCAAGGATCGGTTCGAAAGTCTTTTGAAGA
TAGATTCCTTGAAGGAAGTTAATCCGCCTTCGCTTTCTACTTCTAGTGTGCTATTGAAATCGAATTCAGATAGTTGCACCCATGGTTTTTCCCAAGGCACAAAATCTTCG
GGTGCAAGAACCTTAGAAGATGGTAACAGTCCTATGCAAGCGTCATCTAGAGATGCTTGTGATGAAAATGCAATACTTAAAGTCATGGAGTTTCTTGCTTTGCCCAGGGC
TGATGCCATTCATCTTCTTGCACAGTATAATGGAAACGCAGAGATGGTCATCCAGCAAATATTTGCATAA
mRNA sequenceShow/hide mRNA sequence
TAGCCTTCAAATTCCATTCCTTTTAGCATCTCAACGTCGCCGCCTTTGGCCTGAGCGAGCGCACAAAGTTTCCCACCAAGTTAGCGAGCGAAGCCAGACCCTTCTCTTCC
AAATTCCAAATTCCAAATTCCAACTCTCCCACATCCTCTAATCCACAAGCTCGCCCTCTTCTTTCCAAATGGACTCCACCCCTTTGAACTGGGAAGCTCTCGATGCCCTA
ATCATCGATTTTGCTAGATCAGAGAACTTGATTGAGGATTCCTTTTCATCTTCCCCACCTTCTTCTCCTTCTCCTTCCCCTTCCCCTTCCTCGCTTTCCTCCTCCTCTTA
CCATTCCAGGTTGATCATCCGCCAGATCAGGCGCTCTCTGGAGGCCGGCGATATCGATTGCGCCATCGATCTCCTCCGCCTTCATGCACCCTTCATTCTCGACGATCATA
GACTTCTATTCCGCTTGCAGAAGCAGAAATTTATCGAGCTTTTGAGGAAAGGGACTGCAGAGGATCGTGATTTGGCCATTCAATGTCTCCGGACGGCGCTCGCTCCTTGT
GCTCTTGATGCTTACCCAGAAGCGTATGAGGAGTTCAAGCATGTTCTACTTGCCTTCATTTATGACAAGGATAATCAAACATCCCCGGTGACATATGAGTGGTGTGAAAG
GAGGAGGTTTGATATTGCTGGTTTAATGTCCTCAGTCCTACGAGCTCATATGCAGGCATATGATCCAGTCTTCTCAATGACTTTGAGATACTTGATAAGCATACATAAGG
GGTTTTGCTTTCGTGAAGGTGTGTCGTCTCCCATATCAGATCTCACCGAGAGGTTGCTTCTAGATGAACGCGATCCACCTGCCACACCCAAGGAGAGCCTGTATGAAGCA
CCTCCTTTTGATGAGGTTGACATTCAAGCTCTAGCACATGCTGTAGAGCTAACAAGACAGGGGGCAATTGATAGCTTGAGATTCACCAAAGGTGATTTGTTTCATGCATT
CCAGAATGAGTTGTGCCGGATGAAATTGGACCTTTCTGTACTTGATGAGCTTGTTCGTGAGTATTGCATCTACAGAGGAATTGTGGATTCTGGTCGTGGAGCTCTCTCTG
GGATGCAGAATCTCTCTAGTTCATCAAAAGTTACTCAATCAGAGCCGGAGTATTGTTCATCAAGGAATACTTCTTTTGAAGTGGACTATGCAGCCAGTAAACTTTCGGAT
GGTGAAATTTCTGTCAGTAATTCCCGTGTGGATAGTTCTCCTGAAAATATTGCTGACGTGACCAGTTCACAAGGTAATGAGGTTGAATTACGATATGCATCAGAGCCAAC
ATCCAATCGAGAAGATTGTAGCACTAGTGATTCAATTCATGTGGGAAATTCAAGAACGTTACAAGCAAACAAGAATCGTGGGATTGTAGAAAGGAGCAAGCGTAAGAGAT
GGAGAGGAAGACACGATGATAGAGAACTTTATGATTTGTCTTGCAGTGGATGCAGTAAACAAGAACTTAGCACTACAACAGTGGCCAGTCCGACCATATCTAAGGAACAA
CAGAACCTTGAAAAACATTTACCACTAGAATCTACTGGCAAGGAGGATAAATATGAGATTGTCTTGGGCATTAGAGAAATGGCAAGTAAAAGGTTGGCTGCAGAGGTTGT
GGAAGAAATTAATGCCCTGGATCCGAACTTTTTTGTACAAAATCCTATTTTCCTATTCCAACTTAAGCAGGTTGAATTTTTGAAGCTGGTTAGTTCCGGTGATTATTCTA
GTGCTTTGAGGGTCGCATGCACTCACTTAGGCCCGTTAGCCGCTAATGATCCATCCTTGTTGAAGCAATTAAAGGAGACTTTATTGGCTTTGCTCCTGCCCAACGAAGAT
ATTCTTGGGAAAGGCTTCCCTATAAATGCTCTTGCTAGTTCTCTTCAGGTTGCTTTTGGTAGGAGACTTGGTATTGAAGAACCCCAACTAATGAAGTTGATGAGAGCCAC
ACTTCACTCTCATAGTGAATGGTTTAAACTTCAGATGTGCAAGGATCGGTTCGAAAGTCTTTTGAAGATAGATTCCTTGAAGGAAGTTAATCCGCCTTCGCTTTCTACTT
CTAGTGTGCTATTGAAATCGAATTCAGATAGTTGCACCCATGGTTTTTCCCAAGGCACAAAATCTTCGGGTGCAAGAACCTTAGAAGATGGTAACAGTCCTATGCAAGCG
TCATCTAGAGATGCTTGTGATGAAAATGCAATACTTAAAGTCATGGAGTTTCTTGCTTTGCCCAGGGCTGATGCCATTCATCTTCTTGCACAGTATAATGGAAACGCAGA
GATGGTCATCCAGCAAATATTTGCATAAATTGGCTTGTAAGAGTTGTAGCTTTATGATGGGGTTTATTTGTTTATTATTTGTCTAACATTATTTCATTCATTTTGTGTAC
TTTAATACTTGTATACCATACTTGTATATGCCAGAGCTCCAATGTATAAAACGTCCGTCCAACAAATATTATCCCTTGTGTTTTCTCATTATATTTATGGGGTGGAGGAT
GTGTAATCGGGCAGTTCGGGAATATGTAACTAAGGTTTGAAAGTGCATAAGGATGGATTTCAATCACATCAACTGTAAAGTAGATGTTATTTATTCAATAACATATTTTG
ATTTGCGAGAA
Protein sequenceShow/hide protein sequence
MDSTPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFIELLRKGTAEDR
DLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFCFREGVSSPISDLTERLLLDE
RDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRGALSGMQNLSSSSKVTQSEPEYCSSRN
TSFEVDYAASKLSDGEISVSNSRVDSSPENIADVTSSQGNEVELRYASEPTSNREDCSTSDSIHVGNSRTLQANKNRGIVERSKRKRWRGRHDDRELYDLSCSGCSKQEL
STTTVASPTISKEQQNLEKHLPLESTGKEDKYEIVLGIREMASKRLAAEVVEEINALDPNFFVQNPIFLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLAANDPSLLKQ
LKETLLALLLPNEDILGKGFPINALASSLQVAFGRRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFESLLKIDSLKEVNPPSLSTSSVLLKSNSDSCTHGFSQGTKSS
GARTLEDGNSPMQASSRDACDENAILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA