; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC06G113530 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC06G113530
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionAT-hook motif nuclear-localized protein 20
Genome locationCiama_Chr06:8444702..8458800
RNA-Seq ExpressionCaUC06G113530
SyntenyCaUC06G113530
Gene Ontology termsGO:0045927 - positive regulation of growth (biological process)
InterPro domainsIPR005175 - PPC domain
IPR007700 - Domain of unknown function DUF668
IPR021864 - Domain of unknown function DUF3475


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065685.1 uncharacterized protein E6C27_scaffold90G001520 [Cucumis melo var. makuwa]0.0e+0094.84Show/hide
Query:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA
        MVAEPWIVKMGNQVSANLKHALLE SKNKNSRKPE GGN KQ IGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNL+SSDEVHLLELVVA
Subjt:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA

Query:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESRKAYEQ
        EK+EDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMM+KMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESRKAYEQ
Subjt:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESRKAYEQ

Query:  KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGSSIKSQISSRR
        KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQ GAE M+SKR S ERG GPRRGSS KSQISSRR
Subjt:  KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGSSIKSQISSRR

Query:  GEVSLLTPDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNFGLK
        GEV L TPDDFNFPCGTNPGRLLMDCLSLSSSVSKL DED+DSYVDRDDRSCQ S RSIRNSGSSQFSSFSQVQFS+PFGVDQRQAKS MSN GGN G K
Subjt:  GEVSLLTPDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNFGLK

Query:  SRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR
        SRLS+YAPVSTIGGSALALHYANIIIVIEKLLRYPHLVG+EARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR
Subjt:  SRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR

Query:  WQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKASLST
        WQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCK S  T
Subjt:  WQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKASLST

XP_008460732.1 PREDICTED: uncharacterized protein LOC103499494, partial [Cucumis melo]4.4e-27694.81Show/hide
Query:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA
        MVAEPWIVKMGNQVSANLKHALLE SKNKNSRKPE GGN KQ IGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNL+SSDEVHLLELVVA
Subjt:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA

Query:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESRKAYEQ
        EK+EDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMM+KMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESRKAYEQ
Subjt:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESRKAYEQ

Query:  KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGSSIKSQISSRR
        KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQ GAE M+SKR S ERG GPRRGSS KSQISSRR
Subjt:  KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGSSIKSQISSRR

Query:  GEVSLLTPDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNFGLK
        GEV L TPDDFNFPCGTNPGRLLMDCLSLSSSVSKL DED+DSYVDRDDRSCQ S RSIRNSGSSQFSSFSQVQFS+PFGVDQRQAKS MSN GGN G K
Subjt:  GEVSLLTPDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNFGLK

Query:  SRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR
        SRLS+YAPVSTIGGSALALHYANIIIVIEKLLRYPHLVG+EARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR
Subjt:  SRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR

Query:  WQSERNFEQHQIVTRTNVLL
        WQSERNFEQHQIVTRTNVLL
Subjt:  WQSERNFEQHQIVTRTNVLL

XP_011649159.1 uncharacterized protein LOC101220789 [Cucumis sativus]0.0e+0094.49Show/hide
Query:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA
        MVAEPWIVKMGNQVSANLKHALLE SKNKNSRKP+IGG+ K+ IGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNL+SSDEVHLLELVVA
Subjt:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA

Query:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESRKAYEQ
        EK+EDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMM+KMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESRKAYEQ
Subjt:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESRKAYEQ

Query:  KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGSSIKSQISSRR
        KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQ GAE ++SKR S ERG GPRRGSS KSQISSRR
Subjt:  KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGSSIKSQISSRR

Query:  GEVSLLTPDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNFGLK
        GEV L TPDDFNFPCGTNPGRLLMDCLSLSSSVSKL DED+DSYVD DDRSCQ SGRSIRNSGSSQFSSFSQVQFS+PFGVDQRQAKS MSNSGGN G K
Subjt:  GEVSLLTPDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNFGLK

Query:  SRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR
        SRLS+YAPVSTIGGSALALHYANIIIVIEKLLRYPHLVG+EARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR
Subjt:  SRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR

Query:  WQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKASLST
        WQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCK S  T
Subjt:  WQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKASLST

XP_022138835.1 uncharacterized protein LOC111009906 [Momordica charantia]8.2e-29189.04Show/hide
Query:  MVAEPWIVKMGNQVSANLKHALLESSK-NKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVV
        MVAEPW+VKMGNQVS+NLKHALLE SK NKN +KPEI  NTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNL+SSDE+HLLEL V
Subjt:  MVAEPWIVKMGNQVSANLKHALLESSK-NKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVV

Query:  AEKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESRKAYE
        AEKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDI+NGVINVKELGFLVKDMEGMM+KMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEES+KAYE
Subjt:  AEKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESRKAYE

Query:  QKLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGSSIKSQISSR
        QKL+WQKQ VGHLK+ISLWNQTYDKVVELLARTVCTVYARIHLVFGD FLKKDVNE            N  E +ESKRAS ++ P PRRGSS KS++S R
Subjt:  QKLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGSSIKSQISSR

Query:  RGEVSLLTPDDFNFPCGTNPGRLLMDCLSLSSSVSKL--YDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNF
        RGEV L TPDDFNFPCGTNPGRLLMDCLSLSSSVSKL   DED+D Y DRDDRSCQ SGRSIRNSGSS FSSFSQVQFS+PFGVDQRQ  S MSNSGGNF
Subjt:  RGEVSLLTPDDFNFPCGTNPGRLLMDCLSLSSSVSKL--YDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNF

Query:  GLKSRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHN
        G KSRLS YAPVST+GGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVK+LAIYDAP+AHDWKETLDGILSWLAPLAHN
Subjt:  GLKSRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHN

Query:  MIRWQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKASLST
        MIRWQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKAS  T
Subjt:  MIRWQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKASLST

XP_038875820.1 protein PSK SIMULATOR 1 [Benincasa hispida]0.0e+0095.87Show/hide
Query:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA
        MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNL+SSDEVHLLELVVA
Subjt:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA

Query:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESRKAYEQ
        EK+EDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMM+KMERYVNATANLYTEMEVLNE+EQAAKKFQNNQHEESRKAYEQ
Subjt:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESRKAYEQ

Query:  KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGSSIKSQISSRR
        KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDV+ENGSSNDVNHHVQ GAE MESKRAS ERG GPRRG S+KSQISSR+
Subjt:  KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGSSIKSQISSRR

Query:  GEVSLLTPDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNFGLK
        GEV L  PDDFNFPCGTNPGRLLMDCLSLSSSVSKL DED+DSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFS+PFGVDQRQAKS MSNSGGNFG K
Subjt:  GEVSLLTPDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNFGLK

Query:  SRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR
        SRLS+YAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR
Subjt:  SRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR

Query:  WQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKASLST
        WQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCK S  T
Subjt:  WQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKASLST

TrEMBL top hitse value%identityAlignment
A0A0A0LIJ7 Uncharacterized protein0.0e+0094.49Show/hide
Query:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA
        MVAEPWIVKMGNQVSANLKHALLE SKNKNSRKP+IGG+ K+ IGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNL+SSDEVHLLELVVA
Subjt:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA

Query:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESRKAYEQ
        EK+EDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMM+KMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESRKAYEQ
Subjt:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESRKAYEQ

Query:  KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGSSIKSQISSRR
        KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQ GAE ++SKR S ERG GPRRGSS KSQISSRR
Subjt:  KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGSSIKSQISSRR

Query:  GEVSLLTPDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNFGLK
        GEV L TPDDFNFPCGTNPGRLLMDCLSLSSSVSKL DED+DSYVD DDRSCQ SGRSIRNSGSSQFSSFSQVQFS+PFGVDQRQAKS MSNSGGN G K
Subjt:  GEVSLLTPDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNFGLK

Query:  SRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR
        SRLS+YAPVSTIGGSALALHYANIIIVIEKLLRYPHLVG+EARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR
Subjt:  SRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR

Query:  WQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKASLST
        WQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCK S  T
Subjt:  WQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKASLST

A0A1S3CD70 uncharacterized protein LOC1034994942.1e-27694.81Show/hide
Query:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA
        MVAEPWIVKMGNQVSANLKHALLE SKNKNSRKPE GGN KQ IGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNL+SSDEVHLLELVVA
Subjt:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA

Query:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESRKAYEQ
        EK+EDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMM+KMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESRKAYEQ
Subjt:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESRKAYEQ

Query:  KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGSSIKSQISSRR
        KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQ GAE M+SKR S ERG GPRRGSS KSQISSRR
Subjt:  KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGSSIKSQISSRR

Query:  GEVSLLTPDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNFGLK
        GEV L TPDDFNFPCGTNPGRLLMDCLSLSSSVSKL DED+DSYVDRDDRSCQ S RSIRNSGSSQFSSFSQVQFS+PFGVDQRQAKS MSN GGN G K
Subjt:  GEVSLLTPDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNFGLK

Query:  SRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR
        SRLS+YAPVSTIGGSALALHYANIIIVIEKLLRYPHLVG+EARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR
Subjt:  SRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR

Query:  WQSERNFEQHQIVTRTNVLL
        WQSERNFEQHQIVTRTNVLL
Subjt:  WQSERNFEQHQIVTRTNVLL

A0A5A7VIT4 Uncharacterized protein0.0e+0094.84Show/hide
Query:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA
        MVAEPWIVKMGNQVSANLKHALLE SKNKNSRKPE GGN KQ IGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNL+SSDEVHLLELVVA
Subjt:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA

Query:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESRKAYEQ
        EK+EDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMM+KMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESRKAYEQ
Subjt:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESRKAYEQ

Query:  KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGSSIKSQISSRR
        KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQ GAE M+SKR S ERG GPRRGSS KSQISSRR
Subjt:  KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGSSIKSQISSRR

Query:  GEVSLLTPDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNFGLK
        GEV L TPDDFNFPCGTNPGRLLMDCLSLSSSVSKL DED+DSYVDRDDRSCQ S RSIRNSGSSQFSSFSQVQFS+PFGVDQRQAKS MSN GGN G K
Subjt:  GEVSLLTPDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNFGLK

Query:  SRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR
        SRLS+YAPVSTIGGSALALHYANIIIVIEKLLRYPHLVG+EARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR
Subjt:  SRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR

Query:  WQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKASLST
        WQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCK S  T
Subjt:  WQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKASLST

A0A6J1CAM1 uncharacterized protein LOC1110099064.0e-29189.04Show/hide
Query:  MVAEPWIVKMGNQVSANLKHALLESSK-NKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVV
        MVAEPW+VKMGNQVS+NLKHALLE SK NKN +KPEI  NTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNL+SSDE+HLLEL V
Subjt:  MVAEPWIVKMGNQVSANLKHALLESSK-NKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVV

Query:  AEKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESRKAYE
        AEKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDI+NGVINVKELGFLVKDMEGMM+KMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEES+KAYE
Subjt:  AEKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESRKAYE

Query:  QKLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGSSIKSQISSR
        QKL+WQKQ VGHLK+ISLWNQTYDKVVELLARTVCTVYARIHLVFGD FLKKDVNE            N  E +ESKRAS ++ P PRRGSS KS++S R
Subjt:  QKLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGSSIKSQISSR

Query:  RGEVSLLTPDDFNFPCGTNPGRLLMDCLSLSSSVSKL--YDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNF
        RGEV L TPDDFNFPCGTNPGRLLMDCLSLSSSVSKL   DED+D Y DRDDRSCQ SGRSIRNSGSS FSSFSQVQFS+PFGVDQRQ  S MSNSGGNF
Subjt:  RGEVSLLTPDDFNFPCGTNPGRLLMDCLSLSSSVSKL--YDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNF

Query:  GLKSRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHN
        G KSRLS YAPVST+GGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVK+LAIYDAP+AHDWKETLDGILSWLAPLAHN
Subjt:  GLKSRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHN

Query:  MIRWQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKASLST
        MIRWQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKAS  T
Subjt:  MIRWQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKASLST

A0A6J1EQU9 uncharacterized protein LOC1114369676.8e-27585.99Show/hide
Query:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA
        MVAEPWIVKMGNQVSANLK ALLE SKNK+        NTKQTIGILSFEVANVMSKTIYLHKSLS SAISKLKN+ILSSDGV+NL+S DEVHLLELVVA
Subjt:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA

Query:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESRKAYEQ
        EKLEDLNRVA+VVSRLGKKCSQPALQGF+HVYLDIINGVINVKELGFLVKDMEGMMKKMER+VNATANLYTEMEVLNELEQAAKKFQNNQHEESRKAYEQ
Subjt:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESRKAYEQ

Query:  KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGSSIKSQISSRR
        KLIWQKQDVGHLK+ISLWNQT+DKVVELLARTVCTVYARIHLVFGD FLKKD             V+ G+E +ESKR S    PGPRRGSS KSQIS+RR
Subjt:  KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGSSIKSQISSRR

Query:  GEVSLLTPDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNFGLK
        GEV L TPDDFNFPCGTNP RLLMDCLSLSSSVSK+ DE++D     DDRSC+ SG SIRNSGSS F SFSQVQFS+PFGV+QR+AKS MSN+GG+FG K
Subjt:  GEVSLLTPDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNFGLK

Query:  SRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR
        SRLS+YAPVST+GGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLR+SL  HLKSYVK+LAIYDAPLAHDWKETLDGILSWLAPLAHNMIR
Subjt:  SRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR

Query:  WQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKAS
        WQSERNFEQHQIVTRTN+LLIQT+YFAD KKTE+AICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKAS
Subjt:  WQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKAS

SwissProt top hitse value%identityAlignment
O22130 AT-hook motif nuclear-localized protein 221.2e-5348.15Show/hide
Query:  EDEPNGGGVVVGNRRSRGRPPGSKNKPKSPIIVTSDNPHALRTHVIEIVGGADVAGSIKQFSCRRQRGVCVLSGSGTVVDVTLRQSA----GSAAVIQLH
        E    GGG     RR RGRP GSKNKPK PII+T D+ +AL++HV+E+  G DV  S+  F+ RRQRG+CVLSG+G V +VT+RQ A    G ++V+ LH
Subjt:  EDEPNGGGVVVGNRRSRGRPPGSKNKPKSPIIVTSDNPHALRTHVIEIVGGADVAGSIKQFSCRRQRGVCVLSGSGTVVDVTLRQSA----GSAAVIQLH

Query:  GRFDILSLSGSFLPGRAPPCSTGLTVYLSGGEGQVIGGTVVGPLLAAGPIILIAATFASATYERLPLQDDHNNQERE-------ISPATTGGG-------
        GRF+ILSLSGSFLP  APP ++GLT+YL+GG+GQV+GG+VVGPL+A+GP++++AA+F +A YERLPL++D   ++            AT GGG       
Subjt:  GRFDILSLSGSFLPGRAPPCSTGLTVYLSGGEGQVIGGTVVGPLLAAGPIILIAATFASATYERLPLQDDHNNQERE-------ISPATTGGG-------

Query:  EVEEPPQFTRMATSIYDLMSPNDHDGV----DGYGWAHERPSF
        + ++  Q  +  TS    + PN  + V    + Y W   RPSF
Subjt:  EVEEPPQFTRMATSIYDLMSPNDHDGV----DGYGWAHERPSF

O23620 AT-hook motif nuclear-localized protein 238.3e-5255.87Show/hide
Query:  GGGVVVGNRRSRGRPPGSKNKPKSPIIVTSDNPHALRTHVIEIVGGADVAGSIKQFSCRRQRGVCVLSGSGTVVDVTLRQSAGSAAVIQLHGRFDILSLS
        GGG VVG RR RGRPPGSKNKPK P+I+T ++ + LR H++E+  G DV   +  ++ RRQRG+CVLSGSGTV +V++RQ + + AV+ L G F+ILSLS
Subjt:  GGGVVVGNRRSRGRPPGSKNKPKSPIIVTSDNPHALRTHVIEIVGGADVAGSIKQFSCRRQRGVCVLSGSGTVVDVTLRQSAGSAAVIQLHGRFDILSLS

Query:  GSFLPGRAPPCSTGLTVYLSGGEGQVIGGTVVGPLLAAGPIILIAATFASATYERLPLQDDHNNQEREISPATTGGGEV
        GSFLP  APP +T LT++L+GG+GQV+GG+VVG L AAGP+I+IAA+F +  YERLPL++D   Q++++   + GGG +
Subjt:  GSFLPGRAPPCSTGLTVYLSGGEGQVIGGTVVGPLLAAGPIILIAATFASATYERLPLQDDHNNQEREISPATTGGGEV

O49662 AT-hook motif nuclear-localized protein 241.7e-5250.43Show/hide
Query:  ISTTVDGSGRDHDDDEDEPNGGGVVVGN----RRSRGRPPGSKNKPKSPIIVTSDNPHALRTHVIEIVGGADVAGSIKQFSCRRQRGVCVLSGSGTVVDV
        I+      G+D D       GGG   G+    RR RGRP GSKNKPK PII+T D+ +ALRTHV+EI  G D+  S+  F+ RRQRGVCV+SG+G V +V
Subjt:  ISTTVDGSGRDHDDDEDEPNGGGVVVGN----RRSRGRPPGSKNKPKSPIIVTSDNPHALRTHVIEIVGGADVAGSIKQFSCRRQRGVCVLSGSGTVVDV

Query:  TLRQSA---GSAAVIQLHGRFDILSLSGSFLPGRAPPCSTGLTVYLSGGEGQVIGGTVVGPLLAAGPIILIAATFASATYERLPLQDDHNNQEREISPAT
        T+RQ        +V+ LHGRF+ILSLSGSFLP  APP +TGL+VYL+GG+GQV+GG+VVGPLL AGP++++AA+F++A YERLPL++D   + +      
Subjt:  TLRQSA---GSAAVIQLHGRFDILSLSGSFLPGRAPPCSTGLTVYLSGGEGQVIGGTVVGPLLAAGPIILIAATFASATYERLPLQDDHNNQEREISPAT

Query:  TGGGEVEEPP------QFTRMATSIYDLMSPN
         GGG +E PP      Q  + A S +  + PN
Subjt:  TGGGEVEEPP------QFTRMATSIYDLMSPN

Q8GWQ2 AT-hook motif nuclear-localized protein 207.5e-6152.21Show/hide
Query:  GPISTTVDGSGRDHDDDEDEPNGGGVVVGNRRSRGRPPGSKNKPKSPIIVTSDNPHALRTHVIEIVGGADVAGSIKQFSCRRQRGVCVLSGSGTVVDVTL
        G +   ++ S  +  D+ED+P  G V V NRR RGRPPGSKNKPK+PI VT D+P+ALR+HV+EI  G+DVA +I  FS RRQRGVCVLSG+G+V +VTL
Subjt:  GPISTTVDGSGRDHDDDEDEPNGGGVVVGNRRSRGRPPGSKNKPKSPIIVTSDNPHALRTHVIEIVGGADVAGSIKQFSCRRQRGVCVLSGSGTVVDVTL

Query:  RQSAGSAAVIQLHGRFDILSLSGSFLPGRAPPCSTGLTVYLSGGEGQVIGGTVVGPLLAAGPIILIAATFASATYERLPLQDDHN-NQEREISPATTGGG
        RQ+A    V+ L GRF+ILSL+G+FLPG +PP STGLTVYL+G +GQV+GG+VVGPLLA G +++IAATF++ATYERLP++++ +    R+I     GGG
Subjt:  RQSAGSAAVIQLHGRFDILSLSGSFLPGRAPPCSTGLTVYLSGGEGQVIGGTVVGPLLAAGPIILIAATFASATYERLPLQDDHN-NQEREISPATTGGG

Query:  EV-----EEPPQFTRMATSIYDL---MSPNDHD--GVDGYGWAHERPSF
        +         P  + MA   Y++   + PN     G + Y W H RP +
Subjt:  EV-----EEPPQFTRMATSIYDL---MSPNDHD--GVDGYGWAHERPSF

Q9SR17 AT-hook motif nuclear-localized protein 191.6e-5557.28Show/hide
Query:  VDGSGRDHD-----DDEDEPNGGGVVVGNRRSRGRPPGSKNKPKSPIIVTSDNPHALRTHVIEIVGGADVAGSIKQFSCRRQRGVCVLSGSGTVVDVTLR
        VD +  D D      D+ EP  G V    RR RGRP GSKNKPK PI VT D+P+AL++HV+EI  G DV  ++  F+ RRQRG+C+LSG+GTV +VTLR
Subjt:  VDGSGRDHD-----DDEDEPNGGGVVVGNRRSRGRPPGSKNKPKSPIIVTSDNPHALRTHVIEIVGGADVAGSIKQFSCRRQRGVCVLSGSGTVVDVTLR

Query:  Q--------SAGSAAVIQLHGRFDILSLSGSFLPGRAPPCSTGLTVYLSGGEGQVIGGTVVGPLLAAGPIILIAATFASATYERLPLQDDHNNQEREISP
        Q        + G AAV+ L GRF+ILSL+GSFLPG APP STGLT+YL+GG+GQV+GG+VVGPL+AAGP++LIAATF++ATYERLPL      +E E + 
Subjt:  Q--------SAGSAAVIQLHGRFDILSLSGSFLPGRAPPCSTGLTVYLSGGEGQVIGGTVVGPLLAAGPIILIAATFASATYERLPLQDDHNNQEREISP

Query:  ATTGGG
           GGG
Subjt:  ATTGGG

Arabidopsis top hitse value%identityAlignment
AT3G04570.1 AT-hook motif nuclear-localized protein 191.1e-5657.28Show/hide
Query:  VDGSGRDHD-----DDEDEPNGGGVVVGNRRSRGRPPGSKNKPKSPIIVTSDNPHALRTHVIEIVGGADVAGSIKQFSCRRQRGVCVLSGSGTVVDVTLR
        VD +  D D      D+ EP  G V    RR RGRP GSKNKPK PI VT D+P+AL++HV+EI  G DV  ++  F+ RRQRG+C+LSG+GTV +VTLR
Subjt:  VDGSGRDHD-----DDEDEPNGGGVVVGNRRSRGRPPGSKNKPKSPIIVTSDNPHALRTHVIEIVGGADVAGSIKQFSCRRQRGVCVLSGSGTVVDVTLR

Query:  Q--------SAGSAAVIQLHGRFDILSLSGSFLPGRAPPCSTGLTVYLSGGEGQVIGGTVVGPLLAAGPIILIAATFASATYERLPLQDDHNNQEREISP
        Q        + G AAV+ L GRF+ILSL+GSFLPG APP STGLT+YL+GG+GQV+GG+VVGPL+AAGP++LIAATF++ATYERLPL      +E E + 
Subjt:  Q--------SAGSAAVIQLHGRFDILSLSGSFLPGRAPPCSTGLTVYLSGGEGQVIGGTVVGPLLAAGPIILIAATFASATYERLPLQDDHNNQEREISP

Query:  ATTGGG
           GGG
Subjt:  ATTGGG

AT3G23160.1 Protein of unknown function (DUF668)2.4e-18761.29Show/hide
Query:  MVAEPWIVKMGNQVSANLKHA-LLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVV
        MV+E WIVKM NQVS+NLKHA LLESS  K + KP      KQTIGILSFEVANVMSKTI+LH+SLS + ISKLK E+  S+GV+ L+SSDE HLL+L V
Subjt:  MVAEPWIVKMGNQVSANLKHA-LLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVV

Query:  AEKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQ-NNQHEESRKAY
        +EKL+DL+RVA+VVSRLGKKC++PALQGF+HVY DI+NG I+ ++LGFLVKDME M+KKMER+VNAT +LY EMEV+NELEQA  K Q + QH+ES KA+
Subjt:  AEKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQ-NNQHEESRKAY

Query:  EQKLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFG--------DPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGS
        EQKL+WQ+QDV  L+D SLWNQTYDKVVE+LARTVCT+Y RI  VFG        D  LK+D ++N +S  VN   ++ A   +S+R+ A+         
Subjt:  EQKLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFG--------DPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGS

Query:  SIKSQISSRRGEVSLLTPDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGM
               +R G        DFNFPCGTNPGR+ M+CL+++ ++    D+DDD     DD   +  GR                  + P       A+   
Subjt:  SIKSQISSRRGEVSLLTPDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGM

Query:  SNSGGNFGLKSRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSW
        SN    FG KSRL+ +A  STIGGSAL+LHYAN++IV+EKLL+YPHL+G+EARDDLYQMLPTSL+++LK  L+SY+KN++IYDAPLAHDWKET+DGILSW
Subjt:  SNSGGNFGLKSRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSW

Query:  LAPLAHNMIRWQSERNFE-QHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKAS
        LAPLAHNMIRWQSERNFE Q+QIV RTNVLL+QTLYFADR+KTE AIC+LLVGLNYIC YE QQNALLDCASSFD+EDC EWQ QC+A+
Subjt:  LAPLAHNMIRWQSERNFE-QHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKAS

AT4G14465.1 AT-hook motif nuclear-localized protein 205.3e-6252.21Show/hide
Query:  GPISTTVDGSGRDHDDDEDEPNGGGVVVGNRRSRGRPPGSKNKPKSPIIVTSDNPHALRTHVIEIVGGADVAGSIKQFSCRRQRGVCVLSGSGTVVDVTL
        G +   ++ S  +  D+ED+P  G V V NRR RGRPPGSKNKPK+PI VT D+P+ALR+HV+EI  G+DVA +I  FS RRQRGVCVLSG+G+V +VTL
Subjt:  GPISTTVDGSGRDHDDDEDEPNGGGVVVGNRRSRGRPPGSKNKPKSPIIVTSDNPHALRTHVIEIVGGADVAGSIKQFSCRRQRGVCVLSGSGTVVDVTL

Query:  RQSAGSAAVIQLHGRFDILSLSGSFLPGRAPPCSTGLTVYLSGGEGQVIGGTVVGPLLAAGPIILIAATFASATYERLPLQDDHN-NQEREISPATTGGG
        RQ+A    V+ L GRF+ILSL+G+FLPG +PP STGLTVYL+G +GQV+GG+VVGPLLA G +++IAATF++ATYERLP++++ +    R+I     GGG
Subjt:  RQSAGSAAVIQLHGRFDILSLSGSFLPGRAPPCSTGLTVYLSGGEGQVIGGTVVGPLLAAGPIILIAATFASATYERLPLQDDHN-NQEREISPATTGGG

Query:  EV-----EEPPQFTRMATSIYDL---MSPNDHD--GVDGYGWAHERPSF
        +         P  + MA   Y++   + PN     G + Y W H RP +
Subjt:  EV-----EEPPQFTRMATSIYDL---MSPNDHD--GVDGYGWAHERPSF

AT5G04550.1 Protein of unknown function (DUF668)4.9e-8435.31Show/hide
Query:  KPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVAEKLEDLNRVANVVSRLGKKCSQPALQGFQHVY
        K   G   K  +G+L+FEVA+++SK ++L +SLS   +++L++EI  S G+K L+S D+  ++ L+  E +E++  VA  V+RL +KC+ P L+ F++ +
Subjt:  KPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVAEKLEDLNRVANVVSRLGKKCSQPALQGFQHVY

Query:  LDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQH-EESRKAYEQKLIWQKQDVGHLKDISLWNQTYDKVVELLAR
         D++    +     F  K M+   KKMER++++ A+LY E E+L +LEQ  K+ ++N+   ++   Y++K+ W++ +V +L+D+SLWN+TYD  V LL R
Subjt:  LDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQH-EESRKAYEQKLIWQKQDVGHLKDISLWNQTYDKVVELLAR

Query:  TVCTVYARIHLVFGDPFLKKDVNENGSSNDV---NHHVQNGAELMESKRASA-----ERGP-----GPRRGSSIK-------------SQISSRRGEVSL
        +V T+ +R   VFG  +  +  + + + +D    +H V      +  K  S+       GP     GP  GS+               S  S + G +  
Subjt:  TVCTVYARIHLVFGDPFLKKDVNENGSSNDV---NHHVQNGAELMESKRASA-----ERGP-----GPRRGSSIK-------------SQISSRRGEVSL

Query:  LTPDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVD--QRQAKSGMSNSGG--------
             F F  G      L    S S  +  +   +       +  S  +  + ++ +  +Q   F     S   G+     +A++G  NS          
Subjt:  LTPDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVD--QRQAKSGMSNSGG--------

Query:  -------NFGLKSR--LSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLA---IYDAPLAHDWKET
               N  L SR  LS  AP +T+G + LALHYAN+IIVIE+ +  PHL+GD+ARDDLY MLP S+R+SL+  LK Y KNL+   +YD  LA +W + 
Subjt:  -------NFGLKSR--LSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLA---IYDAPLAHDWKET

Query:  LDGILSWLAPLAHNMIRWQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQN--ALLDCASSFDFEDCME
        + GIL WL PLAHNMI+WQSER++E   +V+RT+++L QTL+FA+++KTE  I ELLVGLNY+ R+  + N  AL +C SS   E C++
Subjt:  LDGILSWLAPLAHNMIRWQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQN--ALLDCASSFDFEDCME

AT5G51670.1 Protein of unknown function (DUF668)9.4e-6733.86Show/hide
Query:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA
        M  E +++K+ N +S+        +S+  +   P I   T  ++G+LSFEVA VM+K ++L  SL+ S +   ++  LS +G+  +++ DE   L LV A
Subjt:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA

Query:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKK--------FQNNQHE
        E  + L   AN VSRL  +C+  +L+ F  ++ +  +   +        KD E   KK+ERYV+ T  LY EME +  LE + +K        F+  +  
Subjt:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKK--------FQNNQHE

Query:  ESRK------AYEQKLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGP
        E++K        + K+  QKQ V +LKD SLWN+++D VV +LAR+V T  AR+  VF                                 A+A     P
Subjt:  ESRK------AYEQKLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGP

Query:  RRGSSIKSQISSRRGEVSLLTPDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQA
           SS+   +SS    ++L+ P                             DE+ D                 + + SS F   S               
Subjt:  RRGSSIKSQISSRRGEVSLLTPDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQA

Query:  KSGMSNSGGNFGLKSRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDG
                      SRL +  P +T+GG+ +ALHYAN+I+V+EK+++ P LVG +ARDDLY MLP S+RSSL++ LK         D  LA +WK  L  
Subjt:  KSGMSNSGGNFGLKSRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDG

Query:  ILSWLAPLAHNMIRWQSERNFEQHQIVTRTN----VLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNA
        IL WL PLA NMIRWQSER+FEQ  + T TN    V+L+QTL FAD+ KTE AI ELLVGLNYI R+E +  A
Subjt:  ILSWLAPLAHNMIRWQSERNFEQHQIVTRTN----VLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGCAGAGCCTTGGATTGTGAAAATGGGGAACCAGGTTAGTGCTAATCTCAAACACGCCCTCCTCGAATCTTCTAAGAACAAGAACTCTAGAAAACCAGAAATCGG
TGGCAATACTAAGCAAACAATCGGTATTCTCTCTTTTGAAGTTGCTAATGTAATGTCCAAAACGATTTACCTTCACAAATCCCTGTCTCATTCTGCCATTTCGAAGCTCA
AGAACGAGATCTTGAGCTCCGATGGAGTTAAAAACCTTATCTCCTCTGATGAGGTTCATCTTCTCGAGCTTGTGGTTGCTGAGAAGCTTGAGGATTTAAACCGAGTGGCT
AATGTGGTTTCTAGGCTTGGGAAGAAGTGTTCTCAACCTGCGTTACAAGGGTTTCAACATGTGTACTTGGATATCATTAATGGAGTTATAAATGTGAAGGAATTGGGGTT
TCTAGTGAAGGATATGGAGGGGATGATGAAAAAAATGGAGAGGTATGTGAATGCTACGGCAAATTTGTACACTGAAATGGAGGTTTTGAATGAATTGGAACAGGCTGCAA
AGAAGTTTCAGAACAATCAGCATGAAGAGAGCAGAAAGGCGTATGAACAGAAACTTATATGGCAGAAACAAGATGTGGGGCATCTCAAGGATATTTCTCTTTGGAACCAA
ACGTATGATAAGGTTGTTGAACTGCTGGCGAGAACTGTTTGTACAGTTTATGCGAGGATTCATTTAGTATTTGGGGACCCTTTTCTGAAGAAGGATGTGAATGAGAATGG
TTCAAGTAATGATGTGAATCATCATGTTCAAAATGGTGCGGAGTTGATGGAATCGAAACGTGCGTCTGCCGAGAGAGGCCCTGGACCAAGACGGGGGTCGAGTATTAAGT
CTCAAATTAGCTCAAGAAGAGGTGAGGTTTCATTGCTTACACCTGATGACTTCAATTTCCCTTGTGGAACAAATCCTGGGAGGCTTCTCATGGATTGCCTTAGTTTAAGT
AGTTCAGTTTCCAAGTTGTATGATGAGGATGATGACAGTTATGTTGACCGTGACGATCGAAGCTGCCAAACCTCCGGTCGTAGCATTAGGAACAGTGGTTCCAGCCAATT
TAGTTCTTTCAGTCAGGTACAGTTCTCTATTCCATTTGGGGTGGATCAAAGACAAGCAAAAAGTGGGATGTCTAACAGTGGAGGAAACTTTGGCCTCAAGAGTAGGTTAT
CTATCTACGCACCGGTTTCAACCATCGGAGGCTCTGCTCTTGCTTTGCATTATGCAAATATCATCATTGTCATTGAAAAGTTACTTCGTTATCCTCATTTAGTTGGTGAC
GAAGCTCGAGACGATTTGTATCAGATGCTACCAACAAGTTTAAGATCATCCCTGAAAACACATTTGAAATCTTATGTTAAGAACCTGGCCATATATGATGCTCCTCTTGC
CCATGATTGGAAGGAGACTCTCGATGGGATCTTGAGCTGGCTCGCTCCGTTGGCACACAACATGATTCGGTGGCAAAGCGAGCGTAATTTCGAACAACATCAGATTGTGA
CTCGAACAAACGTATTGCTGATCCAAACTTTGTATTTTGCTGATCGCAAGAAGACTGAGGAAGCCATATGTGAGCTTCTTGTTGGATTGAATTACATTTGCCGCTACGAG
CATCAGCAAAATGCATTGCTGGATTGTGCAAGTAGTTTCGACTTTGAAGATTGTATGGAGTGGCAGTTGCAATGTAAAGCTTCCTTGAGCACACTGAAAGACATGATTGA
TGGGGACTTCAATTCGTCACTGTTTGCTGTTGAGAGGGAGCACCGTTCTTCTAGATCTGCTCTCTTTGATGACCTTGAGGAGGGTGGTCTCAGGACTAGCTCTTCCGTTG
AAATTAAAGAGCATGACAATGACAAAGCCCTCCATACTTTGGAAGATAGAGTTTCTATTCTCAAGAGGTTTTCATTTCCAGCAAAAAGGAAGTCTGATCCCCCTCCCGGG
ATTTGTCTTCTTGATCCAGCAGCTCTTGATGGAAATGGCATGGATGCTTCAAGGGGTATAATGTCAAGAACCATGGATCGATTCAAGATGGTTTTCGAGCAAAAATCAAA
GTGGAGAACTTGTAGACTTGCGTTGTATTTTGTGTTGTCCTTTCTACTACTTTTCTATCTCATCAGAAAGTATGCACGCTATCCATGCTGTTTGGTCGAGGTCAATCGGT
TGGGCTTTCCGGCCACCGGCCCAATATCAACCACCGTTGACGGCAGTGGAAGAGATCATGACGACGACGAAGACGAGCCAAATGGAGGCGGCGTGGTGGTCGGAAACCGC
CGATCGAGAGGACGGCCACCGGGATCGAAAAACAAACCAAAATCTCCAATCATCGTCACGAGCGACAACCCTCACGCGCTTCGCACGCACGTGATCGAGATCGTCGGAGG
AGCCGACGTCGCCGGCAGCATAAAGCAATTCTCCTGCCGCCGACAACGTGGGGTTTGTGTTCTCAGTGGTAGCGGTACGGTTGTGGACGTAACTCTCCGGCAATCCGCCG
GTTCTGCCGCCGTGATCCAACTTCACGGCCGGTTCGATATTCTATCTCTAAGTGGGTCTTTTCTTCCCGGTCGAGCCCCTCCCTGTTCGACCGGGCTGACCGTGTACTTG
TCGGGCGGTGAGGGGCAGGTGATTGGAGGGACGGTGGTGGGCCCATTGTTGGCAGCTGGGCCTATAATTTTGATAGCTGCTACTTTTGCTAGTGCTACTTATGAGAGATT
GCCTTTACAAGACGACCATAATAATCAAGAAAGAGAGATTTCTCCGGCCACCACGGGCGGCGGAGAAGTGGAGGAGCCGCCGCAGTTTACTCGGATGGCAACTTCAATTT
ACGACTTGATGTCACCAAATGATCATGATGGAGTTGATGGGTATGGTTGGGCTCATGAGAGACCATCTTTTGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTGCAGAGCCTTGGATTGTGAAAATGGGGAACCAGGTTAGTGCTAATCTCAAACACGCCCTCCTCGAATCTTCTAAGAACAAGAACTCTAGAAAACCAGAAATCGG
TGGCAATACTAAGCAAACAATCGGTATTCTCTCTTTTGAAGTTGCTAATGTAATGTCCAAAACGATTTACCTTCACAAATCCCTGTCTCATTCTGCCATTTCGAAGCTCA
AGAACGAGATCTTGAGCTCCGATGGAGTTAAAAACCTTATCTCCTCTGATGAGGTTCATCTTCTCGAGCTTGTGGTTGCTGAGAAGCTTGAGGATTTAAACCGAGTGGCT
AATGTGGTTTCTAGGCTTGGGAAGAAGTGTTCTCAACCTGCGTTACAAGGGTTTCAACATGTGTACTTGGATATCATTAATGGAGTTATAAATGTGAAGGAATTGGGGTT
TCTAGTGAAGGATATGGAGGGGATGATGAAAAAAATGGAGAGGTATGTGAATGCTACGGCAAATTTGTACACTGAAATGGAGGTTTTGAATGAATTGGAACAGGCTGCAA
AGAAGTTTCAGAACAATCAGCATGAAGAGAGCAGAAAGGCGTATGAACAGAAACTTATATGGCAGAAACAAGATGTGGGGCATCTCAAGGATATTTCTCTTTGGAACCAA
ACGTATGATAAGGTTGTTGAACTGCTGGCGAGAACTGTTTGTACAGTTTATGCGAGGATTCATTTAGTATTTGGGGACCCTTTTCTGAAGAAGGATGTGAATGAGAATGG
TTCAAGTAATGATGTGAATCATCATGTTCAAAATGGTGCGGAGTTGATGGAATCGAAACGTGCGTCTGCCGAGAGAGGCCCTGGACCAAGACGGGGGTCGAGTATTAAGT
CTCAAATTAGCTCAAGAAGAGGTGAGGTTTCATTGCTTACACCTGATGACTTCAATTTCCCTTGTGGAACAAATCCTGGGAGGCTTCTCATGGATTGCCTTAGTTTAAGT
AGTTCAGTTTCCAAGTTGTATGATGAGGATGATGACAGTTATGTTGACCGTGACGATCGAAGCTGCCAAACCTCCGGTCGTAGCATTAGGAACAGTGGTTCCAGCCAATT
TAGTTCTTTCAGTCAGGTACAGTTCTCTATTCCATTTGGGGTGGATCAAAGACAAGCAAAAAGTGGGATGTCTAACAGTGGAGGAAACTTTGGCCTCAAGAGTAGGTTAT
CTATCTACGCACCGGTTTCAACCATCGGAGGCTCTGCTCTTGCTTTGCATTATGCAAATATCATCATTGTCATTGAAAAGTTACTTCGTTATCCTCATTTAGTTGGTGAC
GAAGCTCGAGACGATTTGTATCAGATGCTACCAACAAGTTTAAGATCATCCCTGAAAACACATTTGAAATCTTATGTTAAGAACCTGGCCATATATGATGCTCCTCTTGC
CCATGATTGGAAGGAGACTCTCGATGGGATCTTGAGCTGGCTCGCTCCGTTGGCACACAACATGATTCGGTGGCAAAGCGAGCGTAATTTCGAACAACATCAGATTGTGA
CTCGAACAAACGTATTGCTGATCCAAACTTTGTATTTTGCTGATCGCAAGAAGACTGAGGAAGCCATATGTGAGCTTCTTGTTGGATTGAATTACATTTGCCGCTACGAG
CATCAGCAAAATGCATTGCTGGATTGTGCAAGTAGTTTCGACTTTGAAGATTGTATGGAGTGGCAGTTGCAATGTAAAGCTTCCTTGAGCACACTGAAAGACATGATTGA
TGGGGACTTCAATTCGTCACTGTTTGCTGTTGAGAGGGAGCACCGTTCTTCTAGATCTGCTCTCTTTGATGACCTTGAGGAGGGTGGTCTCAGGACTAGCTCTTCCGTTG
AAATTAAAGAGCATGACAATGACAAAGCCCTCCATACTTTGGAAGATAGAGTTTCTATTCTCAAGAGGTTTTCATTTCCAGCAAAAAGGAAGTCTGATCCCCCTCCCGGG
ATTTGTCTTCTTGATCCAGCAGCTCTTGATGGAAATGGCATGGATGCTTCAAGGGGTATAATGTCAAGAACCATGGATCGATTCAAGATGGTTTTCGAGCAAAAATCAAA
GTGGAGAACTTGTAGACTTGCGTTGTATTTTGTGTTGTCCTTTCTACTACTTTTCTATCTCATCAGAAAGTATGCACGCTATCCATGCTGTTTGGTCGAGGTCAATCGGT
TGGGCTTTCCGGCCACCGGCCCAATATCAACCACCGTTGACGGCAGTGGAAGAGATCATGACGACGACGAAGACGAGCCAAATGGAGGCGGCGTGGTGGTCGGAAACCGC
CGATCGAGAGGACGGCCACCGGGATCGAAAAACAAACCAAAATCTCCAATCATCGTCACGAGCGACAACCCTCACGCGCTTCGCACGCACGTGATCGAGATCGTCGGAGG
AGCCGACGTCGCCGGCAGCATAAAGCAATTCTCCTGCCGCCGACAACGTGGGGTTTGTGTTCTCAGTGGTAGCGGTACGGTTGTGGACGTAACTCTCCGGCAATCCGCCG
GTTCTGCCGCCGTGATCCAACTTCACGGCCGGTTCGATATTCTATCTCTAAGTGGGTCTTTTCTTCCCGGTCGAGCCCCTCCCTGTTCGACCGGGCTGACCGTGTACTTG
TCGGGCGGTGAGGGGCAGGTGATTGGAGGGACGGTGGTGGGCCCATTGTTGGCAGCTGGGCCTATAATTTTGATAGCTGCTACTTTTGCTAGTGCTACTTATGAGAGATT
GCCTTTACAAGACGACCATAATAATCAAGAAAGAGAGATTTCTCCGGCCACCACGGGCGGCGGAGAAGTGGAGGAGCCGCCGCAGTTTACTCGGATGGCAACTTCAATTT
ACGACTTGATGTCACCAAATGATCATGATGGAGTTGATGGGTATGGTTGGGCTCATGAGAGACCATCTTTTGTTTAA
Protein sequenceShow/hide protein sequence
MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVAEKLEDLNRVA
NVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESRKAYEQKLIWQKQDVGHLKDISLWNQ
TYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGSSIKSQISSRRGEVSLLTPDDFNFPCGTNPGRLLMDCLSLS
SSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNFGLKSRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGD
EARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIRWQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYE
HQQNALLDCASSFDFEDCMEWQLQCKASLSTLKDMIDGDFNSSLFAVEREHRSSRSALFDDLEEGGLRTSSSVEIKEHDNDKALHTLEDRVSILKRFSFPAKRKSDPPPG
ICLLDPAALDGNGMDASRGIMSRTMDRFKMVFEQKSKWRTCRLALYFVLSFLLLFYLIRKYARYPCCLVEVNRLGFPATGPISTTVDGSGRDHDDDEDEPNGGGVVVGNR
RSRGRPPGSKNKPKSPIIVTSDNPHALRTHVIEIVGGADVAGSIKQFSCRRQRGVCVLSGSGTVVDVTLRQSAGSAAVIQLHGRFDILSLSGSFLPGRAPPCSTGLTVYL
SGGEGQVIGGTVVGPLLAAGPIILIAATFASATYERLPLQDDHNNQEREISPATTGGGEVEEPPQFTRMATSIYDLMSPNDHDGVDGYGWAHERPSFV