; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G07200 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G07200
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionAT-hook motif nuclear-localized protein 20
Genome locationClcChr06:7612216..7626285
RNA-Seq ExpressionClc06G07200
SyntenyClc06G07200
Gene Ontology termsGO:0045927 - positive regulation of growth (biological process)
InterPro domainsIPR005175 - PPC domain
IPR007700 - Domain of unknown function DUF668
IPR021864 - Domain of unknown function DUF3475


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065685.1 uncharacterized protein E6C27_scaffold90G001520 [Cucumis melo var. makuwa]7.5e-31094.15Show/hide
Query:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA
        MVAEPWIVKMGNQVSANLKHALLE SKNKNSRKPE GGN KQ IGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNL+SSDEVHLLELVVA
Subjt:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA

Query:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESKKAYEQ
        EK+EDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMM+KMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEES+KAYEQ
Subjt:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESKKAYEQ

Query:  KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGLSIKSHISSRR
        KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQ GAE M+SKR S ERG GPRRG S KS ISSRR
Subjt:  KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGLSIKSHISSRR

Query:  GEVSLLTTDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNFGLK
        GEV L T DDFNFPCGTNPGRLLMDCLSLSSSVSKL DED+DSYVDRDDRSCQ S RSIRNSGSSQFSSFSQVQFS+PFGVDQRQAKS MSN GGN G K
Subjt:  GEVSLLTTDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNFGLK

Query:  SRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR
        SRLS+YAPVSTIGGSALALHYANIIIVIEKLLRYPHLVG+EARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR
Subjt:  SRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR

Query:  WQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKASLST
        WQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCK S  T
Subjt:  WQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKASLST

XP_008460732.1 PREDICTED: uncharacterized protein LOC103499494, partial [Cucumis melo]1.2e-27394.04Show/hide
Query:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA
        MVAEPWIVKMGNQVSANLKHALLE SKNKNSRKPE GGN KQ IGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNL+SSDEVHLLELVVA
Subjt:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA

Query:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESKKAYEQ
        EK+EDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMM+KMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEES+KAYEQ
Subjt:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESKKAYEQ

Query:  KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGLSIKSHISSRR
        KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQ GAE M+SKR S ERG GPRRG S KS ISSRR
Subjt:  KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGLSIKSHISSRR

Query:  GEVSLLTTDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNFGLK
        GEV L T DDFNFPCGTNPGRLLMDCLSLSSSVSKL DED+DSYVDRDDRSCQ S RSIRNSGSSQFSSFSQVQFS+PFGVDQRQAKS MSN GGN G K
Subjt:  GEVSLLTTDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNFGLK

Query:  SRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR
        SRLS+YAPVSTIGGSALALHYANIIIVIEKLLRYPHLVG+EARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR
Subjt:  SRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR

Query:  WQSERNFEQHQIVTRTNVLL
        WQSERNFEQHQIVTRTNVLL
Subjt:  WQSERNFEQHQIVTRTNVLL

XP_011649159.1 uncharacterized protein LOC101220789 [Cucumis sativus]1.5e-30993.8Show/hide
Query:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA
        MVAEPWIVKMGNQVSANLKHALLE SKNKNSRKP+IGG+ K+ IGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNL+SSDEVHLLELVVA
Subjt:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA

Query:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESKKAYEQ
        EK+EDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMM+KMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEES+KAYEQ
Subjt:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESKKAYEQ

Query:  KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGLSIKSHISSRR
        KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQ GAE ++SKR S ERG GPRRG S KS ISSRR
Subjt:  KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGLSIKSHISSRR

Query:  GEVSLLTTDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNFGLK
        GEV L T DDFNFPCGTNPGRLLMDCLSLSSSVSKL DED+DSYVD DDRSCQ SGRSIRNSGSSQFSSFSQVQFS+PFGVDQRQAKS MSNSGGN G K
Subjt:  GEVSLLTTDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNFGLK

Query:  SRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR
        SRLS+YAPVSTIGGSALALHYANIIIVIEKLLRYPHLVG+EARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR
Subjt:  SRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR

Query:  WQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKASLST
        WQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCK S  T
Subjt:  WQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKASLST

XP_022138835.1 uncharacterized protein LOC111009906 [Momordica charantia]1.6e-28988.87Show/hide
Query:  MVAEPWIVKMGNQVSANLKHALLESSK-NKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVV
        MVAEPW+VKMGNQVS+NLKHALLE SK NKN +KPEI  NTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNL+SSDE+HLLEL V
Subjt:  MVAEPWIVKMGNQVSANLKHALLESSK-NKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVV

Query:  AEKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESKKAYE
        AEKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDI+NGVINVKELGFLVKDMEGMM+KMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESKKAYE
Subjt:  AEKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESKKAYE

Query:  QKLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGLSIKSHISSR
        QKL+WQKQ VGHLK+ISLWNQTYDKVVELLARTVCTVYARIHLVFGD FLKKDVNE            N  E +ESKRAS ++ P PRRG S KS +S R
Subjt:  QKLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGLSIKSHISSR

Query:  RGEVSLLTTDDFNFPCGTNPGRLLMDCLSLSSSVSKL--YDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNF
        RGEV L T DDFNFPCGTNPGRLLMDCLSLSSSVSKL   DED+D Y DRDDRSCQ SGRSIRNSGSS FSSFSQVQFS+PFGVDQRQ  S MSNSGGNF
Subjt:  RGEVSLLTTDDFNFPCGTNPGRLLMDCLSLSSSVSKL--YDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNF

Query:  GLKSRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHN
        G KSRLS YAPVST+GGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVK+LAIYDAP+AHDWKETLDGILSWLAPLAHN
Subjt:  GLKSRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHN

Query:  MIRWQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKASLST
        MIRWQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKAS  T
Subjt:  MIRWQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKASLST

XP_038875820.1 protein PSK SIMULATOR 1 [Benincasa hispida]0.0e+0095.52Show/hide
Query:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA
        MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNL+SSDEVHLLELVVA
Subjt:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA

Query:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESKKAYEQ
        EK+EDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMM+KMERYVNATANLYTEMEVLNE+EQAAKKFQNNQHEES+KAYEQ
Subjt:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESKKAYEQ

Query:  KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGLSIKSHISSRR
        KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDV+ENGSSNDVNHHVQ GAE MESKRAS ERG GPRRGLS+KS ISSR+
Subjt:  KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGLSIKSHISSRR

Query:  GEVSLLTTDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNFGLK
        GEV L   DDFNFPCGTNPGRLLMDCLSLSSSVSKL DED+DSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFS+PFGVDQRQAKS MSNSGGNFG K
Subjt:  GEVSLLTTDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNFGLK

Query:  SRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR
        SRLS+YAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR
Subjt:  SRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR

Query:  WQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKASLST
        WQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCK S  T
Subjt:  WQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKASLST

TrEMBL top hitse value%identityAlignment
A0A0A0LIJ7 Uncharacterized protein7.3e-31093.8Show/hide
Query:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA
        MVAEPWIVKMGNQVSANLKHALLE SKNKNSRKP+IGG+ K+ IGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNL+SSDEVHLLELVVA
Subjt:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA

Query:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESKKAYEQ
        EK+EDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMM+KMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEES+KAYEQ
Subjt:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESKKAYEQ

Query:  KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGLSIKSHISSRR
        KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQ GAE ++SKR S ERG GPRRG S KS ISSRR
Subjt:  KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGLSIKSHISSRR

Query:  GEVSLLTTDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNFGLK
        GEV L T DDFNFPCGTNPGRLLMDCLSLSSSVSKL DED+DSYVD DDRSCQ SGRSIRNSGSSQFSSFSQVQFS+PFGVDQRQAKS MSNSGGN G K
Subjt:  GEVSLLTTDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNFGLK

Query:  SRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR
        SRLS+YAPVSTIGGSALALHYANIIIVIEKLLRYPHLVG+EARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR
Subjt:  SRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR

Query:  WQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKASLST
        WQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCK S  T
Subjt:  WQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKASLST

A0A1S3CD70 uncharacterized protein LOC1034994945.8e-27494.04Show/hide
Query:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA
        MVAEPWIVKMGNQVSANLKHALLE SKNKNSRKPE GGN KQ IGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNL+SSDEVHLLELVVA
Subjt:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA

Query:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESKKAYEQ
        EK+EDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMM+KMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEES+KAYEQ
Subjt:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESKKAYEQ

Query:  KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGLSIKSHISSRR
        KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQ GAE M+SKR S ERG GPRRG S KS ISSRR
Subjt:  KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGLSIKSHISSRR

Query:  GEVSLLTTDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNFGLK
        GEV L T DDFNFPCGTNPGRLLMDCLSLSSSVSKL DED+DSYVDRDDRSCQ S RSIRNSGSSQFSSFSQVQFS+PFGVDQRQAKS MSN GGN G K
Subjt:  GEVSLLTTDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNFGLK

Query:  SRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR
        SRLS+YAPVSTIGGSALALHYANIIIVIEKLLRYPHLVG+EARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR
Subjt:  SRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR

Query:  WQSERNFEQHQIVTRTNVLL
        WQSERNFEQHQIVTRTNVLL
Subjt:  WQSERNFEQHQIVTRTNVLL

A0A5A7VIT4 Uncharacterized protein3.6e-31094.15Show/hide
Query:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA
        MVAEPWIVKMGNQVSANLKHALLE SKNKNSRKPE GGN KQ IGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNL+SSDEVHLLELVVA
Subjt:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA

Query:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESKKAYEQ
        EK+EDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMM+KMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEES+KAYEQ
Subjt:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESKKAYEQ

Query:  KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGLSIKSHISSRR
        KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQ GAE M+SKR S ERG GPRRG S KS ISSRR
Subjt:  KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGLSIKSHISSRR

Query:  GEVSLLTTDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNFGLK
        GEV L T DDFNFPCGTNPGRLLMDCLSLSSSVSKL DED+DSYVDRDDRSCQ S RSIRNSGSSQFSSFSQVQFS+PFGVDQRQAKS MSN GGN G K
Subjt:  GEVSLLTTDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNFGLK

Query:  SRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR
        SRLS+YAPVSTIGGSALALHYANIIIVIEKLLRYPHLVG+EARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR
Subjt:  SRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR

Query:  WQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKASLST
        WQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCK S  T
Subjt:  WQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKASLST

A0A6J1CAM1 uncharacterized protein LOC1110099067.5e-29088.87Show/hide
Query:  MVAEPWIVKMGNQVSANLKHALLESSK-NKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVV
        MVAEPW+VKMGNQVS+NLKHALLE SK NKN +KPEI  NTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNL+SSDE+HLLEL V
Subjt:  MVAEPWIVKMGNQVSANLKHALLESSK-NKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVV

Query:  AEKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESKKAYE
        AEKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDI+NGVINVKELGFLVKDMEGMM+KMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESKKAYE
Subjt:  AEKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESKKAYE

Query:  QKLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGLSIKSHISSR
        QKL+WQKQ VGHLK+ISLWNQTYDKVVELLARTVCTVYARIHLVFGD FLKKDVNE            N  E +ESKRAS ++ P PRRG S KS +S R
Subjt:  QKLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGLSIKSHISSR

Query:  RGEVSLLTTDDFNFPCGTNPGRLLMDCLSLSSSVSKL--YDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNF
        RGEV L T DDFNFPCGTNPGRLLMDCLSLSSSVSKL   DED+D Y DRDDRSCQ SGRSIRNSGSS FSSFSQVQFS+PFGVDQRQ  S MSNSGGNF
Subjt:  RGEVSLLTTDDFNFPCGTNPGRLLMDCLSLSSSVSKL--YDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNF

Query:  GLKSRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHN
        G KSRLS YAPVST+GGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVK+LAIYDAP+AHDWKETLDGILSWLAPLAHN
Subjt:  GLKSRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHN

Query:  MIRWQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKASLST
        MIRWQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKAS  T
Subjt:  MIRWQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKASLST

A0A6J1EQU9 uncharacterized protein LOC1114369671.9e-27285.29Show/hide
Query:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA
        MVAEPWIVKMGNQVSANLK ALLE SKNK+        NTKQTIGILSFEVANVMSKTIYLHKSLS SAISKLKN+ILSSDGV+NL+S DEVHLLELVVA
Subjt:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA

Query:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESKKAYEQ
        EKLEDLNRVA+VVSRLGKKCSQPALQGF+HVYLDIINGVINVKELGFLVKDMEGMMKKMER+VNATANLYTEMEVLNELEQAAKKFQNNQHEES+KAYEQ
Subjt:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESKKAYEQ

Query:  KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGLSIKSHISSRR
        KLIWQKQDVGHLK+ISLWNQT+DKVVELLARTVCTVYARIHLVFGD FLKKD             V+ G+E +ESKR S    PGPRRG S KS IS+RR
Subjt:  KLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGLSIKSHISSRR

Query:  GEVSLLTTDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNFGLK
        GEV L T DDFNFPCGTNP RLLMDCLSLSSSVSK+ DE++D     DDRSC+ SG SIRNSGSS F SFSQVQFS+PFGV+QR+AKS MSN+GG+FG K
Subjt:  GEVSLLTTDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNFGLK

Query:  SRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR
        SRLS+YAPVST+GGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLR+SL  HLKSYVK+LAIYDAPLAHDWKETLDGILSWLAPLAHNMIR
Subjt:  SRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIR

Query:  WQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKAS
        WQSERNFEQHQIVTRTN+LLIQT+YFAD KKTE+AICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKAS
Subjt:  WQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKAS

SwissProt top hitse value%identityAlignment
O22130 AT-hook motif nuclear-localized protein 224.4e-5348.15Show/hide
Query:  EDEPNGGGVVVGNRRSRGRPPGSKNKPKSPIIVTSDNPHALRTHVIEIIVGADVAGSIKQFSCRRQRGVCVLSGGGTVVDVTLRQSA----GSAAVIQLH
        E    GGG     RR RGRP GSKNKPK PII+T D+ +AL++HV+E+  G DV  S+  F+ RRQRG+CVLSG G V +VT+RQ A    G ++V+ LH
Subjt:  EDEPNGGGVVVGNRRSRGRPPGSKNKPKSPIIVTSDNPHALRTHVIEIIVGADVAGSIKQFSCRRQRGVCVLSGGGTVVDVTLRQSA----GSAAVIQLH

Query:  GRFDILSLSGSFLPGRAPPCSTGLTVYLSGGEGQVIGGTVVGPLLAAGPIILIAATFASATYERLPLQDDHNNQERE-------ISPATTGGG-------
        GRF+ILSLSGSFLP  APP ++GLT+YL+GG+GQV+GG+VVGPL+A+GP++++AA+F +A YERLPL++D   ++            AT GGG       
Subjt:  GRFDILSLSGSFLPGRAPPCSTGLTVYLSGGEGQVIGGTVVGPLLAAGPIILIAATFASATYERLPLQDDHNNQERE-------ISPATTGGG-------

Query:  EVEEPPQFTRMATSIYDLMSPNDHDGV----DGYGWAHERPSF
        + ++  Q  +  TS    + PN  + V    + Y W   RPSF
Subjt:  EVEEPPQFTRMATSIYDLMSPNDHDGV----DGYGWAHERPSF

O23620 AT-hook motif nuclear-localized protein 237.0e-5155.31Show/hide
Query:  GGGVVVGNRRSRGRPPGSKNKPKSPIIVTSDNPHALRTHVIEIIVGADVAGSIKQFSCRRQRGVCVLSGGGTVVDVTLRQSAGSAAVIQLHGRFDILSLS
        GGG VVG RR RGRPPGSKNKPK P+I+T ++ + LR H++E+  G DV   +  ++ RRQRG+CVLSG GTV +V++RQ + + AV+ L G F+ILSLS
Subjt:  GGGVVVGNRRSRGRPPGSKNKPKSPIIVTSDNPHALRTHVIEIIVGADVAGSIKQFSCRRQRGVCVLSGGGTVVDVTLRQSAGSAAVIQLHGRFDILSLS

Query:  GSFLPGRAPPCSTGLTVYLSGGEGQVIGGTVVGPLLAAGPIILIAATFASATYERLPLQDDHNNQEREISPATTGGGEV
        GSFLP  APP +T LT++L+GG+GQV+GG+VVG L AAGP+I+IAA+F +  YERLPL++D   Q++++   + GGG +
Subjt:  GSFLPGRAPPCSTGLTVYLSGGEGQVIGGTVVGPLLAAGPIILIAATFASATYERLPLQDDHNNQEREISPATTGGGEV

O49662 AT-hook motif nuclear-localized protein 241.1e-5150.43Show/hide
Query:  ISTTVDGSGRDHDDDEDEPNGGGVVVGN----RRSRGRPPGSKNKPKSPIIVTSDNPHALRTHVIEIIVGADVAGSIKQFSCRRQRGVCVLSGGGTVVDV
        I+      G+D D       GGG   G+    RR RGRP GSKNKPK PII+T D+ +ALRTHV+EI  G D+  S+  F+ RRQRGVCV+SG G V +V
Subjt:  ISTTVDGSGRDHDDDEDEPNGGGVVVGN----RRSRGRPPGSKNKPKSPIIVTSDNPHALRTHVIEIIVGADVAGSIKQFSCRRQRGVCVLSGGGTVVDV

Query:  TLRQSA---GSAAVIQLHGRFDILSLSGSFLPGRAPPCSTGLTVYLSGGEGQVIGGTVVGPLLAAGPIILIAATFASATYERLPLQDDHNNQEREISPAT
        T+RQ        +V+ LHGRF+ILSLSGSFLP  APP +TGL+VYL+GG+GQV+GG+VVGPLL AGP++++AA+F++A YERLPL++D   + +      
Subjt:  TLRQSA---GSAAVIQLHGRFDILSLSGSFLPGRAPPCSTGLTVYLSGGEGQVIGGTVVGPLLAAGPIILIAATFASATYERLPLQDDHNNQEREISPAT

Query:  TGGGEVEEPP------QFTRMATSIYDLMSPN
         GGG +E PP      Q  + A S +  + PN
Subjt:  TGGGEVEEPP------QFTRMATSIYDLMSPN

Q8GWQ2 AT-hook motif nuclear-localized protein 203.7e-6052.21Show/hide
Query:  GPISTTVDGSGRDHDDDEDEPNGGGVVVGNRRSRGRPPGSKNKPKSPIIVTSDNPHALRTHVIEIIVGADVAGSIKQFSCRRQRGVCVLSGGGTVVDVTL
        G +   ++ S  +  D+ED+P  G V V NRR RGRPPGSKNKPK+PI VT D+P+ALR+HV+EI  G+DVA +I  FS RRQRGVCVLSG G+V +VTL
Subjt:  GPISTTVDGSGRDHDDDEDEPNGGGVVVGNRRSRGRPPGSKNKPKSPIIVTSDNPHALRTHVIEIIVGADVAGSIKQFSCRRQRGVCVLSGGGTVVDVTL

Query:  RQSAGSAAVIQLHGRFDILSLSGSFLPGRAPPCSTGLTVYLSGGEGQVIGGTVVGPLLAAGPIILIAATFASATYERLPLQDDHN-NQEREISPATTGGG
        RQ+A    V+ L GRF+ILSL+G+FLPG +PP STGLTVYL+G +GQV+GG+VVGPLLA G +++IAATF++ATYERLP++++ +    R+I     GGG
Subjt:  RQSAGSAAVIQLHGRFDILSLSGSFLPGRAPPCSTGLTVYLSGGEGQVIGGTVVGPLLAAGPIILIAATFASATYERLPLQDDHN-NQEREISPATTGGG

Query:  EV-----EEPPQFTRMATSIYDL---MSPNDHD--GVDGYGWAHERPSF
        +         P  + MA   Y++   + PN     G + Y W H RP +
Subjt:  EV-----EEPPQFTRMATSIYDL---MSPNDHD--GVDGYGWAHERPSF

Q9SR17 AT-hook motif nuclear-localized protein 196.1e-5557.28Show/hide
Query:  VDGSGRDHD-----DDEDEPNGGGVVVGNRRSRGRPPGSKNKPKSPIIVTSDNPHALRTHVIEIIVGADVAGSIKQFSCRRQRGVCVLSGGGTVVDVTLR
        VD +  D D      D+ EP  G V    RR RGRP GSKNKPK PI VT D+P+AL++HV+EI  G DV  ++  F+ RRQRG+C+LSG GTV +VTLR
Subjt:  VDGSGRDHD-----DDEDEPNGGGVVVGNRRSRGRPPGSKNKPKSPIIVTSDNPHALRTHVIEIIVGADVAGSIKQFSCRRQRGVCVLSGGGTVVDVTLR

Query:  Q--------SAGSAAVIQLHGRFDILSLSGSFLPGRAPPCSTGLTVYLSGGEGQVIGGTVVGPLLAAGPIILIAATFASATYERLPLQDDHNNQEREISP
        Q        + G AAV+ L GRF+ILSL+GSFLPG APP STGLT+YL+GG+GQV+GG+VVGPL+AAGP++LIAATF++ATYERLPL      +E E + 
Subjt:  Q--------SAGSAAVIQLHGRFDILSLSGSFLPGRAPPCSTGLTVYLSGGEGQVIGGTVVGPLLAAGPIILIAATFASATYERLPLQDDHNNQEREISP

Query:  ATTGGG
           GGG
Subjt:  ATTGGG

Arabidopsis top hitse value%identityAlignment
AT3G04570.1 AT-hook motif nuclear-localized protein 194.4e-5657.28Show/hide
Query:  VDGSGRDHD-----DDEDEPNGGGVVVGNRRSRGRPPGSKNKPKSPIIVTSDNPHALRTHVIEIIVGADVAGSIKQFSCRRQRGVCVLSGGGTVVDVTLR
        VD +  D D      D+ EP  G V    RR RGRP GSKNKPK PI VT D+P+AL++HV+EI  G DV  ++  F+ RRQRG+C+LSG GTV +VTLR
Subjt:  VDGSGRDHD-----DDEDEPNGGGVVVGNRRSRGRPPGSKNKPKSPIIVTSDNPHALRTHVIEIIVGADVAGSIKQFSCRRQRGVCVLSGGGTVVDVTLR

Query:  Q--------SAGSAAVIQLHGRFDILSLSGSFLPGRAPPCSTGLTVYLSGGEGQVIGGTVVGPLLAAGPIILIAATFASATYERLPLQDDHNNQEREISP
        Q        + G AAV+ L GRF+ILSL+GSFLPG APP STGLT+YL+GG+GQV+GG+VVGPL+AAGP++LIAATF++ATYERLPL      +E E + 
Subjt:  Q--------SAGSAAVIQLHGRFDILSLSGSFLPGRAPPCSTGLTVYLSGGEGQVIGGTVVGPLLAAGPIILIAATFASATYERLPLQDDHNNQEREISP

Query:  ATTGGG
           GGG
Subjt:  ATTGGG

AT3G23160.1 Protein of unknown function (DUF668)2.2e-18861.19Show/hide
Query:  MVAEPWIVKMGNQVSANLKHA-LLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVV
        MV+E WIVKM NQVS+NLKHA LLESS  K + KP      KQTIGILSFEVANVMSKTI+LH+SLS + ISKLK E+  S+GV+ L+SSDE HLL+L V
Subjt:  MVAEPWIVKMGNQVSANLKHA-LLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVV

Query:  AEKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQ-NNQHEESKKAY
        +EKL+DL+RVA+VVSRLGKKC++PALQGF+HVY DI+NG I+ ++LGFLVKDME M+KKMER+VNAT +LY EMEV+NELEQA  K Q + QH+ES KA+
Subjt:  AEKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQ-NNQHEESKKAY

Query:  EQKLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFG--------DPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGL
        EQKL+WQ+QDV  L+D SLWNQTYDKVVE+LARTVCT+Y RI  VFG        D  LK+D ++N +S  VN                           
Subjt:  EQKLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFG--------DPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGL

Query:  SIKSHISSRRGEVSLLT-TDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSG
        S+     SRR E    T   DFNFPCGTNPGR+ M+CL+++ ++    D+DDD     DD   +  GR                  + P       A+  
Subjt:  SIKSHISSRRGEVSLLT-TDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSG

Query:  MSNSGGNFGLKSRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILS
         SN    FG KSRL+ +A  STIGGSAL+LHYAN++IV+EKLL+YPHL+G+EARDDLYQMLPTSL+++LK  L+SY+KN++IYDAPLAHDWKET+DGILS
Subjt:  MSNSGGNFGLKSRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILS

Query:  WLAPLAHNMIRWQSERNFE-QHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKAS
        WLAPLAHNMIRWQSERNFE Q+QIV RTNVLL+QTLYFADR+KTE AIC+LLVGLNYIC YE QQNALLDCASSFD+EDC EWQ QC+A+
Subjt:  WLAPLAHNMIRWQSERNFE-QHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNALLDCASSFDFEDCMEWQLQCKAS

AT4G14465.1 AT-hook motif nuclear-localized protein 202.6e-6152.21Show/hide
Query:  GPISTTVDGSGRDHDDDEDEPNGGGVVVGNRRSRGRPPGSKNKPKSPIIVTSDNPHALRTHVIEIIVGADVAGSIKQFSCRRQRGVCVLSGGGTVVDVTL
        G +   ++ S  +  D+ED+P  G V V NRR RGRPPGSKNKPK+PI VT D+P+ALR+HV+EI  G+DVA +I  FS RRQRGVCVLSG G+V +VTL
Subjt:  GPISTTVDGSGRDHDDDEDEPNGGGVVVGNRRSRGRPPGSKNKPKSPIIVTSDNPHALRTHVIEIIVGADVAGSIKQFSCRRQRGVCVLSGGGTVVDVTL

Query:  RQSAGSAAVIQLHGRFDILSLSGSFLPGRAPPCSTGLTVYLSGGEGQVIGGTVVGPLLAAGPIILIAATFASATYERLPLQDDHN-NQEREISPATTGGG
        RQ+A    V+ L GRF+ILSL+G+FLPG +PP STGLTVYL+G +GQV+GG+VVGPLLA G +++IAATF++ATYERLP++++ +    R+I     GGG
Subjt:  RQSAGSAAVIQLHGRFDILSLSGSFLPGRAPPCSTGLTVYLSGGEGQVIGGTVVGPLLAAGPIILIAATFASATYERLPLQDDHN-NQEREISPATTGGG

Query:  EV-----EEPPQFTRMATSIYDL---MSPNDHD--GVDGYGWAHERPSF
        +         P  + MA   Y++   + PN     G + Y W H RP +
Subjt:  EV-----EEPPQFTRMATSIYDL---MSPNDHD--GVDGYGWAHERPSF

AT5G04550.1 Protein of unknown function (DUF668)1.1e-8334.93Show/hide
Query:  KPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVAEKLEDLNRVANVVSRLGKKCSQPALQGFQHVY
        K   G   K  +G+L+FEVA+++SK ++L +SLS   +++L++EI  S G+K L+S D+  ++ L+  E +E++  VA  V+RL +KC+ P L+ F++ +
Subjt:  KPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVAEKLEDLNRVANVVSRLGKKCSQPALQGFQHVY

Query:  LDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQH-EESKKAYEQKLIWQKQDVGHLKDISLWNQTYDKVVELLAR
         D++    +     F  K M+   KKMER++++ A+LY E E+L +LEQ  K+ ++N+   ++   Y++K+ W++ +V +L+D+SLWN+TYD  V LL R
Subjt:  LDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQH-EESKKAYEQKLIWQKQDVGHLKDISLWNQTYDKVVELLAR

Query:  TVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGLS-IKSHISSRRGEVSLLTTDDFNFPCGT------NPGRLLM
        +V T+ +R   VFG  +  +  + + + +D      + + ++      +E    PR     +        G  +  +T   +F  G+        G L+ 
Subjt:  TVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGLS-IKSHISSRRGEVSLLTTDDFNFPCGT------NPGRLLM

Query:  D----CLSLSSSVSKLYDEDDD--SYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPF--------GVD--QRQAKSGMSNSGG-------------
        +        S  + K+  +          + +  QT  R   +S   Q  +    Q   PF        G+     +A++G  NS               
Subjt:  D----CLSLSSSVSKLYDEDDD--SYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPF--------GVD--QRQAKSGMSNSGG-------------

Query:  --NFGLKSR--LSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLA---IYDAPLAHDWKETLDGIL
          N  L SR  LS  AP +T+G + LALHYAN+IIVIE+ +  PHL+GD+ARDDLY MLP S+R+SL+  LK Y KNL+   +YD  LA +W + + GIL
Subjt:  --NFGLKSR--LSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLA---IYDAPLAHDWKETLDGIL

Query:  SWLAPLAHNMIRWQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQN--ALLDCASSFDFEDCME
         WL PLAHNMI+WQSER++E   +V+RT+++L QTL+FA+++KTE  I ELLVGLNY+ R+  + N  AL +C SS   E C++
Subjt:  SWLAPLAHNMIRWQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQN--ALLDCASSFDFEDCME

AT5G51670.1 Protein of unknown function (DUF668)1.2e-6633.86Show/hide
Query:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA
        M  E +++K+ N +S+        +S+  +   P I   T  ++G+LSFEVA VM+K ++L  SL+ S +   ++  LS +G+  +++ DE   L LV A
Subjt:  MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVA

Query:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKK--------FQNNQHE
        E  + L   AN VSRL  +C+  +L+ F  ++ +  +   +        KD E   KK+ERYV+ T  LY EME +  LE + +K        F+  +  
Subjt:  EKLEDLNRVANVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKK--------FQNNQHE

Query:  ESKK------AYEQKLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGP
        E+KK        + K+  QKQ V +LKD SLWN+++D VV +LAR+V T  AR+  VF                                          
Subjt:  ESKK------AYEQKLIWQKQDVGHLKDISLWNQTYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGP

Query:  RRGLSIKSHISSRRGEVSLLTTDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQA
          G  + + +SS    +S                       S SSS++ ++   +D   +RD           + + SS F   S               
Subjt:  RRGLSIKSHISSRRGEVSLLTTDDFNFPCGTNPGRLLMDCLSLSSSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQA

Query:  KSGMSNSGGNFGLKSRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDG
                      SRL +  P +T+GG+ +ALHYAN+I+V+EK+++ P LVG +ARDDLY MLP S+RSSL++ LK         D  LA +WK  L  
Subjt:  KSGMSNSGGNFGLKSRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGDEARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDG

Query:  ILSWLAPLAHNMIRWQSERNFEQHQIVTRTN----VLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNA
        IL WL PLA NMIRWQSER+FEQ  + T TN    V+L+QTL FAD+ KTE AI ELLVGLNYI R+E +  A
Subjt:  ILSWLAPLAHNMIRWQSERNFEQHQIVTRTN----VLLIQTLYFADRKKTEEAICELLVGLNYICRYEHQQNA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGCAGAGCCTTGGATTGTGAAAATGGGGAACCAGGTTAGTGCTAATCTCAAACACGCCCTCCTCGAATCTTCTAAGAACAAGAACTCTAGAAAACCAGAAATCGG
TGGCAATACTAAGCAAACAATCGGTATTCTCTCTTTTGAAGTTGCTAATGTAATGTCCAAAACGATTTACCTTCATAAATCCCTGTCTCATTCTGCCATTTCGAAGCTCA
AGAACGAGATCTTGAGCTCCGATGGAGTTAAAAACCTTATCTCCTCTGATGAGGTTCATCTTCTCGAGCTTGTGGTTGCTGAGAAGCTTGAGGATTTAAACCGAGTGGCT
AATGTGGTTTCTAGGCTTGGGAAGAAGTGTTCCCAACCTGCGTTACAAGGGTTTCAACATGTGTACTTGGATATCATTAATGGAGTTATAAATGTGAAGGAATTGGGGTT
TCTAGTGAAGGATATGGAGGGGATGATGAAAAAAATGGAGAGGTATGTGAATGCTACGGCAAATTTGTACACTGAAATGGAGGTTTTGAATGAATTGGAACAGGCTGCAA
AGAAGTTTCAGAACAATCAGCATGAAGAGAGCAAAAAGGCGTATGAACAGAAACTTATATGGCAGAAACAAGATGTGGGGCATCTCAAGGATATTTCTCTTTGGAACCAA
ACGTATGATAAGGTTGTTGAACTGCTGGCGAGAACTGTTTGTACAGTTTATGCGAGGATTCATTTAGTATTTGGGGACCCTTTTCTGAAGAAGGATGTGAATGAGAATGG
TTCAAGTAATGATGTGAATCATCATGTTCAAAATGGTGCAGAGTTGATGGAATCGAAACGTGCGTCTGCCGAGAGAGGCCCTGGACCAAGACGGGGGTTGAGTATTAAGT
CTCATATTAGCTCAAGAAGAGGTGAGGTTTCATTGCTTACAACTGATGACTTCAATTTCCCTTGTGGAACAAATCCTGGGAGGCTTCTCATGGATTGCCTTAGTTTAAGT
AGTTCAGTTTCCAAGTTGTATGATGAGGATGATGACAGTTATGTTGACCGTGACGATCGAAGCTGCCAAACCTCCGGTCGTAGCATTAGGAACAGCGGTTCCAGCCAATT
TAGTTCTTTCAGTCAGGTACAGTTCTCTATTCCATTTGGGGTGGATCAAAGACAAGCAAAAAGTGGGATGTCTAACAGTGGAGGAAACTTTGGCCTCAAGAGTAGGTTAT
CTATCTACGCACCGGTTTCAACCATCGGAGGCTCTGCTCTTGCTTTGCATTATGCAAATATCATCATTGTCATTGAAAAGTTACTTCGTTATCCTCATTTAGTTGGTGAC
GAAGCTCGAGACGATTTGTATCAGATGCTACCAACAAGTTTAAGATCATCCCTGAAAACACATTTGAAATCTTATGTTAAGAACCTGGCCATATATGATGCTCCTCTTGC
CCATGATTGGAAGGAGACTCTCGATGGGATCTTGAGCTGGCTCGCTCCGTTGGCACACAACATGATTCGGTGGCAAAGCGAGCGTAATTTCGAACAACATCAGATTGTGA
CTCGAACAAATGTATTGCTGATCCAAACTTTGTATTTTGCTGATCGCAAGAAGACTGAGGAAGCCATATGTGAGCTTCTTGTTGGATTGAATTACATTTGCCGCTACGAG
CATCAGCAAAATGCATTGCTGGATTGTGCAAGTAGTTTCGACTTTGAAGATTGTATGGAGTGGCAGTTGCAATGTAAAGCTTCCTTGAGCACACTGAAAGACATGATTGA
TGGGGACTTCAATTCGTCACTGTTTGCTGTTGAGAGGGAGCACCGTTCTTCTAGATCTGCTCTCTTTGATGACCTTGAGGAGGGTGGTCTCAGGACTAGCTCTTCCGTTG
AAATTAAAGAGCATGACAATGATAAAGCCCTCCATACTTTGGAAGATAGAGTTTCTATTCTCAAGAGGTTTTCATTTCCAGCAAAAAGGAAGTCTGATCCCCCTCCCGGG
ATTTGTCTTCTTGATCCAGCAGCTCTTGATGGAAATGGCATGGATGCGTCAAGGGGTATAATGTCAAGAACCATGGATCGATTCAAGATGGTTTTCGAGCAAAAATCAAA
GTGGAGAACTTGTAGACTTGCGTTGTATTTTGTGTTGTCCTTTCTACTACTTTTCTATCTCATCAGAAAGTATGCATGCTATCCATGCTGTTTGGTCGAGGTCAATCGGT
TGGGCTTTCCGGCCACCGGCCCAATATCAACCACCGTTGACGGCAGTGGAAGAGATCATGACGACGACGAAGACGAGCCAAATGGAGGCGGCGTGGTGGTCGGAAACCGC
CGATCGAGAGGACGGCCACCGGGATCGAAAAACAAACCAAAATCTCCAATCATCGTCACGAGCGACAACCCTCACGCGCTTCGCACGCACGTGATCGAGATCATCGTAGG
AGCCGACGTCGCCGGCAGCATAAAGCAATTCTCCTGCCGCCGACAACGTGGGGTTTGTGTTCTCAGTGGCGGCGGTACGGTTGTGGACGTAACTCTCCGGCAATCCGCCG
GTTCTGCCGCCGTGATCCAACTTCACGGCCGCTTCGATATTCTATCTCTAAGTGGGTCTTTTCTTCCCGGTCGAGCCCCTCCCTGTTCGACCGGGCTGACCGTGTACTTG
TCGGGCGGTGAGGGGCAGGTGATTGGAGGGACGGTGGTGGGCCCATTGTTGGCAGCTGGGCCTATAATTTTGATAGCTGCTACTTTTGCTAGTGCTACTTATGAGAGATT
GCCTTTACAAGACGACCATAATAATCAAGAAAGAGAGATTTCTCCGGCCACCACGGGCGGCGGAGAAGTGGAGGAGCCGCCGCAGTTTACTCGGATGGCAACTTCAATTT
ACGACTTGATGTCACCAAATGATCATGATGGAGTTGATGGGTATGGTTGGGCTCATGAGAGACCATCTTTTGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTGCAGAGCCTTGGATTGTGAAAATGGGGAACCAGGTTAGTGCTAATCTCAAACACGCCCTCCTCGAATCTTCTAAGAACAAGAACTCTAGAAAACCAGAAATCGG
TGGCAATACTAAGCAAACAATCGGTATTCTCTCTTTTGAAGTTGCTAATGTAATGTCCAAAACGATTTACCTTCATAAATCCCTGTCTCATTCTGCCATTTCGAAGCTCA
AGAACGAGATCTTGAGCTCCGATGGAGTTAAAAACCTTATCTCCTCTGATGAGGTTCATCTTCTCGAGCTTGTGGTTGCTGAGAAGCTTGAGGATTTAAACCGAGTGGCT
AATGTGGTTTCTAGGCTTGGGAAGAAGTGTTCCCAACCTGCGTTACAAGGGTTTCAACATGTGTACTTGGATATCATTAATGGAGTTATAAATGTGAAGGAATTGGGGTT
TCTAGTGAAGGATATGGAGGGGATGATGAAAAAAATGGAGAGGTATGTGAATGCTACGGCAAATTTGTACACTGAAATGGAGGTTTTGAATGAATTGGAACAGGCTGCAA
AGAAGTTTCAGAACAATCAGCATGAAGAGAGCAAAAAGGCGTATGAACAGAAACTTATATGGCAGAAACAAGATGTGGGGCATCTCAAGGATATTTCTCTTTGGAACCAA
ACGTATGATAAGGTTGTTGAACTGCTGGCGAGAACTGTTTGTACAGTTTATGCGAGGATTCATTTAGTATTTGGGGACCCTTTTCTGAAGAAGGATGTGAATGAGAATGG
TTCAAGTAATGATGTGAATCATCATGTTCAAAATGGTGCAGAGTTGATGGAATCGAAACGTGCGTCTGCCGAGAGAGGCCCTGGACCAAGACGGGGGTTGAGTATTAAGT
CTCATATTAGCTCAAGAAGAGGTGAGGTTTCATTGCTTACAACTGATGACTTCAATTTCCCTTGTGGAACAAATCCTGGGAGGCTTCTCATGGATTGCCTTAGTTTAAGT
AGTTCAGTTTCCAAGTTGTATGATGAGGATGATGACAGTTATGTTGACCGTGACGATCGAAGCTGCCAAACCTCCGGTCGTAGCATTAGGAACAGCGGTTCCAGCCAATT
TAGTTCTTTCAGTCAGGTACAGTTCTCTATTCCATTTGGGGTGGATCAAAGACAAGCAAAAAGTGGGATGTCTAACAGTGGAGGAAACTTTGGCCTCAAGAGTAGGTTAT
CTATCTACGCACCGGTTTCAACCATCGGAGGCTCTGCTCTTGCTTTGCATTATGCAAATATCATCATTGTCATTGAAAAGTTACTTCGTTATCCTCATTTAGTTGGTGAC
GAAGCTCGAGACGATTTGTATCAGATGCTACCAACAAGTTTAAGATCATCCCTGAAAACACATTTGAAATCTTATGTTAAGAACCTGGCCATATATGATGCTCCTCTTGC
CCATGATTGGAAGGAGACTCTCGATGGGATCTTGAGCTGGCTCGCTCCGTTGGCACACAACATGATTCGGTGGCAAAGCGAGCGTAATTTCGAACAACATCAGATTGTGA
CTCGAACAAATGTATTGCTGATCCAAACTTTGTATTTTGCTGATCGCAAGAAGACTGAGGAAGCCATATGTGAGCTTCTTGTTGGATTGAATTACATTTGCCGCTACGAG
CATCAGCAAAATGCATTGCTGGATTGTGCAAGTAGTTTCGACTTTGAAGATTGTATGGAGTGGCAGTTGCAATGTAAAGCTTCCTTGAGCACACTGAAAGACATGATTGA
TGGGGACTTCAATTCGTCACTGTTTGCTGTTGAGAGGGAGCACCGTTCTTCTAGATCTGCTCTCTTTGATGACCTTGAGGAGGGTGGTCTCAGGACTAGCTCTTCCGTTG
AAATTAAAGAGCATGACAATGATAAAGCCCTCCATACTTTGGAAGATAGAGTTTCTATTCTCAAGAGGTTTTCATTTCCAGCAAAAAGGAAGTCTGATCCCCCTCCCGGG
ATTTGTCTTCTTGATCCAGCAGCTCTTGATGGAAATGGCATGGATGCGTCAAGGGGTATAATGTCAAGAACCATGGATCGATTCAAGATGGTTTTCGAGCAAAAATCAAA
GTGGAGAACTTGTAGACTTGCGTTGTATTTTGTGTTGTCCTTTCTACTACTTTTCTATCTCATCAGAAAGTATGCATGCTATCCATGCTGTTTGGTCGAGGTCAATCGGT
TGGGCTTTCCGGCCACCGGCCCAATATCAACCACCGTTGACGGCAGTGGAAGAGATCATGACGACGACGAAGACGAGCCAAATGGAGGCGGCGTGGTGGTCGGAAACCGC
CGATCGAGAGGACGGCCACCGGGATCGAAAAACAAACCAAAATCTCCAATCATCGTCACGAGCGACAACCCTCACGCGCTTCGCACGCACGTGATCGAGATCATCGTAGG
AGCCGACGTCGCCGGCAGCATAAAGCAATTCTCCTGCCGCCGACAACGTGGGGTTTGTGTTCTCAGTGGCGGCGGTACGGTTGTGGACGTAACTCTCCGGCAATCCGCCG
GTTCTGCCGCCGTGATCCAACTTCACGGCCGCTTCGATATTCTATCTCTAAGTGGGTCTTTTCTTCCCGGTCGAGCCCCTCCCTGTTCGACCGGGCTGACCGTGTACTTG
TCGGGCGGTGAGGGGCAGGTGATTGGAGGGACGGTGGTGGGCCCATTGTTGGCAGCTGGGCCTATAATTTTGATAGCTGCTACTTTTGCTAGTGCTACTTATGAGAGATT
GCCTTTACAAGACGACCATAATAATCAAGAAAGAGAGATTTCTCCGGCCACCACGGGCGGCGGAGAAGTGGAGGAGCCGCCGCAGTTTACTCGGATGGCAACTTCAATTT
ACGACTTGATGTCACCAAATGATCATGATGGAGTTGATGGGTATGGTTGGGCTCATGAGAGACCATCTTTTGTTTAA
Protein sequenceShow/hide protein sequence
MVAEPWIVKMGNQVSANLKHALLESSKNKNSRKPEIGGNTKQTIGILSFEVANVMSKTIYLHKSLSHSAISKLKNEILSSDGVKNLISSDEVHLLELVVAEKLEDLNRVA
NVVSRLGKKCSQPALQGFQHVYLDIINGVINVKELGFLVKDMEGMMKKMERYVNATANLYTEMEVLNELEQAAKKFQNNQHEESKKAYEQKLIWQKQDVGHLKDISLWNQ
TYDKVVELLARTVCTVYARIHLVFGDPFLKKDVNENGSSNDVNHHVQNGAELMESKRASAERGPGPRRGLSIKSHISSRRGEVSLLTTDDFNFPCGTNPGRLLMDCLSLS
SSVSKLYDEDDDSYVDRDDRSCQTSGRSIRNSGSSQFSSFSQVQFSIPFGVDQRQAKSGMSNSGGNFGLKSRLSIYAPVSTIGGSALALHYANIIIVIEKLLRYPHLVGD
EARDDLYQMLPTSLRSSLKTHLKSYVKNLAIYDAPLAHDWKETLDGILSWLAPLAHNMIRWQSERNFEQHQIVTRTNVLLIQTLYFADRKKTEEAICELLVGLNYICRYE
HQQNALLDCASSFDFEDCMEWQLQCKASLSTLKDMIDGDFNSSLFAVEREHRSSRSALFDDLEEGGLRTSSSVEIKEHDNDKALHTLEDRVSILKRFSFPAKRKSDPPPG
ICLLDPAALDGNGMDASRGIMSRTMDRFKMVFEQKSKWRTCRLALYFVLSFLLLFYLIRKYACYPCCLVEVNRLGFPATGPISTTVDGSGRDHDDDEDEPNGGGVVVGNR
RSRGRPPGSKNKPKSPIIVTSDNPHALRTHVIEIIVGADVAGSIKQFSCRRQRGVCVLSGGGTVVDVTLRQSAGSAAVIQLHGRFDILSLSGSFLPGRAPPCSTGLTVYL
SGGEGQVIGGTVVGPLLAAGPIILIAATFASATYERLPLQDDHNNQEREISPATTGGGEVEEPPQFTRMATSIYDLMSPNDHDGVDGYGWAHERPSFV