; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc05G05620 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc05G05620
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionAT-hook motif nuclear-localized protein 20-like
Genome locationClcChr05:4010986..4011676
RNA-Seq ExpressionClc05G05620
SyntenyClc05G05620
Gene Ontology termsNA
InterPro domainsIPR040381 - Uncharacterized protein At4g14450-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049279.1 hypothetical protein E6C27_scaffold171G005190 [Cucumis melo var. makuwa]4.1e-3377.32Show/hide
Query:  MGDAERKSQCQAPTRLQSQAPASIEIKRAPNWNMAIPLLSPLVSPSSCGNPGQEEALLMAENKAREEAKGLSFTKWQHPAAPFYYGPVPRATPFVPV
        M DAERK+   APTRLQSQAPASIEIKR  +WN+ IPLLSPLVSPSSCGN G E+ L MA+N AREE KGL+FTKWQHPAAPFYY PVPRA  FVPV
Subjt:  MGDAERKSQCQAPTRLQSQAPASIEIKRAPNWNMAIPLLSPLVSPSSCGNPGQEEALLMAENKAREEAKGLSFTKWQHPAAPFYYGPVPRATPFVPV

KAE8650334.1 hypothetical protein Csa_009641 [Cucumis sativus]3.7e-3480.61Show/hide
Query:  MGDAERKSQCQ-APTRLQSQAPASIEIKRAPNWNMAIPLLSPLVSPSSCGNPGQEEALLMAENKAREEAKGLSFTKWQHPAAPFYYGPVPRATPFVPV
        M DAERK+    APTRLQSQAPASIEIKRA NWN+AIPLLSPLVSPSSCGN   E+ L MAEN AREE KGL+FTKWQHPAAPFYY PVPRA PFVPV
Subjt:  MGDAERKSQCQ-APTRLQSQAPASIEIKRAPNWNMAIPLLSPLVSPSSCGNPGQEEALLMAENKAREEAKGLSFTKWQHPAAPFYYGPVPRATPFVPV

KAG6582368.1 AT-hook motif nuclear-localized protein 20, partial [Cucurbita argyrosperma subsp. sororia]5.5e-2264.95Show/hide
Query:  MGDAERKSQCQAPTRLQSQAPASIEIKRAPNWNMAIPLLSPLVSPSSCGNPGQEEALLMAENKAREEAKGLSFTKWQHPAAPFYYGPVPRATPFVPV
        M  +ER++    PTRLQSQAPASI I RA NWN+AIPLL+PLVS S CGN  Q + LLM ENKAREE      TKWQHPA P Y GP+P  TPFVPV
Subjt:  MGDAERKSQCQAPTRLQSQAPASIEIKRAPNWNMAIPLLSPLVSPSSCGNPGQEEALLMAENKAREEAKGLSFTKWQHPAAPFYYGPVPRATPFVPV

XP_022143991.1 uncharacterized protein At4g14450, chloroplastic-like [Momordica charantia]2.1e-2160Show/hide
Query:  MGDAERK----SQCQAPTRLQSQAPASIEIKRAPNWNMAIPLLSPLVSPSSCGNPGQEEALLMAENKAREEA----KGLSFTKWQHPAAPFYYGPVPRAT
        MGDAER+    S  Q  TRLQ +AP+SI+I R  +WN+AIPLLSPLVSP       ++  +LM ENKAREEA    K  +FT+W+HPAAPFYYGPV R T
Subjt:  MGDAERK----SQCQAPTRLQSQAPASIEIKRAPNWNMAIPLLSPLVSPSSCGNPGQEEALLMAENKAREEA----KGLSFTKWQHPAAPFYYGPVPRAT

Query:  PFVPV
        PFVPV
Subjt:  PFVPV

XP_022924297.1 AT-hook motif nuclear-localized protein 20-like [Cucurbita moschata]1.4e-2066.67Show/hide
Query:  MGDAERKSQCQAPTRLQSQAPASIEIKRAPNWNMAIPLLSPLVSPSSCGNPGQEEALLMAENKAREEAKGLSFTKWQHPAAPFYYGP
        M  +ER++    PTRLQSQAPASI I RA NWN+AIPLL+PLVS S CGN  Q + LLM ENKAREE KGL+ TKWQHPA PF   P
Subjt:  MGDAERKSQCQAPTRLQSQAPASIEIKRAPNWNMAIPLLSPLVSPSSCGNPGQEEALLMAENKAREEAKGLSFTKWQHPAAPFYYGP

TrEMBL top hitse value%identityAlignment
A0A0A0L8D3 Uncharacterized protein1.8e-3480.61Show/hide
Query:  MGDAERKSQCQ-APTRLQSQAPASIEIKRAPNWNMAIPLLSPLVSPSSCGNPGQEEALLMAENKAREEAKGLSFTKWQHPAAPFYYGPVPRATPFVPV
        M DAERK+    APTRLQSQAPASIEIKRA NWN+AIPLLSPLVSPSSCGN   E+ L MAEN AREE KGL+FTKWQHPAAPFYY PVPRA PFVPV
Subjt:  MGDAERKSQCQ-APTRLQSQAPASIEIKRAPNWNMAIPLLSPLVSPSSCGNPGQEEALLMAENKAREEAKGLSFTKWQHPAAPFYYGPVPRATPFVPV

A0A2P5A713 Uncharacterized protein6.0e-1453.19Show/hide
Query:  PTRLQSQAPASIEIKRAPNWNMAIPLLSPLVSPSSCGNPGQEEAL-LMAENKAREEA--------KGLSFTKWQHPAAPFYYGPVPRATPFVPV
        P+RLQ +APAS++I    +WN+AIPLLSPL SPSS       +AL   AENK+RE+         K + F KWQHPAAPF Y P P   PFVPV
Subjt:  PTRLQSQAPASIEIKRAPNWNMAIPLLSPLVSPSSCGNPGQEEAL-LMAENKAREEA--------KGLSFTKWQHPAAPFYYGPVPRATPFVPV

A0A5D3D1A6 Uncharacterized protein2.0e-3377.32Show/hide
Query:  MGDAERKSQCQAPTRLQSQAPASIEIKRAPNWNMAIPLLSPLVSPSSCGNPGQEEALLMAENKAREEAKGLSFTKWQHPAAPFYYGPVPRATPFVPV
        M DAERK+   APTRLQSQAPASIEIKR  +WN+ IPLLSPLVSPSSCGN G E+ L MA+N AREE KGL+FTKWQHPAAPFYY PVPRA  FVPV
Subjt:  MGDAERKSQCQAPTRLQSQAPASIEIKRAPNWNMAIPLLSPLVSPSSCGNPGQEEALLMAENKAREEAKGLSFTKWQHPAAPFYYGPVPRATPFVPV

A0A6J1CRZ0 uncharacterized protein At4g14450, chloroplastic-like1.0e-2160Show/hide
Query:  MGDAERK----SQCQAPTRLQSQAPASIEIKRAPNWNMAIPLLSPLVSPSSCGNPGQEEALLMAENKAREEA----KGLSFTKWQHPAAPFYYGPVPRAT
        MGDAER+    S  Q  TRLQ +AP+SI+I R  +WN+AIPLLSPLVSP       ++  +LM ENKAREEA    K  +FT+W+HPAAPFYYGPV R T
Subjt:  MGDAERK----SQCQAPTRLQSQAPASIEIKRAPNWNMAIPLLSPLVSPSSCGNPGQEEALLMAENKAREEA----KGLSFTKWQHPAAPFYYGPVPRAT

Query:  PFVPV
        PFVPV
Subjt:  PFVPV

A0A6J1E8H9 AT-hook motif nuclear-localized protein 20-like6.6e-2166.67Show/hide
Query:  MGDAERKSQCQAPTRLQSQAPASIEIKRAPNWNMAIPLLSPLVSPSSCGNPGQEEALLMAENKAREEAKGLSFTKWQHPAAPFYYGP
        M  +ER++    PTRLQSQAPASI I RA NWN+AIPLL+PLVS S CGN  Q + LLM ENKAREE KGL+ TKWQHPA PF   P
Subjt:  MGDAERKSQCQAPTRLQSQAPASIEIKRAPNWNMAIPLLSPLVSPSSCGNPGQEEALLMAENKAREEAKGLSFTKWQHPAAPFYYGP

SwissProt top hitse value%identityAlignment
Q6NN02 Uncharacterized protein At4g14450, chloroplastic2.0e-0639.13Show/hide
Query:  TRLQSQAPA-SIEIKRAPNWNMAIPLLSPLVSPSSCGN-------PGQEEALLMAENKAREEAKGLSFTKWQHPAAPFYYGPVPRATPFVPV
        ++LQ +AP+  I+     NWN+AIPLLSPL +PS   +       P Q +  +  E + +   K   F KWQHPA+PF Y P     PF+ V
Subjt:  TRLQSQAPA-SIEIKRAPNWNMAIPLLSPLVSPSSCGN-------PGQEEALLMAENKAREEAKGLSFTKWQHPAAPFYYGPVPRATPFVPV

Arabidopsis top hitse value%identityAlignment
AT1G04330.1 unknown protein1.8e-1043.3Show/hide
Query:  GDAERKSQCQAPTRLQSQAPASIEIKRA-PNWNMAIPLLSPLVSPSSCGNPGQEEALLMAENK--AREEAKGLSFTKWQHPAAPFYYGPVPRAT-PF
        G   RKS     +RLQ +AP  ++I     NW +AIPLLSP  SP     P +  A++  E +   +E  K   F KWQHPAAPFYY P P +  PF
Subjt:  GDAERKSQCQAPTRLQSQAPASIEIKRA-PNWNMAIPLLSPLVSPSSCGNPGQEEALLMAENK--AREEAKGLSFTKWQHPAAPFYYGPVPRAT-PF

AT3G23170.1 unknown protein5.2e-1043Show/hide
Query:  GDAERKSQCQAPTRLQSQAPASIEIKRAP---NWNMAIPLLSPL-VSPSSCGNPGQEEALLMAENKAREEAKGLSFTKWQHPAAPFYYGPVPRATPFVPV
        GD  R+     P+RL  + PA   +   P   NWN AIPLLSPL +SP S  +P  +  +   ++ A    K   F KWQHPAAPFYY       PFVPV
Subjt:  GDAERKSQCQAPTRLQSQAPASIEIKRAP---NWNMAIPLLSPL-VSPSSCGNPGQEEALLMAENKAREEAKGLSFTKWQHPAAPFYYGPVPRATPFVPV

AT4G14450.1 unknown protein1.4e-0739.13Show/hide
Query:  TRLQSQAPA-SIEIKRAPNWNMAIPLLSPLVSPSSCGN-------PGQEEALLMAENKAREEAKGLSFTKWQHPAAPFYYGPVPRATPFVPV
        ++LQ +AP+  I+     NWN+AIPLLSPL +PS   +       P Q +  +  E + +   K   F KWQHPA+PF Y P     PF+ V
Subjt:  TRLQSQAPA-SIEIKRAPNWNMAIPLLSPLVSPSSCGN-------PGQEEALLMAENKAREEAKGLSFTKWQHPAAPFYYGPVPRATPFVPV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGATGCAGAGAGAAAAAGCCAGTGCCAAGCGCCGACACGGCTGCAGAGCCAGGCTCCGGCATCGATTGAGATAAAGCGGGCGCCGAATTGGAACATGGCCATACC
CTTGTTATCCCCTCTTGTATCACCTTCGTCTTGTGGGAATCCAGGGCAAGAGGAAGCCTTGTTGATGGCTGAGAATAAAGCGAGGGAGGAAGCCAAAGGGCTAAGCTTTA
CCAAATGGCAGCACCCTGCGGCTCCATTTTATTATGGGCCAGTCCCAAGGGCCACCCCCTTTGTGCCCGTGTGA
mRNA sequenceShow/hide mRNA sequence
AGAAGAGGGAAGCAGTCAATGTCATTGCCCAACACAGCATTGGAATTTTGACCACTGACCCTTTTAACCCTTAAATCTCCCGACCCGAATCAGAGGAATCTCACTCGTCT
CTCAACTTCTTCAATTTTTGTGTTTAACCCACTCCGATCTGCACAGAGAAAATGGGCGATGCAGAGAGAAAAAGCCAGTGCCAAGCGCCGACACGGCTGCAGAGCCAGGC
TCCGGCATCGATTGAGATAAAGCGGGCGCCGAATTGGAACATGGCCATACCCTTGTTATCCCCTCTTGTATCACCTTCGTCTTGTGGGAATCCAGGGCAAGAGGAAGCCT
TGTTGATGGCTGAGAATAAAGCGAGGGAGGAAGCCAAAGGGCTAAGCTTTACCAAATGGCAGCACCCTGCGGCTCCATTTTATTATGGGCCAGTCCCAAGGGCCACCCCC
TTTGTGCCCGTGTGAATATATATATATATATATTTTTTCTCATCTTTTAATTGTTCTTCTGCTTTATCCATGAAACCATCTGTATCTATAAAGCTGCTTTGGATTAATTT
TTCCATTTCGTAAATGCTTTTTATGTTTATTTACGCTCATTTTCCAGAGAAGAGGTCAAATGGGGTTTCTGATGAACAAACATTTTGTTAATACAGTACGGAAAGGGCAT
TTTTCATTTGAGGGAATTCGTTATCTCTTTC
Protein sequenceShow/hide protein sequence
MGDAERKSQCQAPTRLQSQAPASIEIKRAPNWNMAIPLLSPLVSPSSCGNPGQEEALLMAENKAREEAKGLSFTKWQHPAAPFYYGPVPRATPFVPV