; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G05620 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G05620
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationClcChr08:16848792..16849517
RNA-Seq ExpressionClc08G05620
SyntenyClc08G05620
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022141216.1 uncharacterized protein LOC111011669 [Momordica charantia]8.0e-1856.36Show/hide
Query:  MAFLMGLNESCAQVHTQLLLMEPEPTIQRAFSLDAQEVDQRSLLSLESPA---TINATVLMATSSGFNSSPRSLSNQLKKKERLVCTHCHLLGHTID--M
        MAFLMGLNES  QV  QLLLMEPE TI RAFSL AQEV+QR+ L+  S A   +I A  L  TS+  N++    S+Q ++KER  CTHCHL GHT+D   
Subjt:  MAFLMGLNESCAQVHTQLLLMEPEPTIQRAFSLDAQEVDQRSLLSLESPA---TINATVLMATSSGFNSSPRSLSNQLKKKERLVCTHCHLLGHTID--M

Query:  RVHGH-PGYR
        ++HG+ PG+R
Subjt:  RVHGH-PGYR

XP_022148562.1 uncharacterized protein LOC111017196 [Momordica charantia]1.1e-2236.86Show/hide
Query:  MILDLAGKNKLGFVDDSLKRLNDELKNLWIIYNNVVTAWILNSLSKEISAS-------------------------------------------------
        M + L  KNK G VD S+ R +DE  N WII NNVV AWILNSLSKEISAS                                                 
Subjt:  MILDLAGKNKLGFVDDSLKRLNDELKNLWIIYNNVVTAWILNSLSKEISAS-------------------------------------------------

Query:  ---------------------------------TAYVMAFLMGLNESCAQVHTQLLLMEPEPTIQRAFSLDAQEVDQRSLLSL---ESPATINATVLMAT
                                         T YVM FLMGLN+S +Q+   LLLM P PTI  AF L AQEV QR +  +    S A+  A  +  T
Subjt:  ---------------------------------TAYVMAFLMGLNESCAQVHTQLLLMEPEPTIQRAFSLDAQEVDQRSLLSL---ESPATINATVLMAT

Query:  --------SSGFNSSPRSLSNQLKKKERLVCTHCHLLGHTID--MRVHGH-PGYR
                S+   SS RSLSNQ K+KE+ +CTHC LL HT+D   ++HG+ PGYR
Subjt:  --------SSGFNSSPRSLSNQLKKKERLVCTHCHLLGHTID--MRVHGH-PGYR

XP_022154919.1 uncharacterized protein LOC111022065 [Momordica charantia]1.8e-2235.51Show/hide
Query:  MILDLAGKNKLGFVDDSLKRLNDELKNLWIIYNNVVTAWILNSLSKEISAST------------------------------------------------
        +++ L  KNK+GFVD S+ R  D   + WII NNVV +WI NSLSK+ISAS                                                 
Subjt:  MILDLAGKNKLGFVDDSLKRLNDELKNLWIIYNNVVTAWILNSLSKEISAST------------------------------------------------

Query:  ----------------------------------AYVMAFLMGLNESCAQVHTQLLLMEPEPTIQRAFSLDAQEVDQRSLLSLESPATINATVLMATSSG
                                           YVMAFLMGLN S +Q+  QLLLMEP PTI RAF+L AQE+ QRS +SL S  +  A+ + ATS+ 
Subjt:  ----------------------------------AYVMAFLMGLNESCAQVHTQLLLMEPEPTIQRAFSLDAQEVDQRSLLSLESPATINATVLMATSSG

Query:  FNSSPRSLS-NQLKKKERLVCTHCHLLGHTID--MRVHGH-PGYR
         NS   S S + +K+K++ +CTHC + GHT+D   ++H + PGYR
Subjt:  FNSSPRSLS-NQLKKKERLVCTHCHLLGHTID--MRVHGH-PGYR

XP_038904477.1 uncharacterized protein LOC120090845 [Benincasa hispida]6.3e-2334.68Show/hide
Query:  MILDLAGKNKLGFVDDSLKRLNDELKNLWIIYNNVVTAWILNSLSKEISAS-------------------------------------------------
        M + L  KNKLGF++  + R + EL + WII N +VT WILNSLSKEISAS                                                 
Subjt:  MILDLAGKNKLGFVDDSLKRLNDELKNLWIIYNNVVTAWILNSLSKEISAS-------------------------------------------------

Query:  ---------------------------------TAYVMAFLMGLNESCAQVHTQLLLMEPEPTIQRAFSLDAQEVDQRSLLSL--ESPATINATVLMATS
                                         T YV+AFLMGLN+S A + +QLLLMEP+PTI RAFSL AQE+DQ++  S    + ++ NAT L+   
Subjt:  ---------------------------------TAYVMAFLMGLNESCAQVHTQLLLMEPEPTIQRAFSLDAQEVDQRSLLSL--ESPATINATVLMATS

Query:  SGFNSSPRSL------SNQLKKKERLVCTHCHLLGHTID--MRVHGHP
        +G ++S +++      +NQ KKK+R +CTHC + GHT+D   ++HG+P
Subjt:  SGFNSSPRSL------SNQLKKKERLVCTHCHLLGHTID--MRVHGHP

XP_038905564.1 uncharacterized protein LOC120091546 [Benincasa hispida]4.4e-2436.73Show/hide
Query:  MILDLAGKNKLGFVDDSLKRLNDELKNLWIIYNNVVTAWILNSLSKEISAS-------------------------------------------------
        M + L  KNKLGF+D S+     E+   WI+ N+VVT WILNSLSKEISAS                                                 
Subjt:  MILDLAGKNKLGFVDDSLKRLNDELKNLWIIYNNVVTAWILNSLSKEISAS-------------------------------------------------

Query:  ---------------------------------TAYVMAFLMGLNESCAQVHTQLLLMEPEPTIQRAFSLDAQEVDQRSLLS----LESPATINATVLMA
                                         T +VM FLMGLNES +Q+HTQLLLME EP+I +AFS   QEV+QR++ S    + + +T NA +L+ 
Subjt:  ---------------------------------TAYVMAFLMGLNESCAQVHTQLLLMEPEPTIQRAFSLDAQEVDQRSLLS----LESPATINATVLMA

Query:  -TSSGFNSSPRSLSNQLKKKERLVCTHCHLLGHTIDM--RVHGHP
         TSS  NS+  SLSN  KKK+RL  THC++ GHT+D   +VH +P
Subjt:  -TSSGFNSSPRSLSNQLKKKERLVCTHCHLLGHTIDM--RVHGHP

TrEMBL top hitse value%identityAlignment
A0A2N9H1Z3 Integrase catalytic domain-containing protein1.3e-1835.75Show/hide
Query:  MILDLAGKNKLGFVDDSLKRLNDELK---NLWIIYNNVVTAWILNSLSKEISASTAY----------------------------VMAFLMGLNESCAQV
        MI+ L  KNK+GF++ ++   NDE     NLW   N +V +WILNS+SK+I++S  Y                            VM FLMGLN+S A V
Subjt:  MILDLAGKNKLGFVDDSLKRLNDELK---NLWIIYNNVVTAWILNSLSKEISASTAY----------------------------VMAFLMGLNESCAQV

Query:  HTQLLLMEPEPTIQRAFSLDAQEVDQRSLLSLESPATINATVLMATSSGFNSSPRSLSNQLKKKERLVCTHCHLLGHTID--MRVHGH-PGYR
          Q+L+MEP P + + FSL  QE  QR  + + S  T   ++ + T S           Q  KK+R +C+HC + GH +D   ++HG  PGY+
Subjt:  HTQLLLMEPEPTIQRAFSLDAQEVDQRSLLSLESPATINATVLMATSSGFNSSPRSLSNQLKKKERLVCTHCHLLGHTID--MRVHGH-PGYR

A0A2N9HCD3 Integrase catalytic domain-containing protein1.7e-1835.98Show/hide
Query:  MILDLAGKNKLGFVDDSLKRLNDELK---NLWIIYNNVVTAWILNSLSKEISASTAY------------------------VMAFLMGLNESCAQVHTQL
        MI+ L  KNK+GF++ ++   NDE     NLW   N +V +WILNS+SK+I++S  Y                        VM FLMGLN+S A V  Q+
Subjt:  MILDLAGKNKLGFVDDSLKRLNDELK---NLWIIYNNVVTAWILNSLSKEISASTAY------------------------VMAFLMGLNESCAQVHTQL

Query:  LLMEPEPTIQRAFSLDAQEVDQRSLLSLESPATINATVLMATSSGFNSSPRSLSNQLKKKERLVCTHCHLLGHTID--MRVHGH-PGYR
        L+MEP P + + FSL  QE  QR  + + S      ++ + T S           Q  KK+R +C+HC + GH +D   ++HG  PGY+
Subjt:  LLMEPEPTIQRAFSLDAQEVDQRSLLSLESPATINATVLMATSSGFNSSPRSLSNQLKKKERLVCTHCHLLGHTID--MRVHGH-PGYR

A0A6J1CIG1 uncharacterized protein LOC1110116693.9e-1856.36Show/hide
Query:  MAFLMGLNESCAQVHTQLLLMEPEPTIQRAFSLDAQEVDQRSLLSLESPA---TINATVLMATSSGFNSSPRSLSNQLKKKERLVCTHCHLLGHTID--M
        MAFLMGLNES  QV  QLLLMEPE TI RAFSL AQEV+QR+ L+  S A   +I A  L  TS+  N++    S+Q ++KER  CTHCHL GHT+D   
Subjt:  MAFLMGLNESCAQVHTQLLLMEPEPTIQRAFSLDAQEVDQRSLLSLESPA---TINATVLMATSSGFNSSPRSLSNQLKKKERLVCTHCHLLGHTID--M

Query:  RVHGH-PGYR
        ++HG+ PG+R
Subjt:  RVHGH-PGYR

A0A6J1D5E3 uncharacterized protein LOC1110171965.2e-2336.86Show/hide
Query:  MILDLAGKNKLGFVDDSLKRLNDELKNLWIIYNNVVTAWILNSLSKEISAS-------------------------------------------------
        M + L  KNK G VD S+ R +DE  N WII NNVV AWILNSLSKEISAS                                                 
Subjt:  MILDLAGKNKLGFVDDSLKRLNDELKNLWIIYNNVVTAWILNSLSKEISAS-------------------------------------------------

Query:  ---------------------------------TAYVMAFLMGLNESCAQVHTQLLLMEPEPTIQRAFSLDAQEVDQRSLLSL---ESPATINATVLMAT
                                         T YVM FLMGLN+S +Q+   LLLM P PTI  AF L AQEV QR +  +    S A+  A  +  T
Subjt:  ---------------------------------TAYVMAFLMGLNESCAQVHTQLLLMEPEPTIQRAFSLDAQEVDQRSLLSL---ESPATINATVLMAT

Query:  --------SSGFNSSPRSLSNQLKKKERLVCTHCHLLGHTID--MRVHGH-PGYR
                S+   SS RSLSNQ K+KE+ +CTHC LL HT+D   ++HG+ PGYR
Subjt:  --------SSGFNSSPRSLSNQLKKKERLVCTHCHLLGHTID--MRVHGH-PGYR

A0A6J1DNP7 uncharacterized protein LOC1110220658.9e-2335.51Show/hide
Query:  MILDLAGKNKLGFVDDSLKRLNDELKNLWIIYNNVVTAWILNSLSKEISAST------------------------------------------------
        +++ L  KNK+GFVD S+ R  D   + WII NNVV +WI NSLSK+ISAS                                                 
Subjt:  MILDLAGKNKLGFVDDSLKRLNDELKNLWIIYNNVVTAWILNSLSKEISAST------------------------------------------------

Query:  ----------------------------------AYVMAFLMGLNESCAQVHTQLLLMEPEPTIQRAFSLDAQEVDQRSLLSLESPATINATVLMATSSG
                                           YVMAFLMGLN S +Q+  QLLLMEP PTI RAF+L AQE+ QRS +SL S  +  A+ + ATS+ 
Subjt:  ----------------------------------AYVMAFLMGLNESCAQVHTQLLLMEPEPTIQRAFSLDAQEVDQRSLLSLESPATINATVLMATSSG

Query:  FNSSPRSLS-NQLKKKERLVCTHCHLLGHTID--MRVHGH-PGYR
         NS   S S + +K+K++ +CTHC + GHT+D   ++H + PGYR
Subjt:  FNSSPRSLS-NQLKKKERLVCTHCHLLGHTID--MRVHGH-PGYR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCTTGACCTCGCTGGGAAAAATAAGCTAGGGTTTGTGGATGACAGTTTGAAGCGACTGAATGATGAATTGAAGAATTTATGGATTATCTATAATAATGTAGTCAC
TGCCTGGATCTTGAATTCCTTGTCCAAAGAAATTTCTGCCAGTACTGCGTATGTCATGGCTTTCTTGATGGGTTTAAATGAATCTTGTGCCCAGGTTCATACTCAATTGC
TTTTAATGGAGCCTGAACCTACTATTCAACGAGCTTTTTCTCTTGATGCTCAAGAAGTTGACCAGCGATCTTTGCTTTCTTTGGAGAGTCCTGCAACGATTAATGCTACT
GTCTTGATGGCTACGTCTTCTGGATTCAACTCATCTCCTCGTTCTTTGTCGAATCAATTGAAAAAGAAAGAACGTCTGGTTTGCACTCATTGTCATCTTCTTGGCCATAC
TATTGACATGAGGGTTCATGGACATCCTGGATATCGATAA
mRNA sequenceShow/hide mRNA sequence
ATGATTCTTGACCTCGCTGGGAAAAATAAGCTAGGGTTTGTGGATGACAGTTTGAAGCGACTGAATGATGAATTGAAGAATTTATGGATTATCTATAATAATGTAGTCAC
TGCCTGGATCTTGAATTCCTTGTCCAAAGAAATTTCTGCCAGTACTGCGTATGTCATGGCTTTCTTGATGGGTTTAAATGAATCTTGTGCCCAGGTTCATACTCAATTGC
TTTTAATGGAGCCTGAACCTACTATTCAACGAGCTTTTTCTCTTGATGCTCAAGAAGTTGACCAGCGATCTTTGCTTTCTTTGGAGAGTCCTGCAACGATTAATGCTACT
GTCTTGATGGCTACGTCTTCTGGATTCAACTCATCTCCTCGTTCTTTGTCGAATCAATTGAAAAAGAAAGAACGTCTGGTTTGCACTCATTGTCATCTTCTTGGCCATAC
TATTGACATGAGGGTTCATGGACATCCTGGATATCGATAA
Protein sequenceShow/hide protein sequence
MILDLAGKNKLGFVDDSLKRLNDELKNLWIIYNNVVTAWILNSLSKEISASTAYVMAFLMGLNESCAQVHTQLLLMEPEPTIQRAFSLDAQEVDQRSLLSLESPATINAT
VLMATSSGFNSSPRSLSNQLKKKERLVCTHCHLLGHTIDMRVHGHPGYR