; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc07G09370 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc07G09370
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrotransposon protein
Genome locationClcChr07:23778803..23779592
RNA-Seq ExpressionClc07G09370
SyntenyClc07G09370
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038877407.1 uncharacterized protein LOC120069696 [Benincasa hispida]4.6e-5460.82Show/hide
Query:  RFDLDTCSTWSDYCIRRC--------------------LVSEILGQSGFGWNVEFKCVQVEREIFDLWVWSHPSVKRMWNKPFPHYDDLSTVFDKDRAIG
        RFD DT +T S++C+ +C                     +SE+L QSGF WN EFKCVQVEREIF+LWV SHP+ K MWNKPFPHYDDLST  D      
Subjt:  RFDLDTCSTWSDYCIRRC--------------------LVSEILGQSGFGWNVEFKCVQVEREIFDLWVWSHPSVKRMWNKPFPHYDDLSTVFDKDRAIG

Query:  QSIRFTIKSGWNDEETTEQSTGRATL-IESSRGSKRKRPSFQAEMINIMRSTVEMQNTHMGRLASWQKEKYELEFSCRKEVVNAIYSIDGLNED
          I   +     DEE TEQSTGR ++ +ESSRGSKRKR SFQ EMI+IMRSTVEM +THMGRLASWQK+KYELEF  +KEVVNAIY+IDGL+ED
Subjt:  QSIRFTIKSGWNDEETTEQSTGRATL-IESSRGSKRKRPSFQAEMINIMRSTVEMQNTHMGRLASWQKEKYELEFSCRKEVVNAIYSIDGLNED

XP_038887234.1 uncharacterized protein LOC120077425 [Benincasa hispida]4.0e-5069.57Show/hide
Query:  VSEILGQSGFGWNVEFKCVQVEREIFDLWVWSHPSVKRMWNKPFPHYDDLSTVFDKDRAIGQS--IRFT---IKSGWNDEETTEQSTGRATL-IESSRGS
        VSE+L QSGF WN EFKCVQVEREIFDLWV SHP+ K MW KPFPHYDDLS VF KDRA   +  +R T   +     DEE  EQSTGRA++  ESSRGS
Subjt:  VSEILGQSGFGWNVEFKCVQVEREIFDLWVWSHPSVKRMWNKPFPHYDDLSTVFDKDRAIGQS--IRFT---IKSGWNDEETTEQSTGRATL-IESSRGS

Query:  KRKRPSFQAEMINIMRSTVEMQNTHMGRLASWQKEKYELEFSCRKEVVNAIYSIDGLNEDD
        KRKR SFQ EMI+I++STVEMQ+THMGRLASWQ EKYELE    KEVVNAIY+ID L E+D
Subjt:  KRKRPSFQAEMINIMRSTVEMQNTHMGRLASWQKEKYELEFSCRKEVVNAIYSIDGLNEDD

XP_038895773.1 uncharacterized protein LOC120083935 [Benincasa hispida]1.1e-3656.77Show/hide
Query:  VSEILGQSGFGWNVEFKCVQVEREIFDLWVWSHPSVKRMWNKPFPHYDDLSTVFDKDRAIGQSIRFTIKSGWNDEETTEQSTGRATLIESSRGSKRKRPS
        VSE+L QSGF WN EFKCVQVEREIFD WV SHP+ K MWNKPFPHYDDLSTVF K +A+GQS           E+    +T                  
Subjt:  VSEILGQSGFGWNVEFKCVQVEREIFDLWVWSHPSVKRMWNKPFPHYDDLSTVFDKDRAIGQSIRFTIKSGWNDEETTEQSTGRATLIESSRGSKRKRPS

Query:  FQAEMINIMRSTVEMQNTHMGRLASWQKEKYELEFSCRKEVVNAIYSIDGLNEDD
        F+ E+    +     ++THMGRLASWQKEKYELEF  RKEVVNAIY+IDGL+EDD
Subjt:  FQAEMINIMRSTVEMQNTHMGRLASWQKEKYELEFSCRKEVVNAIYSIDGLNEDD

XP_038896380.1 uncharacterized protein LOC120084641 [Benincasa hispida]1.1e-5574.53Show/hide
Query:  VSEILGQSGFGWNVEFKCVQVEREIFDLWVWSHPSVKRMWNKPFPHYDDLSTVFDKDRAIGQS-----IRFTIKSGWNDEETTEQSTGRAT-LIESSRGS
        VSE+L QSGFGWN EFKCVQVE+EIFDLWV SH + K MWNK F HYDDLSTVF KDRA   +         +     DEE  EQSTGRA+ L ESSRGS
Subjt:  VSEILGQSGFGWNVEFKCVQVEREIFDLWVWSHPSVKRMWNKPFPHYDDLSTVFDKDRAIGQS-----IRFTIKSGWNDEETTEQSTGRAT-LIESSRGS

Query:  KRKRPSFQAEMINIMRSTVEMQNTHMGRLASWQKEKYELEFSCRKEVVNAIYSIDGLNEDD
        KRKRPSFQAEMI+IMRSTVEMQ+THMGRLASWQKEKYELEF  RKEVVNAIYSIDGL+EDD
Subjt:  KRKRPSFQAEMINIMRSTVEMQNTHMGRLASWQKEKYELEFSCRKEVVNAIYSIDGLNEDD

XP_038899910.1 uncharacterized protein LOC120087100 [Benincasa hispida]2.3e-5062.37Show/hide
Query:  ILGQSGFGWNVEFKCVQVEREIFDLWVWSHPSVKRMWNKPFPHYDDLSTVFDKDRAIGQS------------------------------IRFT---IKS
        +L QSGFGWN EFKCVQVE+EIF+    SHP+ K MWNK FPHYDDLSTVF KDRA+GQS                              +R T   +  
Subjt:  ILGQSGFGWNVEFKCVQVEREIFDLWVWSHPSVKRMWNKPFPHYDDLSTVFDKDRAIGQS------------------------------IRFT---IKS

Query:  GWNDEETTEQSTGRATL-IESSRGSKRKRPSFQAEMINIMRSTVEMQNTHMGRLASWQKEKYELEFSCRKEVVNAIYSIDGLNEDD
           DEE  EQSTGRA++ +E+S+GSKRKRPSFQAEMI+IMRSTVEMQ+THMGRLASWQKEKYELEF   KEVVNAIYSIDGL+EDD
Subjt:  GWNDEETTEQSTGRATL-IESSRGSKRKRPSFQAEMINIMRSTVEMQNTHMGRLASWQKEKYELEFSCRKEVVNAIYSIDGLNEDD

TrEMBL top hitse value%identityAlignment
A0A1S3B4L3 uncharacterized protein LOC1034859538.8e-1935.75Show/hide
Query:  SGFGWNVEFKCVQVEREIFDLWVWSHPSVKRMWNKPFPHYDDLSTVFDKDRAIG------QSIRFTIKSGWND----------EETTEQSTG--------
        SGFGWN EF+C+  ER++FD W+ SHP+ K + +K FP+YDDLS VF KDRA G       ++   + + +ND          +  T  S G        
Subjt:  SGFGWNVEFKCVQVEREIFDLWVWSHPSVKRMWNKPFPHYDDLSTVFDKDRAIG------QSIRFTIKSGWND----------EETTEQSTG--------

Query:  ---RATLIESSRG----SKRKRPSFQAEMINIMRSTVEMQNTHMGRLASWQKEKYELEFSCRKEVVNAIYSIDGLNEDD
           RA      R     SKRKR S + E + ++RS +E  N  +  +A W KEK  +E   R +VV  +  I  L   D
Subjt:  ---RATLIESSRG----SKRKRPSFQAEMINIMRSTVEMQNTHMGRLASWQKEKYELEFSCRKEVVNAIYSIDGLNEDD

A0A5A7U0H7 Retrotransposon protein8.8e-1935.75Show/hide
Query:  SGFGWNVEFKCVQVEREIFDLWVWSHPSVKRMWNKPFPHYDDLSTVFDKDRAIG------QSIRFTIKSGWND----------EETTEQSTG--------
        SGFGWN EF+C+  ER++FD W+ SHP+ K + +K FP+YDDLS VF KDRA G       ++   + + +ND          +  T  S G        
Subjt:  SGFGWNVEFKCVQVEREIFDLWVWSHPSVKRMWNKPFPHYDDLSTVFDKDRAIG------QSIRFTIKSGWND----------EETTEQSTG--------

Query:  ---RATLIESSRG----SKRKRPSFQAEMINIMRSTVEMQNTHMGRLASWQKEKYELEFSCRKEVVNAIYSIDGLNEDD
           RA      R     SKRKR S + E + ++RS +E  N  +  +A W KEK  +E   R +VV  +  I  L   D
Subjt:  ---RATLIESSRG----SKRKRPSFQAEMINIMRSTVEMQNTHMGRLASWQKEKYELEFSCRKEVVNAIYSIDGLNEDD

A0A5D3C7T4 Uncharacterized protein2.6e-1830.11Show/hide
Query:  VSEILGQ--SGFGWNVEFKCVQVEREIFDLWVWSHPSVKRMWNKPFPHYDDLSTVFDKDRAIGQSIRFTIKSGWNDEETTEQSTGRATLIE---------
        ++E++G   SGFGWN   KC++VE+ +FD WV  HP+ + + NKPFP++ DL  VF +DRA G   +  ++        TE+      L +         
Subjt:  VSEILGQ--SGFGWNVEFKCVQVEREIFDLWVWSHPSVKRMWNKPFPHYDDLSTVFDKDRAIGQSIRFTIKSGWNDEETTEQSTGRATLIE---------

Query:  --------------------SSRGSKRKRPSFQAEMINIMRSTVEMQNTHMGRLASWQKEKYELEFSCRKEVVNAIYSIDGLNEDD
                            SSR SK++R S+  ++++  R+++   +  +G++A+WQ+EK E+E S  K +   + +I G++ DD
Subjt:  --------------------SSRGSKRKRPSFQAEMINIMRSTVEMQNTHMGRLASWQKEKYELEFSCRKEVVNAIYSIDGLNEDD

A0A5D3CH30 Retrotransposon protein7.4e-1840.28Show/hide
Query:  SGFGWNVEFKCVQVEREIFDLWVWSHPSVKRMWNKPFPHYDDLSTVFDKDRAIGQSIRFTIKSGWNDEETTEQSTGRATLIESSRGSKRKRPSFQAEMIN
        SGFGWN E KC+  E+E+FD   WSHP+VK + NK F HYD+LS VF KDRA G         G ND   T + + R  +   S GSKRKR     +  +
Subjt:  SGFGWNVEFKCVQVEREIFDLWVWSHPSVKRMWNKPFPHYDDLSTVFDKDRAIGQSIRFTIKSGWNDEETTEQSTGRATLIESSRGSKRKRPSFQAEMIN

Query:  IMRSTVEMQNTHMGRLASWQKEKYELEFSCRKEVVNAIYSIDGL
        I+R+ +E  N  + R+A W   + +     R+E+V  + +I  L
Subjt:  IMRSTVEMQNTHMGRLASWQKEKYELEFSCRKEVVNAIYSIDGL

A0A6J1DW73 uncharacterized protein LOC1110250183.2e-2133.71Show/hide
Query:  GFGWNVEFKCVQVEREIFDLWVWSHPSVKRMWNKPFPHYDDLSTVFDKDRAIGQSI-------------------------------RFTIKSGWNDEET
        GFGWN + KC++ E+E+FD WV SHP+ K + NKP PHYDDL+  F KDRA G ++                                F       +E+ 
Subjt:  GFGWNVEFKCVQVEREIFDLWVWSHPSVKRMWNKPFPHYDDLSTVFDKDRAIGQSI-------------------------------RFTIKSGWNDEET

Query:  TEQSTGRATLIESSRGSKRKRPSFQAEMINIMRSTVEMQNTHMGRLASWQKEKYELEFSCRKEVVNAIYSIDGLNEDD
            T + T+  SS GSKRKR  + +EM++++R+ + MQ  H+ ++A+W  +K E + + RK V + +  I  L  +D
Subjt:  TEQSTGRATLIESSRGSKRKRPSFQAEMINIMRSTVEMQNTHMGRLASWQKEKYELEFSCRKEVVNAIYSIDGLNEDD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCATTAATCCTTCACGTTTCGACTTGGATACTTGCAGCACTTGGAGTGACTACTGCATAAGAAGGTGTTTGGTATCAGAGATATTAGGTCAGTCGGGATTCGGCTG
GAACGTGGAGTTCAAATGTGTCCAGGTTGAGAGGGAGATTTTCGATCTTTGGGTTTGGAGTCATCCCAGTGTGAAGAGGATGTGGAACAAACCGTTTCCCCATTACGATG
ACCTCTCCACCGTCTTTGATAAAGATAGAGCTATCGGACAATCAATCAGATTCACCATTAAATCCGGATGGAATGATGAAGAGACAACAGAGCAATCTACAGGTAGAGCG
ACACTTATCGAGTCATCTCGAGGAAGCAAGAGGAAGAGGCCATCATTCCAAGCTGAAATGATCAACATCATGAGATCGACTGTTGAGATGCAGAACACGCACATGGGTAG
ACTTGCATCGTGGCAGAAGGAGAAGTATGAGCTAGAGTTCAGTTGTCGGAAAGAAGTAGTAAACGCCATATACAGCATTGACGGCTTGAATGAGGATGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTCATTAATCCTTCACGTTTCGACTTGGATACTTGCAGCACTTGGAGTGACTACTGCATAAGAAGGTGTTTGGTATCAGAGATATTAGGTCAGTCGGGATTCGGCTG
GAACGTGGAGTTCAAATGTGTCCAGGTTGAGAGGGAGATTTTCGATCTTTGGGTTTGGAGTCATCCCAGTGTGAAGAGGATGTGGAACAAACCGTTTCCCCATTACGATG
ACCTCTCCACCGTCTTTGATAAAGATAGAGCTATCGGACAATCAATCAGATTCACCATTAAATCCGGATGGAATGATGAAGAGACAACAGAGCAATCTACAGGTAGAGCG
ACACTTATCGAGTCATCTCGAGGAAGCAAGAGGAAGAGGCCATCATTCCAAGCTGAAATGATCAACATCATGAGATCGACTGTTGAGATGCAGAACACGCACATGGGTAG
ACTTGCATCGTGGCAGAAGGAGAAGTATGAGCTAGAGTTCAGTTGTCGGAAAGAAGTAGTAAACGCCATATACAGCATTGACGGCTTGAATGAGGATGACTAG
Protein sequenceShow/hide protein sequence
MLINPSRFDLDTCSTWSDYCIRRCLVSEILGQSGFGWNVEFKCVQVEREIFDLWVWSHPSVKRMWNKPFPHYDDLSTVFDKDRAIGQSIRFTIKSGWNDEETTEQSTGRA
TLIESSRGSKRKRPSFQAEMINIMRSTVEMQNTHMGRLASWQKEKYELEFSCRKEVVNAIYSIDGLNEDD