; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc05G17580 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc05G17580
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrotransposon protein
Genome locationClcChr05:26396531..26399410
RNA-Seq ExpressionClc05G17580
SyntenyClc05G17580
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031678.1 retrotransposon protein [Cucumis melo var. makuwa]7.1e-2138.92Show/hide
Query:  SHPTVNEFLYKSFSYHDKLAYIFGKGWAIGARSKAPRDIKSSVLDCMNDG-----------------------NEMHGTPTSQNTQERDASRGSKRKHCG
        SHP     L+KSF Y+D L+Y+FG   A GA S+   D  S+V +  NDG                       +E+ G    Q ++ R+ S GSKRK   
Subjt:  SHPTVNEFLYKSFSYHDKLAYIFGKGWAIGARSKAPRDIKSSVLDCMNDG-----------------------NEMHGTPTSQNTQERDASRGSKRKHCG

Query:  YHSEVVEVFRNAMDFANDQVKSI---------------VEFVSQLQDILELYNQQSVKLMNIVFHSVQDTKSFLSIPMPLKLEYC
         H E+VEV  +A +F NDQ+K+I                E V QLQDI +L +Q   KLM I+F SV+  + FLSIP  LKLEYC
Subjt:  YHSEVVEVFRNAMDFANDQVKSI---------------VEFVSQLQDILELYNQQSVKLMNIVFHSVQDTKSFLSIPMPLKLEYC

KAA0033290.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]1.2e-1534.06Show/hide
Query:  RERKNEEEDDERKRDDRDSEGGKSCAAELLKNLFEASHP--------TVNEFLYKSFSYHDKLAYIFGKGWAIGARSKAPRDIKSSVLDCMN--------
        RE  N E  D+   D+ DS    +   E+  N  EAS+            + L+KSF Y+D L Y+FGK  A  ARS+   D+ S+V +  N        
Subjt:  RERKNEEEDDERKRDDRDSEGGKSCAAELLKNLFEASHP--------TVNEFLYKSFSYHDKLAYIFGKGWAIGARSKAPRDIKSSVLDCMN--------

Query:  ---------------DGNEMHGTPTSQNTQERDASRGSKRKHCGYHSEVVEVFRNAMDFANDQVKSI---------------VEFVSQLQDILELYNQQS
                         +EM G    Q ++ ++ S GSKRK    H E VEV ++A++F NDQ+K+I                E + QLQDI EL ++  
Subjt:  ---------------DGNEMHGTPTSQNTQERDASRGSKRKHCGYHSEVVEVFRNAMDFANDQVKSI---------------VEFVSQLQDILELYNQQS

Query:  VKLMNIVFHSVQDTKSFLSIPMPLKLEYC
         KL+ I+F SV+  + FLSIP   KLEYC
Subjt:  VKLMNIVFHSVQDTKSFLSIPMPLKLEYC

KAA0042340.1 retrotransposon protein [Cucumis melo var. makuwa]1.5e-2340Show/hide
Query:  SHPTVNEFLYKSFSYHDKLAYIFGKGWAIGARSKAPRDIKSSVLDCMNDG-----------------------NEMHGTPTSQNTQERDASRGSKRKHCG
        SHPTV   L KSF Y+D ++Y+FGK  A  ARS+   D+ S++ +  NDG                       +EM GT   Q ++ R+ S  SKRK   
Subjt:  SHPTVNEFLYKSFSYHDKLAYIFGKGWAIGARSKAPRDIKSSVLDCMNDG-----------------------NEMHGTPTSQNTQERDASRGSKRKHCG

Query:  YHSEVVEVFRNAMDFANDQVKSIVEF---------------VSQLQDILELYNQQSVKLMNIVFHSVQDTKSFLSIPMPLKLEYC
         H E VEV R+AM F NDQ+K+I ++               V QLQDI +L N+   K++ I+FHSV+  + FLSIP  LKLEYC
Subjt:  YHSEVVEVFRNAMDFANDQVKSIVEF---------------VSQLQDILELYNQQSVKLMNIVFHSVQDTKSFLSIPMPLKLEYC

XP_008441954.1 PREDICTED: uncharacterized protein LOC103485953 [Cucumis melo]7.8e-2036.6Show/hide
Query:  KNLFEA---SHPTVNEFLYKSFSYHDKLAYIFGKGWAIGARSKAPRDIKSSVLDCMND-----------------------GNEMHGTPTSQNTQERDAS
        ++LF++   SHP     L+KSF Y+D L+Y+FGK  A GARS+   ++ S+V +  ND                        +EM G    Q ++ R+ S
Subjt:  KNLFEA---SHPTVNEFLYKSFSYHDKLAYIFGKGWAIGARSKAPRDIKSSVLDCMND-----------------------GNEMHGTPTSQNTQERDAS

Query:  RGSKRKHCGYHSEVVEVFRNAMDFANDQVKSIVEF---------------VSQLQDILELYNQQSVKLMNIVFHSVQDTKSFLSIPMPLKLEYC
          SKRK      E VEV R+ M+F N+Q+K+I ++               V QLQDI +L +Q   KLM I+F S++  + FLSIP  LKLEYC
Subjt:  RGSKRKHCGYHSEVVEVFRNAMDFANDQVKSIVEF---------------VSQLQDILELYNQQSVKLMNIVFHSVQDTKSFLSIPMPLKLEYC

XP_038899283.1 uncharacterized protein LOC120086622 [Benincasa hispida]1.1e-3449.14Show/hide
Query:  SHPTVNEFLYKSFSYHDKLAYIFGKGWAIGARSKAPRDIKSSVLDCMNDG-------------NEMHGTPTSQNTQERDASRGSKRKHCGYHSEVVEVFR
        SHP     L KSFSY D LAY+FGK WA G  ++ P D+ SSV  CM++G             +E HGTPT    +  ++ RGSKRKH GYHSE+V+VF+
Subjt:  SHPTVNEFLYKSFSYHDKLAYIFGKGWAIGARSKAPRDIKSSVLDCMNDG-------------NEMHGTPTSQNTQERDASRGSKRKHCGYHSEVVEVFR

Query:  NAMDFANDQVKSIVEF---------------VSQLQDILELYNQQSVKLMNIVFHSVQDTKSFLSIPMPLKLEYC
        NAMDFANDQ+KSI E+               V++L DI +L  QQ +K MNI+F +V +T+SFLSIP  +KLEYC
Subjt:  NAMDFANDQVKSIVEF---------------VSQLQDILELYNQQSVKLMNIVFHSVQDTKSFLSIPMPLKLEYC

TrEMBL top hitse value%identityAlignment
A0A1S3B4L3 uncharacterized protein LOC1034859533.8e-2036.6Show/hide
Query:  KNLFEA---SHPTVNEFLYKSFSYHDKLAYIFGKGWAIGARSKAPRDIKSSVLDCMND-----------------------GNEMHGTPTSQNTQERDAS
        ++LF++   SHP     L+KSF Y+D L+Y+FGK  A GARS+   ++ S+V +  ND                        +EM G    Q ++ R+ S
Subjt:  KNLFEA---SHPTVNEFLYKSFSYHDKLAYIFGKGWAIGARSKAPRDIKSSVLDCMND-----------------------GNEMHGTPTSQNTQERDAS

Query:  RGSKRKHCGYHSEVVEVFRNAMDFANDQVKSIVEF---------------VSQLQDILELYNQQSVKLMNIVFHSVQDTKSFLSIPMPLKLEYC
          SKRK      E VEV R+ M+F N+Q+K+I ++               V QLQDI +L +Q   KLM I+F S++  + FLSIP  LKLEYC
Subjt:  RGSKRKHCGYHSEVVEVFRNAMDFANDQVKSIVEF---------------VSQLQDILELYNQQSVKLMNIVFHSVQDTKSFLSIPMPLKLEYC

A0A5A7SQU2 Putative nuclease HARBI15.7e-1634.06Show/hide
Query:  RERKNEEEDDERKRDDRDSEGGKSCAAELLKNLFEASHP--------TVNEFLYKSFSYHDKLAYIFGKGWAIGARSKAPRDIKSSVLDCMN--------
        RE  N E  D+   D+ DS    +   E+  N  EAS+            + L+KSF Y+D L Y+FGK  A  ARS+   D+ S+V +  N        
Subjt:  RERKNEEEDDERKRDDRDSEGGKSCAAELLKNLFEASHP--------TVNEFLYKSFSYHDKLAYIFGKGWAIGARSKAPRDIKSSVLDCMN--------

Query:  ---------------DGNEMHGTPTSQNTQERDASRGSKRKHCGYHSEVVEVFRNAMDFANDQVKSI---------------VEFVSQLQDILELYNQQS
                         +EM G    Q ++ ++ S GSKRK    H E VEV ++A++F NDQ+K+I                E + QLQDI EL ++  
Subjt:  ---------------DGNEMHGTPTSQNTQERDASRGSKRKHCGYHSEVVEVFRNAMDFANDQVKSI---------------VEFVSQLQDILELYNQQS

Query:  VKLMNIVFHSVQDTKSFLSIPMPLKLEYC
         KL+ I+F SV+  + FLSIP   KLEYC
Subjt:  VKLMNIVFHSVQDTKSFLSIPMPLKLEYC

A0A5A7U0H7 Retrotransposon protein3.8e-2036.6Show/hide
Query:  KNLFEA---SHPTVNEFLYKSFSYHDKLAYIFGKGWAIGARSKAPRDIKSSVLDCMND-----------------------GNEMHGTPTSQNTQERDAS
        ++LF++   SHP     L+KSF Y+D L+Y+FGK  A GARS+   ++ S+V +  ND                        +EM G    Q ++ R+ S
Subjt:  KNLFEA---SHPTVNEFLYKSFSYHDKLAYIFGKGWAIGARSKAPRDIKSSVLDCMND-----------------------GNEMHGTPTSQNTQERDAS

Query:  RGSKRKHCGYHSEVVEVFRNAMDFANDQVKSIVEF---------------VSQLQDILELYNQQSVKLMNIVFHSVQDTKSFLSIPMPLKLEYC
          SKRK      E VEV R+ M+F N+Q+K+I ++               V QLQDI +L +Q   KLM I+F S++  + FLSIP  LKLEYC
Subjt:  RGSKRKHCGYHSEVVEVFRNAMDFANDQVKSIVEF---------------VSQLQDILELYNQQSVKLMNIVFHSVQDTKSFLSIPMPLKLEYC

A0A5D3BZU3 Retrotransposon protein3.4e-2138.92Show/hide
Query:  SHPTVNEFLYKSFSYHDKLAYIFGKGWAIGARSKAPRDIKSSVLDCMNDG-----------------------NEMHGTPTSQNTQERDASRGSKRKHCG
        SHP     L+KSF Y+D L+Y+FG   A GA S+   D  S+V +  NDG                       +E+ G    Q ++ R+ S GSKRK   
Subjt:  SHPTVNEFLYKSFSYHDKLAYIFGKGWAIGARSKAPRDIKSSVLDCMNDG-----------------------NEMHGTPTSQNTQERDASRGSKRKHCG

Query:  YHSEVVEVFRNAMDFANDQVKSI---------------VEFVSQLQDILELYNQQSVKLMNIVFHSVQDTKSFLSIPMPLKLEYC
         H E+VEV  +A +F NDQ+K+I                E V QLQDI +L +Q   KLM I+F SV+  + FLSIP  LKLEYC
Subjt:  YHSEVVEVFRNAMDFANDQVKSI---------------VEFVSQLQDILELYNQQSVKLMNIVFHSVQDTKSFLSIPMPLKLEYC

A0A5D3CVB7 Retrotransposon protein7.4e-2440Show/hide
Query:  SHPTVNEFLYKSFSYHDKLAYIFGKGWAIGARSKAPRDIKSSVLDCMNDG-----------------------NEMHGTPTSQNTQERDASRGSKRKHCG
        SHPTV   L KSF Y+D ++Y+FGK  A  ARS+   D+ S++ +  NDG                       +EM GT   Q ++ R+ S  SKRK   
Subjt:  SHPTVNEFLYKSFSYHDKLAYIFGKGWAIGARSKAPRDIKSSVLDCMNDG-----------------------NEMHGTPTSQNTQERDASRGSKRKHCG

Query:  YHSEVVEVFRNAMDFANDQVKSIVEF---------------VSQLQDILELYNQQSVKLMNIVFHSVQDTKSFLSIPMPLKLEYC
         H E VEV R+AM F NDQ+K+I ++               V QLQDI +L N+   K++ I+FHSV+  + FLSIP  LKLEYC
Subjt:  YHSEVVEVFRNAMDFANDQVKSIVEF---------------VSQLQDILELYNQQSVKLMNIVFHSVQDTKSFLSIPMPLKLEYC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACATTCCGCCATTGCAGCTCAATCGCCTGGCTGCGATTATGAAACCCAAACTACGGCACGAATGGCTAGATGACGATCGGAGATGGCGGGCATCAGCGGTGGGAGG
TTCTGCGAGAGAAAGGAAGAACGAAGAAGAAGATGATGAGCGAAAGAGAGACGATCGAGACTCGGAGGGAGGGAAAAGCTGTGCGGCTGAACTTTTAAAGAACCTGTTTG
AAGCCAGTCATCCTACAGTAAACGAGTTTCTATACAAGTCATTCTCATACCATGATAAATTGGCTTACATCTTCGGCAAGGGTTGGGCTATAGGAGCAAGGTCAAAGGCC
CCTAGGGACATCAAATCAAGTGTGCTAGACTGTATGAATGATGGTAATGAGATGCATGGAACACCTACTAGTCAAAACACTCAAGAAAGAGATGCGTCGAGAGGGAGTAA
GAGGAAGCATTGTGGATATCATTCTGAAGTGGTAGAGGTTTTTAGGAACGCGATGGACTTTGCAAATGACCAAGTGAAGTCGATTGTAGAATTTGTGAGCCAACTCCAAG
ATATTCTTGAACTATACAACCAACAGAGTGTGAAACTTATGAATATCGTATTCCATAGTGTGCAAGATACAAAGAGCTTCTTGTCTATTCCAATGCCCTTGAAATTGGAG
TATTGCAGGTGGAGGTTCCCCATCCCCATCCCTGCCCCTGTAGGGGAAATTTACCCCATCCCCGCCCTCGTTTCCACCTACACTAATAAGTCGAGGATAGGGGCGGAGAT
TCCTCGTCGTGGAAACGGGTCCCCTGTGTGCAAGAGCGAGAGAGACAGAGGCAAGAGCGAGAAGGAGGCTAGAGCAAGACAGAAAAATGATGCGAGAGTGCGAGAGTCAA
CTATAAAATTAATTTTCATATTAAACGGGTTGGTATTAGGGATTCGGGGTCGAGGTCGAGGCGGGGATACCCCATCCCCGCTTTGTCTCCGAACGGGTACCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACATTCCGCCATTGCAGCTCAATCGCCTGGCTGCGATTATGAAACCCAAACTACGGCACGAATGGCTAGATGACGATCGGAGATGGCGGGCATCAGCGGTGGGAGG
TTCTGCGAGAGAAAGGAAGAACGAAGAAGAAGATGATGAGCGAAAGAGAGACGATCGAGACTCGGAGGGAGGGAAAAGCTGTGCGGCTGAACTTTTAAAGAACCTGTTTG
AAGCCAGTCATCCTACAGTAAACGAGTTTCTATACAAGTCATTCTCATACCATGATAAATTGGCTTACATCTTCGGCAAGGGTTGGGCTATAGGAGCAAGGTCAAAGGCC
CCTAGGGACATCAAATCAAGTGTGCTAGACTGTATGAATGATGGTAATGAGATGCATGGAACACCTACTAGTCAAAACACTCAAGAAAGAGATGCGTCGAGAGGGAGTAA
GAGGAAGCATTGTGGATATCATTCTGAAGTGGTAGAGGTTTTTAGGAACGCGATGGACTTTGCAAATGACCAAGTGAAGTCGATTGTAGAATTTGTGAGCCAACTCCAAG
ATATTCTTGAACTATACAACCAACAGAGTGTGAAACTTATGAATATCGTATTCCATAGTGTGCAAGATACAAAGAGCTTCTTGTCTATTCCAATGCCCTTGAAATTGGAG
TATTGCAGGTGGAGGTTCCCCATCCCCATCCCTGCCCCTGTAGGGGAAATTTACCCCATCCCCGCCCTCGTTTCCACCTACACTAATAAGTCGAGGATAGGGGCGGAGAT
TCCTCGTCGTGGAAACGGGTCCCCTGTGTGCAAGAGCGAGAGAGACAGAGGCAAGAGCGAGAAGGAGGCTAGAGCAAGACAGAAAAATGATGCGAGAGTGCGAGAGTCAA
CTATAAAATTAATTTTCATATTAAACGGGTTGGTATTAGGGATTCGGGGTCGAGGTCGAGGCGGGGATACCCCATCCCCGCTTTGTCTCCGAACGGGTACCTAG
Protein sequenceShow/hide protein sequence
MNIPPLQLNRLAAIMKPKLRHEWLDDDRRWRASAVGGSARERKNEEEDDERKRDDRDSEGGKSCAAELLKNLFEASHPTVNEFLYKSFSYHDKLAYIFGKGWAIGARSKA
PRDIKSSVLDCMNDGNEMHGTPTSQNTQERDASRGSKRKHCGYHSEVVEVFRNAMDFANDQVKSIVEFVSQLQDILELYNQQSVKLMNIVFHSVQDTKSFLSIPMPLKLE
YCRWRFPIPIPAPVGEIYPIPALVSTYTNKSRIGAEIPRRGNGSPVCKSERDRGKSEKEARARQKNDARVRESTIKLIFILNGLVLGIRGRGRGGDTPSPLCLRTGT