; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G19830 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G19830
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationClcChr06:29773988..29775279
RNA-Seq ExpressionClc06G19830
SyntenyClc06G19830
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN81137.1 hypothetical protein VITISV_018758 [Vitis vinifera]6.0e-3031.91Show/hide
Query:  MPLGGNPKAASFWQPIIEKIDKKFDAWKYSYISKGGRLTLLQAVLSNLPTYYLSLFQAPTVVCKTIERLMRDFLWEGTEKSGASHLVRWDITTSPKHIGG
        +PLGGNPK   FW P+IE+I  + D W+  Y+S GGR+TL+Q+ L+++P Y+LSLF+ P  V   IERL RDFLW G  +    HLVRWD+   PK IGG
Subjt:  MPLGGNPKAASFWQPIIEKIDKKFDAWKYSYISKGGRLTLLQAVLSNLPTYYLSLFQAPTVVCKTIERLMRDFLWEGTEKSGASHLVRWDITTSPKHIGG

Query:  LAITIAA--NL--LGIALLSFITTFLKILNGVLLMV-GTLSSGSRIGIAGDVSIPLMQREKDLWSATTAGWPQPSFDLNGDTIRWKLTTDGTFTVKSI--
        L     +  NL  LG  L  +      + + V+L + G+ S+G       D +  +    +  W A    +P      +    ++   +   F VKS   
Subjt:  LAITIAA--NL--LGIALLSFITTFLKILNGVLLMV-GTLSSGSRIGIAGDVSIPLMQREKDLWSATTAGWPQPSFDLNGDTIRWKLTTDGTFTVKSI--

Query:  ---------KDCIRPPSPQNYLSKFVGNL-WNSSESAEHIFMTCSFSSVFWDQLQQ--EGITLHTQTIQASLDFIFRQTTTNKPQI-LNSNLIAATLWSI
                  D ++   P   LS  +  L     ESA+H+F+ CS +   W ++ Q  +   +   +I   +   F+   T+K  I L      A +  +
Subjt:  ---------KDCIRPPSPQNYLSKFVGNL-WNSSESAEHIFMTCSFSSVFWDQLQQ--EGITLHTQTIQASLDFIFRQTTTNKPQI-LNSNLIAATLWSI

Query:  WLELN-RTFQGSCENQNYLWADILSLAAL
        W E N R F+    N  +LW  I+ LA+L
Subjt:  WLELN-RTFQGSCENQNYLWADILSLAAL

KAA0035248.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.8e-2931.21Show/hide
Query:  MPLGGNPKAASFWQPIIEKIDKKFDAWKYSYISKGGRLTLLQAVLSNLPTYYLSLFQAPTVVCKTIERLMRDFLWEGTEKSGASHLVRWDITTSPKHIGG
        +PLGGNP++ SFW   IE I KK + WKYS ISK GRLTLL+A LS+LPTY LS F+AP  V K IE+  RDFLW G+E    +HLV W I TSPK +GG
Subjt:  MPLGGNPKAASFWQPIIEKIDKKFDAWKYSYISKGGRLTLLQAVLSNLPTYYLSLFQAPTVVCKTIERLMRDFLWEGTEKSGASHLVRWDITTSPKHIGG

Query:  LAITIAANLLGIALLSFITTFLKILNGVLLMVGTLSSGSRIGIAGDVSIPLMQREKDLWSATTAGWPQPSFDLNGDTI----------------------
        L I+   +     L  ++  +    N   L    + +       GD+ I        +    T   P P    NG TI                      
Subjt:  LAITIAANLLGIALLSFITTFLKILNGVLLMVGTLSSGSRIGIAGDVSIPLMQREKDLWSATTAGWPQPSFDLNGDTI----------------------

Query:  -----RWKLTTDGTFTVKSIKDCI----RPPSPQN------YLSKFVGNLWNSSESAEHIFMTCSFSSVFWDQLQQEGITLHTQTIQASLDFIFRQTTTN
              WK +    ++V S KD +      P   N      + +K + NLW+S                      + GI +    ++     + R T  N
Subjt:  -----RWKLTTDGTFTVKSIKDCI----RPPSPQN------YLSKFVGNLWNSSESAEHIFMTCSFSSVFWDQLQQEGITLHTQTIQASLDFIFRQTTTN

Query:  KPQILNSNLIAATLWSIWLELNR-TFQGSCENQNYLWADILSLAAL
           I++ N   ATLW+IW+  N   F     +    W DI +L  +
Subjt:  KPQILNSNLIAATLWSIWLELNR-TFQGSCENQNYLWADILSLAAL

RVW94236.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]4.5e-3334.12Show/hide
Query:  MPLGGNPKAASFWQPIIEKIDKKFDAWKYSYISKGGRLTLLQAVLSNLPTYYLSLFQAPTVVCKTIERLMRDFLWEGTEKSGASHLVRWDITTSPKHIGG
        +PLGGNPKA  FW P++E+I  + D W+ +Y+S GGR+TL+Q+ L++LP+Y+LSLF+ P  V   IERL RDFLW G  +    HLVRWD+   PK IGG
Subjt:  MPLGGNPKAASFWQPIIEKIDKKFDAWKYSYISKGGRLTLLQAVLSNLPTYYLSLFQAPTVVCKTIERLMRDFLWEGTEKSGASHLVRWDITTSPKHIGG

Query:  LAITIAA----NLLGIALLSFITTFLKILNGVLLMVGTLSSGSRIGIAGDVSIPLMQREKDLWSATTAGWPQPS--FDLNG---------------DTIR
        L +   +     LLG  L S + T L  L G    +   S  +R  +       L      LW+         S   DL G               D   
Subjt:  LAITIAA----NLLGIALLSFITTFLKILNGVLLMVGTLSSGSRIGIAGDVSIPLMQREKDLWSATTAGWPQPS--FDLNG---------------DTIR

Query:  WKLTTDGTFTVKSIKDCIRPP--SPQNYLSKFVGNLWNSSESAEHIFMTCS------FSSVFWDQ--LQQEGITLHTQTIQASLDFIFRQTTTNKPQILN
        W L++ G F+VKS    +     S QN+ SKFV   WNS    +     C+      FSS+F D   + Q GI L      AS+                
Subjt:  WKLTTDGTFTVKSIKDCIRPP--SPQNYLSKFVGNLWNSSESAEHIFMTCS------FSSVFWDQ--LQQEGITLHTQTIQASLDFIFRQTTTNKPQILN

Query:  SNLIAATLWSIWLELN-RTFQGSCENQNYLWADILSLAAL
             A +  +W E N R F+    N  +LW  I+ LA+L
Subjt:  SNLIAATLWSIWLELN-RTFQGSCENQNYLWADILSLAAL

TYK22403.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.8e-2931.21Show/hide
Query:  MPLGGNPKAASFWQPIIEKIDKKFDAWKYSYISKGGRLTLLQAVLSNLPTYYLSLFQAPTVVCKTIERLMRDFLWEGTEKSGASHLVRWDITTSPKHIGG
        +PLGGNP++ SFW   IE I KK + WKYS ISK GRLTLL+A LS+LPTY LS F+AP  V K IE+  RDFLW G+E    +HLV W I TSPK +GG
Subjt:  MPLGGNPKAASFWQPIIEKIDKKFDAWKYSYISKGGRLTLLQAVLSNLPTYYLSLFQAPTVVCKTIERLMRDFLWEGTEKSGASHLVRWDITTSPKHIGG

Query:  LAITIAANLLGIALLSFITTFLKILNGVLLMVGTLSSGSRIGIAGDVSIPLMQREKDLWSATTAGWPQPSFDLNGDTI----------------------
        L I+   +     L  ++  +    N   L    + +       GD+ I        +    T   P P    NG TI                      
Subjt:  LAITIAANLLGIALLSFITTFLKILNGVLLMVGTLSSGSRIGIAGDVSIPLMQREKDLWSATTAGWPQPSFDLNGDTI----------------------

Query:  -----RWKLTTDGTFTVKSIKDCI----RPPSPQN------YLSKFVGNLWNSSESAEHIFMTCSFSSVFWDQLQQEGITLHTQTIQASLDFIFRQTTTN
              WK +    ++V S KD +      P   N      + +K + NLW+S                      + GI +    ++     + R T  N
Subjt:  -----RWKLTTDGTFTVKSIKDCI----RPPSPQN------YLSKFVGNLWNSSESAEHIFMTCSFSSVFWDQLQQEGITLHTQTIQASLDFIFRQTTTN

Query:  KPQILNSNLIAATLWSIWLELNR-TFQGSCENQNYLWADILSLAAL
           I++ N   ATLW+IW+  N   F     +    W DI +L  +
Subjt:  KPQILNSNLIAATLWSIWLELNR-TFQGSCENQNYLWADILSLAAL

XP_022143310.1 uncharacterized protein LOC111013210 [Momordica charantia]1.0e-2962.5Show/hide
Query:  MPLGGNPKAASFWQPIIEKIDKKFDAWKYSYISKGGRLTLLQAVLSNLPTYYLSLFQAPTVVCKTIERLMRDFLWEGTEKSGASHLVRWDITTSPKHIGG
        MPLGGNPKA+ FW PIIE+  +K ++WKYS ISK GRLTL+Q++LS+LPTYYLS+F++P  V K +E+LMRDFLW+G      SHLVRW+I T PK  G 
Subjt:  MPLGGNPKAASFWQPIIEKIDKKFDAWKYSYISKGGRLTLLQAVLSNLPTYYLSLFQAPTVVCKTIERLMRDFLWEGTEKSGASHLVRWDITTSPKHIGG

Query:  LAIT
        L IT
Subjt:  LAIT

TrEMBL top hitse value%identityAlignment
A0A438IBZ1 LINE-1 retrotransposable element ORF2 protein2.2e-3334.12Show/hide
Query:  MPLGGNPKAASFWQPIIEKIDKKFDAWKYSYISKGGRLTLLQAVLSNLPTYYLSLFQAPTVVCKTIERLMRDFLWEGTEKSGASHLVRWDITTSPKHIGG
        +PLGGNPKA  FW P++E+I  + D W+ +Y+S GGR+TL+Q+ L++LP+Y+LSLF+ P  V   IERL RDFLW G  +    HLVRWD+   PK IGG
Subjt:  MPLGGNPKAASFWQPIIEKIDKKFDAWKYSYISKGGRLTLLQAVLSNLPTYYLSLFQAPTVVCKTIERLMRDFLWEGTEKSGASHLVRWDITTSPKHIGG

Query:  LAITIAA----NLLGIALLSFITTFLKILNGVLLMVGTLSSGSRIGIAGDVSIPLMQREKDLWSATTAGWPQPS--FDLNG---------------DTIR
        L +   +     LLG  L S + T L  L G    +   S  +R  +       L      LW+         S   DL G               D   
Subjt:  LAITIAA----NLLGIALLSFITTFLKILNGVLLMVGTLSSGSRIGIAGDVSIPLMQREKDLWSATTAGWPQPS--FDLNG---------------DTIR

Query:  WKLTTDGTFTVKSIKDCIRPP--SPQNYLSKFVGNLWNSSESAEHIFMTCS------FSSVFWDQ--LQQEGITLHTQTIQASLDFIFRQTTTNKPQILN
        W L++ G F+VKS    +     S QN+ SKFV   WNS    +     C+      FSS+F D   + Q GI L      AS+                
Subjt:  WKLTTDGTFTVKSIKDCIRPP--SPQNYLSKFVGNLWNSSESAEHIFMTCS------FSSVFWDQ--LQQEGITLHTQTIQASLDFIFRQTTTNKPQILN

Query:  SNLIAATLWSIWLELN-RTFQGSCENQNYLWADILSLAAL
             A +  +W E N R F+    N  +LW  I+ LA+L
Subjt:  SNLIAATLWSIWLELN-RTFQGSCENQNYLWADILSLAAL

A0A5A7SXB2 LINE-1 retrotransposable element ORF2 protein8.5e-3031.21Show/hide
Query:  MPLGGNPKAASFWQPIIEKIDKKFDAWKYSYISKGGRLTLLQAVLSNLPTYYLSLFQAPTVVCKTIERLMRDFLWEGTEKSGASHLVRWDITTSPKHIGG
        +PLGGNP++ SFW   IE I KK + WKYS ISK GRLTLL+A LS+LPTY LS F+AP  V K IE+  RDFLW G+E    +HLV W I TSPK +GG
Subjt:  MPLGGNPKAASFWQPIIEKIDKKFDAWKYSYISKGGRLTLLQAVLSNLPTYYLSLFQAPTVVCKTIERLMRDFLWEGTEKSGASHLVRWDITTSPKHIGG

Query:  LAITIAANLLGIALLSFITTFLKILNGVLLMVGTLSSGSRIGIAGDVSIPLMQREKDLWSATTAGWPQPSFDLNGDTI----------------------
        L I+   +     L  ++  +    N   L    + +       GD+ I        +    T   P P    NG TI                      
Subjt:  LAITIAANLLGIALLSFITTFLKILNGVLLMVGTLSSGSRIGIAGDVSIPLMQREKDLWSATTAGWPQPSFDLNGDTI----------------------

Query:  -----RWKLTTDGTFTVKSIKDCI----RPPSPQN------YLSKFVGNLWNSSESAEHIFMTCSFSSVFWDQLQQEGITLHTQTIQASLDFIFRQTTTN
              WK +    ++V S KD +      P   N      + +K + NLW+S                      + GI +    ++     + R T  N
Subjt:  -----RWKLTTDGTFTVKSIKDCI----RPPSPQN------YLSKFVGNLWNSSESAEHIFMTCSFSSVFWDQLQQEGITLHTQTIQASLDFIFRQTTTN

Query:  KPQILNSNLIAATLWSIWLELNR-TFQGSCENQNYLWADILSLAAL
           I++ N   ATLW+IW+  N   F     +    W DI +L  +
Subjt:  KPQILNSNLIAATLWSIWLELNR-TFQGSCENQNYLWADILSLAAL

A0A6J1CNG6 uncharacterized protein LOC1110132105.0e-3062.5Show/hide
Query:  MPLGGNPKAASFWQPIIEKIDKKFDAWKYSYISKGGRLTLLQAVLSNLPTYYLSLFQAPTVVCKTIERLMRDFLWEGTEKSGASHLVRWDITTSPKHIGG
        MPLGGNPKA+ FW PIIE+  +K ++WKYS ISK GRLTL+Q++LS+LPTYYLS+F++P  V K +E+LMRDFLW+G      SHLVRW+I T PK  G 
Subjt:  MPLGGNPKAASFWQPIIEKIDKKFDAWKYSYISKGGRLTLLQAVLSNLPTYYLSLFQAPTVVCKTIERLMRDFLWEGTEKSGASHLVRWDITTSPKHIGG

Query:  LAIT
        L IT
Subjt:  LAIT

A0A803P465 Uncharacterized protein5.9e-3159.22Show/hide
Query:  MPLGGNPKAASFWQPIIEKIDKKFDAWKYSYISKGGRLTLLQAVLSNLPTYYLSLFQAPTVVCKTIERLMRDFLWEGTEKSGASHLVRWDITTSPKHIGG
        MPLGG+P+  SFW+P+++K  K+ D WK +++SKGGRLTL+Q+VLS+LP Y+LSLF+AP  V K +E++MRDFLWEG+E SG  HLV WD    P+H GG
Subjt:  MPLGGNPKAASFWQPIIEKIDKKFDAWKYSYISKGGRLTLLQAVLSNLPTYYLSLFQAPTVVCKTIERLMRDFLWEGTEKSGASHLVRWDITTSPKHIGG

Query:  LAI
        L I
Subjt:  LAI

A5BHF0 Uncharacterized protein2.9e-3031.91Show/hide
Query:  MPLGGNPKAASFWQPIIEKIDKKFDAWKYSYISKGGRLTLLQAVLSNLPTYYLSLFQAPTVVCKTIERLMRDFLWEGTEKSGASHLVRWDITTSPKHIGG
        +PLGGNPK   FW P+IE+I  + D W+  Y+S GGR+TL+Q+ L+++P Y+LSLF+ P  V   IERL RDFLW G  +    HLVRWD+   PK IGG
Subjt:  MPLGGNPKAASFWQPIIEKIDKKFDAWKYSYISKGGRLTLLQAVLSNLPTYYLSLFQAPTVVCKTIERLMRDFLWEGTEKSGASHLVRWDITTSPKHIGG

Query:  LAITIAA--NL--LGIALLSFITTFLKILNGVLLMV-GTLSSGSRIGIAGDVSIPLMQREKDLWSATTAGWPQPSFDLNGDTIRWKLTTDGTFTVKSI--
        L     +  NL  LG  L  +      + + V+L + G+ S+G       D +  +    +  W A    +P      +    ++   +   F VKS   
Subjt:  LAITIAA--NL--LGIALLSFITTFLKILNGVLLMV-GTLSSGSRIGIAGDVSIPLMQREKDLWSATTAGWPQPSFDLNGDTIRWKLTTDGTFTVKSI--

Query:  ---------KDCIRPPSPQNYLSKFVGNL-WNSSESAEHIFMTCSFSSVFWDQLQQ--EGITLHTQTIQASLDFIFRQTTTNKPQI-LNSNLIAATLWSI
                  D ++   P   LS  +  L     ESA+H+F+ CS +   W ++ Q  +   +   +I   +   F+   T+K  I L      A +  +
Subjt:  ---------KDCIRPPSPQNYLSKFVGNL-WNSSESAEHIFMTCSFSSVFWDQLQQ--EGITLHTQTIQASLDFIFRQTTTNKPQI-LNSNLIAATLWSI

Query:  WLELN-RTFQGSCENQNYLWADILSLAAL
        W E N R F+    N  +LW  I+ LA+L
Subjt:  WLELN-RTFQGSCENQNYLWADILSLAAL

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657501.2e-1236.63Show/hide
Query:  IIEKIDKKFDAWKYSYISKGGRLTLLQAVLSNLPTYYLSLFQAPTVVCKTIERLMRDFLWEGTEKSGASHLVRWDITTSPKHIGGLAITIAANLLGIALL
        I+E++  +   W+   +S  GRLTL +AVLS++P + +S    P  +   +++L R FLW  T +    HLV+W    SPK  GGL +  AA  +  AL+
Subjt:  IIEKIDKKFDAWKYSYISKGGRLTLLQAVLSNLPTYYLSLFQAPTVVCKTIERLMRDFLWEGTEKSGASHLVRWDITTSPKHIGGLAITIAANLLGIALL

Query:  S
        S
Subjt:  S

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCCTAGGAGGAAATCCGAAAGCTGCTAGTTTTTGGCAACCGATAATTGAGAAGATTGATAAGAAATTCGATGCTTGGAAATACTCATATATTTCAAAAGGT
GGACGATTAACATTATTGCAAGCAGTTCTGAGTAACCTCCCAACATATTATCTTTCTCTGTTTCAGGCCCCAACTGTTGTTTGCAAAACCATTGAGAGATTAATG
AGAGATTTCCTTTGGGAGGGGACTGAGAAAAGTGGTGCTTCACATCTTGTACGTTGGGATATTACCACTTCGCCAAAGCACATCGGTGGTTTAGCAATCACCATA
GCTGCAAATCTCCTTGGCATAGCATTGTTAAGCTTCATCACTACATTTCTGAAAATATTGAATGGCGTATTGTTGATGGTAGGAACACTCTCTTCTGGTTCACGA
ATTGGAATAGCAGGGGATGTCTCAATACCATTGATGCAAAGAGAAAAGGACCTATGGAGTGCAACAACAGCAGGCTGGCCACAACCATCTTTCGACTTGAATGGT
GATACAATAAGATGGAAGCTGACAACTGATGGTACCTTCACTGTCAAATCCATAAAAGACTGCATTAGACCCCCCTCCCCTCAAAACTACTTGAGCAAATTTGTG
GGAAATCTCTGGAACAGTTCAGAATCTGCTGAACATATCTTTATGACTTGTAGTTTTTCCTCTGTTTTTTGGGACCAATTGCAGCAGGAAGGCATTACTCTTCAT
ACTCAAACAATTCAAGCTAGCCTTGACTTCATATTCCGACAAACAACGACCAACAAACCCCAAATCCTCAACAGCAACCTGATAGCAGCTACTCTATGGAGCATT
TGGCTTGAACTTAATAGAACTTTCCAAGGAAGTTGCGAAAATCAGAACTATCTTTGGGCTGATATTCTATCCCTGGCTGCTCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCCCTAGGAGGAAATCCGAAAGCTGCTAGTTTTTGGCAACCGATAATTGAGAAGATTGATAAGAAATTCGATGCTTGGAAATACTCATATATTTCAAAAGGT
GGACGATTAACATTATTGCAAGCAGTTCTGAGTAACCTCCCAACATATTATCTTTCTCTGTTTCAGGCCCCAACTGTTGTTTGCAAAACCATTGAGAGATTAATG
AGAGATTTCCTTTGGGAGGGGACTGAGAAAAGTGGTGCTTCACATCTTGTACGTTGGGATATTACCACTTCGCCAAAGCACATCGGTGGTTTAGCAATCACCATA
GCTGCAAATCTCCTTGGCATAGCATTGTTAAGCTTCATCACTACATTTCTGAAAATATTGAATGGCGTATTGTTGATGGTAGGAACACTCTCTTCTGGTTCACGA
ATTGGAATAGCAGGGGATGTCTCAATACCATTGATGCAAAGAGAAAAGGACCTATGGAGTGCAACAACAGCAGGCTGGCCACAACCATCTTTCGACTTGAATGGT
GATACAATAAGATGGAAGCTGACAACTGATGGTACCTTCACTGTCAAATCCATAAAAGACTGCATTAGACCCCCCTCCCCTCAAAACTACTTGAGCAAATTTGTG
GGAAATCTCTGGAACAGTTCAGAATCTGCTGAACATATCTTTATGACTTGTAGTTTTTCCTCTGTTTTTTGGGACCAATTGCAGCAGGAAGGCATTACTCTTCAT
ACTCAAACAATTCAAGCTAGCCTTGACTTCATATTCCGACAAACAACGACCAACAAACCCCAAATCCTCAACAGCAACCTGATAGCAGCTACTCTATGGAGCATT
TGGCTTGAACTTAATAGAACTTTCCAAGGAAGTTGCGAAAATCAGAACTATCTTTGGGCTGATATTCTATCCCTGGCTGCTCTTTAG
Protein sequenceShow/hide protein sequence
MPLGGNPKAASFWQPIIEKIDKKFDAWKYSYISKGGRLTLLQAVLSNLPTYYLSLFQAPTVVCKTIERLMRDFLWEGTEKSGASHLVRWDITTSPKHIGGLAITI
AANLLGIALLSFITTFLKILNGVLLMVGTLSSGSRIGIAGDVSIPLMQREKDLWSATTAGWPQPSFDLNGDTIRWKLTTDGTFTVKSIKDCIRPPSPQNYLSKFV
GNLWNSSESAEHIFMTCSFSSVFWDQLQQEGITLHTQTIQASLDFIFRQTTTNKPQILNSNLIAATLWSIWLELNRTFQGSCENQNYLWADILSLAAL