; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG09G014823 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG09G014823
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionReverse transcriptase
Genome locationCG_Chr09:27622701..27623353
RNA-Seq ExpressionClCG09G014823
SyntenyClCG09G014823
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
InterPro domainsIPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053507.1 reverse transcriptase [Cucumis melo var. makuwa]4.7e-7060.37Show/hide
Query:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQPKLSSKQARWQECLAEFDFQFEHR---------------------------------PVRKTIKT
        MLAVV CL+ WRQYLLGS FVVKT+NS+ CHFFTQPKL+SKQARWQE LAEFDF+F+H+                                  VR T++ 
Subjt:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQPKLSSKQARWQECLAEFDFQFEHR---------------------------------PVRKTIKT

Query:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD
         L  D AAQ ++ LA+ GKTRQFWVE++LL TKGNRLYVP++G+LR+ L+ ECHDT WAGH G QRTYALLK+GY+WP++RDDVMQ+TKT L+CQQDKV+
Subjt:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD

Query:  RAKLAGLLEPLPVPSRP
        + K+AGLL+PLPVP+RP
Subjt:  RAKLAGLLEPLPVPSRP

KAA0059106.1 reverse transcriptase [Cucumis melo var. makuwa]3.6e-7060.83Show/hide
Query:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQPKLSSKQARWQECLAEFDFQFEHR---------------------------------PVRKTIKT
        MLAVV CL+ WRQYLLGS FVVKT+NS+ CHFFTQPKL+SKQARWQE LAEFDF+FEH+                                  VR T++ 
Subjt:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQPKLSSKQARWQECLAEFDFQFEHR---------------------------------PVRKTIKT

Query:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD
         L  D AAQ ++ LA+ GKTRQFWVED+L  TKGNRLYVP++G LR+ L+ ECHDT WAGH G QRTYALLK+GY+WP++RDDVMQ+TKT L+CQQDKV+
Subjt:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD

Query:  RAKLAGLLEPLPVPSRP
        + K+AGLL+PLPVP+RP
Subjt:  RAKLAGLLEPLPVPSRP

KAA0060055.1 reverse transcriptase [Cucumis melo var. makuwa]3.6e-7060.83Show/hide
Query:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQPKLSSKQARWQECLAEFDFQFEHR---------------------------------PVRKTIKT
        MLAVV CL+ WRQYLLGS FVVKT+NS+ CHFFTQPKL+SKQARWQE LAEFDF+FEH+                                  VR T++ 
Subjt:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQPKLSSKQARWQECLAEFDFQFEHR---------------------------------PVRKTIKT

Query:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD
         L  D AAQ ++ LA+ GKTRQFWVE++LL TKGNRLYVP++G LR+ L+ ECHDT WAGH G QRTYALLK+GY+WP++RDDVMQ+TKT L+CQQDKV+
Subjt:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD

Query:  RAKLAGLLEPLPVPSRP
        + K+AGLL+PLPVP+RP
Subjt:  RAKLAGLLEPLPVPSRP

XP_022155185.1 uncharacterized protein LOC111022320 [Momordica charantia]1.2e-7364.81Show/hide
Query:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQPKLSSKQARWQECLAEFDFQFEHRP---------------------------------VRKTIKT
        MLAVV CL+ WRQYLLG+KFVVKT+NSS+CHFF QPKLSSKQARWQE LAEFDFQFEH+P                                 +R+ I+ 
Subjt:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQPKLSSKQARWQECLAEFDFQFEHRP---------------------------------VRKTIKT

Query:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD
        +L ND AAQA+IQLA +G TRQF VE++L FTKGN LYVP+SG LR+LL+ ECHDT WAGH G QRTYALLK+GYYWPSLRDDVMQ+TKT L+CQQDKV+
Subjt:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD

Query:  RAKLAGLLEPLPVPSR
        R K+A LLEPLP+PSR
Subjt:  RAKLAGLLEPLPVPSR

XP_023537907.1 uncharacterized protein LOC111798805 [Cucurbita pepo subsp. pepo]2.2e-7260.83Show/hide
Query:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQPKLSSKQARWQECLAEFDFQFEHR---------------------------------PVRKTIKT
        MLAVV CL+VWRQYLLGS+FVVKT+NS+ CHFF QPKL++KQARWQE LAEFDF+FEH+                                  +R  IK 
Subjt:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQPKLSSKQARWQECLAEFDFQFEHR---------------------------------PVRKTIKT

Query:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD
        HLH D +A+A+++LA+ GKTRQFWVE +LL TKGNRLYVP++GELR+ L++ECHDT WAGH G QRTYAL+K+GY+WP++RDD+MQ+TKT L+CQQDKV+
Subjt:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD

Query:  RAKLAGLLEPLPVPSRP
        +AK++GLLEPLPVP+RP
Subjt:  RAKLAGLLEPLPVPSRP

TrEMBL top hitse value%identityAlignment
A0A5A7UG93 Reverse transcriptase2.3e-7060.37Show/hide
Query:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQPKLSSKQARWQECLAEFDFQFEHR---------------------------------PVRKTIKT
        MLAVV CL+ WRQYLLGS FVVKT+NS+ CHFFTQPKL+SKQARWQE LAEFDF+F+H+                                  VR T++ 
Subjt:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQPKLSSKQARWQECLAEFDFQFEHR---------------------------------PVRKTIKT

Query:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD
         L  D AAQ ++ LA+ GKTRQFWVE++LL TKGNRLYVP++G+LR+ L+ ECHDT WAGH G QRTYALLK+GY+WP++RDDVMQ+TKT L+CQQDKV+
Subjt:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD

Query:  RAKLAGLLEPLPVPSRP
        + K+AGLL+PLPVP+RP
Subjt:  RAKLAGLLEPLPVPSRP

A0A5A7UY33 Reverse transcriptase1.7e-7060.83Show/hide
Query:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQPKLSSKQARWQECLAEFDFQFEHR---------------------------------PVRKTIKT
        MLAVV CL+ WRQYLLGS FVVKT+NS+ CHFFTQPKL+SKQARWQE LAEFDF+FEH+                                  VR T++ 
Subjt:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQPKLSSKQARWQECLAEFDFQFEHR---------------------------------PVRKTIKT

Query:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD
         L  D AAQ ++ LA+ GKTRQFWVED+L  TKGNRLYVP++G LR+ L+ ECHDT WAGH G QRTYALLK+GY+WP++RDDVMQ+TKT L+CQQDKV+
Subjt:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD

Query:  RAKLAGLLEPLPVPSRP
        + K+AGLL+PLPVP+RP
Subjt:  RAKLAGLLEPLPVPSRP

A0A5A7V2E6 Reverse transcriptase1.7e-7060.83Show/hide
Query:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQPKLSSKQARWQECLAEFDFQFEHR---------------------------------PVRKTIKT
        MLAVV CL+ WRQYLLGS FVVKT+NS+ CHFFTQPKL+SKQARWQE LAEFDF+FEH+                                  VR T++ 
Subjt:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQPKLSSKQARWQECLAEFDFQFEHR---------------------------------PVRKTIKT

Query:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD
         L  D AAQ ++ LA+ GKTRQFWVE++LL TKGNRLYVP++G LR+ L+ ECHDT WAGH G QRTYALLK+GY+WP++RDDVMQ+TKT L+CQQDKV+
Subjt:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD

Query:  RAKLAGLLEPLPVPSRP
        + K+AGLL+PLPVP+RP
Subjt:  RAKLAGLLEPLPVPSRP

A0A5D3C4R1 Reverse transcriptase2.9e-7060.83Show/hide
Query:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQPKLSSKQARWQECLAEFDFQFEHR---------------------------------PVRKTIKT
        MLAVV CL+ WRQYLLGS FVVKT+NS+ CHFFTQPKL+SKQARWQE LAEFDF+FEH+                                  VR T++ 
Subjt:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQPKLSSKQARWQECLAEFDFQFEHR---------------------------------PVRKTIKT

Query:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD
         L  D AAQ ++ LA+ GKTRQFWVE++LL TKGNRLYVP++G LR+ L+ ECHDT WAGH G QRTYALLK+GY+WP++RDDVMQ+TKT L+CQQDKV+
Subjt:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD

Query:  RAKLAGLLEPLPVPSRP
        + K+AGLL+PLPVP+RP
Subjt:  RAKLAGLLEPLPVPSRP

A0A6J1DLQ6 uncharacterized protein LOC1110223205.7e-7464.81Show/hide
Query:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQPKLSSKQARWQECLAEFDFQFEHRP---------------------------------VRKTIKT
        MLAVV CL+ WRQYLLG+KFVVKT+NSS+CHFF QPKLSSKQARWQE LAEFDFQFEH+P                                 +R+ I+ 
Subjt:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQPKLSSKQARWQECLAEFDFQFEHRP---------------------------------VRKTIKT

Query:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD
        +L ND AAQA+IQLA +G TRQF VE++L FTKGN LYVP+SG LR+LL+ ECHDT WAGH G QRTYALLK+GYYWPSLRDDVMQ+TKT L+CQQDKV+
Subjt:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD

Query:  RAKLAGLLEPLPVPSR
        R K+A LLEPLP+PSR
Subjt:  RAKLAGLLEPLPVPSR

SwissProt top hitse value%identityAlignment
Q4R6I1 Gypsy retrotransposon integrase-like protein 16.4e-0633.73Show/hide
Query:  RQFWVEDNLLF-----TKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDK
        ++F  +D  LF      K NRL +    E +++L +ECH+     H+G  RT  L++  YYW S+ +DV Q+      CQ  K
Subjt:  RQFWVEDNLLF-----TKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDK

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein8.7e-1123.81Show/hide
Query:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQPKLSSKQARWQECLAEFDFQFEH------------------------RPV-RKTIKTHLHNDTAA
        +L +++ L  +R  L G  F ++T++ S+     + + + +  RW + LA +DF  E+                        RP+  ++ K++  +D   
Subjt:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQPKLSSKQARWQECLAEFDFQFEH------------------------RPV-RKTIKTHLHNDTAA

Query:  QALI----QLARKGKT---------------------RQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHD-TPWAGHNGGQRTYALLKQGYYWPSLRD
         A++    +L +   T                     + + +ED +++ + +RL VP   + +  +M+  HD T + GH G   T A +   YYWP L+ 
Subjt:  QALI----QLARKGKT---------------------RQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHD-TPWAGHNGGQRTYALLKQGYYWPSLRD

Query:  DVMQFTKTFLVCQQDKVDRAKLAGLLEPLPV
         ++Q+ +T + CQ  K  R +L GLL+PLP+
Subjt:  DVMQFTKTFLVCQQDKVDRAKLAGLLEPLPV

Q8K259 Gypsy retrotransposon integrase-like protein 11.1e-0539.39Show/hide
Query:  KGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDK
        K NRL V    E +++L +ECH+     H+G  RT  L++ GYYW S+ +DV Q+      CQ  K
Subjt:  KGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDK

Q99315 Transposon Ty3-G Gag-Pol polyprotein8.7e-1123.81Show/hide
Query:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQPKLSSKQARWQECLAEFDFQFEH------------------------RPV-RKTIKTHLHNDTAA
        +L +++ L  +R  L G  F ++T++ S+     + + + +  RW + LA +DF  E+                        RP+  ++ K++  +D   
Subjt:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQPKLSSKQARWQECLAEFDFQFEH------------------------RPV-RKTIKTHLHNDTAA

Query:  QALI----QLARKGKT---------------------RQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHD-TPWAGHNGGQRTYALLKQGYYWPSLRD
         A++    +L +   T                     + + +ED +++ + +RL VP   + +  +M+  HD T + GH G   T A +   YYWP L+ 
Subjt:  QALI----QLARKGKT---------------------RQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHD-TPWAGHNGGQRTYALLKQGYYWPSLRD

Query:  DVMQFTKTFLVCQQDKVDRAKLAGLLEPLPV
         ++Q+ +T + CQ  K  R +L GLL+PLP+
Subjt:  DVMQFTKTFLVCQQDKVDRAKLAGLLEPLPV

Q9NXP7 Gypsy retrotransposon integrase-like protein 11.9e-0536.36Show/hide
Query:  KGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDK
        K NRL +    E +++L +ECH+     H+G  RT  L++  YYW S+ +DV Q+      CQ  K
Subjt:  KGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDK

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGCAGTCGTTCAATGTTTGCAGGTTTGGAGACAATATCTGTTAGGGTCCAAGTTCGTAGTCAAGACAAACAACAGCTCGATCTGTCACTTCTTCACCCAACCGAA
GTTATCGTCCAAGCAAGCCCGATGGCAGGAATGCCTTGCCGAATTTGACTTTCAGTTTGAACATAGACCCGTCCGGAAAACCATCAAAACCCACCTGCACAACGATACAG
CTGCCCAAGCCTTAATTCAGTTAGCTAGAAAAGGCAAGACTCGTCAATTTTGGGTGGAAGACAATCTCCTTTTCACCAAGGGAAATCGCCTATATGTTCCTCAATCTGGG
GAGCTACGGAGACTATTAATGAAAGAGTGTCACGATACGCCGTGGGCCGGACACAACGGTGGGCAAAGAACGTACGCGCTATTGAAACAAGGTTACTACTGGCCAAGCCT
CAGAGACGACGTCATGCAATTCACTAAGACCTTTCTTGTCTGTCAACAGGACAAAGTGGATAGGGCGAAACTAGCGGGTCTGCTGGAACCCCTACCAGTGCCCTCAAGGC
CATAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGGCAGTCGTTCAATGTTTGCAGGTTTGGAGACAATATCTGTTAGGGTCCAAGTTCGTAGTCAAGACAAACAACAGCTCGATCTGTCACTTCTTCACCCAACCGAA
GTTATCGTCCAAGCAAGCCCGATGGCAGGAATGCCTTGCCGAATTTGACTTTCAGTTTGAACATAGACCCGTCCGGAAAACCATCAAAACCCACCTGCACAACGATACAG
CTGCCCAAGCCTTAATTCAGTTAGCTAGAAAAGGCAAGACTCGTCAATTTTGGGTGGAAGACAATCTCCTTTTCACCAAGGGAAATCGCCTATATGTTCCTCAATCTGGG
GAGCTACGGAGACTATTAATGAAAGAGTGTCACGATACGCCGTGGGCCGGACACAACGGTGGGCAAAGAACGTACGCGCTATTGAAACAAGGTTACTACTGGCCAAGCCT
CAGAGACGACGTCATGCAATTCACTAAGACCTTTCTTGTCTGTCAACAGGACAAAGTGGATAGGGCGAAACTAGCGGGTCTGCTGGAACCCCTACCAGTGCCCTCAAGGC
CATAG
Protein sequenceShow/hide protein sequence
MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQPKLSSKQARWQECLAEFDFQFEHRPVRKTIKTHLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSG
ELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVDRAKLAGLLEPLPVPSRP