; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G16640 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G16640
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionReverse transcriptase
Genome locationClcChr09:26293098..26293750
RNA-Seq ExpressionClc09G16640
SyntenyClc09G16640
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
InterPro domainsIPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059106.1 reverse transcriptase [Cucumis melo var. makuwa]2.6e-6859.91Show/hide
Query:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQLKLSSKQAQWQECLAEFDFQFEHR---------------------------------PVRKTIKT
        MLAVV CL+ WRQYLLGS FVVKT+NS+ CHFFTQ KL+SKQA+WQE LAEFDF+FEH+                                  VR T++ 
Subjt:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQLKLSSKQAQWQECLAEFDFQFEHR---------------------------------PVRKTIKT

Query:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD
         L  D AAQ ++ LA+ GKTRQFWVED+L  TKGNRLYVP++G LR+ L+ ECHDT WAGH G QRTYALLK+GY+WP++RDDVMQ+TKT L+CQQDKV+
Subjt:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD

Query:  RAKLAGLLEPLPVPSRP
        + K+AGLL+PLPVP+RP
Subjt:  RAKLAGLLEPLPVPSRP

KAA0060055.1 reverse transcriptase [Cucumis melo var. makuwa]2.6e-6859.91Show/hide
Query:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQLKLSSKQAQWQECLAEFDFQFEHR---------------------------------PVRKTIKT
        MLAVV CL+ WRQYLLGS FVVKT+NS+ CHFFTQ KL+SKQA+WQE LAEFDF+FEH+                                  VR T++ 
Subjt:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQLKLSSKQAQWQECLAEFDFQFEHR---------------------------------PVRKTIKT

Query:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD
         L  D AAQ ++ LA+ GKTRQFWVE++LL TKGNRLYVP++G LR+ L+ ECHDT WAGH G QRTYALLK+GY+WP++RDDVMQ+TKT L+CQQDKV+
Subjt:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD

Query:  RAKLAGLLEPLPVPSRP
        + K+AGLL+PLPVP+RP
Subjt:  RAKLAGLLEPLPVPSRP

TYK19195.1 reverse transcriptase [Cucumis melo var. makuwa]5.1e-6964.43Show/hide
Query:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQLKLSSKQAQWQECLAEFDFQFEHRP----------VRKTIKTHLHNDTAAQALIQLARKGKTRQF
        MLAVV CL+ W+QYLLGS FVVKT+NS+ CHFFTQLKL+SKQA+WQE LAEFDF+FEH+           +R T++  L  D  AQ ++ L + GKTRQF
Subjt:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQLKLSSKQAQWQECLAEFDFQFEHRP----------VRKTIKTHLHNDTAAQALIQLARKGKTRQF

Query:  WVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVDRAKLAGLLEPLPVPSRP
        WVE++LL TK NRLYVP+ G+LR+ L+ ECHDT WAGH G QRTYALLK+GY+WP++RDDVMQ+TKT L+CQQDKV++ K+ GLL+PLPVP+RP
Subjt:  WVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVDRAKLAGLLEPLPVPSRP

XP_022155185.1 uncharacterized protein LOC111022320 [Momordica charantia]8.5e-7263.89Show/hide
Query:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQLKLSSKQAQWQECLAEFDFQFEHRP---------------------------------VRKTIKT
        MLAVV CL+ WRQYLLG+KFVVKT+NSS+CHFF Q KLSSKQA+WQE LAEFDFQFEH+P                                 +R+ I+ 
Subjt:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQLKLSSKQAQWQECLAEFDFQFEHRP---------------------------------VRKTIKT

Query:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD
        +L ND AAQA+IQLA +G TRQF VE++L FTKGN LYVP+SG LR+LL+ ECHDT WAGH G QRTYALLK+GYYWPSLRDDVMQ+TKT L+CQQDKV+
Subjt:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD

Query:  RAKLAGLLEPLPVPSR
        R K+A LLEPLP+PSR
Subjt:  RAKLAGLLEPLPVPSR

XP_023537907.1 uncharacterized protein LOC111798805 [Cucurbita pepo subsp. pepo]1.6e-7059.91Show/hide
Query:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQLKLSSKQAQWQECLAEFDFQFEHR---------------------------------PVRKTIKT
        MLAVV CL+VWRQYLLGS+FVVKT+NS+ CHFF Q KL++KQA+WQE LAEFDF+FEH+                                  +R  IK 
Subjt:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQLKLSSKQAQWQECLAEFDFQFEHR---------------------------------PVRKTIKT

Query:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD
        HLH D +A+A+++LA+ GKTRQFWVE +LL TKGNRLYVP++GELR+ L++ECHDT WAGH G QRTYAL+K+GY+WP++RDD+MQ+TKT L+CQQDKV+
Subjt:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD

Query:  RAKLAGLLEPLPVPSRP
        +AK++GLLEPLPVP+RP
Subjt:  RAKLAGLLEPLPVPSRP

TrEMBL top hitse value%identityAlignment
A0A5A7UG93 Reverse transcriptase1.6e-6859.45Show/hide
Query:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQLKLSSKQAQWQECLAEFDFQFEHR---------------------------------PVRKTIKT
        MLAVV CL+ WRQYLLGS FVVKT+NS+ CHFFTQ KL+SKQA+WQE LAEFDF+F+H+                                  VR T++ 
Subjt:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQLKLSSKQAQWQECLAEFDFQFEHR---------------------------------PVRKTIKT

Query:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD
         L  D AAQ ++ LA+ GKTRQFWVE++LL TKGNRLYVP++G+LR+ L+ ECHDT WAGH G QRTYALLK+GY+WP++RDDVMQ+TKT L+CQQDKV+
Subjt:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD

Query:  RAKLAGLLEPLPVPSRP
        + K+AGLL+PLPVP+RP
Subjt:  RAKLAGLLEPLPVPSRP

A0A5A7UY33 Reverse transcriptase1.2e-6859.91Show/hide
Query:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQLKLSSKQAQWQECLAEFDFQFEHR---------------------------------PVRKTIKT
        MLAVV CL+ WRQYLLGS FVVKT+NS+ CHFFTQ KL+SKQA+WQE LAEFDF+FEH+                                  VR T++ 
Subjt:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQLKLSSKQAQWQECLAEFDFQFEHR---------------------------------PVRKTIKT

Query:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD
         L  D AAQ ++ LA+ GKTRQFWVED+L  TKGNRLYVP++G LR+ L+ ECHDT WAGH G QRTYALLK+GY+WP++RDDVMQ+TKT L+CQQDKV+
Subjt:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD

Query:  RAKLAGLLEPLPVPSRP
        + K+AGLL+PLPVP+RP
Subjt:  RAKLAGLLEPLPVPSRP

A0A5A7V2E6 Reverse transcriptase1.2e-6859.91Show/hide
Query:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQLKLSSKQAQWQECLAEFDFQFEHR---------------------------------PVRKTIKT
        MLAVV CL+ WRQYLLGS FVVKT+NS+ CHFFTQ KL+SKQA+WQE LAEFDF+FEH+                                  VR T++ 
Subjt:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQLKLSSKQAQWQECLAEFDFQFEHR---------------------------------PVRKTIKT

Query:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD
         L  D AAQ ++ LA+ GKTRQFWVE++LL TKGNRLYVP++G LR+ L+ ECHDT WAGH G QRTYALLK+GY+WP++RDDVMQ+TKT L+CQQDKV+
Subjt:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD

Query:  RAKLAGLLEPLPVPSRP
        + K+AGLL+PLPVP+RP
Subjt:  RAKLAGLLEPLPVPSRP

A0A5D3D6N9 Reverse transcriptase2.5e-6964.43Show/hide
Query:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQLKLSSKQAQWQECLAEFDFQFEHRP----------VRKTIKTHLHNDTAAQALIQLARKGKTRQF
        MLAVV CL+ W+QYLLGS FVVKT+NS+ CHFFTQLKL+SKQA+WQE LAEFDF+FEH+           +R T++  L  D  AQ ++ L + GKTRQF
Subjt:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQLKLSSKQAQWQECLAEFDFQFEHRP----------VRKTIKTHLHNDTAAQALIQLARKGKTRQF

Query:  WVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVDRAKLAGLLEPLPVPSRP
        WVE++LL TK NRLYVP+ G+LR+ L+ ECHDT WAGH G QRTYALLK+GY+WP++RDDVMQ+TKT L+CQQDKV++ K+ GLL+PLPVP+RP
Subjt:  WVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVDRAKLAGLLEPLPVPSRP

A0A6J1DLQ6 uncharacterized protein LOC1110223204.1e-7263.89Show/hide
Query:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQLKLSSKQAQWQECLAEFDFQFEHRP---------------------------------VRKTIKT
        MLAVV CL+ WRQYLLG+KFVVKT+NSS+CHFF Q KLSSKQA+WQE LAEFDFQFEH+P                                 +R+ I+ 
Subjt:  MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQLKLSSKQAQWQECLAEFDFQFEHRP---------------------------------VRKTIKT

Query:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD
        +L ND AAQA+IQLA +G TRQF VE++L FTKGN LYVP+SG LR+LL+ ECHDT WAGH G QRTYALLK+GYYWPSLRDDVMQ+TKT L+CQQDKV+
Subjt:  HLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVD

Query:  RAKLAGLLEPLPVPSR
        R K+A LLEPLP+PSR
Subjt:  RAKLAGLLEPLPVPSR

SwissProt top hitse value%identityAlignment
Q4R6I1 Gypsy retrotransposon integrase-like protein 16.4e-0633.73Show/hide
Query:  RQFWVEDNLLF-----TKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDK
        ++F  +D  LF      K NRL +    E +++L +ECH+     H+G  RT  L++  YYW S+ +DV Q+      CQ  K
Subjt:  RQFWVEDNLLF-----TKGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDK

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.8e-0835.11Show/hide
Query:  RQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHD-TPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVDRAKLAGLLEPLPV
        + + +ED +++ + +RL VP   + +  +M+  HD T + GH G   T A +   YYWP L+  ++Q+ +T + CQ  K  R +L GLL+PLP+
Subjt:  RQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHD-TPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVDRAKLAGLLEPLPV

Q8K259 Gypsy retrotransposon integrase-like protein 11.1e-0539.39Show/hide
Query:  KGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDK
        K NRL V    E +++L +ECH+     H+G  RT  L++ GYYW S+ +DV Q+      CQ  K
Subjt:  KGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDK

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.8e-0835.11Show/hide
Query:  RQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHD-TPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVDRAKLAGLLEPLPV
        + + +ED +++ + +RL VP   + +  +M+  HD T + GH G   T A +   YYWP L+  ++Q+ +T + CQ  K  R +L GLL+PLP+
Subjt:  RQFWVEDNLLFTKGNRLYVPQSGELRRLLMKECHD-TPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVDRAKLAGLLEPLPV

Q9NXP7 Gypsy retrotransposon integrase-like protein 11.9e-0536.36Show/hide
Query:  KGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDK
        K NRL +    E +++L +ECH+     H+G  RT  L++  YYW S+ +DV Q+      CQ  K
Subjt:  KGNRLYVPQSGELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDK

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGCAGTCGTTCAATGTTTGCAGGTTTGGAGACAATATCTGTTAGGGTCCAAGTTCGTAGTCAAGACAAACAACAGCTCGATCTGTCACTTCTTCACCCAACTGAA
GTTATCGTCCAAGCAAGCCCAATGGCAGGAATGCCTTGCCGAATTCGACTTCCAGTTTGAACATAGACCCGTCCGGAAAACCATCAAAACCCACCTGCACAACGATACAG
CTGCCCAAGCCTTAATTCAGTTAGCTAGAAAAGGCAAGACTCGTCAATTTTGGGTGGAAGACAATCTCCTTTTCACCAAGGGAAATCGCCTATATGTTCCTCAATCTGGG
GAGCTACGGAGACTATTAATGAAAGAGTGTCACGATACGCCGTGGGCCGGACACAACGGTGGGCAAAGAACGTACGCGCTATTGAAACAAGGTTACTACTGGCCAAGCCT
CAGAGACGACGTCATGCAATTCACTAAGACCTTTCTTGTCTGTCAACAGGACAAAGTGGATAGGGCGAAACTAGCGGGTCTGCTGGAACCCCTACCAGTGCCCTCAAGGC
CATAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGGCAGTCGTTCAATGTTTGCAGGTTTGGAGACAATATCTGTTAGGGTCCAAGTTCGTAGTCAAGACAAACAACAGCTCGATCTGTCACTTCTTCACCCAACTGAA
GTTATCGTCCAAGCAAGCCCAATGGCAGGAATGCCTTGCCGAATTCGACTTCCAGTTTGAACATAGACCCGTCCGGAAAACCATCAAAACCCACCTGCACAACGATACAG
CTGCCCAAGCCTTAATTCAGTTAGCTAGAAAAGGCAAGACTCGTCAATTTTGGGTGGAAGACAATCTCCTTTTCACCAAGGGAAATCGCCTATATGTTCCTCAATCTGGG
GAGCTACGGAGACTATTAATGAAAGAGTGTCACGATACGCCGTGGGCCGGACACAACGGTGGGCAAAGAACGTACGCGCTATTGAAACAAGGTTACTACTGGCCAAGCCT
CAGAGACGACGTCATGCAATTCACTAAGACCTTTCTTGTCTGTCAACAGGACAAAGTGGATAGGGCGAAACTAGCGGGTCTGCTGGAACCCCTACCAGTGCCCTCAAGGC
CATAG
Protein sequenceShow/hide protein sequence
MLAVVQCLQVWRQYLLGSKFVVKTNNSSICHFFTQLKLSSKQAQWQECLAEFDFQFEHRPVRKTIKTHLHNDTAAQALIQLARKGKTRQFWVEDNLLFTKGNRLYVPQSG
ELRRLLMKECHDTPWAGHNGGQRTYALLKQGYYWPSLRDDVMQFTKTFLVCQQDKVDRAKLAGLLEPLPVPSRP