; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc07G08670 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc07G08670
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionGag/pol protein
Genome locationClcChr07:22739046..22749198
RNA-Seq ExpressionClc07G08670
SyntenyClc07G08670
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]3.7e-1961.7Show/hide
Query:  NGHKYIYSEESAATGFSGVSQDLKF-SINECPQVPAQNASQNVRDAYEKWMRANEKARAYILANLSEVLAKKHETMVTAREIMDSLQEMFEQLS
        NG+ Y   + +  T    +  DL+F  + ECPQVPA NA++ VR+ YE+W +ANEKARAYILA+LSEVLAKKHE+M+TAREIMDSLQEMF Q S
Subjt:  NGHKYIYSEESAATGFSGVSQDLKF-SINECPQVPAQNASQNVRDAYEKWMRANEKARAYILANLSEVLAKKHETMVTAREIMDSLQEMFEQLS

KAA0061339.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-1962.77Show/hide
Query:  NGHKYIYSEESAATGFSGVSQDLKF-SINECPQVPAQNASQNVRDAYEKWMRANEKARAYILANLSEVLAKKHETMVTAREIMDSLQEMFEQLS
        NG+ Y   + +  T    +  DL+F  + ECPQVPA NA+Q VR+ YE+W +ANEKARAYILA+LSEVLAKKHE+M+TAREIMDSLQEMF Q S
Subjt:  NGHKYIYSEESAATGFSGVSQDLKF-SINECPQVPAQNASQNVRDAYEKWMRANEKARAYILANLSEVLAKKHETMVTAREIMDSLQEMFEQLS

KAA0067084.1 gag/pol protein [Cucumis melo var. makuwa]2.9e-1961.7Show/hide
Query:  NGHKYIYSEESAATGFSGVSQDLKF-SINECPQVPAQNASQNVRDAYEKWMRANEKARAYILANLSEVLAKKHETMVTAREIMDSLQEMFEQLS
        NG+ Y   + +  T    +  DL+F  + ECPQVPA NA++ VR+ YE+W +ANEKARAYILA+LSEVLAKKHE+M+TAREIMDSLQEMF Q S
Subjt:  NGHKYIYSEESAATGFSGVSQDLKF-SINECPQVPAQNASQNVRDAYEKWMRANEKARAYILANLSEVLAKKHETMVTAREIMDSLQEMFEQLS

TYK07761.1 gag/pol protein [Cucumis melo var. makuwa]3.7e-1961.7Show/hide
Query:  NGHKYIYSEESAATGFSGVSQDLKF-SINECPQVPAQNASQNVRDAYEKWMRANEKARAYILANLSEVLAKKHETMVTAREIMDSLQEMFEQLS
        NG+ Y   + +  T    +  DL+F  + ECPQVPA NA++ VR+ YE+W +ANEKARAYILA+LSEVLAKKHE+M+TAREIMDSLQEMF Q S
Subjt:  NGHKYIYSEESAATGFSGVSQDLKF-SINECPQVPAQNASQNVRDAYEKWMRANEKARAYILANLSEVLAKKHETMVTAREIMDSLQEMFEQLS

TYK26319.1 gag/pol protein [Cucumis melo var. makuwa]2.9e-1961.7Show/hide
Query:  NGHKYIYSEESAATGFSGVSQDLKF-SINECPQVPAQNASQNVRDAYEKWMRANEKARAYILANLSEVLAKKHETMVTAREIMDSLQEMFEQLS
        NG+ Y   + +  T    +  DL+F  + ECPQVPA NA++ VR+ YE+W +ANEKARAYILA+LSEVLAKKHE+M+TAREIMDSLQEMF Q S
Subjt:  NGHKYIYSEESAATGFSGVSQDLKF-SINECPQVPAQNASQNVRDAYEKWMRANEKARAYILANLSEVLAKKHETMVTAREIMDSLQEMFEQLS

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.8e-1961.7Show/hide
Query:  NGHKYIYSEESAATGFSGVSQDLKF-SINECPQVPAQNASQNVRDAYEKWMRANEKARAYILANLSEVLAKKHETMVTAREIMDSLQEMFEQLS
        NG+ Y   + +  T    +  DL+F  + ECPQVPA NA++ VR+ YE+W +ANEKARAYILA+LSEVLAKKHE+M+TAREIMDSLQEMF Q S
Subjt:  NGHKYIYSEESAATGFSGVSQDLKF-SINECPQVPAQNASQNVRDAYEKWMRANEKARAYILANLSEVLAKKHETMVTAREIMDSLQEMFEQLS

A0A5A7V6N0 Gag/pol protein6.2e-2062.77Show/hide
Query:  NGHKYIYSEESAATGFSGVSQDLKF-SINECPQVPAQNASQNVRDAYEKWMRANEKARAYILANLSEVLAKKHETMVTAREIMDSLQEMFEQLS
        NG+ Y   + +  T    +  DL+F  + ECPQVPA NA+Q VR+ YE+W +ANEKARAYILA+LSEVLAKKHE+M+TAREIMDSLQEMF Q S
Subjt:  NGHKYIYSEESAATGFSGVSQDLKF-SINECPQVPAQNASQNVRDAYEKWMRANEKARAYILANLSEVLAKKHETMVTAREIMDSLQEMFEQLS

A0A5A7VI97 Gag/pol protein1.4e-1961.7Show/hide
Query:  NGHKYIYSEESAATGFSGVSQDLKF-SINECPQVPAQNASQNVRDAYEKWMRANEKARAYILANLSEVLAKKHETMVTAREIMDSLQEMFEQLS
        NG+ Y   + +  T    +  DL+F  + ECPQVPA NA++ VR+ YE+W +ANEKARAYILA+LSEVLAKKHE+M+TAREIMDSLQEMF Q S
Subjt:  NGHKYIYSEESAATGFSGVSQDLKF-SINECPQVPAQNASQNVRDAYEKWMRANEKARAYILANLSEVLAKKHETMVTAREIMDSLQEMFEQLS

A0A5D3CPJ6 Gag/pol protein1.8e-1961.7Show/hide
Query:  NGHKYIYSEESAATGFSGVSQDLKF-SINECPQVPAQNASQNVRDAYEKWMRANEKARAYILANLSEVLAKKHETMVTAREIMDSLQEMFEQLS
        NG+ Y   + +  T    +  DL+F  + ECPQVPA NA++ VR+ YE+W +ANEKARAYILA+LSEVLAKKHE+M+TAREIMDSLQEMF Q S
Subjt:  NGHKYIYSEESAATGFSGVSQDLKF-SINECPQVPAQNASQNVRDAYEKWMRANEKARAYILANLSEVLAKKHETMVTAREIMDSLQEMFEQLS

A0A5D3DS88 Gag/pol protein1.4e-1961.7Show/hide
Query:  NGHKYIYSEESAATGFSGVSQDLKF-SINECPQVPAQNASQNVRDAYEKWMRANEKARAYILANLSEVLAKKHETMVTAREIMDSLQEMFEQLS
        NG+ Y   + +  T    +  DL+F  + ECPQVPA NA++ VR+ YE+W +ANEKARAYILA+LSEVLAKKHE+M+TAREIMDSLQEMF Q S
Subjt:  NGHKYIYSEESAATGFSGVSQDLKF-SINECPQVPAQNASQNVRDAYEKWMRANEKARAYILANLSEVLAKKHETMVTAREIMDSLQEMFEQLS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCATTGTATTCTCATGGAGATCGGTTTTACATGGACACTTATATAGTGATTACTTCCTCGTCTTCATTGATCGGTGTTTCGGGTTCGCTTCTTCTTTTTCTTCTTC
TCCTCCATCGTCGGCTGCTCTCACTCTCGCTCTTTCTCTCTCTCTCTCTGGCTCTGGCTTTGGCTCTGGGTTTCGCTCTCGCTCTGTAAATCTATCTGACAGGGGGGTCG
AGGATGCACTCCCCGTCCCCGTTCCCGTTGGGATTTGTACCCGTCCCTGCCACATTACCCATTTAGGGGAGTTAGGTCGGGGTCAGGGATCCCCATTCGGGGATCAGGGA
GCTGGGAACATATCTTCACAAGATGAAATTCACTCCTTCCCGTGCTTAGGGGTCGACTTACTGTTGATTAGTAAATTCTGTGGACACAAATATATCTATAGTGCGAAGAG
TGCAACTAGGATAGCAAGATTACCATTGTGGTGGTGTTCCAAAGGAGAAGAGTTCTTGCTATGTTTTGTTGGAGATTCTCACCAGTTCATGACTTGGAAAAAGGAGATTT
CTACATTGAAGGTCTTCAAAGGTAAGAGTAAGCTCAATAGCACTGCTCAACATTCATCCCATTTCAGGGATAAGATCGGGATTGACTTATCGATGACAGTTATATCAAAT
GGACACAAATATATCTACAGTGAGGAGAGTGCAGCTACGGGCTTTAGTGGAGTGTCCCAGGACCTAAAGTTTTCTATCAATGAGTGTCCTCAAGTTCCTGCTCAAAATGC
GTCACAAAATGTTCGTGATGCATATGAGAAATGGATGAGGGCAAACGAAAAGGCCAGAGCATATATTCTTGCCAACTTATCTGAAGTTTTGGCAAAGAAGCATGAAACCA
TGGTCACTGCTCGTGAGATCATGGATTCATTGCAAGAGATGTTTGAACAACTGTCCTCATAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCATTGTATTCTCATGGAGATCGGTTTTACATGGACACTTATATAGTGATTACTTCCTCGTCTTCATTGATCGGTGTTTCGGGTTCGCTTCTTCTTTTTCTTCTTC
TCCTCCATCGTCGGCTGCTCTCACTCTCGCTCTTTCTCTCTCTCTCTCTGGCTCTGGCTTTGGCTCTGGGTTTCGCTCTCGCTCTGTAAATCTATCTGACAGGGGGGTCG
AGGATGCACTCCCCGTCCCCGTTCCCGTTGGGATTTGTACCCGTCCCTGCCACATTACCCATTTAGGGGAGTTAGGTCGGGGTCAGGGATCCCCATTCGGGGATCAGGGA
GCTGGGAACATATCTTCACAAGATGAAATTCACTCCTTCCCGTGCTTAGGGGTCGACTTACTGTTGATTAGTAAATTCTGTGGACACAAATATATCTATAGTGCGAAGAG
TGCAACTAGGATAGCAAGATTACCATTGTGGTGGTGTTCCAAAGGAGAAGAGTTCTTGCTATGTTTTGTTGGAGATTCTCACCAGTTCATGACTTGGAAAAAGGAGATTT
CTACATTGAAGGTCTTCAAAGGTAAGAGTAAGCTCAATAGCACTGCTCAACATTCATCCCATTTCAGGGATAAGATCGGGATTGACTTATCGATGACAGTTATATCAAAT
GGACACAAATATATCTACAGTGAGGAGAGTGCAGCTACGGGCTTTAGTGGAGTGTCCCAGGACCTAAAGTTTTCTATCAATGAGTGTCCTCAAGTTCCTGCTCAAAATGC
GTCACAAAATGTTCGTGATGCATATGAGAAATGGATGAGGGCAAACGAAAAGGCCAGAGCATATATTCTTGCCAACTTATCTGAAGTTTTGGCAAAGAAGCATGAAACCA
TGGTCACTGCTCGTGAGATCATGGATTCATTGCAAGAGATGTTTGAACAACTGTCCTCATAG
Protein sequenceShow/hide protein sequence
MSIVFSWRSVLHGHLYSDYFLVFIDRCFGFASSFSSSPPSSAALTLALSLSLSGSGFGSGFRSRSVNLSDRGVEDALPVPVPVGICTRPCHITHLGELGRGQGSPFGDQG
AGNISSQDEIHSFPCLGVDLLLISKFCGHKYIYSAKSATRIARLPLWWCSKGEEFLLCFVGDSHQFMTWKKEISTLKVFKGKSKLNSTAQHSSHFRDKIGIDLSMTVISN
GHKYIYSEESAATGFSGVSQDLKFSINECPQVPAQNASQNVRDAYEKWMRANEKARAYILANLSEVLAKKHETMVTAREIMDSLQEMFEQLSS