; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C10G193875 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C10G193875
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionGag-pol polyprotein
Genome locationCla97Chr10:22735363..22736133
RNA-Seq ExpressionCla97C10G193875
SyntenyCla97C10G193875
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAC64917.1 gag-pol polyprotein [Glycine max]6.3e-6557.02Show/hide
Query:  MTAFLKSIDNKTWKVVILGWTPLQVTVVDGK--ESLKSEKDWTEAEDEASLGNSSALNAIFNLVDKNIFRLINTCVLAKEAWDILAIAHEETSKVKISRL
        M AFLKS+D++TWK VI GW   ++   +GK    LK E+DWT+ EDE +LGNS ALNA+FN VDKNIFRLINTC +AK+AW+IL   HE TSKVK+SRL
Subjt:  MTAFLKSIDNKTWKVVILGWTPLQVTVVDGK--ESLKSEKDWTEAEDEASLGNSSALNAIFNLVDKNIFRLINTCVLAKEAWDILAIAHEETSKVKISRL

Query:  KLLTFKFESLKILDDETVAEFNVRLLDIANESFALGEIFFEEKLVRKVLRSLPKRFDMKVTTVQEAQDIANMKVDELFGSLCTFEMTLDDNTDKKFKGIA
        +LL  KFE+LK+ ++E + EF++ +L+IAN   ALGE   +EKLVRK+LRSLPKRFDMKVT ++EAQDI NM+VDEL GSL TFE+ L D T+KK K +A
Subjt:  KLLTFKFESLKILDDETVAEFNVRLLDIANESFALGEIFFEEKLVRKVLRSLPKRFDMKVTTVQEAQDIANMKVDELFGSLCTFEMTLDDNTDKKFKGIA

Query:  LQSIVAHNTSLTKNKESKENLAESMPLLAKQFGKALRRWDRR
          S          + ++ E L  ++ LL KQF K L R DRR
Subjt:  LQSIVAHNTSLTKNKESKENLAESMPLLAKQFGKALRRWDRR

AAO73521.1 gag-pol polyprotein [Glycine max]1.4e-6456.61Show/hide
Query:  MTAFLKSIDNKTWKVVILGWTPLQVTVVDGK--ESLKSEKDWTEAEDEASLGNSSALNAIFNLVDKNIFRLINTCVLAKEAWDILAIAHEETSKVKISRL
        M AFLKS+D++TWK VI GW   ++   +GK  + LK E+DWT+ EDE +LGNS ALNA+FN VDKNIFRLINTC +AK+AW+IL I HE TSKVKISRL
Subjt:  MTAFLKSIDNKTWKVVILGWTPLQVTVVDGK--ESLKSEKDWTEAEDEASLGNSSALNAIFNLVDKNIFRLINTCVLAKEAWDILAIAHEETSKVKISRL

Query:  KLLTFKFESLKILDDETVAEFNVRLLDIANESFALGEIFFEEKLVRKVLRSLPKRFDMKVTTVQEAQDIANMKVDELFGSLCTFEMTLDDNTDKKFKGIA
        +LL  KFE+LK+ ++E + +F++ +L+IAN   ALGE   +EKLVRK+LRSLPKRFDMKVT ++EAQDI NM+VDEL GSL TFE+ L D  +KK K +A
Subjt:  KLLTFKFESLKILDDETVAEFNVRLLDIANESFALGEIFFEEKLVRKVLRSLPKRFDMKVTTVQEAQDIANMKVDELFGSLCTFEMTLDDNTDKKFKGIA

Query:  LQSIVAHNTSLTKNKESKENLAESMPLLAKQFGKALRRWDRR
          S          +  + E L  ++ LL KQF K L R D+R
Subjt:  LQSIVAHNTSLTKNKESKENLAESMPLLAKQFGKALRRWDRR

AAO73527.1 gag-pol polyprotein [Glycine max]1.8e-6456.2Show/hide
Query:  MTAFLKSIDNKTWKVVILGWTPLQVTVVDGK--ESLKSEKDWTEAEDEASLGNSSALNAIFNLVDKNIFRLINTCVLAKEAWDILAIAHEETSKVKISRL
        M AFLKS+D++TWK VI GW   ++   +GK  + LK E+DWT+ EDE +LGNS ALNA+FN VDKNIFRLINTC +AK+AW+IL I HE TSKVK+SRL
Subjt:  MTAFLKSIDNKTWKVVILGWTPLQVTVVDGK--ESLKSEKDWTEAEDEASLGNSSALNAIFNLVDKNIFRLINTCVLAKEAWDILAIAHEETSKVKISRL

Query:  KLLTFKFESLKILDDETVAEFNVRLLDIANESFALGEIFFEEKLVRKVLRSLPKRFDMKVTTVQEAQDIANMKVDELFGSLCTFEMTLDDNTDKKFKGIA
        +LL  KFE+LK+ ++E + +F++ +L+IAN   ALGE   +EKLVRK+LRSLPKRFDMKVT ++EAQDI NM+VDEL GSL TFE+ L D  +KK K +A
Subjt:  KLLTFKFESLKILDDETVAEFNVRLLDIANESFALGEIFFEEKLVRKVLRSLPKRFDMKVTTVQEAQDIANMKVDELFGSLCTFEMTLDDNTDKKFKGIA

Query:  LQSIVAHNTSLTKNKESKENLAESMPLLAKQFGKALRRWDRR
          S          + ++ E L  ++ LL KQF K L R D+R
Subjt:  LQSIVAHNTSLTKNKESKENLAESMPLLAKQFGKALRRWDRR

AAO73529.1 gag-pol polyprotein [Glycine max]5.3e-6456.2Show/hide
Query:  MTAFLKSIDNKTWKVVILGWTPLQVTVVDGK--ESLKSEKDWTEAEDEASLGNSSALNAIFNLVDKNIFRLINTCVLAKEAWDILAIAHEETSKVKISRL
        M AFLKS+D++TWK VI GW   ++   +GK    LK E+DWT+ EDE +LGNS ALNA+FN VDKNIFRLINTC +AK+AW+IL   HE TSKVK+SRL
Subjt:  MTAFLKSIDNKTWKVVILGWTPLQVTVVDGK--ESLKSEKDWTEAEDEASLGNSSALNAIFNLVDKNIFRLINTCVLAKEAWDILAIAHEETSKVKISRL

Query:  KLLTFKFESLKILDDETVAEFNVRLLDIANESFALGEIFFEEKLVRKVLRSLPKRFDMKVTTVQEAQDIANMKVDELFGSLCTFEMTLDDNTDKKFKGIA
        +LL  KFE+LK+ ++E + +F++ +L+IAN   ALGE   +EKLVRK+LRSLPKRFDMKVT ++EAQDI NM+VDEL GSL TFE+ L D T+KK K +A
Subjt:  KLLTFKFESLKILDDETVAEFNVRLLDIANESFALGEIFFEEKLVRKVLRSLPKRFDMKVTTVQEAQDIANMKVDELFGSLCTFEMTLDDNTDKKFKGIA

Query:  LQSIVAHNTSLTKNKESKENLAESMPLLAKQFGKALRRWDRR
          S          + ++ E L  ++  L KQF K L R DRR
Subjt:  LQSIVAHNTSLTKNKESKENLAESMPLLAKQFGKALRRWDRR

MCI11749.1 gag-pol polyprotein [Trifolium medium]2.4e-6458.52Show/hide
Query:  MTAFLKSIDNKTWKVVILGWTPLQVTVVDGKESLKSEKDWTEAEDEASLGNSSALNAIFNLVDKNIFRLINTCVLAKEAWDILAIAHEETSKVKISRLKL
        M AFLKS+D++ WK +I GWT   VT  DG  SLK E +WTEAED  +LGNS ALNAIFN VD+N+FRLINTC++AKEAW+IL  AHE TSKV++S+L+L
Subjt:  MTAFLKSIDNKTWKVVILGWTPLQVTVVDGKESLKSEKDWTEAEDEASLGNSSALNAIFNLVDKNIFRLINTCVLAKEAWDILAIAHEETSKVKISRLKL

Query:  LTFKFESLKILDDETVAEFNVRLLDIANESFALGEIFFEEKLVRKVLRSLPKRFDMKVTTVQEAQDIANMKVDELFGSLCTFEMTLDDNTDKKFKGIALQ
        LT KFE+LK+ +DET+ EFN RL DIAN S AL E   EEKLVR +LRSLPKRFDMKV  ++E QD++ MKVDEL GSL TFEM + D T+KK K IA  
Subjt:  LTFKFESLKILDDETVAEFNVRLLDIANESFALGEIFFEEKLVRKVLRSLPKRFDMKVTTVQEAQDIANMKVDELFGSLCTFEMTLDDNTDKKFKGIALQ

Query:  SIVAHNTSLTKNKESKENLAESMPLLAKQ
                +++     +NL E++ L+ K+
Subjt:  SIVAHNTSLTKNKESKENLAESMPLLAKQ

TrEMBL top hitse value%identityAlignment
A0A392PJS5 Gag-pol polyprotein (Fragment)1.2e-6458.52Show/hide
Query:  MTAFLKSIDNKTWKVVILGWTPLQVTVVDGKESLKSEKDWTEAEDEASLGNSSALNAIFNLVDKNIFRLINTCVLAKEAWDILAIAHEETSKVKISRLKL
        M AFLKS+D++ WK +I GWT   VT  DG  SLK E +WTEAED  +LGNS ALNAIFN VD+N+FRLINTC++AKEAW+IL  AHE TSKV++S+L+L
Subjt:  MTAFLKSIDNKTWKVVILGWTPLQVTVVDGKESLKSEKDWTEAEDEASLGNSSALNAIFNLVDKNIFRLINTCVLAKEAWDILAIAHEETSKVKISRLKL

Query:  LTFKFESLKILDDETVAEFNVRLLDIANESFALGEIFFEEKLVRKVLRSLPKRFDMKVTTVQEAQDIANMKVDELFGSLCTFEMTLDDNTDKKFKGIALQ
        LT KFE+LK+ +DET+ EFN RL DIAN S AL E   EEKLVR +LRSLPKRFDMKV  ++E QD++ MKVDEL GSL TFEM + D T+KK K IA  
Subjt:  LTFKFESLKILDDETVAEFNVRLLDIANESFALGEIFFEEKLVRKVLRSLPKRFDMKVTTVQEAQDIANMKVDELFGSLCTFEMTLDDNTDKKFKGIALQ

Query:  SIVAHNTSLTKNKESKENLAESMPLLAKQ
                +++     +NL E++ L+ K+
Subjt:  SIVAHNTSLTKNKESKENLAESMPLLAKQ

O65147 Gag-pol polyprotein3.0e-6557.02Show/hide
Query:  MTAFLKSIDNKTWKVVILGWTPLQVTVVDGK--ESLKSEKDWTEAEDEASLGNSSALNAIFNLVDKNIFRLINTCVLAKEAWDILAIAHEETSKVKISRL
        M AFLKS+D++TWK VI GW   ++   +GK    LK E+DWT+ EDE +LGNS ALNA+FN VDKNIFRLINTC +AK+AW+IL   HE TSKVK+SRL
Subjt:  MTAFLKSIDNKTWKVVILGWTPLQVTVVDGK--ESLKSEKDWTEAEDEASLGNSSALNAIFNLVDKNIFRLINTCVLAKEAWDILAIAHEETSKVKISRL

Query:  KLLTFKFESLKILDDETVAEFNVRLLDIANESFALGEIFFEEKLVRKVLRSLPKRFDMKVTTVQEAQDIANMKVDELFGSLCTFEMTLDDNTDKKFKGIA
        +LL  KFE+LK+ ++E + EF++ +L+IAN   ALGE   +EKLVRK+LRSLPKRFDMKVT ++EAQDI NM+VDEL GSL TFE+ L D T+KK K +A
Subjt:  KLLTFKFESLKILDDETVAEFNVRLLDIANESFALGEIFFEEKLVRKVLRSLPKRFDMKVTTVQEAQDIANMKVDELFGSLCTFEMTLDDNTDKKFKGIA

Query:  LQSIVAHNTSLTKNKESKENLAESMPLLAKQFGKALRRWDRR
          S          + ++ E L  ++ LL KQF K L R DRR
Subjt:  LQSIVAHNTSLTKNKESKENLAESMPLLAKQFGKALRRWDRR

Q84VH6 Gag-pol polyprotein2.6e-6456.2Show/hide
Query:  MTAFLKSIDNKTWKVVILGWTPLQVTVVDGK--ESLKSEKDWTEAEDEASLGNSSALNAIFNLVDKNIFRLINTCVLAKEAWDILAIAHEETSKVKISRL
        M AFLKS+D++TWK VI GW   ++   +GK    LK E+DWT+ EDE +LGNS ALNA+FN VDKNIFRLINTC +AK+AW+IL   HE TSKVK+SRL
Subjt:  MTAFLKSIDNKTWKVVILGWTPLQVTVVDGK--ESLKSEKDWTEAEDEASLGNSSALNAIFNLVDKNIFRLINTCVLAKEAWDILAIAHEETSKVKISRL

Query:  KLLTFKFESLKILDDETVAEFNVRLLDIANESFALGEIFFEEKLVRKVLRSLPKRFDMKVTTVQEAQDIANMKVDELFGSLCTFEMTLDDNTDKKFKGIA
        +LL  KFE+LK+ ++E + +F++ +L+IAN   ALGE   +EKLVRK+LRSLPKRFDMKVT ++EAQDI NM+VDEL GSL TFE+ L D T+KK K +A
Subjt:  KLLTFKFESLKILDDETVAEFNVRLLDIANESFALGEIFFEEKLVRKVLRSLPKRFDMKVTTVQEAQDIANMKVDELFGSLCTFEMTLDDNTDKKFKGIA

Query:  LQSIVAHNTSLTKNKESKENLAESMPLLAKQFGKALRRWDRR
          S          + ++ E L  ++  L KQF K L R DRR
Subjt:  LQSIVAHNTSLTKNKESKENLAESMPLLAKQFGKALRRWDRR

Q84VH8 Gag-pol polyprotein8.8e-6556.2Show/hide
Query:  MTAFLKSIDNKTWKVVILGWTPLQVTVVDGK--ESLKSEKDWTEAEDEASLGNSSALNAIFNLVDKNIFRLINTCVLAKEAWDILAIAHEETSKVKISRL
        M AFLKS+D++TWK VI GW   ++   +GK  + LK E+DWT+ EDE +LGNS ALNA+FN VDKNIFRLINTC +AK+AW+IL I HE TSKVK+SRL
Subjt:  MTAFLKSIDNKTWKVVILGWTPLQVTVVDGK--ESLKSEKDWTEAEDEASLGNSSALNAIFNLVDKNIFRLINTCVLAKEAWDILAIAHEETSKVKISRL

Query:  KLLTFKFESLKILDDETVAEFNVRLLDIANESFALGEIFFEEKLVRKVLRSLPKRFDMKVTTVQEAQDIANMKVDELFGSLCTFEMTLDDNTDKKFKGIA
        +LL  KFE+LK+ ++E + +F++ +L+IAN   ALGE   +EKLVRK+LRSLPKRFDMKVT ++EAQDI NM+VDEL GSL TFE+ L D  +KK K +A
Subjt:  KLLTFKFESLKILDDETVAEFNVRLLDIANESFALGEIFFEEKLVRKVLRSLPKRFDMKVTTVQEAQDIANMKVDELFGSLCTFEMTLDDNTDKKFKGIA

Query:  LQSIVAHNTSLTKNKESKENLAESMPLLAKQFGKALRRWDRR
          S          + ++ E L  ++ LL KQF K L R D+R
Subjt:  LQSIVAHNTSLTKNKESKENLAESMPLLAKQFGKALRRWDRR

Q84VI4 Gag-pol polyprotein6.8e-6556.61Show/hide
Query:  MTAFLKSIDNKTWKVVILGWTPLQVTVVDGK--ESLKSEKDWTEAEDEASLGNSSALNAIFNLVDKNIFRLINTCVLAKEAWDILAIAHEETSKVKISRL
        M AFLKS+D++TWK VI GW   ++   +GK  + LK E+DWT+ EDE +LGNS ALNA+FN VDKNIFRLINTC +AK+AW+IL I HE TSKVKISRL
Subjt:  MTAFLKSIDNKTWKVVILGWTPLQVTVVDGK--ESLKSEKDWTEAEDEASLGNSSALNAIFNLVDKNIFRLINTCVLAKEAWDILAIAHEETSKVKISRL

Query:  KLLTFKFESLKILDDETVAEFNVRLLDIANESFALGEIFFEEKLVRKVLRSLPKRFDMKVTTVQEAQDIANMKVDELFGSLCTFEMTLDDNTDKKFKGIA
        +LL  KFE+LK+ ++E + +F++ +L+IAN   ALGE   +EKLVRK+LRSLPKRFDMKVT ++EAQDI NM+VDEL GSL TFE+ L D  +KK K +A
Subjt:  KLLTFKFESLKILDDETVAEFNVRLLDIANESFALGEIFFEEKLVRKVLRSLPKRFDMKVTTVQEAQDIANMKVDELFGSLCTFEMTLDDNTDKKFKGIA

Query:  LQSIVAHNTSLTKNKESKENLAESMPLLAKQFGKALRRWDRR
          S          +  + E L  ++ LL KQF K L R D+R
Subjt:  LQSIVAHNTSLTKNKESKENLAESMPLLAKQFGKALRRWDRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGCCTTCCTTAAGTCCATTGATAACAAAACTTGGAAAGTCGTGATATTGGGATGGACCCCTCTTCAAGTCACTGTTGTAGATGGCAAGGAGAGTTTGAAATCTGA
GAAAGATTGGACCGAAGCAGAAGATGAGGCATCCTTGGGAAATTCCAGTGCCTTGAATGCCATTTTCAACCTTGTTGATAAAAATATCTTTAGATTAATTAATACATGTG
TCTTAGCCAAAGAAGCATGGGACATTCTTGCAATTGCTCACGAAGAGACATCTAAAGTAAAGATATCAAGACTAAAACTTCTTACTTTTAAGTTTGAATCTCTGAAGATA
CTCGACGATGAAACTGTGGCTGAGTTTAATGTTCGTCTGCTAGACATAGCTAATGAGTCTTTCGCTCTTGGGGAAATTTTTTTTGAAGAAAAGCTGGTTCGTAAAGTCCT
TCGGTCTCTCCCCAAAAGGTTTGATATGAAAGTCACAACTGTACAAGAGGCTCAAGACATTGCCAACATGAAGGTTGATGAATTGTTTGGTTCTTTATGCACATTCGAAA
TGACCTTGGATGATAACACAGATAAAAAATTCAAAGGTATTGCTCTTCAGTCAATTGTTGCCCATAATACATCTCTGACTAAGAACAAGGAATCTAAGGAGAATCTTGCA
GAATCGATGCCTCTTTTGGCGAAACAATTTGGGAAGGCTCTCAGACGTTGGGATAGGCGTATAGGATCTTGTGGTAACTATGTTCCCACAAATGCCAAGGACATCAACTA
G
mRNA sequenceShow/hide mRNA sequence
ATGACTGCCTTCCTTAAGTCCATTGATAACAAAACTTGGAAAGTCGTGATATTGGGATGGACCCCTCTTCAAGTCACTGTTGTAGATGGCAAGGAGAGTTTGAAATCTGA
GAAAGATTGGACCGAAGCAGAAGATGAGGCATCCTTGGGAAATTCCAGTGCCTTGAATGCCATTTTCAACCTTGTTGATAAAAATATCTTTAGATTAATTAATACATGTG
TCTTAGCCAAAGAAGCATGGGACATTCTTGCAATTGCTCACGAAGAGACATCTAAAGTAAAGATATCAAGACTAAAACTTCTTACTTTTAAGTTTGAATCTCTGAAGATA
CTCGACGATGAAACTGTGGCTGAGTTTAATGTTCGTCTGCTAGACATAGCTAATGAGTCTTTCGCTCTTGGGGAAATTTTTTTTGAAGAAAAGCTGGTTCGTAAAGTCCT
TCGGTCTCTCCCCAAAAGGTTTGATATGAAAGTCACAACTGTACAAGAGGCTCAAGACATTGCCAACATGAAGGTTGATGAATTGTTTGGTTCTTTATGCACATTCGAAA
TGACCTTGGATGATAACACAGATAAAAAATTCAAAGGTATTGCTCTTCAGTCAATTGTTGCCCATAATACATCTCTGACTAAGAACAAGGAATCTAAGGAGAATCTTGCA
GAATCGATGCCTCTTTTGGCGAAACAATTTGGGAAGGCTCTCAGACGTTGGGATAGGCGTATAGGATCTTGTGGTAACTATGTTCCCACAAATGCCAAGGACATCAACTA
G
Protein sequenceShow/hide protein sequence
MTAFLKSIDNKTWKVVILGWTPLQVTVVDGKESLKSEKDWTEAEDEASLGNSSALNAIFNLVDKNIFRLINTCVLAKEAWDILAIAHEETSKVKISRLKLLTFKFESLKI
LDDETVAEFNVRLLDIANESFALGEIFFEEKLVRKVLRSLPKRFDMKVTTVQEAQDIANMKVDELFGSLCTFEMTLDDNTDKKFKGIALQSIVAHNTSLTKNKESKENLA
ESMPLLAKQFGKALRRWDRRIGSCGNYVPTNAKDIN