; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019818 (gene) of Snake gourd v1 genome

Gene IDTan0019818
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG02:67624483..67625832
RNA-Seq ExpressionTan0019818
SyntenyTan0019818
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059677.1 gag/pol protein [Cucumis melo var. makuwa]1.0e-3458.82Show/hide
Query:  PGSNASRNVHDAYDRWIKANDKAKVYMLASMSDILAKKHEGTITAKEIMDYVQGIFGQQSTQARHNALKYIFNSRMPEGTSVRDHVLDMMVHFNIAESNG
        P SNA++    AYDRWIKAN+KA VY+LASMSD+LAKK+E   TAKEIMD ++G+FGQ     RH  +KYI+  RM EGTSVR+HVLDMM+HFNIAE NG
Subjt:  PGSNASRNVHDAYDRWIKANDKAKVYMLASMSDILAKKHEGTITAKEIMDYVQGIFGQQSTQARHNALKYIFNSRMPEGTSVRDHVLDMMVHFNIAESNG

Query:  AFIDESS------QNGLLNELEENSLPRCESRCKAK
          IDE+       +NGLL++ E+NSL  C+S  + K
Subjt:  AFIDESS------QNGLLNELEENSLPRCESRCKAK

XP_022152352.1 uncharacterized protein LOC111020095 [Momordica charantia]2.6e-3575.49Show/hide
Query:  NASRNVHDAYDRWIKANDKAKVYMLASMSDILAKKHEGTITAKEIMDYVQGIFGQQSTQARHNALKYIFNSRMPEGTSVRDHVLDMMVHFNIAESNGAFI
        NA+  V  AYDRWIKANDKAKVY+LAS+SD+LAKKHE TITAKEIMD +Q +FGQ S+QARH ALK+I+NSRM EG+SVR+HVL++MVHFN+AESNGA I
Subjt:  NASRNVHDAYDRWIKANDKAKVYMLASMSDILAKKHEGTITAKEIMDYVQGIFGQQSTQARHNALKYIFNSRMPEGTSVRDHVLDMMVHFNIAESNGAFI

Query:  DE
        DE
Subjt:  DE

XP_022155999.1 uncharacterized protein LOC111022974 [Momordica charantia]2.2e-3465.79Show/hide
Query:  PGSNASRNVHDAYDRWIKANDKAKVYMLASMSDILAKKHEGTITAKEIMDYVQGIFGQQSTQARHNALKYIFNSRMPEGTSVRDHVLDMMVHFNIAESNG
        P  NA+R   DAYDRWIKANDKAKVY+LAS+SD+LAKKHE  + A+EIMD ++ +FGQ S QARH ALK+I+NSRM EGTS+++HVL++MVHFN+AE NG
Subjt:  PGSNASRNVHDAYDRWIKANDKAKVYMLASMSDILAKKHEGTITAKEIMDYVQGIFGQQSTQARHNALKYIFNSRMPEGTSVRDHVLDMMVHFNIAESNG

Query:  AFIDESSQNGLLNE
        A IDE SQ   + E
Subjt:  AFIDESSQNGLLNE

XP_022157844.1 uncharacterized protein LOC111024457 [Momordica charantia]1.7e-3465.74Show/hide
Query:  PGSNASRNVHDAYDRWIKANDKAKVYMLASMSDILAKKHEGTITAKEIMDYVQGIFGQQSTQARHNALKYIFNSRMPEGTSVRDHVLDMMVHFNIAESNG
        P +NA+RNV +A+DRW+KANDKA+VY+LASM+D+LAKKHE  +TAKEIMD ++ +FG+ S+  RH ALKY++N  M EGTSVR+HVLDMMVHFN AE NG
Subjt:  PGSNASRNVHDAYDRWIKANDKAKVYMLASMSDILAKKHEGTITAKEIMDYVQGIFGQQSTQARHNALKYIFNSRMPEGTSVRDHVLDMMVHFNIAESNG

Query:  AFIDESSQ
        A IDE+++
Subjt:  AFIDESSQ

XP_022158197.1 uncharacterized protein LOC111024734 [Momordica charantia]4.8e-3766.39Show/hide
Query:  PGSNASRNVHDAYDRWIKANDKAKVYMLASMSDILAKKHEGTITAKEIMDYVQGIFGQQSTQARHNALKYIFNSRMPEGTSVRDHVLDMMVHFNIAESNG
        P SNA+  V +AYDRWIK+NDKAKVY+LAS+SD+LAKKHE T+T KEIMD +Q +FGQ S QARH ALK+++NSRM EG+SVR+HVL++MVHFN+AESNG
Subjt:  PGSNASRNVHDAYDRWIKANDKAKVYMLASMSDILAKKHEGTITAKEIMDYVQGIFGQQSTQARHNALKYIFNSRMPEGTSVRDHVLDMMVHFNIAESNG

Query:  AFIDESSQ-NGLLNELEENSLP
          IDE SQ + +L  L +N LP
Subjt:  AFIDESSQ-NGLLNELEENSLP

TrEMBL top hitse value%identityAlignment
A0A5A7UYF5 Gag/pol protein4.9e-3558.82Show/hide
Query:  PGSNASRNVHDAYDRWIKANDKAKVYMLASMSDILAKKHEGTITAKEIMDYVQGIFGQQSTQARHNALKYIFNSRMPEGTSVRDHVLDMMVHFNIAESNG
        P SNA++    AYDRWIKAN+KA VY+LASMSD+LAKK+E   TAKEIMD ++G+FGQ     RH  +KYI+  RM EGTSVR+HVLDMM+HFNIAE NG
Subjt:  PGSNASRNVHDAYDRWIKANDKAKVYMLASMSDILAKKHEGTITAKEIMDYVQGIFGQQSTQARHNALKYIFNSRMPEGTSVRDHVLDMMVHFNIAESNG

Query:  AFIDESS------QNGLLNELEENSLPRCESRCKAK
          IDE+       +NGLL++ E+NSL  C+S  + K
Subjt:  AFIDESS------QNGLLNELEENSLPRCESRCKAK

A0A6J1DFZ2 uncharacterized protein LOC1110200951.3e-3575.49Show/hide
Query:  NASRNVHDAYDRWIKANDKAKVYMLASMSDILAKKHEGTITAKEIMDYVQGIFGQQSTQARHNALKYIFNSRMPEGTSVRDHVLDMMVHFNIAESNGAFI
        NA+  V  AYDRWIKANDKAKVY+LAS+SD+LAKKHE TITAKEIMD +Q +FGQ S+QARH ALK+I+NSRM EG+SVR+HVL++MVHFN+AESNGA I
Subjt:  NASRNVHDAYDRWIKANDKAKVYMLASMSDILAKKHEGTITAKEIMDYVQGIFGQQSTQARHNALKYIFNSRMPEGTSVRDHVLDMMVHFNIAESNGAFI

Query:  DE
        DE
Subjt:  DE

A0A6J1DUZ9 uncharacterized protein LOC1110242941.1e-3465.74Show/hide
Query:  PGSNASRNVHDAYDRWIKANDKAKVYMLASMSDILAKKHEGTITAKEIMDYVQGIFGQQSTQARHNALKYIFNSRMPEGTSVRDHVLDMMVHFNIAESNG
        P  NA+RNV +A+DRW+KANDKA+VY+LASM+D+LAKKHE  +TAKEIMD ++ +FG+ S+  RH ALKY++N  M EGTSVR+HVLDMMVHFN AE NG
Subjt:  PGSNASRNVHDAYDRWIKANDKAKVYMLASMSDILAKKHEGTITAKEIMDYVQGIFGQQSTQARHNALKYIFNSRMPEGTSVRDHVLDMMVHFNIAESNG

Query:  AFIDESSQ
        A IDE+++
Subjt:  AFIDESSQ

A0A6J1DWL0 uncharacterized protein LOC1110247342.3e-3766.39Show/hide
Query:  PGSNASRNVHDAYDRWIKANDKAKVYMLASMSDILAKKHEGTITAKEIMDYVQGIFGQQSTQARHNALKYIFNSRMPEGTSVRDHVLDMMVHFNIAESNG
        P SNA+  V +AYDRWIK+NDKAKVY+LAS+SD+LAKKHE T+T KEIMD +Q +FGQ S QARH ALK+++NSRM EG+SVR+HVL++MVHFN+AESNG
Subjt:  PGSNASRNVHDAYDRWIKANDKAKVYMLASMSDILAKKHEGTITAKEIMDYVQGIFGQQSTQARHNALKYIFNSRMPEGTSVRDHVLDMMVHFNIAESNG

Query:  AFIDESSQ-NGLLNELEENSLP
          IDE SQ + +L  L +N LP
Subjt:  AFIDESSQ-NGLLNELEENSLP

A0A6J1DXQ5 uncharacterized protein LOC1110244578.3e-3565.74Show/hide
Query:  PGSNASRNVHDAYDRWIKANDKAKVYMLASMSDILAKKHEGTITAKEIMDYVQGIFGQQSTQARHNALKYIFNSRMPEGTSVRDHVLDMMVHFNIAESNG
        P +NA+RNV +A+DRW+KANDKA+VY+LASM+D+LAKKHE  +TAKEIMD ++ +FG+ S+  RH ALKY++N  M EGTSVR+HVLDMMVHFN AE NG
Subjt:  PGSNASRNVHDAYDRWIKANDKAKVYMLASMSDILAKKHEGTITAKEIMDYVQGIFGQQSTQARHNALKYIFNSRMPEGTSVRDHVLDMMVHFNIAESNG

Query:  AFIDESSQ
        A IDE+++
Subjt:  AFIDESSQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAGGCTCGAATGCGTCACGAAATGTTCATGATGCATATGATCGATGGATCAAAGCCAATGATAAGGCCAAGGTCTACATGCTGGCAAGTATGTCTGACATATTAGC
CAAGAAGCATGAGGGCACGATTACCGCCAAGGAAATCATGGATTATGTGCAGGGTATATTTGGACAACAGTCCACACAAGCCCGACATAATGCCCTAAAGTACATATTCA
ACTCGAGGATGCCAGAGGGTACATCTGTTCGGGATCATGTCCTGGATATGATGGTGCACTTTAACATCGCAGAGTCAAATGGTGCTTTCATCGATGAGTCGAGCCAGAAC
GGACTTCTAAACGAGTTAGAAGAAAATTCTTTGCCAAGATGTGAATCCCGCTGTAAGGCAAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCAGGCTCGAATGCGTCACGAAATGTTCATGATGCATATGATCGATGGATCAAAGCCAATGATAAGGCCAAGGTCTACATGCTGGCAAGTATGTCTGACATATTAGC
CAAGAAGCATGAGGGCACGATTACCGCCAAGGAAATCATGGATTATGTGCAGGGTATATTTGGACAACAGTCCACACAAGCCCGACATAATGCCCTAAAGTACATATTCA
ACTCGAGGATGCCAGAGGGTACATCTGTTCGGGATCATGTCCTGGATATGATGGTGCACTTTAACATCGCAGAGTCAAATGGTGCTTTCATCGATGAGTCGAGCCAGAAC
GGACTTCTAAACGAGTTAGAAGAAAATTCTTTGCCAAGATGTGAATCCCGCTGTAAGGCAAAATGA
Protein sequenceShow/hide protein sequence
MPGSNASRNVHDAYDRWIKANDKAKVYMLASMSDILAKKHEGTITAKEIMDYVQGIFGQQSTQARHNALKYIFNSRMPEGTSVRDHVLDMMVHFNIAESNGAFIDESSQN
GLLNELEENSLPRCESRCKAK