; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019464 (gene) of Snake gourd v1 genome

Gene IDTan0019464
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG02:56078353..56081496
RNA-Seq ExpressionTan0019464
SyntenyTan0019464
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158568.1 uncharacterized protein LOC111025021 [Momordica charantia]4.1e-2772.62Show/hide
Query:  EECPPPPSSTTNRIVRDAYDRWIRANEKARVYILASISDVLSKKHESMVTAKEIMGSLQAMFGQPSSSVHYDALKYVYNSRMKE
        EECPP P+   NR VRDAYDRW++ANEKARVYILASIS+VLSKKHE + T +EIM SLQA+FGQPS+++ +DA+KYVYN RMKE
Subjt:  EECPPPPSSTTNRIVRDAYDRWIRANEKARVYILASISDVLSKKHESMVTAKEIMGSLQAMFGQPSSSVHYDALKYVYNSRMKE

XP_022158791.1 uncharacterized protein LOC111025258 [Momordica charantia]2.7e-2672.62Show/hide
Query:  EECPPPPSSTTNRIVRDAYDRWIRANEKARVYILASISDVLSKKHESMVTAKEIMGSLQAMFGQPSSSVHYDALKYVYNSRMKE
        EECPP  +  +N+ VRDA+DRW +ANEKARVYILASISDVLSKKHE + TA+EIM SLQA+FGQPS+S+ +DA+KYVYN RMKE
Subjt:  EECPPPPSSTTNRIVRDAYDRWIRANEKARVYILASISDVLSKKHESMVTAKEIMGSLQAMFGQPSSSVHYDALKYVYNSRMKE

XP_038880476.1 uncharacterized protein LOC120072136 [Benincasa hispida]1.2e-2673.81Show/hide
Query:  EECPPPPSSTTNRIVRDAYDRWIRANEKARVYILASISDVLSKKHESMVTAKEIMGSLQAMFGQPSSSVHYDALKYVYNSRMKE
        EECPP P+   NR + DA+DRW +ANEKA+VYILASISD+LSKKHE MV AKEIM SLQA+FGQPSSS  +DA+KYVYN RMKE
Subjt:  EECPPPPSSTTNRIVRDAYDRWIRANEKARVYILASISDVLSKKHESMVTAKEIMGSLQAMFGQPSSSVHYDALKYVYNSRMKE

XP_038882260.1 uncharacterized protein LOC120073488 [Benincasa hispida]1.2e-2675Show/hide
Query:  EECPPPPSSTTNRIVRDAYDRWIRANEKARVYILASISDVLSKKHESMVTAKEIMGSLQAMFGQPSSSVHYDALKYVYNSRMKE
        EECPP PSSTT + VRD YDRW+RANEKA VYILASISDVL+KKHE M+TA+EIM SLQ MFGQPSSSV ++A+KYVY + MK+
Subjt:  EECPPPPSSTTNRIVRDAYDRWIRANEKARVYILASISDVLSKKHESMVTAKEIMGSLQAMFGQPSSSVHYDALKYVYNSRMKE

XP_038885833.1 uncharacterized protein LOC120076129 [Benincasa hispida]1.6e-2672.62Show/hide
Query:  EECPPPPSSTTNRIVRDAYDRWIRANEKARVYILASISDVLSKKHESMVTAKEIMGSLQAMFGQPSSSVHYDALKYVYNSRMKE
        EECPP PS+T N+ V+D YDRW+RANEKAR YILASISDVL+KKHE M T +EIM SLQ MFGQPSSSV ++A+KYVY++RMK+
Subjt:  EECPPPPSSTTNRIVRDAYDRWIRANEKARVYILASISDVLSKKHESMVTAKEIMGSLQAMFGQPSSSVHYDALKYVYNSRMKE

TrEMBL top hitse value%identityAlignment
A0A6J1CP29 uncharacterized protein LOC1110134174.1e-2566.67Show/hide
Query:  EECPPPPSSTTNRIVRDAYDRWIRANEKARVYILASISDVLSKKHESMVTAKEIMGSLQAMFGQPSSSVHYDALKYVYNSRMKE
        EECP  P++  NR VR+A+DRW++AN+KARVYILAS++DVL+KKHE ++TAKEIM SL+AMFG+PSS++ ++ALKYVYN  MKE
Subjt:  EECPPPPSSTTNRIVRDAYDRWIRANEKARVYILASISDVLSKKHESMVTAKEIMGSLQAMFGQPSSSVHYDALKYVYNSRMKE

A0A6J1DWG6 uncharacterized protein LOC1110250212.0e-2772.62Show/hide
Query:  EECPPPPSSTTNRIVRDAYDRWIRANEKARVYILASISDVLSKKHESMVTAKEIMGSLQAMFGQPSSSVHYDALKYVYNSRMKE
        EECPP P+   NR VRDAYDRW++ANEKARVYILASIS+VLSKKHE + T +EIM SLQA+FGQPS+++ +DA+KYVYN RMKE
Subjt:  EECPPPPSSTTNRIVRDAYDRWIRANEKARVYILASISDVLSKKHESMVTAKEIMGSLQAMFGQPSSSVHYDALKYVYNSRMKE

A0A6J1DXP1 uncharacterized protein LOC1110254688.3e-2669.05Show/hide
Query:  EECPPPPSSTTNRIVRDAYDRWIRANEKARVYILASISDVLSKKHESMVTAKEIMGSLQAMFGQPSSSVHYDALKYVYNSRMKE
        EECPP P+   NR VR+AYDRW++AN+KARVYILASISDVLSKKHE + TA+E+M SLQA+ GQP +S+ +DA++YVYN RMKE
Subjt:  EECPPPPSSTTNRIVRDAYDRWIRANEKARVYILASISDVLSKKHESMVTAKEIMGSLQAMFGQPSSSVHYDALKYVYNSRMKE

A0A6J1DXQ5 uncharacterized protein LOC1110244574.1e-2566.67Show/hide
Query:  EECPPPPSSTTNRIVRDAYDRWIRANEKARVYILASISDVLSKKHESMVTAKEIMGSLQAMFGQPSSSVHYDALKYVYNSRMKE
        EECP  P++  NR VR+A+DRW++AN+KARVYILAS++DVL+KKHE ++TAKEIM SL+AMFG+PSS++ ++ALKYVYN  MKE
Subjt:  EECPPPPSSTTNRIVRDAYDRWIRANEKARVYILASISDVLSKKHESMVTAKEIMGSLQAMFGQPSSSVHYDALKYVYNSRMKE

A0A6J1E205 uncharacterized protein LOC1110252581.3e-2672.62Show/hide
Query:  EECPPPPSSTTNRIVRDAYDRWIRANEKARVYILASISDVLSKKHESMVTAKEIMGSLQAMFGQPSSSVHYDALKYVYNSRMKE
        EECPP  +  +N+ VRDA+DRW +ANEKARVYILASISDVLSKKHE + TA+EIM SLQA+FGQPS+S+ +DA+KYVYN RMKE
Subjt:  EECPPPPSSTTNRIVRDAYDRWIRANEKARVYILASISDVLSKKHESMVTAKEIMGSLQAMFGQPSSSVHYDALKYVYNSRMKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTAAAAGGAGGTATGTGGACTTTGACCGGTTCGTTGACTTAGACCAGTTCAATAGTTTTTGGTCTGGTTCTAGTGGTTTTAGACCGATTCTGACTATTTCGAGGCT
GGTTTTTCCATTTTGGAGGTCCGTTTCATCAATTGGAGGCCCGATTCAGCTTTTTACAGCCCGGATCAACCTTTTGAAGGTCCGGTTCACACATTTGCAAGCTGAGGAAT
GTCCTCCCCCTCCTAGCTCGACAACAAACCGAATTGTTCGGGATGCTTATGACAGATGGATTAGGGCTAATGAGAAGGCCCGAGTTTATATCTTAGCCAGCATATCTGAT
GTGTTGTCTAAGAAACATGAGAGCATGGTCACCGCTAAGGAGATCATGGGATCATTACAGGCGATGTTCGGACAACCGTCCTCATCGGTCCATTATGATGCTCTCAAATA
CGTTTACAACTCCCGTATGAAGGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTAAAAGGAGGTATGTGGACTTTGACCGGTTCGTTGACTTAGACCAGTTCAATAGTTTTTGGTCTGGTTCTAGTGGTTTTAGACCGATTCTGACTATTTCGAGGCT
GGTTTTTCCATTTTGGAGGTCCGTTTCATCAATTGGAGGCCCGATTCAGCTTTTTACAGCCCGGATCAACCTTTTGAAGGTCCGGTTCACACATTTGCAAGCTGAGGAAT
GTCCTCCCCCTCCTAGCTCGACAACAAACCGAATTGTTCGGGATGCTTATGACAGATGGATTAGGGCTAATGAGAAGGCCCGAGTTTATATCTTAGCCAGCATATCTGAT
GTGTTGTCTAAGAAACATGAGAGCATGGTCACCGCTAAGGAGATCATGGGATCATTACAGGCGATGTTCGGACAACCGTCCTCATCGGTCCATTATGATGCTCTCAAATA
CGTTTACAACTCCCGTATGAAGGAGTGA
Protein sequenceShow/hide protein sequence
MIKRRYVDFDRFVDLDQFNSFWSGSSGFRPILTISRLVFPFWRSVSSIGGPIQLFTARINLLKVRFTHLQAEECPPPPSSTTNRIVRDAYDRWIRANEKARVYILASISD
VLSKKHESMVTAKEIMGSLQAMFGQPSSSVHYDALKYVYNSRMKE