; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017243 (gene) of Snake gourd v1 genome

Gene IDTan0017243
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationLG05:29190355..29201082
RNA-Seq ExpressionTan0017243
SyntenyTan0017243
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0005488 - binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049700.1 T4.5 [Cucumis melo var. makuwa]3.8e-5461.83Show/hide
Query:  ASTQDLHSPIFLLSNICNLVSIRLDSSNYVLWQFQLHKILRAHKLFRFIDGSFPPPPQFLPTNVVSFDSSSSTSDV-LQVNPAYDDWVAIDHALMTLINA
        ++ +D  SPIFLLSNICNL+S+RLDS+N+VLW+FQL  IL+AHKL+ FIDG+ P PP+   TN     +SSSTS V  Q NP+Y+DW+A D ALMT+INA
Subjt:  ASTQDLHSPIFLLSNICNLVSIRLDSSNYVLWQFQLHKILRAHKLFRFIDGSFPPPPQFLPTNVVSFDSSSSTSDV-LQVNPAYDDWVAIDHALMTLINA

Query:  TLSPPALAYVVGCDTAKQMWETLAKHYSSTSRAHIVNLKSELQTIFKKPGETVHYYVQRVKELKDKLENVSVDVDAEDLVIYVLNG
        TLSP ALAYVVG  ++KQ+W+ LAK YSS SR+++VNLKS+LQTI+KKP E++  Y++R+KE+KDKL NVS  ++ EDL+IY LNG
Subjt:  TLSPPALAYVVGCDTAKQMWETLAKHYSSTSRAHIVNLKSELQTIFKKPGETVHYYVQRVKELKDKLENVSVDVDAEDLVIYVLNG

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]3.8e-5461.83Show/hide
Query:  ASTQDLHSPIFLLSNICNLVSIRLDSSNYVLWQFQLHKILRAHKLFRFIDGSFPPPPQFLPTNVVSFDSSSSTSDV-LQVNPAYDDWVAIDHALMTLINA
        ++ +D  SPIFLLSNICNL+S+RLDS+N+VLW+FQL  IL+AHKL+ FIDG+ P PP+   TN     +SSSTS V  Q NP+Y+DW+A D ALMT+INA
Subjt:  ASTQDLHSPIFLLSNICNLVSIRLDSSNYVLWQFQLHKILRAHKLFRFIDGSFPPPPQFLPTNVVSFDSSSSTSDV-LQVNPAYDDWVAIDHALMTLINA

Query:  TLSPPALAYVVGCDTAKQMWETLAKHYSSTSRAHIVNLKSELQTIFKKPGETVHYYVQRVKELKDKLENVSVDVDAEDLVIYVLNG
        TLSP ALAYVVG  ++KQ+W+ LAK YSS SR+++VNLKS+LQTI+KKP E++  Y++R+KE+KDKL NVS  ++ EDL+IY LNG
Subjt:  TLSPPALAYVVGCDTAKQMWETLAKHYSSTSRAHIVNLKSELQTIFKKPGETVHYYVQRVKELKDKLENVSVDVDAEDLVIYVLNG

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]3.8e-5461.83Show/hide
Query:  ASTQDLHSPIFLLSNICNLVSIRLDSSNYVLWQFQLHKILRAHKLFRFIDGSFPPPPQFLPTNVVSFDSSSSTSDV-LQVNPAYDDWVAIDHALMTLINA
        ++ +D  SPIFLLSNICNL+S+RLDS+N+VLW+FQL  IL+AHKL+ FIDG+ P PP+   TN     +SSSTS V  Q NP+Y+DW+A D ALMT+INA
Subjt:  ASTQDLHSPIFLLSNICNLVSIRLDSSNYVLWQFQLHKILRAHKLFRFIDGSFPPPPQFLPTNVVSFDSSSSTSDV-LQVNPAYDDWVAIDHALMTLINA

Query:  TLSPPALAYVVGCDTAKQMWETLAKHYSSTSRAHIVNLKSELQTIFKKPGETVHYYVQRVKELKDKLENVSVDVDAEDLVIYVLNG
        TLSP ALAYVVG  ++KQ+W+ LAK YSS SR+++VNLKS+LQTI+KKP E++  Y++R+KE+KDKL NVS  ++ EDL+IY LNG
Subjt:  TLSPPALAYVVGCDTAKQMWETLAKHYSSTSRAHIVNLKSELQTIFKKPGETVHYYVQRVKELKDKLENVSVDVDAEDLVIYVLNG

XP_016900446.1 PREDICTED: uncharacterized protein LOC103490319 isoform X1 [Cucumis melo]3.8e-5461.83Show/hide
Query:  ASTQDLHSPIFLLSNICNLVSIRLDSSNYVLWQFQLHKILRAHKLFRFIDGSFPPPPQFLPTNVVSFDSSSSTSDV-LQVNPAYDDWVAIDHALMTLINA
        ++ +D  SPIFLLSNICNL+S+RLDS+N+VLW+FQL  IL+AHKL+ FIDG+ P PP+   TN     +SSSTS V  Q NP+Y+DW+A D ALMT+INA
Subjt:  ASTQDLHSPIFLLSNICNLVSIRLDSSNYVLWQFQLHKILRAHKLFRFIDGSFPPPPQFLPTNVVSFDSSSSTSDV-LQVNPAYDDWVAIDHALMTLINA

Query:  TLSPPALAYVVGCDTAKQMWETLAKHYSSTSRAHIVNLKSELQTIFKKPGETVHYYVQRVKELKDKLENVSVDVDAEDLVIYVLNG
        TLSP ALAYVVG  ++KQ+W+ LAK YSS SR+++VNLKS+LQTI+KKP E++  Y++R+KE+KDKL NVS  ++ EDL+IY LNG
Subjt:  TLSPPALAYVVGCDTAKQMWETLAKHYSSTSRAHIVNLKSELQTIFKKPGETVHYYVQRVKELKDKLENVSVDVDAEDLVIYVLNG

XP_022158689.1 uncharacterized protein LOC111025150 [Momordica charantia]6.3e-5765.75Show/hide
Query:  QDLHSPIFLLSNICNLVSIRLDSSNYVLWQFQLHKILRAHKLFRFIDGSFPPPPQFLPTNVVSFDSSSSTSDVLQVNPAYDDWVAIDHALMTLINATLSP
        +DL SPIFLLSNICNLVS+RLDSSN+VLW+FQL  IL+AHKL+ FIDGS P P +FL   V   D SSS       NPA+ +W+A DHALMTL+NA LS 
Subjt:  QDLHSPIFLLSNICNLVSIRLDSSNYVLWQFQLHKILRAHKLFRFIDGSFPPPPQFLPTNVVSFDSSSSTSDVLQVNPAYDDWVAIDHALMTLINATLSP

Query:  PALAYVVGCDTAKQMWETLAKHYSSTSRAHIVNLKSELQTIFKKPGETVHYYVQRVKELKDKLENVSVDVDAEDLVIYVLN
         ALAYVVGCD+++Q+W+TL KHYSS+SR ++VNLKS+LQ+I KKPG ++  YVQR+KELKDKL NV V VD EDL+IY LN
Subjt:  PALAYVVGCDTAKQMWETLAKHYSSTSRAHIVNLKSELQTIFKKPGETVHYYVQRVKELKDKLENVSVDVDAEDLVIYVLN

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X21.8e-5461.83Show/hide
Query:  ASTQDLHSPIFLLSNICNLVSIRLDSSNYVLWQFQLHKILRAHKLFRFIDGSFPPPPQFLPTNVVSFDSSSSTSDV-LQVNPAYDDWVAIDHALMTLINA
        ++ +D  SPIFLLSNICNL+S+RLDS+N+VLW+FQL  IL+AHKL+ FIDG+ P PP+   TN     +SSSTS V  Q NP+Y+DW+A D ALMT+INA
Subjt:  ASTQDLHSPIFLLSNICNLVSIRLDSSNYVLWQFQLHKILRAHKLFRFIDGSFPPPPQFLPTNVVSFDSSSSTSDV-LQVNPAYDDWVAIDHALMTLINA

Query:  TLSPPALAYVVGCDTAKQMWETLAKHYSSTSRAHIVNLKSELQTIFKKPGETVHYYVQRVKELKDKLENVSVDVDAEDLVIYVLNG
        TLSP ALAYVVG  ++KQ+W+ LAK YSS SR+++VNLKS+LQTI+KKP E++  Y++R+KE+KDKL NVS  ++ EDL+IY LNG
Subjt:  TLSPPALAYVVGCDTAKQMWETLAKHYSSTSRAHIVNLKSELQTIFKKPGETVHYYVQRVKELKDKLENVSVDVDAEDLVIYVLNG

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X31.8e-5461.83Show/hide
Query:  ASTQDLHSPIFLLSNICNLVSIRLDSSNYVLWQFQLHKILRAHKLFRFIDGSFPPPPQFLPTNVVSFDSSSSTSDV-LQVNPAYDDWVAIDHALMTLINA
        ++ +D  SPIFLLSNICNL+S+RLDS+N+VLW+FQL  IL+AHKL+ FIDG+ P PP+   TN     +SSSTS V  Q NP+Y+DW+A D ALMT+INA
Subjt:  ASTQDLHSPIFLLSNICNLVSIRLDSSNYVLWQFQLHKILRAHKLFRFIDGSFPPPPQFLPTNVVSFDSSSSTSDV-LQVNPAYDDWVAIDHALMTLINA

Query:  TLSPPALAYVVGCDTAKQMWETLAKHYSSTSRAHIVNLKSELQTIFKKPGETVHYYVQRVKELKDKLENVSVDVDAEDLVIYVLNG
        TLSP ALAYVVG  ++KQ+W+ LAK YSS SR+++VNLKS+LQTI+KKP E++  Y++R+KE+KDKL NVS  ++ EDL+IY LNG
Subjt:  TLSPPALAYVVGCDTAKQMWETLAKHYSSTSRAHIVNLKSELQTIFKKPGETVHYYVQRVKELKDKLENVSVDVDAEDLVIYVLNG

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X11.8e-5461.83Show/hide
Query:  ASTQDLHSPIFLLSNICNLVSIRLDSSNYVLWQFQLHKILRAHKLFRFIDGSFPPPPQFLPTNVVSFDSSSSTSDV-LQVNPAYDDWVAIDHALMTLINA
        ++ +D  SPIFLLSNICNL+S+RLDS+N+VLW+FQL  IL+AHKL+ FIDG+ P PP+   TN     +SSSTS V  Q NP+Y+DW+A D ALMT+INA
Subjt:  ASTQDLHSPIFLLSNICNLVSIRLDSSNYVLWQFQLHKILRAHKLFRFIDGSFPPPPQFLPTNVVSFDSSSSTSDV-LQVNPAYDDWVAIDHALMTLINA

Query:  TLSPPALAYVVGCDTAKQMWETLAKHYSSTSRAHIVNLKSELQTIFKKPGETVHYYVQRVKELKDKLENVSVDVDAEDLVIYVLNG
        TLSP ALAYVVG  ++KQ+W+ LAK YSS SR+++VNLKS+LQTI+KKP E++  Y++R+KE+KDKL NVS  ++ EDL+IY LNG
Subjt:  TLSPPALAYVVGCDTAKQMWETLAKHYSSTSRAHIVNLKSELQTIFKKPGETVHYYVQRVKELKDKLENVSVDVDAEDLVIYVLNG

A0A5D3CLI6 T4.51.8e-5461.83Show/hide
Query:  ASTQDLHSPIFLLSNICNLVSIRLDSSNYVLWQFQLHKILRAHKLFRFIDGSFPPPPQFLPTNVVSFDSSSSTSDV-LQVNPAYDDWVAIDHALMTLINA
        ++ +D  SPIFLLSNICNL+S+RLDS+N+VLW+FQL  IL+AHKL+ FIDG+ P PP+   TN     +SSSTS V  Q NP+Y+DW+A D ALMT+INA
Subjt:  ASTQDLHSPIFLLSNICNLVSIRLDSSNYVLWQFQLHKILRAHKLFRFIDGSFPPPPQFLPTNVVSFDSSSSTSDV-LQVNPAYDDWVAIDHALMTLINA

Query:  TLSPPALAYVVGCDTAKQMWETLAKHYSSTSRAHIVNLKSELQTIFKKPGETVHYYVQRVKELKDKLENVSVDVDAEDLVIYVLNG
        TLSP ALAYVVG  ++KQ+W+ LAK YSS SR+++VNLKS+LQTI+KKP E++  Y++R+KE+KDKL NVS  ++ EDL+IY LNG
Subjt:  TLSPPALAYVVGCDTAKQMWETLAKHYSSTSRAHIVNLKSELQTIFKKPGETVHYYVQRVKELKDKLENVSVDVDAEDLVIYVLNG

A0A6J1E049 uncharacterized protein LOC1110251503.0e-5765.75Show/hide
Query:  QDLHSPIFLLSNICNLVSIRLDSSNYVLWQFQLHKILRAHKLFRFIDGSFPPPPQFLPTNVVSFDSSSSTSDVLQVNPAYDDWVAIDHALMTLINATLSP
        +DL SPIFLLSNICNLVS+RLDSSN+VLW+FQL  IL+AHKL+ FIDGS P P +FL   V   D SSS       NPA+ +W+A DHALMTL+NA LS 
Subjt:  QDLHSPIFLLSNICNLVSIRLDSSNYVLWQFQLHKILRAHKLFRFIDGSFPPPPQFLPTNVVSFDSSSSTSDVLQVNPAYDDWVAIDHALMTLINATLSP

Query:  PALAYVVGCDTAKQMWETLAKHYSSTSRAHIVNLKSELQTIFKKPGETVHYYVQRVKELKDKLENVSVDVDAEDLVIYVLN
         ALAYVVGCD+++Q+W+TL KHYSS+SR ++VNLKS+LQ+I KKPG ++  YVQR+KELKDKL NV V VD EDL+IY LN
Subjt:  PALAYVVGCDTAKQMWETLAKHYSSTSRAHIVNLKSELQTIFKKPGETVHYYVQRVKELKDKLENVSVDVDAEDLVIYVLN

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.4e-1228.09Show/hide
Query:  LHSPIFLLSNICNLVSIRLDSSNYVLWQFQLHKILRAHKLFRFIDGSFPPPPQFLPTNVVSFDSSSSTSDVLQVNPAYDDWVAIDHALMTLINATLSPPA
        L++   L  N+ N+   +L S+NY++W  Q+H +   ++L  F+DGS   PP  + T+              +VNP Y  W   D  + + +   +S   
Subjt:  LHSPIFLLSNICNLVSIRLDSSNYVLWQFQLHKILRAHKLFRFIDGSFPPPPQFLPTNVVSFDSSSSTSDVLQVNPAYDDWVAIDHALMTLINATLSPPA

Query:  LAYVVGCDTAKQMWETLAKHYSSTSRAHIVNLKSELQTIFKKPGETVHYYVQRVKELKDKLENVSVDVDAEDLVIYVL
           V    TA Q+WETL K Y++ S  H+  L+++L+  + K  +T+  Y+Q +    D+L  +   +D ++ V  VL
Subjt:  LAYVVGCDTAKQMWETLAKHYSSTSRAHIVNLKSELQTIFKKPGETVHYYVQRVKELKDKLENVSVDVDAEDLVIYVL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.1e-1132.76Show/hide
Query:  RLDSSNYVLWQFQLHKILRAHKLFRFIDGSFPPPPQFLPTNVVSFDSSSSTSDVLQVNPAYDDWVAIDHALMTLINATLSPPALAYVVGCDTAKQMWETL
        +L S+NY++W  Q+H +   ++L  F+DGS P PP  + T+ V            +VNP Y  W   D  + + I   +S      V    TA Q+WETL
Subjt:  RLDSSNYVLWQFQLHKILRAHKLFRFIDGSFPPPPQFLPTNVVSFDSSSSTSDVLQVNPAYDDWVAIDHALMTLINATLSPPALAYVVGCDTAKQMWETL

Query:  AKHYSSTSRAHIVNLK
         K Y++ S  H+  L+
Subjt:  AKHYSSTSRAHIVNLK

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)8.3e-0728.89Show/hide
Query:  IFLLSNICNLVSIRLD--SSNYVLWQFQLHKILRAHKLFRFIDGSFPPPPQFLPTNVVSFDSSSSTSDVLQVNPAYDDWVAIDHALMTLINATLSPPAL-
        I+ +SNI + + + LD   SNY  W+        +  +   IDG+       LPTN          +DV        +W   D  +   +  TL+P    
Subjt:  IFLLSNICNLVSIRLD--SSNYVLWQFQLHKILRAHKLFRFIDGSFPPPPQFLPTNVVSFDSSSSTSDVLQVNPAYDDWVAIDHALMTLINATLSPPAL-

Query:  AYVVGCDTAKQMWETLAKHYSSTSRAHIVNLKSELQTIFKKPGE-TVHYYVQRVKELKDKLENVSVDVDAEDLVIYVLNG
           V   T++ +W  +   + +   A  + L SEL+T  K  G+  V  Y +++K+L D L NV V V   +LV+YVLNG
Subjt:  AYVVGCDTAKQMWETLAKHYSSTSRAHIVNLKSELQTIFKKPGE-TVHYYVQRVKELKDKLENVSVDVDAEDLVIYVLNG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCTACACAGGATCTTCATTCTCCCATCTTCTTGTTGTCAAATATCTGTAATCTTGTGTCTATTCGTCTTGATTCGTCCAATTATGTCCTCTGGCAGTTTCAATT
ACACAAGATCTTGCGAGCTCACAAACTATTTCGCTTCATCGATGGGTCTTTCCCACCACCACCACAATTTCTGCCTACAAACGTGGTTTCGTTCGATAGTTCGTCTTCTA
CCTCCGATGTTCTTCAGGTTAATCCAGCGTATGACGACTGGGTTGCCATCGATCATGCTCTTATGACGTTAATCAACGCTACTCTCTCTCCGCCGGCTCTAGCTTATGTA
GTCGGTTGTGACACTGCAAAACAAATGTGGGAAACATTGGCCAAACATTACTCGTCAACTTCAAGGGCGCATATCGTCAATCTCAAGTCTGAGCTCCAGACTATTTTCAA
GAAACCAGGTGAGACAGTCCATTATTACGTTCAACGCGTTAAAGAATTGAAGGATAAGTTGGAAAATGTTTCGGTTGATGTTGATGCTGAAGATCTTGTGATATATGTTC
TGAACGGTACAATAGTTTTTCTTTTCTTGAAGGGTTGTTAA
mRNA sequenceShow/hide mRNA sequence
AACCTTTCTTCTCGAGCCGACGTGTCCGCTTGAGCGCTTCCTGGTCGCACGCGTCTACTAAGATCCGCGCGTCAGCTTGAGTTTCACTTCTCTCGCACGCACAAGGATCA
ATCTTCTTCTTCAGTATTCACAGAGCCCCTTCCCTTTAGGGTTCTTTCGTTTTTTTTTTCTTTCTCTCTATGGCGTCTACACAGGATCTTCATTCTCCCATCTTCTTGTT
GTCAAATATCTGTAATCTTGTGTCTATTCGTCTTGATTCGTCCAATTATGTCCTCTGGCAGTTTCAATTACACAAGATCTTGCGAGCTCACAAACTATTTCGCTTCATCG
ATGGGTCTTTCCCACCACCACCACAATTTCTGCCTACAAACGTGGTTTCGTTCGATAGTTCGTCTTCTACCTCCGATGTTCTTCAGGTTAATCCAGCGTATGACGACTGG
GTTGCCATCGATCATGCTCTTATGACGTTAATCAACGCTACTCTCTCTCCGCCGGCTCTAGCTTATGTAGTCGGTTGTGACACTGCAAAACAAATGTGGGAAACATTGGC
CAAACATTACTCGTCAACTTCAAGGGCGCATATCGTCAATCTCAAGTCTGAGCTCCAGACTATTTTCAAGAAACCAGGTGAGACAGTCCATTATTACGTTCAACGCGTTA
AAGAATTGAAGGATAAGTTGGAAAATGTTTCGGTTGATGTTGATGCTGAAGATCTTGTGATATATGTTCTGAACGGTACAATAGTTTTTCTTTTCTTGAAGGGTTGTTAA
AATGGTTCTAGAGGATGATGATTTGCAACATAAATAGGTTGAGAGCAAAGCAAATTCAAGGTTGATGATAGAATATCATGAGAAGTTCTTGGCCACAAAAACAACTAGCT
CGAGAGTTATATATTTTGATATTGAATAGTGAAACATTTTGTAATTTAATGAATAGTATTTGACAAATTTATATTTAGTGTCATCCCAAAGTTGATGTTCCTTACTATAT
TTATTTTGTGAACAATAGATTGTTATGTTTAATTTGTATCGATTTAAGTGGG
Protein sequenceShow/hide protein sequence
MASTQDLHSPIFLLSNICNLVSIRLDSSNYVLWQFQLHKILRAHKLFRFIDGSFPPPPQFLPTNVVSFDSSSSTSDVLQVNPAYDDWVAIDHALMTLINATLSPPALAYV
VGCDTAKQMWETLAKHYSSTSRAHIVNLKSELQTIFKKPGETVHYYVQRVKELKDKLENVSVDVDAEDLVIYVLNGTIVFLFLKGC