; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0022743 (gene) of Snake gourd v1 genome

Gene IDTan0022743
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase
Genome locationLG04:14207767..14211736
RNA-Seq ExpressionTan0022743
SyntenyTan0022743
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156662.1 uncharacterized protein LOC111023512 [Momordica charantia]6.7e-4356.08Show/hide
Query:  LETMQAMVQATVASQLAQMGQGQVNATVEARYLKDFRKYDPRPFNGSSGDPTVAELWLSSIENIFRLMNCPEEYRVPCVAFMLRDDAFLWWESTQRTVST
        +ET+Q +VQ TV++Q+ Q+ Q + + ++EA+YL+DF+KYDPR F+G S DP +AE WLS +E IFR M C EE +V C  FML+DDAFLWWEST+R +  
Subjt:  LETMQAMVQATVASQLAQMGQGQVNATVEARYLKDFRKYDPRPFNGSSGDPTVAELWLSSIENIFRLMNCPEEYRVPCVAFMLRDDAFLWWESTQRTVST

Query:  DGGLVTWAQFREAFWRKFYPVAARYRKQEEFLQLRHNKRSVESYEREF
         GG VTW QF+EAF++++YP    YRKQ EFL L+ + RSVE Y+REF
Subjt:  DGGLVTWAQFREAFWRKFYPVAARYRKQEEFLQLRHNKRSVESYEREF

XP_023520277.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111783585 [Cucurbita pepo subsp. pepo]3.3e-3445.51Show/hide
Query:  PLVAPAMGQSILVMPT--MLETMQAMVQATVASQLAQMGQGQVNATVEARYLKDFRKYDPRPFNGSSGDPTVAELWLSSIENIFRLMNCPEEYRVPCVAF
        PL AP         PT  ++  +Q ++Q   A+Q A        +T+EARYL+DF++ DPR F G+S DPTVA++WL SIE++F L NCPE +RV C  F
Subjt:  PLVAPAMGQSILVMPT--MLETMQAMVQATVASQLAQMGQGQVNATVEARYLKDFRKYDPRPFNGSSGDPTVAELWLSSIENIFRLMNCPEEYRVPCVAF

Query:  MLRDDAFLWWESTQRTVSTDGGLVTWAQFREAFWRKFYPVAARYRKQEEFLQLRHNKRSVESYEREF
        MLRDDA LWW++T+  +S +GG ++W +F++AF  ++YP   + RKQ+EF QL    R+V +Y +EF
Subjt:  MLRDDAFLWWESTQRTVSTDGGLVTWAQFREAFWRKFYPVAARYRKQEEFLQLRHNKRSVESYEREF

XP_023529765.1 uncharacterized protein LOC111792490 [Cucurbita pepo subsp. pepo]6.1e-3658.87Show/hide
Query:  ATVEARYLKDFRKYDPRPFNGSSGDPTVAELWLSSIENIFRLMNCPEEYRVPCVAFMLRDDAFLWWESTQRTVSTDGGLVTWAQFREAFWRKFYPVAARY
        +T  ARY KDF++YDPR FNG+S DPTVAE+W+ S+ENIFRLMNCP++ +V   +FML+ DA LWWE T+  +S +GG++TW +FREAFW K+    AR 
Subjt:  ATVEARYLKDFRKYDPRPFNGSSGDPTVAELWLSSIENIFRLMNCPEEYRVPCVAFMLRDDAFLWWESTQRTVSTDGGLVTWAQFREAFWRKFYPVAARY

Query:  RKQEEFLQLRHNKRSVESYEREFM
        RKQ+EF QL  + RSV +Y REFM
Subjt:  RKQEEFLQLRHNKRSVESYEREFM

XP_023537968.1 uncharacterized protein LOC111798850 [Cucurbita pepo subsp. pepo]8.5e-3850Show/hide
Query:  PLVAPAMGQSILVMPTMLETMQAMVQATVASQLAQMGQGQVNATVEARYLKDFRKYDPRPFNGSSGDPTVAELWLSSIENIFRLMNCPEEYRVPCVAFML
        P   P +G     +  M E   AM+Q  VA Q AQ+       T E RYL+DF++YDPR FNG+S D  VAELWLSSIE IF  MNCP++ +V   +FML
Subjt:  PLVAPAMGQSILVMPTMLETMQAMVQATVASQLAQMGQGQVNATVEARYLKDFRKYDPRPFNGSSGDPTVAELWLSSIENIFRLMNCPEEYRVPCVAFML

Query:  RDDAFLWWESTQRTVSTDGGLVTWAQFREAFWRKFYPVAARYRKQEEFLQLRHNKRSVESYEREFM
        RDDA +WWE T+  +S DGG+++W QF+EAFW ++Y   AR +KQ+EF QL  N RSV +Y ++F+
Subjt:  RDDAFLWWESTQRTVSTDGGLVTWAQFREAFWRKFYPVAARYRKQEEFLQLRHNKRSVESYEREFM

XP_038880446.1 uncharacterized protein LOC120072105 [Benincasa hispida]9.7e-3454.1Show/hide
Query:  TVEARYLKDFRKYDPRPFNGSSGDPTVAELWLSSIENIFRLMNCPEEYRVPCVAFMLRDDAFLWWESTQRTVSTDGGLVTWAQFREAFWRKFYPVAARYR
        ++EA++L+DF+KYDPRPF+ S GDPT AE+WLSSIE IFR M CPEE+++ C  FML D+  +WW S ++ + T G L TW QF+E F+ K++    RY 
Subjt:  TVEARYLKDFRKYDPRPFNGSSGDPTVAELWLSSIENIFRLMNCPEEYRVPCVAFMLRDDAFLWWESTQRTVSTDGGLVTWAQFREAFWRKFYPVAARYR

Query:  KQEEFLQLRHNKRSVESYEREF
        KQ +FL LR    SVE YE+EF
Subjt:  KQEEFLQLRHNKRSVESYEREF

TrEMBL top hitse value%identityAlignment
A0A6J1DSJ6 uncharacterized protein LOC1110235123.2e-4356.08Show/hide
Query:  LETMQAMVQATVASQLAQMGQGQVNATVEARYLKDFRKYDPRPFNGSSGDPTVAELWLSSIENIFRLMNCPEEYRVPCVAFMLRDDAFLWWESTQRTVST
        +ET+Q +VQ TV++Q+ Q+ Q + + ++EA+YL+DF+KYDPR F+G S DP +AE WLS +E IFR M C EE +V C  FML+DDAFLWWEST+R +  
Subjt:  LETMQAMVQATVASQLAQMGQGQVNATVEARYLKDFRKYDPRPFNGSSGDPTVAELWLSSIENIFRLMNCPEEYRVPCVAFMLRDDAFLWWESTQRTVST

Query:  DGGLVTWAQFREAFWRKFYPVAARYRKQEEFLQLRHNKRSVESYEREF
         GG VTW QF+EAF++++YP    YRKQ EFL L+ + RSVE Y+REF
Subjt:  DGGLVTWAQFREAFWRKFYPVAARYRKQEEFLQLRHNKRSVESYEREF

A0A6J1EPR7 uncharacterized protein LOC1114345292.0e-3245.57Show/hide
Query:  GQSILVMPTMLETMQAMVQATVASQLAQMGQGQVNATVEARYLKDFRKYDPRPFNGSSGDPTVAELWLSSIENIFRLMNCPEEYRVPCVAFMLRDDAFLW
        GQ   ++PT+ ++         A Q     QG V+A  E++Y   F++ DP+ F   S DP VAELWLS+I+ IFR M C EE+R+ CV ++LR+DA LW
Subjt:  GQSILVMPTMLETMQAMVQATVASQLAQMGQGQVNATVEARYLKDFRKYDPRPFNGSSGDPTVAELWLSSIENIFRLMNCPEEYRVPCVAFMLRDDAFLW

Query:  WESTQRTVSTDGGLVTWAQFREAFWRKFYPVAARYRKQEEFLQLRHNKRSVESYEREF
        W+S  R ++ D   +TW QFR+AF RK++    RY+KQ EFL +    RSVE YEREF
Subjt:  WESTQRTVSTDGGLVTWAQFREAFWRKFYPVAARYRKQEEFLQLRHNKRSVESYEREF

A0A6J1ET54 uncharacterized protein LOC111435758 isoform X49.8e-3242.69Show/hide
Query:  IPLVAPAMGQSILVMP----TMLETMQAMVQATVASQLAQMGQGQVNATVEARYLKDFRKYDPRPFNGSSGDPTVAELWLSSIENIFRLMNCPEEYRVPC
        +PL+ PA+    L  P     ++E +QA+VQ  + +Q A       ++T +A+ L+DF++ DP+ FNGSS DPT  +LWL SIE +F L+NCP++ +V C
Subjt:  IPLVAPAMGQSILVMP----TMLETMQAMVQATVASQLAQMGQGQVNATVEARYLKDFRKYDPRPFNGSSGDPTVAELWLSSIENIFRLMNCPEEYRVPC

Query:  VAFMLRDDAFLWWESTQRTVSTDGGLVTWAQFREAFWRKFYPVAARYRKQEEFLQLRHNKRSVESYEREFM
          FMLRDDA LWW+ST   +S +G +++WA+F++AF  ++Y    + R Q++F QL    RSV +Y REF+
Subjt:  VAFMLRDDAFLWWESTQRTVSTDGGLVTWAQFREAFWRKFYPVAARYRKQEEFLQLRHNKRSVESYEREFM

A0A6J1EUD4 uncharacterized protein LOC1114367147.5e-3252.07Show/hide
Query:  VEARYLKDFRKYDPRPFNGSSGDPTVAELWLSSIENIFRLMNCPEEYRVPCVAFMLRDDAFLWWESTQRTVSTDGGLVTWAQFREAFWRKFYPVAARYRK
        +EARYL++F++ DPR F G+S DPTVA++WL SIE++F L NCPE +RV C  FMLRDDA LWW++T+  +  +G  V+W +F++AF  ++YP   + RK
Subjt:  VEARYLKDFRKYDPRPFNGSSGDPTVAELWLSSIENIFRLMNCPEEYRVPCVAFMLRDDAFLWWESTQRTVSTDGGLVTWAQFREAFWRKFYPVAARYRK

Query:  QEEFLQLRHNKRSVESYEREF
        Q+EF QL    R+V +Y REF
Subjt:  QEEFLQLRHNKRSVESYEREF

A0A6J1FDR9 uncharacterized protein LOC1114444631.8e-3345.51Show/hide
Query:  PLVAPAMGQSILVMPT--MLETMQAMVQATVASQLAQMGQGQVNATVEARYLKDFRKYDPRPFNGSSGDPTVAELWLSSIENIFRLMNCPEEYRVPCVAF
        P+  PA  Q+    PT  +++ +Q ++Q   A+Q A        +T+EA+YL+DF++ DPR F G+S DPTVA++WL SIE +F L NCPE +RV C  F
Subjt:  PLVAPAMGQSILVMPT--MLETMQAMVQATVASQLAQMGQGQVNATVEARYLKDFRKYDPRPFNGSSGDPTVAELWLSSIENIFRLMNCPEEYRVPCVAF

Query:  MLRDDAFLWWESTQRTVSTDGGLVTWAQFREAFWRKFYPVAARYRKQEEFLQLRHNKRSVESYEREF
        MLR DA LWW++T+  +S +GG V+W +F+ AF  ++YP   + RKQ+EF QL     SV++Y REF
Subjt:  MLRDDAFLWWESTQRTVSTDGGLVTWAQFREAFWRKFYPVAARYRKQEEFLQLRHNKRSVESYEREF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGGATCCGCCTCAGGCTCCACATAATAACCAAGGAAATATCCCTCTAGTCGCACCGGCTATGGGGCAATCCATCCTCGTTATGCCTACGATGCTGGAGACTATGCA
AGCGATGGTACAAGCCACTGTGGCGTCTCAATTAGCTCAGATGGGTCAGGGGCAAGTAAATGCGACGGTGGAAGCTCGGTACCTGAAGGATTTCAGGAAATACGACCCTC
GACCGTTCAATGGATCTTCTGGGGACCCCACTGTGGCAGAGTTGTGGTTGTCGTCGATAGAGAATATCTTTCGGCTCATGAATTGTCCTGAAGAATACAGGGTGCCTTGT
GTGGCGTTCATGCTGAGAGATGACGCCTTCTTGTGGTGGGAGTCAACCCAGAGAACCGTGAGCACCGACGGAGGCCTCGTGACATGGGCCCAATTTAGGGAGGCTTTCTG
GCGAAAATTTTACCCTGTGGCAGCCCGCTACAGGAAGCAAGAAGAATTCTTACAGCTTCGCCATAACAAGCGGTCTGTGGAGAGCTATGAGCGGGAGTTTATGATACAAA
TATATCGTCTACATATACCTCCTTATTATTCCTGGTGA
mRNA sequenceShow/hide mRNA sequence
CTTCCCAAGGCTCATTCTTCATCTCTCTCACACCCTTTTCACGAAAGCCATCTTCAGTCACCGCCTCATCTTCAGATGCCGTCTCCTCGCAGTCGCCGTTGGCATGTTGT
CGCGCCGGCACCCTTCACAAAGGAGATTCGCCTGCCGCCGGACGTCGAGGTTAGATCTTCCTCTCTCGCGCAGAAGTCCGTCGAGTCGTGGTTGCAGTGGTCGTCGCGCT
AAATCTGGAAGCTCGTTGGGTTCCTTTGTTTTGTCGTCGTCGCCAGACGTGTTGCCGATCGAAGAAGCCACCGTCTGACTCCCGAAAACCTACCGCACCGCTGAAGTCCG
AAGCTTTTGGTCCAGCCGTGCAACCCTAGAGCCGATGTTGCCCGCTCCGCCGTCGTCGGTTTCAGGAGATCTGTTGAGGGTCACTGACCCAATAACATAAAGGGGACAAT
TGTGGTCACGTTTCAGCAAGAGATTTAATTCAAGGTTCAAAAGGAACGTCCTGCTAGGAGCTAAGATTCTTGAATTTTGGAGCTTACGCTAAGGTAAGGGCAAAGGCAAG
CTGGTCGGATGACGAGAGGGCGCTACAAAAGCCATGGGATGAAAAGCTACTTCCGCACTTCATGTTTTCACGTTTTTTTCCCTTTTCTAGTATTCCATTTATTGAGTAGT
AAACTCTGTCAGAATTTGGAGGTCAGGATTAAGTTGTTTACTTGTTTTTTTAATAAGCTTGTTTTAAGCACTTTTGTTTCGGGATCCCAATAAAGGAGTTGTTTTGTTTT
CTCTAGTTTTACATGGTTATTGAACTTTAAAATTTGTGGCCTCAAGTCATGCATGTTAAGTAAAGACCCTAGGAAGAGAGTCATAAAGTTAGGGTCCTTACAGTTGGTAT
CAGAACCCAAGGTTATGAGTTTTGTAGACTTACTTAAGTTATAGGTAACGAATCTCATGGCTAGTAGTAACTCCCAGTCATCACCAGGTTTGCTCCAAGATCTTTATTAC
GTAAATTTTATTCTTTTGAAGGCAAGAATGTTTATAAGCAAGCATGTGCATGTATGATTTATGCTAGATATCCATGTGTTCCTTGGTTTGACTGCCTAATGTTAAGAGCT
GAGTATGGATCGTTATGTTGTAGTATTAATGCCACCTAGAGGAAGAGGGTGAGGCAGGGGTCGAGGCAGGGAGCGAAGGGTTGGAGTAACTGGCCCGCCCCCAGAACTGC
CCATGGAGCAGTAAGAGGTTCCCCCGCCTATGCCGGATCCGCCTCAGGCTCCACATAATAACCAAGGAAATATCCCTCTAGTCGCACCGGCTATGGGGCAATCCATCCTC
GTTATGCCTACGATGCTGGAGACTATGCAAGCGATGGTACAAGCCACTGTGGCGTCTCAATTAGCTCAGATGGGTCAGGGGCAAGTAAATGCGACGGTGGAAGCTCGGTA
CCTGAAGGATTTCAGGAAATACGACCCTCGACCGTTCAATGGATCTTCTGGGGACCCCACTGTGGCAGAGTTGTGGTTGTCGTCGATAGAGAATATCTTTCGGCTCATGA
ATTGTCCTGAAGAATACAGGGTGCCTTGTGTGGCGTTCATGCTGAGAGATGACGCCTTCTTGTGGTGGGAGTCAACCCAGAGAACCGTGAGCACCGACGGAGGCCTCGTG
ACATGGGCCCAATTTAGGGAGGCTTTCTGGCGAAAATTTTACCCTGTGGCAGCCCGCTACAGGAAGCAAGAAGAATTCTTACAGCTTCGCCATAACAAGCGGTCTGTGGA
GAGCTATGAGCGGGAGTTTATGATACAAATATATCGTCTACATATACCTCCTTATTATTCCTGGTGATACAAATAGTATAAAAGGGTTTATAGTAAAGGGATGAGGTTGG
GTACCTTATGCTGATGATACTATGGATATGGCCCACTTTGTATATGATAGAAACACAATGATCTAATGTGTTCATGTAGACGGCATGTGAGTGGGGATATCCTATATAAT
GAGATTGCATAAGACCGGACTACGAAATAGTAACCAATAGATGTAACTCCGTTGACTAGTTAGATTTCTATTTCAATAGGATGACCTAGGCGACTTGATCTCAATCTTGA
GCAGGTTATGAACTCCTGTTTGCAAGGGATTGTCCTTGGACTAGTATGGGTGAGAGTGGCCTGAGACAGCGACTCAATAAGCCTACCTTCTTGGGGACAAAACTGCGCAG
ATAGCTGGGGACTTAACTTAGCAAAATGGAATCACTCTTTCCCGACTTTAGAGTAAGAAAATGAGTGTTCCCTTAAGTGGTGCCTCTGGAACTTGAAAAAAGGGTCATAT
CCTCTCACTGGCCCGAGAGTGATTTCTGTTTATTGGTATGACCATAAACAGGTTGTTCATTAGAGGAGCACTGGTACTTAAGGTTAAAGATGTAACCTAGAGGTAAAATG
GTAATTTGACCCAACTGGCGTTACGAACACTCATGAAGGATTGACTTGTTGTTATTGGTCGTTATCCATGGACACAAAAATATATCTGCAGTGAGAAGAGTGCAACCGCG
GGACTTTAGTGGATTGTTCCCGTAGTTAACGAATGTTGATTAGGGATAATGAGATTAACCTAATAATCTTGTATCGTTGGAGCTTATGATCTGTAGGTCCATTAGGTCCA
CTTCCTAGCTTGTAAAGGGTTCTTAGATTATTAGATATTGATTAGAATTTGAAGTGTGCAAATTTACTTTGGGAATTAGCTTAATGTATGGAGATACATTATAATATAAA
GTTTTTATGTACATTAATACTTTATAGTATAAATTTAATTTGAATATGATTCAAAATAAATTTATGAGATAATTGAATATTTGAATGAGTTCAAATATTGTTTTAAATAT
GAATAGAATTCATATTTAAAATTATAGGTTAAAATTTAATGTGCATGAGATGTACATTAAAACTATAGGTTAAAGGGATT
Protein sequenceShow/hide protein sequence
MPDPPQAPHNNQGNIPLVAPAMGQSILVMPTMLETMQAMVQATVASQLAQMGQGQVNATVEARYLKDFRKYDPRPFNGSSGDPTVAELWLSSIENIFRLMNCPEEYRVPC
VAFMLRDDAFLWWESTQRTVSTDGGLVTWAQFREAFWRKFYPVAARYRKQEEFLQLRHNKRSVESYEREFMIQIYRLHIPPYYSW