; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005567 (gene) of Snake gourd v1 genome

Gene IDTan0005567
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationLG08:51756660..51761728
RNA-Seq ExpressionTan0005567
SyntenyTan0005567
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047638.1 retrotransposon protein putative ty3-gypsy sub-class [Cucumis melo var. makuwa]8.9e-1528.18Show/hide
Query:  SSKDRVVVKDNPLFDQFTPAIGQSKDASNQDVMSVMMADVESDERMAEMERKISLLMKAVEERDLEIAYLKN------QMRI------------------
        SSK R+V K+N L+D F     +SK  ++ D+MSVMMAD+ +   M EMERKI+ LMK VEERD EI  L+N       M +                  
Subjt:  SSKDRVVVKDNPLFDQFTPAIGQSKDASNQDVMSVMMADVESDERMAEMERKISLLMKAVEERDLEIAYLKN------QMRI------------------

Query:  -ARPLD----------------------------------------------------------------------------------------------
         A  LD                                                                                              
Subjt:  -ARPLD----------------------------------------------------------------------------------------------

Query:  -------------LVQFGSFEPIIVWMNDEPSSMNPQEGGIQKQYVQEKNKQTEDENEGWTVVTRRKKRQQSYAQKESRLFRHHKRKSMSQKKKRKQITK
                     LVQFG+FEPI+V    E   + P++        QEK K  E++NEGW +VTRRKKR+ +  +KES  +R++ R + +QK K+K+  +
Subjt:  -------------LVQFGSFEPIIVWMNDEPSSMNPQEGGIQKQYVQEKNKQTEDENEGWTVVTRRKKRQQSYAQKESRLFRHHKRKSMSQKKKRKQITK

Query:  KPVYAIREDENLFRLRQLVTLEEFFPKNFL
        KP    +ED++  + ++LVTL +FFP  FL
Subjt:  KPVYAIREDENLFRLRQLVTLEEFFPKNFL

KAA0061113.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.6e-1444.63Show/hide
Query:  KKKPHAREIGDNTYCRGIIRSRGVEIKEDRTPDA-IANKIVKLIEGSSKDRVVVKDNPLFDQFTPAIGQSKDASNQDVMSVMMADVESDERMAEMERKIS
        KK      +  N Y   I RSR   I +++   + +A  I+K +  S K  +V+K+NPL+D    +  +SK  ++ DVMSVMMAD+  +  MAEMERKI+
Subjt:  KKKPHAREIGDNTYCRGIIRSRGVEIKEDRTPDA-IANKIVKLIEGSSKDRVVVKDNPLFDQFTPAIGQSKDASNQDVMSVMMADVESDERMAEMERKIS

Query:  LLMKAVEERDLEIAYLKNQMR
         LMKAVEERD EI  L+ QMR
Subjt:  LLMKAVEERDLEIAYLKNQMR

KAA0061113.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.6e-0829.74Show/hide
Query:  IRSRGV------EIKEDRTPDAIANKIVKLIEGSSKDRVVVKDNPL-FDQFTPAIGQSKDASNQDVMSVMMADVESDERMAEMERKISLLMKAVEERDLE
        I SRG       ++++D+     A K+VK    + K+ +VV   PL F +      + KD  N   ++ M+  +   + +   E K S     V++ +  
Subjt:  IRSRGV------EIKEDRTPDAIANKIVKLIEGSSKDRVVVKDNPL-FDQFTPAIGQSKDASNQDVMSVMMADVESDERMAEMERKISLLMKAVEERDLE

Query:  IAYLKNQMRIARPLDLVQFGSFEPIIVWMNDEPSSMNPQEGGIQKQYVQEKNKQTEDENEGWTVVTRRKKRQQSYAQKESRLFRHHKRKSMSQKKKRKQI
          Y K    I+ P++  ++   + +I+ +  E       E  +  +  QEK +  E+++EGWTVVTRRKKR+ +  QKESRL+ +++R + +QK K+K+ 
Subjt:  IAYLKNQMRIARPLDLVQFGSFEPIIVWMNDEPSSMNPQEGGIQKQYVQEKNKQTEDENEGWTVVTRRKKRQQSYAQKESRLFRHHKRKSMSQKKKRKQI

Query:  TKKPVYAIREDENLFRLRQLVTLEEFFPKNFL
        T+K      +D++  R +++VTL +FFP  FL
Subjt:  TKKPVYAIREDENLFRLRQLVTLEEFFPKNFL

KAA0061113.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]9.8e-1446.28Show/hide
Query:  KKKPHAREIGDNTYCRGIIRSRGVEIKEDRTP-DAIANKIVKLIEGSSKDRVVVKDNPLFDQFTPAIGQSKDASNQDVMSVMMADVESDERMAEMERKIS
        KK         +TY   I RSR   I + +    AIA  I+K +  S K  +V+K+NPL++ +  A  +S   ++ DVMSVMMADV  +  MAEMERKI+
Subjt:  KKKPHAREIGDNTYCRGIIRSRGVEIKEDRTP-DAIANKIVKLIEGSSKDRVVVKDNPLFDQFTPAIGQSKDASNQDVMSVMMADVESDERMAEMERKIS

Query:  LLMKAVEERDLEIAYLKNQMR
        LLMK V+ERD EIA LK QM+
Subjt:  LLMKAVEERDLEIAYLKNQMR

TYK05006.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.3e-1538.96Show/hide
Query:  KSIIEKKKPHAREIGDNTYCRGIIRSR--GVEIKEDRTPDAIANKIVKLIEGSSKDRVVVKDNPLFDQFTPAIGQSKDASNQDVMSVMMADVESDERMAE
        KS+  KK      + ++ Y   I RSR  G+ I+E      +A  I+K +  S K  +V+K+NPL+D    A  +SK  ++ DVMSVMMAD+ ++  MAE
Subjt:  KSIIEKKKPHAREIGDNTYCRGIIRSR--GVEIKEDRTPDAIANKIVKLIEGSSKDRVVVKDNPLFDQFTPAIGQSKDASNQDVMSVMMADVESDERMAE

Query:  MERKISLLMKAVEERDLEIAYLKNQMRIARPLDLVQFGSFEPIIVWMNDEPSSM
        MERKI+ LM  VEERD EI  L+ QM+   P++     S + ++V   D+  +M
Subjt:  MERKISLLMKAVEERDLEIAYLKNQMRIARPLDLVQFGSFEPIIVWMNDEPSSM

XP_031739134.1 uncharacterized protein LOC116402863 [Cucumis sativus]6.6e-1042.74Show/hide
Query:  LVQFGSFEPIIVWMNDEPSSMNPQEGGIQKQYVQEKNKQTEDENEGWTVVTRRKKRQQSYAQKESRLFRHHKRKSMSQKKKRKQITKKPVYAIREDENLF
        LVQFG+FEPI+V    E S  +P          Q + +  E+++EGW VVT RKKRQ    Q+ESR +++++R + +QK K+K+ T K      ED N  
Subjt:  LVQFGSFEPIIVWMNDEPSSMNPQEGGIQKQYVQEKNKQTEDENEGWTVVTRRKKRQQSYAQKESRLFRHHKRKSMSQKKKRKQITKKPVYAIREDENLF

Query:  RLRQLVTLEEFFPKNFL
        R ++LVTL +F PK+FL
Subjt:  RLRQLVTLEEFFPKNFL

XP_031739134.1 uncharacterized protein LOC116402863 [Cucumis sativus]9.8e-1446.28Show/hide
Query:  KKKPHAREIGDNTYCRGIIRSRGVEIKEDRTP-DAIANKIVKLIEGSSKDRVVVKDNPLFDQFTPAIGQSKDASNQDVMSVMMADVESDERMAEMERKIS
        KK         +TY   I RSR   I + +    AIA  I+K +  S K  +V+K+NPL++ +  A  +S   ++ DVMSVMMADV  +  MAEMERKI+
Subjt:  KKKPHAREIGDNTYCRGIIRSRGVEIKEDRTP-DAIANKIVKLIEGSSKDRVVVKDNPLFDQFTPAIGQSKDASNQDVMSVMMADVESDERMAEMERKIS

Query:  LLMKAVEERDLEIAYLKNQMR
        LLMK V+ERD EIA LK QM+
Subjt:  LLMKAVEERDLEIAYLKNQMR

XP_031742032.1 uncharacterized protein LOC116404025 [Cucumis sativus]6.6e-1042.74Show/hide
Query:  LVQFGSFEPIIVWMNDEPSSMNPQEGGIQKQYVQEKNKQTEDENEGWTVVTRRKKRQQSYAQKESRLFRHHKRKSMSQKKKRKQITKKPVYAIREDENLF
        LVQFG+FEPI+V    E S  +P          Q + +  E+++EGW VVT RKKRQ    Q+ESR +++++R + +QK K+K+ T K      ED N  
Subjt:  LVQFGSFEPIIVWMNDEPSSMNPQEGGIQKQYVQEKNKQTEDENEGWTVVTRRKKRQQSYAQKESRLFRHHKRKSMSQKKKRKQITKKPVYAIREDENLF

Query:  RLRQLVTLEEFFPKNFL
        R ++LVTL +F PK+FL
Subjt:  RLRQLVTLEEFFPKNFL

TrEMBL top hitse value%identityAlignment
A0A5A7TPD2 Ty3-gypsy retrotransposon protein1.1e-1343.1Show/hide
Query:  LVQFGSFEPIIVWMNDEPSSMNPQEGGIQKQYVQEKNKQTEDENEGWTVVTRRKKRQQSYAQKESRLFRHHKRKSMSQKKKRKQITKKPVYAIREDENLF
        L+QFGS EP++++ + E      Q    Q    +E+ KQ +D +EGWT+VTRRKKR+Q+++QKESR +R ++R   SQ++K ++  +K +  I E E L 
Subjt:  LVQFGSFEPIIVWMNDEPSSMNPQEGGIQKQYVQEKNKQTEDENEGWTVVTRRKKRQQSYAQKESRLFRHHKRKSMSQKKKRKQITKKPVYAIREDENLF

Query:  RLRQLVTLEEFFPKNF
        R RQ +TL++FF +NF
Subjt:  RLRQLVTLEEFFPKNF

A0A5A7TRQ6 RNase H domain-containing protein1.8e-1336.24Show/hide
Query:  MERKISLLMKAVEERDLEIAYLKNQMRIARPLDLVQFGSFEPIIVWMNDEPSSMNPQEGGIQKQYVQEKNKQTEDENEGWTVVTRRKKRQQSYAQKESRL
        +++KI L +  V + +     +++  R++    L+QFGS EP++++ + E    + Q    Q    +E+ KQ ++  EGWT+VTRRKKR+QS+ QKES  
Subjt:  MERKISLLMKAVEERDLEIAYLKNQMRIARPLDLVQFGSFEPIIVWMNDEPSSMNPQEGGIQKQYVQEKNKQTEDENEGWTVVTRRKKRQQSYAQKESRL

Query:  FRHHKRKSMSQKKKRKQITKKPVYAIREDENLFRLRQLVTLEEFFPKNF
        +R +  K  SQ++  ++  +K +  I E E L R R+L+ L++FFPKNF
Subjt:  FRHHKRKSMSQKKKRKQITKKPVYAIREDENLFRLRQLVTLEEFFPKNF

A0A5A7TXJ5 Retrotransposon protein putative ty3-gypsy sub-class4.3e-1528.18Show/hide
Query:  SSKDRVVVKDNPLFDQFTPAIGQSKDASNQDVMSVMMADVESDERMAEMERKISLLMKAVEERDLEIAYLKN------QMRI------------------
        SSK R+V K+N L+D F     +SK  ++ D+MSVMMAD+ +   M EMERKI+ LMK VEERD EI  L+N       M +                  
Subjt:  SSKDRVVVKDNPLFDQFTPAIGQSKDASNQDVMSVMMADVESDERMAEMERKISLLMKAVEERDLEIAYLKN------QMRI------------------

Query:  -ARPLD----------------------------------------------------------------------------------------------
         A  LD                                                                                              
Subjt:  -ARPLD----------------------------------------------------------------------------------------------

Query:  -------------LVQFGSFEPIIVWMNDEPSSMNPQEGGIQKQYVQEKNKQTEDENEGWTVVTRRKKRQQSYAQKESRLFRHHKRKSMSQKKKRKQITK
                     LVQFG+FEPI+V    E   + P++        QEK K  E++NEGW +VTRRKKR+ +  +KES  +R++ R + +QK K+K+  +
Subjt:  -------------LVQFGSFEPIIVWMNDEPSSMNPQEGGIQKQYVQEKNKQTEDENEGWTVVTRRKKRQQSYAQKESRLFRHHKRKSMSQKKKRKQITK

Query:  KPVYAIREDENLFRLRQLVTLEEFFPKNFL
        KP    +ED++  + ++LVTL +FFP  FL
Subjt:  KPVYAIREDENLFRLRQLVTLEEFFPKNFL

A0A5D3C3J8 Ty3-gypsy retrotransposon protein1.1e-1538.96Show/hide
Query:  KSIIEKKKPHAREIGDNTYCRGIIRSR--GVEIKEDRTPDAIANKIVKLIEGSSKDRVVVKDNPLFDQFTPAIGQSKDASNQDVMSVMMADVESDERMAE
        KS+  KK      + ++ Y   I RSR  G+ I+E      +A  I+K +  S K  +V+K+NPL+D    A  +SK  ++ DVMSVMMAD+ ++  MAE
Subjt:  KSIIEKKKPHAREIGDNTYCRGIIRSR--GVEIKEDRTPDAIANKIVKLIEGSSKDRVVVKDNPLFDQFTPAIGQSKDASNQDVMSVMMADVESDERMAE

Query:  MERKISLLMKAVEERDLEIAYLKNQMRIARPLDLVQFGSFEPIIVWMNDEPSSM
        MERKI+ LM  VEERD EI  L+ QM+   P++     S + ++V   D+  +M
Subjt:  MERKISLLMKAVEERDLEIAYLKNQMRIARPLDLVQFGSFEPIIVWMNDEPSSM

A0A5D3D5V8 RNase H domain-containing protein1.8e-1336.24Show/hide
Query:  MERKISLLMKAVEERDLEIAYLKNQMRIARPLDLVQFGSFEPIIVWMNDEPSSMNPQEGGIQKQYVQEKNKQTEDENEGWTVVTRRKKRQQSYAQKESRL
        +++KI L +  V + +     +++  R++    L+QFGS EP++++ + E    + Q    Q    +E+ KQ ++  EGWT+VTRRKKR+QS+ QKES  
Subjt:  MERKISLLMKAVEERDLEIAYLKNQMRIARPLDLVQFGSFEPIIVWMNDEPSSMNPQEGGIQKQYVQEKNKQTEDENEGWTVVTRRKKRQQSYAQKESRL

Query:  FRHHKRKSMSQKKKRKQITKKPVYAIREDENLFRLRQLVTLEEFFPKNF
        +R +  K  SQ++  ++  +K +  I E E L R R+L+ L++FFPKNF
Subjt:  FRHHKRKSMSQKKKRKQITKKPVYAIREDENLFRLRQLVTLEEFFPKNF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAGCATTATAGAGAAGAAGAAACCTCATGCAAGAGAAATTGGAGATAATACCTATTGTAGAGGAATTATTCGTTCTCGGGGGGTTGAGATAAAGGAGGACCGAAC
CCCTGATGCGATTGCAAATAAGATCGTCAAGTTGATCGAAGGATCCTCCAAGGATAGAGTGGTCGTCAAAGATAACCCGCTGTTTGACCAGTTTACCCCTGCTATCGGTC
AATCAAAGGACGCATCGAATCAAGATGTGATGTCTGTGATGATGGCCGATGTGGAATCCGACGAAAGGATGGCAGAGATGGAGAGAAAGATTAGTCTCTTGATGAAGGCA
GTCGAAGAAAGGGATTTAGAGATTGCCTACTTGAAGAATCAAATGCGAATCGCGAGACCACTTGACCTAGTTCAGTTCGGATCCTTTGAACCTATCATTGTGTGGATGAA
TGATGAACCATCGAGTATGAATCCTCAAGAGGGAGGCATCCAAAAGCAGTACGTTCAAGAAAAGAATAAGCAGACCGAAGATGAAAACGAAGGTTGGACAGTCGTGACTC
GTCGCAAGAAGCGACAACAAAGTTACGCACAGAAGGAATCGCGACTATTTCGACACCATAAGAGAAAAAGTATGTCGCAAAAGAAGAAAAGAAAACAGATCACGAAGAAG
CCTGTTTACGCCATAAGAGAAGACGAAAACCTCTTTCGCCTACGACAACTGGTAACTTTGGAGGAATTCTTCCCAAAGAATTTCCTAAACTTGAAGATGACCGATTGGAA
GCAAATTTGCCTAAGAGTCGAACGAAAGATGGGTTTGACCCTAAGGCATATAAACTCCTATCAAAGGCAGGATACGACTTCACAACTCACACTGAGTTCAAAAGTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAAGCATTATAGAGAAGAAGAAACCTCATGCAAGAGAAATTGGAGATAATACCTATTGTAGAGGAATTATTCGTTCTCGGGGGGTTGAGATAAAGGAGGACCGAAC
CCCTGATGCGATTGCAAATAAGATCGTCAAGTTGATCGAAGGATCCTCCAAGGATAGAGTGGTCGTCAAAGATAACCCGCTGTTTGACCAGTTTACCCCTGCTATCGGTC
AATCAAAGGACGCATCGAATCAAGATGTGATGTCTGTGATGATGGCCGATGTGGAATCCGACGAAAGGATGGCAGAGATGGAGAGAAAGATTAGTCTCTTGATGAAGGCA
GTCGAAGAAAGGGATTTAGAGATTGCCTACTTGAAGAATCAAATGCGAATCGCGAGACCACTTGACCTAGTTCAGTTCGGATCCTTTGAACCTATCATTGTGTGGATGAA
TGATGAACCATCGAGTATGAATCCTCAAGAGGGAGGCATCCAAAAGCAGTACGTTCAAGAAAAGAATAAGCAGACCGAAGATGAAAACGAAGGTTGGACAGTCGTGACTC
GTCGCAAGAAGCGACAACAAAGTTACGCACAGAAGGAATCGCGACTATTTCGACACCATAAGAGAAAAAGTATGTCGCAAAAGAAGAAAAGAAAACAGATCACGAAGAAG
CCTGTTTACGCCATAAGAGAAGACGAAAACCTCTTTCGCCTACGACAACTGGTAACTTTGGAGGAATTCTTCCCAAAGAATTTCCTAAACTTGAAGATGACCGATTGGAA
GCAAATTTGCCTAAGAGTCGAACGAAAGATGGGTTTGACCCTAAGGCATATAAACTCCTATCAAAGGCAGGATACGACTTCACAACTCACACTGAGTTCAAAAGTCTAA
Protein sequenceShow/hide protein sequence
MKSIIEKKKPHAREIGDNTYCRGIIRSRGVEIKEDRTPDAIANKIVKLIEGSSKDRVVVKDNPLFDQFTPAIGQSKDASNQDVMSVMMADVESDERMAEMERKISLLMKA
VEERDLEIAYLKNQMRIARPLDLVQFGSFEPIIVWMNDEPSSMNPQEGGIQKQYVQEKNKQTEDENEGWTVVTRRKKRQQSYAQKESRLFRHHKRKSMSQKKKRKQITKK
PVYAIREDENLFRLRQLVTLEEFFPKNFLNLKMTDWKQICLRVERKMGLTLRHINSYQRQDTTSQLTLSSKV