; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003279 (gene) of Snake gourd v1 genome

Gene IDTan0003279
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationLG09:9767552..9770768
RNA-Seq ExpressionTan0003279
SyntenyTan0003279
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW13148.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]2.1e-4134.68Show/hide
Query:  MRFLTWNVQGLGSWKKRSLIKKTIQLQNPGIVLLQETKLTAIDSSMIKTIWSSSHIGWTTLDATNSAGGILILWNEPDFSVLEVIRGLYSLSIHVLLSDG
        M+ ++WNV+GLGS  KR ++K  ++ +NP +V++QETK    D   + ++W+  +  W  L A+ ++GGILI+W+  +    EV+ G +S+S+   L   
Subjt:  MRFLTWNVQGLGSWKKRSLIKKTIQLQNPGIVLLQETKLTAIDSSMIKTIWSSSHIGWTTLDATNSAGGILILWNEPDFSVLEVIRGLYSLSIHVLLSDG

Query:  FAFTVNKRYGPSASGFHEDFWSELNDIAGLGDVCLRKFGNAIFRRIDRLTSDHYPLALSFGDIEWGPCPFRFENAWLQNSSIRQVVNDWWQQNRLEGWPG
            ++  YGP++    +DFW EL DI GL +  +R+            TSDH+P+ +      WGP PFRFEN WLQ+++ ++   DWW   +  GW G
Subjt:  FAFTVNKRYGPSASGFHEDFWSELNDIAGLGDVCLRKFGNAIFRRIDRLTSDHYPLALSFGDIEWGPCPFRFENAWLQNSSIRQVVNDWWQQNRLEGWPG

Query:  HGFMMKLKGLKIVLKQWIK-SNNPRASLFSSLMAQLHSLDSLADAGPL
        H FM +L+ +K  LK+W K S         S++  L + D++   G L
Subjt:  HGFMMKLKGLKIVLKQWIK-SNNPRASLFSSLMAQLHSLDSLADAGPL

RVW15385.1 putative ribonuclease H protein [Vitis vinifera]9.3e-4237.24Show/hide
Query:  WNVQGLGSWKKRSLIKKTIQLQNPGIVLLQETKLTAIDSSMIKTIWSSSHIGWTTLDATNSAGGILILWNEPDFSVLEVIRGLYSLSIHVLLSDGFAFTV
        WN +GLGS KKR  +++ +  QNP +V+LQETK    D  ++ +IW    + W  L A  ++GGI+ILW+   F+  E + G +S+++ +   +  +F +
Subjt:  WNVQGLGSWKKRSLIKKTIQLQNPGIVLLQETKLTAIDSSMIKTIWSSSHIGWTTLDATNSAGGILILWNEPDFSVLEVIRGLYSLSIHVLLSDGFAFTV

Query:  NKRYGPSASGFHEDFWSELNDIAGL---------------------GD----VCLRKFGNAI--FRRIDRLTSDHYPLALSFGDIEWGPCPFRFENAWLQ
           YGP+ + + EDFW EL D+ GL                     GD    V +R+F   I     + R TSDH P+ L      WGP PFRFEN WL 
Subjt:  NKRYGPSASGFHEDFWSELNDIAGL---------------------GD----VCLRKFGNAI--FRRIDRLTSDHYPLALSFGDIEWGPCPFRFENAWLQ

Query:  NSSIRQVVNDWWQQNRLEGWPGHGFMMKLKGLKIVLKQW
        +   ++   DWWQ+  +EGW GH FM KLK +K  LK+W
Subjt:  NSSIRQVVNDWWQQNRLEGWPGHGFMMKLKGLKIVLKQW

RVW55793.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.4e-4237.04Show/hide
Query:  RFLTWNVQGLGSWKKRSLIKKTIQLQNPGIVLLQETKLTAIDSSMIKTIWSSSHIGWTTLDATNSAGGILILWNEPDFSVLEVIRGLYSLSIHVLLSDGF
        + L+WN +GLGS KKR  +++ +  QNP +V+LQETK    D  ++ +IW    + W  L A  ++GGI+ILW+   F+  E + G +S+++ +   +  
Subjt:  RFLTWNVQGLGSWKKRSLIKKTIQLQNPGIVLLQETKLTAIDSSMIKTIWSSSHIGWTTLDATNSAGGILILWNEPDFSVLEVIRGLYSLSIHVLLSDGF

Query:  AFTVNKRYGPSASGFHEDFWSELNDIAGL---------------------GD----VCLRKFGNAI--FRRIDRLTSDHYPLALSFGDIEWGPCPFRFEN
        +F +   YGP+ + + EDFW EL D+ GL                     GD    V +R+F   I     + R TSDH P+ L      WGP PFRFEN
Subjt:  AFTVNKRYGPSASGFHEDFWSELNDIAGL---------------------GD----VCLRKFGNAI--FRRIDRLTSDHYPLALSFGDIEWGPCPFRFEN

Query:  AWLQNSSIRQVVNDWWQQNRLEGWPGHGFMMKLKGLKIVLKQW
         WL +   ++   DWWQ+  +EGW GH FM KLK +K  LK+W
Subjt:  AWLQNSSIRQVVNDWWQQNRLEGWPGHGFMMKLKGLKIVLKQW

RVW83303.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]1.4e-4237.04Show/hide
Query:  RFLTWNVQGLGSWKKRSLIKKTIQLQNPGIVLLQETKLTAIDSSMIKTIWSSSHIGWTTLDATNSAGGILILWNEPDFSVLEVIRGLYSLSIHVLLSDGF
        + L+WN +GLGS KKR  +++ +  QNP +V+LQETK    D  ++ +IW    + W  L A  ++GGI+ILW+   F+  E + G +S+++ +   +  
Subjt:  RFLTWNVQGLGSWKKRSLIKKTIQLQNPGIVLLQETKLTAIDSSMIKTIWSSSHIGWTTLDATNSAGGILILWNEPDFSVLEVIRGLYSLSIHVLLSDGF

Query:  AFTVNKRYGPSASGFHEDFWSELNDIAGL---------------------GD----VCLRKFGNAI--FRRIDRLTSDHYPLALSFGDIEWGPCPFRFEN
        +F +   YGP+ + + EDFW EL D+ GL                     GD    V +R+F   I     + R TSDH P+ L      WGP PFRFEN
Subjt:  AFTVNKRYGPSASGFHEDFWSELNDIAGL---------------------GD----VCLRKFGNAI--FRRIDRLTSDHYPLALSFGDIEWGPCPFRFEN

Query:  AWLQNSSIRQVVNDWWQQNRLEGWPGHGFMMKLKGLKIVLKQW
         WL +   ++   DWWQ+  +EGW GH FM KLK +K  LK+W
Subjt:  AWLQNSSIRQVVNDWWQQNRLEGWPGHGFMMKLKGLKIVLKQW

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]3.9e-5638.7Show/hide
Query:  MRFLTWNVQGLGSWKKRSLIKKTIQLQNPGIVLLQETKLTAIDSSMIKTIWSSSHIGWTTLDATNSAGGILILWNEPDFSVLEVIRGLYSLSIHVLLSDG
        M+FLTWNV+GL SWKK +LIK+ I   NP +V+LQETKL+ +D  ++K++WS+  I W+ LDA+  A GILILWN+PD    E+I G++SL+I+  LSDG
Subjt:  MRFLTWNVQGLGSWKKRSLIKKTIQLQNPGIVLLQETKLTAIDSSMIKTIWSSSHIGWTTLDATNSAGGILILWNEPDFSVLEVIRGLYSLSIHVLLSDG

Query:  FAFTVNKRYGPSASGFHEDFWSELNDIAGL--------GDV-----------------------------------------------------------
        F F V+  YGPS + FH  FW EL D++ L        GD                                                            
Subjt:  FAFTVNKRYGPSASGFHEDFWSELNDIAGL--------GDV-----------------------------------------------------------

Query:  -CLRKFGNAIFRRIDRLTSDHYPLALSFGDIEWGPCPFRFENAWLQNSSIRQVVNDWWQQNRLEGWPGHGFMMKLKGLKIVLKQWIKSN-NPRASLFSSL
         C+ K G  I +R+ R TSDH+P+ L FG   WG  PFRFEN WL + + +  +  WW    L GWPGHG MMKLK LK  +K WI  +     S    L
Subjt:  -CLRKFGNAIFRRIDRLTSDHYPLALSFGDIEWGPCPFRFENAWLQNSSIRQVVNDWWQQNRLEGWPGHGFMMKLKGLKIVLKQWIKSN-NPRASLFSSL

Query:  MAQLHSLDSLADAGPLTVVHSTA
           ++SLD L  + P+T   S A
Subjt:  MAQLHSLDSLADAGPLTVVHSTA

TrEMBL top hitse value%identityAlignment
A0A438BQB2 Transposon TX1 uncharacterized 149 kDa protein1.0e-4134.68Show/hide
Query:  MRFLTWNVQGLGSWKKRSLIKKTIQLQNPGIVLLQETKLTAIDSSMIKTIWSSSHIGWTTLDATNSAGGILILWNEPDFSVLEVIRGLYSLSIHVLLSDG
        M+ ++WNV+GLGS  KR ++K  ++ +NP +V++QETK    D   + ++W+  +  W  L A+ ++GGILI+W+  +    EV+ G +S+S+   L   
Subjt:  MRFLTWNVQGLGSWKKRSLIKKTIQLQNPGIVLLQETKLTAIDSSMIKTIWSSSHIGWTTLDATNSAGGILILWNEPDFSVLEVIRGLYSLSIHVLLSDG

Query:  FAFTVNKRYGPSASGFHEDFWSELNDIAGLGDVCLRKFGNAIFRRIDRLTSDHYPLALSFGDIEWGPCPFRFENAWLQNSSIRQVVNDWWQQNRLEGWPG
            ++  YGP++    +DFW EL DI GL +  +R+            TSDH+P+ +      WGP PFRFEN WLQ+++ ++   DWW   +  GW G
Subjt:  FAFTVNKRYGPSASGFHEDFWSELNDIAGLGDVCLRKFGNAIFRRIDRLTSDHYPLALSFGDIEWGPCPFRFENAWLQNSSIRQVVNDWWQQNRLEGWPG

Query:  HGFMMKLKGLKIVLKQWIK-SNNPRASLFSSLMAQLHSLDSLADAGPL
        H FM +L+ +K  LK+W K S         S++  L + D++   G L
Subjt:  HGFMMKLKGLKIVLKQWIK-SNNPRASLFSSLMAQLHSLDSLADAGPL

A0A438BWL9 Putative ribonuclease H protein4.5e-4237.24Show/hide
Query:  WNVQGLGSWKKRSLIKKTIQLQNPGIVLLQETKLTAIDSSMIKTIWSSSHIGWTTLDATNSAGGILILWNEPDFSVLEVIRGLYSLSIHVLLSDGFAFTV
        WN +GLGS KKR  +++ +  QNP +V+LQETK    D  ++ +IW    + W  L A  ++GGI+ILW+   F+  E + G +S+++ +   +  +F +
Subjt:  WNVQGLGSWKKRSLIKKTIQLQNPGIVLLQETKLTAIDSSMIKTIWSSSHIGWTTLDATNSAGGILILWNEPDFSVLEVIRGLYSLSIHVLLSDGFAFTV

Query:  NKRYGPSASGFHEDFWSELNDIAGL---------------------GD----VCLRKFGNAI--FRRIDRLTSDHYPLALSFGDIEWGPCPFRFENAWLQ
           YGP+ + + EDFW EL D+ GL                     GD    V +R+F   I     + R TSDH P+ L      WGP PFRFEN WL 
Subjt:  NKRYGPSASGFHEDFWSELNDIAGL---------------------GD----VCLRKFGNAI--FRRIDRLTSDHYPLALSFGDIEWGPCPFRFENAWLQ

Query:  NSSIRQVVNDWWQQNRLEGWPGHGFMMKLKGLKIVLKQW
        +   ++   DWWQ+  +EGW GH FM KLK +K  LK+W
Subjt:  NSSIRQVVNDWWQQNRLEGWPGHGFMMKLKGLKIVLKQW

A0A438F756 LINE-1 retrotransposable element ORF2 protein7.0e-4337.04Show/hide
Query:  RFLTWNVQGLGSWKKRSLIKKTIQLQNPGIVLLQETKLTAIDSSMIKTIWSSSHIGWTTLDATNSAGGILILWNEPDFSVLEVIRGLYSLSIHVLLSDGF
        + L+WN +GLGS KKR  +++ +  QNP +V+LQETK    D  ++ +IW    + W  L A  ++GGI+ILW+   F+  E + G +S+++ +   +  
Subjt:  RFLTWNVQGLGSWKKRSLIKKTIQLQNPGIVLLQETKLTAIDSSMIKTIWSSSHIGWTTLDATNSAGGILILWNEPDFSVLEVIRGLYSLSIHVLLSDGF

Query:  AFTVNKRYGPSASGFHEDFWSELNDIAGL---------------------GD----VCLRKFGNAI--FRRIDRLTSDHYPLALSFGDIEWGPCPFRFEN
        +F +   YGP+ + + EDFW EL D+ GL                     GD    V +R+F   I     + R TSDH P+ L      WGP PFRFEN
Subjt:  AFTVNKRYGPSASGFHEDFWSELNDIAGL---------------------GD----VCLRKFGNAI--FRRIDRLTSDHYPLALSFGDIEWGPCPFRFEN

Query:  AWLQNSSIRQVVNDWWQQNRLEGWPGHGFMMKLKGLKIVLKQW
         WL +   ++   DWWQ+  +EGW GH FM KLK +K  LK+W
Subjt:  AWLQNSSIRQVVNDWWQQNRLEGWPGHGFMMKLKGLKIVLKQW

A0A438HFR2 Transposon TX1 uncharacterized 149 kDa protein7.0e-4337.04Show/hide
Query:  RFLTWNVQGLGSWKKRSLIKKTIQLQNPGIVLLQETKLTAIDSSMIKTIWSSSHIGWTTLDATNSAGGILILWNEPDFSVLEVIRGLYSLSIHVLLSDGF
        + L+WN +GLGS KKR  +++ +  QNP +V+LQETK    D  ++ +IW    + W  L A  ++GGI+ILW+   F+  E + G +S+++ +   +  
Subjt:  RFLTWNVQGLGSWKKRSLIKKTIQLQNPGIVLLQETKLTAIDSSMIKTIWSSSHIGWTTLDATNSAGGILILWNEPDFSVLEVIRGLYSLSIHVLLSDGF

Query:  AFTVNKRYGPSASGFHEDFWSELNDIAGL---------------------GD----VCLRKFGNAI--FRRIDRLTSDHYPLALSFGDIEWGPCPFRFEN
        +F +   YGP+ + + EDFW EL D+ GL                     GD    V +R+F   I     + R TSDH P+ L      WGP PFRFEN
Subjt:  AFTVNKRYGPSASGFHEDFWSELNDIAGL---------------------GD----VCLRKFGNAI--FRRIDRLTSDHYPLALSFGDIEWGPCPFRFEN

Query:  AWLQNSSIRQVVNDWWQQNRLEGWPGHGFMMKLKGLKIVLKQW
         WL +   ++   DWWQ+  +EGW GH FM KLK +K  LK+W
Subjt:  AWLQNSSIRQVVNDWWQQNRLEGWPGHGFMMKLKGLKIVLKQW

A0A6J1E2G6 uncharacterized protein LOC1110254051.9e-5638.7Show/hide
Query:  MRFLTWNVQGLGSWKKRSLIKKTIQLQNPGIVLLQETKLTAIDSSMIKTIWSSSHIGWTTLDATNSAGGILILWNEPDFSVLEVIRGLYSLSIHVLLSDG
        M+FLTWNV+GL SWKK +LIK+ I   NP +V+LQETKL+ +D  ++K++WS+  I W+ LDA+  A GILILWN+PD    E+I G++SL+I+  LSDG
Subjt:  MRFLTWNVQGLGSWKKRSLIKKTIQLQNPGIVLLQETKLTAIDSSMIKTIWSSSHIGWTTLDATNSAGGILILWNEPDFSVLEVIRGLYSLSIHVLLSDG

Query:  FAFTVNKRYGPSASGFHEDFWSELNDIAGL--------GDV-----------------------------------------------------------
        F F V+  YGPS + FH  FW EL D++ L        GD                                                            
Subjt:  FAFTVNKRYGPSASGFHEDFWSELNDIAGL--------GDV-----------------------------------------------------------

Query:  -CLRKFGNAIFRRIDRLTSDHYPLALSFGDIEWGPCPFRFENAWLQNSSIRQVVNDWWQQNRLEGWPGHGFMMKLKGLKIVLKQWIKSN-NPRASLFSSL
         C+ K G  I +R+ R TSDH+P+ L FG   WG  PFRFEN WL + + +  +  WW    L GWPGHG MMKLK LK  +K WI  +     S    L
Subjt:  -CLRKFGNAIFRRIDRLTSDHYPLALSFGDIEWGPCPFRFENAWLQNSSIRQVVNDWWQQNRLEGWPGHGFMMKLKGLKIVLKQWIKSN-NPRASLFSSL

Query:  MAQLHSLDSLADAGPLTVVHSTA
           ++SLD L  + P+T   S A
Subjt:  MAQLHSLDSLADAGPLTVVHSTA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGTTTTTAACTTGGAATGTTCAAGGGTTGGGTTCCTGGAAGAAGAGGTCTTTGATTAAAAAGACTATTCAACTTCAAAATCCGGGTATCGTGTTGTTACAGGAAAC
TAAACTTACTGCAATCGATTCTTCTATGATAAAAACAATCTGGAGTTCCTCTCACATTGGATGGACGACACTTGACGCCACTAATTCGGCAGGTGGCATCCTTATTCTTT
GGAATGAACCAGATTTTTCGGTTCTGGAGGTTATCCGAGGCCTTTATTCTCTTTCAATTCACGTTTTATTATCTGATGGTTTTGCTTTTACAGTTAACAAGAGATATGGT
CCTTCAGCCTCAGGTTTTCACGAGGATTTCTGGAGTGAATTGAATGATATTGCAGGCTTGGGAGATGTTTGTCTCAGGAAATTTGGTAATGCTATTTTCAGACGTATTGA
CCGACTCACTTCTGACCATTACCCTTTGGCATTATCCTTTGGTGATATAGAATGGGGGCCTTGTCCCTTTCGCTTTGAGAATGCGTGGCTTCAAAACTCGTCTATTCGTC
AGGTGGTTAATGATTGGTGGCAACAAAACAGACTGGAAGGTTGGCCAGGACATGGCTTCATGATGAAATTGAAAGGGCTCAAAATTGTACTCAAACAGTGGATTAAATCC
AATAACCCACGTGCTTCTCTTTTTTCTTCTCTTATGGCCCAGTTGCATTCTCTAGATAGCTTAGCTGATGCTGGCCCTCTTACGGTTGTTCATTCGACTGCTCATAAATC
ACCTTGGCGGAATATTTCACTATTGTCGAACTTGGTTTCAAGTAATGTTTCTCGTGTTTTAGGTAATGGTTGTCATACTTTATTTTGGAAGGATTCTTGGCTCAGTTGCG
GACCTTGTGCATTAGCTTTCCCTCGTTTATTTCGTCTCACTACTAATCCAGATAGTTTGGTGGTTAACCTTTGGAATGAAGTTACAGAAGCTTGGGATTTGTTACCTCGT
CGTAATCTAAATGAACTAGAAATAGAGGAATGGACTACTCTATCTCAACTCTTGTCTTTAGTTAGACTTCGGAACTTCCCTGATTCTTGGTCTTGGTCCCTTGATCCTTC
TGGTTCCTTTTCGGTTAGTTCGCTCATGGATTCTCTGGTTAGTGAGTCTGATCCCCGGTTAAAAGACTTGTATACTGCTATTTGGAAGGACATGTATCCAAAGAAGGTCA
AAATATTCCTTTGGGAACTCAGTTTGGGAGCAATAAATACAAATGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGTTTTTAACTTGGAATGTTCAAGGGTTGGGTTCCTGGAAGAAGAGGTCTTTGATTAAAAAGACTATTCAACTTCAAAATCCGGGTATCGTGTTGTTACAGGAAAC
TAAACTTACTGCAATCGATTCTTCTATGATAAAAACAATCTGGAGTTCCTCTCACATTGGATGGACGACACTTGACGCCACTAATTCGGCAGGTGGCATCCTTATTCTTT
GGAATGAACCAGATTTTTCGGTTCTGGAGGTTATCCGAGGCCTTTATTCTCTTTCAATTCACGTTTTATTATCTGATGGTTTTGCTTTTACAGTTAACAAGAGATATGGT
CCTTCAGCCTCAGGTTTTCACGAGGATTTCTGGAGTGAATTGAATGATATTGCAGGCTTGGGAGATGTTTGTCTCAGGAAATTTGGTAATGCTATTTTCAGACGTATTGA
CCGACTCACTTCTGACCATTACCCTTTGGCATTATCCTTTGGTGATATAGAATGGGGGCCTTGTCCCTTTCGCTTTGAGAATGCGTGGCTTCAAAACTCGTCTATTCGTC
AGGTGGTTAATGATTGGTGGCAACAAAACAGACTGGAAGGTTGGCCAGGACATGGCTTCATGATGAAATTGAAAGGGCTCAAAATTGTACTCAAACAGTGGATTAAATCC
AATAACCCACGTGCTTCTCTTTTTTCTTCTCTTATGGCCCAGTTGCATTCTCTAGATAGCTTAGCTGATGCTGGCCCTCTTACGGTTGTTCATTCGACTGCTCATAAATC
ACCTTGGCGGAATATTTCACTATTGTCGAACTTGGTTTCAAGTAATGTTTCTCGTGTTTTAGGTAATGGTTGTCATACTTTATTTTGGAAGGATTCTTGGCTCAGTTGCG
GACCTTGTGCATTAGCTTTCCCTCGTTTATTTCGTCTCACTACTAATCCAGATAGTTTGGTGGTTAACCTTTGGAATGAAGTTACAGAAGCTTGGGATTTGTTACCTCGT
CGTAATCTAAATGAACTAGAAATAGAGGAATGGACTACTCTATCTCAACTCTTGTCTTTAGTTAGACTTCGGAACTTCCCTGATTCTTGGTCTTGGTCCCTTGATCCTTC
TGGTTCCTTTTCGGTTAGTTCGCTCATGGATTCTCTGGTTAGTGAGTCTGATCCCCGGTTAAAAGACTTGTATACTGCTATTTGGAAGGACATGTATCCAAAGAAGGTCA
AAATATTCCTTTGGGAACTCAGTTTGGGAGCAATAAATACAAATGGATGA
Protein sequenceShow/hide protein sequence
MRFLTWNVQGLGSWKKRSLIKKTIQLQNPGIVLLQETKLTAIDSSMIKTIWSSSHIGWTTLDATNSAGGILILWNEPDFSVLEVIRGLYSLSIHVLLSDGFAFTVNKRYG
PSASGFHEDFWSELNDIAGLGDVCLRKFGNAIFRRIDRLTSDHYPLALSFGDIEWGPCPFRFENAWLQNSSIRQVVNDWWQQNRLEGWPGHGFMMKLKGLKIVLKQWIKS
NNPRASLFSSLMAQLHSLDSLADAGPLTVVHSTAHKSPWRNISLLSNLVSSNVSRVLGNGCHTLFWKDSWLSCGPCALAFPRLFRLTTNPDSLVVNLWNEVTEAWDLLPR
RNLNELEIEEWTTLSQLLSLVRLRNFPDSWSWSLDPSGSFSVSSLMDSLVSESDPRLKDLYTAIWKDMYPKKVKIFLWELSLGAINTNG