; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000336 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000336
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr4:4528106..4528681
RNA-Seq ExpressionLag0000336
SyntenyLag0000336
Gene Ontology termsGO:0034641 - cellular nitrogen compound metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:0005488 - binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045111.1 putative glutathione S-transferase isoform X1 [Cucumis melo var. makuwa]4.1e-4550.26Show/hide
Query:  VQIEFALEGHDLENFINSDVEPPPKRIS-------VSENSSATK----VNPEYTKWKKQDRLISSWLLGSMTENILEQVIHCKSSREIWMCLLQIFNSKN
        V+  F  E +DLE++++S+ EPP K I+       V+  SS+      +NPEY  WK+QDRLISSWLLGSM E+IL Q++HC S+++IW  L  I++S+ 
Subjt:  VQIEFALEGHDLENFINSDVEPPPKRIS-------VSENSSATK----VNPEYTKWKKQDRLISSWLLGSMTENILEQVIHCKSSREIWMCLLQIFNSKN

Query:  KAQVMRMKNRLQTIQKGGMSLSEYFSQIKKCIDALAAIGKTIDEEDHVLYILGGLGPDYESMVSAITATTEDQSVQEVMALLLTHENQIERKLKS
         A+ M+ KN+L  ++KG +SL EYF +I++C+DALA+I K I  +DH+LYIL GLG +Y+SM+S I+A T+  SVQ+VM+LLLT E+QIE K+ S
Subjt:  KAQVMRMKNRLQTIQKGGMSLSEYFSQIKKCIDALAAIGKTIDEEDHVLYILGGLGPDYESMVSAITATTEDQSVQEVMALLLTHENQIERKLKS

KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.5e-5257.37Show/hide
Query:  MWKVQIEFALEGHDLENFINSDVEPPPKRISVSENSSATKV---NPEYTKWKKQDRLISSWLLGSMTENILEQVIHCKSSREIWMCLLQIFNSKNKAQVM
        +WK QI  ALE +DLENF+ S+ EPP K +  +E+SSA+     NP Y  WK+QDRLISSWLLGSM+E IL Q++HCKS++EIW  L  IF+S+  AQ M
Subjt:  MWKVQIEFALEGHDLENFINSDVEPPPKRISVSENSSATKV---NPEYTKWKKQDRLISSWLLGSMTENILEQVIHCKSSREIWMCLLQIFNSKNKAQVM

Query:  RMKNRLQTIQKGGMSLSEYFSQIKKCIDALAAIGKTIDEEDHVLYILGGLGPDYESMVSAITATTEDQSVQEVMALLLTHENQIERKLKS
        + KN+L  I+KG M L EYF +I +C+DALA+I K +  +DH+LYIL GLG DY+SM+S I+A T+  SVQEVM+LLLT E+Q E KL S
Subjt:  RMKNRLQTIQKGGMSLSEYFSQIKKCIDALAAIGKTIDEEDHVLYILGGLGPDYESMVSAITATTEDQSVQEVMALLLTHENQIERKLKS

KAA0053143.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]1.1e-5056.32Show/hide
Query:  MWKVQIEFALEGHDLENFINSDVEPPPKRISV--SENSSATKV-NPEYTKWKKQDRLISSWLLGSMTENILEQVIHCKSSREIWMCLLQIFNSKNKAQVM
        +WK QI  ALE +DLENF  S++EPP K ++   S ++SAT+  NPEY  WK+ +RLIS WLLGSM+E IL Q++HCKS++EIW  L  IF+S+  AQ M
Subjt:  MWKVQIEFALEGHDLENFINSDVEPPPKRISV--SENSSATKV-NPEYTKWKKQDRLISSWLLGSMTENILEQVIHCKSSREIWMCLLQIFNSKNKAQVM

Query:  RMKNRLQTIQKGGMSLSEYFSQIKKCIDALAAIGKTIDEEDHVLYILGGLGPDYESMVSAITATTEDQSVQEVMALLLTHENQIERKLKS
        + KN+L  I+KG MSL EYF +I++C+DALA+I K +  +DH+LYIL GLG DY+SM+S I+A T+  S+QEVM+LLLT E+Q E KL S
Subjt:  RMKNRLQTIQKGGMSLSEYFSQIKKCIDALAAIGKTIDEEDHVLYILGGLGPDYESMVSAITATTEDQSVQEVMALLLTHENQIERKLKS

KAA0067213.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]2.5e-4254.71Show/hide
Query:  MWKVQIEFALEGHDLENFINSDVEPPPKRI--SVSENSSATKV-NPEYTKWKKQDRLISSWLLGSMTENILEQVIHCKSSREIWMCLLQIFNSKNKAQVM
        +WK QI  ALE +DLENF+ S+ EPP K +  + S ++SAT+  NP Y  WK+QDRLISSWLLGSM+E IL Q++HCKS++EIW  L  IF+S+  AQ M
Subjt:  MWKVQIEFALEGHDLENFINSDVEPPPKRI--SVSENSSATKV-NPEYTKWKKQDRLISSWLLGSMTENILEQVIHCKSSREIWMCLLQIFNSKNKAQVM

Query:  RMKNRLQTIQKGGMSLSEYFSQIKKCIDALAAIGKTIDEEDHVLYILGGLGPDYESMVSAITATTEDQSV
        + KN+L  I+K  M L EYF +I+  +DALA+I K +  +DH+LYIL GLG DY+SM+S I   TE  SV
Subjt:  RMKNRLQTIQKGGMSLSEYFSQIKKCIDALAAIGKTIDEEDHVLYILGGLGPDYESMVSAITATTEDQSV

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.5e-5257.37Show/hide
Query:  MWKVQIEFALEGHDLENFINSDVEPPPKRISVSENSSATKV---NPEYTKWKKQDRLISSWLLGSMTENILEQVIHCKSSREIWMCLLQIFNSKNKAQVM
        +WK QI  ALE +DLENF+ S+ EPP K +  +E+SSA+     NP Y  WK+QDRLISSWLLGSM+E IL Q++HCKS++EIW  L  IF+S+  AQ M
Subjt:  MWKVQIEFALEGHDLENFINSDVEPPPKRISVSENSSATKV---NPEYTKWKKQDRLISSWLLGSMTENILEQVIHCKSSREIWMCLLQIFNSKNKAQVM

Query:  RMKNRLQTIQKGGMSLSEYFSQIKKCIDALAAIGKTIDEEDHVLYILGGLGPDYESMVSAITATTEDQSVQEVMALLLTHENQIERKLKS
        + KN+L  I+KG M L EYF +I +C+DALA+I K +  +DH+LYIL GLG DY+SM+S I+A T+  SVQEVM+LLLT E+Q E KL S
Subjt:  RMKNRLQTIQKGGMSLSEYFSQIKKCIDALAAIGKTIDEEDHVLYILGGLGPDYESMVSAITATTEDQSVQEVMALLLTHENQIERKLKS

TrEMBL top hitse value%identityAlignment
A0A5A7TUB3 Putative glutathione S-transferase isoform X12.0e-4550.26Show/hide
Query:  VQIEFALEGHDLENFINSDVEPPPKRIS-------VSENSSATK----VNPEYTKWKKQDRLISSWLLGSMTENILEQVIHCKSSREIWMCLLQIFNSKN
        V+  F  E +DLE++++S+ EPP K I+       V+  SS+      +NPEY  WK+QDRLISSWLLGSM E+IL Q++HC S+++IW  L  I++S+ 
Subjt:  VQIEFALEGHDLENFINSDVEPPPKRIS-------VSENSSATK----VNPEYTKWKKQDRLISSWLLGSMTENILEQVIHCKSSREIWMCLLQIFNSKN

Query:  KAQVMRMKNRLQTIQKGGMSLSEYFSQIKKCIDALAAIGKTIDEEDHVLYILGGLGPDYESMVSAITATTEDQSVQEVMALLLTHENQIERKLKS
         A+ M+ KN+L  ++KG +SL EYF +I++C+DALA+I K I  +DH+LYIL GLG +Y+SM+S I+A T+  SVQ+VM+LLLT E+QIE K+ S
Subjt:  KAQVMRMKNRLQTIQKGGMSLSEYFSQIKKCIDALAAIGKTIDEEDHVLYILGGLGPDYESMVSAITATTEDQSVQEVMALLLTHENQIERKLKS

A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-941.7e-5257.37Show/hide
Query:  MWKVQIEFALEGHDLENFINSDVEPPPKRISVSENSSATKV---NPEYTKWKKQDRLISSWLLGSMTENILEQVIHCKSSREIWMCLLQIFNSKNKAQVM
        +WK QI  ALE +DLENF+ S+ EPP K +  +E+SSA+     NP Y  WK+QDRLISSWLLGSM+E IL Q++HCKS++EIW  L  IF+S+  AQ M
Subjt:  MWKVQIEFALEGHDLENFINSDVEPPPKRISVSENSSATKV---NPEYTKWKKQDRLISSWLLGSMTENILEQVIHCKSSREIWMCLLQIFNSKNKAQVM

Query:  RMKNRLQTIQKGGMSLSEYFSQIKKCIDALAAIGKTIDEEDHVLYILGGLGPDYESMVSAITATTEDQSVQEVMALLLTHENQIERKLKS
        + KN+L  I+KG M L EYF +I +C+DALA+I K +  +DH+LYIL GLG DY+SM+S I+A T+  SVQEVM+LLLT E+Q E KL S
Subjt:  RMKNRLQTIQKGGMSLSEYFSQIKKCIDALAAIGKTIDEEDHVLYILGGLGPDYESMVSAITATTEDQSVQEVMALLLTHENQIERKLKS

A0A5A7UB21 Keratin, type II cytoskeletal 1-like5.4e-5156.32Show/hide
Query:  MWKVQIEFALEGHDLENFINSDVEPPPKRISV--SENSSATKV-NPEYTKWKKQDRLISSWLLGSMTENILEQVIHCKSSREIWMCLLQIFNSKNKAQVM
        +WK QI  ALE +DLENF  S++EPP K ++   S ++SAT+  NPEY  WK+ +RLIS WLLGSM+E IL Q++HCKS++EIW  L  IF+S+  AQ M
Subjt:  MWKVQIEFALEGHDLENFINSDVEPPPKRISV--SENSSATKV-NPEYTKWKKQDRLISSWLLGSMTENILEQVIHCKSSREIWMCLLQIFNSKNKAQVM

Query:  RMKNRLQTIQKGGMSLSEYFSQIKKCIDALAAIGKTIDEEDHVLYILGGLGPDYESMVSAITATTEDQSVQEVMALLLTHENQIERKLKS
        + KN+L  I+KG MSL EYF +I++C+DALA+I K +  +DH+LYIL GLG DY+SM+S I+A T+  S+QEVM+LLLT E+Q E KL S
Subjt:  RMKNRLQTIQKGGMSLSEYFSQIKKCIDALAAIGKTIDEEDHVLYILGGLGPDYESMVSAITATTEDQSVQEVMALLLTHENQIERKLKS

A0A5A7VGJ8 Keratin, type II cytoskeletal 1-like1.2e-4254.71Show/hide
Query:  MWKVQIEFALEGHDLENFINSDVEPPPKRI--SVSENSSATKV-NPEYTKWKKQDRLISSWLLGSMTENILEQVIHCKSSREIWMCLLQIFNSKNKAQVM
        +WK QI  ALE +DLENF+ S+ EPP K +  + S ++SAT+  NP Y  WK+QDRLISSWLLGSM+E IL Q++HCKS++EIW  L  IF+S+  AQ M
Subjt:  MWKVQIEFALEGHDLENFINSDVEPPPKRI--SVSENSSATKV-NPEYTKWKKQDRLISSWLLGSMTENILEQVIHCKSSREIWMCLLQIFNSKNKAQVM

Query:  RMKNRLQTIQKGGMSLSEYFSQIKKCIDALAAIGKTIDEEDHVLYILGGLGPDYESMVSAITATTEDQSV
        + KN+L  I+K  M L EYF +I+  +DALA+I K +  +DH+LYIL GLG DY+SM+S I   TE  SV
Subjt:  RMKNRLQTIQKGGMSLSEYFSQIKKCIDALAAIGKTIDEEDHVLYILGGLGPDYESMVSAITATTEDQSV

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-941.7e-5257.37Show/hide
Query:  MWKVQIEFALEGHDLENFINSDVEPPPKRISVSENSSATKV---NPEYTKWKKQDRLISSWLLGSMTENILEQVIHCKSSREIWMCLLQIFNSKNKAQVM
        +WK QI  ALE +DLENF+ S+ EPP K +  +E+SSA+     NP Y  WK+QDRLISSWLLGSM+E IL Q++HCKS++EIW  L  IF+S+  AQ M
Subjt:  MWKVQIEFALEGHDLENFINSDVEPPPKRISVSENSSATKV---NPEYTKWKKQDRLISSWLLGSMTENILEQVIHCKSSREIWMCLLQIFNSKNKAQVM

Query:  RMKNRLQTIQKGGMSLSEYFSQIKKCIDALAAIGKTIDEEDHVLYILGGLGPDYESMVSAITATTEDQSVQEVMALLLTHENQIERKLKS
        + KN+L  I+KG M L EYF +I +C+DALA+I K +  +DH+LYIL GLG DY+SM+S I+A T+  SVQEVM+LLLT E+Q E KL S
Subjt:  RMKNRLQTIQKGGMSLSEYFSQIKKCIDALAAIGKTIDEEDHVLYILGGLGPDYESMVSAITATTEDQSVQEVMALLLTHENQIERKLKS

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.3e-2128.73Show/hide
Query:  MWKVQIEFALEGHDLENFINSDVEPPPKRISVSENSSATKVNPEYTKWKKQDRLISSWLLGSMTENILEQVIHCKSSREIWMCLLQIFNSKNKAQVMRMK
        MW  Q+    +G++L  F++     PP  I      +A +VNP+YT+WK+QD+LI S +LG+++ ++   V    ++ +IW  L +I+ + +   V +++
Subjt:  MWKVQIEFALEGHDLENFINSDVEPPPKRISVSENSSATKVNPEYTKWKKQDRLISSWLLGSMTENILEQVIHCKSSREIWMCLLQIFNSKNKAQVMRMK

Query:  NRLQTIQKGGMSLSEYFSQIKKCIDALAAIGKTIDEEDHVLYILGGLGPDYESMVSAITATTEDQSVQEVMALLLTHENQI
         +L+   KG  ++ +Y   +    D LA +GK +D ++ V  +L  L  +Y+ ++  I A     ++ E+   LL HE++I
Subjt:  NRLQTIQKGGMSLSEYFSQIKKCIDALAAIGKTIDEEDHVLYILGGLGPDYESMVSAITATTEDQSVQEVMALLLTHENQI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.1e-1424.86Show/hide
Query:  MWKVQIEFALEGHDLENFINSDVEPPPKRISVSENSSATKVNPEYTKWKKQDRLISSWLLGSMTENILEQVIHCKSSREIWMCLLQIFNSKNKAQVMRMK
        MW  Q+    +G++L  F++     PP  I      +  +VNP+YT+W++QD+LI S +LG+++ ++   V    ++ +IW  L +I+ + +   V +++
Subjt:  MWKVQIEFALEGHDLENFINSDVEPPPKRISVSENSSATKVNPEYTKWKKQDRLISSWLLGSMTENILEQVIHCKSSREIWMCLLQIFNSKNKAQVMRMK

Query:  NRLQTIQKGGMSLSEYFSQIKKCIDALAAIGKTIDEEDHVLYILGGLGPDYESMVSAITATTEDQSVQEVMALLLTHENQI
                       + ++     D LA +GK +D ++ V  +L  L  DY+ ++  I A     S+ E+   L+  E+++
Subjt:  NRLQTIQKGGMSLSEYFSQIKKCIDALAAIGKTIDEEDHVLYILGGLGPDYESMVSAITATTEDQSVQEVMALLLTHENQI

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.7e-0929.63Show/hide
Query:  NPEYTKWKKQDRLISSWLLGSMTENILEQVIHCKSSREIWMCLLQIFNSKNKAQVMRMKNRLQTIQKGGMSLSEYFSQIKK
        +P Y  W++ + ++  WL+ SMT+ +LE V++ +++ ++W  L ++F      ++ +++ RL T+++GG S+ EYF ++ K
Subjt:  NPEYTKWKKQDRLISSWLLGSMTENILEQVIHCKSSREIWMCLLQIFNSKNKAQVMRMKNRLQTIQKGGMSLSEYFSQIKK

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)9.9e-1323.97Show/hide
Query:  NPEYTKWKKQDRLISSWLLGSMTENILE-QVIHCKSSREIWMCLLQIFNSKNKAQVMRMKNRLQTIQKGGMSLSEYFSQIKKCIDALAAIGKTIDEEDHV
        N     W+K+D ++   L G++T    +   +   +SR+IW+ +   F +   A+ +R+ + L+T   G M +++Y+ ++KK  D+L  +   + + + V
Subjt:  NPEYTKWKKQDRLISSWLLGSMTENILE-QVIHCKSSREIWMCLLQIFNSKNKAQVMRMKNRLQTIQKGGMSLSEYFSQIKKCIDALAAIGKTIDEEDHV

Query:  LYILGGLGPDYESMVSAITATTEDQSVQEVMALLLTHENQIERKLK
        +Y+L GL P ++++++ I       S  +   +L   E++++R +K
Subjt:  LYILGGLGPDYESMVSAITATTEDQSVQEVMALLLTHENQIERKLK

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)8.1e-1525.97Show/hide
Query:  NSSATKVNPEYTKWKKQDRLISSWLLGSMTENILEQVIHCK-SSREIWMCLLQIFNSKNKAQVMRMKNRLQTIQKGGMSLSEYFSQIKKCIDALAAIGKT
        + S+T       +WK++D L+  W+ G++T+++L+ +I    ++R++W+ L  +F    +A+ ++ +N L+T     +S+ EY  ++K   D L  +   
Subjt:  NSSATKVNPEYTKWKKQDRLISSWLLGSMTENILEQVIHCK-SSREIWMCLLQIFNSKNKAQVMRMKNRLQTIQKGGMSLSEYFSQIKKCIDALAAIGKT

Query:  IDEEDHVLYILGGLGPDYESMVSAITATTEDQSVQEVMALLLTHENQIERKLKS
        I +   V+++L GL   Y+ +++ I   +   S  E  ++LL  E+++  K KS
Subjt:  IDEEDHVLYILGGLGPDYESMVSAITATTEDQSVQEVMALLLTHENQIERKLKS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGAAGGTTCAAATTGAATTCGCACTTGAAGGACATGATCTAGAAAATTTCATTAACAGTGATGTTGAACCTCCACCTAAAAGAATCTCGGTATCGGAAAATTCATC
TGCTACTAAAGTTAATCCAGAATATACCAAATGGAAGAAACAAGATCGGTTAATTTCTTCTTGGTTACTTGGCTCAATGACTGAAAATATATTGGAACAAGTCATTCACT
GTAAGTCTTCTAGAGAAATTTGGATGTGTTTACTTCAGATTTTTAATTCAAAGAATAAGGCACAAGTGATGAGGATGAAAAATAGGCTTCAGACTATTCAAAAAGGAGGA
ATGTCGCTTAGTGAATATTTCTCCCAGATTAAGAAGTGCATTGATGCACTGGCTGCAATAGGAAAGACTATTGATGAAGAAGATCATGTTCTTTATATACTCGGCGGTTT
GGGTCCAGACTATGAGTCCATGGTTTCTGCAATCACTGCAACTACTGAAGATCAAAGTGTTCAAGAAGTCATGGCACTTCTCTTAACTCATGAGAACCAGATTGAAAGAA
AACTGAAGTCTATTGAGCAAATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTGGAAGGTTCAAATTGAATTCGCACTTGAAGGACATGATCTAGAAAATTTCATTAACAGTGATGTTGAACCTCCACCTAAAAGAATCTCGGTATCGGAAAATTCATC
TGCTACTAAAGTTAATCCAGAATATACCAAATGGAAGAAACAAGATCGGTTAATTTCTTCTTGGTTACTTGGCTCAATGACTGAAAATATATTGGAACAAGTCATTCACT
GTAAGTCTTCTAGAGAAATTTGGATGTGTTTACTTCAGATTTTTAATTCAAAGAATAAGGCACAAGTGATGAGGATGAAAAATAGGCTTCAGACTATTCAAAAAGGAGGA
ATGTCGCTTAGTGAATATTTCTCCCAGATTAAGAAGTGCATTGATGCACTGGCTGCAATAGGAAAGACTATTGATGAAGAAGATCATGTTCTTTATATACTCGGCGGTTT
GGGTCCAGACTATGAGTCCATGGTTTCTGCAATCACTGCAACTACTGAAGATCAAAGTGTTCAAGAAGTCATGGCACTTCTCTTAACTCATGAGAACCAGATTGAAAGAA
AACTGAAGTCTATTGAGCAAATTTGA
Protein sequenceShow/hide protein sequence
MWKVQIEFALEGHDLENFINSDVEPPPKRISVSENSSATKVNPEYTKWKKQDRLISSWLLGSMTENILEQVIHCKSSREIWMCLLQIFNSKNKAQVMRMKNRLQTIQKGG
MSLSEYFSQIKKCIDALAAIGKTIDEEDHVLYILGGLGPDYESMVSAITATTEDQSVQEVMALLLTHENQIERKLKSIEQI