; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007855 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007855
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr9:6539860..6540888
RNA-Seq ExpressionLag0007855
SyntenyLag0007855
Gene Ontology termsGO:0034641 - cellular nitrogen compound metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.9e-3541.21Show/hide
Query:  SSQALKVVNLGNKISTVKLDEEIFCYGSFKFLRSSE-------------------IHASGESSSSTKIPNPAYDHWVRQDNLITAWLLGSMTNSLLSEML
        SS   ++   GNKIS VKL+++ F    F+ L + E                   I     S+S+T  PNPAY  W RQD LI++WLLGSM+  +L++ML
Subjt:  SSQALKVVNLGNKISTVKLDEEIFCYGSFKFLRSSE-------------------IHASGESSSSTKIPNPAYDHWVRQDNLITAWLLGSMTNSLLSEML

Query:  ECDTAQEVWKILNGRFSSRNLTRLMDLKSKLESLKKGNLKLEEYFLKVKNLVDSLAAAGKKISHTDHVLHILAGLGPEYDPTVSVITEKDETPPCRRII
         C +A+E+W+ L G FSSR L + M  K+KL ++KKG++ L+EYFLK+   VD+LA+  K +S  DH+L+ILAGLG +Y   +SVI+ + ++P  + ++
Subjt:  ECDTAQEVWKILNGRFSSRNLTRLMDLKSKLESLKKGNLKLEEYFLKVKNLVDSLAAAGKKISHTDHVLHILAGLGPEYDPTVSVITEKDETPPCRRII

KAA0067213.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]5.2e-3341.45Show/hide
Query:  SSQALKVVNLGNKISTVKLDEEIFCYGSFKFLRSSE-------------------IHASGESSSSTKIPNPAYDHWVRQDNLITAWLLGSMTNSLLSEML
        SS   ++    NKIS VKL ++ F    F+ L + E                   I     S+S+T+ PNP Y  W RQD LI++WLLGSM+  +L++ML
Subjt:  SSQALKVVNLGNKISTVKLDEEIFCYGSFKFLRSSE-------------------IHASGESSSSTKIPNPAYDHWVRQDNLITAWLLGSMTNSLLSEML

Query:  ECDTAQEVWKILNGRFSSRNLTRLMDLKSKLESLKKGNLKLEEYFLKVKNLVDSLAAAGKKISHTDHVLHILAGLGPEYDPTVSVITEKDETP
         C +A+E+W  L G FSSR L + M  K+KL ++KK ++ L+EYFLK+++ VD+LA+  K +S  DH+L+ILAGLG +Y   +SVI  + E+P
Subjt:  ECDTAQEVWKILNGRFSSRNLTRLMDLKSKLESLKKGNLKLEEYFLKVKNLVDSLAAAGKKISHTDHVLHILAGLGPEYDPTVSVITEKDETP

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.9e-3541.21Show/hide
Query:  SSQALKVVNLGNKISTVKLDEEIFCYGSFKFLRSSE-------------------IHASGESSSSTKIPNPAYDHWVRQDNLITAWLLGSMTNSLLSEML
        SS   ++   GNKIS VKL+++ F    F+ L + E                   I     S+S+T  PNPAY  W RQD LI++WLLGSM+  +L++ML
Subjt:  SSQALKVVNLGNKISTVKLDEEIFCYGSFKFLRSSE-------------------IHASGESSSSTKIPNPAYDHWVRQDNLITAWLLGSMTNSLLSEML

Query:  ECDTAQEVWKILNGRFSSRNLTRLMDLKSKLESLKKGNLKLEEYFLKVKNLVDSLAAAGKKISHTDHVLHILAGLGPEYDPTVSVITEKDETPPCRRII
         C +A+E+W+ L G FSSR L + M  K+KL ++KKG++ L+EYFLK+   VD+LA+  K +S  DH+L+ILAGLG +Y   +SVI+ + ++P  + ++
Subjt:  ECDTAQEVWKILNGRFSSRNLTRLMDLKSKLESLKKGNLKLEEYFLKVKNLVDSLAAAGKKISHTDHVLHILAGLGPEYDPTVSVITEKDETPPCRRII

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]1.2e-4047.96Show/hide
Query:  SSDILNSSQALKVVNLGNKISTVKLDEEIFCYGSFK------------FLRSSE-------IHASGESSSSTKIPNPAYDHWVRQDNLITAWLLGSMTNS
        +SD     QA K +N G+K+S V+L+++      F+            ++ S+E            ESSSS+   NPAY  W++QD LI+AWLLGSM   
Subjt:  SSDILNSSQALKVVNLGNKISTVKLDEEIFCYGSFK------------FLRSSE-------IHASGESSSSTKIPNPAYDHWVRQDNLITAWLLGSMTNS

Query:  LLSEMLECDTAQEVWKILNGRFSSRNLTRLMDLKSKLESLKKGNLKLEEYFLKVKNLVDSLAAAGKKISHTDHVLHILAGLGPEYDPTVSVITEKD
        +LS+ML+C +A+E+W +L   F+SR L R+M LK KLE+ KKGNL L++YFLK+KNLVDSLA AGKK+S  DH++HILAGLGPE+D  +SVIT ++
Subjt:  LLSEMLECDTAQEVWKILNGRFSSRNLTRLMDLKSKLESLKKGNLKLEEYFLKVKNLVDSLAAAGKKISHTDHVLHILAGLGPEYDPTVSVITEKD

XP_022156747.1 uncharacterized protein LOC111023586 [Momordica charantia]4.6e-3756.39Show/hide
Query:  HASGESSSSTKIPNPAYDHWVRQDNLITAWLLGSMTNSLLSEMLECDTAQEVWKILNGRFSSRNLTRLMDLKSKLESLKKGNLKLEEYFLKVKNLVDSLA
        +  G +SS+T+ PNP Y HW++QD LI+ WLLGSM+  +LS+ML+C   +E+W +L   F+SRNL R+M LKSKLE++KKG++ L+ YFLK+KNLVDSLA
Subjt:  HASGESSSSTKIPNPAYDHWVRQDNLITAWLLGSMTNSLLSEMLECDTAQEVWKILNGRFSSRNLTRLMDLKSKLESLKKGNLKLEEYFLKVKNLVDSLA

Query:  AAGKKISHTDHVLHILAGLGPEYDPTVSVITEK
         AGK++   DH++HILA LGPE+D  VSVI+ +
Subjt:  AAGKKISHTDHVLHILAGLGPEYDPTVSVITEK

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-949.3e-3641.21Show/hide
Query:  SSQALKVVNLGNKISTVKLDEEIFCYGSFKFLRSSE-------------------IHASGESSSSTKIPNPAYDHWVRQDNLITAWLLGSMTNSLLSEML
        SS   ++   GNKIS VKL+++ F    F+ L + E                   I     S+S+T  PNPAY  W RQD LI++WLLGSM+  +L++ML
Subjt:  SSQALKVVNLGNKISTVKLDEEIFCYGSFKFLRSSE-------------------IHASGESSSSTKIPNPAYDHWVRQDNLITAWLLGSMTNSLLSEML

Query:  ECDTAQEVWKILNGRFSSRNLTRLMDLKSKLESLKKGNLKLEEYFLKVKNLVDSLAAAGKKISHTDHVLHILAGLGPEYDPTVSVITEKDETPPCRRII
         C +A+E+W+ L G FSSR L + M  K+KL ++KKG++ L+EYFLK+   VD+LA+  K +S  DH+L+ILAGLG +Y   +SVI+ + ++P  + ++
Subjt:  ECDTAQEVWKILNGRFSSRNLTRLMDLKSKLESLKKGNLKLEEYFLKVKNLVDSLAAAGKKISHTDHVLHILAGLGPEYDPTVSVITEKDETPPCRRII

A0A5A7VGJ8 Keratin, type II cytoskeletal 1-like2.5e-3341.45Show/hide
Query:  SSQALKVVNLGNKISTVKLDEEIFCYGSFKFLRSSE-------------------IHASGESSSSTKIPNPAYDHWVRQDNLITAWLLGSMTNSLLSEML
        SS   ++    NKIS VKL ++ F    F+ L + E                   I     S+S+T+ PNP Y  W RQD LI++WLLGSM+  +L++ML
Subjt:  SSQALKVVNLGNKISTVKLDEEIFCYGSFKFLRSSE-------------------IHASGESSSSTKIPNPAYDHWVRQDNLITAWLLGSMTNSLLSEML

Query:  ECDTAQEVWKILNGRFSSRNLTRLMDLKSKLESLKKGNLKLEEYFLKVKNLVDSLAAAGKKISHTDHVLHILAGLGPEYDPTVSVITEKDETP
         C +A+E+W  L G FSSR L + M  K+KL ++KK ++ L+EYFLK+++ VD+LA+  K +S  DH+L+ILAGLG +Y   +SVI  + E+P
Subjt:  ECDTAQEVWKILNGRFSSRNLTRLMDLKSKLESLKKGNLKLEEYFLKVKNLVDSLAAAGKKISHTDHVLHILAGLGPEYDPTVSVITEKDETP

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-949.3e-3641.21Show/hide
Query:  SSQALKVVNLGNKISTVKLDEEIFCYGSFKFLRSSE-------------------IHASGESSSSTKIPNPAYDHWVRQDNLITAWLLGSMTNSLLSEML
        SS   ++   GNKIS VKL+++ F    F+ L + E                   I     S+S+T  PNPAY  W RQD LI++WLLGSM+  +L++ML
Subjt:  SSQALKVVNLGNKISTVKLDEEIFCYGSFKFLRSSE-------------------IHASGESSSSTKIPNPAYDHWVRQDNLITAWLLGSMTNSLLSEML

Query:  ECDTAQEVWKILNGRFSSRNLTRLMDLKSKLESLKKGNLKLEEYFLKVKNLVDSLAAAGKKISHTDHVLHILAGLGPEYDPTVSVITEKDETPPCRRII
         C +A+E+W+ L G FSSR L + M  K+KL ++KKG++ L+EYFLK+   VD+LA+  K +S  DH+L+ILAGLG +Y   +SVI+ + ++P  + ++
Subjt:  ECDTAQEVWKILNGRFSSRNLTRLMDLKSKLESLKKGNLKLEEYFLKVKNLVDSLAAAGKKISHTDHVLHILAGLGPEYDPTVSVITEKDETPPCRRII

A0A6J1DLT9 uncharacterized protein LOC1110217575.6e-4147.96Show/hide
Query:  SSDILNSSQALKVVNLGNKISTVKLDEEIFCYGSFK------------FLRSSE-------IHASGESSSSTKIPNPAYDHWVRQDNLITAWLLGSMTNS
        +SD     QA K +N G+K+S V+L+++      F+            ++ S+E            ESSSS+   NPAY  W++QD LI+AWLLGSM   
Subjt:  SSDILNSSQALKVVNLGNKISTVKLDEEIFCYGSFK------------FLRSSE-------IHASGESSSSTKIPNPAYDHWVRQDNLITAWLLGSMTNS

Query:  LLSEMLECDTAQEVWKILNGRFSSRNLTRLMDLKSKLESLKKGNLKLEEYFLKVKNLVDSLAAAGKKISHTDHVLHILAGLGPEYDPTVSVITEKD
        +LS+ML+C +A+E+W +L   F+SR L R+M LK KLE+ KKGNL L++YFLK+KNLVDSLA AGKK+S  DH++HILAGLGPE+D  +SVIT ++
Subjt:  LLSEMLECDTAQEVWKILNGRFSSRNLTRLMDLKSKLESLKKGNLKLEEYFLKVKNLVDSLAAAGKKISHTDHVLHILAGLGPEYDPTVSVITEKD

A0A6J1DSS1 uncharacterized protein LOC1110235862.2e-3756.39Show/hide
Query:  HASGESSSSTKIPNPAYDHWVRQDNLITAWLLGSMTNSLLSEMLECDTAQEVWKILNGRFSSRNLTRLMDLKSKLESLKKGNLKLEEYFLKVKNLVDSLA
        +  G +SS+T+ PNP Y HW++QD LI+ WLLGSM+  +LS+ML+C   +E+W +L   F+SRNL R+M LKSKLE++KKG++ L+ YFLK+KNLVDSLA
Subjt:  HASGESSSSTKIPNPAYDHWVRQDNLITAWLLGSMTNSLLSEMLECDTAQEVWKILNGRFSSRNLTRLMDLKSKLESLKKGNLKLEEYFLKVKNLVDSLA

Query:  AAGKKISHTDHVLHILAGLGPEYDPTVSVITEK
         AGK++   DH++HILA LGPE+D  VSVI+ +
Subjt:  AAGKKISHTDHVLHILAGLGPEYDPTVSVITEK

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.8e-1231.2Show/hide
Query:  NPAYDHWVRQDNLITAWLLGSMTNSLLSEMLECDTAQEVWKILNGRFSSRNLTRLMDLKSKLESLKKGNLKLEEYFLKVKNLVDSLAAAGKKISHTDHVL
        NP Y  W RQD LI + +LG+++ S+   +    TA ++W+ L   +++ +   +  L+++L+   KG   +++Y   +    D LA  GK + H + V 
Subjt:  NPAYDHWVRQDNLITAWLLGSMTNSLLSEMLECDTAQEVWKILNGRFSSRNLTRLMDLKSKLESLKKGNLKLEEYFLKVKNLVDSLAAAGKKISHTDHVL

Query:  HILAGLGPEYDPTVSVITEKDETPP
         +L  L  EY P +  I  KD TPP
Subjt:  HILAGLGPEYDPTVSVITEKDETPP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.0e-0628Show/hide
Query:  NPAYDHWVRQDNLITAWLLGSMTNSLLSEMLECDTAQEVWKILNGRFSSRNLTRLMDLKSKLESLKKGNLKLEEYFLKVKNLVDSLAAAGKKISHTDHVL
        NP Y  W RQD LI + +LG+++ S+   +    TA ++W+ L   +++ +   +  L+                F+      D LA  GK + H + V 
Subjt:  NPAYDHWVRQDNLITAWLLGSMTNSLLSEMLECDTAQEVWKILNGRFSSRNLTRLMDLKSKLESLKKGNLKLEEYFLKVKNLVDSLAAAGKKISHTDHVL

Query:  HILAGLGPEYDPTVSVITEKDETPP
         +L  L  +Y P +  I  KD TPP
Subjt:  HILAGLGPEYDPTVSVITEKDETPP

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).2.0e-0629.27Show/hide
Query:  NPAYDHWVRQDNLITAWLLGSMTNSLLSEMLECDTAQEVWKILNGRFSSRNLTRLMDLKSKLESLKKGNLKLEEYFLKVKNL
        +P Y  W + + ++  WL+ SMT+ LL  ++  +TA ++W+ L   F      ++  L+ +L +L++G   +EEYF K+  +
Subjt:  NPAYDHWVRQDNLITAWLLGSMTNSLLSEMLECDTAQEVWKILNGRFSSRNLTRLMDLKSKLESLKKGNLKLEEYFLKVKNL

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.0e-1024.86Show/hide
Query:  ILNSSQALKVVNLGNKISTVKLDEEIFCYGSFK--FLRSSEIHASGESSSSTKIPNPAYD-HWVRQDNLITAWLLGSMT-NSLLSEMLECDTAQEVWKIL
        ++N  Q   V N+ + I  V LD E   Y +++  FL              T +P  A D +W ++D ++   L G++T        +   T++++W  +
Subjt:  ILNSSQALKVVNLGNKISTVKLDEEIFCYGSFK--FLRSSEIHASGESSSSTKIPNPAYD-HWVRQDNLITAWLLGSMT-NSLLSEMLECDTAQEVWKIL

Query:  NGRFSSRNLTRLMDLKSKLESLKKGNLKLEEYFLKVKNLVDSLAAAGKKISHTDHVLHILAGLGPEYDPTVSVITEKDETP
          +F +    R + L S+L +   G++++ +Y+ K+K L DSL      ++  + V+++L GL P++D  ++VI  +   P
Subjt:  NGRFSSRNLTRLMDLKSKLESLKKGNLKLEEYFLKVKNLVDSLAAAGKKISHTDHVLHILAGLGPEYDPTVSVITEKDETP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCTTTCTTCTCCTGAAGAGAGTAGCTCAGATATTCTAAATTCTTCCCAAGCGTTGAAAGTGGTGAATCTTGGAAACAAGATTAGCACAGTAAAGCTCGATGAAGA
AATTTTTTGTTATGGAAGCTTCAAGTTCTTACGTTCCTCCGAAATTCATGCATCAGGTGAGTCATCATCGTCTACAAAGATTCCGAATCCGGCGTATGATCACTGGGTTC
GTCAAGACAACTTGATTACAGCATGGTTGCTGGGTTCCATGACAAATTCACTACTTTCTGAAATGTTAGAATGTGATACTGCGCAAGAAGTCTGGAAGATTCTTAATGGT
CGTTTTTCTTCGAGAAATTTGACTAGATTAATGGATTTGAAATCCAAATTGGAGTCACTCAAGAAAGGTAACCTCAAACTAGAGGAATATTTCTTGAAAGTCAAGAACCT
GGTGGATTCTTTGGCAGCAGCTGGGAAAAAGATCTCCCACACCGATCATGTTCTTCATATTCTAGCAGGATTAGGGCCAGAATATGACCCTACGGTCTCAGTCATCACTG
AGAAGGATGAAACTCCACCTTGCAGAAGAATAATAATGGTCGATCGAATCGAAGGACCTGGAATAATTCTAACAAGCCGCAATGTCAATTGTGTCGGCGCTTTGGGCATA
CTGTACAGAGGTGTTATTACCGTTTTGAGAGGTGGTTTCAGGGTCCAAATACAAATTCCAGTGGTCAATCCTCTTCACAATTTGTTCCTAATCAACAATTTCACTCTGCT
GGACAACAGAATCATCAATCCCGGCAAGTGCCTTCATGTTGCAGCACGATCTGAACAAGGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGATCTTTCTTCTCCTGAAGAGAGTAGCTCAGATATTCTAAATTCTTCCCAAGCGTTGAAAGTGGTGAATCTTGGAAACAAGATTAGCACAGTAAAGCTCGATGAAGA
AATTTTTTGTTATGGAAGCTTCAAGTTCTTACGTTCCTCCGAAATTCATGCATCAGGTGAGTCATCATCGTCTACAAAGATTCCGAATCCGGCGTATGATCACTGGGTTC
GTCAAGACAACTTGATTACAGCATGGTTGCTGGGTTCCATGACAAATTCACTACTTTCTGAAATGTTAGAATGTGATACTGCGCAAGAAGTCTGGAAGATTCTTAATGGT
CGTTTTTCTTCGAGAAATTTGACTAGATTAATGGATTTGAAATCCAAATTGGAGTCACTCAAGAAAGGTAACCTCAAACTAGAGGAATATTTCTTGAAAGTCAAGAACCT
GGTGGATTCTTTGGCAGCAGCTGGGAAAAAGATCTCCCACACCGATCATGTTCTTCATATTCTAGCAGGATTAGGGCCAGAATATGACCCTACGGTCTCAGTCATCACTG
AGAAGGATGAAACTCCACCTTGCAGAAGAATAATAATGGTCGATCGAATCGAAGGACCTGGAATAATTCTAACAAGCCGCAATGTCAATTGTGTCGGCGCTTTGGGCATA
CTGTACAGAGGTGTTATTACCGTTTTGAGAGGTGGTTTCAGGGTCCAAATACAAATTCCAGTGGTCAATCCTCTTCACAATTTGTTCCTAATCAACAATTTCACTCTGCT
GGACAACAGAATCATCAATCCCGGCAAGTGCCTTCATGTTGCAGCACGATCTGAACAAGGATAA
Protein sequenceShow/hide protein sequence
MDLSSPEESSSDILNSSQALKVVNLGNKISTVKLDEEIFCYGSFKFLRSSEIHASGESSSSTKIPNPAYDHWVRQDNLITAWLLGSMTNSLLSEMLECDTAQEVWKILNG
RFSSRNLTRLMDLKSKLESLKKGNLKLEEYFLKVKNLVDSLAAAGKKISHTDHVLHILAGLGPEYDPTVSVITEKDETPPCRRIIMVDRIEGPGIILTSRNVNCVGALGI
LYRGVITVLRGGFRVQIQIPVVNPLHNLFLINNFTLLDNRIINPGKCLHVAARSEQG