; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022657 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022657
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr7:35078624..35079154
RNA-Seq ExpressionLag0022657
SyntenyLag0022657
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.2e-2946.1Show/hide
Query:  GTEKNSNDSQQFKIVNHENKITTVKLDEDNFLLWKLQVTTALRGHCLMNHVNEDSEAPSKFITNGDLKD------PNPEYDQWVRQDSLITAWLLGAMPN
        G E     S   +I    NKI+ VKL++D FLLWK Q+ TAL  + L N +  +SE PSK++ + +         PNP Y  W RQD LI++WLLG+M  
Subjt:  GTEKNSNDSQQFKIVNHENKITTVKLDEDNFLLWKLQVTTALRGHCLMNHVNEDSEAPSKFITNGDLKD------PNPEYDQWVRQDSLITAWLLGAMPN

Query:  SILSEMLDCETAKEVWKTLNARFSSRNLASIMDLKSKLETTKKGGLKLEEYFVK
         IL++ML C++AKE+W+TL   FSSR LA  M  K+KL   KKG + L+EYF+K
Subjt:  SILSEMLDCETAKEVWKTLNARFSSRNLASIMDLKSKLETTKKGGLKLEEYFVK

KAA0067213.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]8.5e-2946.1Show/hide
Query:  GTEKNSNDSQQFKIVNHENKITTVKLDEDNFLLWKLQVTTALRGHCLMNHVNEDSEAPSKFITN------GDLKDPNPEYDQWVRQDSLITAWLLGAMPN
        G E     S   +I    NKI+ VKL +DNFLLWK Q+ TAL  + L N +  +SE PSK++ +         + PNP Y  W RQD LI++WLLG+M  
Subjt:  GTEKNSNDSQQFKIVNHENKITTVKLDEDNFLLWKLQVTTALRGHCLMNHVNEDSEAPSKFITN------GDLKDPNPEYDQWVRQDSLITAWLLGAMPN

Query:  SILSEMLDCETAKEVWKTLNARFSSRNLASIMDLKSKLETTKKGGLKLEEYFVK
         IL++ML C++AKE+W TL   FSSR LA  M  K+KL   KK  + L+EYF+K
Subjt:  SILSEMLDCETAKEVWKTLNARFSSRNLASIMDLKSKLETTKKGGLKLEEYFVK

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.2e-2946.1Show/hide
Query:  GTEKNSNDSQQFKIVNHENKITTVKLDEDNFLLWKLQVTTALRGHCLMNHVNEDSEAPSKFITNGDLKD------PNPEYDQWVRQDSLITAWLLGAMPN
        G E     S   +I    NKI+ VKL++D FLLWK Q+ TAL  + L N +  +SE PSK++ + +         PNP Y  W RQD LI++WLLG+M  
Subjt:  GTEKNSNDSQQFKIVNHENKITTVKLDEDNFLLWKLQVTTALRGHCLMNHVNEDSEAPSKFITNGDLKD------PNPEYDQWVRQDSLITAWLLGAMPN

Query:  SILSEMLDCETAKEVWKTLNARFSSRNLASIMDLKSKLETTKKGGLKLEEYFVK
         IL++ML C++AKE+W+TL   FSSR LA  M  K+KL   KKG + L+EYF+K
Subjt:  SILSEMLDCETAKEVWKTLNARFSSRNLASIMDLKSKLETTKKGGLKLEEYFVK

TYK18917.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]8.5e-2946.1Show/hide
Query:  GTEKNSNDSQQFKIVNHENKITTVKLDEDNFLLWKLQVTTALRGHCLMNHVNEDSEAPSKFITN------GDLKDPNPEYDQWVRQDSLITAWLLGAMPN
        G E     S   +I    NKI+ VKL +DNFLLWK Q+ TAL  + L N +  +SE PSK++ +         + PNP Y  W RQD LI++WLLG+M  
Subjt:  GTEKNSNDSQQFKIVNHENKITTVKLDEDNFLLWKLQVTTALRGHCLMNHVNEDSEAPSKFITN------GDLKDPNPEYDQWVRQDSLITAWLLGAMPN

Query:  SILSEMLDCETAKEVWKTLNARFSSRNLASIMDLKSKLETTKKGGLKLEEYFVK
         IL++ML C++AKE+W TL   FSSR LA  M  K+KL   KK  + L+EYF+K
Subjt:  SILSEMLDCETAKEVWKTLNARFSSRNLASIMDLKSKLETTKKGGLKLEEYFVK

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]1.3e-2941.57Show/hide
Query:  MESSGTEKNSNDS---QQFKIVNHENKITTVKLDEDNFLLWKLQVTTALRGHCLMNHVNEDSEAPSKFI------TNGDLKDPNPEYDQWVRQDSLITAW
        M S  + +NS+ +   Q  K +N  +K++ V+L++DN LLWK Q+ TAL+G+ L ++++ + + P++F+      ++      NP Y +W++QD LI+AW
Subjt:  MESSGTEKNSNDS---QQFKIVNHENKITTVKLDEDNFLLWKLQVTTALRGHCLMNHVNEDSEAPSKFI------TNGDLKDPNPEYDQWVRQDSLITAW

Query:  LLGAMPNSILSEMLDCETAKEVWKTLNARFSSRNLASIMDLKSKLETTKKGGLKLEEYFVKSRTLL
        LLG+M   ILS+MLDC++A+E+W  L   F+SR LA +M LK KLE  KKG L L++YF+K + L+
Subjt:  LLGAMPNSILSEMLDCETAKEVWKTLNARFSSRNLASIMDLKSKLETTKKGGLKLEEYFVKSRTLL

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-2946.1Show/hide
Query:  GTEKNSNDSQQFKIVNHENKITTVKLDEDNFLLWKLQVTTALRGHCLMNHVNEDSEAPSKFITNGDLKD------PNPEYDQWVRQDSLITAWLLGAMPN
        G E     S   +I    NKI+ VKL++D FLLWK Q+ TAL  + L N +  +SE PSK++ + +         PNP Y  W RQD LI++WLLG+M  
Subjt:  GTEKNSNDSQQFKIVNHENKITTVKLDEDNFLLWKLQVTTALRGHCLMNHVNEDSEAPSKFITNGDLKD------PNPEYDQWVRQDSLITAWLLGAMPN

Query:  SILSEMLDCETAKEVWKTLNARFSSRNLASIMDLKSKLETTKKGGLKLEEYFVK
         IL++ML C++AKE+W+TL   FSSR LA  M  K+KL   KKG + L+EYF+K
Subjt:  SILSEMLDCETAKEVWKTLNARFSSRNLASIMDLKSKLETTKKGGLKLEEYFVK

A0A5A7VGJ8 Keratin, type II cytoskeletal 1-like4.1e-2946.1Show/hide
Query:  GTEKNSNDSQQFKIVNHENKITTVKLDEDNFLLWKLQVTTALRGHCLMNHVNEDSEAPSKFITN------GDLKDPNPEYDQWVRQDSLITAWLLGAMPN
        G E     S   +I    NKI+ VKL +DNFLLWK Q+ TAL  + L N +  +SE PSK++ +         + PNP Y  W RQD LI++WLLG+M  
Subjt:  GTEKNSNDSQQFKIVNHENKITTVKLDEDNFLLWKLQVTTALRGHCLMNHVNEDSEAPSKFITN------GDLKDPNPEYDQWVRQDSLITAWLLGAMPN

Query:  SILSEMLDCETAKEVWKTLNARFSSRNLASIMDLKSKLETTKKGGLKLEEYFVK
         IL++ML C++AKE+W TL   FSSR LA  M  K+KL   KK  + L+EYF+K
Subjt:  SILSEMLDCETAKEVWKTLNARFSSRNLASIMDLKSKLETTKKGGLKLEEYFVK

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-2946.1Show/hide
Query:  GTEKNSNDSQQFKIVNHENKITTVKLDEDNFLLWKLQVTTALRGHCLMNHVNEDSEAPSKFITNGDLKD------PNPEYDQWVRQDSLITAWLLGAMPN
        G E     S   +I    NKI+ VKL++D FLLWK Q+ TAL  + L N +  +SE PSK++ + +         PNP Y  W RQD LI++WLLG+M  
Subjt:  GTEKNSNDSQQFKIVNHENKITTVKLDEDNFLLWKLQVTTALRGHCLMNHVNEDSEAPSKFITNGDLKD------PNPEYDQWVRQDSLITAWLLGAMPN

Query:  SILSEMLDCETAKEVWKTLNARFSSRNLASIMDLKSKLETTKKGGLKLEEYFVK
         IL++ML C++AKE+W+TL   FSSR LA  M  K+KL   KKG + L+EYF+K
Subjt:  SILSEMLDCETAKEVWKTLNARFSSRNLASIMDLKSKLETTKKGGLKLEEYFVK

A0A5D3D5T2 Keratin, type II cytoskeletal 1-like4.1e-2946.1Show/hide
Query:  GTEKNSNDSQQFKIVNHENKITTVKLDEDNFLLWKLQVTTALRGHCLMNHVNEDSEAPSKFITN------GDLKDPNPEYDQWVRQDSLITAWLLGAMPN
        G E     S   +I    NKI+ VKL +DNFLLWK Q+ TAL  + L N +  +SE PSK++ +         + PNP Y  W RQD LI++WLLG+M  
Subjt:  GTEKNSNDSQQFKIVNHENKITTVKLDEDNFLLWKLQVTTALRGHCLMNHVNEDSEAPSKFITN------GDLKDPNPEYDQWVRQDSLITAWLLGAMPN

Query:  SILSEMLDCETAKEVWKTLNARFSSRNLASIMDLKSKLETTKKGGLKLEEYFVK
         IL++ML C++AKE+W TL   FSSR LA  M  K+KL   KK  + L+EYF+K
Subjt:  SILSEMLDCETAKEVWKTLNARFSSRNLASIMDLKSKLETTKKGGLKLEEYFVK

A0A6J1DLT9 uncharacterized protein LOC1110217576.3e-3041.57Show/hide
Query:  MESSGTEKNSNDS---QQFKIVNHENKITTVKLDEDNFLLWKLQVTTALRGHCLMNHVNEDSEAPSKFI------TNGDLKDPNPEYDQWVRQDSLITAW
        M S  + +NS+ +   Q  K +N  +K++ V+L++DN LLWK Q+ TAL+G+ L ++++ + + P++F+      ++      NP Y +W++QD LI+AW
Subjt:  MESSGTEKNSNDS---QQFKIVNHENKITTVKLDEDNFLLWKLQVTTALRGHCLMNHVNEDSEAPSKFI------TNGDLKDPNPEYDQWVRQDSLITAW

Query:  LLGAMPNSILSEMLDCETAKEVWKTLNARFSSRNLASIMDLKSKLETTKKGGLKLEEYFVKSRTLL
        LLG+M   ILS+MLDC++A+E+W  L   F+SR LA +M LK KLE  KKG L L++YF+K + L+
Subjt:  LLGAMPNSILSEMLDCETAKEVWKTLNARFSSRNLASIMDLKSKLETTKKGGLKLEEYFVKSRTLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).6.5e-1127.21Show/hide
Query:  VNHENKITTVKL--DEDNFLLWKLQVTTALRGHCLMNHVNEDSEAPSKFITNGDLKDPNPEYDQWVRQDSLITAWLLGAMPNSILSEMLDCETAKEVWKT
        ++H +  +  KL  DEDN++ WK++  + LR       ++     P  F         +P Y  W + ++++  WL+ +M + +L  ++  ETA ++W+ 
Subjt:  VNHENKITTVKL--DEDNFLLWKLQVTTALRGHCLMNHVNEDSEAPSKFITNGDLKDPNPEYDQWVRQDSLITAWLLGAMPNSILSEMLDCETAKEVWKT

Query:  LNARFSSRNLASIMDLKSKLETTKKGGLKLEEYFVK
        L   F       I  L+ +L T ++GG  +EEYF K
Subjt:  LNARFSSRNLASIMDLKSKLETTKKGGLKLEEYFVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTCCTCTGGTACAGAGAAGAACTCCAATGATTCTCAGCAGTTCAAGATAGTAAACCATGAGAATAAAATCACTACTGTGAAGCTTGATGAGGATAATTTTCTTCT
ATGGAAATTGCAAGTGACCACTGCGTTACGAGGGCACTGTCTGATGAACCATGTGAATGAAGACTCTGAAGCACCTTCGAAATTCATCACCAATGGCGATTTGAAGGATC
CTAATCCTGAGTATGATCAATGGGTGAGGCAAGATAGCTTAATTACGGCCTGGCTTCTTGGTGCCATGCCCAATTCGATTCTTTCTGAGATGCTCGACTGCGAAACAGCC
AAAGAGGTATGGAAAACTCTCAACGCTCGTTTTTCTTCCAGAAATTTAGCAAGTATAATGGATCTGAAATCCAAGTTAGAGACTACAAAGAAAGGTGGCCTCAAACTAGA
AGAGTACTTTGTAAAATCAAGAACATTGTTGATTCTTTGTCTACTGCTGGAAGAAAACTTACTCATGAAGATCACGTTTTACACCTTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTCCTCTGGTACAGAGAAGAACTCCAATGATTCTCAGCAGTTCAAGATAGTAAACCATGAGAATAAAATCACTACTGTGAAGCTTGATGAGGATAATTTTCTTCT
ATGGAAATTGCAAGTGACCACTGCGTTACGAGGGCACTGTCTGATGAACCATGTGAATGAAGACTCTGAAGCACCTTCGAAATTCATCACCAATGGCGATTTGAAGGATC
CTAATCCTGAGTATGATCAATGGGTGAGGCAAGATAGCTTAATTACGGCCTGGCTTCTTGGTGCCATGCCCAATTCGATTCTTTCTGAGATGCTCGACTGCGAAACAGCC
AAAGAGGTATGGAAAACTCTCAACGCTCGTTTTTCTTCCAGAAATTTAGCAAGTATAATGGATCTGAAATCCAAGTTAGAGACTACAAAGAAAGGTGGCCTCAAACTAGA
AGAGTACTTTGTAAAATCAAGAACATTGTTGATTCTTTGTCTACTGCTGGAAGAAAACTTACTCATGAAGATCACGTTTTACACCTTCTAA
Protein sequenceShow/hide protein sequence
MESSGTEKNSNDSQQFKIVNHENKITTVKLDEDNFLLWKLQVTTALRGHCLMNHVNEDSEAPSKFITNGDLKDPNPEYDQWVRQDSLITAWLLGAMPNSILSEMLDCETA
KEVWKTLNARFSSRNLASIMDLKSKLETTKKGGLKLEEYFVKSRTLLILCLLLEENLLMKITFYTF