; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010078 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010078
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr9:44444654..44445243
RNA-Seq ExpressionLag0010078
SyntenyLag0010078
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.3e-1434.32Show/hide
Query:  VSERSSSEAITNPKFINP-------GSKTSTPKLDEENFLMWKFQ------------FLNSTDE-------------SSSTRIPTPGYDYWIRQDNLITV
        +S  SS   + N +  +P       G+K S  KL+++ FL+WKFQ            FL S  E             +S+T  P P Y  W RQD LI+ 
Subjt:  VSERSSSEAITNPKFINP-------GSKTSTPKLDEENFLMWKFQ------------FLNSTDE-------------SSSTRIPTPGYDYWIRQDNLITV

Query:  WLLSSMTTAIIVEFLYCKTSKEVWSHLSARFSLKHMARVLELKTKLGTTKKGHSISHSDHALYILGVLD
        WLL SM+  I+ + L+CK++KE+W  L   FS +++A+ ++ K KL   KKG S+   ++ L IL  +D
Subjt:  WLLSSMTTAIIVEFLYCKTSKEVWSHLSARFSLKHMARVLELKTKLGTTKKGHSISHSDHALYILGVLD

KAA0053143.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]3.9e-1434.15Show/hide
Query:  VSERSSSEAITNPK--FINPGSKTSTPKLDEENFLMWKFQ-----------------------FLNSTDES--SSTRIPTPGYDYWIRQDNLITVWLLSS
        +S  SS   + N +      G+K S  KL ++NFL+WKFQ                       +L ST  S  S+TR P P Y  W R + LI+ WLL S
Subjt:  VSERSSSEAITNPK--FINPGSKTSTPKLDEENFLMWKFQ-----------------------FLNSTDES--SSTRIPTPGYDYWIRQDNLITVWLLSS

Query:  MTTAIIVEFLYCKTSKEVWSHLSARFSLKHMARVLELKTKLGTTKKGHSISHSDHALYILGVLD
        M+  I+ + ++CK++KE+W  L   FS +++A+ ++ K KL   KKG S+S  ++ L I   +D
Subjt:  MTTAIIVEFLYCKTSKEVWSHLSARFSLKHMARVLELKTKLGTTKKGHSISHSDHALYILGVLD

KAA0067213.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]8.6e-1432.11Show/hide
Query:  VSERSSSEAITNPKFINP-------GSKTSTPKLDEENFLMWKFQ------------FLNSTDE-------------SSSTRIPTPGYDYWIRQDNLITV
        +S  SS   + N +  +P        +K S  KL ++NFL+WKFQ            FL S  E             +S+TR P P Y  W RQD LI+ 
Subjt:  VSERSSSEAITNPKFINP-------GSKTSTPKLDEENFLMWKFQ------------FLNSTDE-------------SSSTRIPTPGYDYWIRQDNLITV

Query:  WLLSSMTTAIIVEFLYCKTSKEVWSHLSARFSLKHMARVLELKTKLGTTKK----------------------GHSISHSDHALYILGVL
        WLL SM+  I+ + L+CK++KE+W  L   FS +++A+ ++ K KL   KK                         +S  DH LYIL  L
Subjt:  WLLSSMTTAIIVEFLYCKTSKEVWSHLSARFSLKHMARVLELKTKLGTTKK----------------------GHSISHSDHALYILGVL

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.3e-1434.32Show/hide
Query:  VSERSSSEAITNPKFINP-------GSKTSTPKLDEENFLMWKFQ------------FLNSTDE-------------SSSTRIPTPGYDYWIRQDNLITV
        +S  SS   + N +  +P       G+K S  KL+++ FL+WKFQ            FL S  E             +S+T  P P Y  W RQD LI+ 
Subjt:  VSERSSSEAITNPKFINP-------GSKTSTPKLDEENFLMWKFQ------------FLNSTDE-------------SSSTRIPTPGYDYWIRQDNLITV

Query:  WLLSSMTTAIIVEFLYCKTSKEVWSHLSARFSLKHMARVLELKTKLGTTKKGHSISHSDHALYILGVLD
        WLL SM+  I+ + L+CK++KE+W  L   FS +++A+ ++ K KL   KKG S+   ++ L IL  +D
Subjt:  WLLSSMTTAIIVEFLYCKTSKEVWSHLSARFSLKHMARVLELKTKLGTTKKGHSISHSDHALYILGVLD

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]4.9e-1737.2Show/hide
Query:  SSVSERSSSEAITNPKFINPGSKTSTPKLDEENFLMWKF-----------------------QFLNST-DESSSTRI-PTPGYDYWIRQDNLITVWLLSS
        SS+    ++  I   K INPGSK S  +L+++N L+WKF                       QF+ +T DESSS+ +   P Y  WI+QD LI+ WLL S
Subjt:  SSVSERSSSEAITNPKFINPGSKTSTPKLDEENFLMWKF-----------------------QFLNST-DESSSTRI-PTPGYDYWIRQDNLITVWLLSS

Query:  MTTAIIVEFLYCKTSKEVWSHLSARFSLKHMARVLELKTKLGTTKKGHSISHSDHALYILGVLD
        M   I+ + L CK+++E+W+ L   F+ + +ARV++LK KL   KKG ++S  D+ L I  ++D
Subjt:  MTTAIIVEFLYCKTSKEVWSHLSARFSLKHMARVLELKTKLGTTKKGHSISHSDHALYILGVLD

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-1434.32Show/hide
Query:  VSERSSSEAITNPKFINP-------GSKTSTPKLDEENFLMWKFQ------------FLNSTDE-------------SSSTRIPTPGYDYWIRQDNLITV
        +S  SS   + N +  +P       G+K S  KL+++ FL+WKFQ            FL S  E             +S+T  P P Y  W RQD LI+ 
Subjt:  VSERSSSEAITNPKFINP-------GSKTSTPKLDEENFLMWKFQ------------FLNSTDE-------------SSSTRIPTPGYDYWIRQDNLITV

Query:  WLLSSMTTAIIVEFLYCKTSKEVWSHLSARFSLKHMARVLELKTKLGTTKKGHSISHSDHALYILGVLD
        WLL SM+  I+ + L+CK++KE+W  L   FS +++A+ ++ K KL   KKG S+   ++ L IL  +D
Subjt:  WLLSSMTTAIIVEFLYCKTSKEVWSHLSARFSLKHMARVLELKTKLGTTKKGHSISHSDHALYILGVLD

A0A5A7UB21 Keratin, type II cytoskeletal 1-like1.9e-1434.15Show/hide
Query:  VSERSSSEAITNPK--FINPGSKTSTPKLDEENFLMWKFQ-----------------------FLNSTDES--SSTRIPTPGYDYWIRQDNLITVWLLSS
        +S  SS   + N +      G+K S  KL ++NFL+WKFQ                       +L ST  S  S+TR P P Y  W R + LI+ WLL S
Subjt:  VSERSSSEAITNPK--FINPGSKTSTPKLDEENFLMWKFQ-----------------------FLNSTDES--SSTRIPTPGYDYWIRQDNLITVWLLSS

Query:  MTTAIIVEFLYCKTSKEVWSHLSARFSLKHMARVLELKTKLGTTKKGHSISHSDHALYILGVLD
        M+  I+ + ++CK++KE+W  L   FS +++A+ ++ K KL   KKG S+S  ++ L I   +D
Subjt:  MTTAIIVEFLYCKTSKEVWSHLSARFSLKHMARVLELKTKLGTTKKGHSISHSDHALYILGVLD

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-1434.32Show/hide
Query:  VSERSSSEAITNPKFINP-------GSKTSTPKLDEENFLMWKFQ------------FLNSTDE-------------SSSTRIPTPGYDYWIRQDNLITV
        +S  SS   + N +  +P       G+K S  KL+++ FL+WKFQ            FL S  E             +S+T  P P Y  W RQD LI+ 
Subjt:  VSERSSSEAITNPKFINP-------GSKTSTPKLDEENFLMWKFQ------------FLNSTDE-------------SSSTRIPTPGYDYWIRQDNLITV

Query:  WLLSSMTTAIIVEFLYCKTSKEVWSHLSARFSLKHMARVLELKTKLGTTKKGHSISHSDHALYILGVLD
        WLL SM+  I+ + L+CK++KE+W  L   FS +++A+ ++ K KL   KKG S+   ++ L IL  +D
Subjt:  WLLSSMTTAIIVEFLYCKTSKEVWSHLSARFSLKHMARVLELKTKLGTTKKGHSISHSDHALYILGVLD

A0A5D3D5T2 Keratin, type II cytoskeletal 1-like4.2e-1432.11Show/hide
Query:  VSERSSSEAITNPKFINP-------GSKTSTPKLDEENFLMWKFQ------------FLNSTDE-------------SSSTRIPTPGYDYWIRQDNLITV
        +S  SS   + N +  +P        +K S  KL ++NFL+WKFQ            FL S  E             +S+TR P P Y  W RQD LI+ 
Subjt:  VSERSSSEAITNPKFINP-------GSKTSTPKLDEENFLMWKFQ------------FLNSTDE-------------SSSTRIPTPGYDYWIRQDNLITV

Query:  WLLSSMTTAIIVEFLYCKTSKEVWSHLSARFSLKHMARVLELKTKLGTTKK----------------------GHSISHSDHALYILGVL
        WLL SM+  I+ + L+CK++KE+W  L   FS +++A+ ++ K KL   KK                         +S  DH LYIL  L
Subjt:  WLLSSMTTAIIVEFLYCKTSKEVWSHLSARFSLKHMARVLELKTKLGTTKK----------------------GHSISHSDHALYILGVL

A0A6J1DLT9 uncharacterized protein LOC1110217572.4e-1737.2Show/hide
Query:  SSVSERSSSEAITNPKFINPGSKTSTPKLDEENFLMWKF-----------------------QFLNST-DESSSTRI-PTPGYDYWIRQDNLITVWLLSS
        SS+    ++  I   K INPGSK S  +L+++N L+WKF                       QF+ +T DESSS+ +   P Y  WI+QD LI+ WLL S
Subjt:  SSVSERSSSEAITNPKFINPGSKTSTPKLDEENFLMWKF-----------------------QFLNST-DESSSTRI-PTPGYDYWIRQDNLITVWLLSS

Query:  MTTAIIVEFLYCKTSKEVWSHLSARFSLKHMARVLELKTKLGTTKKGHSISHSDHALYILGVLD
        M   I+ + L CK+++E+W+ L   F+ + +ARV++LK KL   KKG ++S  D+ L I  ++D
Subjt:  MTTAIIVEFLYCKTSKEVWSHLSARFSLKHMARVLELKTKLGTTKKGHSISHSDHALYILGVLD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.3e-0424.29Show/hide
Query:  MSSSVSERSSSEAITNPKFINPG----SKTSTPKL--DEENFLMWKFQ---FLNSTDESS--STRIPTPG-----YDYWIRQDNLITVWLLSSMTTAIIV
        M+ ++   S +    +P ++ P     S  S  KL  DE+N++ WK +   FL  T +       +P P      Y  W + + ++  WL++SMT  ++ 
Subjt:  MSSSVSERSSSEAITNPKFINPG----SKTSTPKL--DEENFLMWKFQ---FLNSTDESS--STRIPTPG-----YDYWIRQDNLITVWLLSSMTTAIIV

Query:  EFLYCKTSKEVWSHLSARFSLKHMARVLELKTKLGTTKKG
          +Y +T+ ++W  L   F      ++ +L+ +L T ++G
Subjt:  EFLYCKTSKEVWSHLSARFSLKHMARVLELKTKLGTTKKG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCTTCAGTTTCGGAAAGAAGCTCTTCAGAAGCAATCACGAATCCAAAATTCATCAATCCTGGGAGCAAGACTTCGACTCCGAAGTTAGATGAAGAGAATTTCTT
GATGTGGAAATTTCAATTCCTGAATTCTACCGATGAATCGTCTTCTACACGAATCCCTACTCCTGGGTACGATTACTGGATTCGACAGGACAATCTCATCACAGTATGGC
TTCTTAGTTCGATGACTACTGCGATTATTGTTGAATTTCTTTATTGCAAAACTTCCAAAGAGGTGTGGTCTCATCTTTCAGCCCGTTTTTCGTTGAAACACATGGCTAGG
GTTCTTGAACTTAAGACAAAACTTGGAACAACCAAGAAAGGACACTCTATTTCACATAGTGATCATGCATTGTATATACTAGGGGTCTTGGACCTGAATATGATCCCACT
GTATCTGTCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTCTTCAGTTTCGGAAAGAAGCTCTTCAGAAGCAATCACGAATCCAAAATTCATCAATCCTGGGAGCAAGACTTCGACTCCGAAGTTAGATGAAGAGAATTTCTT
GATGTGGAAATTTCAATTCCTGAATTCTACCGATGAATCGTCTTCTACACGAATCCCTACTCCTGGGTACGATTACTGGATTCGACAGGACAATCTCATCACAGTATGGC
TTCTTAGTTCGATGACTACTGCGATTATTGTTGAATTTCTTTATTGCAAAACTTCCAAAGAGGTGTGGTCTCATCTTTCAGCCCGTTTTTCGTTGAAACACATGGCTAGG
GTTCTTGAACTTAAGACAAAACTTGGAACAACCAAGAAAGGACACTCTATTTCACATAGTGATCATGCATTGTATATACTAGGGGTCTTGGACCTGAATATGATCCCACT
GTATCTGTCCTGA
Protein sequenceShow/hide protein sequence
MSSSVSERSSSEAITNPKFINPGSKTSTPKLDEENFLMWKFQFLNSTDESSSTRIPTPGYDYWIRQDNLITVWLLSSMTTAIIVEFLYCKTSKEVWSHLSARFSLKHMAR
VLELKTKLGTTKKGHSISHSDHALYILGVLDLNMIPLYLS