; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0042055 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0042055
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr13:35598385..35598987
RNA-Seq ExpressionLag0042055
SyntenyLag0042055
Gene Ontology termsGO:0034641 - cellular nitrogen compound metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.7e-3945.23Show/hide
Query:  STNKTIPDLSNFSQSSKIE---NPGHKTTSVKLDEENFLLWKLQVLTKLRGYGLEQFLDNDSAIPPKSIAST--NAATFEQTLNPDYQYWIRQDSLITTW
        S+  ++  + N   SS I      G+K + VKL+++ FLLWK Q+LT L  Y LE FL+++S  P K + ST  ++A+   T NP Y+ W RQD LI++W
Subjt:  STNKTIPDLSNFSQSSKIE---NPGHKTTSVKLDEENFLLWKLQVLTKLRGYGLEQFLDNDSAIPPKSIAST--NAATFEQTLNPDYQYWIRQDSLITTW

Query:  LLGSMTNSLLSEMLDCKTAREVWKILNARFSSCNMARVMDLKSKLETMKKGSLKLEEYFLKIKNLVDSLTAAGRKISHEDHIMHILARLGPKYDSTVSV
        LLGSM+  +L++ML CK+A+E+W+ L   FSS  +A+ M  K+KL  +KKGS+ L+EYFLKI   VD+L +  + +S +DHI++ILA LG  Y S +SV
Subjt:  LLGSMTNSLLSEMLDCKTAREVWKILNARFSSCNMARVMDLKSKLETMKKGSLKLEEYFLKIKNLVDSLTAAGRKISHEDHIMHILARLGPKYDSTVSV

KAA0067213.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]1.3e-3844.72Show/hide
Query:  STNKTIPDLSNFSQSSKIE---NPGHKTTSVKLDEENFLLWKLQVLTKLRGYGLEQFLDNDSAIPPKSIAST--NAATFEQTLNPDYQYWIRQDSLITTW
        S+  ++  + N   SS I       +K + VKL ++NFLLWK Q+LT L  Y LE FL+++S  P K + ST  ++A+  +T NP Y+ W RQD LI++W
Subjt:  STNKTIPDLSNFSQSSKIE---NPGHKTTSVKLDEENFLLWKLQVLTKLRGYGLEQFLDNDSAIPPKSIAST--NAATFEQTLNPDYQYWIRQDSLITTW

Query:  LLGSMTNSLLSEMLDCKTAREVWKILNARFSSCNMARVMDLKSKLETMKKGSLKLEEYFLKIKNLVDSLTAAGRKISHEDHIMHILARLGPKYDSTVSV
        LLGSM+  +L++ML CK+A+E+W  L   FSS  +A+ M  K+KL  +KK S+ L+EYFLKI++ VD+L +  + +S +DHI++ILA LG  Y S +SV
Subjt:  LLGSMTNSLLSEMLDCKTAREVWKILNARFSSCNMARVMDLKSKLETMKKGSLKLEEYFLKIKNLVDSLTAAGRKISHEDHIMHILARLGPKYDSTVSV

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.7e-3945.23Show/hide
Query:  STNKTIPDLSNFSQSSKIE---NPGHKTTSVKLDEENFLLWKLQVLTKLRGYGLEQFLDNDSAIPPKSIAST--NAATFEQTLNPDYQYWIRQDSLITTW
        S+  ++  + N   SS I      G+K + VKL+++ FLLWK Q+LT L  Y LE FL+++S  P K + ST  ++A+   T NP Y+ W RQD LI++W
Subjt:  STNKTIPDLSNFSQSSKIE---NPGHKTTSVKLDEENFLLWKLQVLTKLRGYGLEQFLDNDSAIPPKSIAST--NAATFEQTLNPDYQYWIRQDSLITTW

Query:  LLGSMTNSLLSEMLDCKTAREVWKILNARFSSCNMARVMDLKSKLETMKKGSLKLEEYFLKIKNLVDSLTAAGRKISHEDHIMHILARLGPKYDSTVSV
        LLGSM+  +L++ML CK+A+E+W+ L   FSS  +A+ M  K+KL  +KKGS+ L+EYFLKI   VD+L +  + +S +DHI++ILA LG  Y S +SV
Subjt:  LLGSMTNSLLSEMLDCKTAREVWKILNARFSSCNMARVMDLKSKLETMKKGSLKLEEYFLKIKNLVDSLTAAGRKISHEDHIMHILARLGPKYDSTVSV

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]3.8e-4952.91Show/hide
Query:  DLSNFSQSSKIENPGHKTTSVKLDEENFLLWKLQVLTKLRGYGLEQFLDNDSAIPPKSIASTNAATFEQTL--NPDYQYWIRQDSLITTWLLGSMTNSLL
        D +   Q+SK  NPG K + V+L+++N LLWK Q+ T L+G GLE ++D++   P + + +T   +   +L  NP Y  WI+QD LI+ WLLGSM   +L
Subjt:  DLSNFSQSSKIENPGHKTTSVKLDEENFLLWKLQVLTKLRGYGLEQFLDNDSAIPPKSIASTNAATFEQTL--NPDYQYWIRQDSLITTWLLGSMTNSLL

Query:  SEMLDCKTAREVWKILNARFSSCNMARVMDLKSKLETMKKGSLKLEEYFLKIKNLVDSLTAAGRKISHEDHIMHILARLGPKYDSTVSV
        S+MLDCK+ARE+W +L   F+S  +ARVM LK KLE  KKG+L L++YFLKIKNLVDSL  AG+K+S EDHIMHILA LGP++D+ +SV
Subjt:  SEMLDCKTAREVWKILNARFSSCNMARVMDLKSKLETMKKGSLKLEEYFLKIKNLVDSLTAAGRKISHEDHIMHILARLGPKYDSTVSV

XP_022156747.1 uncharacterized protein LOC111023586 [Momordica charantia]6.2e-4456.96Show/hide
Query:  KLQVLTKLRGYGLEQFLDNDSAIPPKSIASTNAATFEQTL--NPDYQYWIRQDSLITTWLLGSMTNSLLSEMLDCKTAREVWKILNARFSSCNMARVMDL
        K QVLT ++G+GLEQ++D+D   P + I + +  T   T   NP+Y +WI+QD LI+ WLLGSM+  +LS+MLDC+  +E+W +L   F+S N+ARVM L
Subjt:  KLQVLTKLRGYGLEQFLDNDSAIPPKSIASTNAATFEQTL--NPDYQYWIRQDSLITTWLLGSMTNSLLSEMLDCKTAREVWKILNARFSSCNMARVMDL

Query:  KSKLETMKKGSLKLEEYFLKIKNLVDSLTAAGRKISHEDHIMHILARLGPKYDSTVSV
        KSKLE MKKGS+ L+ YFLKIKNLVDSL  AG+++  +DHIMHILARLGP++DS VSV
Subjt:  KSKLETMKKGSLKLEEYFLKIKNLVDSLTAAGRKISHEDHIMHILARLGPKYDSTVSV

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-3945.23Show/hide
Query:  STNKTIPDLSNFSQSSKIE---NPGHKTTSVKLDEENFLLWKLQVLTKLRGYGLEQFLDNDSAIPPKSIAST--NAATFEQTLNPDYQYWIRQDSLITTW
        S+  ++  + N   SS I      G+K + VKL+++ FLLWK Q+LT L  Y LE FL+++S  P K + ST  ++A+   T NP Y+ W RQD LI++W
Subjt:  STNKTIPDLSNFSQSSKIE---NPGHKTTSVKLDEENFLLWKLQVLTKLRGYGLEQFLDNDSAIPPKSIAST--NAATFEQTLNPDYQYWIRQDSLITTW

Query:  LLGSMTNSLLSEMLDCKTAREVWKILNARFSSCNMARVMDLKSKLETMKKGSLKLEEYFLKIKNLVDSLTAAGRKISHEDHIMHILARLGPKYDSTVSV
        LLGSM+  +L++ML CK+A+E+W+ L   FSS  +A+ M  K+KL  +KKGS+ L+EYFLKI   VD+L +  + +S +DHI++ILA LG  Y S +SV
Subjt:  LLGSMTNSLLSEMLDCKTAREVWKILNARFSSCNMARVMDLKSKLETMKKGSLKLEEYFLKIKNLVDSLTAAGRKISHEDHIMHILARLGPKYDSTVSV

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-3945.23Show/hide
Query:  STNKTIPDLSNFSQSSKIE---NPGHKTTSVKLDEENFLLWKLQVLTKLRGYGLEQFLDNDSAIPPKSIAST--NAATFEQTLNPDYQYWIRQDSLITTW
        S+  ++  + N   SS I      G+K + VKL+++ FLLWK Q+LT L  Y LE FL+++S  P K + ST  ++A+   T NP Y+ W RQD LI++W
Subjt:  STNKTIPDLSNFSQSSKIE---NPGHKTTSVKLDEENFLLWKLQVLTKLRGYGLEQFLDNDSAIPPKSIAST--NAATFEQTLNPDYQYWIRQDSLITTW

Query:  LLGSMTNSLLSEMLDCKTAREVWKILNARFSSCNMARVMDLKSKLETMKKGSLKLEEYFLKIKNLVDSLTAAGRKISHEDHIMHILARLGPKYDSTVSV
        LLGSM+  +L++ML CK+A+E+W+ L   FSS  +A+ M  K+KL  +KKGS+ L+EYFLKI   VD+L +  + +S +DHI++ILA LG  Y S +SV
Subjt:  LLGSMTNSLLSEMLDCKTAREVWKILNARFSSCNMARVMDLKSKLETMKKGSLKLEEYFLKIKNLVDSLTAAGRKISHEDHIMHILARLGPKYDSTVSV

A0A5D3D5T2 Keratin, type II cytoskeletal 1-like6.5e-3944.72Show/hide
Query:  STNKTIPDLSNFSQSSKIE---NPGHKTTSVKLDEENFLLWKLQVLTKLRGYGLEQFLDNDSAIPPKSIAST--NAATFEQTLNPDYQYWIRQDSLITTW
        S+  ++  + N   SS I       +K + VKL ++NFLLWK Q+LT L  Y LE FL+++S  P K + ST  ++A+  +T NP Y+ W RQD LI++W
Subjt:  STNKTIPDLSNFSQSSKIE---NPGHKTTSVKLDEENFLLWKLQVLTKLRGYGLEQFLDNDSAIPPKSIAST--NAATFEQTLNPDYQYWIRQDSLITTW

Query:  LLGSMTNSLLSEMLDCKTAREVWKILNARFSSCNMARVMDLKSKLETMKKGSLKLEEYFLKIKNLVDSLTAAGRKISHEDHIMHILARLGPKYDSTVSV
        LLGSM+  +L++ML CK+A+E+W  L   FSS  +A+ M  K+KL  +KK S+ L+EYFLKI++ VD+L +  + +S +DHI++ILA LG  Y S +SV
Subjt:  LLGSMTNSLLSEMLDCKTAREVWKILNARFSSCNMARVMDLKSKLETMKKGSLKLEEYFLKIKNLVDSLTAAGRKISHEDHIMHILARLGPKYDSTVSV

A0A6J1DLT9 uncharacterized protein LOC1110217571.8e-4952.91Show/hide
Query:  DLSNFSQSSKIENPGHKTTSVKLDEENFLLWKLQVLTKLRGYGLEQFLDNDSAIPPKSIASTNAATFEQTL--NPDYQYWIRQDSLITTWLLGSMTNSLL
        D +   Q+SK  NPG K + V+L+++N LLWK Q+ T L+G GLE ++D++   P + + +T   +   +L  NP Y  WI+QD LI+ WLLGSM   +L
Subjt:  DLSNFSQSSKIENPGHKTTSVKLDEENFLLWKLQVLTKLRGYGLEQFLDNDSAIPPKSIASTNAATFEQTL--NPDYQYWIRQDSLITTWLLGSMTNSLL

Query:  SEMLDCKTAREVWKILNARFSSCNMARVMDLKSKLETMKKGSLKLEEYFLKIKNLVDSLTAAGRKISHEDHIMHILARLGPKYDSTVSV
        S+MLDCK+ARE+W +L   F+S  +ARVM LK KLE  KKG+L L++YFLKIKNLVDSL  AG+K+S EDHIMHILA LGP++D+ +SV
Subjt:  SEMLDCKTAREVWKILNARFSSCNMARVMDLKSKLETMKKGSLKLEEYFLKIKNLVDSLTAAGRKISHEDHIMHILARLGPKYDSTVSV

A0A6J1DSS1 uncharacterized protein LOC1110235863.0e-4456.96Show/hide
Query:  KLQVLTKLRGYGLEQFLDNDSAIPPKSIASTNAATFEQTL--NPDYQYWIRQDSLITTWLLGSMTNSLLSEMLDCKTAREVWKILNARFSSCNMARVMDL
        K QVLT ++G+GLEQ++D+D   P + I + +  T   T   NP+Y +WI+QD LI+ WLLGSM+  +LS+MLDC+  +E+W +L   F+S N+ARVM L
Subjt:  KLQVLTKLRGYGLEQFLDNDSAIPPKSIASTNAATFEQTL--NPDYQYWIRQDSLITTWLLGSMTNSLLSEMLDCKTAREVWKILNARFSSCNMARVMDL

Query:  KSKLETMKKGSLKLEEYFLKIKNLVDSLTAAGRKISHEDHIMHILARLGPKYDSTVSV
        KSKLE MKKGS+ L+ YFLKIKNLVDSL  AG+++  +DHIMHILARLGP++DS VSV
Subjt:  KSKLETMKKGSLKLEEYFLKIKNLVDSLTAAGRKISHEDHIMHILARLGPKYDSTVSV

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.0e-1829.88Show/hide
Query:  KLDEENFLLWKLQVLTKLRGYGLEQFLDNDSAIPPKSIASTNAATFEQTLNPDYQYWIRQDSLITTWLLGSMTNSLLSEMLDCKTAREVWKILNARFSSC
        KL   N+L+W  QV     GY L  FLD  + +PP +I  T+AA     +NPDY  W RQD LI + +LG+++ S+   +    TA ++W+ L   +++ 
Subjt:  KLDEENFLLWKLQVLTKLRGYGLEQFLDNDSAIPPKSIASTNAATFEQTLNPDYQYWIRQDSLITTWLLGSMTNSLLSEMLDCKTAREVWKILNARFSSC

Query:  NMARVMDLKSKLETMKKGSLKLEEYFLKIKNLVDSLTAAGRKISHEDHIMHILARLGPKYDSTV
        +   V  L+++L+   KG+  +++Y   +    D L   G+ + H++ +  +L  L  +Y   +
Subjt:  NMARVMDLKSKLETMKKGSLKLEEYFLKIKNLVDSLTAAGRKISHEDHIMHILARLGPKYDSTV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.5e-1126.83Show/hide
Query:  KLDEENFLLWKLQVLTKLRGYGLEQFLDNDSAIPPKSIASTNAATFEQTLNPDYQYWIRQDSLITTWLLGSMTNSLLSEMLDCKTAREVWKILNARFSSC
        KL   N+L+W  QV     GY L  FLD  + +PP +I  T+A      +NPDY  W RQD LI + +LG+++ S+   +    TA ++W+ L   +++ 
Subjt:  KLDEENFLLWKLQVLTKLRGYGLEQFLDNDSAIPPKSIASTNAATFEQTLNPDYQYWIRQDSLITTWLLGSMTNSLLSEMLDCKTAREVWKILNARFSSC

Query:  NMARVMDLKSKLETMKKGSLKLEEYFLKIKNLVDSLTAAGRKISHEDHIMHILARLGPKYDSTV
        +   V                     L+     D L   G+ + H++ +  +L  L   Y   +
Subjt:  NMARVMDLKSKLETMKKGSLKLEEYFLKIKNLVDSLTAAGRKISHEDHIMHILARLGPKYDSTV

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).5.7e-1127.69Show/hide
Query:  DEENFLLWKLQVLTKLRGYGLEQFLDNDSAIPPKSIASTNAATFEQTLNPDYQYWIRQDSLITTWLLGSMTNSLLSEMLDCKTAREVWKILNARFSSCNM
        DE+N++ WK++  + LR      F+D     P                +P YQ W + ++++  WL+ SMT+ LL  ++  +TA ++W+ L   F  C  
Subjt:  DEENFLLWKLQVLTKLRGYGLEQFLDNDSAIPPKSIASTNAATFEQTLNPDYQYWIRQDSLITTWLLGSMTNSLLSEMLDCKTAREVWKILNARFSSCNM

Query:  ARVMDLKSKLETMKKGSLKLEEYFLKIKNL
         ++  L+ +L T+++G   +EEYF K+  +
Subjt:  ARVMDLKSKLETMKKGSLKLEEYFLKIKNL

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)3.3e-1123.43Show/hide
Query:  HKTTSVKLDEENFLLWKLQVLTKLRGYGLEQFLDNDSAIPPKSIASTNAATFEQTLNPDYQYWIRQDSLITTWLLGSMT-NSLLSEMLDCKTAREVWKIL
        H    + ++E N+  W+   LT    + +   +D        ++  TNA             W ++D ++   L G++T        +   T+R++W  +
Subjt:  HKTTSVKLDEENFLLWKLQVLTKLRGYGLEQFLDNDSAIPPKSIASTNAATFEQTLNPDYQYWIRQDSLITTWLLGSMT-NSLLSEMLDCKTAREVWKIL

Query:  NARFSSCNMARVMDLKSKLETMKKGSLKLEEYFLKIKNLVDSLTAAGRKISHEDHIMHILARLGPKYDSTVSVRK
          +F +   AR + L S+L T   G +++ +Y+ K+K L DSL      ++  + +M++L  L PK+D+ ++V K
Subjt:  NARFSSCNMARVMDLKSKLETMKKGSLKLEEYFLKIKNLVDSLTAAGRKISHEDHIMHILARLGPKYDSTVSVRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTCTTCAACAAACAAGACGATACCAGATCTCTCAAATTTCTCTCAATCTTCAAAAATAGAAAACCCAGGCCATAAAACTACCTCGGTTAAACTTGATGAAGAGAA
CTTCCTCCTGTGGAAACTTCAGGTGCTTACCAAGCTTCGAGGATATGGATTGGAACAATTTCTTGATAACGATTCTGCAATTCCTCCTAAATCCATTGCTTCTACAAATG
CTGCAACGTTTGAACAAACTTTAAATCCTGATTATCAGTATTGGATTCGCCAAGACAGCTTGATTACGACGTGGCTCTTGGGTTCTATGACGAACTCACTCCTGTCTGAG
ATGTTAGACTGCAAAACTGCTCGTGAGGTATGGAAAATTCTTAATGCCCGTTTTTCTTCGTGCAATATGGCTAGAGTTATGGACCTTAAATCAAAACTTGAGACAATGAA
GAAAGGTAGTTTAAAGCTTGAAGAATATTTTTTGAAAATTAAGAATCTTGTGGACTCACTAACAGCAGCTGGTCGAAAAATATCACATGAGGATCACATAATGCACATTT
TAGCAAGATTGGGGCCGAAATATGATTCTACAGTATCAGTCAGAAAAAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTCTTCAACAAACAAGACGATACCAGATCTCTCAAATTTCTCTCAATCTTCAAAAATAGAAAACCCAGGCCATAAAACTACCTCGGTTAAACTTGATGAAGAGAA
CTTCCTCCTGTGGAAACTTCAGGTGCTTACCAAGCTTCGAGGATATGGATTGGAACAATTTCTTGATAACGATTCTGCAATTCCTCCTAAATCCATTGCTTCTACAAATG
CTGCAACGTTTGAACAAACTTTAAATCCTGATTATCAGTATTGGATTCGCCAAGACAGCTTGATTACGACGTGGCTCTTGGGTTCTATGACGAACTCACTCCTGTCTGAG
ATGTTAGACTGCAAAACTGCTCGTGAGGTATGGAAAATTCTTAATGCCCGTTTTTCTTCGTGCAATATGGCTAGAGTTATGGACCTTAAATCAAAACTTGAGACAATGAA
GAAAGGTAGTTTAAAGCTTGAAGAATATTTTTTGAAAATTAAGAATCTTGTGGACTCACTAACAGCAGCTGGTCGAAAAATATCACATGAGGATCACATAATGCACATTT
TAGCAAGATTGGGGCCGAAATATGATTCTACAGTATCAGTCAGAAAAAGATGA
Protein sequenceShow/hide protein sequence
MESSTNKTIPDLSNFSQSSKIENPGHKTTSVKLDEENFLLWKLQVLTKLRGYGLEQFLDNDSAIPPKSIASTNAATFEQTLNPDYQYWIRQDSLITTWLLGSMTNSLLSE
MLDCKTAREVWKILNARFSSCNMARVMDLKSKLETMKKGSLKLEEYFLKIKNLVDSLTAAGRKISHEDHIMHILARLGPKYDSTVSVRKR