; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025552 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025552
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr10:15009370..15009882
RNA-Seq ExpressionLag0025552
SyntenyLag0025552
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]5.7e-3045.81Show/hide
Query:  KIVNPGNKITTVKLEEDNFLLWKLQITTALRGHGLMNHVSDNPDVPPKFI--TTGDPPNTTKLPNPAHDQWIKQDSLITAWLLGAMSNSLLSEMLDCETA
        +I   GNKI+ VKL +D FLLWK QI TAL  + L N +    + P K++  T     + T  PNPA+  W +QD LI++WLLG+MS  +L++ML C++A
Subjt:  KIVNPGNKITTVKLEEDNFLLWKLQITTALRGHGLMNHVSDNPDVPPKFI--TTGDPPNTTKLPNPAHDQWIKQDSLITAWLLGAMSNSLLSEMLDCETA

Query:  CEVWKTLNSRFSSRNLARVMDLKLKLGTTKKGNLKLEEYFVKIKNLVDSLSVLGE
         E+W+TL   FSSR LA+ M  K KL   KKG++ L+EYF+KI   VD+L+ + +
Subjt:  CEVWKTLNSRFSSRNLARVMDLKLKLGTTKKGNLKLEEYFVKIKNLVDSLSVLGE

KAA0053143.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]2.2e-2943.21Show/hide
Query:  GANLQSIKIVNPGNKITTVKLEEDNFLLWKLQITTALRGHGLMNHVSDNPDVPPKFITTGDPPNT--TKLPNPAHDQWIKQDSLITAWLLGAMSNSLLSE
        G     + I   GNKI+ VKL +DNFLLWK QI TAL  + L N      + P K++T+    +T  T+ PNP +  W + + LI+ WLLG+MS  +L++
Subjt:  GANLQSIKIVNPGNKITTVKLEEDNFLLWKLQITTALRGHGLMNHVSDNPDVPPKFITTGDPPNT--TKLPNPAHDQWIKQDSLITAWLLGAMSNSLLSE

Query:  MLDCETACEVWKTLNSRFSSRNLARVMDLKLKLGTTKKGNLKLEEYFVKIKNLVDSLSVLGE
        M+ C++A E+W TL   FSSR LA+ M  K KL   KKG++ L+EYF+KI+  VD+L+ + +
Subjt:  MLDCETACEVWKTLNSRFSSRNLARVMDLKLKLGTTKKGNLKLEEYFVKIKNLVDSLSVLGE

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]5.7e-3045.81Show/hide
Query:  KIVNPGNKITTVKLEEDNFLLWKLQITTALRGHGLMNHVSDNPDVPPKFI--TTGDPPNTTKLPNPAHDQWIKQDSLITAWLLGAMSNSLLSEMLDCETA
        +I   GNKI+ VKL +D FLLWK QI TAL  + L N +    + P K++  T     + T  PNPA+  W +QD LI++WLLG+MS  +L++ML C++A
Subjt:  KIVNPGNKITTVKLEEDNFLLWKLQITTALRGHGLMNHVSDNPDVPPKFI--TTGDPPNTTKLPNPAHDQWIKQDSLITAWLLGAMSNSLLSEMLDCETA

Query:  CEVWKTLNSRFSSRNLARVMDLKLKLGTTKKGNLKLEEYFVKIKNLVDSLSVLGE
         E+W+TL   FSSR LA+ M  K KL   KKG++ L+EYF+KI   VD+L+ + +
Subjt:  CEVWKTLNSRFSSRNLARVMDLKLKLGTTKKGNLKLEEYFVKIKNLVDSLSVLGE

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]2.2e-4255Show/hide
Query:  LQSIKIVNPGNKITTVKLEEDNFLLWKLQITTALRGHGLMNHVSDNPDVPPKFI-TTGDPPNTTKL-PNPAHDQWIKQDSLITAWLLGAMSNSLLSEMLD
        +Q+ K +NPG+K++ V+L +DN LLWK QI TAL+G+GL +++  N D P +F+ TT D  +++ L  NPA+ +WIKQD LI+AWLLG+M+  +LS+MLD
Subjt:  LQSIKIVNPGNKITTVKLEEDNFLLWKLQITTALRGHGLMNHVSDNPDVPPKFI-TTGDPPNTTKL-PNPAHDQWIKQDSLITAWLLGAMSNSLLSEMLD

Query:  CETACEVWKTLNSRFSSRNLARVMDLKLKLGTTKKGNLKLEEYFVKIKNLVDSLSVLGEK
        C++A E+W  L   F+SR LARVM LKLKL   KKGNL L++YF+KIKNLVDSL++ G+K
Subjt:  CETACEVWKTLNSRFSSRNLARVMDLKLKLGTTKKGNLKLEEYFVKIKNLVDSLSVLGEK

XP_022156747.1 uncharacterized protein LOC111023586 [Momordica charantia]2.5e-3051.49Show/hide
Query:  KLQITTALRGHGLMNHVSDNPDVPPKFITTGD--PPNTTKLPNPAHDQWIKQDSLITAWLLGAMSNSLLSEMLDCETACEVWKTLNSRFSSRNLARVMDL
        K Q+ TA++GHGL  ++  + + P +FI  GD    +TT+ PNP +  WIKQD LI+ WLLG+MS  +LS+MLDC    E+W  L   F+SRNLARVM L
Subjt:  KLQITTALRGHGLMNHVSDNPDVPPKFITTGD--PPNTTKLPNPAHDQWIKQDSLITAWLLGAMSNSLLSEMLDCETACEVWKTLNSRFSSRNLARVMDL

Query:  KLKLGTTKKGNLKLEEYFVKIKNLVDSLSVLGEK
        K KL   KKG++ L+ YF+KIKNLVDSL+  G++
Subjt:  KLKLGTTKKGNLKLEEYFVKIKNLVDSLSVLGEK

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-942.7e-3045.81Show/hide
Query:  KIVNPGNKITTVKLEEDNFLLWKLQITTALRGHGLMNHVSDNPDVPPKFI--TTGDPPNTTKLPNPAHDQWIKQDSLITAWLLGAMSNSLLSEMLDCETA
        +I   GNKI+ VKL +D FLLWK QI TAL  + L N +    + P K++  T     + T  PNPA+  W +QD LI++WLLG+MS  +L++ML C++A
Subjt:  KIVNPGNKITTVKLEEDNFLLWKLQITTALRGHGLMNHVSDNPDVPPKFI--TTGDPPNTTKLPNPAHDQWIKQDSLITAWLLGAMSNSLLSEMLDCETA

Query:  CEVWKTLNSRFSSRNLARVMDLKLKLGTTKKGNLKLEEYFVKIKNLVDSLSVLGE
         E+W+TL   FSSR LA+ M  K KL   KKG++ L+EYF+KI   VD+L+ + +
Subjt:  CEVWKTLNSRFSSRNLARVMDLKLKLGTTKKGNLKLEEYFVKIKNLVDSLSVLGE

A0A5A7UB21 Keratin, type II cytoskeletal 1-like1.0e-2943.21Show/hide
Query:  GANLQSIKIVNPGNKITTVKLEEDNFLLWKLQITTALRGHGLMNHVSDNPDVPPKFITTGDPPNT--TKLPNPAHDQWIKQDSLITAWLLGAMSNSLLSE
        G     + I   GNKI+ VKL +DNFLLWK QI TAL  + L N      + P K++T+    +T  T+ PNP +  W + + LI+ WLLG+MS  +L++
Subjt:  GANLQSIKIVNPGNKITTVKLEEDNFLLWKLQITTALRGHGLMNHVSDNPDVPPKFITTGDPPNT--TKLPNPAHDQWIKQDSLITAWLLGAMSNSLLSE

Query:  MLDCETACEVWKTLNSRFSSRNLARVMDLKLKLGTTKKGNLKLEEYFVKIKNLVDSLSVLGE
        M+ C++A E+W TL   FSSR LA+ M  K KL   KKG++ L+EYF+KI+  VD+L+ + +
Subjt:  MLDCETACEVWKTLNSRFSSRNLARVMDLKLKLGTTKKGNLKLEEYFVKIKNLVDSLSVLGE

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-942.7e-3045.81Show/hide
Query:  KIVNPGNKITTVKLEEDNFLLWKLQITTALRGHGLMNHVSDNPDVPPKFI--TTGDPPNTTKLPNPAHDQWIKQDSLITAWLLGAMSNSLLSEMLDCETA
        +I   GNKI+ VKL +D FLLWK QI TAL  + L N +    + P K++  T     + T  PNPA+  W +QD LI++WLLG+MS  +L++ML C++A
Subjt:  KIVNPGNKITTVKLEEDNFLLWKLQITTALRGHGLMNHVSDNPDVPPKFI--TTGDPPNTTKLPNPAHDQWIKQDSLITAWLLGAMSNSLLSEMLDCETA

Query:  CEVWKTLNSRFSSRNLARVMDLKLKLGTTKKGNLKLEEYFVKIKNLVDSLSVLGE
         E+W+TL   FSSR LA+ M  K KL   KKG++ L+EYF+KI   VD+L+ + +
Subjt:  CEVWKTLNSRFSSRNLARVMDLKLKLGTTKKGNLKLEEYFVKIKNLVDSLSVLGE

A0A6J1DLT9 uncharacterized protein LOC1110217571.1e-4255Show/hide
Query:  LQSIKIVNPGNKITTVKLEEDNFLLWKLQITTALRGHGLMNHVSDNPDVPPKFI-TTGDPPNTTKL-PNPAHDQWIKQDSLITAWLLGAMSNSLLSEMLD
        +Q+ K +NPG+K++ V+L +DN LLWK QI TAL+G+GL +++  N D P +F+ TT D  +++ L  NPA+ +WIKQD LI+AWLLG+M+  +LS+MLD
Subjt:  LQSIKIVNPGNKITTVKLEEDNFLLWKLQITTALRGHGLMNHVSDNPDVPPKFI-TTGDPPNTTKL-PNPAHDQWIKQDSLITAWLLGAMSNSLLSEMLD

Query:  CETACEVWKTLNSRFSSRNLARVMDLKLKLGTTKKGNLKLEEYFVKIKNLVDSLSVLGEK
        C++A E+W  L   F+SR LARVM LKLKL   KKGNL L++YF+KIKNLVDSL++ G+K
Subjt:  CETACEVWKTLNSRFSSRNLARVMDLKLKLGTTKKGNLKLEEYFVKIKNLVDSLSVLGEK

A0A6J1DSS1 uncharacterized protein LOC1110235861.2e-3051.49Show/hide
Query:  KLQITTALRGHGLMNHVSDNPDVPPKFITTGD--PPNTTKLPNPAHDQWIKQDSLITAWLLGAMSNSLLSEMLDCETACEVWKTLNSRFSSRNLARVMDL
        K Q+ TA++GHGL  ++  + + P +FI  GD    +TT+ PNP +  WIKQD LI+ WLLG+MS  +LS+MLDC    E+W  L   F+SRNLARVM L
Subjt:  KLQITTALRGHGLMNHVSDNPDVPPKFITTGD--PPNTTKLPNPAHDQWIKQDSLITAWLLGAMSNSLLSEMLDCETACEVWKTLNSRFSSRNLARVMDL

Query:  KLKLGTTKKGNLKLEEYFVKIKNLVDSLSVLGEK
        K KL   KKG++ L+ YF+KIKNLVDSL+  G++
Subjt:  KLKLGTTKKGNLKLEEYFVKIKNLVDSLSVLGEK

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.6e-1329.11Show/hide
Query:  NLQSIKIVNPGNKITTVKLEEDNFLLWKLQITTALRGHGLMNHVSDNPDVPPKFITTGDPPNTTKLPNPAHDQWIKQDSLITAWLLGAMSNSLLSEMLDC
        N  SI  VN  N     KL   N+L+W  Q+     G+ L   +  +  +PP  I T   P      NP + +W +QD LI + +LGA+S S+   +   
Subjt:  NLQSIKIVNPGNKITTVKLEEDNFLLWKLQITTALRGHGLMNHVSDNPDVPPKFITTGDPPNTTKLPNPAHDQWIKQDSLITAWLLGAMSNSLLSEMLDC

Query:  ETACEVWKTLNSRFSSRNLARVMDLKLKLGTTKKGNLKLEEYFVKIKNLVDSLSVLGE
         TA ++W+TL   +++ +   V  L+ +L    KG   +++Y   +    D L++LG+
Subjt:  ETACEVWKTLNSRFSSRNLARVMDLKLKLGTTKKGNLKLEEYFVKIKNLVDSLSVLGE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE29.8e-0928.7Show/hide
Query:  NKITTVKLEEDNFLLWKLQITTALRGHGLMNHVSDNPDVPPKFITTGDPPNTTKLPNPAHDQWIKQDSLITAWLLGAMSNSLLSEMLDCETACEVWKTLN
        N     KL   N+L+W  Q+     G+ L   +  +  +PP  I T   P      NP + +W +QD LI + +LGA+S S+   +    TA ++W+TL 
Subjt:  NKITTVKLEEDNFLLWKLQITTALRGHGLMNHVSDNPDVPPKFITTGDPPNTTKLPNPAHDQWIKQDSLITAWLLGAMSNSLLSEMLDCETACEVWKTLN

Query:  SRFSSRNLARVMDLK
          +++ +   V  L+
Subjt:  SRFSSRNLARVMDLK

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).3.5e-0924.11Show/hide
Query:  ITTVKLEEDNFLLWKLQITTALRGHGLMNHVSDNPDVPPKFITTGDPPNTTKLPNPAHDQWIKQDSLITAWLLGAMSNSLLSEMLDCETACEVWKTLNSR
        I  +  +EDN++ WK++  + LR       +      P  F             +P +  W + ++++  WL+ +M++ LL  ++  ETA ++W+ L   
Subjt:  ITTVKLEEDNFLLWKLQITTALRGHGLMNHVSDNPDVPPKFITTGDPPNTTKLPNPAHDQWIKQDSLITAWLLGAMSNSLLSEMLDCETACEVWKTLNSR

Query:  FSSRNLARVMDLKLKLGTTKKGNLKLEEYFVKIKNLVDSLS
        F      ++  L+ +L T ++G   +EEYF K+  +   LS
Subjt:  FSSRNLARVMDLKLKLGTTKKGNLKLEEYFVKIKNLVDSLS

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)9.4e-0724.09Show/hide
Query:  LEEDNFLLWKLQITTALRGHGLMNHVSDNPDVPPKFITTGDPPNTTKLPNPAHD-QWIKQDSLITAWLLGAMS-NSLLSEMLDCETACEVWKTLNSRFSS
        +EE N+  W+    T      +M H+                 + T LP  A+D  W K+D ++   L G ++        +   T+ ++W  + ++F +
Subjt:  LEEDNFLLWKLQITTALRGHGLMNHVSDNPDVPPKFITTGDPPNTTKLPNPAHD-QWIKQDSLITAWLLGAMS-NSLLSEMLDCETACEVWKTLNSRFSS

Query:  RNLARVMDLKLKLGTTKKGNLKLEEYFVKIKNLVDSL
           AR + L  +L T   G++++ +Y+ K+K L DSL
Subjt:  RNLARVMDLKLKLGTTKKGNLKLEEYFVKIKNLVDSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATACTGCGATTACAGAAGGGAATGGTGCTAACCTACAATCGATCAAGATTGTCAATCCTGGAAACAAGATTACAACTGTGAAACTTGAAGAGGATAATTTTCTCTT
ATGGAAATTACAGATCACTACTGCCTTAAGAGGTCATGGGTTGATGAATCATGTAAGTGATAATCCGGATGTTCCTCCGAAGTTCATCACCACAGGCGATCCCCCGAATA
CTACAAAGCTTCCAAATCCTGCACATGATCAATGGATTAAACAAGATAGTTTAATCACAGCGTGGTTGCTTGGAGCTATGTCAAATTCGCTACTCTCTGAGATGCTCGAT
TGCGAAACAGCCTGCGAAGTATGGAAAACGCTAAATTCTCGGTTTTCTTCAAGAAATCTTGCACGGGTAATGGACTTGAAGTTGAAATTAGGAACAACAAAGAAGGGCAA
TTTGAAATTAGAGGAGTATTTTGTTAAAATCAAGAATCTGGTCGATTCTTTATCTGTGTTGGGCGAAAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATACTGCGATTACAGAAGGGAATGGTGCTAACCTACAATCGATCAAGATTGTCAATCCTGGAAACAAGATTACAACTGTGAAACTTGAAGAGGATAATTTTCTCTT
ATGGAAATTACAGATCACTACTGCCTTAAGAGGTCATGGGTTGATGAATCATGTAAGTGATAATCCGGATGTTCCTCCGAAGTTCATCACCACAGGCGATCCCCCGAATA
CTACAAAGCTTCCAAATCCTGCACATGATCAATGGATTAAACAAGATAGTTTAATCACAGCGTGGTTGCTTGGAGCTATGTCAAATTCGCTACTCTCTGAGATGCTCGAT
TGCGAAACAGCCTGCGAAGTATGGAAAACGCTAAATTCTCGGTTTTCTTCAAGAAATCTTGCACGGGTAATGGACTTGAAGTTGAAATTAGGAACAACAAAGAAGGGCAA
TTTGAAATTAGAGGAGTATTTTGTTAAAATCAAGAATCTGGTCGATTCTTTATCTGTGTTGGGCGAAAAGTAA
Protein sequenceShow/hide protein sequence
MDTAITEGNGANLQSIKIVNPGNKITTVKLEEDNFLLWKLQITTALRGHGLMNHVSDNPDVPPKFITTGDPPNTTKLPNPAHDQWIKQDSLITAWLLGAMSNSLLSEMLD
CETACEVWKTLNSRFSSRNLARVMDLKLKLGTTKKGNLKLEEYFVKIKNLVDSLSVLGEK