; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022757 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022757
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr7:37295028..37295540
RNA-Seq ExpressionLag0022757
SyntenyLag0022757
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]7.4e-3044.08Show/hide
Query:  SKIINPGNKINTVKLDEENFLLWKLQVLTVLCEHRLTKYIEEDAEIPDNFLPSNDTS--SSVKLPNPAYDLWVRQDSLITAWLFGSMSNSLISEMLDCDK
        ++I   GNKI+ VKL+++ FLLWK Q+LT L  + L  ++E ++E P  +L S ++S  S+   PNPAY +W RQD LI++WL GSMS  ++++ML C  
Subjt:  SKIINPGNKINTVKLDEENFLLWKLQVLTVLCEHRLTKYIEEDAEIPDNFLPSNDTS--SSVKLPNPAYDLWVRQDSLITAWLFGSMSNSLISEMLDCDK

Query:  ECEVWKILNNHFSSRNMARMMDLKSKLETCKKGNLKLEEYFQRVKNLVDSMA
          E+W+ L   FSSR +A+ M  K+KL   KKG++ L+EYF ++   VD++A
Subjt:  ECEVWKILNNHFSSRNMARMMDLKSKLETCKKGNLKLEEYFQRVKNLVDSMA

KAA0067213.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]2.4e-2842.76Show/hide
Query:  SKIINPGNKINTVKLDEENFLLWKLQVLTVLCEHRLTKYIEEDAEIPDNFLPSNDTS--SSVKLPNPAYDLWVRQDSLITAWLFGSMSNSLISEMLDCDK
        ++I    NKI+ VKL ++NFLLWK Q+LT L  + L  ++E ++E P  +L S  +S  S+ + PNP Y +W RQD LI++WL GSMS  ++++ML C  
Subjt:  SKIINPGNKINTVKLDEENFLLWKLQVLTVLCEHRLTKYIEEDAEIPDNFLPSNDTS--SSVKLPNPAYDLWVRQDSLITAWLFGSMSNSLISEMLDCDK

Query:  ECEVWKILNNHFSSRNMARMMDLKSKLETCKKGNLKLEEYFQRVKNLVDSMA
          E+W  L   FSSR +A+ M  K+KL   KK ++ L+EYF ++++ VD++A
Subjt:  ECEVWKILNNHFSSRNMARMMDLKSKLETCKKGNLKLEEYFQRVKNLVDSMA

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]7.4e-3044.08Show/hide
Query:  SKIINPGNKINTVKLDEENFLLWKLQVLTVLCEHRLTKYIEEDAEIPDNFLPSNDTS--SSVKLPNPAYDLWVRQDSLITAWLFGSMSNSLISEMLDCDK
        ++I   GNKI+ VKL+++ FLLWK Q+LT L  + L  ++E ++E P  +L S ++S  S+   PNPAY +W RQD LI++WL GSMS  ++++ML C  
Subjt:  SKIINPGNKINTVKLDEENFLLWKLQVLTVLCEHRLTKYIEEDAEIPDNFLPSNDTS--SSVKLPNPAYDLWVRQDSLITAWLFGSMSNSLISEMLDCDK

Query:  ECEVWKILNNHFSSRNMARMMDLKSKLETCKKGNLKLEEYFQRVKNLVDSMA
          E+W+ L   FSSR +A+ M  K+KL   KKG++ L+EYF ++   VD++A
Subjt:  ECEVWKILNNHFSSRNMARMMDLKSKLETCKKGNLKLEEYFQRVKNLVDSMA

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]1.1e-3648.52Show/hide
Query:  LSSSKISTSADITQHSKIINPGNKINTVKLDEENFLLWKLQVLTVLCEHRLTKYIEEDAEIPDNFLPS--NDTSSSVKLPNPAYDLWVRQDSLITAWLFG
        LSS + S +A I Q SK INPG+K++ V+L+++N LLWK Q+ T L  + L  YI+ + + P  F+ +  +++SSS    NPAY  W++QD LI+AWL G
Subjt:  LSSSKISTSADITQHSKIINPGNKINTVKLDEENFLLWKLQVLTVLCEHRLTKYIEEDAEIPDNFLPS--NDTSSSVKLPNPAYDLWVRQDSLITAWLFG

Query:  SMSNSLISEMLDCDKECEVWKILNNHFSSRNMARMMDLKSKLETCKKGNLKLEEYFQRVKNLVDSMAVA
        SM+  ++S+MLDC    E+W +L   F+SR +AR+M LK KLE  KKGNL L++YF ++KNLVDS+A+A
Subjt:  SMSNSLISEMLDCDKECEVWKILNNHFSSRNMARMMDLKSKLETCKKGNLKLEEYFQRVKNLVDSMAVA

XP_022156747.1 uncharacterized protein LOC111023586 [Momordica charantia]3.7e-2951.15Show/hide
Query:  KLQVLTVLCEHRLTKYIEEDAEIPDNFLPSND--TSSSVKLPNPAYDLWVRQDSLITAWLFGSMSNSLISEMLDCDKECEVWKILNNHFSSRNMARMMDL
        K QVLT +  H L +YI+ D E P  F+ + D  TSS+ + PNP Y  W++QD LI+ WL GSMS  ++S+MLDC    E+W +L   F+SRN+AR+M L
Subjt:  KLQVLTVLCEHRLTKYIEEDAEIPDNFLPSND--TSSSVKLPNPAYDLWVRQDSLITAWLFGSMSNSLISEMLDCDKECEVWKILNNHFSSRNMARMMDL

Query:  KSKLETCKKGNLKLEEYFQRVKNLVDSMAVA
        KSKLE  KKG++ L+ YF ++KNLVDS+A A
Subjt:  KSKLETCKKGNLKLEEYFQRVKNLVDSMAVA

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-943.6e-3044.08Show/hide
Query:  SKIINPGNKINTVKLDEENFLLWKLQVLTVLCEHRLTKYIEEDAEIPDNFLPSNDTS--SSVKLPNPAYDLWVRQDSLITAWLFGSMSNSLISEMLDCDK
        ++I   GNKI+ VKL+++ FLLWK Q+LT L  + L  ++E ++E P  +L S ++S  S+   PNPAY +W RQD LI++WL GSMS  ++++ML C  
Subjt:  SKIINPGNKINTVKLDEENFLLWKLQVLTVLCEHRLTKYIEEDAEIPDNFLPSNDTS--SSVKLPNPAYDLWVRQDSLITAWLFGSMSNSLISEMLDCDK

Query:  ECEVWKILNNHFSSRNMARMMDLKSKLETCKKGNLKLEEYFQRVKNLVDSMA
          E+W+ L   FSSR +A+ M  K+KL   KKG++ L+EYF ++   VD++A
Subjt:  ECEVWKILNNHFSSRNMARMMDLKSKLETCKKGNLKLEEYFQRVKNLVDSMA

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-943.6e-3044.08Show/hide
Query:  SKIINPGNKINTVKLDEENFLLWKLQVLTVLCEHRLTKYIEEDAEIPDNFLPSNDTS--SSVKLPNPAYDLWVRQDSLITAWLFGSMSNSLISEMLDCDK
        ++I   GNKI+ VKL+++ FLLWK Q+LT L  + L  ++E ++E P  +L S ++S  S+   PNPAY +W RQD LI++WL GSMS  ++++ML C  
Subjt:  SKIINPGNKINTVKLDEENFLLWKLQVLTVLCEHRLTKYIEEDAEIPDNFLPSNDTS--SSVKLPNPAYDLWVRQDSLITAWLFGSMSNSLISEMLDCDK

Query:  ECEVWKILNNHFSSRNMARMMDLKSKLETCKKGNLKLEEYFQRVKNLVDSMA
          E+W+ L   FSSR +A+ M  K+KL   KKG++ L+EYF ++   VD++A
Subjt:  ECEVWKILNNHFSSRNMARMMDLKSKLETCKKGNLKLEEYFQRVKNLVDSMA

A0A5D3D5T2 Keratin, type II cytoskeletal 1-like1.2e-2842.76Show/hide
Query:  SKIINPGNKINTVKLDEENFLLWKLQVLTVLCEHRLTKYIEEDAEIPDNFLPSNDTS--SSVKLPNPAYDLWVRQDSLITAWLFGSMSNSLISEMLDCDK
        ++I    NKI+ VKL ++NFLLWK Q+LT L  + L  ++E ++E P  +L S  +S  S+ + PNP Y +W RQD LI++WL GSMS  ++++ML C  
Subjt:  SKIINPGNKINTVKLDEENFLLWKLQVLTVLCEHRLTKYIEEDAEIPDNFLPSNDTS--SSVKLPNPAYDLWVRQDSLITAWLFGSMSNSLISEMLDCDK

Query:  ECEVWKILNNHFSSRNMARMMDLKSKLETCKKGNLKLEEYFQRVKNLVDSMA
          E+W  L   FSSR +A+ M  K+KL   KK ++ L+EYF ++++ VD++A
Subjt:  ECEVWKILNNHFSSRNMARMMDLKSKLETCKKGNLKLEEYFQRVKNLVDSMA

A0A6J1DLT9 uncharacterized protein LOC1110217575.2e-3748.52Show/hide
Query:  LSSSKISTSADITQHSKIINPGNKINTVKLDEENFLLWKLQVLTVLCEHRLTKYIEEDAEIPDNFLPS--NDTSSSVKLPNPAYDLWVRQDSLITAWLFG
        LSS + S +A I Q SK INPG+K++ V+L+++N LLWK Q+ T L  + L  YI+ + + P  F+ +  +++SSS    NPAY  W++QD LI+AWL G
Subjt:  LSSSKISTSADITQHSKIINPGNKINTVKLDEENFLLWKLQVLTVLCEHRLTKYIEEDAEIPDNFLPS--NDTSSSVKLPNPAYDLWVRQDSLITAWLFG

Query:  SMSNSLISEMLDCDKECEVWKILNNHFSSRNMARMMDLKSKLETCKKGNLKLEEYFQRVKNLVDSMAVA
        SM+  ++S+MLDC    E+W +L   F+SR +AR+M LK KLE  KKGNL L++YF ++KNLVDS+A+A
Subjt:  SMSNSLISEMLDCDKECEVWKILNNHFSSRNMARMMDLKSKLETCKKGNLKLEEYFQRVKNLVDSMAVA

A0A6J1DSS1 uncharacterized protein LOC1110235861.8e-2951.15Show/hide
Query:  KLQVLTVLCEHRLTKYIEEDAEIPDNFLPSND--TSSSVKLPNPAYDLWVRQDSLITAWLFGSMSNSLISEMLDCDKECEVWKILNNHFSSRNMARMMDL
        K QVLT +  H L +YI+ D E P  F+ + D  TSS+ + PNP Y  W++QD LI+ WL GSMS  ++S+MLDC    E+W +L   F+SRN+AR+M L
Subjt:  KLQVLTVLCEHRLTKYIEEDAEIPDNFLPSND--TSSSVKLPNPAYDLWVRQDSLITAWLFGSMSNSLISEMLDCDKECEVWKILNNHFSSRNMARMMDL

Query:  KSKLETCKKGNLKLEEYFQRVKNLVDSMAVA
        KSKLE  KKG++ L+ YF ++KNLVDS+A A
Subjt:  KSKLETCKKGNLKLEEYFQRVKNLVDSMAVA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.5e-0722.06Show/hide
Query:  INTVKLDEENFLLWKLQVLTVLCEHRLTKYIEEDAEIPDNFLPSNDTSSSVKLPNPAYDLWVRQDSLITAWLFGSMSNSLISEMLDCDKECEVWKILNNH
        I  +  DE+N++ WK++  + L   +   +I+     PD F             +P Y  W + ++++  WL  SM++ L+  ++  +   ++W+ L   
Subjt:  INTVKLDEENFLLWKLQVLTVLCEHRLTKYIEEDAEIPDNFLPSNDTSSSVKLPNPAYDLWVRQDSLITAWLFGSMSNSLISEMLDCDKECEVWKILNNH

Query:  FSSRNMARMMDLKSKLETCKKGNLKLEEYFQRVKNL
        F      ++  L+ +L T ++G   +EEYF ++  +
Subjt:  FSSRNMARMMDLKSKLETCKKGNLKLEEYFQRVKNL

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.3e-0525.56Show/hide
Query:  LPNPAYDL-WVRQDSLITAWLFGSMS-NSLISEMLDCDKECEVWKILNNHFSSRNMARMMDLKSKLETCKKGNLKLEEYFQRVKNLVDSM
        LP  A D+ W ++D ++   L+G+++        +      ++W  + N F +   AR + L S+L T   G++++ +Y++++K L DS+
Subjt:  LPNPAYDL-WVRQDSLITAWLFGSMS-NSLISEMLDCDKECEVWKILNNHFSSRNMARMMDLKSKLETCKKGNLKLEEYFQRVKNLVDSM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCTCTCTTCTTCAAAGATCAGTACTTCTGCAGATATTACGCAACACTCCAAGATTATCAACCCGGGAAACAAGATCAATACCGTCAAGCTTGACGAAGAAAATTT
CCTTCTATGGAAATTACAAGTTCTCACTGTATTATGCGAGCACAGGTTGACGAAATACATCGAGGAAGACGCTGAAATTCCGGACAACTTTCTTCCCTCAAATGATACTT
CTTCTTCTGTGAAATTACCAAATCCAGCATATGATCTTTGGGTTCGTCAAGATAGCTTGATCACAGCTTGGCTTTTTGGCTCTATGTCGAATTCCCTTATCTCCGAAATG
CTTGACTGTGACAAAGAGTGTGAGGTCTGGAAGATTCTTAACAATCATTTCTCCTCAAGGAATATGGCAAGGATGATGGATTTAAAGTCCAAATTGGAGACGTGCAAGAA
AGGAAATCTCAAATTGGAGGAGTATTTTCAGAGAGTCAAGAATCTTGTTGATTCCATGGCAGTCGCCGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGATCTCTCTTCTTCAAAGATCAGTACTTCTGCAGATATTACGCAACACTCCAAGATTATCAACCCGGGAAACAAGATCAATACCGTCAAGCTTGACGAAGAAAATTT
CCTTCTATGGAAATTACAAGTTCTCACTGTATTATGCGAGCACAGGTTGACGAAATACATCGAGGAAGACGCTGAAATTCCGGACAACTTTCTTCCCTCAAATGATACTT
CTTCTTCTGTGAAATTACCAAATCCAGCATATGATCTTTGGGTTCGTCAAGATAGCTTGATCACAGCTTGGCTTTTTGGCTCTATGTCGAATTCCCTTATCTCCGAAATG
CTTGACTGTGACAAAGAGTGTGAGGTCTGGAAGATTCTTAACAATCATTTCTCCTCAAGGAATATGGCAAGGATGATGGATTTAAAGTCCAAATTGGAGACGTGCAAGAA
AGGAAATCTCAAATTGGAGGAGTATTTTCAGAGAGTCAAGAATCTTGTTGATTCCATGGCAGTCGCCGAATAA
Protein sequenceShow/hide protein sequence
MDLSSSKISTSADITQHSKIINPGNKINTVKLDEENFLLWKLQVLTVLCEHRLTKYIEEDAEIPDNFLPSNDTSSSVKLPNPAYDLWVRQDSLITAWLFGSMSNSLISEM
LDCDKECEVWKILNNHFSSRNMARMMDLKSKLETCKKGNLKLEEYFQRVKNLVDSMAVAE