; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031607 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031607
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr11:10669205..10671401
RNA-Seq ExpressionLag0031607
SyntenyLag0031607
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.6e-1743.64Show/hide
Query:  EISSQTSKSINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDKDSDIPPKLIQNGDLK------IPNPVYDHWVRQDSLIIAWLLGSMSNSLLSE
        E SS  ++    GNKI+ +KL++D FLLWK QILT L  + L++ L+ +S+ P K + + +         PNP Y  W RQD LI +WLLGSMS  +L++
Subjt:  EISSQTSKSINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDKDSDIPPKLIQNGDLK------IPNPVYDHWVRQDSLIIAWLLGSMSNSLLSE

Query:  MLECETAREV
        ML C++A+E+
Subjt:  MLECETAREV

KAA0067213.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]7.2e-1846.36Show/hide
Query:  EISSQTSKSINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDKDSDIPPK-LIQNGD-----LKIPNPVYDHWVRQDSLIIAWLLGSMSNSLLSE
        E SS  ++     NKI+ +KL +DNFLLWK QILT L  + L++ L+ +S+ P K LI  G       + PNP Y  W RQD LI +WLLGSMS  +L++
Subjt:  EISSQTSKSINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDKDSDIPPK-LIQNGD-----LKIPNPVYDHWVRQDSLIIAWLLGSMSNSLLSE

Query:  MLECETAREV
        ML C++A+E+
Subjt:  MLECETAREV

TYK18917.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]7.2e-1846.36Show/hide
Query:  EISSQTSKSINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDKDSDIPPK-LIQNGD-----LKIPNPVYDHWVRQDSLIIAWLLGSMSNSLLSE
        E SS  ++     NKI+ +KL +DNFLLWK QILT L  + L++ L+ +S+ P K LI  G       + PNP Y  W RQD LI +WLLGSMS  +L++
Subjt:  EISSQTSKSINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDKDSDIPPK-LIQNGD-----LKIPNPVYDHWVRQDSLIIAWLLGSMSNSLLSE

Query:  MLECETAREV
        ML C++A+E+
Subjt:  MLECETAREV

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]7.0e-2146.23Show/hide
Query:  QTSKSINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDKDSDIPPKLIQNGDLKI------PNPVYDHWVRQDSLIIAWLLGSMSNSLLSEMLEC
        Q SK+INPG+K++ ++L++DN LLWK QI T L+G+GL+ ++D + D P + +Q  + +        NP Y  W++QD LI AWLLGSM+  +LS+ML+C
Subjt:  QTSKSINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDKDSDIPPKLIQNGDLKI------PNPVYDHWVRQDSLIIAWLLGSMSNSLLSEMLEC

Query:  ETAREV
        ++ARE+
Subjt:  ETAREV

XP_022159146.1 uncharacterized protein LOC111025572 [Momordica charantia]6.1e-2549.57Show/hide
Query:  SIEKESYEIEISSQTSKSINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDKDSDIPPKLIQNGD----LKIPNPVYDHWVRQDSLIIAWLLGSM
        +I     +  + +Q S  +NPGNKI+T+KL+++NFLLW+LQI T L+GHGL   +D ++ IP + +Q+ D    +  PNP + +W RQD LI +WLLGSM
Subjt:  SIEKESYEIEISSQTSKSINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDKDSDIPPKLIQNGD----LKIPNPVYDHWVRQDSLIIAWLLGSM

Query:  SNSLLSEMLECETAREV
        S  +LS+MLECETA+EV
Subjt:  SNSLLSEMLECETAREV

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-941.7e-1743.64Show/hide
Query:  EISSQTSKSINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDKDSDIPPKLIQNGDLK------IPNPVYDHWVRQDSLIIAWLLGSMSNSLLSE
        E SS  ++    GNKI+ +KL++D FLLWK QILT L  + L++ L+ +S+ P K + + +         PNP Y  W RQD LI +WLLGSMS  +L++
Subjt:  EISSQTSKSINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDKDSDIPPKLIQNGDLK------IPNPVYDHWVRQDSLIIAWLLGSMSNSLLSE

Query:  MLECETAREV
        ML C++A+E+
Subjt:  MLECETAREV

A0A5A7VGJ8 Keratin, type II cytoskeletal 1-like3.5e-1846.36Show/hide
Query:  EISSQTSKSINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDKDSDIPPK-LIQNGD-----LKIPNPVYDHWVRQDSLIIAWLLGSMSNSLLSE
        E SS  ++     NKI+ +KL +DNFLLWK QILT L  + L++ L+ +S+ P K LI  G       + PNP Y  W RQD LI +WLLGSMS  +L++
Subjt:  EISSQTSKSINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDKDSDIPPK-LIQNGD-----LKIPNPVYDHWVRQDSLIIAWLLGSMSNSLLSE

Query:  MLECETAREV
        ML C++A+E+
Subjt:  MLECETAREV

A0A5D3D5T2 Keratin, type II cytoskeletal 1-like3.5e-1846.36Show/hide
Query:  EISSQTSKSINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDKDSDIPPK-LIQNGD-----LKIPNPVYDHWVRQDSLIIAWLLGSMSNSLLSE
        E SS  ++     NKI+ +KL +DNFLLWK QILT L  + L++ L+ +S+ P K LI  G       + PNP Y  W RQD LI +WLLGSMS  +L++
Subjt:  EISSQTSKSINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDKDSDIPPK-LIQNGD-----LKIPNPVYDHWVRQDSLIIAWLLGSMSNSLLSE

Query:  MLECETAREV
        ML C++A+E+
Subjt:  MLECETAREV

A0A6J1DLT9 uncharacterized protein LOC1110217573.4e-2146.23Show/hide
Query:  QTSKSINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDKDSDIPPKLIQNGDLKI------PNPVYDHWVRQDSLIIAWLLGSMSNSLLSEMLEC
        Q SK+INPG+K++ ++L++DN LLWK QI T L+G+GL+ ++D + D P + +Q  + +        NP Y  W++QD LI AWLLGSM+  +LS+ML+C
Subjt:  QTSKSINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDKDSDIPPKLIQNGDLKI------PNPVYDHWVRQDSLIIAWLLGSMSNSLLSEMLEC

Query:  ETAREV
        ++ARE+
Subjt:  ETAREV

A0A6J1E314 uncharacterized protein LOC1110255722.9e-2549.57Show/hide
Query:  SIEKESYEIEISSQTSKSINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDKDSDIPPKLIQNGD----LKIPNPVYDHWVRQDSLIIAWLLGSM
        +I     +  + +Q S  +NPGNKI+T+KL+++NFLLW+LQI T L+GHGL   +D ++ IP + +Q+ D    +  PNP + +W RQD LI +WLLGSM
Subjt:  SIEKESYEIEISSQTSKSINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDKDSDIPPKLIQNGD----LKIPNPVYDHWVRQDSLIIAWLLGSM

Query:  SNSLLSEMLECETAREV
        S  +LS+MLECETA+EV
Subjt:  SNSLLSEMLECETAREV

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.9e-0729.25Show/hide
Query:  EIEISSQTSKSINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDKDSDIPPKLIQNGDLKIPNPVYDHWVRQDSLIIAWLLGSMSNSLLSEMLEC
        E+ +++ +  ++N  N     KL   N+L+W  Q+     G+ L   LD  + +PP  I        NP Y  W RQD LI + +LG++S S+   +   
Subjt:  EIEISSQTSKSINPGNKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDKDSDIPPKLIQNGDLKIPNPVYDHWVRQDSLIIAWLLGSMSNSLLSEMLEC

Query:  ETAREV
         TA ++
Subjt:  ETAREV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.9e-0731.87Show/hide
Query:  NKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDKDSDIPPKLIQNGDLKIPNPVYDHWVRQDSLIIAWLLGSMSNSLLSEMLECETAREV
        N     KL   N+L+W  Q+     G+ L   LD  + +PP  I    +   NP Y  W RQD LI + +LG++S S+   +    TA ++
Subjt:  NKITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDKDSDIPPKLIQNGDLKIPNPVYDHWVRQDSLIIAWLLGSMSNSLLSEMLECETAREV

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).7.5e-0525.84Show/hide
Query:  ITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDKDSDIPPKLIQNGDLKIPNPVYDHWVRQDSLIIAWLLGSMSNSLLSEMLECETAREV
        I  +  DEDN++ WK++  + LR       +D     P            +P+Y  W + +++++ WL+ SM++ LL  ++  ETA ++
Subjt:  ITTMKLDEDNFLLWKLQILTTLRGHGLKHHLDKDSDIPPKLIQNGDLKIPNPVYDHWVRQDSLIIAWLLGSMSNSLLSEMLECETAREV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCGACCATTCAGCCCGCGCCGCGGGCTGAGCCCAGTGACCTCTTTTCGGTCCCTGATGCCCCGGACCGCCCCGGTTCCGCCTGCTTCTCCTCAGTTTTCTGACTT
AGGCATCGGAGGTGGTGTGGCCTACACCACGCCGGGCAGTGGTTTTTGCTGGTCTTGCAGGTCACGTCTTCTCCAGTTTCTACAAATTCACTGTTGGTGTCACGTGAAGT
ATCACTCAATAGAGAAAGAGAGCTATGAGATTGAAATCTCATCTCAGACCTCGAAGAGCATCAATCCAGGCAACAAGATCACCACAATGAAGCTTGATGAAGACAACTTT
CTACTCTGGAAGTTACAAATTCTTACTACTCTACGAGGACATGGATTAAAGCATCATCTTGATAAAGATTCTGATATTCCACCCAAGCTCATTCAGAATGGTGATTTGAA
AATCCCTAATCCTGTGTATGACCATTGGGTTCGACAGGATAGCTTGATCATTGCCTGGCTACTCGGTTCAATGTCCAATTCCCTCCTTTCAGAAATGCTGGAATGCGAAA
CTGCTCGAGAAGTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCCGACCATTCAGCCCGCGCCGCGGGCTGAGCCCAGTGACCTCTTTTCGGTCCCTGATGCCCCGGACCGCCCCGGTTCCGCCTGCTTCTCCTCAGTTTTCTGACTT
AGGCATCGGAGGTGGTGTGGCCTACACCACGCCGGGCAGTGGTTTTTGCTGGTCTTGCAGGTCACGTCTTCTCCAGTTTCTACAAATTCACTGTTGGTGTCACGTGAAGT
ATCACTCAATAGAGAAAGAGAGCTATGAGATTGAAATCTCATCTCAGACCTCGAAGAGCATCAATCCAGGCAACAAGATCACCACAATGAAGCTTGATGAAGACAACTTT
CTACTCTGGAAGTTACAAATTCTTACTACTCTACGAGGACATGGATTAAAGCATCATCTTGATAAAGATTCTGATATTCCACCCAAGCTCATTCAGAATGGTGATTTGAA
AATCCCTAATCCTGTGTATGACCATTGGGTTCGACAGGATAGCTTGATCATTGCCTGGCTACTCGGTTCAATGTCCAATTCCCTCCTTTCAGAAATGCTGGAATGCGAAA
CTGCTCGAGAAGTGTGA
Protein sequenceShow/hide protein sequence
MARPFSPRRGLSPVTSFRSLMPRTAPVPPASPQFSDLGIGGGVAYTTPGSGFCWSCRSRLLQFLQIHCWCHVKYHSIEKESYEIEISSQTSKSINPGNKITTMKLDEDNF
LLWKLQILTTLRGHGLKHHLDKDSDIPPKLIQNGDLKIPNPVYDHWVRQDSLIIAWLLGSMSNSLLSEMLECETAREV