; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0012057 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0012057
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr1:36925984..36929763
RNA-Seq ExpressionLag0012057
SyntenyLag0012057
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142327.1 uncharacterized protein LOC111012468 [Momordica charantia]2.3e-1851.4Show/hide
Query:  PSEIPVVNPPSQVVASSSNQ-----SVLHLYADPYFLHHSDSTSIVLVSNPLTETNYTSWCQAMTIGLTVKNKFCFVDGSLTRPTDDRLGSWLICNSVVT
        PS IP     + V +SS+       S+L  Y++PY+LHHSD+TS+VLVS+ L E NYTSW ++M I LTVKNK  FVDGS++RP+  ++ SW ICN+VV 
Subjt:  PSEIPVVNPPSQVVASSSNQ-----SVLHLYADPYFLHHSDSTSIVLVSNPLTETNYTSWCQAMTIGLTVKNKFCFVDGSLTRPTDDRLGSWLICNSVVT

Query:  ATSISSL
        A  ++SL
Subjt:  ATSISSL

XP_022152756.1 uncharacterized protein LOC111020399 [Momordica charantia]1.6e-1964.94Show/hide
Query:  YADPYFLHHSDSTSIVLVSNPLTETNYTSWCQAMTIGLTVKNKFCFVDGSLTRPTDDRLGSWLICNSVVTATSISSL
        Y +PYFLHHSD+TS+VLVS+PLT  NYTSW ++M I LTVKNK  FVDGS+ RPT D L SW+ICN+VV +  ++SL
Subjt:  YADPYFLHHSDSTSIVLVSNPLTETNYTSWCQAMTIGLTVKNKFCFVDGSLTRPTDDRLGSWLICNSVVTATSISSL

XP_022154919.1 uncharacterized protein LOC111022065 [Momordica charantia]1.2e-1960.71Show/hide
Query:  NQSVLHLYADPYFLHHSDSTSIVLVSNPLTETNYTSWCQAMTIGLTVKNKFCFVDGSLTRPTDDRLGSWLICNSVVTATSISSL
        N  V+  +A+PYFLHHSD+TS+VLVS+ LT+ NYTSW +++ I LTVKNK  FVDGS++RPTD RL SW+ICN+VV +   +SL
Subjt:  NQSVLHLYADPYFLHHSDSTSIVLVSNPLTETNYTSWCQAMTIGLTVKNKFCFVDGSLTRPTDDRLGSWLICNSVVTATSISSL

XP_038887186.1 uncharacterized protein LOC120077373 [Benincasa hispida]1.1e-1754Show/hide
Query:  NPPSQVVASSSNQSVLHLYADPYFLHHSDSTSIVLVSNPLTETNYTSWCQAMTIGLTVKNKFCFVDGSLTRPTDDRLGSWLICNSVVTATSISSL-KSVF
        N  S     +S  S L  Y +PYFLH SDSTS+VLVSN LTE+NY SW QAM IGLTVKNK  F++G + RP+ + L SW+I N +VT   ++SL K +F
Subjt:  NPPSQVVASSSNQSVLHLYADPYFLHHSDSTSIVLVSNPLTETNYTSWCQAMTIGLTVKNKFCFVDGSLTRPTDDRLGSWLICNSVVTATSISSL-KSVF

XP_038902375.1 uncharacterized protein LOC120089012 [Benincasa hispida]1.4e-2052.59Show/hide
Query:  MAGPSEIPVVNPPSQVVASSS-----------NQSVLHLYADPYFLHHSDSTSIVLVSNPLTETNYTSWCQAMTIGLTVKNKFCFVDGSLTRPTDDRLGS
        MAG   +P    P+    SS+           N +VL  Y + YFLHHSDST++V+VSN LTETNYTSW Q M IGL VKNK  FVDGS+ R T D L S
Subjt:  MAGPSEIPVVNPPSQVVASSS-----------NQSVLHLYADPYFLHHSDSTSIVLVSNPLTETNYTSWCQAMTIGLTVKNKFCFVDGSLTRPTDDRLGS

Query:  WLICNSVVTATSISSL
        W+IC+ VVTA  ++SL
Subjt:  WLICNSVVTATSISSL

TrEMBL top hitse value%identityAlignment
A0A5D3BXU2 Retrotran_gag_3 domain-containing protein1.8e-1345.28Show/hide
Query:  SEIPVVNPPSQVVASSSNQSVLHLYADPYFLHHSDSTSIVLVSNPLTETNYTSWCQAMTIGLTVKNKFCFVDGSLTRPTDDRLGSWLICNSVVTATSISS
        S IP  N  S +  + +  SV+ LY +  +LHHS++T+++LVS+ L E+NYTSW  A  +GLTVK K  F+D +LT  TD    SW+ICNS+VTA  ++S
Subjt:  SEIPVVNPPSQVVASSSNQSVLHLYADPYFLHHSDSTSIVLVSNPLTETNYTSWCQAMTIGLTVKNKFCFVDGSLTRPTDDRLGSWLICNSVVTATSISS

Query:  LKSVFL
        +  + L
Subjt:  LKSVFL

A0A6J1CMF8 uncharacterized protein LOC1110124681.1e-1851.4Show/hide
Query:  PSEIPVVNPPSQVVASSSNQ-----SVLHLYADPYFLHHSDSTSIVLVSNPLTETNYTSWCQAMTIGLTVKNKFCFVDGSLTRPTDDRLGSWLICNSVVT
        PS IP     + V +SS+       S+L  Y++PY+LHHSD+TS+VLVS+ L E NYTSW ++M I LTVKNK  FVDGS++RP+  ++ SW ICN+VV 
Subjt:  PSEIPVVNPPSQVVASSSNQ-----SVLHLYADPYFLHHSDSTSIVLVSNPLTETNYTSWCQAMTIGLTVKNKFCFVDGSLTRPTDDRLGSWLICNSVVT

Query:  ATSISSL
        A  ++SL
Subjt:  ATSISSL

A0A6J1DIP8 uncharacterized protein LOC1110203997.7e-2064.94Show/hide
Query:  YADPYFLHHSDSTSIVLVSNPLTETNYTSWCQAMTIGLTVKNKFCFVDGSLTRPTDDRLGSWLICNSVVTATSISSL
        Y +PYFLHHSD+TS+VLVS+PLT  NYTSW ++M I LTVKNK  FVDGS+ RPT D L SW+ICN+VV +  ++SL
Subjt:  YADPYFLHHSDSTSIVLVSNPLTETNYTSWCQAMTIGLTVKNKFCFVDGSLTRPTDDRLGSWLICNSVVTATSISSL

A0A6J1DKR8 uncharacterized protein LOC1110218316.3e-1447.13Show/hide
Query:  SSSNQSVLHLYADPYFLHHSDSTSIVLVSNPLTETNYTSWCQAMTIGLTVKNKFCFVDGSLTRPTDDRLGSWLICNSVVTATSISSL
        SSS+ S +    +PY+LHH+D+T +VLV+ PLTE NY+SW ++M I L++KNK  F+DGS++RP  + L +W+  N VV A  ++S+
Subjt:  SSSNQSVLHLYADPYFLHHSDSTSIVLVSNPLTETNYTSWCQAMTIGLTVKNKFCFVDGSLTRPTDDRLGSWLICNSVVTATSISSL

A0A6J1DNP7 uncharacterized protein LOC1110220655.9e-2060.71Show/hide
Query:  NQSVLHLYADPYFLHHSDSTSIVLVSNPLTETNYTSWCQAMTIGLTVKNKFCFVDGSLTRPTDDRLGSWLICNSVVTATSISSL
        N  V+  +A+PYFLHHSD+TS+VLVS+ LT+ NYTSW +++ I LTVKNK  FVDGS++RPTD RL SW+ICN+VV +   +SL
Subjt:  NQSVLHLYADPYFLHHSDSTSIVLVSNPLTETNYTSWCQAMTIGLTVKNKFCFVDGSLTRPTDDRLGSWLICNSVVTATSISSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGGTCCTTCTGAAATTCCCGTGGTTAATCCACCTTCTCAGGTCGTCGCCTCTTCCTCAAATCAGTCAGTTTTGCACCTCTATGCTGATCCCTATTTTCTCCATCA
CTCTGATAGTACGAGCATTGTTCTTGTCTCCAATCCGCTAACAGAGACCAATTACACTTCGTGGTGTCAAGCCATGACCATCGGTTTAACTGTAAAGAATAAGTTCTGCT
TTGTTGACGGCTCGCTTACTCGCCCGACTGACGATCGTCTTGGTTCCTGGCTCATCTGCAATAGTGTGGTCACAGCGACATCAATATCTTCCCTTAAAAGTGTCTTCCTT
CTTCCTCCCTTCCCAGCCCTCTTTCAGTCTGTTCGTGCTCTTCGCGCCGGTAAGAAACCCACAGATCTGGAACCAGACCGTTCATGCTCCTCTGTCGGTAAGAAACCCAC
AAATTTGGGCGGAACGAAACCCACAAATTTGTTCGCGCTCGCGTCGGCAAGAAACCCACTGATCTGGAATCAGTCCGTTCGTGCTCCTCCGTCGGCAAGAAACCCACAAA
TTTGGGCTGAACGAAACCCACAAATCTGTTTGTGCTCCTCGTGCCGGCAAGAAACCCATTTCGATTCCAGCTTGAAACAGGGAAACGAACGGCTGGGCGGAACGATTTGT
GGCGGCGCAAGGACGGACGGTTGGGTAGAACAAACAGCGCGAGGACGGAGTTGCGGTACGCAGACGGTTCCAAATCTCACCGGCATGGGTTGTTTTCTTCTCCTCTACAC
AAAATCGCGACTGACGGGCTGGCAGGGGAAGGCAAAATCCGATGGTAGCGGCGGCGGACGTGGACAGAGGTCGGCGATGGTGTTGGTTGCCGATGGTGGGCTGATTGCGA
TTAGGGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGGTCCTTCTGAAATTCCCGTGGTTAATCCACCTTCTCAGGTCGTCGCCTCTTCCTCAAATCAGTCAGTTTTGCACCTCTATGCTGATCCCTATTTTCTCCATCA
CTCTGATAGTACGAGCATTGTTCTTGTCTCCAATCCGCTAACAGAGACCAATTACACTTCGTGGTGTCAAGCCATGACCATCGGTTTAACTGTAAAGAATAAGTTCTGCT
TTGTTGACGGCTCGCTTACTCGCCCGACTGACGATCGTCTTGGTTCCTGGCTCATCTGCAATAGTGTGGTCACAGCGACATCAATATCTTCCCTTAAAAGTGTCTTCCTT
CTTCCTCCCTTCCCAGCCCTCTTTCAGTCTGTTCGTGCTCTTCGCGCCGGTAAGAAACCCACAGATCTGGAACCAGACCGTTCATGCTCCTCTGTCGGTAAGAAACCCAC
AAATTTGGGCGGAACGAAACCCACAAATTTGTTCGCGCTCGCGTCGGCAAGAAACCCACTGATCTGGAATCAGTCCGTTCGTGCTCCTCCGTCGGCAAGAAACCCACAAA
TTTGGGCTGAACGAAACCCACAAATCTGTTTGTGCTCCTCGTGCCGGCAAGAAACCCATTTCGATTCCAGCTTGAAACAGGGAAACGAACGGCTGGGCGGAACGATTTGT
GGCGGCGCAAGGACGGACGGTTGGGTAGAACAAACAGCGCGAGGACGGAGTTGCGGTACGCAGACGGTTCCAAATCTCACCGGCATGGGTTGTTTTCTTCTCCTCTACAC
AAAATCGCGACTGACGGGCTGGCAGGGGAAGGCAAAATCCGATGGTAGCGGCGGCGGACGTGGACAGAGGTCGGCGATGGTGTTGGTTGCCGATGGTGGGCTGATTGCGA
TTAGGGTTTGA
Protein sequenceShow/hide protein sequence
MAGPSEIPVVNPPSQVVASSSNQSVLHLYADPYFLHHSDSTSIVLVSNPLTETNYTSWCQAMTIGLTVKNKFCFVDGSLTRPTDDRLGSWLICNSVVTATSISSLKSVFL
LPPFPALFQSVRALRAGKKPTDLEPDRSCSSVGKKPTNLGGTKPTNLFALASARNPLIWNQSVRAPPSARNPQIWAERNPQICLCSSCRQETHFDSSLKQGNERLGGTIC
GGARTDGWVEQTARGRSCGTQTVPNLTGMGCFLLLYTKSRLTGWQGKAKSDGSGGGRGQRSAMVLVADGGLIAIRV