; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027698 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027698
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr8:3697719..3698445
RNA-Seq ExpressionLag0027698
SyntenyLag0027698
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142327.1 uncharacterized protein LOC111012468 [Momordica charantia]1.9e-2654.95Show/hide
Query:  SSSSMSH-PTNQSVIDQYTNPYFLHHFDGTNRILVSDPLTESNYTSWSQVMLLGLMVKNKEGLINGTIEHPIGELLHSWKKCNGVVKAWIINALSKEISA
        SSSS +H     S+++ Y+NPY+LHH D T+ +LVSD L E+NYTSWS+ M++ L VKNK G ++G+I  P G  ++SWK CN VV AW++N+LSKEISA
Subjt:  SSSSMSH-PTNQSVIDQYTNPYFLHHFDGTNRILVSDPLTESNYTSWSQVMLLGLMVKNKEGLINGTIEHPIGELLHSWKKCNGVVKAWIINALSKEISA

Query:  SINFFDTAREI
        S+ F D+AR+I
Subjt:  SINFFDTAREI

XP_022152756.1 uncharacterized protein LOC111020399 [Momordica charantia]2.5e-3440.98Show/hide
Query:  MAEENLNSSSSMSHPTNQSVIDQYTNPYFLHHFDGTNRILVSDPLTESNYTSWSQVMLLGLMVKNKEGLINGTIEHPIGELLHSWKKCNGVVKAWIINAL
        MA++ +N ++S S       I+QYTNPYFLHH D T+ +LVSDPLT  NYTSWS+ ML+ L VKNK G ++G+I  P G+LLHSW  CN VV +WI+N+L
Subjt:  MAEENLNSSSSMSHPTNQSVIDQYTNPYFLHHFDGTNRILVSDPLTESNYTSWSQVMLLGLMVKNKEGLINGTIEHPIGELLHSWKKCNGVVKAWIINAL

Query:  SKEISASINFFDTAREI--------------RNF----------------------------HLSSRSTLC-----------------------------
        SKEISASI F D+AREI              R F                             L+S S  C                             
Subjt:  SKEISASINFFDTAREI--------------RNF----------------------------HLSSRSTLC-----------------------------

Query:  ----NDLRTQLLLMEPERTITKAFSLIAQEVEQRASITPSNPQT
            + LR QLLLMEPE TI + FSL++QE +QRA +T ++PQT
Subjt:  ----NDLRTQLLLMEPERTITKAFSLIAQEVEQRASITPSNPQT

XP_022154919.1 uncharacterized protein LOC111022065 [Momordica charantia]5.5e-2635.71Show/hide
Query:  EENLNSSSSMSHPTNQSVIDQYTNPYFLHHFDGTNRILVSDPLTESNYTSWSQVMLLGLMVKNKEGLINGTIEHPIGELLHSWKKCNGVVKAWIINALSK
        +++LN ++    P N  V++Q+ NPYFLHH D T+ +LVSD LT+ NYTSWS+ +++ L VKNK G ++G+I  P    LHSW  CN VV +WI N+LSK
Subjt:  EENLNSSSSMSHPTNQSVIDQYTNPYFLHHFDGTNRILVSDPLTESNYTSWSQVMLLGLMVKNKEGLINGTIEHPIGELLHSWKKCNGVVKAWIINALSK

Query:  EISASINFFDTAREI--------------RNFHLSSR-STLCND--------------------------------------------------------
        +ISAS+ F D+A EI              R F L    S L  D                                                        
Subjt:  EISASINFFDTAREI--------------RNFHLSSR-STLCND--------------------------------------------------------

Query:  ----LRTQLLLMEPERTITKAFSLIAQEVEQRASITPS
            +R QLLLMEP  TI +AF+L+AQE++QR+   PS
Subjt:  ----LRTQLLLMEPERTITKAFSLIAQEVEQRASITPS

XP_038875043.1 uncharacterized protein LOC120067569 [Benincasa hispida]5.0e-2738.36Show/hide
Query:  ENLNSSSSMSHPTNQSVIDQYTNPYFLHHFDGTNRILVSDPLTESNYTSWSQVMLLGLMVKNKEGLINGTIEHPIGELLHSWKKCNGVVKAWIINALSKE
        EN NS  + +  +  S+++QY NPYFLH  D T+ + +S+ LTESNY SWSQ M + L VKNK G IN  I HP GELL SW  CNGVV AWI+N+LSKE
Subjt:  ENLNSSSSMSHPTNQSVIDQYTNPYFLHHFDGTNRILVSDPLTESNYTSWSQVMLLGLMVKNKEGLINGTIEHPIGELLHSWKKCNGVVKAWIINALSKE

Query:  ISASINFFDTAREI--------------RNFHL------------------SSRSTLCNDL---------------------------------------
        IS SINF ++ +EI              R F L                  +    L N+L                                       
Subjt:  ISASINFFDTAREI--------------RNFHL------------------SSRSTLCNDL---------------------------------------

Query:  ----RTQLLLMEPERTITKAFSLIAQEVEQRA
            R+QLLLMEPE +I +AFSL+ QE++Q+A
Subjt:  ----RTQLLLMEPERTITKAFSLIAQEVEQRA

XP_038902375.1 uncharacterized protein LOC120089012 [Benincasa hispida]2.6e-2859.29Show/hide
Query:  NLNSSSSMSHPTNQSVIDQYTNPYFLHHFDGTNRILVSDPLTESNYTSWSQVMLLGLMVKNKEGLINGTIEHPIGELLHSWKKCNGVVKAWIINALSKEI
        +L+   SMS   N +V++QYTN YFLHH D TN ++VS+ LTE+NYTSWSQVM++GL+VKNK G ++G+I    G+LL SW  C+GVV AWI+N+L KEI
Subjt:  NLNSSSSMSHPTNQSVIDQYTNPYFLHHFDGTNRILVSDPLTESNYTSWSQVMLLGLMVKNKEGLINGTIEHPIGELLHSWKKCNGVVKAWIINALSKEI

Query:  SASINFFDTAREI
        S SINF D+AR+I
Subjt:  SASINFFDTAREI

TrEMBL top hitse value%identityAlignment
A0A5J4ZL66 Uncharacterized protein3.6e-2340.85Show/hide
Query:  SVIDQYTNPYFLHHFDGTNRILVSDPLTESNYTSWSQVMLLGLMVKNKEGLINGTIEHP---IGELLHSWKKCNGVVKAWIINALSKEISASINFFDTAR
        ++ID  ++PYFLHH D    +LVS PLT  NY++WS+ M + L  KNK G I+G+I  P     +L  +W +CN +V +WI+N++SK+++ASI F DTA 
Subjt:  SVIDQYTNPYFLHHFDGTNRILVSDPLTESNYTSWSQVMLLGLMVKNKEGLINGTIEHP---IGELLHSWKKCNGVVKAWIINALSKEISASINFFDTAR

Query:  EI--------------RNFHLSSRSTLCNDLRTQLLLMEPERTITKAFSLIAQEVEQR--ASIT
         +              R F L    +  +  +  +LLM+P   I K FSLI QE  QR  ASIT
Subjt:  EI--------------RNFHLSSRSTLCNDLRTQLLLMEPERTITKAFSLIAQEVEQR--ASIT

A0A6J1CMF8 uncharacterized protein LOC1110124689.1e-2754.95Show/hide
Query:  SSSSMSH-PTNQSVIDQYTNPYFLHHFDGTNRILVSDPLTESNYTSWSQVMLLGLMVKNKEGLINGTIEHPIGELLHSWKKCNGVVKAWIINALSKEISA
        SSSS +H     S+++ Y+NPY+LHH D T+ +LVSD L E+NYTSWS+ M++ L VKNK G ++G+I  P G  ++SWK CN VV AW++N+LSKEISA
Subjt:  SSSSMSH-PTNQSVIDQYTNPYFLHHFDGTNRILVSDPLTESNYTSWSQVMLLGLMVKNKEGLINGTIEHPIGELLHSWKKCNGVVKAWIINALSKEISA

Query:  SINFFDTAREI
        S+ F D+AR+I
Subjt:  SINFFDTAREI

A0A6J1DIP8 uncharacterized protein LOC1110203991.2e-3440.98Show/hide
Query:  MAEENLNSSSSMSHPTNQSVIDQYTNPYFLHHFDGTNRILVSDPLTESNYTSWSQVMLLGLMVKNKEGLINGTIEHPIGELLHSWKKCNGVVKAWIINAL
        MA++ +N ++S S       I+QYTNPYFLHH D T+ +LVSDPLT  NYTSWS+ ML+ L VKNK G ++G+I  P G+LLHSW  CN VV +WI+N+L
Subjt:  MAEENLNSSSSMSHPTNQSVIDQYTNPYFLHHFDGTNRILVSDPLTESNYTSWSQVMLLGLMVKNKEGLINGTIEHPIGELLHSWKKCNGVVKAWIINAL

Query:  SKEISASINFFDTAREI--------------RNF----------------------------HLSSRSTLC-----------------------------
        SKEISASI F D+AREI              R F                             L+S S  C                             
Subjt:  SKEISASINFFDTAREI--------------RNF----------------------------HLSSRSTLC-----------------------------

Query:  ----NDLRTQLLLMEPERTITKAFSLIAQEVEQRASITPSNPQT
            + LR QLLLMEPE TI + FSL++QE +QRA +T ++PQT
Subjt:  ----NDLRTQLLLMEPERTITKAFSLIAQEVEQRASITPSNPQT

A0A6J1DKR8 uncharacterized protein LOC1110218313.2e-2439.66Show/hide
Query:  LNSSSSMSHPTNQSVIDQYTNPYFLHHFDGTNRILVSDPLTESNYTSWSQVMLLGLMVKNKEGLINGTIEHPIGELLHSWKKCNGVVKAWIINALSKEIS
        L+ SSS +  ++ S +D   NPY+LHH D T  +LV+ PLTE NY+SWS+ ML+ L +KNK G I+G+I  PIGELL +W   N VV AWI+N++SKEIS
Subjt:  LNSSSSMSHPTNQSVIDQYTNPYFLHHFDGTNRILVSDPLTESNYTSWSQVMLLGLMVKNKEGLINGTIEHPIGELLHSWKKCNGVVKAWIINALSKEIS

Query:  ASINFFDTAREI-----RNFHLSSRSTLCNDLRTQLLLMEPERTITKAFSLIAQEVEQRASITPSNPQTCSSDAVSIPP
        +SI F ++AR+I       F  S+   +    R    L + +++++  F+ +    ++     P     CS   VS  P
Subjt:  ASINFFDTAREI-----RNFHLSSRSTLCNDLRTQLLLMEPERTITKAFSLIAQEVEQRASITPSNPQTCSSDAVSIPP

A0A6J1DNP7 uncharacterized protein LOC1110220652.6e-2635.71Show/hide
Query:  EENLNSSSSMSHPTNQSVIDQYTNPYFLHHFDGTNRILVSDPLTESNYTSWSQVMLLGLMVKNKEGLINGTIEHPIGELLHSWKKCNGVVKAWIINALSK
        +++LN ++    P N  V++Q+ NPYFLHH D T+ +LVSD LT+ NYTSWS+ +++ L VKNK G ++G+I  P    LHSW  CN VV +WI N+LSK
Subjt:  EENLNSSSSMSHPTNQSVIDQYTNPYFLHHFDGTNRILVSDPLTESNYTSWSQVMLLGLMVKNKEGLINGTIEHPIGELLHSWKKCNGVVKAWIINALSK

Query:  EISASINFFDTAREI--------------RNFHLSSR-STLCND--------------------------------------------------------
        +ISAS+ F D+A EI              R F L    S L  D                                                        
Subjt:  EISASINFFDTAREI--------------RNFHLSSR-STLCND--------------------------------------------------------

Query:  ----LRTQLLLMEPERTITKAFSLIAQEVEQRASITPS
            +R QLLLMEP  TI +AF+L+AQE++QR+   PS
Subjt:  ----LRTQLLLMEPERTITKAFSLIAQEVEQRASITPS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGAAGAAAATCTCAACTCGTCTTCTTCAATGAGTCATCCCACGAATCAATCTGTAATAGATCAGTATACCAACCCATATTTTCTCCATCATTTCGATGGAACTAA
CCGCATCCTTGTATCTGATCCTCTCACTGAATCGAATTACACATCCTGGAGTCAAGTCATGCTTCTCGGACTGATGGTGAAAAATAAAGAGGGTCTCATCAACGGAACAA
TTGAACATCCTATCGGGGAATTGCTTCATTCTTGGAAGAAATGCAATGGAGTTGTCAAAGCTTGGATCATTAATGCTTTGTCCAAAGAAATTTCCGCCAGCATCAATTTC
TTCGACACCGCCAGAGAAATAAGAAATTTCCACCTTAGTTCAAGATCAACTCTCTGTAACGACCTTCGCACTCAACTCTTATTGATGGAACCTGAACGTACGATCACCAA
AGCCTTCTCTCTAATAGCTCAAGAAGTAGAGCAACGAGCTTCTATAACCCCATCAAATCCGCAGACTTGTTCCAGCGATGCTGTGTCAATACCTCCCGATTACCTTAACC
ACAACCCATTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCGAAGAAAATCTCAACTCGTCTTCTTCAATGAGTCATCCCACGAATCAATCTGTAATAGATCAGTATACCAACCCATATTTTCTCCATCATTTCGATGGAACTAA
CCGCATCCTTGTATCTGATCCTCTCACTGAATCGAATTACACATCCTGGAGTCAAGTCATGCTTCTCGGACTGATGGTGAAAAATAAAGAGGGTCTCATCAACGGAACAA
TTGAACATCCTATCGGGGAATTGCTTCATTCTTGGAAGAAATGCAATGGAGTTGTCAAAGCTTGGATCATTAATGCTTTGTCCAAAGAAATTTCCGCCAGCATCAATTTC
TTCGACACCGCCAGAGAAATAAGAAATTTCCACCTTAGTTCAAGATCAACTCTCTGTAACGACCTTCGCACTCAACTCTTATTGATGGAACCTGAACGTACGATCACCAA
AGCCTTCTCTCTAATAGCTCAAGAAGTAGAGCAACGAGCTTCTATAACCCCATCAAATCCGCAGACTTGTTCCAGCGATGCTGTGTCAATACCTCCCGATTACCTTAACC
ACAACCCATTATAG
Protein sequenceShow/hide protein sequence
MAEENLNSSSSMSHPTNQSVIDQYTNPYFLHHFDGTNRILVSDPLTESNYTSWSQVMLLGLMVKNKEGLINGTIEHPIGELLHSWKKCNGVVKAWIINALSKEISASINF
FDTAREIRNFHLSSRSTLCNDLRTQLLLMEPERTITKAFSLIAQEVEQRASITPSNPQTCSSDAVSIPPDYLNHNPL