; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021970 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021970
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr7:15061744..15062333
RNA-Seq ExpressionLag0021970
SyntenyLag0021970
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW64278.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]4.4e-1340.35Show/hide
Query:  MGEIINLDTASAIWSSLTKSYDSKTTARIMGLKTQLQH------GLGSEYNAFVTSIQNRSDNPSLEDVRSLLLAYEARLEKQVSVDQLNIAQANLSSLN
        M +II LDTAS IW++L K + + + ARIM L+ QLQ        LG+EYN+FV ++ +  ++ SLE++ S+LL +E RLE+Q + ++ N+ QAN++++N
Subjt:  MGEIINLDTASAIWSSLTKSYDSKTTARIMGLKTQLQH------GLGSEYNAFVTSIQNRSDNPSLEDVRSLLLAYEARLEKQVSVDQLNIAQANLSSLN

Query:  IQQIHRRPSTKPNF
        IQ  +++      F
Subjt:  IQQIHRRPSTKPNF

XP_022148871.1 uncharacterized protein LOC111017438 [Momordica charantia]1.4e-1452.68Show/hide
Query:  GLGSEYNAFVTSIQNRSDNPSLEDVRSLLLAYEARLEKQVSVDQLNIAQANLSSLNIQQIHRRPSTKPNFTYVTKFPFPSPVPSQQSGHP------PNIL
        GLG EYNAFVTSIQNRSD  +LEDVR+LLLAY+ RLEKQ SVDQLN+ QAN+++L   Q++R+     N            VPSQ S  P      P +L
Subjt:  GLGSEYNAFVTSIQNRSDNPSLEDVRSLLLAYEARLEKQVSVDQLNIAQANLSSLNIQQIHRRPSTKPNFTYVTKFPFPSPVPSQQSGHP------PNIL

Query:  GKP--QSVTKWP
        GKP   S   WP
Subjt:  GKP--QSVTKWP

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]1.8e-3552.04Show/hide
Query:  MGEIINLDTASAIWSSLTKSYDSKTTARIMGLKTQLQH----------------------------------------GLGSEYNAFVTSIQNRSDNPSL
        MGE+++L+T   IWSSLT+ YDSKTTARIMGLKT+LQ+                                        GLGSEYNAFVTSI NR+D+PSL
Subjt:  MGEIINLDTASAIWSSLTKSYDSKTTARIMGLKTQLQH----------------------------------------GLGSEYNAFVTSIQNRSDNPSL

Query:  EDVRSLLLAYEARLEKQVSVDQLNIAQANLSSLNIQQIHRRPSTKPNFTYVTKFPFP-SPVPSQQSGHPPNILGKPQSVTKWPSLGTFTPKSSNSK
        EDVRSLLLAYEARL+KQ +VDQLNIAQANL +L++Q   +RP  K +F    K  FP SP+ + QS    +ILGKPQSV KWP      PK S+SK
Subjt:  EDVRSLLLAYEARLEKQVSVDQLNIAQANLSSLNIQQIHRRPSTKPNFTYVTKFPFP-SPVPSQQSGHPPNILGKPQSVTKWPSLGTFTPKSSNSK

XP_038887133.1 uncharacterized protein LOC120077323 [Benincasa hispida]2.0e-1334.91Show/hide
Query:  MGEIINLDTASAIWSSLTKSYDSKTTARIMGLKTQLQ----------------------------------------HGLGSEYNAFVTSIQNRSDNPSL
        MGEI+  ++A  IW +L   Y+S + A IMG  +QLQ                                         GLGSEYN FV+SI NR++ PS+
Subjt:  MGEIINLDTASAIWSSLTKSYDSKTTARIMGLKTQLQ----------------------------------------HGLGSEYNAFVTSIQNRSDNPSL

Query:  EDVRSLLLAYEARLEKQVSVDQLNIAQANLSSLNIQ--------QIHRRPSTKPNFTYVTKFPFPSPVP
         DVR+LL+ Y++RLEKQ + D L + QAN++ L+I         Q H R S + +   V  FP   P P
Subjt:  EDVRSLLLAYEARLEKQVSVDQLNIAQANLSSLNIQ--------QIHRRPSTKPNFTYVTKFPFPSPVP

XP_038891713.1 uncharacterized protein LOC120081111 [Benincasa hispida]6.8e-1446.88Show/hide
Query:  TQLQHGLGSEYNAFVTSIQNRSDNPSLEDVRSLLLAYEARLEKQVSVDQLNIAQANLSSLNIQQIHRRPSTKPNFTYVTKFPFPSP-----VPSQQSGHP
        T +  GLGSEYNAFVTSIQN  DN S+EDV SLLL+YEA+LEKQ ++D LNIAQA LS L+ Q   +R + +P F   +    PSP     +PS  +   
Subjt:  TQLQHGLGSEYNAFVTSIQNRSDNPSLEDVRSLLLAYEARLEKQVSVDQLNIAQANLSSLNIQQIHRRPSTKPNFTYVTKFPFPSP-----VPSQQSGHP

Query:  PNILGKPQSVTKWPSLGTFTPKSSNSKP
         ++  +P    KWP       K  +SKP
Subjt:  PNILGKPQSVTKWPSLGTFTPKSSNSKP

TrEMBL top hitse value%identityAlignment
A0A2P5BGF8 Uncharacterized protein2.0e-1135.52Show/hide
Query:  MGEIINLDTASAIWSSLTKSYDSKTTARIMGLKTQLQH----------------------------------------GLGSEYNAFVTSIQNRSDNPSL
        MG+I+   +A  IW +L + Y S + A+I  L+ +LQ+                                        GL  EYNAFVTSI  R DN  L
Subjt:  MGEIINLDTASAIWSSLTKSYDSKTTARIMGLKTQLQH----------------------------------------GLGSEYNAFVTSIQNRSDNPSL

Query:  EDVRSLLLAYEARLEKQVSVDQLNIAQANLSSLNIQQIHRRPSTKPNFTYVTKFPFPSPVPSQQSGHPPN-------ILGKPQ
        E++ SLLL+YE RLE Q +  QL+  QANL+ LNI +   RP+      + T+  F +     QS HPPN       ILGKPQ
Subjt:  EDVRSLLLAYEARLEKQVSVDQLNIAQANLSSLNIQQIHRRPSTKPNFTYVTKFPFPSPVPSQQSGHPPN-------ILGKPQ

A0A438FWG3 Retrovirus-related Pol polyprotein from transposon RE12.1e-1340.35Show/hide
Query:  MGEIINLDTASAIWSSLTKSYDSKTTARIMGLKTQLQH------GLGSEYNAFVTSIQNRSDNPSLEDVRSLLLAYEARLEKQVSVDQLNIAQANLSSLN
        M +II LDTAS IW++L K + + + ARIM L+ QLQ        LG+EYN+FV ++ +  ++ SLE++ S+LL +E RLE+Q + ++ N+ QAN++++N
Subjt:  MGEIINLDTASAIWSSLTKSYDSKTTARIMGLKTQLQH------GLGSEYNAFVTSIQNRSDNPSLEDVRSLLLAYEARLEKQVSVDQLNIAQANLSSLN

Query:  IQQIHRRPSTKPNF
        IQ  +++      F
Subjt:  IQQIHRRPSTKPNF

A0A6J1D6N7 uncharacterized protein LOC1110174386.7e-1552.68Show/hide
Query:  GLGSEYNAFVTSIQNRSDNPSLEDVRSLLLAYEARLEKQVSVDQLNIAQANLSSLNIQQIHRRPSTKPNFTYVTKFPFPSPVPSQQSGHP------PNIL
        GLG EYNAFVTSIQNRSD  +LEDVR+LLLAY+ RLEKQ SVDQLN+ QAN+++L   Q++R+     N            VPSQ S  P      P +L
Subjt:  GLGSEYNAFVTSIQNRSDNPSLEDVRSLLLAYEARLEKQVSVDQLNIAQANLSSLNIQQIHRRPSTKPNFTYVTKFPFPSPVPSQQSGHP------PNIL

Query:  GKP--QSVTKWP
        GKP   S   WP
Subjt:  GKP--QSVTKWP

A0A6J1DQX7 uncharacterized protein LOC1110223159.0e-3652.04Show/hide
Query:  MGEIINLDTASAIWSSLTKSYDSKTTARIMGLKTQLQH----------------------------------------GLGSEYNAFVTSIQNRSDNPSL
        MGE+++L+T   IWSSLT+ YDSKTTARIMGLKT+LQ+                                        GLGSEYNAFVTSI NR+D+PSL
Subjt:  MGEIINLDTASAIWSSLTKSYDSKTTARIMGLKTQLQH----------------------------------------GLGSEYNAFVTSIQNRSDNPSL

Query:  EDVRSLLLAYEARLEKQVSVDQLNIAQANLSSLNIQQIHRRPSTKPNFTYVTKFPFP-SPVPSQQSGHPPNILGKPQSVTKWPSLGTFTPKSSNSK
        EDVRSLLLAYEARL+KQ +VDQLNIAQANL +L++Q   +RP  K +F    K  FP SP+ + QS    +ILGKPQSV KWP      PK S+SK
Subjt:  EDVRSLLLAYEARLEKQVSVDQLNIAQANLSSLNIQQIHRRPSTKPNFTYVTKFPFP-SPVPSQQSGHPPNILGKPQSVTKWPSLGTFTPKSSNSK

A0A7J6FPX2 Uncharacterized protein2.0e-1130.22Show/hide
Query:  MGEIINLDTASAIWSSLTKSYDSKTTARIMGLKTQLQ----------------------------------------HGLGSEYNAFVTSIQNRSDNPSL
        +G+I+   TA+ IWSSL + Y + + AR+   +T LQ                                        +GLG  YNAFVT I  RS  PS+
Subjt:  MGEIINLDTASAIWSSLTKSYDSKTTARIMGLKTQLQ----------------------------------------HGLGSEYNAFVTSIQNRSDNPSL

Query:  EDVRSLLLAYEARLEKQVSVDQLNIAQANLSSLNIQQIHRRPSTKPNFTYVTKFPFPSPVPSQQSGHPPNILGKPQSVTKWP
        E+V SLLL+Y+ARL++Q +   L+  QAN ++L++ + + +P  +P  ++ + +P+ S  P  +    PN    P+  T +P
Subjt:  EDVRSLLLAYEARLEKQVSVDQLNIAQANLSSLNIQQIHRRPSTKPNFTYVTKFPFPSPVPSQQSGHPPNILGKPQSVTKWP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGAGATTATTAACCTTGATACTGCCTCTGCTATTTGGTCTTCTCTTACTAAGTCTTATGATTCGAAAACGACTGCTAGAATTATGGGTTTAAAGACACAACTCCA
ACATGGGTTGGGTAGTGAATACAATGCCTTCGTTACTTCTATTCAGAATCGGTCTGACAATCCCTCTTTGGAAGATGTTAGGAGCTTATTGTTGGCTTATGAAGCTAGGT
TGGAAAAACAAGTGTCTGTGGATCAACTCAATATTGCACAAGCTAACTTAAGTAGTCTCAACATTCAGCAGATTCATCGTCGTCCTTCTACCAAACCAAACTTTACCTAT
GTTACCAAATTCCCTTTTCCTTCCCCTGTACCTTCACAACAATCTGGCCATCCTCCTAATATTTTAGGCAAACCACAGTCTGTTACAAAATGGCCTTCCCTTGGAACCTT
TACCCCTAAATCCTCTAATTCCAAACCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTGAGATTATTAACCTTGATACTGCCTCTGCTATTTGGTCTTCTCTTACTAAGTCTTATGATTCGAAAACGACTGCTAGAATTATGGGTTTAAAGACACAACTCCA
ACATGGGTTGGGTAGTGAATACAATGCCTTCGTTACTTCTATTCAGAATCGGTCTGACAATCCCTCTTTGGAAGATGTTAGGAGCTTATTGTTGGCTTATGAAGCTAGGT
TGGAAAAACAAGTGTCTGTGGATCAACTCAATATTGCACAAGCTAACTTAAGTAGTCTCAACATTCAGCAGATTCATCGTCGTCCTTCTACCAAACCAAACTTTACCTAT
GTTACCAAATTCCCTTTTCCTTCCCCTGTACCTTCACAACAATCTGGCCATCCTCCTAATATTTTAGGCAAACCACAGTCTGTTACAAAATGGCCTTCCCTTGGAACCTT
TACCCCTAAATCCTCTAATTCCAAACCTTAA
Protein sequenceShow/hide protein sequence
MGEIINLDTASAIWSSLTKSYDSKTTARIMGLKTQLQHGLGSEYNAFVTSIQNRSDNPSLEDVRSLLLAYEARLEKQVSVDQLNIAQANLSSLNIQQIHRRPSTKPNFTY
VTKFPFPSPVPSQQSGHPPNILGKPQSVTKWPSLGTFTPKSSNSKP