; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028178 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028178
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr8:15012014..15012831
RNA-Seq ExpressionLag0028178
SyntenyLag0028178
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN77126.1 hypothetical protein VITISV_013628 [Vitis vinifera]2.7e-2340.76Show/hide
Query:  STDCPNTICS-----KSLPVLTSASFLDGSIQAPPKFLDAQQSQPNPDFLSWER--------IMGLKTQLQRIKKDGLSVSQYLSQIKDITDKFSAIGEP
        +T  P T+ +        P+ +   F++G    P KFLD  Q Q NP F+ WER        I    T   +IKK+G+++S+YL++IK++ DK+SA+GEP
Subjt:  STDCPNTICS-----KSLPVLTSASFLDGSIQAPPKFLDAQQSQPNPDFLSWER--------IMGLKTQLQRIKKDGLSVSQYLSQIKDITDKFSAIGEP

Query:  ISYRDHLAHILDGLGSEYNAFVTSIKNRADSPALEDVRSLLLAYEARLESKTVLNNL
        +SYRD L + L+GL  EY+ FVTSI NR+D  +L++V SLL  Y   LE +     L
Subjt:  ISYRDHLAHILDGLGSEYNAFVTSIKNRADSPALEDVRSLLLAYEARLESKTVLNNL

XP_022148871.1 uncharacterized protein LOC111017438 [Momordica charantia]3.6e-2362.92Show/hide
Query:  QLQRIKKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIKNRADSPALEDVRSLLLAYEARLESKTVLNNL
        ++Q++KKDGLSVSQYL++IK+IT K S+IGEPIS +DH+++I++GLG EYNAFVTSI+NR+D   LEDVR+LLLAY+ RLE +  ++ L
Subjt:  QLQRIKKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIKNRADSPALEDVRSLLLAYEARLESKTVLNNL

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]5.1e-3844.16Show/hide
Query:  FPSYNPHHFTHPQLEF----FNLITLPLSHALHFLNIQYRKLSTDCPNTICSKSLPVLTSASFLDGSIQAPPKFLDAQQSQPNPDFLSWE----------
        FP   P+    P   F    F  +  PL+  L+  N    K      N + +  L       +LDG+I  PP+FLD  Q QPNP + +WE          
Subjt:  FPSYNPHHFTHPQLEF----FNLITLPLSHALHFLNIQYRKLSTDCPNTICSKSLPVLTSASFLDGSIQAPPKFLDAQQSQPNPDFLSWE----------

Query:  -----------------------------------RIMGLKTQLQRIKKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIK
                                           RIMGLKT+LQ ++KDG SVSQYL++IK+I DKF+A+GEP+SYRDHLAH+LDGLGSEYNAFVTSI 
Subjt:  -----------------------------------RIMGLKTQLQRIKKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIK

Query:  NRADSPALEDVRSLLLAYEARLESKTVLNNL
        NRADSP+LEDVRSLLLAYEARL+ +  ++ L
Subjt:  NRADSPALEDVRSLLLAYEARLESKTVLNNL

XP_038887133.1 uncharacterized protein LOC120077323 [Benincasa hispida]5.3e-2764.21Show/hide
Query:  IMGLKTQLQRIKKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIKNRADSPALEDVRSLLLAYEARLESKTVLNNL
        IMG  +QLQ+IKKDGL+VSQYL+QIKD+ D F+AIGEP+SYRDHL++IL+GLGSEYN FV+SI NR + P++ DVR+LL+ Y++RLE +T  ++L
Subjt:  IMGLKTQLQRIKKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIKNRADSPALEDVRSLLLAYEARLESKTVLNNL

XP_038891713.1 uncharacterized protein LOC120081111 [Benincasa hispida]1.8e-2768.09Show/hide
Query:  MGLKTQLQRIKKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIKNRADSPALEDVRSLLLAYEARLESKTVLNNL
        M LK +LQ+I+KD LS+SQYLSQIKD+ DKFS +GE ISYRDHL HILDGLGSEYNAFVTSI+N  D+ ++EDV SLLL+YEA+LE +  +++L
Subjt:  MGLKTQLQRIKKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIKNRADSPALEDVRSLLLAYEARLESKTVLNNL

TrEMBL top hitse value%identityAlignment
A0A438GC62 Retrovirus-related Pol polyprotein from transposon RE14.3e-2234.02Show/hide
Query:  TQPAFPSYNPHHFTH------PQLEFFNLITLPLSHALHF----LNIQYRKLSTDCPNTICSKSLPVLTSASFLDGSIQAPPKFLDAQQSQPNPDFLSWE
        T P  P+ N +  T+      PQ+    L +  LS +L       N+  RK  +   N I +  L       F+D    +PPK+LDA   Q NP+F+ W+
Subjt:  TQPAFPSYNPHHFTH------PQLEFFNLITLPLSHALHF----LNIQYRKLSTDCPNTICSKSLPVLTSASFLDGSIQAPPKFLDAQQSQPNPDFLSWE

Query:  R---------------------------------------------IMGLKTQLQRIKKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGS
        R                                             +M L +QLQRIKK  + +S+YLS++K + D+F+ IGEP+SYRD L  IL+GL  
Subjt:  R---------------------------------------------IMGLKTQLQRIKKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGS

Query:  EYNAFVTSIKNRADSPALEDVRSLLLAYEARLESKTVLNNL
        EY+ FVTSI NR+D P+L++V SLL  YE RL  +++  NL
Subjt:  EYNAFVTSIKNRADSPALEDVRSLLLAYEARLESKTVLNNL

A0A5C7IHH0 Uncharacterized protein8.6e-2359.77Show/hide
Query:  KTQLQRIKKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIKNRADSPALEDVRSLLLAYEARLESKTV
        ++QL  +KK+G +++QYL Q K+I DKF+AIGEP+SYRDHL ++L+GLG EY+AFVTSI+NR D P++EDV SLLL++E RL  +T+
Subjt:  KTQLQRIKKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIKNRADSPALEDVRSLLLAYEARLESKTV

A0A6J1D6N7 uncharacterized protein LOC1110174381.7e-2362.92Show/hide
Query:  QLQRIKKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIKNRADSPALEDVRSLLLAYEARLESKTVLNNL
        ++Q++KKDGLSVSQYL++IK+IT K S+IGEPIS +DH+++I++GLG EYNAFVTSI+NR+D   LEDVR+LLLAY+ RLE +  ++ L
Subjt:  QLQRIKKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIKNRADSPALEDVRSLLLAYEARLESKTVLNNL

A0A6J1DQX7 uncharacterized protein LOC1110223152.5e-3844.16Show/hide
Query:  FPSYNPHHFTHPQLEF----FNLITLPLSHALHFLNIQYRKLSTDCPNTICSKSLPVLTSASFLDGSIQAPPKFLDAQQSQPNPDFLSWE----------
        FP   P+    P   F    F  +  PL+  L+  N    K      N + +  L       +LDG+I  PP+FLD  Q QPNP + +WE          
Subjt:  FPSYNPHHFTHPQLEF----FNLITLPLSHALHFLNIQYRKLSTDCPNTICSKSLPVLTSASFLDGSIQAPPKFLDAQQSQPNPDFLSWE----------

Query:  -----------------------------------RIMGLKTQLQRIKKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIK
                                           RIMGLKT+LQ ++KDG SVSQYL++IK+I DKF+A+GEP+SYRDHLAH+LDGLGSEYNAFVTSI 
Subjt:  -----------------------------------RIMGLKTQLQRIKKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIK

Query:  NRADSPALEDVRSLLLAYEARLESKTVLNNL
        NRADSP+LEDVRSLLLAYEARL+ +  ++ L
Subjt:  NRADSPALEDVRSLLLAYEARLESKTVLNNL

A5BPS3 Uncharacterized protein1.3e-2340.76Show/hide
Query:  STDCPNTICS-----KSLPVLTSASFLDGSIQAPPKFLDAQQSQPNPDFLSWER--------IMGLKTQLQRIKKDGLSVSQYLSQIKDITDKFSAIGEP
        +T  P T+ +        P+ +   F++G    P KFLD  Q Q NP F+ WER        I    T   +IKK+G+++S+YL++IK++ DK+SA+GEP
Subjt:  STDCPNTICS-----KSLPVLTSASFLDGSIQAPPKFLDAQQSQPNPDFLSWER--------IMGLKTQLQRIKKDGLSVSQYLSQIKDITDKFSAIGEP

Query:  ISYRDHLAHILDGLGSEYNAFVTSIKNRADSPALEDVRSLLLAYEARLESKTVLNNL
        +SYRD L + L+GL  EY+ FVTSI NR+D  +L++V SLL  Y   LE +     L
Subjt:  ISYRDHLAHILDGLGSEYNAFVTSIKNRADSPALEDVRSLLLAYEARLESKTVLNNL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.4e-0423.86Show/hide
Query:  RIMGLKTQLQRIKKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIKNRADSPALEDVRSLLLAYEARLE
        R + L ++L+      + V+ Y  ++K + D    +  P++ R+ + ++L+GL  +++  +  IK+R   P+ +D  ++L   E RL+
Subjt:  RIMGLKTQLQRIKKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIKNRADSPALEDVRSLLLAYEARLE

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.4e-0931.87Show/hide
Query:  RIMGLKTQLQRIKKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIKNRADSPALEDVRSLLLAYEARLESKT
        R +  + +L+    D LSV +Y  ++K ++D  + +  PIS R  + H+L+GL  +Y+  +  IK+++  P+  + RS+LL  E+RL +K+
Subjt:  RIMGLKTQLQRIKKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIKNRADSPALEDVRSLLLAYEARLESKT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCCCCACCTTTTTTCGAAATTTGCCCAAATCCTATCAAACTCAGCCAGCTTTTCCATCGTATAACCCTCACCATTTTACCCACCCTCAACTGGAGTTTTTCAACCT
TATTACCCTGCCTCTTTCCCACGCCCTACACTTCCTTAATATCCAATACCGCAAGCTCTCCACCGACTGTCCCAACACCATTTGCTCCAAATCCCTACCCGTCCTTACCT
CAGCCTCTTTTCTTGATGGGTCGATCCAAGCTCCTCCAAAATTTCTTGATGCTCAACAATCTCAGCCGAATCCGGATTTTCTCTCTTGGGAAAGGATAATGGGCCTTAAA
ACACAACTTCAACGAATTAAGAAAGATGGTCTCTCTGTCAGTCAATACTTGTCTCAAATTAAGGATATTACTGATAAGTTTTCAGCTATAGGGGAGCCCATCTCTTATCG
AGATCATTTGGCTCATATCTTAGATGGTCTTGGCAGTGAGTACAATGCCTTTGTCACATCTATTAAGAATCGTGCTGATAGCCCTGCTTTAGAGGATGTTCGTAGCCTTC
TTCTTGCTTATGAGGCTCGTTTAGAAAGCAAAACAGTGTTGAACAACTTAACTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTCCCCACCTTTTTTCGAAATTTGCCCAAATCCTATCAAACTCAGCCAGCTTTTCCATCGTATAACCCTCACCATTTTACCCACCCTCAACTGGAGTTTTTCAACCT
TATTACCCTGCCTCTTTCCCACGCCCTACACTTCCTTAATATCCAATACCGCAAGCTCTCCACCGACTGTCCCAACACCATTTGCTCCAAATCCCTACCCGTCCTTACCT
CAGCCTCTTTTCTTGATGGGTCGATCCAAGCTCCTCCAAAATTTCTTGATGCTCAACAATCTCAGCCGAATCCGGATTTTCTCTCTTGGGAAAGGATAATGGGCCTTAAA
ACACAACTTCAACGAATTAAGAAAGATGGTCTCTCTGTCAGTCAATACTTGTCTCAAATTAAGGATATTACTGATAAGTTTTCAGCTATAGGGGAGCCCATCTCTTATCG
AGATCATTTGGCTCATATCTTAGATGGTCTTGGCAGTGAGTACAATGCCTTTGTCACATCTATTAAGAATCGTGCTGATAGCCCTGCTTTAGAGGATGTTCGTAGCCTTC
TTCTTGCTTATGAGGCTCGTTTAGAAAGCAAAACAGTGTTGAACAACTTAACTTAG
Protein sequenceShow/hide protein sequence
MLPTFFRNLPKSYQTQPAFPSYNPHHFTHPQLEFFNLITLPLSHALHFLNIQYRKLSTDCPNTICSKSLPVLTSASFLDGSIQAPPKFLDAQQSQPNPDFLSWERIMGLK
TQLQRIKKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIKNRADSPALEDVRSLLLAYEARLESKTVLNNLT