; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0004261 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0004261
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr6:2376948..2379996
RNA-Seq ExpressionLag0004261
SyntenyLag0004261
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4372688.1 hypothetical protein F8388_000855 [Cannabis sativa]3.2e-1546.39Show/hide
Query:  IKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDSPALEDVRSLLSAYEARLEKQNSVDHLNLAQANFASLNIHNTNRRPSPRSPSS
        I  I +  ++ G+P+S +DHL  +L+GLG  YNAFVT I  RS  P++E+V SLL +Y+ARL++Q +   L+  QANF +L++  +N +P PR PSS
Subjt:  IKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDSPALEDVRSLLSAYEARLEKQNSVDHLNLAQANFASLNIHNTNRRPSPRSPSS

XP_022148871.1 uncharacterized protein LOC111017438 [Momordica charantia]5.5e-2363.37Show/hide
Query:  IKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDSPALEDVRSLLSAYEARLEKQNSVDHLNLAQANFASLNI-----HNTNRR-PSPRS
        IK+IT K S+IGEPIS +DH+++I++GLG EYNAFVTSIQNRSD   LEDVR+LL AY+ RLEKQNSVD LN+ QAN A+L +     H  N R PS  S
Subjt:  IKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDSPALEDVRSLLSAYEARLEKQNSVDHLNLAQANFASLNI-----HNTNRR-PSPRS

Query:  P
        P
Subjt:  P

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]9.3e-3172.04Show/hide
Query:  IKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDSPALEDVRSLLSAYEARLEKQNSVDHLNLAQANFASLNIHNTNRRPSPR
        IK+I DKF+A+GEP+SYRDHLAH+LDGLGSEYNAFVTSI NR+DSP+LEDVRSLL AYEARL+KQN+VD LN+AQAN  +L++ + ++RP P+
Subjt:  IKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDSPALEDVRSLLSAYEARLEKQNSVDHLNLAQANFASLNIHNTNRRPSPR

XP_038887133.1 uncharacterized protein LOC120077323 [Benincasa hispida]5.0e-2454.78Show/hide
Query:  IKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDSPALEDVRSLLSAYEARLEKQNSVDHLNLAQANFASLNIHNTNRRP--SPRSPSSL
        IKD+ D F+AIGEP+SYRDHL++IL+GLGSEYN FV+SI NR++ P++ DVR+LL  Y++RLEKQ + DHL L QAN A L+I++ NR P     + SS+
Subjt:  IKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDSPALEDVRSLLSAYEARLEKQNSVDHLNLAQANFASLNIHNTNRRP--SPRSPSSL

Query:  FLELPSI-HYPFLLP
            PS+  +P +LP
Subjt:  FLELPSI-HYPFLLP

XP_038891713.1 uncharacterized protein LOC120081111 [Benincasa hispida]5.3e-2657.5Show/hide
Query:  LGKVFISIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDSPALEDVRSLLSAYEARLEKQNSVDHLNLAQANFASLNI-HNTNRRPSP
        L +    IKD+ DKFS +GE ISYRDHL HILDGLGSEYNAFVTSIQN  D+ ++EDV SLL +YEA+LEKQN++DHLN+AQA  + L+  HN+ R    
Subjt:  LGKVFISIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDSPALEDVRSLLSAYEARLEKQNSVDHLNLAQANFASLNI-HNTNRRPSP

Query:  RSPSSLFLELPSIHYPFLLP
          P+   L LPS ++  +LP
Subjt:  RSPSSLFLELPSIHYPFLLP

TrEMBL top hitse value%identityAlignment
A0A438FTV3 Uncharacterized protein4.5e-1548.48Show/hide
Query:  LGKVFISIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDSPALEDVRSLLSAYEARLEKQNSVDHLNLAQANFASLNIHNTNRRPSP
        +G+    +K I +  +AIGEP+S +DHL ++  GL  EYN FVTSIQNRSD P +E + SLL +Y+ RLE+QN V  LN AQ + A LN       P P
Subjt:  LGKVFISIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDSPALEDVRSLLSAYEARLEKQNSVDHLNLAQANFASLNIHNTNRRPSP

A0A5C7IHH0 Uncharacterized protein1.6e-1548.96Show/hide
Query:  LGKVFISIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDSPALEDVRSLLSAYEARLEKQNSVDHLNLAQANFASLNIHNTNRR
        + +     K+I DKF+AIGEP+SYRDHL ++L+GLG EY+AFVTSI+NR D P++EDV SLL ++E RL K+      +L +    +LN H+   R
Subjt:  LGKVFISIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDSPALEDVRSLLSAYEARLEKQNSVDHLNLAQANFASLNIHNTNRR

A0A6J1D6N7 uncharacterized protein LOC1110174382.7e-2363.37Show/hide
Query:  IKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDSPALEDVRSLLSAYEARLEKQNSVDHLNLAQANFASLNI-----HNTNRR-PSPRS
        IK+IT K S+IGEPIS +DH+++I++GLG EYNAFVTSIQNRSD   LEDVR+LL AY+ RLEKQNSVD LN+ QAN A+L +     H  N R PS  S
Subjt:  IKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDSPALEDVRSLLSAYEARLEKQNSVDHLNLAQANFASLNI-----HNTNRR-PSPRS

Query:  P
        P
Subjt:  P

A0A6J1DQX7 uncharacterized protein LOC1110223154.5e-3172.04Show/hide
Query:  IKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDSPALEDVRSLLSAYEARLEKQNSVDHLNLAQANFASLNIHNTNRRPSPR
        IK+I DKF+A+GEP+SYRDHLAH+LDGLGSEYNAFVTSI NR+DSP+LEDVRSLL AYEARL+KQN+VD LN+AQAN  +L++ + ++RP P+
Subjt:  IKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDSPALEDVRSLLSAYEARLEKQNSVDHLNLAQANFASLNIHNTNRRPSPR

A0A7J6FPX2 Uncharacterized protein1.6e-1546.39Show/hide
Query:  IKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDSPALEDVRSLLSAYEARLEKQNSVDHLNLAQANFASLNIHNTNRRPSPRSPSS
        I  I +  ++ G+P+S +DHL  +L+GLG  YNAFVT I  RS  P++E+V SLL +Y+ARL++Q +   L+  QANF +L++  +N +P PR PSS
Subjt:  IKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDSPALEDVRSLLSAYEARLEKQNSVDHLNLAQANFASLNIHNTNRRPSPRSPSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAATCTTAGAAACCCATGTATCAGGAGGACATCCAAATCTTCTTACATGCCTCTCGTGTGGGCATCCTAGGATGTCCGGGCATCCCGAGATGCCTAGGCATCTTAC
ATCACCACAAAACAAACGGGGTAATCCAGATGCCCATTCCATGGGAATCTTGGATTACCGCAAAACAAACGGCCCCTTAGGAATTCCAACCTCTAATTTTGAAGTGATTT
TGGACCACACAGAGGGACAAGGAGCTGAGGAGGACCATCGGACAGAGGTAGGACCAAAAGCCCGACCCAGAGGAAGACCGGACCAAAGGGTCGGGCTAAAATGGCCCGAC
CCATATGGTCGGCCTCGGCATATATGGTCGGCCTCGGCCGACCATTCGGCCCGTTTGCGCGGGCCGAGCCCAGTGACCTCTTTTCGGTCCCTGATGCCCCGAATCGCCCC
GGTTCCGCCTAGTTCGTCCCGAAACACCACCGAATTCCTAAAAATCCTAGGAGGACAAGCAGCTTCTCCTCAGTTTTCTGACTTAGGCATCAGAGGCGGTGTGGCCTACA
CCACGCCGGTGTGCAGCGGTTTTTGCTGGTCAGGTGAGTCTTCTGTCCGGATTTTGGCATCAACAGTTGCTGAACATATTTTGATTTATGCATCACAGTACCTTTTCCCC
TGCTTTGTTCTTCTTTCCCGCTTATGGTATCAGAGACACAAGGTTCAAATCATGTCGTCTGAAGCCTCTACCTCATCATCTTCTTCTTTGTCTGCGCCGATTACCCCATC
GATCGTCACTCCCTCCACTCCAACAACTACTCCAGTGGTTTCCCCTATTGCTTCTCAGCCGCATCCCACGGTTTATCAAAATCGACCCAATGTCCCTCAAACTCAACCTC
CTTCAATCCTTATCAACAACCTTTTATCCATCCTCAACCTTCTTCCAGCCGTTTTACCCATCTTCTTTCCACGCCCTCAACCATTTTTTCAACCACCACAGCTTCTAAAT
GCAGTGTTGGCTAATGGTCTCCATGGTTTCTCGATGGATCAATCCCTGCTCCACCGAAGTTTCTTGATGCTCAGCAATCTCAACCGAATCCGGATTTTCTTACTTGGGAA
AGTATTTATCTCAATTAAGGATATAACGGATAAATTTTCTGCTATTGGGGAACCCATCTCATATAGAGATCATTTGGCTCATATATTGGATGGTCTTGGAAGTGAATATA
ATGCCTTCGTGACTTCGATACAAAATCGTTCTGATAGTCCAGCCTTAGAGGATGTTCGCAGTCTCCTTTCTGCTTATGAGGCACGTTTAGAGAAGCAAAATAGTGTTGAT
CACCTTAACTTAGCTCAAGCGAATTTTGCTAGTCTCAACATTCACAATACTAATCGTCGTCCTTCACCTCGTTCTCCTTCATCCCTTTTCCTAGAACTCCCTTCAATCCA
TTATCCTTTTCTTCTTCCCCTGCTGCTTCAAATAGTTTTTCTCCAAGCCTCCTTGGTAAGCCTCAATCTCAACCCCTTCACAAATGGCCTTCCCGCCCCAATTCCAACCG
ACCACAATGTCAAATCTGTGGCAAATTTGGTCACACGGCACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAATCTTAGAAACCCATGTATCAGGAGGACATCCAAATCTTCTTACATGCCTCTCGTGTGGGCATCCTAGGATGTCCGGGCATCCCGAGATGCCTAGGCATCTTAC
ATCACCACAAAACAAACGGGGTAATCCAGATGCCCATTCCATGGGAATCTTGGATTACCGCAAAACAAACGGCCCCTTAGGAATTCCAACCTCTAATTTTGAAGTGATTT
TGGACCACACAGAGGGACAAGGAGCTGAGGAGGACCATCGGACAGAGGTAGGACCAAAAGCCCGACCCAGAGGAAGACCGGACCAAAGGGTCGGGCTAAAATGGCCCGAC
CCATATGGTCGGCCTCGGCATATATGGTCGGCCTCGGCCGACCATTCGGCCCGTTTGCGCGGGCCGAGCCCAGTGACCTCTTTTCGGTCCCTGATGCCCCGAATCGCCCC
GGTTCCGCCTAGTTCGTCCCGAAACACCACCGAATTCCTAAAAATCCTAGGAGGACAAGCAGCTTCTCCTCAGTTTTCTGACTTAGGCATCAGAGGCGGTGTGGCCTACA
CCACGCCGGTGTGCAGCGGTTTTTGCTGGTCAGGTGAGTCTTCTGTCCGGATTTTGGCATCAACAGTTGCTGAACATATTTTGATTTATGCATCACAGTACCTTTTCCCC
TGCTTTGTTCTTCTTTCCCGCTTATGGTATCAGAGACACAAGGTTCAAATCATGTCGTCTGAAGCCTCTACCTCATCATCTTCTTCTTTGTCTGCGCCGATTACCCCATC
GATCGTCACTCCCTCCACTCCAACAACTACTCCAGTGGTTTCCCCTATTGCTTCTCAGCCGCATCCCACGGTTTATCAAAATCGACCCAATGTCCCTCAAACTCAACCTC
CTTCAATCCTTATCAACAACCTTTTATCCATCCTCAACCTTCTTCCAGCCGTTTTACCCATCTTCTTTCCACGCCCTCAACCATTTTTTCAACCACCACAGCTTCTAAAT
GCAGTGTTGGCTAATGGTCTCCATGGTTTCTCGATGGATCAATCCCTGCTCCACCGAAGTTTCTTGATGCTCAGCAATCTCAACCGAATCCGGATTTTCTTACTTGGGAA
AGTATTTATCTCAATTAAGGATATAACGGATAAATTTTCTGCTATTGGGGAACCCATCTCATATAGAGATCATTTGGCTCATATATTGGATGGTCTTGGAAGTGAATATA
ATGCCTTCGTGACTTCGATACAAAATCGTTCTGATAGTCCAGCCTTAGAGGATGTTCGCAGTCTCCTTTCTGCTTATGAGGCACGTTTAGAGAAGCAAAATAGTGTTGAT
CACCTTAACTTAGCTCAAGCGAATTTTGCTAGTCTCAACATTCACAATACTAATCGTCGTCCTTCACCTCGTTCTCCTTCATCCCTTTTCCTAGAACTCCCTTCAATCCA
TTATCCTTTTCTTCTTCCCCTGCTGCTTCAAATAGTTTTTCTCCAAGCCTCCTTGGTAAGCCTCAATCTCAACCCCTTCACAAATGGCCTTCCCGCCCCAATTCCAACCG
ACCACAATGTCAAATCTGTGGCAAATTTGGTCACACGGCACTGA
Protein sequenceShow/hide protein sequence
MGILETHVSGGHPNLLTCLSCGHPRMSGHPEMPRHLTSPQNKRGNPDAHSMGILDYRKTNGPLGIPTSNFEVILDHTEGQGAEEDHRTEVGPKARPRGRPDQRVGLKWPD
PYGRPRHIWSASADHSARLRGPSPVTSFRSLMPRIAPVPPSSSRNTTEFLKILGGQAASPQFSDLGIRGGVAYTTPVCSGFCWSGESSVRILASTVAEHILIYASQYLFP
CFVLLSRLWYQRHKVQIMSSEASTSSSSSLSAPITPSIVTPSTPTTTPVVSPIASQPHPTVYQNRPNVPQTQPPSILINNLLSILNLLPAVLPIFFPRPQPFFQPPQLLN
AVLANGLHGFSMDQSLLHRSFLMLSNLNRIRIFLLGKVFISIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDSPALEDVRSLLSAYEARLEKQNSVD
HLNLAQANFASLNIHNTNRRPSPRSPSSLFLELPSIHYPFLLPLLLQIVFLQASLVSLNLNPFTNGLPAPIPTDHNVKSVANLVTRH