; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022159 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022159
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr7:19870832..19871697
RNA-Seq ExpressionLag0022159
SyntenyLag0022159
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFS33695.1 hypothetical protein Acr_00g0030110 [Actinidia rufa]1.8e-2642.2Show/hide
Query:  LGEIIEYSSAYEIWENLRVVYESSSIAHIMVLRSQLQKIRKDVISITQYLTHIKDVDDKFSAIDEPLSYMDHHGYILEGLDSEYNPFVTSIQNCTDRHSL
        LG+I+ Y+SA +IWE L  +Y ++S AH+  LR+ LQ I+KD ++   Y+   + + +  ++I EP++Y DH  Y L GL  +YNPFVTSIQ+   R S+
Subjt:  LGEIIEYSSAYEIWENLRVVYESSSIAHIMVLRSQLQKIRKDVISITQYLTHIKDVDDKFSAIDEPLSYMDHHGYILEGLDSEYNPFVTSIQNCTDRHSL

Query:  ADVRSLLLAYAARLEKKTSVDTLNMVQANLANLSISS------------NQKQFQHP-SQNLKPKFFSRPSSP
         +V SLLL+Y ARLE++++ DTL+ +QANLANL+               N   + HP  QN  P +   PSSP
Subjt:  ADVRSLLLAYAARLEKKTSVDTLNMVQANLANLSISS------------NQKQFQHP-SQNLKPKFFSRPSSP

GFY82848.1 hypothetical protein Acr_02g0010880 [Actinidia rufa]1.8e-2642.2Show/hide
Query:  LGEIIEYSSAYEIWENLRVVYESSSIAHIMVLRSQLQKIRKDVISITQYLTHIKDVDDKFSAIDEPLSYMDHHGYILEGLDSEYNPFVTSIQNCTDRHSL
        LG+I+ Y+SA +IWE L  +Y ++S AH+  LR+ LQ I+KD ++   Y+   + + +  ++I EP++Y DH  Y L GL  +YNPFVTSIQ+   R S+
Subjt:  LGEIIEYSSAYEIWENLRVVYESSSIAHIMVLRSQLQKIRKDVISITQYLTHIKDVDDKFSAIDEPLSYMDHHGYILEGLDSEYNPFVTSIQNCTDRHSL

Query:  ADVRSLLLAYAARLEKKTSVDTLNMVQANLANLSISS------------NQKQFQHP-SQNLKPKFFSRPSSP
         +V SLLL+Y ARLE++++ DTL+ +QANLANL+               N   + HP  QN  P +   PSSP
Subjt:  ADVRSLLLAYAARLEKKTSVDTLNMVQANLANLSISS------------NQKQFQHP-SQNLKPKFFSRPSSP

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]4.1e-3450Show/hide
Query:  RLGEIIEYSSAYEIWENLRVVYESSSIAHIMVLRSQLQKIRKDVISITQYLTHIKDVDDKFSAIDEPLSYMDHHGYILEGLDSEYNPFVTSIQNCTDRHS
        ++GE++   + ++IW +L  VY+S + A IM L+++LQ +RKD  S++QYL  IK++ DKF+A+ EPLSY DH  ++L+GL SEYN FVTSI N  D  S
Subjt:  RLGEIIEYSSAYEIWENLRVVYESSSIAHIMVLRSQLQKIRKDVISITQYLTHIKDVDDKFSAIDEPLSYMDHHGYILEGLDSEYNPFVTSIQNCTDRHS

Query:  LADVRSLLLAYAARLEKKTSVDTLNMVQANLANLSISSNQKQFQHPSQNLKPKF-------FSRPSSPFS
        L DVRSLLLAY ARL+K+ +VD LN+ QANL NLS+       QH S+   PKF        S P+SP S
Subjt:  LADVRSLLLAYAARLEKKTSVDTLNMVQANLANLSISSNQKQFQHPSQNLKPKF-------FSRPSSPFS

XP_038887133.1 uncharacterized protein LOC120077323 [Benincasa hispida]2.5e-3957.83Show/hide
Query:  LGEIIEYSSAYEIWENLRVVYESSSIAHIMVLRSQLQKIRKDVISITQYLTHIKDVDDKFSAIDEPLSYMDHHGYILEGLDSEYNPFVTSIQNCTDRHSL
        +GEI+ Y SA++IWE LR VYESSSIA IM   SQLQKI+KD ++++QYL  IKDV D F+AI EPLSY DH  YILEGL SEYNPFV+SI N T+R S+
Subjt:  LGEIIEYSSAYEIWENLRVVYESSSIAHIMVLRSQLQKIRKDVISITQYLTHIKDVDDKFSAIDEPLSYMDHHGYILEGLDSEYNPFVTSIQNCTDRHSL

Query:  ADVRSLLLAYAARLEKKTSVDTLNMVQANLANLSISSNQKQFQHPSQNLKPKFFSRPSSPFSFPFP
        ADVR+LL+ Y +RLEK+T+ D L ++QAN+A+LSI+S   Q +HP      +   R S+P    FP
Subjt:  ADVRSLLLAYAARLEKKTSVDTLNMVQANLANLSISSNQKQFQHPSQNLKPKFFSRPSSPFSFPFP

XP_038891713.1 uncharacterized protein LOC120081111 [Benincasa hispida]8.6e-2457.14Show/hide
Query:  MVLRSQLQKIRKDVISITQYLTHIKDVDDKFSAIDEPLSYMDHHGYILEGLDSEYNPFVTSIQNCTDRHSLADVRSLLLAYAARLEKKTSVDTLNMVQAN
        M L+++LQKIRKD +S++QYL+ IKDV DKFS + E +SY DH  +IL+GL SEYN FVTSIQN  D  S+ DV SLLL+Y A+LEK+ ++D LN+ QA 
Subjt:  MVLRSQLQKIRKDVISITQYLTHIKDVDDKFSAIDEPLSYMDHHGYILEGLDSEYNPFVTSIQNCTDRHSLADVRSLLLAYAARLEKKTSVDTLNMVQAN

Query:  LANLSISSNQKQ
        L+ LS   N K+
Subjt:  LANLSISSNQKQ

TrEMBL top hitse value%identityAlignment
A0A2P5F9H3 Uncharacterized protein2.1e-2345.24Show/hide
Query:  LGEIIEYSSAYEIWENLRVVYESSSIAHIMVLRSQLQKIRKDVISITQYLTHIKDVDDKFSAIDEPLSYMDHHGYILEGLDSEYNPFVTSIQNCTDRHSL
        +G I+EY+ A++IW +L  V+ES SIA +M L SQL +I+K  IS+++YL  +K + DK++ I EPLSY D     LEGL  EY+ FVTSI N +DR SL
Subjt:  LGEIIEYSSAYEIWENLRVVYESSSIAHIMVLRSQLQKIRKDVISITQYLTHIKDVDDKFSAIDEPLSYMDHHGYILEGLDSEYNPFVTSIQNCTDRHSL

Query:  ADVRSLLLAYAARLEKKTSVDTLNMVQANLANLS---ISSNQKQFQHPSQNLKPKFFSRPSSPFSFPF
          V SLL AY  RL +      +N  QANLA  +     +N        +N KP F S P  P +FPF
Subjt:  ADVRSLLLAYAARLEKKTSVDTLNMVQANLANLS---ISSNQKQFQHPSQNLKPKFFSRPSSPFSFPF

A0A438GC62 Retrovirus-related Pol polyprotein from transposon RE13.5e-2351.16Show/hide
Query:  LGEIIEYSSAYEIWENLRVVYESSSIAHIMVLRSQLQKIRKDVISITQYLTHIKDVDDKFSAIDEPLSYMDHHGYILEGLDSEYNPFVTSIQNCTDRHSL
        +G+I+EYS+A +IW +L   YES SIA +M L SQLQ+I+K  I +++YL+ +K V D+F+ I EPLSY D    ILEGL  EY+ FVTSI N +DR SL
Subjt:  LGEIIEYSSAYEIWENLRVVYESSSIAHIMVLRSQLQKIRKDVISITQYLTHIKDVDDKFSAIDEPLSYMDHHGYILEGLDSEYNPFVTSIQNCTDRHSL

Query:  ADVRSLLLAYAARLEKKTSVDTLNMVQAN
         +V SLL  Y  RL +++    LN  QAN
Subjt:  ADVRSLLLAYAARLEKKTSVDTLNMVQAN

A0A6J1DQX7 uncharacterized protein LOC1110223152.0e-3450Show/hide
Query:  RLGEIIEYSSAYEIWENLRVVYESSSIAHIMVLRSQLQKIRKDVISITQYLTHIKDVDDKFSAIDEPLSYMDHHGYILEGLDSEYNPFVTSIQNCTDRHS
        ++GE++   + ++IW +L  VY+S + A IM L+++LQ +RKD  S++QYL  IK++ DKF+A+ EPLSY DH  ++L+GL SEYN FVTSI N  D  S
Subjt:  RLGEIIEYSSAYEIWENLRVVYESSSIAHIMVLRSQLQKIRKDVISITQYLTHIKDVDDKFSAIDEPLSYMDHHGYILEGLDSEYNPFVTSIQNCTDRHS

Query:  LADVRSLLLAYAARLEKKTSVDTLNMVQANLANLSISSNQKQFQHPSQNLKPKF-------FSRPSSPFS
        L DVRSLLLAY ARL+K+ +VD LN+ QANL NLS+       QH S+   PKF        S P+SP S
Subjt:  LADVRSLLLAYAARLEKKTSVDTLNMVQANLANLSISSNQKQFQHPSQNLKPKF-------FSRPSSPFS

A0A7J0DER3 Uncharacterized protein8.9e-2742.2Show/hide
Query:  LGEIIEYSSAYEIWENLRVVYESSSIAHIMVLRSQLQKIRKDVISITQYLTHIKDVDDKFSAIDEPLSYMDHHGYILEGLDSEYNPFVTSIQNCTDRHSL
        LG+I+ Y+SA +IWE L  +Y ++S AH+  LR+ LQ I+KD ++   Y+   + + +  ++I EP++Y DH  Y L GL  +YNPFVTSIQ+   R S+
Subjt:  LGEIIEYSSAYEIWENLRVVYESSSIAHIMVLRSQLQKIRKDVISITQYLTHIKDVDDKFSAIDEPLSYMDHHGYILEGLDSEYNPFVTSIQNCTDRHSL

Query:  ADVRSLLLAYAARLEKKTSVDTLNMVQANLANLSISS------------NQKQFQHP-SQNLKPKFFSRPSSP
         +V SLLL+Y ARLE++++ DTL+ +QANLANL+               N   + HP  QN  P +   PSSP
Subjt:  ADVRSLLLAYAARLEKKTSVDTLNMVQANLANLSISS------------NQKQFQHP-SQNLKPKFFSRPSSP

A0A7J0E8R3 Uncharacterized protein8.9e-2742.2Show/hide
Query:  LGEIIEYSSAYEIWENLRVVYESSSIAHIMVLRSQLQKIRKDVISITQYLTHIKDVDDKFSAIDEPLSYMDHHGYILEGLDSEYNPFVTSIQNCTDRHSL
        LG+I+ Y+SA +IWE L  +Y ++S AH+  LR+ LQ I+KD ++   Y+   + + +  ++I EP++Y DH  Y L GL  +YNPFVTSIQ+   R S+
Subjt:  LGEIIEYSSAYEIWENLRVVYESSSIAHIMVLRSQLQKIRKDVISITQYLTHIKDVDDKFSAIDEPLSYMDHHGYILEGLDSEYNPFVTSIQNCTDRHSL

Query:  ADVRSLLLAYAARLEKKTSVDTLNMVQANLANLSISS------------NQKQFQHP-SQNLKPKFFSRPSSP
         +V SLLL+Y ARLE++++ DTL+ +QANLANL+               N   + HP  QN  P +   PSSP
Subjt:  ADVRSLLLAYAARLEKKTSVDTLNMVQANLANLSISS------------NQKQFQHP-SQNLKPKFFSRPSSP

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.4e-0623.38Show/hide
Query:  SSAYEIWENLRVVYESSSIAHIMVLRSQLQKIRKDVISITQYLTHIKDVDDKFSAIDEPLSYMDHHGYILEGLDSEYNPFVTSIQNCTDRHSLADVRSLL
        ++A +IWE LR +Y + S  H+  LR+QL++  K   +I  Y+  +    D+ + + +P+ + +    +LE L  EY P +  I       +L ++   L
Subjt:  SSAYEIWENLRVVYESSSIAHIMVLRSQLQKIRKDVISITQYLTHIKDVDDKFSAIDEPLSYMDHHGYILEGLDSEYNPFVTSIQNCTDRHSLADVRSLL

Query:  LAYAARLEKKTSVDTL----------NMVQANLANLSISSNQKQFQHPSQNLKP
        L + +++   +S   +          N    N  N    +N+   ++ + N KP
Subjt:  LAYAARLEKKTSVDTL----------NMVQANLANLSISSNQKQFQHPSQNLKP

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCATCCCTATTCCTCGACTACGACTCGTTTTCGATACCCTCCACAGACTTGGTGAGATAATTGAGTATTCCTCTGCTTATGAAATTTGGGAGAATTTGCGTGTTGT
CTATGAATCATCTTCTATAGCTCATATAATGGTTCTTAGATCTCAACTACAGAAAATTAGAAAGGATGTTATCTCAATTACACAATACTTGACTCATATCAAAGACGTTG
ACGACAAGTTCTCAGCCATCGATGAGCCTCTTTCCTATATGGACCATCATGGTTACATTCTTGAAGGACTTGATTCGGAATACAATCCTTTCGTTACCTCCATTCAAAAT
TGCACTGATCGCCACTCCCTTGCTGATGTTCGCAGTCTTCTTCTTGCATATGCAGCTCGTCTGGAAAAGAAAACCTCTGTTGATACGTTAAATATGGTGCAAGCCAATCT
CGCCAATCTTTCGATAAGTTCTAATCAAAAGCAGTTCCAACACCCTTCCCAAAATCTCAAACCAAAATTCTTTTCTAGACCTTCTTCCCCTTTTTCATTTCCATTTCCCT
AG
mRNA sequenceShow/hide mRNA sequence
ATGTTCATCCCTATTCCTCGACTACGACTCGTTTTCGATACCCTCCACAGACTTGGTGAGATAATTGAGTATTCCTCTGCTTATGAAATTTGGGAGAATTTGCGTGTTGT
CTATGAATCATCTTCTATAGCTCATATAATGGTTCTTAGATCTCAACTACAGAAAATTAGAAAGGATGTTATCTCAATTACACAATACTTGACTCATATCAAAGACGTTG
ACGACAAGTTCTCAGCCATCGATGAGCCTCTTTCCTATATGGACCATCATGGTTACATTCTTGAAGGACTTGATTCGGAATACAATCCTTTCGTTACCTCCATTCAAAAT
TGCACTGATCGCCACTCCCTTGCTGATGTTCGCAGTCTTCTTCTTGCATATGCAGCTCGTCTGGAAAAGAAAACCTCTGTTGATACGTTAAATATGGTGCAAGCCAATCT
CGCCAATCTTTCGATAAGTTCTAATCAAAAGCAGTTCCAACACCCTTCCCAAAATCTCAAACCAAAATTCTTTTCTAGACCTTCTTCCCCTTTTTCATTTCCATTTCCCT
AG
Protein sequenceShow/hide protein sequence
MFIPIPRLRLVFDTLHRLGEIIEYSSAYEIWENLRVVYESSSIAHIMVLRSQLQKIRKDVISITQYLTHIKDVDDKFSAIDEPLSYMDHHGYILEGLDSEYNPFVTSIQN
CTDRHSLADVRSLLLAYAARLEKKTSVDTLNMVQANLANLSISSNQKQFQHPSQNLKPKFFSRPSSPFSFPFP