; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021142 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021142
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr7:5014038..5014766
RNA-Seq ExpressionLag0021142
SyntenyLag0021142
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFY85402.1 hypothetical protein Acr_04g0001400 [Actinidia rufa]4.3e-3945.88Show/hide
Query:  MESLINGT-PAPPRFLDTAETQVNPLFPVWQKYNRTLMSWIYSSLTGDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQLQKIRKDGLTVSQFLA
        +E  ++G+   PPRFLD  + Q NP F  WQ+YNR +MSWIY+S+    +G+I+G ++A +IWE L+ +Y ++S A +  LR+ LQ I+K+GLT   ++ 
Subjt:  MESLINGT-PAPPRFLDTAETQVNPLFPVWQKYNRTLMSWIYSSLTGDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQLQKIRKDGLTVSQFLA

Query:  QIKDIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYETRLEKQTSVD
        + + + +  ++IGEP++Y DHL Y L GLG +YNPFVTSIQ++  RPS+ +V SLL++Y+ RLE+Q++ D
Subjt:  QIKDIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYETRLEKQTSVD

PON47862.1 hypothetical protein TorRG33x02_321990 [Trema orientale]3.2e-4246.19Show/hide
Query:  MESLINGT-PAPPRFLDTAETQVNPLFPVWQKYNRTLMSWIYSSLTGDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQLQKIRKDGLTVSQFLA
        +E  I+G+ P PPRF D A   VN  +  WQ++NR +MSWIY+SLT   +G+I+G ++A+EIWE L  +Y SSS A++  LR++LQ +RKDGLT  +++ 
Subjt:  MESLINGT-PAPPRFLDTAETQVNPLFPVWQKYNRTLMSWIYSSLTGDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQLQKIRKDGLTVSQFLA

Query:  QIKDIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYETRLEKQTSVDQLNMVQANLANFSFSN---NQNETNPI
        + K+I +  +A+GEP+S +DHL Y+  GL  EYN FVTSI  R D   L ++ SLL++YE RLE Q +  QL+ +QANLA+ + +      N +NP+
Subjt:  QIKDIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYETRLEKQTSVDQLNMVQANLANFSFSN---NQNETNPI

RVW69807.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.0e-4051.41Show/hide
Query:  APPRFLDTAETQVNPLFPVWQKYNRTLMSWIYSSLTGDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQLQKIRKDGLTVSQFLAQIKDIADKFS
        +PP++LD A  QVNP F  W + N+ +MSWIYSSLT   +G+I+  STA +IW  L   YES S A VM L SQLQ+I+K  + +S++L+++K + D+F+
Subjt:  APPRFLDTAETQVNPLFPVWQKYNRTLMSWIYSSLTGDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQLQKIRKDGLTVSQFLAQIKDIADKFS

Query:  AIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYETRLEKQTSVDQLNMVQANLANFSFSNN
         IGEPLSYRD L  ILEGL  EY+ FVTSI NR+DRPSL +V SLL  YE RL +++    LN  QAN     ++N+
Subjt:  AIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYETRLEKQTSVDQLNMVQANLANFSFSNN

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]3.7e-5959.89Show/hide
Query:  PPRFLDTAETQVNPLFPVWQKYNRTLMSWIYSSLTGDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQLQKIRKDGLTVSQFLAQIKDIADKFSA
        PP+FLD  + Q NP +  W++YNR LM WIYSSL+ +K+GE++   T ++IW  L  VY+S +TAR+MGL+++LQ +RKDG +VSQ+LA+IK+IADKF+A
Subjt:  PPRFLDTAETQVNPLFPVWQKYNRTLMSWIYSSLTGDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQLQKIRKDGLTVSQFLAQIKDIADKFSA

Query:  IGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYETRLEKQTSVDQLNMVQANLANFSFSNNQNETNP
        +GEPLSYRDHL ++L+GLG+EYN FVTSI NR D PSL DVRSLL+AYE RL+KQ +VDQLN+ QANL N S  +N     P
Subjt:  IGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYETRLEKQTSVDQLNMVQANLANFSFSNNQNETNP

XP_038887133.1 uncharacterized protein LOC120077323 [Benincasa hispida]1.9e-4769.29Show/hide
Query:  IGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQLQKIRKDGLTVSQFLAQIKDIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSL
        +GEI+G  +A++IWE L+ VYESSS A +MG  SQLQKI+KDGLTVSQ+LAQIKD+ D F+AIGEPLSYRDHL YILEGLG+EYNPFV+SI NRT+RPS+
Subjt:  IGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQLQKIRKDGLTVSQFLAQIKDIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSL

Query:  ADVRSLLIAYETRLEKQTSVDQLNMVQANLANFSFSNNQN
        ADVR+LLI Y++RLEKQT+ D L ++QAN+A+ S  N+QN
Subjt:  ADVRSLLIAYETRLEKQTSVDQLNMVQANLANFSFSNNQN

TrEMBL top hitse value%identityAlignment
A0A2P5BGF8 Uncharacterized protein1.5e-4246.19Show/hide
Query:  MESLINGT-PAPPRFLDTAETQVNPLFPVWQKYNRTLMSWIYSSLTGDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQLQKIRKDGLTVSQFLA
        +E  I+G+ P PPRF D A   VN  +  WQ++NR +MSWIY+SLT   +G+I+G ++A+EIWE L  +Y SSS A++  LR++LQ +RKDGLT  +++ 
Subjt:  MESLINGT-PAPPRFLDTAETQVNPLFPVWQKYNRTLMSWIYSSLTGDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQLQKIRKDGLTVSQFLA

Query:  QIKDIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYETRLEKQTSVDQLNMVQANLANFSFSN---NQNETNPI
        + K+I +  +A+GEP+S +DHL Y+  GL  EYN FVTSI  R D   L ++ SLL++YE RLE Q +  QL+ +QANLA+ + +      N +NP+
Subjt:  QIKDIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYETRLEKQTSVDQLNMVQANLANFSFSN---NQNETNPI

A0A438GC62 Retrovirus-related Pol polyprotein from transposon RE14.9e-4151.41Show/hide
Query:  APPRFLDTAETQVNPLFPVWQKYNRTLMSWIYSSLTGDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQLQKIRKDGLTVSQFLAQIKDIADKFS
        +PP++LD A  QVNP F  W + N+ +MSWIYSSLT   +G+I+  STA +IW  L   YES S A VM L SQLQ+I+K  + +S++L+++K + D+F+
Subjt:  APPRFLDTAETQVNPLFPVWQKYNRTLMSWIYSSLTGDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQLQKIRKDGLTVSQFLAQIKDIADKFS

Query:  AIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYETRLEKQTSVDQLNMVQANLANFSFSNN
         IGEPLSYRD L  ILEGL  EY+ FVTSI NR+DRPSL +V SLL  YE RL +++    LN  QAN     ++N+
Subjt:  AIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYETRLEKQTSVDQLNMVQANLANFSFSNN

A0A6J1DQX7 uncharacterized protein LOC1110223151.8e-5959.89Show/hide
Query:  PPRFLDTAETQVNPLFPVWQKYNRTLMSWIYSSLTGDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQLQKIRKDGLTVSQFLAQIKDIADKFSA
        PP+FLD  + Q NP +  W++YNR LM WIYSSL+ +K+GE++   T ++IW  L  VY+S +TAR+MGL+++LQ +RKDG +VSQ+LA+IK+IADKF+A
Subjt:  PPRFLDTAETQVNPLFPVWQKYNRTLMSWIYSSLTGDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQLQKIRKDGLTVSQFLAQIKDIADKFSA

Query:  IGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYETRLEKQTSVDQLNMVQANLANFSFSNNQNETNP
        +GEPLSYRDHL ++L+GLG+EYN FVTSI NR D PSL DVRSLL+AYE RL+KQ +VDQLN+ QANL N S  +N     P
Subjt:  IGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYETRLEKQTSVDQLNMVQANLANFSFSNNQNETNP

A0A7J0EGI5 Uncharacterized protein2.1e-3945.88Show/hide
Query:  MESLINGT-PAPPRFLDTAETQVNPLFPVWQKYNRTLMSWIYSSLTGDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQLQKIRKDGLTVSQFLA
        +E  ++G+   PPRFLD  + Q NP F  WQ+YNR +MSWIY+S+    +G+I+G ++A +IWE L+ +Y ++S A +  LR+ LQ I+K+GLT   ++ 
Subjt:  MESLINGT-PAPPRFLDTAETQVNPLFPVWQKYNRTLMSWIYSSLTGDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQLQKIRKDGLTVSQFLA

Query:  QIKDIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYETRLEKQTSVD
        + + + +  ++IGEP++Y DHL Y L GLG +YNPFVTSIQ++  RPS+ +V SLL++Y+ RLE+Q++ D
Subjt:  QIKDIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYETRLEKQTSVD

A0A803NL56 Uncharacterized protein1.1e-3739.32Show/hide
Query:  MESLINGTPA-PPRFLDTAETQVNPLFPVWQKYNRTLMSWIYSSLTGDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQLQKIRKDGLTVSQFLA
        +E  I+GT A   +F ++  +QV+P F  W +YN+ LMSW+Y+SL+   +G+I+G +TA EIW  L+  Y ++S AR    R  LQ ++KD L  S +L 
Subjt:  MESLINGTPA-PPRFLDTAETQVNPLFPVWQKYNRTLMSWIYSSLTGDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQLQKIRKDGLTVSQFLA

Query:  QIKDIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYETRLEKQTSVDQLNMVQANLANFSFSNNQNETNPIVPPICP
        ++K + +  +++G+P+S ++HL Y+L GLG EYN FVT I  R  +P++ +V +LL++YE RLE+Q +    + +QAN AN SF   + +++   P   P
Subjt:  QIKDIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYETRLEKQTSVDQLNMVQANLANFSFSNNQNETNPIVPPICP

Query:  SIPLLP
          P  P
Subjt:  SIPLLP

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.9e-1424.08Show/hide
Query:  LINGTPAPPRFLDT-AETQVNPLFPVWQKYNRTLMSWIYSSLTGDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQLQKIRKDGLTVSQFLAQIK
        L   T  PP  + T A  +VNP +  W++ ++ + S +  +++      +   +TA +IWE L+ +Y + S   V  LR+QL++  K   T+  ++  + 
Subjt:  LINGTPAPPRFLDT-AETQVNPLFPVWQKYNRTLMSWIYSSLTGDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQLQKIRKDGLTVSQFLAQIK

Query:  DIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYETRLEKQTSVDQLNMVQANLA--NFSFSNNQNETN
           D+ + +G+P+ + + +  +LE L  EY P +  I  +   P+L ++   L+ +E+++   +S   + +    ++  N + +NN N  N
Subjt:  DIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYETRLEKQTSVDQLNMVQANLA--NFSFSNNQNETN

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.7e-1123.81Show/hide
Query:  LINGTPAPPRFLDT-AETQVNPLFPVWQKYNRTLMSWIYSSLTGDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQLQKIRKDGLTVSQFLAQIK
        L   TP PP  + T A  +VNP +  W++ ++ + S I  +++      +   +TA +IWE L+ +Y + S   V  LR               F+ +  
Subjt:  LINGTPAPPRFLDT-AETQVNPLFPVWQKYNRTLMSWIYSSLTGDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQLQKIRKDGLTVSQFLAQIK

Query:  DIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYETRLEKQTSVDQLNMVQANLANFSFSNNQNETN
           D+ + +G+P+ + + +  +LE L  +Y P +  I  +   PSL ++   LI  E++L    S + + +    + + + + N+N+ N
Subjt:  DIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYETRLEKQTSVDQLNMVQANLANFSFSNNQNETN

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).7.8e-0725.74Show/hide
Query:  INGT-PAPPRFLDTAETQVNPLFPVWQKYNRTLMSWIYSSLTGDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQLQKIRKDGLTVSQFLAQIKD
        I+GT P P  F        +PL+  W++ N  +M W+ +S+T   +  ++   TA+++WE L+ V+      ++  LR +L  +R+ G +V ++  ++  
Subjt:  INGT-PAPPRFLDTAETQVNPLFPVWQKYNRTLMSWIYSSLTGDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQLQKIRKDGLTVSQFLAQIKD

Query:  I
        +
Subjt:  I

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)3.1e-1126.28Show/hide
Query:  WQKYNRTLMSWIYSSLTGDKI-GEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQLQKIRKDGLTVSQFLAQIKDIADKFSAIGEPLSYRDHLGYILEG
        WQK +  +   +Y +LT  +  G  +  ST+ +IW  +K  + ++  AR + L S+L+      + V+ +  ++K +AD    +  P++ R+ + Y+L G
Subjt:  WQKYNRTLMSWIYSSLTGDKI-GEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQLQKIRKDGLTVSQFLAQIKDIADKFSAIGEPLSYRDHLGYILEG

Query:  LGTEYNPFVTSIQNRTDRPSLADVRSLLIAYETRLEK
        L  +++  +  I++R   PS  D  ++L   E RL++
Subjt:  LGTEYNPFVTSIQNRTDRPSLADVRSLLIAYETRLEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAGTCTAATCAACGGTACGCCTGCTCCTCCTAGATTTCTGGATACTGCAGAAACTCAGGTAAACCCTCTTTTTCCTGTTTGGCAGAAGTATAATCGCACGTTAAT
GAGCTGGATTTACTCTTCACTGACTGGGGATAAGATAGGTGAAATAATTGGTTGCTCTACTGCTTATGAAATTTGGGAGCATCTTAAAATTGTTTATGAATCGTCTTCTA
CTGCTCGTGTTATGGGGTTAAGGTCTCAATTACAAAAAATTCGTAAAGATGGGCTCACTGTGTCTCAGTTCCTAGCCCAAATAAAGGATATAGCGGATAAGTTCTCAGCC
ATTGGCGAGCCATTGTCATATAGGGACCATCTAGGCTATATTCTCGAAGGTTTAGGAACCGAATATAACCCGTTTGTAACATCAATACAAAACCGCACTGATCGCCCATC
TCTTGCGGATGTCCGTAGTCTGTTGATTGCGTATGAAACCAGGCTCGAAAAACAAACATCTGTCGACCAGTTGAACATGGTACAAGCTAATCTAGCTAATTTCTCCTTTT
CAAATAACCAAAACGAAACCAATCCAATCGTTCCTCCAATATGCCCAAGCATTCCCCTGCTCCCAGACTGCCGCAATCTTTCCCTTCCCTTCAATGCCTTCCTCTCTCAA
TCCCAGTCTTCTTGGCCGACCACAATTCTCACCTCGTCCATATACCAACTCAAATCGCTGGCCTACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAGTCTAATCAACGGTACGCCTGCTCCTCCTAGATTTCTGGATACTGCAGAAACTCAGGTAAACCCTCTTTTTCCTGTTTGGCAGAAGTATAATCGCACGTTAAT
GAGCTGGATTTACTCTTCACTGACTGGGGATAAGATAGGTGAAATAATTGGTTGCTCTACTGCTTATGAAATTTGGGAGCATCTTAAAATTGTTTATGAATCGTCTTCTA
CTGCTCGTGTTATGGGGTTAAGGTCTCAATTACAAAAAATTCGTAAAGATGGGCTCACTGTGTCTCAGTTCCTAGCCCAAATAAAGGATATAGCGGATAAGTTCTCAGCC
ATTGGCGAGCCATTGTCATATAGGGACCATCTAGGCTATATTCTCGAAGGTTTAGGAACCGAATATAACCCGTTTGTAACATCAATACAAAACCGCACTGATCGCCCATC
TCTTGCGGATGTCCGTAGTCTGTTGATTGCGTATGAAACCAGGCTCGAAAAACAAACATCTGTCGACCAGTTGAACATGGTACAAGCTAATCTAGCTAATTTCTCCTTTT
CAAATAACCAAAACGAAACCAATCCAATCGTTCCTCCAATATGCCCAAGCATTCCCCTGCTCCCAGACTGCCGCAATCTTTCCCTTCCCTTCAATGCCTTCCTCTCTCAA
TCCCAGTCTTCTTGGCCGACCACAATTCTCACCTCGTCCATATACCAACTCAAATCGCTGGCCTACTAA
Protein sequenceShow/hide protein sequence
MESLINGTPAPPRFLDTAETQVNPLFPVWQKYNRTLMSWIYSSLTGDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQLQKIRKDGLTVSQFLAQIKDIADKFSA
IGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYETRLEKQTSVDQLNMVQANLANFSFSNNQNETNPIVPPICPSIPLLPDCRNLSLPFNAFLSQ
SQSSWPTTILTSSIYQLKSLAY