; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027262 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027262
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr10:46303380..46307825
RNA-Seq ExpressionLag0027262
SyntenyLag0027262
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN63649.1 hypothetical protein VITISV_037657 [Vitis vinifera]4.1e-2128.41Show/hide
Query:  FGPNSYPTLPHPLAVKLNDNNFLLWKNQLLNAVLANGLHGFLDGSIPAPPKFLDAQQLQLNPEFLIWE--------------------------------
        F  ++ P+L     V+L+ +N+LLW+ Q+LN ++ANGL   + G I AP +FL   +  +NPE+ IW+                                
Subjt:  FGPNSYPTLPHPLAVKLNDNNFLLWKNQLLNAVLANGLHGFLDGSIPAPPKFLDAQQLQLNPEFLIWE--------------------------------

Query:  --------SEYNAFVTSKQNRYDNPALEDVRSLLLAYEACLEKRNVVEQLNIAQANLSSLHLQSANRRSSP--------RPSSNQPI-----------RS
                +EYN+FV +  +R++  +LE++ S+LL +E  LE+++  E+ N+ QAN++++++Q  N+++          R + NQ             R 
Subjt:  --------SEYNAFVTSKQNRYDNPALEDVRSLLLAYEACLEKRNVVEQLNIAQANLSSLHLQSANRRSSP--------RPSSNQPI-----------RS

Query:  PFNPPTGSFPPNPPFSPSILGKPQAPSTHKWSPRS-------SSNKPQCQICGKFGHTALICHHRTNLAYQ
         +N   G+F   P       G+    + H +S R+       S+ KPQCQ+CGK+GH A+ C+HR +  YQ
Subjt:  PFNPPTGSFPPNPPFSPSILGKPQAPSTHKWSPRS-------SSNKPQCQICGKFGHTALICHHRTNLAYQ

CAN73380.1 hypothetical protein VITISV_032547 [Vitis vinifera]1.0e-2743.54Show/hide
Query:  HPLAVKLNDNNFLLWKNQLLNAVLANGLHGFLDGSIPAPPKFLDAQQLQLNPEFLIWESEYNAFVTSKQNRYDNPALEDVRSLLLAYEACLEKRNVVEQL
        +PL +KL+ +N+L+WKNQLLN V+ NGL G LD +   PPKFLD QQL +NPE+ +W   YN  + S    +     E+V + ++ Y    E    + Q+
Subjt:  HPLAVKLNDNNFLLWKNQLLNAVLANGLHGFLDGSIPAPPKFLDAQQLQLNPEFLIWESEYNAFVTSKQNRYDNPALEDVRSLLLAYEACLEKRNVVEQL

Query:  --NIAQANLSSL--HLQSANRRSSPRPSSNQPIRSP--FNPPTGSFPPNPPFSPSILGKPQA-PSTHKW--SPRSSSN--KPQCQICGKFGHTALICHHR
          + + A L  L   LQS  ++   +  S+    +P  F PPT +      F PSILG+PQ  P    W   P +S N  KP+CQICGKFGHTALICHHR
Subjt:  --NIAQANLSSL--HLQSANRRSSPRPSSNQPIRSP--FNPPTGSFPPNPPFSPSILGKPQA-PSTHKW--SPRSSSN--KPQCQICGKFGHTALICHHR

Query:  TNLAYQTPS
         NL YQ PS
Subjt:  TNLAYQTPS

XP_022148871.1 uncharacterized protein LOC111017438 [Momordica charantia]3.5e-2047.74Show/hide
Query:  EYNAFVTSKQNRYDNPALEDVRSLLLAYEACLEKRNVVEQLNIAQANLSSLHL--QSANRRSSPRPSSNQPIRSPFNPPTGSFPPNPPFSPSILGKPQAP
        EYNAFVTS QNR D   LEDVR+LLLAY+  LEK+N V+QLN+ QAN+++L L  QS + R+   PS + P   PFN  T          P +LGKP   
Subjt:  EYNAFVTSKQNRYDNPALEDVRSLLLAYEACLEKRNVVEQLNIAQANLSSLHL--QSANRRSSPRPSSNQPIRSPFNPPTGSFPPNPPFSPSILGKPQAP

Query:  STHKWSPRSSS---NKPQCQICGKFGHTALICHHRTNLAYQ-------TPSPQAL
        S   W P   S    K QCQIC K GHT   C+HR NL Y+        P+P AL
Subjt:  STHKWSPRSSS---NKPQCQICGKFGHTALICHHRTNLAYQ-------TPSPQAL

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]1.2e-5242.65Show/hide
Query:  FPQPTQQNPQIPPPAANPFGPNSYPTLPHPLAVKLNDNNFLLWKNQLLNAVLANGLHGFLDGSIPAPPKFLDAQQLQLNPEFLIWE--------------
        FP PT      PP   NPF  N +PTLP PL VKLNDNNFLLWKNQLLNAV+ANGL G+LDG+I  PP+FLD  QLQ NP +  WE              
Subjt:  FPQPTQQNPQIPPPAANPFGPNSYPTLPHPLAVKLNDNNFLLWKNQLLNAVLANGLHGFLDGSIPAPPKFLDAQQLQLNPEFLIWE--------------

Query:  -------------------------------------------------------------------------------------SEYNAFVTSKQNRYD
                                                                                             SEYNAFVTS  NR D
Subjt:  -------------------------------------------------------------------------------------SEYNAFVTSKQNRYD

Query:  NPALEDVRSLLLAYEACLEKRNVVEQLNIAQANLSSLHLQSANRRSSPRPSSNQPIRSPFNPPTGSFPPNPPF----SPSILGKPQAPSTHKWSPRSSSN
        +P+LEDVRSLLLAYEA L+K+N V+QLNIAQANL +L LQ  ++R  P+ S     +  F        PN P     S SILGKPQ  S HKW P+ SS+
Subjt:  NPALEDVRSLLLAYEACLEKRNVVEQLNIAQANLSSLHLQSANRRSSPRPSSNQPIRSPFNPPTGSFPPNPPF----SPSILGKPQAPSTHKWSPRSSSN

Query:  KPQCQICGKFGHTALICHHRTNLAYQTPSPQALLTTTQPT
        K QCQICGK GH+A +C+HRTN+AY   SPQAL    QP+
Subjt:  KPQCQICGKFGHTALICHHRTNLAYQTPSPQALLTTTQPT

XP_038891713.1 uncharacterized protein LOC120081111 [Benincasa hispida]4.6e-2048.03Show/hide
Query:  SEYNAFVTSKQNRYDNPALEDVRSLLLAYEACLEKRNVVEQLNIAQANLSSLHLQSANRRSSPRPSSNQPIRSPFNPPTGSFPP-NPPFSPSILGKPQAP
        SEYNAFVTS QN  DN ++EDV SLLL+YEA LEK+N ++ LNIAQA LS L  Q  ++R++ RP  N    S    P+ +F P  P  S S+  +P   
Subjt:  SEYNAFVTSKQNRYDNPALEDVRSLLLAYEACLEKRNVVEQLNIAQANLSSLHLQSANRRSSPRPSSNQPIRSPFNPPTGSFPP-NPPFSPSILGKPQAP

Query:  STHKWSP-RSSSNKPQCQICGKFGHTALICHHRTNLAYQTPSPQALLTTTQP
           KW P +  S+KPQCQI  KFGH    CH   + AYQ  +PQA +++ QP
Subjt:  STHKWSP-RSSSNKPQCQICGKFGHTALICHHRTNLAYQTPSPQALLTTTQP

TrEMBL top hitse value%identityAlignment
A0A438FWG3 Retrovirus-related Pol polyprotein from transposon RE14.6e-1825.56Show/hide
Query:  AVNPSVFPQPTQQNPQIPPPAANPFGPNSYPTLPHPLAVKLNDNNFLLWKNQLLNAVLANGLHGFLDGSIPAPPKFLDAQQLQLNPEFLIWE--------
        + NP  + Q  Q +P+I     + F  ++ P+L     V+L+ +N+LLW+ Q+LN ++ANGL   + G IPAP +FL   +  +NPE+ IW+        
Subjt:  AVNPSVFPQPTQQNPQIPPPAANPFGPNSYPTLPHPLAVKLNDNNFLLWKNQLLNAVLANGLHGFLDGSIPAPPKFLDAQQLQLNPEFLIWE--------

Query:  ---------------------------------------------------------SEYNAFVTSKQNRYDNPALEDVRSLLLAYEACLEKRNVVEQLN
                                                                 +EYN+FV +  + +++ +LE++ S+LL +E  LE+++  E+ N
Subjt:  ---------------------------------------------------------SEYNAFVTSKQNRYDNPALEDVRSLLLAYEACLEKRNVVEQLN

Query:  IAQANLSSLHLQSANRRSSP--------RPSSNQPI-----------RSPFNPPTGSFPPNPPFSPSILGKPQAPSTHKWSPRSSSNKPQCQICGKFGHT
        + QAN++++++Q  N+++          R + NQ             R  +N   G+F  + P   S      +          S+ KPQCQ+CGK+GH 
Subjt:  IAQANLSSLHLQSANRRSSP--------RPSSNQPI-----------RSPFNPPTGSFPPNPPFSPSILGKPQAPSTHKWSPRSSSNKPQCQICGKFGHT

Query:  ALICHHRTNLAYQ
        A+ C+HR +  YQ
Subjt:  ALICHHRTNLAYQ

A0A6J1D6N7 uncharacterized protein LOC1110174381.7e-2047.74Show/hide
Query:  EYNAFVTSKQNRYDNPALEDVRSLLLAYEACLEKRNVVEQLNIAQANLSSLHL--QSANRRSSPRPSSNQPIRSPFNPPTGSFPPNPPFSPSILGKPQAP
        EYNAFVTS QNR D   LEDVR+LLLAY+  LEK+N V+QLN+ QAN+++L L  QS + R+   PS + P   PFN  T          P +LGKP   
Subjt:  EYNAFVTSKQNRYDNPALEDVRSLLLAYEACLEKRNVVEQLNIAQANLSSLHL--QSANRRSSPRPSSNQPIRSPFNPPTGSFPPNPPFSPSILGKPQAP

Query:  STHKWSPRSSS---NKPQCQICGKFGHTALICHHRTNLAYQ-------TPSPQAL
        S   W P   S    K QCQIC K GHT   C+HR NL Y+        P+P AL
Subjt:  STHKWSPRSSS---NKPQCQICGKFGHTALICHHRTNLAYQ-------TPSPQAL

A0A6J1DQX7 uncharacterized protein LOC1110223155.8e-5342.65Show/hide
Query:  FPQPTQQNPQIPPPAANPFGPNSYPTLPHPLAVKLNDNNFLLWKNQLLNAVLANGLHGFLDGSIPAPPKFLDAQQLQLNPEFLIWE--------------
        FP PT      PP   NPF  N +PTLP PL VKLNDNNFLLWKNQLLNAV+ANGL G+LDG+I  PP+FLD  QLQ NP +  WE              
Subjt:  FPQPTQQNPQIPPPAANPFGPNSYPTLPHPLAVKLNDNNFLLWKNQLLNAVLANGLHGFLDGSIPAPPKFLDAQQLQLNPEFLIWE--------------

Query:  -------------------------------------------------------------------------------------SEYNAFVTSKQNRYD
                                                                                             SEYNAFVTS  NR D
Subjt:  -------------------------------------------------------------------------------------SEYNAFVTSKQNRYD

Query:  NPALEDVRSLLLAYEACLEKRNVVEQLNIAQANLSSLHLQSANRRSSPRPSSNQPIRSPFNPPTGSFPPNPPF----SPSILGKPQAPSTHKWSPRSSSN
        +P+LEDVRSLLLAYEA L+K+N V+QLNIAQANL +L LQ  ++R  P+ S     +  F        PN P     S SILGKPQ  S HKW P+ SS+
Subjt:  NPALEDVRSLLLAYEACLEKRNVVEQLNIAQANLSSLHLQSANRRSSPRPSSNQPIRSPFNPPTGSFPPNPPF----SPSILGKPQAPSTHKWSPRSSSN

Query:  KPQCQICGKFGHTALICHHRTNLAYQTPSPQALLTTTQPT
        K QCQICGK GH+A +C+HRTN+AY   SPQAL    QP+
Subjt:  KPQCQICGKFGHTALICHHRTNLAYQTPSPQALLTTTQPT

A5AG90 Uncharacterized protein4.9e-2843.54Show/hide
Query:  HPLAVKLNDNNFLLWKNQLLNAVLANGLHGFLDGSIPAPPKFLDAQQLQLNPEFLIWESEYNAFVTSKQNRYDNPALEDVRSLLLAYEACLEKRNVVEQL
        +PL +KL+ +N+L+WKNQLLN V+ NGL G LD +   PPKFLD QQL +NPE+ +W   YN  + S    +     E+V + ++ Y    E    + Q+
Subjt:  HPLAVKLNDNNFLLWKNQLLNAVLANGLHGFLDGSIPAPPKFLDAQQLQLNPEFLIWESEYNAFVTSKQNRYDNPALEDVRSLLLAYEACLEKRNVVEQL

Query:  --NIAQANLSSL--HLQSANRRSSPRPSSNQPIRSP--FNPPTGSFPPNPPFSPSILGKPQA-PSTHKW--SPRSSSN--KPQCQICGKFGHTALICHHR
          + + A L  L   LQS  ++   +  S+    +P  F PPT +      F PSILG+PQ  P    W   P +S N  KP+CQICGKFGHTALICHHR
Subjt:  --NIAQANLSSL--HLQSANRRSSPRPSSNQPIRSP--FNPPTGSFPPNPPFSPSILGKPQA-PSTHKW--SPRSSSN--KPQCQICGKFGHTALICHHR

Query:  TNLAYQTPS
         NL YQ PS
Subjt:  TNLAYQTPS

A5BMF5 Uncharacterized protein2.0e-2128.41Show/hide
Query:  FGPNSYPTLPHPLAVKLNDNNFLLWKNQLLNAVLANGLHGFLDGSIPAPPKFLDAQQLQLNPEFLIWE--------------------------------
        F  ++ P+L     V+L+ +N+LLW+ Q+LN ++ANGL   + G I AP +FL   +  +NPE+ IW+                                
Subjt:  FGPNSYPTLPHPLAVKLNDNNFLLWKNQLLNAVLANGLHGFLDGSIPAPPKFLDAQQLQLNPEFLIWE--------------------------------

Query:  --------SEYNAFVTSKQNRYDNPALEDVRSLLLAYEACLEKRNVVEQLNIAQANLSSLHLQSANRRSSP--------RPSSNQPI-----------RS
                +EYN+FV +  +R++  +LE++ S+LL +E  LE+++  E+ N+ QAN++++++Q  N+++          R + NQ             R 
Subjt:  --------SEYNAFVTSKQNRYDNPALEDVRSLLLAYEACLEKRNVVEQLNIAQANLSSLHLQSANRRSSP--------RPSSNQPI-----------RS

Query:  PFNPPTGSFPPNPPFSPSILGKPQAPSTHKWSPRS-------SSNKPQCQICGKFGHTALICHHRTNLAYQ
         +N   G+F   P       G+    + H +S R+       S+ KPQCQ+CGK+GH A+ C+HR +  YQ
Subjt:  PFNPPTGSFPPNPPFSPSILGKPQAPSTHKWSPRS-------SSNKPQCQICGKFGHTALICHHRTNLAYQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCCTAGGGTTTTTAGGAATTCGGAGGCGTTTCGGGACAAACCAGGCAGAACCGGAGCGGCCAGAGGCGGTAGGGACCAAACAGAGTCGGACAGGCTCGGCCCGCGC
GAGCCGGCCGAGGGTCGACCTCGATCATGAGCTTGGCCTCGGCGGGGGGTCGGGCCAAAAGCCCACCCCTTCGGTCTTGGCCCGTCCCACTTGTCGGTTTCGCCTCTTGG
GTCCATCTCCTAGTCCGATTTCTTCCCGGTTGTCCTCGTCAGCTCCTTGTACATCGGGGTGGTCTAAAATTGCCTATAACAATAAAACTTTTCTCACATCAATATCACAG
ACTTTATTCTTCTTCTTCTCTGGTGTTTTCCCTACCGTTTATGGTATCAGAGCTAAACCTCCATCCCTTCAATCAAGCGCCTCCATTTTACCCAAATTTTTCTCGCGCCC
TCCTCCCGCTGTTAATCCCTCTGTTTTTCCCCAACCCACTCAACAAAACCCACAAATCCCTCCTCCTGCTGCTAATCCTTTTGGTCCGAATTCCTACCCTACTCTCCCTC
ATCCCTTAGCCGTCAAGCTGAATGATAATAACTTTCTTTTGTGGAAAAACCAGCTTCTAAATGCAGTGCTCGCTAATGGTCTTCATGGTTTTTTGGATGGTTCAATCCCG
GCTCCTCCCAAATTTCTTGATGCTCAACAACTACAGCTGAATCCTGAGTTTCTGATATGGGAAAGTGAGTATAACGCTTTTGTGACCTCTAAACAAAATCGATATGATAA
TCCAGCTTTAGAGGATGTTCGAAGTTTATTGTTGGCTTATGAAGCTTGTCTTGAAAAACGGAATGTCGTTGAACAATTAAATATAGCTCAAGCAAATCTTAGTTCTCTAC
ACCTTCAATCTGCTAATCGTCGCTCTTCTCCTCGTCCATCCTCCAACCAACCTATTAGATCTCCCTTCAATCCACCTACTGGTTCTTTTCCCCCCAACCCACCGTTTTCT
CCTAGCATCTTAGGCAAACCGCAAGCTCCTTCCACTCATAAATGGTCTCCTCGGTCTAGTTCCAACAAACCACAATGCCAAATTTGTGGGAAGTTTGGTCATACTGCCCT
TATTTGCCATCATCGCACTAATTTAGCATACCAAACACCATCACCTCAAGCTTTACTAACTACAACTCAACCCACTGCTATTCCCATTTTAATGACTCTTTATCCACTTT
ATCTACTGATTCCTATCACCCTGATGAACGTTGGTATCTTGATTCGGGAGCTACTCATCACATGA
mRNA sequenceShow/hide mRNA sequence
ATGCTCCTAGGGTTTTTAGGAATTCGGAGGCGTTTCGGGACAAACCAGGCAGAACCGGAGCGGCCAGAGGCGGTAGGGACCAAACAGAGTCGGACAGGCTCGGCCCGCGC
GAGCCGGCCGAGGGTCGACCTCGATCATGAGCTTGGCCTCGGCGGGGGGTCGGGCCAAAAGCCCACCCCTTCGGTCTTGGCCCGTCCCACTTGTCGGTTTCGCCTCTTGG
GTCCATCTCCTAGTCCGATTTCTTCCCGGTTGTCCTCGTCAGCTCCTTGTACATCGGGGTGGTCTAAAATTGCCTATAACAATAAAACTTTTCTCACATCAATATCACAG
ACTTTATTCTTCTTCTTCTCTGGTGTTTTCCCTACCGTTTATGGTATCAGAGCTAAACCTCCATCCCTTCAATCAAGCGCCTCCATTTTACCCAAATTTTTCTCGCGCCC
TCCTCCCGCTGTTAATCCCTCTGTTTTTCCCCAACCCACTCAACAAAACCCACAAATCCCTCCTCCTGCTGCTAATCCTTTTGGTCCGAATTCCTACCCTACTCTCCCTC
ATCCCTTAGCCGTCAAGCTGAATGATAATAACTTTCTTTTGTGGAAAAACCAGCTTCTAAATGCAGTGCTCGCTAATGGTCTTCATGGTTTTTTGGATGGTTCAATCCCG
GCTCCTCCCAAATTTCTTGATGCTCAACAACTACAGCTGAATCCTGAGTTTCTGATATGGGAAAGTGAGTATAACGCTTTTGTGACCTCTAAACAAAATCGATATGATAA
TCCAGCTTTAGAGGATGTTCGAAGTTTATTGTTGGCTTATGAAGCTTGTCTTGAAAAACGGAATGTCGTTGAACAATTAAATATAGCTCAAGCAAATCTTAGTTCTCTAC
ACCTTCAATCTGCTAATCGTCGCTCTTCTCCTCGTCCATCCTCCAACCAACCTATTAGATCTCCCTTCAATCCACCTACTGGTTCTTTTCCCCCCAACCCACCGTTTTCT
CCTAGCATCTTAGGCAAACCGCAAGCTCCTTCCACTCATAAATGGTCTCCTCGGTCTAGTTCCAACAAACCACAATGCCAAATTTGTGGGAAGTTTGGTCATACTGCCCT
TATTTGCCATCATCGCACTAATTTAGCATACCAAACACCATCACCTCAAGCTTTACTAACTACAACTCAACCCACTGCTATTCCCATTTTAATGACTCTTTATCCACTTT
ATCTACTGATTCCTATCACCCTGATGAACGTTGGTATCTTGATTCGGGAGCTACTCATCACATGA
Protein sequenceShow/hide protein sequence
MLLGFLGIRRRFGTNQAEPERPEAVGTKQSRTGSARASRPRVDLDHELGLGGGSGQKPTPSVLARPTCRFRLLGPSPSPISSRLSSSAPCTSGWSKIAYNNKTFLTSISQ
TLFFFFSGVFPTVYGIRAKPPSLQSSASILPKFFSRPPPAVNPSVFPQPTQQNPQIPPPAANPFGPNSYPTLPHPLAVKLNDNNFLLWKNQLLNAVLANGLHGFLDGSIP
APPKFLDAQQLQLNPEFLIWESEYNAFVTSKQNRYDNPALEDVRSLLLAYEACLEKRNVVEQLNIAQANLSSLHLQSANRRSSPRPSSNQPIRSPFNPPTGSFPPNPPFS
PSILGKPQAPSTHKWSPRSSSNKPQCQICGKFGHTALICHHRTNLAYQTPSPQALLTTTQPTAIPILMTLYPLYLLIPITLMNVGILIRELLIT