; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011127 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011127
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr1:15282806..15283737
RNA-Seq ExpressionLag0011127
SyntenyLag0011127
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFY85402.1 hypothetical protein Acr_04g0001400 [Actinidia rufa]2.5e-4444.39Show/hide
Query:  SSSSSQKQQTPIISPSTPVTTPVATPQPPQSPPHPN--FFTPNPYSTLP---QPLAVKLNDNNFILWNNQLLNVVLANGLSGFLDGSIPASPKFLDQQNT
        +SS++    TP   P+ P   P + P P  S P+PN      +P   +P   QPLAVKL+D+N+I+W  QLLN+V+ANGL  FLDGS    P+FLD Q  
Subjt:  SSSSSQKQQTPIISPSTPVTTPVATPQPPQSPPHPN--FFTPNPYSTLP---QPLAVKLNDNNFILWNNQLLNVVLANGLSGFLDGSIPASPKFLDQQNT

Query:  QPNPDYIGWERYNRFIMCWLYSSLWEEKMGEIVCLSSASDIWSSLNRSYDSNTTARIMALKSQLQKIRKDGLSVGQYLAKIKEITDKFVAIGEPISYRDH
        Q NP++  W+RYNR +M W+Y+S+ E  +G+IV  +SAS IW +L R Y + + A +  L++ LQ I+K+GL+   Y+ K + + +   +IGEP++Y DH
Subjt:  QPNPDYIGWERYNRFIMCWLYSSLWEEKMGEIVCLSSASDIWSSLNRSYDSNTTARIMALKSQLQKIRKDGLSVGQYLAKIKEITDKFVAIGEPISYRDH

Query:  LAHILDGLGSEYNAFVTSIQNRS
        L + L GLG +YN FVTSIQ+++
Subjt:  LAHILDGLGSEYNAFVTSIQNRS

GFZ12741.1 UBX domain-containing protein [Actinidia rufa]2.5e-4443.44Show/hide
Query:  SSSSSQKQQTPIISPSTPVTTPVATPQPPQSPPHPNFFTPNPYSTLP---QPLAVKLNDNNFILWNNQLLNVVLANGLSGFLDGSIPASPKFLDQQNTQP
        +SS++    TP  +P  P + P+ +   P+  P+P     +P   +P   QPLAVKL+D+N+I+W  QLLN+V+ANGL  FLDGS    P+FLD Q  Q 
Subjt:  SSSSSQKQQTPIISPSTPVTTPVATPQPPQSPPHPNFFTPNPYSTLP---QPLAVKLNDNNFILWNNQLLNVVLANGLSGFLDGSIPASPKFLDQQNTQP

Query:  NPDYIGWERYNRFIMCWLYSSLWEEKMGEIVCLSSASDIWSSLNRSYDSNTTARIMALKSQLQKIRKDGLSVGQYLAKIKEITDKFVAIGEPISYRDHLA
        NP++  W+RYNR +M W+Y+S+ E  +G+IV  +SAS IW +L R Y + + A +  L++ LQ I+K+GL+   Y+ K + + +   +IGEP++Y DHL 
Subjt:  NPDYIGWERYNRFIMCWLYSSLWEEKMGEIVCLSSASDIWSSLNRSYDSNTTARIMALKSQLQKIRKDGLSVGQYLAKIKEITDKFVAIGEPISYRDHLA

Query:  HILDGLGSEYNAFVTSIQNRS
        + L GLG +YN FVTSIQ+++
Subjt:  HILDGLGSEYNAFVTSIQNRS

PON47862.1 hypothetical protein TorRG33x02_321990 [Trema orientale]1.5e-4144.69Show/hide
Query:  GSSSSSQKQQTPIIS-PST---PVTTPVATPQPPQSPPHPNFFTPNPYSTLPQPLAVKLNDNNFILWNNQLLNVVLANGLSGFLDGSIPASPKFLDQQNT
        GSSS++     P I  P T    V   +   QPP  PP P   + N      QP  +KL+ +N+++W NQLLNV++ANGL  F+DGS P  P+F D    
Subjt:  GSSSSSQKQQTPIIS-PST---PVTTPVATPQPPQSPPHPNFFTPNPYSTLPQPLAVKLNDNNFILWNNQLLNVVLANGLSGFLDGSIPASPKFLDQQNT

Query:  QPNPDYIGWERYNRFIMCWLYSSLWEEKMGEIVCLSSASDIWSSLNRSYDSNTTARIMALKSQLQKIRKDGLSVGQYLAKIKEITDKFVAIGEPISYRDH
          N +YI W+R+NR IM W+Y+SL +  MG+IV  +SA +IW +LN+ Y S++ A+I  L+++LQ +RKDGL+  +Y+ K K I +   A+GEP+S +DH
Subjt:  QPNPDYIGWERYNRFIMCWLYSSLWEEKMGEIVCLSSASDIWSSLNRSYDSNTTARIMALKSQLQKIRKDGLSVGQYLAKIKEITDKFVAIGEPISYRDH

Query:  LAHILDGLGSEYNAFVTSIQNRSYNL
        L ++  GL  EYNAFVTSI  R  NL
Subjt:  LAHILDGLGSEYNAFVTSIQNRSYNL

RVW69807.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]2.2e-4044.61Show/hide
Query:  PSTPVTTPVATPQPPQSPPHPNFFTPNPYSTLPQPLAVKLNDNNFILWNNQLLNVVLANGLSGFLDGSIPASPKFLDQQNTQPNPDYIGWERYNRFIMCW
        P TP +    T    Q+P         P  +L Q L++KL++ N +L  +QLLNV++ANGL  F+D    + PK+LD    Q NP+++ W+R N+ +M W
Subjt:  PSTPVTTPVATPQPPQSPPHPNFFTPNPYSTLPQPLAVKLNDNNFILWNNQLLNVVLANGLSGFLDGSIPASPKFLDQQNTQPNPDYIGWERYNRFIMCW

Query:  LYSSLWEEKMGEIVCLSSASDIWSSLNRSYDSNTTARIMALKSQLQKIRKDGLSVGQYLAKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSI
        +YSSL    +G+IV  S+A DIW+SLN  Y+S + A +M+L SQLQ+I+K  + + +YL+++K + D+F  IGEP+SYRD L  IL+GL  EY+ FVTSI
Subjt:  LYSSLWEEKMGEIVCLSSASDIWSSLNRSYDSNTTARIMALKSQLQKIRKDGLSVGQYLAKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSI

Query:  QNRS
         NRS
Subjt:  QNRS

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]2.9e-7269.54Show/hide
Query:  PVATPQPPQSPPHPNFFTPNPYSTLPQPLAVKLNDNNFILWNNQLLNVVLANGLSGFLDGSIPASPKFLDQQNTQPNPDYIGWERYNRFIMCWLYSSLWE
        P  TP     PP+P  F+ NP+ TLPQPL VKLNDNNF+LW NQLLN V+ANGL G+LDG+I   P+FLD    QPNP Y  WERYNR +MCW+YSSL E
Subjt:  PVATPQPPQSPPHPNFFTPNPYSTLPQPLAVKLNDNNFILWNNQLLNVVLANGLSGFLDGSIPASPKFLDQQNTQPNPDYIGWERYNRFIMCWLYSSLWE

Query:  EKMGEIVCLSSASDIWSSLNRSYDSNTTARIMALKSQLQKIRKDGLSVGQYLAKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRS
        EKMGE+V L +  DIWSSL R YDS TTARIM LK++LQ +RKDG SV QYLAKIKEI DKF A+GEP+SYRDHLAH+LDGLGSEYNAFVTSI NR+
Subjt:  EKMGEIVCLSSASDIWSSLNRSYDSNTTARIMALKSQLQKIRKDGLSVGQYLAKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRS

TrEMBL top hitse value%identityAlignment
A0A2P5BGF8 Uncharacterized protein7.4e-4244.69Show/hide
Query:  GSSSSSQKQQTPIIS-PST---PVTTPVATPQPPQSPPHPNFFTPNPYSTLPQPLAVKLNDNNFILWNNQLLNVVLANGLSGFLDGSIPASPKFLDQQNT
        GSSS++     P I  P T    V   +   QPP  PP P   + N      QP  +KL+ +N+++W NQLLNV++ANGL  F+DGS P  P+F D    
Subjt:  GSSSSSQKQQTPIIS-PST---PVTTPVATPQPPQSPPHPNFFTPNPYSTLPQPLAVKLNDNNFILWNNQLLNVVLANGLSGFLDGSIPASPKFLDQQNT

Query:  QPNPDYIGWERYNRFIMCWLYSSLWEEKMGEIVCLSSASDIWSSLNRSYDSNTTARIMALKSQLQKIRKDGLSVGQYLAKIKEITDKFVAIGEPISYRDH
          N +YI W+R+NR IM W+Y+SL +  MG+IV  +SA +IW +LN+ Y S++ A+I  L+++LQ +RKDGL+  +Y+ K K I +   A+GEP+S +DH
Subjt:  QPNPDYIGWERYNRFIMCWLYSSLWEEKMGEIVCLSSASDIWSSLNRSYDSNTTARIMALKSQLQKIRKDGLSVGQYLAKIKEITDKFVAIGEPISYRDH

Query:  LAHILDGLGSEYNAFVTSIQNRSYNL
        L ++  GL  EYNAFVTSI  R  NL
Subjt:  LAHILDGLGSEYNAFVTSIQNRSYNL

A0A438GC62 Retrovirus-related Pol polyprotein from transposon RE11.1e-4044.61Show/hide
Query:  PSTPVTTPVATPQPPQSPPHPNFFTPNPYSTLPQPLAVKLNDNNFILWNNQLLNVVLANGLSGFLDGSIPASPKFLDQQNTQPNPDYIGWERYNRFIMCW
        P TP +    T    Q+P         P  +L Q L++KL++ N +L  +QLLNV++ANGL  F+D    + PK+LD    Q NP+++ W+R N+ +M W
Subjt:  PSTPVTTPVATPQPPQSPPHPNFFTPNPYSTLPQPLAVKLNDNNFILWNNQLLNVVLANGLSGFLDGSIPASPKFLDQQNTQPNPDYIGWERYNRFIMCW

Query:  LYSSLWEEKMGEIVCLSSASDIWSSLNRSYDSNTTARIMALKSQLQKIRKDGLSVGQYLAKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSI
        +YSSL    +G+IV  S+A DIW+SLN  Y+S + A +M+L SQLQ+I+K  + + +YL+++K + D+F  IGEP+SYRD L  IL+GL  EY+ FVTSI
Subjt:  LYSSLWEEKMGEIVCLSSASDIWSSLNRSYDSNTTARIMALKSQLQKIRKDGLSVGQYLAKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSI

Query:  QNRS
         NRS
Subjt:  QNRS

A0A6J1DQX7 uncharacterized protein LOC1110223151.4e-7269.54Show/hide
Query:  PVATPQPPQSPPHPNFFTPNPYSTLPQPLAVKLNDNNFILWNNQLLNVVLANGLSGFLDGSIPASPKFLDQQNTQPNPDYIGWERYNRFIMCWLYSSLWE
        P  TP     PP+P  F+ NP+ TLPQPL VKLNDNNF+LW NQLLN V+ANGL G+LDG+I   P+FLD    QPNP Y  WERYNR +MCW+YSSL E
Subjt:  PVATPQPPQSPPHPNFFTPNPYSTLPQPLAVKLNDNNFILWNNQLLNVVLANGLSGFLDGSIPASPKFLDQQNTQPNPDYIGWERYNRFIMCWLYSSLWE

Query:  EKMGEIVCLSSASDIWSSLNRSYDSNTTARIMALKSQLQKIRKDGLSVGQYLAKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRS
        EKMGE+V L +  DIWSSL R YDS TTARIM LK++LQ +RKDG SV QYLAKIKEI DKF A+GEP+SYRDHLAH+LDGLGSEYNAFVTSI NR+
Subjt:  EKMGEIVCLSSASDIWSSLNRSYDSNTTARIMALKSQLQKIRKDGLSVGQYLAKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRS

A0A7J0EGI5 Uncharacterized protein1.2e-4444.39Show/hide
Query:  SSSSSQKQQTPIISPSTPVTTPVATPQPPQSPPHPN--FFTPNPYSTLP---QPLAVKLNDNNFILWNNQLLNVVLANGLSGFLDGSIPASPKFLDQQNT
        +SS++    TP   P+ P   P + P P  S P+PN      +P   +P   QPLAVKL+D+N+I+W  QLLN+V+ANGL  FLDGS    P+FLD Q  
Subjt:  SSSSSQKQQTPIISPSTPVTTPVATPQPPQSPPHPN--FFTPNPYSTLP---QPLAVKLNDNNFILWNNQLLNVVLANGLSGFLDGSIPASPKFLDQQNT

Query:  QPNPDYIGWERYNRFIMCWLYSSLWEEKMGEIVCLSSASDIWSSLNRSYDSNTTARIMALKSQLQKIRKDGLSVGQYLAKIKEITDKFVAIGEPISYRDH
        Q NP++  W+RYNR +M W+Y+S+ E  +G+IV  +SAS IW +L R Y + + A +  L++ LQ I+K+GL+   Y+ K + + +   +IGEP++Y DH
Subjt:  QPNPDYIGWERYNRFIMCWLYSSLWEEKMGEIVCLSSASDIWSSLNRSYDSNTTARIMALKSQLQKIRKDGLSVGQYLAKIKEITDKFVAIGEPISYRDH

Query:  LAHILDGLGSEYNAFVTSIQNRS
        L + L GLG +YN FVTSIQ+++
Subjt:  LAHILDGLGSEYNAFVTSIQNRS

A0A7J0GPN0 UBX domain-containing protein1.2e-4443.44Show/hide
Query:  SSSSSQKQQTPIISPSTPVTTPVATPQPPQSPPHPNFFTPNPYSTLP---QPLAVKLNDNNFILWNNQLLNVVLANGLSGFLDGSIPASPKFLDQQNTQP
        +SS++    TP  +P  P + P+ +   P+  P+P     +P   +P   QPLAVKL+D+N+I+W  QLLN+V+ANGL  FLDGS    P+FLD Q  Q 
Subjt:  SSSSSQKQQTPIISPSTPVTTPVATPQPPQSPPHPNFFTPNPYSTLP---QPLAVKLNDNNFILWNNQLLNVVLANGLSGFLDGSIPASPKFLDQQNTQP

Query:  NPDYIGWERYNRFIMCWLYSSLWEEKMGEIVCLSSASDIWSSLNRSYDSNTTARIMALKSQLQKIRKDGLSVGQYLAKIKEITDKFVAIGEPISYRDHLA
        NP++  W+RYNR +M W+Y+S+ E  +G+IV  +SAS IW +L R Y + + A +  L++ LQ I+K+GL+   Y+ K + + +   +IGEP++Y DHL 
Subjt:  NPDYIGWERYNRFIMCWLYSSLWEEKMGEIVCLSSASDIWSSLNRSYDSNTTARIMALKSQLQKIRKDGLSVGQYLAKIKEITDKFVAIGEPISYRDHLA

Query:  HILDGLGSEYNAFVTSIQNRS
        + L GLG +YN FVTSIQ+++
Subjt:  HILDGLGSEYNAFVTSIQNRS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.3e-1425.88Show/hide
Query:  LAVKLNDNNFILWNNQLLNVVLANGLSGFLDGSIPASPKFLDQQNTQPNPDYIGWERYNRFIMCWLYSSLWEEKMGEIVCLS-SASDIWSSLNRSYDSNT
        + + LN  N+ +W      + L+ G+ G +DGS   +P       T+       W+  +  +  W+Y ++ +  +  I+ +  +A D+W SL   +  N 
Subjt:  LAVKLNDNNFILWNNQLLNVVLANGLSGFLDGSIPASPKFLDQQNTQPNPDYIGWERYNRFIMCWLYSSLWEEKMGEIVCLS-SASDIWSSLNRSYDSNT

Query:  TARIMALKSQLQKIRKDGLSVGQYLAKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRS
         AR +  +++L+    D LSV +Y  K+K ++D    +  PIS R  + H+L+GL  +Y+  +  I+++S
Subjt:  TARIMALKSQLQKIRKDGLSVGQYLAKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCACTGAAGAAGGTTCTTCCTCCTCTTCTCAAAAGCAACAAACTCCCATCATCTCTCCCAGCACGCCGGTGACAACTCCAGTAGCTACTCCCCAACCTCCACAGTC
CCCCCCCCACCCTAATTTCTTTACCCCTAATCCATATTCCACTTTACCTCAACCCCTAGCTGTTAAACTCAATGACAACAACTTCATACTCTGGAATAATCAACTTCTCA
ATGTTGTGCTCGCGAATGGTTTGTCTGGATTTCTCGATGGCTCCATTCCTGCTTCGCCCAAGTTTCTCGATCAACAAAACACTCAACCCAATCCAGATTACATTGGCTGG
GAAAGGTACAACCGTTTTATTATGTGTTGGCTATATTCATCATTGTGGGAAGAAAAGATGGGTGAAATCGTATGTTTGTCCTCTGCCTCTGATATTTGGTCTTCTCTCAA
TCGATCATATGATTCAAATACAACTGCTAGGATAATGGCCTTGAAATCTCAGTTACAGAAAATAAGAAAGGATGGTTTATCTGTTGGTCAGTATTTAGCAAAAATTAAAG
AGATTACTGATAAATTTGTTGCTATTGGTGAACCAATATCTTATAGGGATCATCTTGCTCATATACTTGATGGTTTAGGTAGTGAGTATAATGCCTTCGTTACATCTATT
CAAAATCGTTCGTATAACCTACTTTGGAAGATGTTCGAAGCCTTTTATTAG
mRNA sequenceShow/hide mRNA sequence
ATGACCACTGAAGAAGGTTCTTCCTCCTCTTCTCAAAAGCAACAAACTCCCATCATCTCTCCCAGCACGCCGGTGACAACTCCAGTAGCTACTCCCCAACCTCCACAGTC
CCCCCCCCACCCTAATTTCTTTACCCCTAATCCATATTCCACTTTACCTCAACCCCTAGCTGTTAAACTCAATGACAACAACTTCATACTCTGGAATAATCAACTTCTCA
ATGTTGTGCTCGCGAATGGTTTGTCTGGATTTCTCGATGGCTCCATTCCTGCTTCGCCCAAGTTTCTCGATCAACAAAACACTCAACCCAATCCAGATTACATTGGCTGG
GAAAGGTACAACCGTTTTATTATGTGTTGGCTATATTCATCATTGTGGGAAGAAAAGATGGGTGAAATCGTATGTTTGTCCTCTGCCTCTGATATTTGGTCTTCTCTCAA
TCGATCATATGATTCAAATACAACTGCTAGGATAATGGCCTTGAAATCTCAGTTACAGAAAATAAGAAAGGATGGTTTATCTGTTGGTCAGTATTTAGCAAAAATTAAAG
AGATTACTGATAAATTTGTTGCTATTGGTGAACCAATATCTTATAGGGATCATCTTGCTCATATACTTGATGGTTTAGGTAGTGAGTATAATGCCTTCGTTACATCTATT
CAAAATCGTTCGTATAACCTACTTTGGAAGATGTTCGAAGCCTTTTATTAG
Protein sequenceShow/hide protein sequence
MTTEEGSSSSSQKQQTPIISPSTPVTTPVATPQPPQSPPHPNFFTPNPYSTLPQPLAVKLNDNNFILWNNQLLNVVLANGLSGFLDGSIPASPKFLDQQNTQPNPDYIGW
ERYNRFIMCWLYSSLWEEKMGEIVCLSSASDIWSSLNRSYDSNTTARIMALKSQLQKIRKDGLSVGQYLAKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSI
QNRSYNLLWKMFEAFY