; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011514 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011514
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag protease polyprotein
Genome locationchr1:26870115..26878678
RNA-Seq ExpressionLag0011514
SyntenyLag0011514
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046094.1 gag protease polyprotein [Cucumis melo var. makuwa]2.9e-4445.67Show/hide
Query:  APQNNEAALSREARCLRDFRKWDPCPFDGASRDPTVAELWLSSIETVFRHMNCPEDQKVYYVVFLLRDNALIWWQSIERTIGVSNGPITWTQFRGEFFRK
        APQ     LS EA+ LRDFRK++P  FDG+  DPT A++WLSS+ET+FR+M CPEDQKV   VF+L D    WW++ ER +G     ITW QF+  F+ K
Subjt:  APQNNEAALSREARCLRDFRKWDPCPFDGASRDPTVAELWLSSIETVFRHMNCPEDQKVYYVVFLLRDNALIWWQSIERTIGVSNGPITWTQFRGEFFRK

Query:  YYPAPSRFKKQAEFVALTQGNRTVEEYETKFVRLSRFAPTLVVVEANKVERFITGLKEEIQGSVAAHEPQDYATAVRVTEPIDRRPLVVSSK-----PSS
        ++ A  R  K+ EF+ L QG+ TVE+Y+ +F  LSRFAP ++  EA +V++F+ GL+ +IQG V A  P  +A A+R+   +  +    SSK      +S
Subjt:  YYPAPSRFKKQAEFVALTQGNRTVEEYETKFVRLSRFAPTLVVVEANKVERFITGLKEEIQGSVAAHEPQDYATAVRVTEPIDRRPLVVSSK-----PSS

Query:  GHKRRYDQ
        G KR+ +Q
Subjt:  GHKRRYDQ

KAA0050008.1 gag protease polyprotein [Cucumis melo var. makuwa]1.7e-4446.15Show/hide
Query:  APQNNEAALSREARCLRDFRKWDPCPFDGASRDPTVAELWLSSIETVFRHMNCPEDQKVYYVVFLLRDNALIWWQSIERTIGVSNGPITWTQFRGEFFRK
        APQ     LS EA+ LRDFRK++P  FDG+  DPT A+LWLSS+ET+FR+M CPEDQKV   VF+L D    WW++ ER +G     ITW QF+  F+ K
Subjt:  APQNNEAALSREARCLRDFRKWDPCPFDGASRDPTVAELWLSSIETVFRHMNCPEDQKVYYVVFLLRDNALIWWQSIERTIGVSNGPITWTQFRGEFFRK

Query:  YYPAPSRFKKQAEFVALTQGNRTVEEYETKFVRLSRFAPTLVVVEANKVERFITGLKEEIQGSVAAHEPQDYATAVRVTEPIDRRPLVVSSK-----PSS
        ++ A  R  K+ EF+ L QG+ TVE+Y+ +F  LSRFAP ++V+EA + ++F+ GL+ +IQG V A  P  +A A+R+   +  +    SSK      +S
Subjt:  YYPAPSRFKKQAEFVALTQGNRTVEEYETKFVRLSRFAPTLVVVEANKVERFITGLKEEIQGSVAAHEPQDYATAVRVTEPIDRRPLVVSSK-----PSS

Query:  GHKRRYDQ
        G KR+ +Q
Subjt:  GHKRRYDQ

XP_022156662.1 uncharacterized protein LOC111023512 [Momordica charantia]7.1e-5153.03Show/hide
Query:  SRQAQAPQNNEAALSREARCLRDFRKWDPCPFDGASRDPTVAELWLSSIETVFRHMNCPEDQKVYYVVFLLRDNALIWWQSIERTIGVSNGPITWTQFRG
        S Q      N  ++S EA+ LRDF+K+DP  FDG S DP +AE WLS +ET+FR+M C E+QKV   VF+L+D+A +WW+S ER I VS GP+TW QF+ 
Subjt:  SRQAQAPQNNEAALSREARCLRDFRKWDPCPFDGASRDPTVAELWLSSIETVFRHMNCPEDQKVYYVVFLLRDNALIWWQSIERTIGVSNGPITWTQFRG

Query:  EFFRKYYPAPSRFKKQAEFVALTQGNRTVEEYETKFVRLSRFAPTLVVVEANKVERFITGLKEEIQGSVAAHEPQDYATAVRVTEPIDRRPLVVSSKP
         FF++YYPA + ++KQ EF+ L Q NR+VEEY+ +F +LSRFAP LV  EA K ERFI  LK+E +G VA   P DYATA+R    ID R    S  P
Subjt:  EFFRKYYPAPSRFKKQAEFVALTQGNRTVEEYETKFVRLSRFAPTLVVVEANKVERFITGLKEEIQGSVAAHEPQDYATAVRVTEPIDRRPLVVSSKP

XP_038884794.1 LOW QUALITY PROTEIN: uncharacterized protein LOC120075457 [Benincasa hispida]2.3e-4648.51Show/hide
Query:  AALSREARCLRDFRKWDPCPFDGASRDPTVAELWLSSIETVFRHMNCPEDQKVYYVVFLLRDNALIWWQSIERTIGVSNGPITWTQFRGEFFRKYYPAPS
        + LS EA+ LRDFRK++P  F+G+ +DPT  ELW+S IET+FR+M C EDQKV Y V +L   A IWWQS ER +GV   P+TW QF+  F+ KY+ A  
Subjt:  AALSREARCLRDFRKWDPCPFDGASRDPTVAELWLSSIETVFRHMNCPEDQKVYYVVFLLRDNALIWWQSIERTIGVSNGPITWTQFRGEFFRKYYPAPS

Query:  RFKKQAEFVALTQGNRTVEEYETKFVRLSRFAPTLVVVEANKVERFITGLKEEIQGSVAAHEPQDYATAVRVTEPIDRR-----PLVVSSKPSSGHKRRY
        R+ KQ EF+ L QG+R+VE+Y+ +F  LS FAP L   EA + ERF+ GLK+ IQG V A +P  +  A+R+   +D +       V    PS G KR+ 
Subjt:  RFKKQAEFVALTQGNRTVEEYETKFVRLSRFAPTLVVVEANKVERFITGLKEEIQGSVAAHEPQDYATAVRVTEPIDRR-----PLVVSSKPSSGHKRRY

Query:  DQ
        DQ
Subjt:  DQ

XP_038891712.1 uncharacterized protein LOC120081110 [Benincasa hispida]2.1e-5051.49Show/hide
Query:  AALSREARCLRDFRKWDPCPFDGASRDPTVAELWLSSIETVFRHMNCPEDQKVYYVVFLLRDNALIWWQSIERTIGVSNGPITWTQFRGEFFRKYYPAPS
        +  S EA+ LRDF+K++P  F+G+ +DPT AELW+S IET+FR+M CPEDQKV   VF+L D A IWWQ  ER +GV   P+TW QF+  F+ KY+ A  
Subjt:  AALSREARCLRDFRKWDPCPFDGASRDPTVAELWLSSIETVFRHMNCPEDQKVYYVVFLLRDNALIWWQSIERTIGVSNGPITWTQFRGEFFRKYYPAPS

Query:  RFKKQAEFVALTQGNRTVEEYETKFVRLSRFAPTLVVVEANKVERFITGLKEEIQGSVAAHEPQDYATAVRVTEPIDRR-----PLVVSSKPSSGHKRRY
        R+ KQ EF+ L QG+R+VEEY+ +F  LSRFAP LV  EA + ERFI GLKE I+G V A +P  +  A+R+   +D +      LV  + PSSG KR+ 
Subjt:  RFKKQAEFVALTQGNRTVEEYETKFVRLSRFAPTLVVVEANKVERFITGLKEEIQGSVAAHEPQDYATAVRVTEPIDRR-----PLVVSSKPSSGHKRRY

Query:  DQ
        DQ
Subjt:  DQ

TrEMBL top hitse value%identityAlignment
A0A5A7TD13 Reverse transcriptase1.4e-4445.67Show/hide
Query:  APQNNEAALSREARCLRDFRKWDPCPFDGASRDPTVAELWLSSIETVFRHMNCPEDQKVYYVVFLLRDNALIWWQSIERTIGVSNGPITWTQFRGEFFRK
        APQ     LS EA+ LRDFRK++P  FDG+  DPT A+LWLSS+ET+FR+M CPEDQKV   VF+L D   +WW++ ER +G     ITW QF+  F+ K
Subjt:  APQNNEAALSREARCLRDFRKWDPCPFDGASRDPTVAELWLSSIETVFRHMNCPEDQKVYYVVFLLRDNALIWWQSIERTIGVSNGPITWTQFRGEFFRK

Query:  YYPAPSRFKKQAEFVALTQGNRTVEEYETKFVRLSRFAPTLVVVEANKVERFITGLKEEIQGSVAAHEPQDYATAVRVTEPIDRRPLVVSSK-----PSS
        ++ A  R  K+ EF+ L QG+ TVE+Y+ +F  LSRFAP ++  EA + ++F+ GL+ +IQG V A  P  +A A+R+   +  +    SSK      +S
Subjt:  YYPAPSRFKKQAEFVALTQGNRTVEEYETKFVRLSRFAPTLVVVEANKVERFITGLKEEIQGSVAAHEPQDYATAVRVTEPIDRRPLVVSSK-----PSS

Query:  GHKRRYDQ
        G KR+ +Q
Subjt:  GHKRRYDQ

A0A5A7TSQ8 Reverse transcriptase1.4e-4445.67Show/hide
Query:  APQNNEAALSREARCLRDFRKWDPCPFDGASRDPTVAELWLSSIETVFRHMNCPEDQKVYYVVFLLRDNALIWWQSIERTIGVSNGPITWTQFRGEFFRK
        APQ     LS EA+ LRDFRK++P  FDG+  DPT A++WLSS+ET+FR+M CPEDQKV   VF+L D    WW++ ER +G     ITW QF+  F+ K
Subjt:  APQNNEAALSREARCLRDFRKWDPCPFDGASRDPTVAELWLSSIETVFRHMNCPEDQKVYYVVFLLRDNALIWWQSIERTIGVSNGPITWTQFRGEFFRK

Query:  YYPAPSRFKKQAEFVALTQGNRTVEEYETKFVRLSRFAPTLVVVEANKVERFITGLKEEIQGSVAAHEPQDYATAVRVTEPIDRRPLVVSSK-----PSS
        ++ A  R  K+ EF+ L QG+ TVE+Y+ +F  LSRFAP ++  EA +V++F+ GL+ +IQG V A  P  +A A+R+   +  +    SSK      +S
Subjt:  YYPAPSRFKKQAEFVALTQGNRTVEEYETKFVRLSRFAPTLVVVEANKVERFITGLKEEIQGSVAAHEPQDYATAVRVTEPIDRRPLVVSSK-----PSS

Query:  GHKRRYDQ
        G KR+ +Q
Subjt:  GHKRRYDQ

A0A5A7U417 Gag protease polyprotein8.2e-4546.15Show/hide
Query:  APQNNEAALSREARCLRDFRKWDPCPFDGASRDPTVAELWLSSIETVFRHMNCPEDQKVYYVVFLLRDNALIWWQSIERTIGVSNGPITWTQFRGEFFRK
        APQ     LS EA+ LRDFRK++P  FDG+  DPT A+LWLSS+ET+FR+M CPEDQKV   VF+L D    WW++ ER +G     ITW QF+  F+ K
Subjt:  APQNNEAALSREARCLRDFRKWDPCPFDGASRDPTVAELWLSSIETVFRHMNCPEDQKVYYVVFLLRDNALIWWQSIERTIGVSNGPITWTQFRGEFFRK

Query:  YYPAPSRFKKQAEFVALTQGNRTVEEYETKFVRLSRFAPTLVVVEANKVERFITGLKEEIQGSVAAHEPQDYATAVRVTEPIDRRPLVVSSK-----PSS
        ++ A  R  K+ EF+ L QG+ TVE+Y+ +F  LSRFAP ++V+EA + ++F+ GL+ +IQG V A  P  +A A+R+   +  +    SSK      +S
Subjt:  YYPAPSRFKKQAEFVALTQGNRTVEEYETKFVRLSRFAPTLVVVEANKVERFITGLKEEIQGSVAAHEPQDYATAVRVTEPIDRRPLVVSSK-----PSS

Query:  GHKRRYDQ
        G KR+ +Q
Subjt:  GHKRRYDQ

A0A5A7UAA8 Reverse transcriptase1.8e-4445.19Show/hide
Query:  APQNNEAALSREARCLRDFRKWDPCPFDGASRDPTVAELWLSSIETVFRHMNCPEDQKVYYVVFLLRDNALIWWQSIERTIGVSNGPITWTQFRGEFFRK
        APQ     LS EA+ LRDFRK++P  FDG+  DPT A++WLSS+ET+FR+M CPEDQKV   VF+L D    WW++ ER +G     ITW QF+  F+ K
Subjt:  APQNNEAALSREARCLRDFRKWDPCPFDGASRDPTVAELWLSSIETVFRHMNCPEDQKVYYVVFLLRDNALIWWQSIERTIGVSNGPITWTQFRGEFFRK

Query:  YYPAPSRFKKQAEFVALTQGNRTVEEYETKFVRLSRFAPTLVVVEANKVERFITGLKEEIQGSVAAHEPQDYATAVRVTEPIDRRPLVVSSK-----PSS
        ++ A  R  K+ EF+ L QG+ TVE+Y+ +F  LSRFAP ++  EA + ++F+ GL+ +IQG V A  P  +A A+R+   +  + +  SSK      +S
Subjt:  YYPAPSRFKKQAEFVALTQGNRTVEEYETKFVRLSRFAPTLVVVEANKVERFITGLKEEIQGSVAAHEPQDYATAVRVTEPIDRRPLVVSSK-----PSS

Query:  GHKRRYDQ
        G KR+ +Q
Subjt:  GHKRRYDQ

A0A6J1DSJ6 uncharacterized protein LOC1110235123.4e-5153.03Show/hide
Query:  SRQAQAPQNNEAALSREARCLRDFRKWDPCPFDGASRDPTVAELWLSSIETVFRHMNCPEDQKVYYVVFLLRDNALIWWQSIERTIGVSNGPITWTQFRG
        S Q      N  ++S EA+ LRDF+K+DP  FDG S DP +AE WLS +ET+FR+M C E+QKV   VF+L+D+A +WW+S ER I VS GP+TW QF+ 
Subjt:  SRQAQAPQNNEAALSREARCLRDFRKWDPCPFDGASRDPTVAELWLSSIETVFRHMNCPEDQKVYYVVFLLRDNALIWWQSIERTIGVSNGPITWTQFRG

Query:  EFFRKYYPAPSRFKKQAEFVALTQGNRTVEEYETKFVRLSRFAPTLVVVEANKVERFITGLKEEIQGSVAAHEPQDYATAVRVTEPIDRRPLVVSSKP
         FF++YYPA + ++KQ EF+ L Q NR+VEEY+ +F +LSRFAP LV  EA K ERFI  LK+E +G VA   P DYATA+R    ID R    S  P
Subjt:  EFFRKYYPAPSRFKKQAEFVALTQGNRTVEEYETKFVRLSRFAPTLVVVEANKVERFITGLKEEIQGSVAAHEPQDYATAVRVTEPIDRRPLVVSSKP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCCTTGCACGAGGACTCGACAAACTCATTCGACCAGACATGCTTAAGTTTACCGCCGATACTTTTCATAAAAAACTCTTCTCAAGGAACGAACTAGGCGTTGCGAC
ATTTGAGACTTTTGGATTGGGTACTAATGCGAGTATGGTTCAAGAAGTTGCCACAGGCGTTGCGACATTTGAGACTTTTGGATTGGGTACTAATGCGAGTATGGTTCTAG
AAGTTGCCACAGGTGTGGAAGAAAAAGGCATGCTCACCGTCGATGTCGACGCCGCCGCCGCACTGCACTGCTTTTCGTCGTCCTCCGGGAGCCGCCGCCGCCTGCCGCGC
GACGCCTGCCGCTCACCTCCCGTCGCCGTGAACGGCCGCCATTCGCCGTCTTCTGAAGTGTTGTCGCCACCACGTTCAGCGAACCAGTCGTGCACGCCGCCGTCACCGCG
TCCAGATCTGTTTGTTCTTTGTCGCTCGGATTTGATTACCGCCGTCACTGCCTTAAGACAAATAGCCACACGTCTCTTGAACGTCGACAATTACAACTGGCGGTGTGAAA
TTGGAGATTGGACAGCAAGCAGTCGTCTGACTTCGACGATTGTTTCGCATCACGTGGTTGGCGTTGCATCAAGTGTTTGGGTGATTTCTATCAAGCTTAGTTACCTCGAG
GTTTACATCATAAACCTTAGGTATATCCATGGCTTGTCAGCAAGCATCCGGTCACGCCAGGCCCAAGCGCCTCAGAACAACGAAGCCGCCTTATCACGAGAGGCTAGGTG
CTTAAGAGATTTTAGGAAATGGGACCCCTGTCCGTTCGATGGAGCCTCAAGGGACCCCACGGTAGCAGAGTTGTGGCTGTCCTCCATCGAGACTGTTTTTCGCCACATGA
ACTGCCCGGAGGACCAGAAAGTGTACTACGTCGTGTTCCTGTTACGAGATAACGCCCTGATTTGGTGGCAATCGATCGAAAGAACTATAGGCGTCAGCAACGGACCTATC
ACATGGACTCAGTTCAGGGGGGAGTTCTTTAGGAAGTATTATCCTGCACCCTCCCGATTCAAGAAGCAAGCAGAGTTTGTGGCTCTTACGCAGGGAAACCGAACTGTAGA
GGAGTACGAGACAAAGTTTGTCAGACTGTCTCGGTTTGCTCCTACACTGGTAGTTGTAGAGGCTAATAAGGTAGAACGTTTCATCACAGGCTTAAAGGAAGAGATTCAGG
GCAGCGTGGCAGCCCATGAACCCCAAGACTATGCCACGGCAGTCAGGGTGACAGAGCCGATTGATCGACGACCTTTAGTTGTCTCTTCGAAACCTTCCTCAGGTCATAAG
CGAAGGTACGATCAGATGAGCTCGAACACACAGAGGGGTCACCGTTACCCTAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGACCCTTGCACGAGGACTCGACAAACTCATTCGACCAGACATGCTTAAGTTTACCGCCGATACTTTTCATAAAAAACTCTTCTCAAGGAACGAACTAGGCGTTGCGAC
ATTTGAGACTTTTGGATTGGGTACTAATGCGAGTATGGTTCAAGAAGTTGCCACAGGCGTTGCGACATTTGAGACTTTTGGATTGGGTACTAATGCGAGTATGGTTCTAG
AAGTTGCCACAGGTGTGGAAGAAAAAGGCATGCTCACCGTCGATGTCGACGCCGCCGCCGCACTGCACTGCTTTTCGTCGTCCTCCGGGAGCCGCCGCCGCCTGCCGCGC
GACGCCTGCCGCTCACCTCCCGTCGCCGTGAACGGCCGCCATTCGCCGTCTTCTGAAGTGTTGTCGCCACCACGTTCAGCGAACCAGTCGTGCACGCCGCCGTCACCGCG
TCCAGATCTGTTTGTTCTTTGTCGCTCGGATTTGATTACCGCCGTCACTGCCTTAAGACAAATAGCCACACGTCTCTTGAACGTCGACAATTACAACTGGCGGTGTGAAA
TTGGAGATTGGACAGCAAGCAGTCGTCTGACTTCGACGATTGTTTCGCATCACGTGGTTGGCGTTGCATCAAGTGTTTGGGTGATTTCTATCAAGCTTAGTTACCTCGAG
GTTTACATCATAAACCTTAGGTATATCCATGGCTTGTCAGCAAGCATCCGGTCACGCCAGGCCCAAGCGCCTCAGAACAACGAAGCCGCCTTATCACGAGAGGCTAGGTG
CTTAAGAGATTTTAGGAAATGGGACCCCTGTCCGTTCGATGGAGCCTCAAGGGACCCCACGGTAGCAGAGTTGTGGCTGTCCTCCATCGAGACTGTTTTTCGCCACATGA
ACTGCCCGGAGGACCAGAAAGTGTACTACGTCGTGTTCCTGTTACGAGATAACGCCCTGATTTGGTGGCAATCGATCGAAAGAACTATAGGCGTCAGCAACGGACCTATC
ACATGGACTCAGTTCAGGGGGGAGTTCTTTAGGAAGTATTATCCTGCACCCTCCCGATTCAAGAAGCAAGCAGAGTTTGTGGCTCTTACGCAGGGAAACCGAACTGTAGA
GGAGTACGAGACAAAGTTTGTCAGACTGTCTCGGTTTGCTCCTACACTGGTAGTTGTAGAGGCTAATAAGGTAGAACGTTTCATCACAGGCTTAAAGGAAGAGATTCAGG
GCAGCGTGGCAGCCCATGAACCCCAAGACTATGCCACGGCAGTCAGGGTGACAGAGCCGATTGATCGACGACCTTTAGTTGTCTCTTCGAAACCTTCCTCAGGTCATAAG
CGAAGGTACGATCAGATGAGCTCGAACACACAGAGGGGTCACCGTTACCCTAGATGA
Protein sequenceShow/hide protein sequence
MTLARGLDKLIRPDMLKFTADTFHKKLFSRNELGVATFETFGLGTNASMVQEVATGVATFETFGLGTNASMVLEVATGVEEKGMLTVDVDAAAALHCFSSSSGSRRRLPR
DACRSPPVAVNGRHSPSSEVLSPPRSANQSCTPPSPRPDLFVLCRSDLITAVTALRQIATRLLNVDNYNWRCEIGDWTASSRLTSTIVSHHVVGVASSVWVISIKLSYLE
VYIINLRYIHGLSASIRSRQAQAPQNNEAALSREARCLRDFRKWDPCPFDGASRDPTVAELWLSSIETVFRHMNCPEDQKVYYVVFLLRDNALIWWQSIERTIGVSNGPI
TWTQFRGEFFRKYYPAPSRFKKQAEFVALTQGNRTVEEYETKFVRLSRFAPTLVVVEANKVERFITGLKEEIQGSVAAHEPQDYATAVRVTEPIDRRPLVVSSKPSSGHK
RRYDQMSSNTQRGHRYPR