; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008308 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008308
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr9:17223510..17224139
RNA-Seq ExpressionLag0008308
SyntenyLag0008308
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]1.8e-4952.33Show/hide
Query:  MRRNKVVNLFPLDIEIERTLRTIHREKRLAKAMAHQEEAPKAIRDFLQLVLPTDNSGIVYAPIQATNFELKTRLIQMARDNSFKGHLSEDPHSHLRSFLE
        MRR +  ++ P+D EIERTLR++ R K LA A   +E  P+ ++D+++ V+  + S I+  PI A NFELK  LI M +   F G   +DP+ HL  FLE
Subjt:  MRRNKVVNLFPLDIEIERTLRTIHREKRLAKAMAHQEEAPKAIRDFLQLVLPTDNSGIVYAPIQATNFELKTRLIQMARDNSFKGHLSEDPHSHLRSFLE

Query:  ICGTVKMNGVPTNAIRLRLFPFSLQDKAKDLLESVETGSISTWDELAQAFLTKFFPSAKTTKLRTEIGTFRQLEDEQLYEAWERYKEMLRRCP
        IC TVK+NGV  + IRLRLFPFSL+DKA+  L+S++ GSI +W ++A+ FL KFFP AKT +LR+EIG F+Q + E LYEAWERYK+++RRCP
Subjt:  ICGTVKMNGVPTNAIRLRLFPFSLQDKAKDLLESVETGSISTWDELAQAFLTKFFPSAKTTKLRTEIGTFRQLEDEQLYEAWERYKEMLRRCP

RWR83368.1 hypothetical protein CKAN_01212200 [Cinnamomum micranthum f. kanehirae]2.2e-4450.51Show/hide
Query:  MRRNKVVNLFPLDIEIERTLRTIHREKRLA---KAMAHQEEAPKAIRDFLQLVLPTDNSGIVYAPIQATNFELKTRLIQM-ARDNSFKGHLSEDPHSHLR
        MRRN+ +NL PLD EIERTLR + +EK+     +    +E+A +++ D+   ++    S I    IQA NFE+K  +IQM A    F G   +DP++H+ 
Subjt:  MRRNKVVNLFPLDIEIERTLRTIHREKRLA---KAMAHQEEAPKAIRDFLQLVLPTDNSGIVYAPIQATNFELKTRLIQM-ARDNSFKGHLSEDPHSHLR

Query:  SFLEICGTVKMNGVPTNAIRLRLFPFSLQDKAKDLLESVETGSISTWDELAQAFLTKFFPSAKTTKLRTEIGTFRQLEDEQLYEAWERYKEMLRRCPN
        +FLE+C T K NGV  +A+RLRL PFSL+DKAK  L S+   +I+TWDELA+ FL KFFP  KT K+R +I TF Q E E LYEAWERYKE+LR+CP+
Subjt:  SFLEICGTVKMNGVPTNAIRLRLFPFSLQDKAKDLLESVETGSISTWDELAQAFLTKFFPSAKTTKLRTEIGTFRQLEDEQLYEAWERYKEMLRRCPN

WP_217833153.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]2.2e-6065.05Show/hide
Query:  NLFPLDIEIERTLRTIHREKRLAKAMAHQEEAPKAIRDFLQLVLPTDNSGIVYAPIQATNFELKTRLIQMARDNSFKGHLSEDPHSHLRSFLEICGTVKM
        NL PLD EI+RT R  +    L +     EE PKAIRD+ Q  LP    GI+  PI   NFELK  LIQMAR+ +F+G  +EDPH HLRSFLEICGTVKM
Subjt:  NLFPLDIEIERTLRTIHREKRLAKAMAHQEEAPKAIRDFLQLVLPTDNSGIVYAPIQATNFELKTRLIQMARDNSFKGHLSEDPHSHLRSFLEICGTVKM

Query:  NGVPTNAIRLRLFPFSLQDKAKDLLESVETGSISTWDELAQAFLTKFFPSAKTTKLRTEIGTFRQLEDEQLYEAWERYKEMLRRCP
        NGV  +AI+LRLFPFSLQD+AKD LE++   SI+TW+ LAQAFL K+FP AK+ +LRTEIGTFRQLEDEQLYEAWERYK++LRRCP
Subjt:  NGVPTNAIRLRLFPFSLQDKAKDLLESVETGSISTWDELAQAFLTKFFPSAKTTKLRTEIGTFRQLEDEQLYEAWERYKEMLRRCP

XP_021279280.1 LOW QUALITY PROTEIN: uncharacterized protein LOC110412945 [Herrania umbratica]3.2e-4348.37Show/hide
Query:  MRRNKVVNLFPLDIEIERTLRTIHREKRLAKAMAHQ-----------------EEAPKAIRDFLQLVLPTDNSGIVYAPIQATNFELKTRLIQMARDN-S
        M+R   +NL P D +IERT R  HR + L  A  +Q                  EA +A+RD++  ++   +  I    I A NFE+K   IQM + +  
Subjt:  MRRNKVVNLFPLDIEIERTLRTIHREKRLAKAMAHQ-----------------EEAPKAIRDFLQLVLPTDNSGIVYAPIQATNFELKTRLIQMARDN-S

Query:  FKGHLSEDPHSHLRSFLEICGTVKMNGVPTNAIRLRLFPFSLQDKAKDLLESVETGSISTWDELAQAFLTKFFPSAKTTKLRTEIGTFRQLEDEQLYEAW
        F G  S+DP+SHL +FLEIC T K NGV  +AIRLRLFPFSL+DKAK  L S+  GSI+TW++LAQ FL KFFP AKT K+R +I +F Q + E LYEAW
Subjt:  FKGHLSEDPHSHLRSFLEICGTVKMNGVPTNAIRLRLFPFSLQDKAKDLLESVETGSISTWDELAQAFLTKFFPSAKTTKLRTEIGTFRQLEDEQLYEAW

Query:  ERYKEMLRRCPNMDI
        ER+KE+LRRCP+  I
Subjt:  ERYKEMLRRCPNMDI

XP_022843226.1 uncharacterized protein LOC111366761 [Olea europaea var. sylvestris]1.8e-4653.23Show/hide
Query:  MRRNKVVNLFPLDIEIERT---LRTIHREKRLAKA-----MAHQEEAPKAIRDFLQLVLPTDNSGIVYAPIQATNFELKTRLIQMARDNSFKGHLSEDPH
        MRR + ++L  +D E ERT   LR I R +R A A      A+++   +AIRD+++ V+  + SGI    I A NFELK  LI M + N F G   EDP+
Subjt:  MRRNKVVNLFPLDIEIERT---LRTIHREKRLAKA-----MAHQEEAPKAIRDFLQLVLPTDNSGIVYAPIQATNFELKTRLIQMARDNSFKGHLSEDPH

Query:  SHLRSFLEICGTVKMNGVPTNAIRLRLFPFSLQDKAKDLLESVETGSISTWDELAQAFLTKFFPSAKTTKLRTEIGTFRQLEDEQLYEAWERYKEMLRRC
        +HL SFLEIC TVKMNGV  +AIRLRLF FSL+DKAK   +S+  GSI+TWD+LAQ FLTK+FP +K+ +LR EI  F+QL+ E  YEAWER+K++LRRC
Subjt:  SHLRSFLEICGTVKMNGVPTNAIRLRLFPFSLQDKAKDLLESVETGSISTWDELAQAFLTKFFPSAKTTKLRTEIGTFRQLEDEQLYEAWERYKEMLRRC

Query:  P
        P
Subjt:  P

TrEMBL top hitse value%identityAlignment
A0A1S3UKD4 uncharacterized protein LOC1067662671.1e-4148.51Show/hide
Query:  DIEIERTLRTIH-------REKRLAKAMAHQEEAP----------KAIRDFLQLVLPTDN---SGIVYAPIQATNFELKTRLIQMARDNSFKGHLSEDPH
        D  IERT R+         RE+R  +    QEE            K IRD+    +P  N     IV  PIQA NFE+K  L+Q+ + N F G +SEDP+
Subjt:  DIEIERTLRTIH-------REKRLAKAMAHQEEAP----------KAIRDFLQLVLPTDN---SGIVYAPIQATNFELKTRLIQMARDNSFKGHLSEDPH

Query:  SHLRSFLEICGTVKMNGVPTNAIRLRLFPFSLQDKAKDLLESVETGSISTWDELAQAFLTKFFPSAKTTKLRTEIGTFRQLEDEQLYEAWERYKEMLRRC
        SHL +FL IC T+K NGV  +AI LRLFPFSL+DKAK+ L+S+  GSISTW+++A  F+TK+FP +K+ K+R EI +F Q ++E LYEAWERYKE++R+C
Subjt:  SHLRSFLEICGTVKMNGVPTNAIRLRLFPFSLQDKAKDLLESVETGSISTWDELAQAFLTKFFPSAKTTKLRTEIGTFRQLEDEQLYEAWERYKEMLRRC

Query:  PN
        P+
Subjt:  PN

A0A3S3N117 Retrotrans_gag domain-containing protein1.1e-4450.51Show/hide
Query:  MRRNKVVNLFPLDIEIERTLRTIHREKRLA---KAMAHQEEAPKAIRDFLQLVLPTDNSGIVYAPIQATNFELKTRLIQM-ARDNSFKGHLSEDPHSHLR
        MRRN+ +NL PLD EIERTLR + +EK+     +    +E+A +++ D+   ++    S I    IQA NFE+K  +IQM A    F G   +DP++H+ 
Subjt:  MRRNKVVNLFPLDIEIERTLRTIHREKRLA---KAMAHQEEAPKAIRDFLQLVLPTDNSGIVYAPIQATNFELKTRLIQM-ARDNSFKGHLSEDPHSHLR

Query:  SFLEICGTVKMNGVPTNAIRLRLFPFSLQDKAKDLLESVETGSISTWDELAQAFLTKFFPSAKTTKLRTEIGTFRQLEDEQLYEAWERYKEMLRRCPN
        +FLE+C T K NGV  +A+RLRL PFSL+DKAK  L S+   +I+TWDELA+ FL KFFP  KT K+R +I TF Q E E LYEAWERYKE+LR+CP+
Subjt:  SFLEICGTVKMNGVPTNAIRLRLFPFSLQDKAKDLLESVETGSISTWDELAQAFLTKFFPSAKTTKLRTEIGTFRQLEDEQLYEAWERYKEMLRRCPN

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129451.6e-4348.37Show/hide
Query:  MRRNKVVNLFPLDIEIERTLRTIHREKRLAKAMAHQ-----------------EEAPKAIRDFLQLVLPTDNSGIVYAPIQATNFELKTRLIQMARDN-S
        M+R   +NL P D +IERT R  HR + L  A  +Q                  EA +A+RD++  ++   +  I    I A NFE+K   IQM + +  
Subjt:  MRRNKVVNLFPLDIEIERTLRTIHREKRLAKAMAHQ-----------------EEAPKAIRDFLQLVLPTDNSGIVYAPIQATNFELKTRLIQMARDN-S

Query:  FKGHLSEDPHSHLRSFLEICGTVKMNGVPTNAIRLRLFPFSLQDKAKDLLESVETGSISTWDELAQAFLTKFFPSAKTTKLRTEIGTFRQLEDEQLYEAW
        F G  S+DP+SHL +FLEIC T K NGV  +AIRLRLFPFSL+DKAK  L S+  GSI+TW++LAQ FL KFFP AKT K+R +I +F Q + E LYEAW
Subjt:  FKGHLSEDPHSHLRSFLEICGTVKMNGVPTNAIRLRLFPFSLQDKAKDLLESVETGSISTWDELAQAFLTKFFPSAKTTKLRTEIGTFRQLEDEQLYEAW

Query:  ERYKEMLRRCPNMDI
        ER+KE+LRRCP+  I
Subjt:  ERYKEMLRRCPNMDI

A0A6J0ZYV0 uncharacterized protein LOC1104134132.7e-4348.37Show/hide
Query:  MRRNKVVNLFPLDIEIERTLRTIHREKRLAKAMAHQ-----------------EEAPKAIRDFLQLVLPTDNSGIVYAPIQATNFELKTRLIQMARDN-S
        M+R   +NL P D +IERT R  HR + L  A  +Q                  EA +A+RD+   ++   +  I    I A NFE+K   IQM + +  
Subjt:  MRRNKVVNLFPLDIEIERTLRTIHREKRLAKAMAHQ-----------------EEAPKAIRDFLQLVLPTDNSGIVYAPIQATNFELKTRLIQMARDN-S

Query:  FKGHLSEDPHSHLRSFLEICGTVKMNGVPTNAIRLRLFPFSLQDKAKDLLESVETGSISTWDELAQAFLTKFFPSAKTTKLRTEIGTFRQLEDEQLYEAW
        F G  S+DP+SHL +FLEIC T K NGV  +AIRLRLFPFSL+DKAK  L S+  GSI+TW++LAQ FL KFFP AKT K+R +I +F Q + E LYEAW
Subjt:  FKGHLSEDPHSHLRSFLEICGTVKMNGVPTNAIRLRLFPFSLQDKAKDLLESVETGSISTWDELAQAFLTKFFPSAKTTKLRTEIGTFRQLEDEQLYEAW

Query:  ERYKEMLRRCPNMDI
        ER+KE+LRRCP+  I
Subjt:  ERYKEMLRRCPNMDI

A0A6P6XAQ1 Reverse transcriptase7.3e-4153.09Show/hide
Query:  MAHQEEAPKAIRDFLQLVLPTDNSGIVYAPIQATNFELKTRLIQMARDNSFKGHLSEDPHSHLRSFLEICGTVKMNGVPTNAIRLRLFPFSLQDKAKDLL
        MA  E   + +RDF         + IV   + A NFE+K  LIQM + + + G+ +EDP+SHL +FLEIC T+K NGV  +AI+LRLFPFSL+DKAK  L
Subjt:  MAHQEEAPKAIRDFLQLVLPTDNSGIVYAPIQATNFELKTRLIQMARDNSFKGHLSEDPHSHLRSFLEICGTVKMNGVPTNAIRLRLFPFSLQDKAKDLL

Query:  ESVETGSISTWDELAQAFLTKFFPSAKTTKLRTEIGTFRQLEDEQLYEAWERYKEMLRRCPN
        +S    + +TWDELA+AFL KFFP  KT KLR +I +F Q E E LYEAWERY+E+ RRCP+
Subjt:  ESVETGSISTWDELAQAFLTKFFPSAKTTKLRTEIGTFRQLEDEQLYEAWERYKEMLRRCPN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGAAGGAACAAGGTGGTTAATTTGTTTCCGCTAGATATTGAAATTGAGAGGACTCTTAGAACCATTCACAGAGAGAAAAGATTAGCAAAAGCAATGGCCCATCAAGA
GGAAGCTCCCAAGGCAATCAGAGACTTCTTACAGCTAGTTCTTCCCACCGACAATTCTGGAATTGTCTACGCCCCAATCCAAGCTACCAATTTTGAGTTAAAGACAAGAT
TGATTCAGATGGCGCGCGATAACTCTTTTAAGGGACATCTTTCTGAGGACCCCCACTCACATCTGCGATCATTCTTGGAAATTTGTGGGACGGTGAAGATGAACGGAGTT
CCGACAAACGCGATAAGATTGAGGTTGTTTCCATTTTCTCTACAGGATAAAGCAAAGGATTTGCTCGAATCAGTCGAGACGGGCAGCATTAGTACATGGGACGAGCTTGC
CCAGGCTTTTCTGACGAAATTTTTCCCGTCTGCCAAGACTACCAAGCTCCGGACTGAGATCGGAACGTTTAGGCAGCTTGAAGATGAGCAGTTGTACGAGGCGTGGGAGA
GATACAAGGAAATGCTTAGGCGGTGCCCCAACATGGATATCCTGATTGGCTTCAACTGCAATTATTTTACAATGGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGAAGGAACAAGGTGGTTAATTTGTTTCCGCTAGATATTGAAATTGAGAGGACTCTTAGAACCATTCACAGAGAGAAAAGATTAGCAAAAGCAATGGCCCATCAAGA
GGAAGCTCCCAAGGCAATCAGAGACTTCTTACAGCTAGTTCTTCCCACCGACAATTCTGGAATTGTCTACGCCCCAATCCAAGCTACCAATTTTGAGTTAAAGACAAGAT
TGATTCAGATGGCGCGCGATAACTCTTTTAAGGGACATCTTTCTGAGGACCCCCACTCACATCTGCGATCATTCTTGGAAATTTGTGGGACGGTGAAGATGAACGGAGTT
CCGACAAACGCGATAAGATTGAGGTTGTTTCCATTTTCTCTACAGGATAAAGCAAAGGATTTGCTCGAATCAGTCGAGACGGGCAGCATTAGTACATGGGACGAGCTTGC
CCAGGCTTTTCTGACGAAATTTTTCCCGTCTGCCAAGACTACCAAGCTCCGGACTGAGATCGGAACGTTTAGGCAGCTTGAAGATGAGCAGTTGTACGAGGCGTGGGAGA
GATACAAGGAAATGCTTAGGCGGTGCCCCAACATGGATATCCTGATTGGCTTCAACTGCAATTATTTTACAATGGATTGA
Protein sequenceShow/hide protein sequence
MRRNKVVNLFPLDIEIERTLRTIHREKRLAKAMAHQEEAPKAIRDFLQLVLPTDNSGIVYAPIQATNFELKTRLIQMARDNSFKGHLSEDPHSHLRSFLEICGTVKMNGV
PTNAIRLRLFPFSLQDKAKDLLESVETGSISTWDELAQAFLTKFFPSAKTTKLRTEIGTFRQLEDEQLYEAWERYKEMLRRCPNMDILIGFNCNYFTMD