; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028544 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028544
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 reverse transcriptase-like
Genome locationchr8:24704280..24705929
RNA-Seq ExpressionLag0028544
SyntenyLag0028544
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022145869.1 uncharacterized protein LOC111015219 [Momordica charantia]6.3e-4378.29Show/hide
Query:  MISGKLYPSYSTSCIFPPTHLHLHLPSPFPLLKISSQLKLISSESVSLSFPTSFASKPSANSIRFSNSVAKV--YSYDDQNPTALSDLDDLSENGVVYKK
        MISGK+YP YST  I PPT LHLHL SP PLLKISS L+LISSES+SLSFPT  ASKP A S+RFSNSVAKV  Y Y+ QN T  SDL+DLSENGVVYKK
Subjt:  MISGKLYPSYSTSCIFPPTHLHLHLPSPFPLLKISSQLKLISSESVSLSFPTSFASKPSANSIRFSNSVAKV--YSYDDQNPTALSDLDDLSENGVVYKK

Query:  TLAMVECSMFAALNGLVYFLSNSLALERW
        TLAMVECSMFAAL+GLVYFLSNSLALE +
Subjt:  TLAMVECSMFAALNGLVYFLSNSLALERW

XP_022929150.1 uncharacterized protein LOC111435817 [Cucurbita moschata]8.2e-4377.44Show/hide
Query:  MISGKLYPSYSTSCIFPP------THLHLHLPSPFPLLKISSQLKLISSESVSLSFPTSFASKPSANSIRFSNSVAKVYSYDDQNPTALSDLDDLSENGV
        MISG LYPS STSCIFPP      T  H+HL  P PLLKISS+L+LIS ESVSLSFPT  ASK S  S RFSNSVAKVYS++ QNPT+LSDL+DLSENGV
Subjt:  MISGKLYPSYSTSCIFPP------THLHLHLPSPFPLLKISSQLKLISSESVSLSFPTSFASKPSANSIRFSNSVAKVYSYDDQNPTALSDLDDLSENGV

Query:  VYKKTLAMVECSMFAALNGLVYFLSNSLALERW
        VYKKTLAMVECSMFAALNGLVYFLSNSLALE +
Subjt:  VYKKTLAMVECSMFAALNGLVYFLSNSLALERW

XP_022969819.1 uncharacterized protein LOC111468904 [Cucurbita maxima]9.1e-4277.1Show/hide
Query:  MISGKLYPSYSTSCIFPP----THLHLHLPSPFPLLKISSQLKLISSESVSLSFPTSFASKPSANSIRFSNSVAKVYSYDDQNPTALSDLDDLSENGVVY
        MISG LYPS STSCIFPP    T  H+HL    PLLKIS++L+LIS ESVSLSFPT  ASK S  S RFSNSVAKVYS++ QNPT+LSDL+DLSENGVVY
Subjt:  MISGKLYPSYSTSCIFPP----THLHLHLPSPFPLLKISSQLKLISSESVSLSFPTSFASKPSANSIRFSNSVAKVYSYDDQNPTALSDLDDLSENGVVY

Query:  KKTLAMVECSMFAALNGLVYFLSNSLALERW
        KKTLAMVECSMFAALNGLVYFLSNSLALE +
Subjt:  KKTLAMVECSMFAALNGLVYFLSNSLALERW

XP_023549519.1 uncharacterized protein LOC111807999 [Cucurbita pepo subsp. pepo]2.4e-4276.69Show/hide
Query:  MISGKLYPSYSTSCIFPP------THLHLHLPSPFPLLKISSQLKLISSESVSLSFPTSFASKPSANSIRFSNSVAKVYSYDDQNPTALSDLDDLSENGV
        MISGKLYPS STSCIFPP      T +H+HL    PLLKISS+L+LIS +SVSLSFPT  ASK S  S RFSNSVAKVYS++ QNPT+LSDL+DLSENGV
Subjt:  MISGKLYPSYSTSCIFPP------THLHLHLPSPFPLLKISSQLKLISSESVSLSFPTSFASKPSANSIRFSNSVAKVYSYDDQNPTALSDLDDLSENGV

Query:  VYKKTLAMVECSMFAALNGLVYFLSNSLALERW
        VYKKTLAMVECSMFAALNGLVYFLSNSLALE +
Subjt:  VYKKTLAMVECSMFAALNGLVYFLSNSLALERW

XP_038885169.1 uncharacterized protein LOC120075651 [Benincasa hispida]1.6e-4177.17Show/hide
Query:  MISGKLYPSYSTSCIFPPTHLHLHLPSPFPLLKISSQLKLISSESVSLSFPTSFASKPSANSIRFSNSVAKVYSYDDQNPTALSDLDDLSENGVVYKKTL
        MISGKLYPSYS SCIFPP   +LHL    PLL+ISS L+LIS +SVSLSFP+ FASK SA S RFSNSV KVYSY+ QNP  LSDL+DLSE+G VYKKTL
Subjt:  MISGKLYPSYSTSCIFPPTHLHLHLPSPFPLLKISSQLKLISSESVSLSFPTSFASKPSANSIRFSNSVAKVYSYDDQNPTALSDLDDLSENGVVYKKTL

Query:  AMVECSMFAALNGLVYFLSNSLALERW
        AMVECSMFAALNGLVYFLSNSLALE +
Subjt:  AMVECSMFAALNGLVYFLSNSLALERW

TrEMBL top hitse value%identityAlignment
A0A1S3BEA4 uncharacterized protein LOC103488678 isoform X21.5e-3771.64Show/hide
Query:  MISGKLYPSYSTSCIFPPTHLHLHLPSPFP-------LLKISSQLKLISSESVSLSFPTSFASKPSANSIRFSNSVAKVYSYDDQNPTALSDLDDLSENG
        MISGKLY S S+SCIFPPT      P+P P        LKISS L+LIS +SVSLS P+SFASK SA S RFSNS+ +VYSY+ QN   LSDLDDLSENG
Subjt:  MISGKLYPSYSTSCIFPPTHLHLHLPSPFP-------LLKISSQLKLISSESVSLSFPTSFASKPSANSIRFSNSVAKVYSYDDQNPTALSDLDDLSENG

Query:  VVYKKTLAMVECSMFAALNGLVYFLSNSLALERW
        VVYKKTLAMVECSMFAALNGLVYFLSNSLALE +
Subjt:  VVYKKTLAMVECSMFAALNGLVYFLSNSLALERW

A0A1S4DVX7 uncharacterized protein LOC103488678 isoform X51.5e-3771.64Show/hide
Query:  MISGKLYPSYSTSCIFPPTHLHLHLPSPFP-------LLKISSQLKLISSESVSLSFPTSFASKPSANSIRFSNSVAKVYSYDDQNPTALSDLDDLSENG
        MISGKLY S S+SCIFPPT      P+P P        LKISS L+LIS +SVSLS P+SFASK SA S RFSNS+ +VYSY+ QN   LSDLDDLSENG
Subjt:  MISGKLYPSYSTSCIFPPTHLHLHLPSPFP-------LLKISSQLKLISSESVSLSFPTSFASKPSANSIRFSNSVAKVYSYDDQNPTALSDLDDLSENG

Query:  VVYKKTLAMVECSMFAALNGLVYFLSNSLALERW
        VVYKKTLAMVECSMFAALNGLVYFLSNSLALE +
Subjt:  VVYKKTLAMVECSMFAALNGLVYFLSNSLALERW

A0A6J1CVR0 uncharacterized protein LOC1110152193.0e-4378.29Show/hide
Query:  MISGKLYPSYSTSCIFPPTHLHLHLPSPFPLLKISSQLKLISSESVSLSFPTSFASKPSANSIRFSNSVAKV--YSYDDQNPTALSDLDDLSENGVVYKK
        MISGK+YP YST  I PPT LHLHL SP PLLKISS L+LISSES+SLSFPT  ASKP A S+RFSNSVAKV  Y Y+ QN T  SDL+DLSENGVVYKK
Subjt:  MISGKLYPSYSTSCIFPPTHLHLHLPSPFPLLKISSQLKLISSESVSLSFPTSFASKPSANSIRFSNSVAKV--YSYDDQNPTALSDLDDLSENGVVYKK

Query:  TLAMVECSMFAALNGLVYFLSNSLALERW
        TLAMVECSMFAAL+GLVYFLSNSLALE +
Subjt:  TLAMVECSMFAALNGLVYFLSNSLALERW

A0A6J1ETG3 uncharacterized protein LOC1114358174.0e-4377.44Show/hide
Query:  MISGKLYPSYSTSCIFPP------THLHLHLPSPFPLLKISSQLKLISSESVSLSFPTSFASKPSANSIRFSNSVAKVYSYDDQNPTALSDLDDLSENGV
        MISG LYPS STSCIFPP      T  H+HL  P PLLKISS+L+LIS ESVSLSFPT  ASK S  S RFSNSVAKVYS++ QNPT+LSDL+DLSENGV
Subjt:  MISGKLYPSYSTSCIFPP------THLHLHLPSPFPLLKISSQLKLISSESVSLSFPTSFASKPSANSIRFSNSVAKVYSYDDQNPTALSDLDDLSENGV

Query:  VYKKTLAMVECSMFAALNGLVYFLSNSLALERW
        VYKKTLAMVECSMFAALNGLVYFLSNSLALE +
Subjt:  VYKKTLAMVECSMFAALNGLVYFLSNSLALERW

A0A6J1I3S1 uncharacterized protein LOC1114689044.4e-4277.1Show/hide
Query:  MISGKLYPSYSTSCIFPP----THLHLHLPSPFPLLKISSQLKLISSESVSLSFPTSFASKPSANSIRFSNSVAKVYSYDDQNPTALSDLDDLSENGVVY
        MISG LYPS STSCIFPP    T  H+HL    PLLKIS++L+LIS ESVSLSFPT  ASK S  S RFSNSVAKVYS++ QNPT+LSDL+DLSENGVVY
Subjt:  MISGKLYPSYSTSCIFPP----THLHLHLPSPFPLLKISSQLKLISSESVSLSFPTSFASKPSANSIRFSNSVAKVYSYDDQNPTALSDLDDLSENGVVY

Query:  KKTLAMVECSMFAALNGLVYFLSNSLALERW
        KKTLAMVECSMFAALNGLVYFLSNSLALE +
Subjt:  KKTLAMVECSMFAALNGLVYFLSNSLALERW

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657504.1e-0526.29Show/hide
Query:  NAWKEIT-KASDFVWGNCSFNVGLGDKALSWEDRWRGDQPLKCVFPNLYRISFSKAFSIKDLQ-KDSTWDLR----FHRNLLNRELQEWDTLASLIGSFN
        + W+ I     D V     +  G G +   W DRW   +PL     N  R +       KDL      WD      +  N  N  L+    +  L+    
Subjt:  NAWKEIT-KASDFVWGNCSFNVGLGDKALSWEDRWRGDQPLKCVFPNLYRISFSKAFSIKDLQ-KDSTWDLR----FHRNLLNRELQEWDTLASLIGSFN

Query:  PSGREDVLTWRLDKSGMFSVKSALEEIQTKRRILEEDLSS---QIWEGNIPKKVKFFLWSSALNSINTMDRIQRR
         +G  D L+W+  + G FSV+SA E + T   +   +++S    +W+  +P++VK FLW     ++ T +   RR
Subjt:  PSGREDVLTWRLDKSGMFSVKSALEEIQTKRRILEEDLSS---QIWEGNIPKKVKFFLWSSALNSINTMDRIQRR

Arabidopsis top hitse value%identityAlignment
AT1G26180.1 unknown protein1.0e-0637.5Show/hide
Query:  LKISSQLKLISSESVSLSFPTSFASKPSANSIRFSNSVAKVYSYDDQNPTALSDLDDLSENGVVYKKTLAMVECSMFAALNGLVYFLSNSLALERW
        + +S Q  +IS  S+S +    FA    + +  ++N         ++N  +  + D+     VVY+KTL +VEC+MFAA+ GLVYFLSNSLA+E +
Subjt:  LKISSQLKLISSESVSLSFPTSFASKPSANSIRFSNSVAKVYSYDDQNPTALSDLDDLSENGVVYKKTLAMVECSMFAALNGLVYFLSNSLALERW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTTCTGGGAAGCTTTATCCATCCTACTCCACATCATGCATTTTCCCACCAACCCATCTTCATCTTCATCTTCCTTCCCCATTTCCTCTTCTCAAAATCTCTTCCCA
ACTCAAATTGATCAGCTCTGAATCCGTCTCCCTCTCTTTTCCAACCTCATTTGCTTCTAAACCCAGTGCCAACTCCATTAGATTTTCGAATTCAGTGGCCAAAGTTTATA
GCTATGACGACCAAAACCCCACTGCTTTGTCGGATTTGGATGACTTGTCCGAGAATGGAGTTGTCTATAAGAAGACATTGGCGATGGTCGAGTGCTCCATGTTCGCTGCA
CTTAATGGCTTGGTCTACTTCTTGAGTAATTCACTTGCTCTTGAGAGATGGAAAAATTTGTATAATGCTTGGAAAGAGATTACTAAAGCCTCTGATTTTGTGTGGGGAAA
TTGTTCCTTCAACGTGGGTTTGGGTGATAAGGCCCTTTCTTGGGAAGATAGATGGAGGGGTGATCAACCTCTCAAGTGTGTTTTTCCTAATTTGTACAGAATCTCCTTCA
GCAAAGCGTTCAGTATTAAGGACCTTCAGAAAGATAGCACCTGGGATCTTCGTTTCCATAGAAATTTGCTGAATAGAGAGCTTCAAGAGTGGGATACCTTGGCTTCTTTG
ATAGGCAGCTTTAATCCTTCAGGAAGGGAGGATGTTCTGACTTGGAGGTTGGATAAATCAGGGATGTTTTCTGTCAAGTCAGCCTTGGAAGAGATTCAAACTAAAAGAAG
AATTCTGGAGGAAGATCTCAGCAGTCAAATCTGGGAAGGCAACATTCCTAAGAAAGTTAAGTTTTTCTTGTGGTCTTCGGCCCTTAATAGTATTAACACCATGGATAGAA
TCCAGAGGAGGTTTCCTAGTCTTAATCTTTCCCGGCTGGCGTGTGATGTGTGGCAATAA
mRNA sequenceShow/hide mRNA sequence
ATGATTTCTGGGAAGCTTTATCCATCCTACTCCACATCATGCATTTTCCCACCAACCCATCTTCATCTTCATCTTCCTTCCCCATTTCCTCTTCTCAAAATCTCTTCCCA
ACTCAAATTGATCAGCTCTGAATCCGTCTCCCTCTCTTTTCCAACCTCATTTGCTTCTAAACCCAGTGCCAACTCCATTAGATTTTCGAATTCAGTGGCCAAAGTTTATA
GCTATGACGACCAAAACCCCACTGCTTTGTCGGATTTGGATGACTTGTCCGAGAATGGAGTTGTCTATAAGAAGACATTGGCGATGGTCGAGTGCTCCATGTTCGCTGCA
CTTAATGGCTTGGTCTACTTCTTGAGTAATTCACTTGCTCTTGAGAGATGGAAAAATTTGTATAATGCTTGGAAAGAGATTACTAAAGCCTCTGATTTTGTGTGGGGAAA
TTGTTCCTTCAACGTGGGTTTGGGTGATAAGGCCCTTTCTTGGGAAGATAGATGGAGGGGTGATCAACCTCTCAAGTGTGTTTTTCCTAATTTGTACAGAATCTCCTTCA
GCAAAGCGTTCAGTATTAAGGACCTTCAGAAAGATAGCACCTGGGATCTTCGTTTCCATAGAAATTTGCTGAATAGAGAGCTTCAAGAGTGGGATACCTTGGCTTCTTTG
ATAGGCAGCTTTAATCCTTCAGGAAGGGAGGATGTTCTGACTTGGAGGTTGGATAAATCAGGGATGTTTTCTGTCAAGTCAGCCTTGGAAGAGATTCAAACTAAAAGAAG
AATTCTGGAGGAAGATCTCAGCAGTCAAATCTGGGAAGGCAACATTCCTAAGAAAGTTAAGTTTTTCTTGTGGTCTTCGGCCCTTAATAGTATTAACACCATGGATAGAA
TCCAGAGGAGGTTTCCTAGTCTTAATCTTTCCCGGCTGGCGTGTGATGTGTGGCAATAA
Protein sequenceShow/hide protein sequence
MISGKLYPSYSTSCIFPPTHLHLHLPSPFPLLKISSQLKLISSESVSLSFPTSFASKPSANSIRFSNSVAKVYSYDDQNPTALSDLDDLSENGVVYKKTLAMVECSMFAA
LNGLVYFLSNSLALERWKNLYNAWKEITKASDFVWGNCSFNVGLGDKALSWEDRWRGDQPLKCVFPNLYRISFSKAFSIKDLQKDSTWDLRFHRNLLNRELQEWDTLASL
IGSFNPSGREDVLTWRLDKSGMFSVKSALEEIQTKRRILEEDLSSQIWEGNIPKKVKFFLWSSALNSINTMDRIQRRFPSLNLSRLACDVWQ