; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010769 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010769
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposon Tf2-2 polyprotein
Genome locationchr1:5977446..5979087
RNA-Seq ExpressionLag0010769
SyntenyLag0010769
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585985.1 hypothetical protein SDJN03_18718, partial [Cucurbita argyrosperma subsp. sororia]1.7e-1077.78Show/hide
Query:  WARHELRWQNVSDLASSIIAAERMVDDMEKPTHSHKQGETSSRQQDGWSGEQKK
        WAR ELRWQN  DLA +IIAAERMVDDMEKPT ++K  ETSSRQQDG S EQKK
Subjt:  WARHELRWQNVSDLASSIIAAERMVDDMEKPTHSHKQGETSSRQQDGWSGEQKK

RVW30894.1 Transposon Tf2-12 polyprotein [Vitis vinifera]1.6e-0834.62Show/hide
Query:  FRSRALAWARHELRWQNVSDLASSIIAAERMVD--------DMEKP--THSHKQGETSSRQQDGWSGEQKKGEEHATNELETSLV--NLTKGWGCFICGG
        F S    WA+ ELR Q V DL +++ AA+ +VD         M++P      K       ++ GW  + KK         +T+ V    T+  GCFIC G
Subjt:  FRSRALAWARHELRWQNVSDLASSIIAAERMVD--------DMEKP--THSHKQGETSSRQQDGWSGEQKKGEEHATNELETSLV--NLTKGWGCFICGG

Query:  PQRARDCPNKDKLSAFIADDRQGEKFPSYP
        P RA+DCP ++KLSA +  D +G+  P+ P
Subjt:  PQRARDCPNKDKLSAFIADDRQGEKFPSYP

RWR91972.1 hypothetical protein CKAN_02116000 [Cinnamomum micranthum f. kanehirae]1.9e-0936.13Show/hide
Query:  FRSRALAWARHELRWQNVSDLASSIIAAERMVD---------DMEKPTHSHKQGETSSRQQDGWSGEQKKGEEHATNELETSLVNLTKGWGCFICGGPQR
        F S    WA+ EL  Q V DL S+I AAE +VD         D  KP +++K G    +++ G +G +KK   +   + + +  + +    CFIC GP R
Subjt:  FRSRALAWARHELRWQNVSDLASSIIAAERMVD---------DMEKPTHSHKQGETSSRQQDGWSGEQKKGEEHATNELETSLVNLTKGWGCFICGGPQR

Query:  ARDCPNKDKLSAFIADDRQ
        ARDCP K+KL+A +A++ +
Subjt:  ARDCPNKDKLSAFIADDRQ

RWR91976.1 hypothetical protein CKAN_02116400 [Cinnamomum micranthum f. kanehirae]3.2e-0936.61Show/hide
Query:  WARHELRWQNVSDLASSIIAAERMVD---------DMEKPTHSHKQGETSSRQQDGWSGEQKKGEEHATNELETSLVNLTKGWGCFICGGPQRARDCPNK
        WA+ EL  Q V DL S+I AAE +VD         D  KP +++K G    +++ G +G +KK  ++   + + +  + +    CFIC GP RARDCP K
Subjt:  WARHELRWQNVSDLASSIIAAERMVD---------DMEKPTHSHKQGETSSRQQDGWSGEQKKGEEHATNELETSLVNLTKGWGCFICGGPQRARDCPNK

Query:  DKLSAFIADDRQ
        +KL+A +A++ +
Subjt:  DKLSAFIADDRQ

RWR95040.1 gag-asp_proteas domain-containing protein [Cinnamomum micranthum f. kanehirae]6.4e-1035.2Show/hide
Query:  FRSRALAWARHELRWQNVSDLASSIIAAERMVD---------DMEKPTHSHKQGETSSRQQDGWSGEQKKGEEHATNELETSLVNLTKGWGCFICGGPQR
        F S    WA+ ELR Q V DL S++ AAE ++D         D+ KP +++K G    +++ G SG ++K  ++   + + +  +     GCFIC GP R
Subjt:  FRSRALAWARHELRWQNVSDLASSIIAAERMVD---------DMEKPTHSHKQGETSSRQQDGWSGEQKKGEEHATNELETSLVNLTKGWGCFICGGPQR

Query:  ARDCPNKDKLSAFIADDRQGEKFPS
        ARD P K+KL+A +A++ + +  PS
Subjt:  ARDCPNKDKLSAFIADDRQGEKFPS

TrEMBL top hitse value%identityAlignment
A0A3S3QX45 Reverse transcriptase domain-containing protein9.1e-1036.13Show/hide
Query:  FRSRALAWARHELRWQNVSDLASSIIAAERMVD---------DMEKPTHSHKQGETSSRQQDGWSGEQKKGEEHATNELETSLVNLTKGWGCFICGGPQR
        F S    WA+ EL  Q V DL S+I AAE +VD         D  KP +++K G    +++ G +G +KK   +   + + +  + +    CFIC GP R
Subjt:  FRSRALAWARHELRWQNVSDLASSIIAAERMVD---------DMEKPTHSHKQGETSSRQQDGWSGEQKKGEEHATNELETSLVNLTKGWGCFICGGPQR

Query:  ARDCPNKDKLSAFIADDRQ
        ARDCP K+KL+A +A++ +
Subjt:  ARDCPNKDKLSAFIADDRQ

A0A438D626 Transposon Tf2-12 polyprotein7.7e-0934.62Show/hide
Query:  FRSRALAWARHELRWQNVSDLASSIIAAERMVD--------DMEKP--THSHKQGETSSRQQDGWSGEQKKGEEHATNELETSLV--NLTKGWGCFICGG
        F S    WA+ ELR Q V DL +++ AA+ +VD         M++P      K       ++ GW  + KK         +T+ V    T+  GCFIC G
Subjt:  FRSRALAWARHELRWQNVSDLASSIIAAERMVD--------DMEKP--THSHKQGETSSRQQDGWSGEQKKGEEHATNELETSLV--NLTKGWGCFICGG

Query:  PQRARDCPNKDKLSAFIADDRQGEKFPSYP
        P RA+DCP ++KLSA +  D +G+  P+ P
Subjt:  PQRARDCPNKDKLSAFIADDRQGEKFPSYP

A0A438F5W3 Transposon Tf2-2 polyprotein1.7e-0835.34Show/hide
Query:  FRSRALAWARHELRWQNVSDLASSIIAAERMVD--------DMEKP-----THSHKQGETSSRQQDGWSGEQKKGEEHATNELETSLV--NLTKGWGCFI
        F +    WA+ ELR Q V DL +++ AA+ +VD          +KP       +  +G+TS  Q+ GW  + KK         +T+ V    T+  GCFI
Subjt:  FRSRALAWARHELRWQNVSDLASSIIAAERMVD--------DMEKP-----THSHKQGETSSRQQDGWSGEQKKGEEHATNELETSLV--NLTKGWGCFI

Query:  CGGPQRARDCPNKDKLSAFIADDRQGEKFPSYP
        C GP RA+DCP ++KLSA +  + +GE  P  P
Subjt:  CGGPQRARDCPNKDKLSAFIADDRQGEKFPSYP

A0A443PME6 Uncharacterized protein1.5e-0936.61Show/hide
Query:  WARHELRWQNVSDLASSIIAAERMVD---------DMEKPTHSHKQGETSSRQQDGWSGEQKKGEEHATNELETSLVNLTKGWGCFICGGPQRARDCPNK
        WA+ EL  Q V DL S+I AAE +VD         D  KP +++K G    +++ G +G +KK  ++   + + +  + +    CFIC GP RARDCP K
Subjt:  WARHELRWQNVSDLASSIIAAERMVD---------DMEKPTHSHKQGETSSRQQDGWSGEQKKGEEHATNELETSLVNLTKGWGCFICGGPQRARDCPNK

Query:  DKLSAFIADDRQ
        +KL+A +A++ +
Subjt:  DKLSAFIADDRQ

A0A443PWA7 Gag-asp_proteas domain-containing protein3.1e-1035.2Show/hide
Query:  FRSRALAWARHELRWQNVSDLASSIIAAERMVD---------DMEKPTHSHKQGETSSRQQDGWSGEQKKGEEHATNELETSLVNLTKGWGCFICGGPQR
        F S    WA+ ELR Q V DL S++ AAE ++D         D+ KP +++K G    +++ G SG ++K  ++   + + +  +     GCFIC GP R
Subjt:  FRSRALAWARHELRWQNVSDLASSIIAAERMVD---------DMEKPTHSHKQGETSSRQQDGWSGEQKKGEEHATNELETSLVNLTKGWGCFICGGPQR

Query:  ARDCPNKDKLSAFIADDRQGEKFPS
        ARD P K+KL+A +A++ + +  PS
Subjt:  ARDCPNKDKLSAFIADDRQGEKFPS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTGCTATCTGTTGCATTATGTTACTGTGTTCCGCGAGCCTCTTTGCTTCCTTAAGCCACCATTCAGGAGTCGAGCCTTGGCTTGGGCTAGACATGAACTTCGTTG
GCAAAATGTATCGGATTTGGCTTCTTCGATCATTGCTGCAGAAAGAATGGTAGATGACATGGAAAAACCTACTCATTCACACAAACAAGGGGAAACTTCTTCGAGGCAAC
AAGACGGGTGGAGTGGAGAGCAAAAGAAAGGGGAAGAACATGCTACCAACGAGCTGGAAACAAGCCTGGTAAACCTAACAAAGGGTTGGGGTTGTTTCATTTGTGGAGGA
CCTCAAAGAGCCCGTGACTGCCCAAATAAAGATAAATTGAGTGCATTCATCGCAGATGACAGACAGGGGGAGAAGTTCCCTTCTTACCCGTATGCTTCACACATTCAGTC
TTTCTTCTCCCTTATGTGCCACACCAACCCTCTAAAGCCAAGTGCTTGCTTCAGTTATAGCCCTGAATTGAATGGGGTGGGAAAGAAAGAAACAAAGCCATTAGTCTCAC
TCAAGAAGTCAGCAGGCTCTCTTCTTCTTCTTCCTCCGCTTCAGGCGAGAGCAGAGCTTTTTATGTCTACCCTATTTGGGTTATTGATTCCGAGACGAACTCCATCGGAT
CTTTCTCATGTCGTTTCTCCTTCTCAAAACTATTTCCCCACTGTGTGTCCTTCTGGAGATGGGATTGCGTTTTTGATCACCTCTAATACGGATTCTTTCCGCCCGCCCTG
GGTATATGGGTCTCCCAGAACTGGCCTTAGATGGGGAAGATTCATTCATCGCTGTGGAAGTGGATACCAGTTTCGACCCTTCGCTTGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTGCTATCTGTTGCATTATGTTACTGTGTTCCGCGAGCCTCTTTGCTTCCTTAAGCCACCATTCAGGAGTCGAGCCTTGGCTTGGGCTAGACATGAACTTCGTTG
GCAAAATGTATCGGATTTGGCTTCTTCGATCATTGCTGCAGAAAGAATGGTAGATGACATGGAAAAACCTACTCATTCACACAAACAAGGGGAAACTTCTTCGAGGCAAC
AAGACGGGTGGAGTGGAGAGCAAAAGAAAGGGGAAGAACATGCTACCAACGAGCTGGAAACAAGCCTGGTAAACCTAACAAAGGGTTGGGGTTGTTTCATTTGTGGAGGA
CCTCAAAGAGCCCGTGACTGCCCAAATAAAGATAAATTGAGTGCATTCATCGCAGATGACAGACAGGGGGAGAAGTTCCCTTCTTACCCGTATGCTTCACACATTCAGTC
TTTCTTCTCCCTTATGTGCCACACCAACCCTCTAAAGCCAAGTGCTTGCTTCAGTTATAGCCCTGAATTGAATGGGGTGGGAAAGAAAGAAACAAAGCCATTAGTCTCAC
TCAAGAAGTCAGCAGGCTCTCTTCTTCTTCTTCCTCCGCTTCAGGCGAGAGCAGAGCTTTTTATGTCTACCCTATTTGGGTTATTGATTCCGAGACGAACTCCATCGGAT
CTTTCTCATGTCGTTTCTCCTTCTCAAAACTATTTCCCCACTGTGTGTCCTTCTGGAGATGGGATTGCGTTTTTGATCACCTCTAATACGGATTCTTTCCGCCCGCCCTG
GGTATATGGGTCTCCCAGAACTGGCCTTAGATGGGGAAGATTCATTCATCGCTGTGGAAGTGGATACCAGTTTCGACCCTTCGCTTGTTGA
Protein sequenceShow/hide protein sequence
MSCYLLHYVTVFREPLCFLKPPFRSRALAWARHELRWQNVSDLASSIIAAERMVDDMEKPTHSHKQGETSSRQQDGWSGEQKKGEEHATNELETSLVNLTKGWGCFICGG
PQRARDCPNKDKLSAFIADDRQGEKFPSYPYASHIQSFFSLMCHTNPLKPSACFSYSPELNGVGKKETKPLVSLKKSAGSLLLLPPLQARAELFMSTLFGLLIPRRTPSD
LSHVVSPSQNYFPTVCPSGDGIAFLITSNTDSFRPPWVYGSPRTGLRWGRFIHRCGSGYQFRPFAC