; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022899 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022899
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr7:40412429..40415886
RNA-Seq ExpressionLag0022899
SyntenyLag0022899
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG9442207.1 hypothetical protein H6P81_018061 [Aristolochia fimbriata]8.3e-1947.79Show/hide
Query:  CSARVQQGVLEKLSDPGSFTIPCNFGTFSC-RALCDLGASINIIPLSLCKKLNIGDIKSTSVRLQLADQSVVSPYGIVENILIKVALQQIGVTSFLSPVV
        CSA ++  +  KL DPGSFTIPC FG F   + LCDLGASIN++PLS+CKKLN+G++K T++ LQ AD S   P G++E++L++V         F+ P  
Subjt:  CSARVQQGVLEKLSDPGSFTIPCNFGTFSC-RALCDLGASINIIPLSLCKKLNIGDIKSTSVRLQLADQSVVSPYGIVENILIKVALQQIGVTSFLSPVV

Query:  RFCSPHRCDFVLL
               CDFV+L
Subjt:  RFCSPHRCDFVLL

KAG9450476.1 hypothetical protein H6P81_010441 [Aristolochia fimbriata]6.3e-1945.61Show/hide
Query:  TCSARVQQGVLEKLSDPGSFTIPCNFGTFSC-RALCDLGASINIIPLSLCKKLNIGDIKSTSVRLQLADQSVVSPYGIVENILIKVALQQIGVTSFLSPV
        +CSA ++  +  KL DPGSFTIPC FG F   + LCDLGASIN++PLS+C+KLN+G++K T++ LQ AD+S   P G++E++L+++         F+ P 
Subjt:  TCSARVQQGVLEKLSDPGSFTIPCNFGTFSC-RALCDLGASINIIPLSLCKKLNIGDIKSTSVRLQLADQSVVSPYGIVENILIKVALQQIGVTSFLSPV

Query:  VRFCSPHRCDFVLL
                CDFV+L
Subjt:  VRFCSPHRCDFVLL

KYP58627.1 Retrovirus-related Pol polyprotein from transposon opus [Cajanus cajan]8.3e-1958.82Show/hide
Query:  CSARVQQGVLEKLSDPGSFTIPCNFGTFS-CRALCDLGASINIIPLSLCKKLNIGDIKSTSVRLQLADQSVVSPYGIVENILIKV
        CSA +QQ +  KL DPGSF IPC  G     +ALCDLGASIN++PLS+ K+L IG++K T + LQLAD+SV  PYG+VE++L+KV
Subjt:  CSARVQQGVLEKLSDPGSFTIPCNFGTFS-CRALCDLGASINIIPLSLCKKLNIGDIKSTSVRLQLADQSVVSPYGIVENILIKV

XP_010906452.1 uncharacterized protein LOC105033389 [Elaeis guineensis]3.7e-1961.18Show/hide
Query:  CSARVQQGVLEKLSDPGSFTIPCNFGTFS-CRALCDLGASINIIPLSLCKKLNIGDIKSTSVRLQLADQSVVSPYGIVENILIKV
        CSA +Q  +  KL DPGSFTIPCN G     +ALCDLGASIN++PLS+ +KL +GD+K TSV LQLAD+SV  P GIVE++L+K+
Subjt:  CSARVQQGVLEKLSDPGSFTIPCNFGTFS-CRALCDLGASINIIPLSLCKKLNIGDIKSTSVRLQLADQSVVSPYGIVENILIKV

XP_014506502.2 uncharacterized protein LOC106766276 [Vigna radiata var. radiata]6.3e-1960Show/hide
Query:  CSARVQQGVLEKLSDPGSFTIPCNFGTFSC-RALCDLGASINIIPLSLCKKLNIGDIKSTSVRLQLADQSVVSPYGIVENILIKV
        CSA +QQ +  KL DPGSF IPC  G     +ALCDLGASIN++PLS+ K+L IGD+K T + LQLAD+S+  PYGIVE++L+KV
Subjt:  CSARVQQGVLEKLSDPGSFTIPCNFGTFSC-RALCDLGASINIIPLSLCKKLNIGDIKSTSVRLQLADQSVVSPYGIVENILIKV

TrEMBL top hitse value%identityAlignment
A0A0S3QWS7 Uncharacterized protein5.2e-1958.82Show/hide
Query:  CSARVQQGVLEKLSDPGSFTIPCNFGTFSC-RALCDLGASINIIPLSLCKKLNIGDIKSTSVRLQLADQSVVSPYGIVENILIKV
        CSA +QQ +  KL DPGSF IPC  G  +  +ALCDLGASIN++PLS+ K+L IG++K T + LQLAD+S+  PYGIVE++L+KV
Subjt:  CSARVQQGVLEKLSDPGSFTIPCNFGTFSC-RALCDLGASINIIPLSLCKKLNIGDIKSTSVRLQLADQSVVSPYGIVENILIKV

A0A151SV10 Retrovirus-related Pol polyprotein from transposon opus4.0e-1958.82Show/hide
Query:  CSARVQQGVLEKLSDPGSFTIPCNFGTFS-CRALCDLGASINIIPLSLCKKLNIGDIKSTSVRLQLADQSVVSPYGIVENILIKV
        CSA +QQ +  KL DPGSF IPC  G     +ALCDLGASIN++PLS+ K+L IG++K T + LQLAD+SV  PYG+VE++L+KV
Subjt:  CSARVQQGVLEKLSDPGSFTIPCNFGTFS-CRALCDLGASINIIPLSLCKKLNIGDIKSTSVRLQLADQSVVSPYGIVENILIKV

A0A1S3UKE1 uncharacterized protein LOC1067662763.1e-1960Show/hide
Query:  CSARVQQGVLEKLSDPGSFTIPCNFGTFSC-RALCDLGASINIIPLSLCKKLNIGDIKSTSVRLQLADQSVVSPYGIVENILIKV
        CSA +QQ +  KL DPGSF IPC  G     +ALCDLGASIN++PLS+ K+L IGD+K T + LQLAD+S+  PYGIVE++L+KV
Subjt:  CSARVQQGVLEKLSDPGSFTIPCNFGTFSC-RALCDLGASINIIPLSLCKKLNIGDIKSTSVRLQLADQSVVSPYGIVENILIKV

A0A6I9QBH1 uncharacterized protein LOC1050333891.8e-1961.18Show/hide
Query:  CSARVQQGVLEKLSDPGSFTIPCNFGTFS-CRALCDLGASINIIPLSLCKKLNIGDIKSTSVRLQLADQSVVSPYGIVENILIKV
        CSA +Q  +  KL DPGSFTIPCN G     +ALCDLGASIN++PLS+ +KL +GD+K TSV LQLAD+SV  P GIVE++L+K+
Subjt:  CSARVQQGVLEKLSDPGSFTIPCNFGTFS-CRALCDLGASINIIPLSLCKKLNIGDIKSTSVRLQLADQSVVSPYGIVENILIKV

A0A6J1DTZ8 uncharacterized protein LOC1110239796.8e-1955.42Show/hide
Query:  SARVQQGVLEKLSDPGSFTIPCNFGTFSCRALCDLGASINIIPLSLCKKLNIGDIKSTSVRLQLADQSVVSPYGIVENILIKV
        + R+Q+ + +KL D   F+IPCN G++  R LCDLG +IN  PLSLC+KLNIG+IK TS+ +QL D+S   PYG++EN+LIKV
Subjt:  SARVQQGVLEKLSDPGSFTIPCNFGTFSCRALCDLGASINIIPLSLCKKLNIGDIKSTSVRLQLADQSVVSPYGIVENILIKV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TCGACGTGCAGTGCTCGTGTCCAACAGGGAGTACTAGAGAAATTGTCTGACCCAGGGAGTTTTACCATTCCTTGTAATTTTGGTACTTTTTCATGTAGAGCACTATGTGA
TCTGGGGGCTAGCATAAATATAATTCCTCTATCATTATGTAAAAAGCTTAACATAGGAGACATAAAATCTACTTCTGTTAGACTTCAGTTGGCTGACCAGTCTGTGGTTA
GTCCATATGGGATTGTTGAGAATATTCTGATTAAAGTAGCACTACAACAAATTGGCGTGACTTCCTTCCTCTCACCGGTGGTGCGATTTTGTTCTCCTCACCGGTGCGAT
TTTGTTCTCCTCACCGGCGCAGTTGGTTCTTCTTTGGTGCGTTTTTCTTCTTCTGTGGCGCGATTTGCTACCTCTCTCACCGTCGACAGCCGATTTCTCTCACCATCACC
ATTCCTCCTTCTTCTTGTTCGTTGCATGTGGGTTTTCTCCATGGCTACTCATGGGTTCCACGTTAGCTACGGGTTAATTCCATTTTTCATTACCTCTAGTGTGCTTAATG
CGGCTTATAATTCCAATACGCAAGAAATAATGAATGATAGTGGAGCTTCTTCAACATCTAAATCAGCAAAAGCTGATGGAATGCATTACAGTAACAAGGAAATGGTTGAT
TGGAGCTATAATGCGATATCAGAGTTAATCGGGTGCTCGGAGCGTGAAAAGATGCAAAGGAATGAAAAGAGTAAAAGTGGAGAAAAGTCAAATCTCGGTCAACAGCAGGC
TAGCGTCGAGACGCTAGCTCTTGAGCGTCTCGACGCTCACATTCCATATCAGATTAGGCGCGTAAAGCTTACAACGTCGAGACGCTATGATAGGAAGCGTCCCGACGCTA
CCGTTTTCCTTATTCAGAACGCGCGTTTAAGAGGCAGCGTCGCGACGCTGTCTTGA
mRNA sequenceShow/hide mRNA sequence
TCGACGTGCAGTGCTCGTGTCCAACAGGGAGTACTAGAGAAATTGTCTGACCCAGGGAGTTTTACCATTCCTTGTAATTTTGGTACTTTTTCATGTAGAGCACTATGTGA
TCTGGGGGCTAGCATAAATATAATTCCTCTATCATTATGTAAAAAGCTTAACATAGGAGACATAAAATCTACTTCTGTTAGACTTCAGTTGGCTGACCAGTCTGTGGTTA
GTCCATATGGGATTGTTGAGAATATTCTGATTAAAGTAGCACTACAACAAATTGGCGTGACTTCCTTCCTCTCACCGGTGGTGCGATTTTGTTCTCCTCACCGGTGCGAT
TTTGTTCTCCTCACCGGCGCAGTTGGTTCTTCTTTGGTGCGTTTTTCTTCTTCTGTGGCGCGATTTGCTACCTCTCTCACCGTCGACAGCCGATTTCTCTCACCATCACC
ATTCCTCCTTCTTCTTGTTCGTTGCATGTGGGTTTTCTCCATGGCTACTCATGGGTTCCACGTTAGCTACGGGTTAATTCCATTTTTCATTACCTCTAGTGTGCTTAATG
CGGCTTATAATTCCAATACGCAAGAAATAATGAATGATAGTGGAGCTTCTTCAACATCTAAATCAGCAAAAGCTGATGGAATGCATTACAGTAACAAGGAAATGGTTGAT
TGGAGCTATAATGCGATATCAGAGTTAATCGGGTGCTCGGAGCGTGAAAAGATGCAAAGGAATGAAAAGAGTAAAAGTGGAGAAAAGTCAAATCTCGGTCAACAGCAGGC
TAGCGTCGAGACGCTAGCTCTTGAGCGTCTCGACGCTCACATTCCATATCAGATTAGGCGCGTAAAGCTTACAACGTCGAGACGCTATGATAGGAAGCGTCCCGACGCTA
CCGTTTTCCTTATTCAGAACGCGCGTTTAAGAGGCAGCGTCGCGACGCTGTCTTGA
Protein sequenceShow/hide protein sequence
STCSARVQQGVLEKLSDPGSFTIPCNFGTFSCRALCDLGASINIIPLSLCKKLNIGDIKSTSVRLQLADQSVVSPYGIVENILIKVALQQIGVTSFLSPVVRFCSPHRCD
FVLLTGAVGSSLVRFSSSVARFATSLTVDSRFLSPSPFLLLLVRCMWVFSMATHGFHVSYGLIPFFITSSVLNAAYNSNTQEIMNDSGASSTSKSAKADGMHYSNKEMVD
WSYNAISELIGCSEREKMQRNEKSKSGEKSNLGQQQASVETLALERLDAHIPYQIRRVKLTTSRRYDRKRPDATVFLIQNARLRGSVATLS