; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC02G033460 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC02G033460
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationCiama_Chr02:9610095..9611678
RNA-Seq ExpressionCaUC02G033460
SyntenyCaUC02G033460
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG71332.1 hypothetical protein EZV62_006267 [Acer yangbiense]3.7e-3539.32Show/hide
Query:  SDGHKAEGSHRNGRWNRKKNGKAADKNEGKPFKRPCFLCDGPYWTRDCPQKKALNALVAQSRAQEQVKPVCRMGSLQQISAMIGGFSPREIGEEGQLYID
        S   K EG  +    + K+N  +  K+E    K PCF+CDGP+W RDCP++K L+A+++Q    +       MGSL+Q+ A+    +P    ++G ++++
Subjt:  SDGHKAEGSHRNGRWNRKKNGKAADKNEGKPFKRPCFLCDGPYWTRDCPQKKALNALVAQSRAQEQVKPVCRMGSLQQISAMIGGFSPREIGEEGQLYID

Query:  ARINGIVHEVLLDTGASHNFIDPNKVVSLGLKVVGGGGKMKSVNSTTVDAKGIAKDVFLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANATLSPASKQ
        A ING     +LDTGA+HNF+  ++  +LGL+    GG +K+VNS      GIAK V + +G W G +DF+VV +DD+++VLG+EFF + +A   PA   
Subjt:  ARINGIVHEVLLDTGASHNFIDPNKVVSLGLKVVGGGGKMKSVNSTTVDAKGIAKDVFLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANATLSPASKQ

Query:  LSLYDG
        LS+ DG
Subjt:  LSLYDG

TXG73104.1 hypothetical protein EZV62_001683 [Acer yangbiense]3.1e-3740.89Show/hide
Query:  SDGHKAEGSHRNGRWNRKKNGKAADKNEGKPFKRPCFLCDGPYWTRDCPQKKALNALVAQSRAQEQVKPVCRMGSLQQISAMIGGFSPREIGEEGQLYID
        S G K EG  +    +++        N  KP K PCF+CDGP+W RDCP++K L+A+++Q    ++      MGSL+Q+ A+ G  +P    ++G ++++
Subjt:  SDGHKAEGSHRNGRWNRKKNGKAADKNEGKPFKRPCFLCDGPYWTRDCPQKKALNALVAQSRAQEQVKPVCRMGSLQQISAMIGGFSPREIGEEGQLYID

Query:  ARINGIVHEVLLDTGASHNFIDPNKVVSLGLKVVGGGGKMKSVNSTTVDAKGIAKDVFLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANATLSPASKQ
        A ING     +LDTGA+HNF+  +K   LGLK   GGG +K+VNS      GIAK V + +G W G +DF+VV +DD++VVLG+EFF + +A   PA   
Subjt:  ARINGIVHEVLLDTGASHNFIDPNKVVSLGLKVVGGGGKMKSVNSTTVDAKGIAKDVFLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANATLSPASKQ

Query:  LSL
        L++
Subjt:  LSL

XP_006493831.1 uncharacterized protein LOC102625991 [Citrus sinensis]2.6e-3640Show/hide
Query:  DGHKAEGSHRNGRWNRKK-------NGKAADKNEGKPFKRPCFLCDGPYWTRDCPQKKALNALVAQSRAQEQV---KPVCRMGSLQQISAMIGGFSPREI
        +  K +G++R   W +KK        GK+  K E    + PCF+C+GP+W RDCP+K++LNAL AQ ++   +   +P   MGSLQ++ A+        +
Subjt:  DGHKAEGSHRNGRWNRKK-------NGKAADKNEGKPFKRPCFLCDGPYWTRDCPQKKALNALVAQSRAQEQV---KPVCRMGSLQQISAMIGGFSPREI

Query:  GEEGQLYIDARINGIVHEVLLDTGASHNFIDPNKVVSLGLKVVGGGGKMKSVNSTTVDAKGIAKDVFLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKAN
         ++G +Y+ A++NG     LLDTGA+HNF+  ++   LGLK    GG MK+VNS      GIA+ V + +G W G +DF++V +DD+K+VLG+EFF + +
Subjt:  GEEGQLYIDARINGIVHEVLLDTGASHNFIDPNKVVSLGLKVVGGGGKMKSVNSTTVDAKGIAKDVFLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKAN

Query:  ATLSPASKQLSLYDG
        A   PA+  LS+ DG
Subjt:  ATLSPASKQLSLYDG

XP_015389228.1 uncharacterized protein LOC107178484 [Citrus sinensis]2.6e-3640Show/hide
Query:  DGHKAEGSHRNGRWNRKK-------NGKAADKNEGKPFKRPCFLCDGPYWTRDCPQKKALNALVAQSRAQEQV---KPVCRMGSLQQISAMIGGFSPREI
        +  K +G++R   W +KK        GK+  K E    + PCF+C+GP+W RDCP+K++LNAL AQ ++   +   +P   MGSLQ++ A+        +
Subjt:  DGHKAEGSHRNGRWNRKK-------NGKAADKNEGKPFKRPCFLCDGPYWTRDCPQKKALNALVAQSRAQEQV---KPVCRMGSLQQISAMIGGFSPREI

Query:  GEEGQLYIDARINGIVHEVLLDTGASHNFIDPNKVVSLGLKVVGGGGKMKSVNSTTVDAKGIAKDVFLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKAN
         ++G +Y+ A++NG     LLDTGA+HNF+  ++   LGLK    GG MK+VNS      GIA+ V + +G W G +DF++V +DD+K+VLG+EFF + +
Subjt:  GEEGQLYIDARINGIVHEVLLDTGASHNFIDPNKVVSLGLKVVGGGGKMKSVNSTTVDAKGIAKDVFLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKAN

Query:  ATLSPASKQLSLYDG
        A   PA+  LS+ DG
Subjt:  ATLSPASKQLSLYDG

XP_015869899.1 uncharacterized protein LOC107407175 [Ziziphus jujuba]3.7e-3539.23Show/hide
Query:  SDESDGHKAEGSHRNGRWNRKKNGKAADKNEGKPFKRPCFLCDGPYWTRDCPQKKALNALVAQSRAQEQVKPVCRMGSLQQISAMIGGFSPREIGEEGQL
        S E +  ++    ++ RW   + GK A K      K  CFLCDGP+W RDCP++KALNA+  Q   +E+ +    +GSLQ ++A+    +P+E  + G +
Subjt:  SDESDGHKAEGSHRNGRWNRKKNGKAADKNEGKPFKRPCFLCDGPYWTRDCPQKKALNALVAQSRAQEQVKPVCRMGSLQQISAMIGGFSPREIGEEGQL

Query:  YIDARINGIVHEVLLDTGASHNFIDPNKVVSLGLKVVGGGGKMKSVNSTTVDAKGIAKDVFLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANATLSPA
        +++A++NG+  + L+DTGASHNF+   +   LG+K   G G +K+VNS     +G+A+ V   +G W G ++ TVVS+DDYKVVLG++FF +  A   P 
Subjt:  YIDARINGIVHEVLLDTGASHNFIDPNKVVSLGLKVVGGGGKMKSVNSTTVDAKGIAKDVFLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANATLSPA

Query:  SKQLSLYDG
        + +L + DG
Subjt:  SKQLSLYDG

TrEMBL top hitse value%identityAlignment
A0A5C7GN37 Retrotrans_gag domain-containing protein3.1e-3540.29Show/hide
Query:  SDGHKAEGSHRNGRWNRKKNGKAADKNEGKPFKRPCFLCDGPYWTRDCPQKKALNALVAQSRAQEQVKPVCRMGSLQQISAMIGGFSPREIGEEGQLYID
        S G K EG  +    + K+N  +  K E    K PCF+CDGP+W RDCP++K LNA+++Q    +  +    MGSL+Q+ A+    +P    ++G ++++
Subjt:  SDGHKAEGSHRNGRWNRKKNGKAADKNEGKPFKRPCFLCDGPYWTRDCPQKKALNALVAQSRAQEQVKPVCRMGSLQQISAMIGGFSPREIGEEGQLYID

Query:  ARINGIVHEVLLDTGASHNFIDPNKVVSLGLKVVGGGGKMKSVNSTTVDAKGIAKDVFLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANATLSPASKQ
        A ING     +LDTGA+HNF+  ++   LGLK   GGG +K+  S      GIAK V + +G W G +DF++V +DD++VVLG+EFF + +A   PA   
Subjt:  ARINGIVHEVLLDTGASHNFIDPNKVVSLGLKVVGGGGKMKSVNSTTVDAKGIAKDVFLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANATLSPASKQ

Query:  LSLYDG
        LS+ DG
Subjt:  LSLYDG

A0A5C7ISI1 Uncharacterized protein1.8e-3539.32Show/hide
Query:  SDGHKAEGSHRNGRWNRKKNGKAADKNEGKPFKRPCFLCDGPYWTRDCPQKKALNALVAQSRAQEQVKPVCRMGSLQQISAMIGGFSPREIGEEGQLYID
        S   K EG  +    + K+N  +  K+E    K PCF+CDGP+W RDCP++K L+A+++Q    +       MGSL+Q+ A+    +P    ++G ++++
Subjt:  SDGHKAEGSHRNGRWNRKKNGKAADKNEGKPFKRPCFLCDGPYWTRDCPQKKALNALVAQSRAQEQVKPVCRMGSLQQISAMIGGFSPREIGEEGQLYID

Query:  ARINGIVHEVLLDTGASHNFIDPNKVVSLGLKVVGGGGKMKSVNSTTVDAKGIAKDVFLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANATLSPASKQ
        A ING     +LDTGA+HNF+  ++  +LGL+    GG +K+VNS      GIAK V + +G W G +DF+VV +DD+++VLG+EFF + +A   PA   
Subjt:  ARINGIVHEVLLDTGASHNFIDPNKVVSLGLKVVGGGGKMKSVNSTTVDAKGIAKDVFLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANATLSPASKQ

Query:  LSLYDG
        LS+ DG
Subjt:  LSLYDG

A0A5C7IW54 Uncharacterized protein1.5e-3740.89Show/hide
Query:  SDGHKAEGSHRNGRWNRKKNGKAADKNEGKPFKRPCFLCDGPYWTRDCPQKKALNALVAQSRAQEQVKPVCRMGSLQQISAMIGGFSPREIGEEGQLYID
        S G K EG  +    +++        N  KP K PCF+CDGP+W RDCP++K L+A+++Q    ++      MGSL+Q+ A+ G  +P    ++G ++++
Subjt:  SDGHKAEGSHRNGRWNRKKNGKAADKNEGKPFKRPCFLCDGPYWTRDCPQKKALNALVAQSRAQEQVKPVCRMGSLQQISAMIGGFSPREIGEEGQLYID

Query:  ARINGIVHEVLLDTGASHNFIDPNKVVSLGLKVVGGGGKMKSVNSTTVDAKGIAKDVFLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANATLSPASKQ
        A ING     +LDTGA+HNF+  +K   LGLK   GGG +K+VNS      GIAK V + +G W G +DF+VV +DD++VVLG+EFF + +A   PA   
Subjt:  ARINGIVHEVLLDTGASHNFIDPNKVVSLGLKVVGGGGKMKSVNSTTVDAKGIAKDVFLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANATLSPASKQ

Query:  LSL
        L++
Subjt:  LSL

A0A6P3Z009 uncharacterized protein LOC1074071751.8e-3539.23Show/hide
Query:  SDESDGHKAEGSHRNGRWNRKKNGKAADKNEGKPFKRPCFLCDGPYWTRDCPQKKALNALVAQSRAQEQVKPVCRMGSLQQISAMIGGFSPREIGEEGQL
        S E +  ++    ++ RW   + GK A K      K  CFLCDGP+W RDCP++KALNA+  Q   +E+ +    +GSLQ ++A+    +P+E  + G +
Subjt:  SDESDGHKAEGSHRNGRWNRKKNGKAADKNEGKPFKRPCFLCDGPYWTRDCPQKKALNALVAQSRAQEQVKPVCRMGSLQQISAMIGGFSPREIGEEGQL

Query:  YIDARINGIVHEVLLDTGASHNFIDPNKVVSLGLKVVGGGGKMKSVNSTTVDAKGIAKDVFLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANATLSPA
        +++A++NG+  + L+DTGASHNF+   +   LG+K   G G +K+VNS     +G+A+ V   +G W G ++ TVVS+DDYKVVLG++FF +  A   P 
Subjt:  YIDARINGIVHEVLLDTGASHNFIDPNKVVSLGLKVVGGGGKMKSVNSTTVDAKGIAKDVFLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANATLSPA

Query:  SKQLSLYDG
        + +L + DG
Subjt:  SKQLSLYDG

A0A6P6FTI3 uncharacterized protein LOC1074055251.3e-3338.5Show/hide
Query:  ESDESDGHK-AEGSHR--NGRWNRKKNGKAADKNEGKPFKRPCFLCDGPYWTRDCPQKKALNALVAQSRAQEQVKPVCRMGSLQQISAMIGGFSPREIGE
        ES +   HK  EG+H+  + +W   + GK A K      K  CFLCDGP+W RDCP++KALNA+  Q   +E  +    MGS+Q ++A+    +P++  +
Subjt:  ESDESDGHK-AEGSHR--NGRWNRKKNGKAADKNEGKPFKRPCFLCDGPYWTRDCPQKKALNALVAQSRAQEQVKPVCRMGSLQQISAMIGGFSPREIGE

Query:  EGQLYIDARINGIVHEVLLDTGASHNFIDPNKVVSLGLKVVGGGGKMKSVNSTTVDAKGIAKDVFLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANAT
         G ++++A+ING++ + L+DTGA+HNF+   +   LG+K +  GG MK+VNS      G+A+ V   +G+W G ++ +VV +DD+KVVLG+EF  +  A 
Subjt:  EGQLYIDARINGIVHEVLLDTGASHNFIDPNKVVSLGLKVVGGGGKMKSVNSTTVDAKGIAKDVFLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANAT

Query:  LSPASKQLSLYDG
          P +  + + DG
Subjt:  LSPASKQLSLYDG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCAGAGGAAGCACGAAGCTCATTCGTGGAGCGTGTGCAAGTCAAAGGACCTATGACCCGAGGAAAATGGCGAGATGATGCACAGTTGGAGTCTGATGAATCTGA
TGGGCATAAGGCTGAGGGGAGCCATAGGAATGGTCGATGGAACAGGAAGAAGAATGGAAAAGCTGCCGACAAGAATGAAGGTAAGCCTTTCAAACGCCCTTGCTTCCTAT
GTGACGGGCCATATTGGACGAGAGATTGCCCGCAAAAGAAGGCGCTTAATGCCTTAGTGGCTCAATCCCGCGCGCAAGAACAAGTAAAGCCTGTGTGTCGAATGGGTTCC
TTGCAGCAGATTAGCGCCATGATCGGTGGCTTCTCTCCACGGGAGATTGGGGAGGAGGGACAACTATACATTGATGCGAGGATCAACGGTATAGTTCATGAGGTGTTGCT
CGACACAGGAGCATCACATAACTTCATTGATCCGAATAAGGTCGTGAGCCTTGGCCTCAAGGTCGTTGGAGGAGGTGGAAAGATGAAGTCGGTGAATTCCACAACCGTGG
ATGCGAAGGGAATAGCCAAGGATGTTTTCTTAAAGGTGGGAAAATGGCGAGGTTTGGTTGATTTCACCGTGGTCTCGTTAGATGACTATAAGGTGGTTTTAGGCTTGGAG
TTCTTCAGGAAAGCCAACGCCACCCTTTCACCTGCTTCCAAACAGTTGTCTTTGTATGATGGACAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCAGAGGAAGCACGAAGCTCATTCGTGGAGCGTGTGCAAGTCAAAGGACCTATGACCCGAGGAAAATGGCGAGATGATGCACAGTTGGAGTCTGATGAATCTGA
TGGGCATAAGGCTGAGGGGAGCCATAGGAATGGTCGATGGAACAGGAAGAAGAATGGAAAAGCTGCCGACAAGAATGAAGGTAAGCCTTTCAAACGCCCTTGCTTCCTAT
GTGACGGGCCATATTGGACGAGAGATTGCCCGCAAAAGAAGGCGCTTAATGCCTTAGTGGCTCAATCCCGCGCGCAAGAACAAGTAAAGCCTGTGTGTCGAATGGGTTCC
TTGCAGCAGATTAGCGCCATGATCGGTGGCTTCTCTCCACGGGAGATTGGGGAGGAGGGACAACTATACATTGATGCGAGGATCAACGGTATAGTTCATGAGGTGTTGCT
CGACACAGGAGCATCACATAACTTCATTGATCCGAATAAGGTCGTGAGCCTTGGCCTCAAGGTCGTTGGAGGAGGTGGAAAGATGAAGTCGGTGAATTCCACAACCGTGG
ATGCGAAGGGAATAGCCAAGGATGTTTTCTTAAAGGTGGGAAAATGGCGAGGTTTGGTTGATTTCACCGTGGTCTCGTTAGATGACTATAAGGTGGTTTTAGGCTTGGAG
TTCTTCAGGAAAGCCAACGCCACCCTTTCACCTGCTTCCAAACAGTTGTCTTTGTATGATGGACAATGA
Protein sequenceShow/hide protein sequence
MAAEEARSSFVERVQVKGPMTRGKWRDDAQLESDESDGHKAEGSHRNGRWNRKKNGKAADKNEGKPFKRPCFLCDGPYWTRDCPQKKALNALVAQSRAQEQVKPVCRMGS
LQQISAMIGGFSPREIGEEGQLYIDARINGIVHEVLLDTGASHNFIDPNKVVSLGLKVVGGGGKMKSVNSTTVDAKGIAKDVFLKVGKWRGLVDFTVVSLDDYKVVLGLE
FFRKANATLSPASKQLSLYDGQ