; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G09845 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G09845
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationClcChr02:14731488..14734099
RNA-Seq ExpressionClc02G09845
SyntenyClc02G09845
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040414.1 uncharacterized protein E6C27_scaffold35G00340 [Cucumis melo var. makuwa]2.6e-9843.28Show/hide
Query:  DARLTSLEEAVEEAQLVVGRLSERVEELVQDSSEITAVTKEMIKELGRTLGKEVSTLFDEVANLRKFVEGELHDLRGE-------VDDMCKECRAMRSAN
        D RLT+LE+ +E+ QL VGRLSE  EELVQ+++EIT+V KEMI+++GRT  +E+  L   V  L+ FVEGELH+L  +       +D +C ECR+    +
Subjt:  DARLTSLEEAVEEAQLVVGRLSERVEELVQDSSEITAVTKEMIKELGRTLGKEVSTLFDEVANLRKFVEGELHDLRGE-------VDDMCKECRAMRSAN

Query:  GGASTSTSSVARGTNGVKVPKPDTYDGTRSATVVDNFLFGLEQYFEALGVIDDGAKIANVPNFLREAVQLWWRRKYA-----------------------
           STST     GT+ +KVPKPD Y+G R+ATVVDNFLFGLE+YF ALGV DD A+I + P FLR+A QLWWRRKYA                       
Subjt:  GGASTSTSSVARGTNGVKVPKPDTYDGTRSATVVDNFLFGLEQYFEALGVIDDGAKIANVPNFLREAVQLWWRRKYA-----------------------

Query:  -----EPRGKFRRLRQTGSIPDYISEFTTLMLEIEGLSDKDALFYFRDGFKDWAKIELNRQDVQILDDAIATAEMLVDFSTKTKKATKKEGEELESDE--
             E RGK RRLR TGSI DY+ EFTTLMLEI  L +K+ALF F+DG KDWAKIEL+R++VQ LDDAIA AE LVD+S ++K   KK G E    +  
Subjt:  -----EPRGKFRRLRQTGSIPDYISEFTTLMLEIEGLSDKDALFYFRDGFKDWAKIELNRQDVQILDDAIATAEMLVDFSTKTKKATKKEGEELESDE--

Query:  -------SDGHKLRR---AIGMVDGTGRRMERLPTR---------------MKKALNALVAQSRTQEQVK--PVCRMGSLQQISAMTGGFSPREIGEEGQ
                DG K++      G  DG  R     P +                +KALNALV + +  +QV+  P  ++GS+QQI  M        +  +G 
Subjt:  -------SDGHKLRR---AIGMVDGTGRRMERLPTR---------------MKKALNALVAQSRTQEQVK--PVCRMGSLQQISAMTGGFSPREIGEEGQ

Query:  LYVDVRINGIVHEVLLDTGASHNFIDPNKVMSLGLKVVGGGGKMKLVNSTTVDAKGVVAKDVSLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANANLS
        LY  +RI G     + DTGASHNF+D  +   LGLK     G +K+VN+      G VAK V +K+G W+  +DF+V+ +DD+ +VLGL FF K    L 
Subjt:  LYVDVRINGIVHEVLLDTGASHNFIDPNKVMSLGLKVVGGGGKMKLVNSTTVDAKGVVAKDVSLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANANLS

Query:  PASKQLSLYDGQRIHVVPLMVKELPETKVMSALHMVKRCMKRR
             LS+ DG    +  + ++     K++SAL   +   K +
Subjt:  PASKQLSLYDGQRIHVVPLMVKELPETKVMSALHMVKRCMKRR

KAA0040659.1 uncharacterized protein E6C27_scaffold370G00130 [Cucumis melo var. makuwa]1.4e-9944.01Show/hide
Query:  DARLTSLEEAVEEAQLVVGRLSERVEELVQDSSEITAVTKEMIKELGRTLGKEVSTLFDEVANLRKFVEGELHDLRGE-------VDDMCKECRAMRSAN
        D RLT+LE+ +E+ QL VGRLSE  EELVQ+++EIT+V KEMI+++GRT  KE+  L   V  L+ FVEGELHDL  +       +D +C ECR+    +
Subjt:  DARLTSLEEAVEEAQLVVGRLSERVEELVQDSSEITAVTKEMIKELGRTLGKEVSTLFDEVANLRKFVEGELHDLRGE-------VDDMCKECRAMRSAN

Query:  GGASTSTSSVARGTNGVKVPKPDTYDGTRSATVVDNFLFGLEQYFEALGVIDDGAKIANVPNFLREAVQLWWRRKYA-----------------------
           STST     GT+ +KVPKPD Y+G R+ATVVDNFLFGLE+YF ALGV DD A+I + P FLR+A QLWWRRKYA                       
Subjt:  GGASTSTSSVARGTNGVKVPKPDTYDGTRSATVVDNFLFGLEQYFEALGVIDDGAKIANVPNFLREAVQLWWRRKYA-----------------------

Query:  -----EPRGKFRRLRQTGSIPDYISEFTTLMLEIEGLSDKDALFYFRDGFKDWAKIELNRQDVQILDDAIATAEMLVDFSTKTKKATKKEGEELESDES-
             E RGK RRLR TGSI DY+ EFTTLMLEI  L +K+ALF F+DG KDWAKIEL+R++VQ LDDAIA AE LVD+S ++K   KK G E    +S 
Subjt:  -----EPRGKFRRLRQTGSIPDYISEFTTLMLEIEGLSDKDALFYFRDGFKDWAKIELNRQDVQILDDAIATAEMLVDFSTKTKKATKKEGEELESDES-

Query:  --------DGHKLRR---AIGMVDGTGRRMERLPTR---------------MKKALNALVAQSRTQEQVK--PVCRMGSLQQISAMTGGFSPREIGEEGQ
                DG K++      G  DG  R     P +                +KALNALVA+ +  +QV+  P  ++GS+QQI  M        +  +G 
Subjt:  --------DGHKLRR---AIGMVDGTGRRMERLPTR---------------MKKALNALVAQSRTQEQVK--PVCRMGSLQQISAMTGGFSPREIGEEGQ

Query:  LYVDVRINGIVHEVLLDTGASHNFIDPNKVMSLGLKVVGGGGKMKLVNSTTVDAKGVVAKDVSLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANANLS
        LY  +RI G     + DTGASHNFID  +   LGLK     G +K+VN+      G VAK V +K+G W+  +DF+V+ +DD+ +VLGL FF K    L 
Subjt:  LYVDVRINGIVHEVLLDTGASHNFIDPNKVMSLGLKVVGGGGKMKLVNSTTVDAKGVVAKDVSLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANANLS

Query:  PASKQLSLYDGQRIHVVPLMVKELPETKVMSALHMVKRCMKRR
             LS+ DG    +  + ++     +++SAL   +   K +
Subjt:  PASKQLSLYDGQRIHVVPLMVKELPETKVMSALHMVKRCMKRR

KAA0042140.1 uncharacterized protein E6C27_scaffold67G006290 [Cucumis melo var. makuwa]4.9e-9742.51Show/hide
Query:  DARLTSLEEAVEEAQLVVGRLSERVEELVQDSSEITAVTKEMIKELGRTLGKEVSTLFDEVANLRKFVEGELHDLRGE-------VDDMCKECRAMRSAN
        D RLT+LE+ +E+ QL VGRLSE  EELVQ+++EIT+V KEMI+++GRT  KE+  L   V  L+ FVEGELHDL  +       +D +C EC +    +
Subjt:  DARLTSLEEAVEEAQLVVGRLSERVEELVQDSSEITAVTKEMIKELGRTLGKEVSTLFDEVANLRKFVEGELHDLRGE-------VDDMCKECRAMRSAN

Query:  GGASTSTSSVARGTNGVKVPKPDTYDGTRSATVVDNFLFGLEQYFEALGVIDDGAKIANVPNFLREAVQLWWRRKYA-----------------------
           STST     GT+ +KVPKPD Y+G R+ATVVDNFLFGLE+YF ALGV DD A+I + P FLR+  QLWWR KYA                       
Subjt:  GGASTSTSSVARGTNGVKVPKPDTYDGTRSATVVDNFLFGLEQYFEALGVIDDGAKIANVPNFLREAVQLWWRRKYA-----------------------

Query:  -----EPRGKFRRLRQTGSIPDYISEFTTLMLEIEGLSDKDALFYFRDGFKDWAKIELNRQDVQILDDAIATAEMLVDFSTKT---KKATKKEGEELESD
             E RGK R LR  GSI DY+ EFTTLMLEI  L +K+ALF F+DG KDWAKIEL+R++VQ LDDAIA AE LVD+S ++   K   +K G + +  
Subjt:  -----EPRGKFRRLRQTGSIPDYISEFTTLMLEIEGLSDKDALFYFRDGFKDWAKIELNRQDVQILDDAIATAEMLVDFSTKT---KKATKKEGEELESD

Query:  ESDGHK-------LRRAIGMVDGTGRRMERLPTR---------------MKKALNALVAQSRTQEQVK--PVCRMGSLQQISAMTGGFSPREIGEEGQLY
        ++ GHK        +   G  DG  R     P +                +KALNALVA+ +  +QV+  P  ++GS+QQI  M        +  +G LY
Subjt:  ESDGHK-------LRRAIGMVDGTGRRMERLPTR---------------MKKALNALVAQSRTQEQVK--PVCRMGSLQQISAMTGGFSPREIGEEGQLY

Query:  VDVRINGIVHEVLLDTGASHNFIDPNKVMSLGLKVVGGGGKMKLVNSTTVDAKGVVAKDVSLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANANLSPA
         ++RI G     + DTGASHNF+D  +   LGLK     G +K+VN+      G VAK V +K+G W+  +DF+V+ +DD+ +VLGL FF K    +   
Subjt:  VDVRINGIVHEVLLDTGASHNFIDPNKVMSLGLKVVGGGGKMKLVNSTTVDAKGVVAKDVSLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANANLSPA

Query:  SKQLSLYDGQRIHVVPLMVKELPETKVMSALHMVKRCMKRR
           LS+ DG    +  + ++     K++SAL   +   K +
Subjt:  SKQLSLYDGQRIHVVPLMVKELPETKVMSALHMVKRCMKRR

KAA0065760.1 polyprotein [Cucumis melo var. makuwa]1.2e-9843.83Show/hide
Query:  DARLTSLEEAVEEAQLVVGRLSERVEELVQDSSEITAVTKEMIKELGRTLGKEVSTLFDEVANLRKFVEGELHDLRGE-------VDDMCKECRAMRSAN
        D RLT+LE+ +E+ QL VGRLSE  EELVQ+++EIT+V KEMI+++GRT  +E+  L   V  L+ FVEGELH+L  +       +D +C ECR+    +
Subjt:  DARLTSLEEAVEEAQLVVGRLSERVEELVQDSSEITAVTKEMIKELGRTLGKEVSTLFDEVANLRKFVEGELHDLRGE-------VDDMCKECRAMRSAN

Query:  GGASTSTSSVARGTNGVKVPKPDTYDGTRSATVVDNFLFGLEQYFEALGVIDDGAKIANVPNFLREAVQLWWRRKYA-----------------------
           STST     GT+ +KVPKPD Y+G R+ATVVDNFLFGLE+YF ALGV DD A+I + P FLR+A QLWWRRKYA                       
Subjt:  GGASTSTSSVARGTNGVKVPKPDTYDGTRSATVVDNFLFGLEQYFEALGVIDDGAKIANVPNFLREAVQLWWRRKYA-----------------------

Query:  -----EPRGKFRRLRQTGSIPDYISEFTTLMLEIEGLSDKDALFYFRDGFKDWAKIELNRQDVQILDDAIATAEMLVDFSTKTKKATKKEGEELESDESD
             E RGK RRLR TGSI +Y+ EFTTLMLEI  L +K+ALF F+DG KDWAKIEL+R++VQ LDDAIA AE LVD+S ++K   KK G E    +SD
Subjt:  -----EPRGKFRRLRQTGSIPDYISEFTTLMLEIEGLSDKDALFYFRDGFKDWAKIELNRQDVQILDDAIATAEMLVDFSTKTKKATKKEGEELESDESD

Query:  GHKL--RRAIGMV----------DGTGRRMERLPTR---------------MKKALNALVAQSRTQEQVK--PVCRMGSLQQISAMTGGFSPREIGEEGQ
          K   RR  G V          DG  R     P +                +KALNALVA+ +  +QV+  P  ++GS+QQI  M        +  +G 
Subjt:  GHKL--RRAIGMV----------DGTGRRMERLPTR---------------MKKALNALVAQSRTQEQVK--PVCRMGSLQQISAMTGGFSPREIGEEGQ

Query:  LYVDVRINGIVHEVLLDTGASHNFIDPNKVMSLGLKVVGGGGKMKLVNSTTVDAKGVVAKDVSLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANANLS
        LY  +RI G     + DTGASHNF+D  +   LGLK     G +K+VN+      G VAK V +K+G W+  +DF+V+ +DD+ +VLGL FF K    L 
Subjt:  LYVDVRINGIVHEVLLDTGASHNFIDPNKVMSLGLKVVGGGGKMKLVNSTTVDAKGVVAKDVSLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANANLS

Query:  PASKQLSLYDGQRIHVVPLMVKELPETKVMSALHMVKRCMKRR
             LS+ DG    +  + ++     K++SAL   +   K +
Subjt:  PASKQLSLYDGQRIHVVPLMVKELPETKVMSALHMVKRCMKRR

TYK18079.1 uncharacterized protein E5676_scaffold306G004150 [Cucumis melo var. makuwa]4.9e-9742.51Show/hide
Query:  DARLTSLEEAVEEAQLVVGRLSERVEELVQDSSEITAVTKEMIKELGRTLGKEVSTLFDEVANLRKFVEGELHDLRGE-------VDDMCKECRAMRSAN
        D RLT+LE+ +E+ QL VGRLSE  EELVQ+++EIT+V KEMI+++GRT  KE+  L   V  L+ FVEGELHDL  +       +D +C EC +    +
Subjt:  DARLTSLEEAVEEAQLVVGRLSERVEELVQDSSEITAVTKEMIKELGRTLGKEVSTLFDEVANLRKFVEGELHDLRGE-------VDDMCKECRAMRSAN

Query:  GGASTSTSSVARGTNGVKVPKPDTYDGTRSATVVDNFLFGLEQYFEALGVIDDGAKIANVPNFLREAVQLWWRRKYA-----------------------
           STST     GT+ +KVPKPD Y+G R+ATVVDNFLFGLE+YF ALGV DD A+I + P FLR+  QLWWR KYA                       
Subjt:  GGASTSTSSVARGTNGVKVPKPDTYDGTRSATVVDNFLFGLEQYFEALGVIDDGAKIANVPNFLREAVQLWWRRKYA-----------------------

Query:  -----EPRGKFRRLRQTGSIPDYISEFTTLMLEIEGLSDKDALFYFRDGFKDWAKIELNRQDVQILDDAIATAEMLVDFSTKT---KKATKKEGEELESD
             E RGK R LR  GSI DY+ EFTTLMLEI  L +K+ALF F+DG KDWAKIEL+R++VQ LDDAIA AE LVD+S ++   K   +K G + +  
Subjt:  -----EPRGKFRRLRQTGSIPDYISEFTTLMLEIEGLSDKDALFYFRDGFKDWAKIELNRQDVQILDDAIATAEMLVDFSTKT---KKATKKEGEELESD

Query:  ESDGHK-------LRRAIGMVDGTGRRMERLPTR---------------MKKALNALVAQSRTQEQVK--PVCRMGSLQQISAMTGGFSPREIGEEGQLY
        ++ GHK        +   G  DG  R     P +                +KALNALVA+ +  +QV+  P  ++GS+QQI  M        +  +G LY
Subjt:  ESDGHK-------LRRAIGMVDGTGRRMERLPTR---------------MKKALNALVAQSRTQEQVK--PVCRMGSLQQISAMTGGFSPREIGEEGQLY

Query:  VDVRINGIVHEVLLDTGASHNFIDPNKVMSLGLKVVGGGGKMKLVNSTTVDAKGVVAKDVSLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANANLSPA
         ++RI G     + DTGASHNF+D  +   LGLK     G +K+VN+      G VAK V +K+G W+  +DF+V+ +DD+ +VLGL FF K    +   
Subjt:  VDVRINGIVHEVLLDTGASHNFIDPNKVMSLGLKVVGGGGKMKLVNSTTVDAKGVVAKDVSLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANANLSPA

Query:  SKQLSLYDGQRIHVVPLMVKELPETKVMSALHMVKRCMKRR
           LS+ DG    +  + ++     K++SAL   +   K +
Subjt:  SKQLSLYDGQRIHVVPLMVKELPETKVMSALHMVKRCMKRR

TrEMBL top hitse value%identityAlignment
A0A5A7TAA5 Uncharacterized protein1.3e-9843.28Show/hide
Query:  DARLTSLEEAVEEAQLVVGRLSERVEELVQDSSEITAVTKEMIKELGRTLGKEVSTLFDEVANLRKFVEGELHDLRGE-------VDDMCKECRAMRSAN
        D RLT+LE+ +E+ QL VGRLSE  EELVQ+++EIT+V KEMI+++GRT  +E+  L   V  L+ FVEGELH+L  +       +D +C ECR+    +
Subjt:  DARLTSLEEAVEEAQLVVGRLSERVEELVQDSSEITAVTKEMIKELGRTLGKEVSTLFDEVANLRKFVEGELHDLRGE-------VDDMCKECRAMRSAN

Query:  GGASTSTSSVARGTNGVKVPKPDTYDGTRSATVVDNFLFGLEQYFEALGVIDDGAKIANVPNFLREAVQLWWRRKYA-----------------------
           STST     GT+ +KVPKPD Y+G R+ATVVDNFLFGLE+YF ALGV DD A+I + P FLR+A QLWWRRKYA                       
Subjt:  GGASTSTSSVARGTNGVKVPKPDTYDGTRSATVVDNFLFGLEQYFEALGVIDDGAKIANVPNFLREAVQLWWRRKYA-----------------------

Query:  -----EPRGKFRRLRQTGSIPDYISEFTTLMLEIEGLSDKDALFYFRDGFKDWAKIELNRQDVQILDDAIATAEMLVDFSTKTKKATKKEGEELESDE--
             E RGK RRLR TGSI DY+ EFTTLMLEI  L +K+ALF F+DG KDWAKIEL+R++VQ LDDAIA AE LVD+S ++K   KK G E    +  
Subjt:  -----EPRGKFRRLRQTGSIPDYISEFTTLMLEIEGLSDKDALFYFRDGFKDWAKIELNRQDVQILDDAIATAEMLVDFSTKTKKATKKEGEELESDE--

Query:  -------SDGHKLRR---AIGMVDGTGRRMERLPTR---------------MKKALNALVAQSRTQEQVK--PVCRMGSLQQISAMTGGFSPREIGEEGQ
                DG K++      G  DG  R     P +                +KALNALV + +  +QV+  P  ++GS+QQI  M        +  +G 
Subjt:  -------SDGHKLRR---AIGMVDGTGRRMERLPTR---------------MKKALNALVAQSRTQEQVK--PVCRMGSLQQISAMTGGFSPREIGEEGQ

Query:  LYVDVRINGIVHEVLLDTGASHNFIDPNKVMSLGLKVVGGGGKMKLVNSTTVDAKGVVAKDVSLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANANLS
        LY  +RI G     + DTGASHNF+D  +   LGLK     G +K+VN+      G VAK V +K+G W+  +DF+V+ +DD+ +VLGL FF K    L 
Subjt:  LYVDVRINGIVHEVLLDTGASHNFIDPNKVMSLGLKVVGGGGKMKLVNSTTVDAKGVVAKDVSLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANANLS

Query:  PASKQLSLYDGQRIHVVPLMVKELPETKVMSALHMVKRCMKRR
             LS+ DG    +  + ++     K++SAL   +   K +
Subjt:  PASKQLSLYDGQRIHVVPLMVKELPETKVMSALHMVKRCMKRR

A0A5A7TFP3 Retrotrans_gag domain-containing protein2.4e-9742.51Show/hide
Query:  DARLTSLEEAVEEAQLVVGRLSERVEELVQDSSEITAVTKEMIKELGRTLGKEVSTLFDEVANLRKFVEGELHDLRGE-------VDDMCKECRAMRSAN
        D RLT+LE+ +E+ QL VGRLSE  EELVQ+++EIT+V KEMI+++GRT  KE+  L   V  L+ FVEGELHDL  +       +D +C EC +    +
Subjt:  DARLTSLEEAVEEAQLVVGRLSERVEELVQDSSEITAVTKEMIKELGRTLGKEVSTLFDEVANLRKFVEGELHDLRGE-------VDDMCKECRAMRSAN

Query:  GGASTSTSSVARGTNGVKVPKPDTYDGTRSATVVDNFLFGLEQYFEALGVIDDGAKIANVPNFLREAVQLWWRRKYA-----------------------
           STST     GT+ +KVPKPD Y+G R+ATVVDNFLFGLE+YF ALGV DD A+I + P FLR+  QLWWR KYA                       
Subjt:  GGASTSTSSVARGTNGVKVPKPDTYDGTRSATVVDNFLFGLEQYFEALGVIDDGAKIANVPNFLREAVQLWWRRKYA-----------------------

Query:  -----EPRGKFRRLRQTGSIPDYISEFTTLMLEIEGLSDKDALFYFRDGFKDWAKIELNRQDVQILDDAIATAEMLVDFSTKT---KKATKKEGEELESD
             E RGK R LR  GSI DY+ EFTTLMLEI  L +K+ALF F+DG KDWAKIEL+R++VQ LDDAIA AE LVD+S ++   K   +K G + +  
Subjt:  -----EPRGKFRRLRQTGSIPDYISEFTTLMLEIEGLSDKDALFYFRDGFKDWAKIELNRQDVQILDDAIATAEMLVDFSTKT---KKATKKEGEELESD

Query:  ESDGHK-------LRRAIGMVDGTGRRMERLPTR---------------MKKALNALVAQSRTQEQVK--PVCRMGSLQQISAMTGGFSPREIGEEGQLY
        ++ GHK        +   G  DG  R     P +                +KALNALVA+ +  +QV+  P  ++GS+QQI  M        +  +G LY
Subjt:  ESDGHK-------LRRAIGMVDGTGRRMERLPTR---------------MKKALNALVAQSRTQEQVK--PVCRMGSLQQISAMTGGFSPREIGEEGQLY

Query:  VDVRINGIVHEVLLDTGASHNFIDPNKVMSLGLKVVGGGGKMKLVNSTTVDAKGVVAKDVSLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANANLSPA
         ++RI G     + DTGASHNF+D  +   LGLK     G +K+VN+      G VAK V +K+G W+  +DF+V+ +DD+ +VLGL FF K    +   
Subjt:  VDVRINGIVHEVLLDTGASHNFIDPNKVMSLGLKVVGGGGKMKLVNSTTVDAKGVVAKDVSLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANANLSPA

Query:  SKQLSLYDGQRIHVVPLMVKELPETKVMSALHMVKRCMKRR
           LS+ DG    +  + ++     K++SAL   +   K +
Subjt:  SKQLSLYDGQRIHVVPLMVKELPETKVMSALHMVKRCMKRR

A0A5A7THC0 Reverse transcriptase domain-containing protein6.7e-10044.01Show/hide
Query:  DARLTSLEEAVEEAQLVVGRLSERVEELVQDSSEITAVTKEMIKELGRTLGKEVSTLFDEVANLRKFVEGELHDLRGE-------VDDMCKECRAMRSAN
        D RLT+LE+ +E+ QL VGRLSE  EELVQ+++EIT+V KEMI+++GRT  KE+  L   V  L+ FVEGELHDL  +       +D +C ECR+    +
Subjt:  DARLTSLEEAVEEAQLVVGRLSERVEELVQDSSEITAVTKEMIKELGRTLGKEVSTLFDEVANLRKFVEGELHDLRGE-------VDDMCKECRAMRSAN

Query:  GGASTSTSSVARGTNGVKVPKPDTYDGTRSATVVDNFLFGLEQYFEALGVIDDGAKIANVPNFLREAVQLWWRRKYA-----------------------
           STST     GT+ +KVPKPD Y+G R+ATVVDNFLFGLE+YF ALGV DD A+I + P FLR+A QLWWRRKYA                       
Subjt:  GGASTSTSSVARGTNGVKVPKPDTYDGTRSATVVDNFLFGLEQYFEALGVIDDGAKIANVPNFLREAVQLWWRRKYA-----------------------

Query:  -----EPRGKFRRLRQTGSIPDYISEFTTLMLEIEGLSDKDALFYFRDGFKDWAKIELNRQDVQILDDAIATAEMLVDFSTKTKKATKKEGEELESDES-
             E RGK RRLR TGSI DY+ EFTTLMLEI  L +K+ALF F+DG KDWAKIEL+R++VQ LDDAIA AE LVD+S ++K   KK G E    +S 
Subjt:  -----EPRGKFRRLRQTGSIPDYISEFTTLMLEIEGLSDKDALFYFRDGFKDWAKIELNRQDVQILDDAIATAEMLVDFSTKTKKATKKEGEELESDES-

Query:  --------DGHKLRR---AIGMVDGTGRRMERLPTR---------------MKKALNALVAQSRTQEQVK--PVCRMGSLQQISAMTGGFSPREIGEEGQ
                DG K++      G  DG  R     P +                +KALNALVA+ +  +QV+  P  ++GS+QQI  M        +  +G 
Subjt:  --------DGHKLRR---AIGMVDGTGRRMERLPTR---------------MKKALNALVAQSRTQEQVK--PVCRMGSLQQISAMTGGFSPREIGEEGQ

Query:  LYVDVRINGIVHEVLLDTGASHNFIDPNKVMSLGLKVVGGGGKMKLVNSTTVDAKGVVAKDVSLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANANLS
        LY  +RI G     + DTGASHNFID  +   LGLK     G +K+VN+      G VAK V +K+G W+  +DF+V+ +DD+ +VLGL FF K    L 
Subjt:  LYVDVRINGIVHEVLLDTGASHNFIDPNKVMSLGLKVVGGGGKMKLVNSTTVDAKGVVAKDVSLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANANLS

Query:  PASKQLSLYDGQRIHVVPLMVKELPETKVMSALHMVKRCMKRR
             LS+ DG    +  + ++     +++SAL   +   K +
Subjt:  PASKQLSLYDGQRIHVVPLMVKELPETKVMSALHMVKRCMKRR

A0A5A7VEX8 Polyprotein5.7e-9943.83Show/hide
Query:  DARLTSLEEAVEEAQLVVGRLSERVEELVQDSSEITAVTKEMIKELGRTLGKEVSTLFDEVANLRKFVEGELHDLRGE-------VDDMCKECRAMRSAN
        D RLT+LE+ +E+ QL VGRLSE  EELVQ+++EIT+V KEMI+++GRT  +E+  L   V  L+ FVEGELH+L  +       +D +C ECR+    +
Subjt:  DARLTSLEEAVEEAQLVVGRLSERVEELVQDSSEITAVTKEMIKELGRTLGKEVSTLFDEVANLRKFVEGELHDLRGE-------VDDMCKECRAMRSAN

Query:  GGASTSTSSVARGTNGVKVPKPDTYDGTRSATVVDNFLFGLEQYFEALGVIDDGAKIANVPNFLREAVQLWWRRKYA-----------------------
           STST     GT+ +KVPKPD Y+G R+ATVVDNFLFGLE+YF ALGV DD A+I + P FLR+A QLWWRRKYA                       
Subjt:  GGASTSTSSVARGTNGVKVPKPDTYDGTRSATVVDNFLFGLEQYFEALGVIDDGAKIANVPNFLREAVQLWWRRKYA-----------------------

Query:  -----EPRGKFRRLRQTGSIPDYISEFTTLMLEIEGLSDKDALFYFRDGFKDWAKIELNRQDVQILDDAIATAEMLVDFSTKTKKATKKEGEELESDESD
             E RGK RRLR TGSI +Y+ EFTTLMLEI  L +K+ALF F+DG KDWAKIEL+R++VQ LDDAIA AE LVD+S ++K   KK G E    +SD
Subjt:  -----EPRGKFRRLRQTGSIPDYISEFTTLMLEIEGLSDKDALFYFRDGFKDWAKIELNRQDVQILDDAIATAEMLVDFSTKTKKATKKEGEELESDESD

Query:  GHKL--RRAIGMV----------DGTGRRMERLPTR---------------MKKALNALVAQSRTQEQVK--PVCRMGSLQQISAMTGGFSPREIGEEGQ
          K   RR  G V          DG  R     P +                +KALNALVA+ +  +QV+  P  ++GS+QQI  M        +  +G 
Subjt:  GHKL--RRAIGMV----------DGTGRRMERLPTR---------------MKKALNALVAQSRTQEQVK--PVCRMGSLQQISAMTGGFSPREIGEEGQ

Query:  LYVDVRINGIVHEVLLDTGASHNFIDPNKVMSLGLKVVGGGGKMKLVNSTTVDAKGVVAKDVSLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANANLS
        LY  +RI G     + DTGASHNF+D  +   LGLK     G +K+VN+      G VAK V +K+G W+  +DF+V+ +DD+ +VLGL FF K    L 
Subjt:  LYVDVRINGIVHEVLLDTGASHNFIDPNKVMSLGLKVVGGGGKMKLVNSTTVDAKGVVAKDVSLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANANLS

Query:  PASKQLSLYDGQRIHVVPLMVKELPETKVMSALHMVKRCMKRR
             LS+ DG    +  + ++     K++SAL   +   K +
Subjt:  PASKQLSLYDGQRIHVVPLMVKELPETKVMSALHMVKRCMKRR

A0A5D3D3V4 Retrotrans_gag domain-containing protein2.4e-9742.51Show/hide
Query:  DARLTSLEEAVEEAQLVVGRLSERVEELVQDSSEITAVTKEMIKELGRTLGKEVSTLFDEVANLRKFVEGELHDLRGE-------VDDMCKECRAMRSAN
        D RLT+LE+ +E+ QL VGRLSE  EELVQ+++EIT+V KEMI+++GRT  KE+  L   V  L+ FVEGELHDL  +       +D +C EC +    +
Subjt:  DARLTSLEEAVEEAQLVVGRLSERVEELVQDSSEITAVTKEMIKELGRTLGKEVSTLFDEVANLRKFVEGELHDLRGE-------VDDMCKECRAMRSAN

Query:  GGASTSTSSVARGTNGVKVPKPDTYDGTRSATVVDNFLFGLEQYFEALGVIDDGAKIANVPNFLREAVQLWWRRKYA-----------------------
           STST     GT+ +KVPKPD Y+G R+ATVVDNFLFGLE+YF ALGV DD A+I + P FLR+  QLWWR KYA                       
Subjt:  GGASTSTSSVARGTNGVKVPKPDTYDGTRSATVVDNFLFGLEQYFEALGVIDDGAKIANVPNFLREAVQLWWRRKYA-----------------------

Query:  -----EPRGKFRRLRQTGSIPDYISEFTTLMLEIEGLSDKDALFYFRDGFKDWAKIELNRQDVQILDDAIATAEMLVDFSTKT---KKATKKEGEELESD
             E RGK R LR  GSI DY+ EFTTLMLEI  L +K+ALF F+DG KDWAKIEL+R++VQ LDDAIA AE LVD+S ++   K   +K G + +  
Subjt:  -----EPRGKFRRLRQTGSIPDYISEFTTLMLEIEGLSDKDALFYFRDGFKDWAKIELNRQDVQILDDAIATAEMLVDFSTKT---KKATKKEGEELESD

Query:  ESDGHK-------LRRAIGMVDGTGRRMERLPTR---------------MKKALNALVAQSRTQEQVK--PVCRMGSLQQISAMTGGFSPREIGEEGQLY
        ++ GHK        +   G  DG  R     P +                +KALNALVA+ +  +QV+  P  ++GS+QQI  M        +  +G LY
Subjt:  ESDGHK-------LRRAIGMVDGTGRRMERLPTR---------------MKKALNALVAQSRTQEQVK--PVCRMGSLQQISAMTGGFSPREIGEEGQLY

Query:  VDVRINGIVHEVLLDTGASHNFIDPNKVMSLGLKVVGGGGKMKLVNSTTVDAKGVVAKDVSLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANANLSPA
         ++RI G     + DTGASHNF+D  +   LGLK     G +K+VN+      G VAK V +K+G W+  +DF+V+ +DD+ +VLGL FF K    +   
Subjt:  VDVRINGIVHEVLLDTGASHNFIDPNKVMSLGLKVVGGGGKMKLVNSTTVDAKGVVAKDVSLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKANANLSPA

Query:  SKQLSLYDGQRIHVVPLMVKELPETKVMSALHMVKRCMKRR
           LS+ DG    +  + ++     K++SAL   +   K +
Subjt:  SKQLSLYDGQRIHVVPLMVKELPETKVMSALHMVKRCMKRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACCGAGGAAAACGGCGAGATGATGCACGTCTCACCAGCTTGGAAGAAGCGGTTGAAGAGGCGCAGCTTGTCGTTGGTCGTCTAAGTGAACGCGTTGAAGAGCTTGT
CCAAGATAGCTCAGAGATCACAGCAGTTACCAAGGAGATGATCAAGGAGCTGGGACGGACGTTGGGGAAAGAGGTAAGTACCCTCTTTGATGAAGTAGCCAATCTTAGGA
AGTTTGTGGAGGGGGAACTTCACGATCTTCGTGGTGAAGTCGACGACATGTGCAAGGAGTGTCGGGCGATGCGTAGTGCTAATGGTGGCGCATCCACCAGCACCTCATCC
GTTGCGCGAGGCACCAATGGTGTGAAGGTGCCCAAGCCCGATACTTACGATGGCACAAGAAGTGCGACGGTGGTAGATAACTTCTTGTTCGGCCTAGAGCAATACTTCGA
GGCCCTAGGCGTCATAGACGATGGTGCTAAGATTGCCAATGTGCCCAACTTCCTACGTGAGGCAGTGCAACTTTGGTGGCGTAGGAAGTATGCAGAACCTCGAGGAAAAT
TTAGGAGATTAAGGCAGACTGGTAGTATTCCCGACTACATAAGCGAGTTTACCACCCTCATGCTGGAAATTGAAGGCCTATCCGACAAAGACGCACTCTTTTATTTTCGA
GATGGTTTCAAGGATTGGGCGAAGATAGAGCTCAATAGGCAAGATGTGCAGATTCTTGATGATGCCATTGCTACCGCAGAGATGCTTGTGGATTTTTCAACCAAGACCAA
AAAGGCCACGAAGAAGGAAGGAGAAGAGTTGGAGTCTGATGAATCTGATGGGCATAAGCTGAGGAGAGCCATAGGAATGGTCGATGGAACGGGAAGAAGAATGGAAAGGC
TGCCGACAAGAATGAAGAAGGCGCTTAATGCCTTAGTGGCTCAATCCCGCACGCAAGAACAAGTAAAGCCTGTGTGTCGAATGGGTTCCTTGCAGCAGATTAGTGCCATG
ACCGGTGGCTTCTCTCCACGGGAGATTGGGGAGGAGGGACAACTATACGTTGATGTGAGGATCAACGGTATAGTTCATGAGGTGTTGCTCGACACAGGAGCATCACATAA
CTTCATTGATCCGAATAAGGTCATGAGCCTTGGCCTCAAGGTCGTTGGAGGAGGTGGAAAGATGAAGTTGGTGAATTCTACAACCGTAGATGCGAAGGGAGTAGTAGCCA
AGGATGTTTCCTTAAAGGTGGGAAAATGGCGAGGTTTGGTTGATTTCACCGTGGTCTCGTTGGATGACTATAAGGTGGTTTTAGGCTTGGAGTTCTTCAGGAAGGCCAAC
GCCAACCTTTCGCCTGCTTCCAAACAGTTGTCTTTGTATGATGGACAACGAATCCACGTGGTCCCTTTGATGGTGAAGGAGTTGCCCGAGACAAAGGTAATGTCTGCGCT
TCATATGGTAAAGAGATGCATGAAGAGGAGAAGGCGCTCGAGGAAGCCCAACGAAATCGATGCTCTTGACTTGCGTAGCACCGCCAAAGATGAGTCGAGGAAGAGGAGCA
ACATCGTCGTTAAGCCAAGCCAACGCAAGAGCCATGGCCCTCAAGCGTGCACCCTTGTCGCAGCTCGACACCCCATCAAGAGCCTCGATGCATGCCTAGATGCGACAACG
TATGGTAGTGTGGTTGCGCAACCAGCCATAGACGTGCAGCCCATGCGCGCATCTGCTAGGAGAGGCAGCGCTGCGCGCCAGCCTGCACAAAGTTTTCAGCAAGCGAGGGT
AACGCATGCCCAAGATGCGTTCATGTGTGCGCCCATCATCGGCTTGTCAGGCGCACACAACGCGGCCCATGCGTCCAACGGTGTCCCTAGACTCGTGGAAGATCTTAGAC
ACATCGGGAGAGTCCCAGAAGATTCTAGACGCATCGAGATAGGCGCGTCTGGAAGGATGGGCAGGCTGTTAGGCAGTGCCAGCGCCCATGCCCTGGTGCCTGACACATCT
GGAAGCATCTGGAAGACACAAAAAGGGGCTCGACTGCTCGCCCATGGCCTCGACAACACGAGAAAGGTCCAGAAGAACCTCAAAGGTGCTGGACTATGTGGGAAGGCACG
CGATGCTGCTGAGAGACCGGAGAGGTGCCTCGACAGGTCTGGATGGCACGAAAAGTTGCTAGAGTGTGCGAGAGGTGTCCAAACAGTTCCCGAACTTTCCATGATAGTCG
GTCACCTATGTAAATATGCTAGAAGGCCCTCGATAGGTCTAGATTGTTCTAGCATGGGCCTTAGTCAATTAGTTCTAGACTTGTACATCGGATTGCTACCTGATGTACAT
ACGGGAAGAGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACCGAGGAAAACGGCGAGATGATGCACGTCTCACCAGCTTGGAAGAAGCGGTTGAAGAGGCGCAGCTTGTCGTTGGTCGTCTAAGTGAACGCGTTGAAGAGCTTGT
CCAAGATAGCTCAGAGATCACAGCAGTTACCAAGGAGATGATCAAGGAGCTGGGACGGACGTTGGGGAAAGAGGTAAGTACCCTCTTTGATGAAGTAGCCAATCTTAGGA
AGTTTGTGGAGGGGGAACTTCACGATCTTCGTGGTGAAGTCGACGACATGTGCAAGGAGTGTCGGGCGATGCGTAGTGCTAATGGTGGCGCATCCACCAGCACCTCATCC
GTTGCGCGAGGCACCAATGGTGTGAAGGTGCCCAAGCCCGATACTTACGATGGCACAAGAAGTGCGACGGTGGTAGATAACTTCTTGTTCGGCCTAGAGCAATACTTCGA
GGCCCTAGGCGTCATAGACGATGGTGCTAAGATTGCCAATGTGCCCAACTTCCTACGTGAGGCAGTGCAACTTTGGTGGCGTAGGAAGTATGCAGAACCTCGAGGAAAAT
TTAGGAGATTAAGGCAGACTGGTAGTATTCCCGACTACATAAGCGAGTTTACCACCCTCATGCTGGAAATTGAAGGCCTATCCGACAAAGACGCACTCTTTTATTTTCGA
GATGGTTTCAAGGATTGGGCGAAGATAGAGCTCAATAGGCAAGATGTGCAGATTCTTGATGATGCCATTGCTACCGCAGAGATGCTTGTGGATTTTTCAACCAAGACCAA
AAAGGCCACGAAGAAGGAAGGAGAAGAGTTGGAGTCTGATGAATCTGATGGGCATAAGCTGAGGAGAGCCATAGGAATGGTCGATGGAACGGGAAGAAGAATGGAAAGGC
TGCCGACAAGAATGAAGAAGGCGCTTAATGCCTTAGTGGCTCAATCCCGCACGCAAGAACAAGTAAAGCCTGTGTGTCGAATGGGTTCCTTGCAGCAGATTAGTGCCATG
ACCGGTGGCTTCTCTCCACGGGAGATTGGGGAGGAGGGACAACTATACGTTGATGTGAGGATCAACGGTATAGTTCATGAGGTGTTGCTCGACACAGGAGCATCACATAA
CTTCATTGATCCGAATAAGGTCATGAGCCTTGGCCTCAAGGTCGTTGGAGGAGGTGGAAAGATGAAGTTGGTGAATTCTACAACCGTAGATGCGAAGGGAGTAGTAGCCA
AGGATGTTTCCTTAAAGGTGGGAAAATGGCGAGGTTTGGTTGATTTCACCGTGGTCTCGTTGGATGACTATAAGGTGGTTTTAGGCTTGGAGTTCTTCAGGAAGGCCAAC
GCCAACCTTTCGCCTGCTTCCAAACAGTTGTCTTTGTATGATGGACAACGAATCCACGTGGTCCCTTTGATGGTGAAGGAGTTGCCCGAGACAAAGGTAATGTCTGCGCT
TCATATGGTAAAGAGATGCATGAAGAGGAGAAGGCGCTCGAGGAAGCCCAACGAAATCGATGCTCTTGACTTGCGTAGCACCGCCAAAGATGAGTCGAGGAAGAGGAGCA
ACATCGTCGTTAAGCCAAGCCAACGCAAGAGCCATGGCCCTCAAGCGTGCACCCTTGTCGCAGCTCGACACCCCATCAAGAGCCTCGATGCATGCCTAGATGCGACAACG
TATGGTAGTGTGGTTGCGCAACCAGCCATAGACGTGCAGCCCATGCGCGCATCTGCTAGGAGAGGCAGCGCTGCGCGCCAGCCTGCACAAAGTTTTCAGCAAGCGAGGGT
AACGCATGCCCAAGATGCGTTCATGTGTGCGCCCATCATCGGCTTGTCAGGCGCACACAACGCGGCCCATGCGTCCAACGGTGTCCCTAGACTCGTGGAAGATCTTAGAC
ACATCGGGAGAGTCCCAGAAGATTCTAGACGCATCGAGATAGGCGCGTCTGGAAGGATGGGCAGGCTGTTAGGCAGTGCCAGCGCCCATGCCCTGGTGCCTGACACATCT
GGAAGCATCTGGAAGACACAAAAAGGGGCTCGACTGCTCGCCCATGGCCTCGACAACACGAGAAAGGTCCAGAAGAACCTCAAAGGTGCTGGACTATGTGGGAAGGCACG
CGATGCTGCTGAGAGACCGGAGAGGTGCCTCGACAGGTCTGGATGGCACGAAAAGTTGCTAGAGTGTGCGAGAGGTGTCCAAACAGTTCCCGAACTTTCCATGATAGTCG
GTCACCTATGTAAATATGCTAGAAGGCCCTCGATAGGTCTAGATTGTTCTAGCATGGGCCTTAGTCAATTAGTTCTAGACTTGTACATCGGATTGCTACCTGATGTACAT
ACGGGAAGAGTCTAG
Protein sequenceShow/hide protein sequence
MNRGKRRDDARLTSLEEAVEEAQLVVGRLSERVEELVQDSSEITAVTKEMIKELGRTLGKEVSTLFDEVANLRKFVEGELHDLRGEVDDMCKECRAMRSANGGASTSTSS
VARGTNGVKVPKPDTYDGTRSATVVDNFLFGLEQYFEALGVIDDGAKIANVPNFLREAVQLWWRRKYAEPRGKFRRLRQTGSIPDYISEFTTLMLEIEGLSDKDALFYFR
DGFKDWAKIELNRQDVQILDDAIATAEMLVDFSTKTKKATKKEGEELESDESDGHKLRRAIGMVDGTGRRMERLPTRMKKALNALVAQSRTQEQVKPVCRMGSLQQISAM
TGGFSPREIGEEGQLYVDVRINGIVHEVLLDTGASHNFIDPNKVMSLGLKVVGGGGKMKLVNSTTVDAKGVVAKDVSLKVGKWRGLVDFTVVSLDDYKVVLGLEFFRKAN
ANLSPASKQLSLYDGQRIHVVPLMVKELPETKVMSALHMVKRCMKRRRRSRKPNEIDALDLRSTAKDESRKRSNIVVKPSQRKSHGPQACTLVAARHPIKSLDACLDATT
YGSVVAQPAIDVQPMRASARRGSAARQPAQSFQQARVTHAQDAFMCAPIIGLSGAHNAAHASNGVPRLVEDLRHIGRVPEDSRRIEIGASGRMGRLLGSASAHALVPDTS
GSIWKTQKGARLLAHGLDNTRKVQKNLKGAGLCGKARDAAERPERCLDRSGWHEKLLECARGVQTVPELSMIVGHLCKYARRPSIGLDCSSMGLSQLVLDLYIGLLPDVH
TGRV