; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc11G10150 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc11G10150
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationClcChr11:14738590..14740455
RNA-Seq ExpressionClc11G10150
SyntenyClc11G10150
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040414.1 uncharacterized protein E6C27_scaffold35G00340 [Cucumis melo var. makuwa]4.4e-8739.19Show/hide
Query:  MATEEARSSSAERVQVEGLVTRGRQQD------------------DAHLSNLEEGLEDVQLAVGRLSEKYEELIQESSEITVVAKRMIMELGRTTGEEVR
        M+ EE  +S  E+V +EG VTRGR++                   D  L+NLE+G+EDVQLAVGRLSE +EEL+QE++EIT VAK MI ++GRT  EE++
Subjt:  MATEEARSSSAERVQVEGLVTRGRQQD------------------DAHLSNLEEGLEDVQLAVGRLSEKYEELIQESSEITVVAKRMIMELGRTTGEEVR

Query:  SLTQEVANLRKFVEGELHDLRKE-------VDGIQKECRASRGANGEASTSTSVAARSMNGLKIPKPDTYDGTRSAIIVDNFLFGLEQYFDALNVVDDGV
         L   V  L+ FVEGELH+L  +       +D +  ECR+    +   STST       + +K+PKPD Y+G R+A +VDNFLFGLE+YF AL V DD  
Subjt:  SLTQEVANLRKFVEGELHDLRKE-------VDGIQKECRASRGANGEASTSTSVAARSMNGLKIPKPDTYDGTRSAIIVDNFLFGLEQYFDALNVVDDGV

Query:  KIANAPNFLREAAQLWWRRKYAERERDGNCIQTWRQFKADTRK---------------------------IEAF-------------EADRQYS---QDW
        +I +AP FLR+AAQLWWRRKYA  ++ GN I +W QFKA+ RK                           ++ F             EA  Q+    +DW
Subjt:  KIANAPNFLREAAQLWWRRKYAERERDGNCIQTWRQFKADTRK---------------------------IEAF-------------EADRQYS---QDW

Query:  ARVELQRRNVQTLDDAIAAAEMLTDFTSKTKKAAKKEEEVSESDESDGHKTEDSHGNGRWNGKKKEKAAEKNG-------GKSAR--RPCFLCDGPHWTR
        A++EL RRNVQTLDDAIAAAE L D+++++K   KK        E  G K + +   GR +G K +    KNG       G+S+   +PCF+C GPHWTR
Subjt:  ARVELQRRNVQTLDDAIAAAEMLTDFTSKTKKAAKKEEEVSESDESDGHKTEDSHGNGRWNGKKKEKAAEKNG-------GKSAR--RPCFLCDGPHWTR

Query:  ECSKKNALNALVARSRDEESQVESPA-KMGSLQLIGGMTSDFSPWKIGGKGQLYVDAKVNGVVKEVLLDTGASHNFIDPNMAISLGLKVEKEGEKLKAVN
        +C  + ALNALV + +  +   ++P  ++GS+Q IG M  D +   +  KG LY   ++ G     + DTGASHNF+D   A  LGLK ++E   +K VN
Subjt:  ECSKKNALNALVARSRDEESQVESPA-KMGSLQLIGGMTSDFSPWKIGGKGQLYVDAKVNGVVKEVLLDTGASHNFIDPNMAISLGLKVEKEGEKLKAVN

Query:  SAAVKTKGIAKEIPLK-------------------VVLGMEFFRKANATLSPASKQLSLFDGQRIRVIPLKEKELHETKVMSALVKIKGSMK
        +      G+AK + +K                   +VLG+ FF K    L      LS+ DG  +  IP++  +    K++SAL   +G  K
Subjt:  SAAVKTKGIAKEIPLK-------------------VVLGMEFFRKANATLSPASKQLSLFDGQRIRVIPLKEKELHETKVMSALVKIKGSMK

KAA0040659.1 uncharacterized protein E6C27_scaffold370G00130 [Cucumis melo var. makuwa]4.4e-8739.19Show/hide
Query:  MATEEARSSSAERVQVEGLVTRGRQQD------------------DAHLSNLEEGLEDVQLAVGRLSEKYEELIQESSEITVVAKRMIMELGRTTGEEVR
        M+ EE  +S  E+V +EG VTRGR++                   D  L+NLE+G+EDVQLAVGRLSE +EEL+QE++EIT VAK MI ++GRT  +E+R
Subjt:  MATEEARSSSAERVQVEGLVTRGRQQD------------------DAHLSNLEEGLEDVQLAVGRLSEKYEELIQESSEITVVAKRMIMELGRTTGEEVR

Query:  SLTQEVANLRKFVEGELHDLRKE-------VDGIQKECRASRGANGEASTSTSVAARSMNGLKIPKPDTYDGTRSAIIVDNFLFGLEQYFDALNVVDDGV
         L   V  L+ FVEGELHDL  +       +D +  ECR+    +   STST       + +K+PKPD Y+G R+A +VDNFLFGLE+YF AL V DD  
Subjt:  SLTQEVANLRKFVEGELHDLRKE-------VDGIQKECRASRGANGEASTSTSVAARSMNGLKIPKPDTYDGTRSAIIVDNFLFGLEQYFDALNVVDDGV

Query:  KIANAPNFLREAAQLWWRRKYAERERDGNCIQTWRQFKADTRK---------------------------IEAF-------------EADRQYS---QDW
        +I +AP FLR+AAQLWWRRKYA  ++ GN I +W QFKA+ RK                           ++ F             EA  Q+    +DW
Subjt:  KIANAPNFLREAAQLWWRRKYAERERDGNCIQTWRQFKADTRK---------------------------IEAF-------------EADRQYS---QDW

Query:  ARVELQRRNVQTLDDAIAAAEMLTDFTSKTKKAAKKEEEVSESDESDGHKTEDSHGNGRWNGKKKEKAAEKNG-------GKSAR--RPCFLCDGPHWTR
        A++EL RRNVQTLDDAIA AE L D+++++K   KK        E  G K++ +   GR +G K +    KNG       G+S+   +PCF+C GPHWTR
Subjt:  ARVELQRRNVQTLDDAIAAAEMLTDFTSKTKKAAKKEEEVSESDESDGHKTEDSHGNGRWNGKKKEKAAEKNG-------GKSAR--RPCFLCDGPHWTR

Query:  ECSKKNALNALVARSRDEESQVESPA-KMGSLQLIGGMTSDFSPWKIGGKGQLYVDAKVNGVVKEVLLDTGASHNFIDPNMAISLGLKVEKEGEKLKAVN
        +C  + ALNALVA+ ++ +   ++P  ++GS+Q IG M  + +   +  KG LY   ++ G     + DTGASHNFID   A  LGLK ++E   +K VN
Subjt:  ECSKKNALNALVARSRDEESQVESPA-KMGSLQLIGGMTSDFSPWKIGGKGQLYVDAKVNGVVKEVLLDTGASHNFIDPNMAISLGLKVEKEGEKLKAVN

Query:  SAAVKTKGIAKEIPLK-------------------VVLGMEFFRKANATLSPASKQLSLFDGQRIRVIPLKEKELHETKVMSALVKIKGSMK
        +      G+AK + +K                   +VLG+ FF K    L      LS+ DG  +  IP++  +    +++SAL   +G  K
Subjt:  SAAVKTKGIAKEIPLK-------------------VVLGMEFFRKANATLSPASKQLSLFDGQRIRVIPLKEKELHETKVMSALVKIKGSMK

KAA0042140.1 uncharacterized protein E6C27_scaffold67G006290 [Cucumis melo var. makuwa]1.2e-8437.88Show/hide
Query:  MATEEARSSSAERVQVEGLVTRGRQQD------------------DAHLSNLEEGLEDVQLAVGRLSEKYEELIQESSEITVVAKRMIMELGRTTGEEVR
        M+ EE  +S  E+V +EG VTRGR++                   D  L+NLE+G+EDVQLAVGRLSE +EEL+QE++EIT VAK MI ++GRT  +E++
Subjt:  MATEEARSSSAERVQVEGLVTRGRQQD------------------DAHLSNLEEGLEDVQLAVGRLSEKYEELIQESSEITVVAKRMIMELGRTTGEEVR

Query:  SLTQEVANLRKFVEGELHDLRKE-------VDGIQKECRASRGANGEASTSTSVAARSMNGLKIPKPDTYDGTRSAIIVDNFLFGLEQYFDALNVVDDGV
         L   V  L+ FVEGELHDL  +       +D +  EC +    +   STST       + +K+PKPD Y+G R+A +VDNFLFGLE+YF AL V DD  
Subjt:  SLTQEVANLRKFVEGELHDLRKE-------VDGIQKECRASRGANGEASTSTSVAARSMNGLKIPKPDTYDGTRSAIIVDNFLFGLEQYFDALNVVDDGV

Query:  KIANAPNFLREAAQLWWRRKYAERERDGNCIQTWRQFKADTRK---------------------------IEAF-------------EADRQYS---QDW
        +I +AP FLR+ AQLWWR KYA  ++ GN I +W QFK + RK                           ++ F             EA  Q+    +DW
Subjt:  KIANAPNFLREAAQLWWRRKYAERERDGNCIQTWRQFKADTRK---------------------------IEAF-------------EADRQYS---QDW

Query:  ARVELQRRNVQTLDDAIAAAEMLTDFTSKTK-KAAKKEEEVSESDESD--GHKTEDSHGNGRWNGKKKEKAAEKNGGKSARRPCFLCDGPHWTRECSKKN
        A++EL RRNVQTLDDAIAAAE L D+++++K K    E+   + D++   GHK        +W   K +  A +    +  +PCF+C GPHWTR+C  + 
Subjt:  ARVELQRRNVQTLDDAIAAAEMLTDFTSKTK-KAAKKEEEVSESDESD--GHKTEDSHGNGRWNGKKKEKAAEKNGGKSARRPCFLCDGPHWTRECSKKN

Query:  ALNALVARSRDEESQVESPA-KMGSLQLIGGMTSDFSPWKIGGKGQLYVDAKVNGVVKEVLLDTGASHNFIDPNMAISLGLKVEKEGEKLKAVNSAAVKT
        ALNALVA+ ++ +   ++P  ++GS+Q IG M  + +   +  KG LY + ++ G     + DTGASHNF+D   A  LGLK ++E   +K VN+     
Subjt:  ALNALVARSRDEESQVESPA-KMGSLQLIGGMTSDFSPWKIGGKGQLYVDAKVNGVVKEVLLDTGASHNFIDPNMAISLGLKVEKEGEKLKAVNSAAVKT

Query:  KGIAKEIPLK-------------------VVLGMEFFRKANATLSPASKQLSLFDGQRIRVIPLKEKELHETKVMSALVKIKGSMK
         G+AK + +K                   +VLG+ FF K    +      LS+ DG  +  IP++  +    K++SAL   +G  K
Subjt:  KGIAKEIPLK-------------------VVLGMEFFRKANATLSPASKQLSLFDGQRIRVIPLKEKELHETKVMSALVKIKGSMK

KAA0065760.1 polyprotein [Cucumis melo var. makuwa]8.8e-8839.36Show/hide
Query:  MATEEARSSSAERVQVEGLVTRGRQQD------------------DAHLSNLEEGLEDVQLAVGRLSEKYEELIQESSEITVVAKRMIMELGRTTGEEVR
        M+ EE  +S  E+V +EG VTRGR++                   D  L+NLE+G+EDVQLAVGRLSE +EEL+QE++EIT VAK MI ++GRT  EE++
Subjt:  MATEEARSSSAERVQVEGLVTRGRQQD------------------DAHLSNLEEGLEDVQLAVGRLSEKYEELIQESSEITVVAKRMIMELGRTTGEEVR

Query:  SLTQEVANLRKFVEGELHDLRKE-------VDGIQKECRASRGANGEASTSTSVAARSMNGLKIPKPDTYDGTRSAIIVDNFLFGLEQYFDALNVVDDGV
         LT  V  L+ FVEGELH+L  +       +D +  ECR+    +   STST       + +K+PKPD Y+G R+A +VDNFLFGLE+YF AL V DD  
Subjt:  SLTQEVANLRKFVEGELHDLRKE-------VDGIQKECRASRGANGEASTSTSVAARSMNGLKIPKPDTYDGTRSAIIVDNFLFGLEQYFDALNVVDDGV

Query:  KIANAPNFLREAAQLWWRRKYAERERDGNCIQTWRQFKADTRK---------------------------IEAF-------------EADRQYS---QDW
        +I +AP FLR+AAQLWWRRKYA  ++ GN I +W QFKA+ RK                           ++ F             EA  Q+    +DW
Subjt:  KIANAPNFLREAAQLWWRRKYAERERDGNCIQTWRQFKADTRK---------------------------IEAF-------------EADRQYS---QDW

Query:  ARVELQRRNVQTLDDAIAAAEMLTDFTSKTKKAAKKEEEVSESDESDGHKTEDSHGNGRWNGKKKEKAAEKNG-------GKSAR--RPCFLCDGPHWTR
        A++EL RRNVQTLDDAIAAAE L D+++++K   KK        E  G K++ +   GR +G K +    KNG       G+S+   +PCF+C GPHWTR
Subjt:  ARVELQRRNVQTLDDAIAAAEMLTDFTSKTKKAAKKEEEVSESDESDGHKTEDSHGNGRWNGKKKEKAAEKNG-------GKSAR--RPCFLCDGPHWTR

Query:  ECSKKNALNALVARSRDEESQVESPA-KMGSLQLIGGMTSDFSPWKIGGKGQLYVDAKVNGVVKEVLLDTGASHNFIDPNMAISLGLKVEKEGEKLKAVN
        +C  + ALNALVA+ ++ +   ++P  ++GS+Q IG M  + +   +  KG LY   ++ G     + DTGASHNF+D   A  LGLK ++E   +K VN
Subjt:  ECSKKNALNALVARSRDEESQVESPA-KMGSLQLIGGMTSDFSPWKIGGKGQLYVDAKVNGVVKEVLLDTGASHNFIDPNMAISLGLKVEKEGEKLKAVN

Query:  SAAVKTKGIAKEIPLK-------------------VVLGMEFFRKANATLSPASKQLSLFDGQRIRVIPLKEKELHETKVMSALVKIKGSMK
        +      G+AK + +K                   +VLG+ FF K    L      LS+ DG  +  IP++  +    K++SAL   +G  K
Subjt:  SAAVKTKGIAKEIPLK-------------------VVLGMEFFRKANATLSPASKQLSLFDGQRIRVIPLKEKELHETKVMSALVKIKGSMK

TYK18079.1 uncharacterized protein E5676_scaffold306G004150 [Cucumis melo var. makuwa]1.2e-8437.88Show/hide
Query:  MATEEARSSSAERVQVEGLVTRGRQQD------------------DAHLSNLEEGLEDVQLAVGRLSEKYEELIQESSEITVVAKRMIMELGRTTGEEVR
        M+ EE  +S  E+V +EG VTRGR++                   D  L+NLE+G+EDVQLAVGRLSE +EEL+QE++EIT VAK MI ++GRT  +E++
Subjt:  MATEEARSSSAERVQVEGLVTRGRQQD------------------DAHLSNLEEGLEDVQLAVGRLSEKYEELIQESSEITVVAKRMIMELGRTTGEEVR

Query:  SLTQEVANLRKFVEGELHDLRKE-------VDGIQKECRASRGANGEASTSTSVAARSMNGLKIPKPDTYDGTRSAIIVDNFLFGLEQYFDALNVVDDGV
         L   V  L+ FVEGELHDL  +       +D +  EC +    +   STST       + +K+PKPD Y+G R+A +VDNFLFGLE+YF AL V DD  
Subjt:  SLTQEVANLRKFVEGELHDLRKE-------VDGIQKECRASRGANGEASTSTSVAARSMNGLKIPKPDTYDGTRSAIIVDNFLFGLEQYFDALNVVDDGV

Query:  KIANAPNFLREAAQLWWRRKYAERERDGNCIQTWRQFKADTRK---------------------------IEAF-------------EADRQYS---QDW
        +I +AP FLR+ AQLWWR KYA  ++ GN I +W QFK + RK                           ++ F             EA  Q+    +DW
Subjt:  KIANAPNFLREAAQLWWRRKYAERERDGNCIQTWRQFKADTRK---------------------------IEAF-------------EADRQYS---QDW

Query:  ARVELQRRNVQTLDDAIAAAEMLTDFTSKTK-KAAKKEEEVSESDESD--GHKTEDSHGNGRWNGKKKEKAAEKNGGKSARRPCFLCDGPHWTRECSKKN
        A++EL RRNVQTLDDAIAAAE L D+++++K K    E+   + D++   GHK        +W   K +  A +    +  +PCF+C GPHWTR+C  + 
Subjt:  ARVELQRRNVQTLDDAIAAAEMLTDFTSKTK-KAAKKEEEVSESDESD--GHKTEDSHGNGRWNGKKKEKAAEKNGGKSARRPCFLCDGPHWTRECSKKN

Query:  ALNALVARSRDEESQVESPA-KMGSLQLIGGMTSDFSPWKIGGKGQLYVDAKVNGVVKEVLLDTGASHNFIDPNMAISLGLKVEKEGEKLKAVNSAAVKT
        ALNALVA+ ++ +   ++P  ++GS+Q IG M  + +   +  KG LY + ++ G     + DTGASHNF+D   A  LGLK ++E   +K VN+     
Subjt:  ALNALVARSRDEESQVESPA-KMGSLQLIGGMTSDFSPWKIGGKGQLYVDAKVNGVVKEVLLDTGASHNFIDPNMAISLGLKVEKEGEKLKAVNSAAVKT

Query:  KGIAKEIPLK-------------------VVLGMEFFRKANATLSPASKQLSLFDGQRIRVIPLKEKELHETKVMSALVKIKGSMK
         G+AK + +K                   +VLG+ FF K    +      LS+ DG  +  IP++  +    K++SAL   +G  K
Subjt:  KGIAKEIPLK-------------------VVLGMEFFRKANATLSPASKQLSLFDGQRIRVIPLKEKELHETKVMSALVKIKGSMK

TrEMBL top hitse value%identityAlignment
A0A5A7TAA5 Uncharacterized protein2.1e-8739.19Show/hide
Query:  MATEEARSSSAERVQVEGLVTRGRQQD------------------DAHLSNLEEGLEDVQLAVGRLSEKYEELIQESSEITVVAKRMIMELGRTTGEEVR
        M+ EE  +S  E+V +EG VTRGR++                   D  L+NLE+G+EDVQLAVGRLSE +EEL+QE++EIT VAK MI ++GRT  EE++
Subjt:  MATEEARSSSAERVQVEGLVTRGRQQD------------------DAHLSNLEEGLEDVQLAVGRLSEKYEELIQESSEITVVAKRMIMELGRTTGEEVR

Query:  SLTQEVANLRKFVEGELHDLRKE-------VDGIQKECRASRGANGEASTSTSVAARSMNGLKIPKPDTYDGTRSAIIVDNFLFGLEQYFDALNVVDDGV
         L   V  L+ FVEGELH+L  +       +D +  ECR+    +   STST       + +K+PKPD Y+G R+A +VDNFLFGLE+YF AL V DD  
Subjt:  SLTQEVANLRKFVEGELHDLRKE-------VDGIQKECRASRGANGEASTSTSVAARSMNGLKIPKPDTYDGTRSAIIVDNFLFGLEQYFDALNVVDDGV

Query:  KIANAPNFLREAAQLWWRRKYAERERDGNCIQTWRQFKADTRK---------------------------IEAF-------------EADRQYS---QDW
        +I +AP FLR+AAQLWWRRKYA  ++ GN I +W QFKA+ RK                           ++ F             EA  Q+    +DW
Subjt:  KIANAPNFLREAAQLWWRRKYAERERDGNCIQTWRQFKADTRK---------------------------IEAF-------------EADRQYS---QDW

Query:  ARVELQRRNVQTLDDAIAAAEMLTDFTSKTKKAAKKEEEVSESDESDGHKTEDSHGNGRWNGKKKEKAAEKNG-------GKSAR--RPCFLCDGPHWTR
        A++EL RRNVQTLDDAIAAAE L D+++++K   KK        E  G K + +   GR +G K +    KNG       G+S+   +PCF+C GPHWTR
Subjt:  ARVELQRRNVQTLDDAIAAAEMLTDFTSKTKKAAKKEEEVSESDESDGHKTEDSHGNGRWNGKKKEKAAEKNG-------GKSAR--RPCFLCDGPHWTR

Query:  ECSKKNALNALVARSRDEESQVESPA-KMGSLQLIGGMTSDFSPWKIGGKGQLYVDAKVNGVVKEVLLDTGASHNFIDPNMAISLGLKVEKEGEKLKAVN
        +C  + ALNALV + +  +   ++P  ++GS+Q IG M  D +   +  KG LY   ++ G     + DTGASHNF+D   A  LGLK ++E   +K VN
Subjt:  ECSKKNALNALVARSRDEESQVESPA-KMGSLQLIGGMTSDFSPWKIGGKGQLYVDAKVNGVVKEVLLDTGASHNFIDPNMAISLGLKVEKEGEKLKAVN

Query:  SAAVKTKGIAKEIPLK-------------------VVLGMEFFRKANATLSPASKQLSLFDGQRIRVIPLKEKELHETKVMSALVKIKGSMK
        +      G+AK + +K                   +VLG+ FF K    L      LS+ DG  +  IP++  +    K++SAL   +G  K
Subjt:  SAAVKTKGIAKEIPLK-------------------VVLGMEFFRKANATLSPASKQLSLFDGQRIRVIPLKEKELHETKVMSALVKIKGSMK

A0A5A7TFP3 Retrotrans_gag domain-containing protein5.8e-8537.88Show/hide
Query:  MATEEARSSSAERVQVEGLVTRGRQQD------------------DAHLSNLEEGLEDVQLAVGRLSEKYEELIQESSEITVVAKRMIMELGRTTGEEVR
        M+ EE  +S  E+V +EG VTRGR++                   D  L+NLE+G+EDVQLAVGRLSE +EEL+QE++EIT VAK MI ++GRT  +E++
Subjt:  MATEEARSSSAERVQVEGLVTRGRQQD------------------DAHLSNLEEGLEDVQLAVGRLSEKYEELIQESSEITVVAKRMIMELGRTTGEEVR

Query:  SLTQEVANLRKFVEGELHDLRKE-------VDGIQKECRASRGANGEASTSTSVAARSMNGLKIPKPDTYDGTRSAIIVDNFLFGLEQYFDALNVVDDGV
         L   V  L+ FVEGELHDL  +       +D +  EC +    +   STST       + +K+PKPD Y+G R+A +VDNFLFGLE+YF AL V DD  
Subjt:  SLTQEVANLRKFVEGELHDLRKE-------VDGIQKECRASRGANGEASTSTSVAARSMNGLKIPKPDTYDGTRSAIIVDNFLFGLEQYFDALNVVDDGV

Query:  KIANAPNFLREAAQLWWRRKYAERERDGNCIQTWRQFKADTRK---------------------------IEAF-------------EADRQYS---QDW
        +I +AP FLR+ AQLWWR KYA  ++ GN I +W QFK + RK                           ++ F             EA  Q+    +DW
Subjt:  KIANAPNFLREAAQLWWRRKYAERERDGNCIQTWRQFKADTRK---------------------------IEAF-------------EADRQYS---QDW

Query:  ARVELQRRNVQTLDDAIAAAEMLTDFTSKTK-KAAKKEEEVSESDESD--GHKTEDSHGNGRWNGKKKEKAAEKNGGKSARRPCFLCDGPHWTRECSKKN
        A++EL RRNVQTLDDAIAAAE L D+++++K K    E+   + D++   GHK        +W   K +  A +    +  +PCF+C GPHWTR+C  + 
Subjt:  ARVELQRRNVQTLDDAIAAAEMLTDFTSKTK-KAAKKEEEVSESDESD--GHKTEDSHGNGRWNGKKKEKAAEKNGGKSARRPCFLCDGPHWTRECSKKN

Query:  ALNALVARSRDEESQVESPA-KMGSLQLIGGMTSDFSPWKIGGKGQLYVDAKVNGVVKEVLLDTGASHNFIDPNMAISLGLKVEKEGEKLKAVNSAAVKT
        ALNALVA+ ++ +   ++P  ++GS+Q IG M  + +   +  KG LY + ++ G     + DTGASHNF+D   A  LGLK ++E   +K VN+     
Subjt:  ALNALVARSRDEESQVESPA-KMGSLQLIGGMTSDFSPWKIGGKGQLYVDAKVNGVVKEVLLDTGASHNFIDPNMAISLGLKVEKEGEKLKAVNSAAVKT

Query:  KGIAKEIPLK-------------------VVLGMEFFRKANATLSPASKQLSLFDGQRIRVIPLKEKELHETKVMSALVKIKGSMK
         G+AK + +K                   +VLG+ FF K    +      LS+ DG  +  IP++  +    K++SAL   +G  K
Subjt:  KGIAKEIPLK-------------------VVLGMEFFRKANATLSPASKQLSLFDGQRIRVIPLKEKELHETKVMSALVKIKGSMK

A0A5A7THC0 Reverse transcriptase domain-containing protein2.1e-8739.19Show/hide
Query:  MATEEARSSSAERVQVEGLVTRGRQQD------------------DAHLSNLEEGLEDVQLAVGRLSEKYEELIQESSEITVVAKRMIMELGRTTGEEVR
        M+ EE  +S  E+V +EG VTRGR++                   D  L+NLE+G+EDVQLAVGRLSE +EEL+QE++EIT VAK MI ++GRT  +E+R
Subjt:  MATEEARSSSAERVQVEGLVTRGRQQD------------------DAHLSNLEEGLEDVQLAVGRLSEKYEELIQESSEITVVAKRMIMELGRTTGEEVR

Query:  SLTQEVANLRKFVEGELHDLRKE-------VDGIQKECRASRGANGEASTSTSVAARSMNGLKIPKPDTYDGTRSAIIVDNFLFGLEQYFDALNVVDDGV
         L   V  L+ FVEGELHDL  +       +D +  ECR+    +   STST       + +K+PKPD Y+G R+A +VDNFLFGLE+YF AL V DD  
Subjt:  SLTQEVANLRKFVEGELHDLRKE-------VDGIQKECRASRGANGEASTSTSVAARSMNGLKIPKPDTYDGTRSAIIVDNFLFGLEQYFDALNVVDDGV

Query:  KIANAPNFLREAAQLWWRRKYAERERDGNCIQTWRQFKADTRK---------------------------IEAF-------------EADRQYS---QDW
        +I +AP FLR+AAQLWWRRKYA  ++ GN I +W QFKA+ RK                           ++ F             EA  Q+    +DW
Subjt:  KIANAPNFLREAAQLWWRRKYAERERDGNCIQTWRQFKADTRK---------------------------IEAF-------------EADRQYS---QDW

Query:  ARVELQRRNVQTLDDAIAAAEMLTDFTSKTKKAAKKEEEVSESDESDGHKTEDSHGNGRWNGKKKEKAAEKNG-------GKSAR--RPCFLCDGPHWTR
        A++EL RRNVQTLDDAIA AE L D+++++K   KK        E  G K++ +   GR +G K +    KNG       G+S+   +PCF+C GPHWTR
Subjt:  ARVELQRRNVQTLDDAIAAAEMLTDFTSKTKKAAKKEEEVSESDESDGHKTEDSHGNGRWNGKKKEKAAEKNG-------GKSAR--RPCFLCDGPHWTR

Query:  ECSKKNALNALVARSRDEESQVESPA-KMGSLQLIGGMTSDFSPWKIGGKGQLYVDAKVNGVVKEVLLDTGASHNFIDPNMAISLGLKVEKEGEKLKAVN
        +C  + ALNALVA+ ++ +   ++P  ++GS+Q IG M  + +   +  KG LY   ++ G     + DTGASHNFID   A  LGLK ++E   +K VN
Subjt:  ECSKKNALNALVARSRDEESQVESPA-KMGSLQLIGGMTSDFSPWKIGGKGQLYVDAKVNGVVKEVLLDTGASHNFIDPNMAISLGLKVEKEGEKLKAVN

Query:  SAAVKTKGIAKEIPLK-------------------VVLGMEFFRKANATLSPASKQLSLFDGQRIRVIPLKEKELHETKVMSALVKIKGSMK
        +      G+AK + +K                   +VLG+ FF K    L      LS+ DG  +  IP++  +    +++SAL   +G  K
Subjt:  SAAVKTKGIAKEIPLK-------------------VVLGMEFFRKANATLSPASKQLSLFDGQRIRVIPLKEKELHETKVMSALVKIKGSMK

A0A5A7VEX8 Polyprotein4.3e-8839.36Show/hide
Query:  MATEEARSSSAERVQVEGLVTRGRQQD------------------DAHLSNLEEGLEDVQLAVGRLSEKYEELIQESSEITVVAKRMIMELGRTTGEEVR
        M+ EE  +S  E+V +EG VTRGR++                   D  L+NLE+G+EDVQLAVGRLSE +EEL+QE++EIT VAK MI ++GRT  EE++
Subjt:  MATEEARSSSAERVQVEGLVTRGRQQD------------------DAHLSNLEEGLEDVQLAVGRLSEKYEELIQESSEITVVAKRMIMELGRTTGEEVR

Query:  SLTQEVANLRKFVEGELHDLRKE-------VDGIQKECRASRGANGEASTSTSVAARSMNGLKIPKPDTYDGTRSAIIVDNFLFGLEQYFDALNVVDDGV
         LT  V  L+ FVEGELH+L  +       +D +  ECR+    +   STST       + +K+PKPD Y+G R+A +VDNFLFGLE+YF AL V DD  
Subjt:  SLTQEVANLRKFVEGELHDLRKE-------VDGIQKECRASRGANGEASTSTSVAARSMNGLKIPKPDTYDGTRSAIIVDNFLFGLEQYFDALNVVDDGV

Query:  KIANAPNFLREAAQLWWRRKYAERERDGNCIQTWRQFKADTRK---------------------------IEAF-------------EADRQYS---QDW
        +I +AP FLR+AAQLWWRRKYA  ++ GN I +W QFKA+ RK                           ++ F             EA  Q+    +DW
Subjt:  KIANAPNFLREAAQLWWRRKYAERERDGNCIQTWRQFKADTRK---------------------------IEAF-------------EADRQYS---QDW

Query:  ARVELQRRNVQTLDDAIAAAEMLTDFTSKTKKAAKKEEEVSESDESDGHKTEDSHGNGRWNGKKKEKAAEKNG-------GKSAR--RPCFLCDGPHWTR
        A++EL RRNVQTLDDAIAAAE L D+++++K   KK        E  G K++ +   GR +G K +    KNG       G+S+   +PCF+C GPHWTR
Subjt:  ARVELQRRNVQTLDDAIAAAEMLTDFTSKTKKAAKKEEEVSESDESDGHKTEDSHGNGRWNGKKKEKAAEKNG-------GKSAR--RPCFLCDGPHWTR

Query:  ECSKKNALNALVARSRDEESQVESPA-KMGSLQLIGGMTSDFSPWKIGGKGQLYVDAKVNGVVKEVLLDTGASHNFIDPNMAISLGLKVEKEGEKLKAVN
        +C  + ALNALVA+ ++ +   ++P  ++GS+Q IG M  + +   +  KG LY   ++ G     + DTGASHNF+D   A  LGLK ++E   +K VN
Subjt:  ECSKKNALNALVARSRDEESQVESPA-KMGSLQLIGGMTSDFSPWKIGGKGQLYVDAKVNGVVKEVLLDTGASHNFIDPNMAISLGLKVEKEGEKLKAVN

Query:  SAAVKTKGIAKEIPLK-------------------VVLGMEFFRKANATLSPASKQLSLFDGQRIRVIPLKEKELHETKVMSALVKIKGSMK
        +      G+AK + +K                   +VLG+ FF K    L      LS+ DG  +  IP++  +    K++SAL   +G  K
Subjt:  SAAVKTKGIAKEIPLK-------------------VVLGMEFFRKANATLSPASKQLSLFDGQRIRVIPLKEKELHETKVMSALVKIKGSMK

A0A5D3D3V4 Retrotrans_gag domain-containing protein5.8e-8537.88Show/hide
Query:  MATEEARSSSAERVQVEGLVTRGRQQD------------------DAHLSNLEEGLEDVQLAVGRLSEKYEELIQESSEITVVAKRMIMELGRTTGEEVR
        M+ EE  +S  E+V +EG VTRGR++                   D  L+NLE+G+EDVQLAVGRLSE +EEL+QE++EIT VAK MI ++GRT  +E++
Subjt:  MATEEARSSSAERVQVEGLVTRGRQQD------------------DAHLSNLEEGLEDVQLAVGRLSEKYEELIQESSEITVVAKRMIMELGRTTGEEVR

Query:  SLTQEVANLRKFVEGELHDLRKE-------VDGIQKECRASRGANGEASTSTSVAARSMNGLKIPKPDTYDGTRSAIIVDNFLFGLEQYFDALNVVDDGV
         L   V  L+ FVEGELHDL  +       +D +  EC +    +   STST       + +K+PKPD Y+G R+A +VDNFLFGLE+YF AL V DD  
Subjt:  SLTQEVANLRKFVEGELHDLRKE-------VDGIQKECRASRGANGEASTSTSVAARSMNGLKIPKPDTYDGTRSAIIVDNFLFGLEQYFDALNVVDDGV

Query:  KIANAPNFLREAAQLWWRRKYAERERDGNCIQTWRQFKADTRK---------------------------IEAF-------------EADRQYS---QDW
        +I +AP FLR+ AQLWWR KYA  ++ GN I +W QFK + RK                           ++ F             EA  Q+    +DW
Subjt:  KIANAPNFLREAAQLWWRRKYAERERDGNCIQTWRQFKADTRK---------------------------IEAF-------------EADRQYS---QDW

Query:  ARVELQRRNVQTLDDAIAAAEMLTDFTSKTK-KAAKKEEEVSESDESD--GHKTEDSHGNGRWNGKKKEKAAEKNGGKSARRPCFLCDGPHWTRECSKKN
        A++EL RRNVQTLDDAIAAAE L D+++++K K    E+   + D++   GHK        +W   K +  A +    +  +PCF+C GPHWTR+C  + 
Subjt:  ARVELQRRNVQTLDDAIAAAEMLTDFTSKTK-KAAKKEEEVSESDESD--GHKTEDSHGNGRWNGKKKEKAAEKNGGKSARRPCFLCDGPHWTRECSKKN

Query:  ALNALVARSRDEESQVESPA-KMGSLQLIGGMTSDFSPWKIGGKGQLYVDAKVNGVVKEVLLDTGASHNFIDPNMAISLGLKVEKEGEKLKAVNSAAVKT
        ALNALVA+ ++ +   ++P  ++GS+Q IG M  + +   +  KG LY + ++ G     + DTGASHNF+D   A  LGLK ++E   +K VN+     
Subjt:  ALNALVARSRDEESQVESPA-KMGSLQLIGGMTSDFSPWKIGGKGQLYVDAKVNGVVKEVLLDTGASHNFIDPNMAISLGLKVEKEGEKLKAVNSAAVKT

Query:  KGIAKEIPLK-------------------VVLGMEFFRKANATLSPASKQLSLFDGQRIRVIPLKEKELHETKVMSALVKIKGSMK
         G+AK + +K                   +VLG+ FF K    +      LS+ DG  +  IP++  +    K++SAL   +G  K
Subjt:  KGIAKEIPLK-------------------VVLGMEFFRKANATLSPASKQLSLFDGQRIRVIPLKEKELHETKVMSALVKIKGSMK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAACCGAGGAAGCTAGAAGCTCATCGGCAGAGCGAGTACAAGTTGAAGGACTAGTGACCCGAGGTAGGCAGCAAGATGATGCTCATCTATCCAACTTAGAAGAAGG
ACTGGAAGATGTGCAACTTGCCGTTGGTCGATTGAGTGAAAAATATGAAGAGCTCATCCAAGAAAGCTCGGAGATCACAGTAGTTGCCAAAAGAATGATCATGGAGCTGG
GACGGACGACGGGAGAAGAGGTAAGATCCCTCACTCAAGAAGTAGCCAACCTAAGGAAGTTCGTGGAAGGGGAACTCCACGATCTTCGCAAAGAGGTCGACGGCATTCAA
AAGGAGTGTCGGGCAAGTCGTGGTGCAAATGGTGAGGCGTCCACCAGCACCTCTGTCGCCGCAAGAAGCATGAATGGCTTAAAAATACCAAAGCCTGATACTTATGACGG
CACAAGAAGTGCAATCATAGTGGATAACTTCTTGTTCGGCCTAGAGCAATATTTTGATGCTCTGAACGTCGTCGACGATGGCGTCAAGATAGCCAATGCGCCCAACTTCC
TGCGAGAGGCAGCCCAGTTATGGTGGCGTAGGAAGTATGCTGAGCGCGAGCGGGACGGAAATTGCATCCAAACGTGGAGGCAATTTAAAGCGGACACGAGGAAAATTGAG
GCGTTTGAGGCAGACCGGCAGTATTCCCAAGATTGGGCACGCGTTGAGCTCCAGCGCCGCAATGTCCAGACGCTGGACGATGCGATAGCAGCTGCGGAGATGCTCACTGA
CTTCACATCCAAAACCAAGAAAGCCGCGAAGAAGGAAGAGGAAGTGTCGGAATCGGATGAATCTGATGGCCATAAAACTGAGGACAGCCATGGGAATGGTCGATGGAACG
GGAAGAAAAAGGAAAAAGCTGCCGAGAAAAATGGAGGTAAATCTGCCAGACGCCCTTGCTTCCTATGTGATGGACCACATTGGACGCGGGAATGCTCGAAGAAAAATGCT
CTCAACGCCTTGGTGGCCCGGTCCCGCGACGAAGAAAGTCAAGTTGAGTCCCCCGCAAAGATGGGTTCCTTGCAGCTTATTGGCGGCATGACAAGTGACTTCTCACCTTG
GAAGATTGGTGGGAAGGGACAACTATACGTTGATGCGAAGGTCAACGGTGTAGTAAAAGAAGTGCTGCTCGACACAGGAGCATCGCACAACTTTATCGACCCAAATATGG
CCATAAGTCTTGGTCTTAAGGTCGAAAAAGAAGGGGAAAAGTTGAAAGCGGTGAACTCGGCAGCCGTGAAAACCAAAGGGATTGCCAAAGAAATTCCCTTAAAGGTGGTC
CTAGGAATGGAGTTCTTCCGCAAAGCCAACGCCACCTTGTCACCCGCCTCAAAACAGTTGTCGTTGTTTGATGGACAACGCATAAGGGTGATCCCTTTGAAGGAGAAAGA
GCTGCATGAGACAAAGGTAATGTCCGCACTCGTCAAAATAAAGGGATCCATGAAAAGAAATAGGCGCTCGAGAAAATCACACGCAACGAGGGAGTTGCAAAATGAAGTGG
GGGAGAATGTCACGGGTAGAAATAATGGCATTTCAAATCGTGACACAAACGCCCTTAGCTCGCACCGCGTCGTCAAAGAAAGGTCGAGAGTGAAGGGCCAGCGCGTTGAC
AAAGCTGCTGCACGAGGAATCGATCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAACCGAGGAAGCTAGAAGCTCATCGGCAGAGCGAGTACAAGTTGAAGGACTAGTGACCCGAGGTAGGCAGCAAGATGATGCTCATCTATCCAACTTAGAAGAAGG
ACTGGAAGATGTGCAACTTGCCGTTGGTCGATTGAGTGAAAAATATGAAGAGCTCATCCAAGAAAGCTCGGAGATCACAGTAGTTGCCAAAAGAATGATCATGGAGCTGG
GACGGACGACGGGAGAAGAGGTAAGATCCCTCACTCAAGAAGTAGCCAACCTAAGGAAGTTCGTGGAAGGGGAACTCCACGATCTTCGCAAAGAGGTCGACGGCATTCAA
AAGGAGTGTCGGGCAAGTCGTGGTGCAAATGGTGAGGCGTCCACCAGCACCTCTGTCGCCGCAAGAAGCATGAATGGCTTAAAAATACCAAAGCCTGATACTTATGACGG
CACAAGAAGTGCAATCATAGTGGATAACTTCTTGTTCGGCCTAGAGCAATATTTTGATGCTCTGAACGTCGTCGACGATGGCGTCAAGATAGCCAATGCGCCCAACTTCC
TGCGAGAGGCAGCCCAGTTATGGTGGCGTAGGAAGTATGCTGAGCGCGAGCGGGACGGAAATTGCATCCAAACGTGGAGGCAATTTAAAGCGGACACGAGGAAAATTGAG
GCGTTTGAGGCAGACCGGCAGTATTCCCAAGATTGGGCACGCGTTGAGCTCCAGCGCCGCAATGTCCAGACGCTGGACGATGCGATAGCAGCTGCGGAGATGCTCACTGA
CTTCACATCCAAAACCAAGAAAGCCGCGAAGAAGGAAGAGGAAGTGTCGGAATCGGATGAATCTGATGGCCATAAAACTGAGGACAGCCATGGGAATGGTCGATGGAACG
GGAAGAAAAAGGAAAAAGCTGCCGAGAAAAATGGAGGTAAATCTGCCAGACGCCCTTGCTTCCTATGTGATGGACCACATTGGACGCGGGAATGCTCGAAGAAAAATGCT
CTCAACGCCTTGGTGGCCCGGTCCCGCGACGAAGAAAGTCAAGTTGAGTCCCCCGCAAAGATGGGTTCCTTGCAGCTTATTGGCGGCATGACAAGTGACTTCTCACCTTG
GAAGATTGGTGGGAAGGGACAACTATACGTTGATGCGAAGGTCAACGGTGTAGTAAAAGAAGTGCTGCTCGACACAGGAGCATCGCACAACTTTATCGACCCAAATATGG
CCATAAGTCTTGGTCTTAAGGTCGAAAAAGAAGGGGAAAAGTTGAAAGCGGTGAACTCGGCAGCCGTGAAAACCAAAGGGATTGCCAAAGAAATTCCCTTAAAGGTGGTC
CTAGGAATGGAGTTCTTCCGCAAAGCCAACGCCACCTTGTCACCCGCCTCAAAACAGTTGTCGTTGTTTGATGGACAACGCATAAGGGTGATCCCTTTGAAGGAGAAAGA
GCTGCATGAGACAAAGGTAATGTCCGCACTCGTCAAAATAAAGGGATCCATGAAAAGAAATAGGCGCTCGAGAAAATCACACGCAACGAGGGAGTTGCAAAATGAAGTGG
GGGAGAATGTCACGGGTAGAAATAATGGCATTTCAAATCGTGACACAAACGCCCTTAGCTCGCACCGCGTCGTCAAAGAAAGGTCGAGAGTGAAGGGCCAGCGCGTTGAC
AAAGCTGCTGCACGAGGAATCGATCCTTAA
Protein sequenceShow/hide protein sequence
MATEEARSSSAERVQVEGLVTRGRQQDDAHLSNLEEGLEDVQLAVGRLSEKYEELIQESSEITVVAKRMIMELGRTTGEEVRSLTQEVANLRKFVEGELHDLRKEVDGIQ
KECRASRGANGEASTSTSVAARSMNGLKIPKPDTYDGTRSAIIVDNFLFGLEQYFDALNVVDDGVKIANAPNFLREAAQLWWRRKYAERERDGNCIQTWRQFKADTRKIE
AFEADRQYSQDWARVELQRRNVQTLDDAIAAAEMLTDFTSKTKKAAKKEEEVSESDESDGHKTEDSHGNGRWNGKKKEKAAEKNGGKSARRPCFLCDGPHWTRECSKKNA
LNALVARSRDEESQVESPAKMGSLQLIGGMTSDFSPWKIGGKGQLYVDAKVNGVVKEVLLDTGASHNFIDPNMAISLGLKVEKEGEKLKAVNSAAVKTKGIAKEIPLKVV
LGMEFFRKANATLSPASKQLSLFDGQRIRVIPLKEKELHETKVMSALVKIKGSMKRNRRSRKSHATRELQNEVGENVTGRNNGISNRDTNALSSHRVVKERSRVKGQRVD
KAAARGIDP