; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C10G190125 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C10G190125
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionGag-protease polyprotein
Genome locationCla97Chr10:7158674..7161829
RNA-Seq ExpressionCla97C10G190125
SyntenyCla97C10G190125
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAC64917.1 gag-pol polyprotein [Glycine max]1.0e-3728.34Show/hide
Query:  PQVTDAKGKVGAKL--EKDWIEEEDEAFLGNSRAPNAIFNGI------------------------------------AAFNYKFESLKMSEEETITEFN
        P++ D +GK   +L  E+DW +EEDE  LGNS+A NA+FNG+                                         KFE+LKM EEE I EF+
Subjt:  PQVTDAKGKVGAKL--EKDWIEEEDEAFLGNSRAPNAIFNGI------------------------------------AAFNYKFESLKMSEEETITEFN

Query:  -------------------------VLRSLPKRFDMKVTAIEEAHDIATMK----------------------------LSVEPDAESYKNKESDKNLAQ
                                 +LRSLPKRFDMKVTAIEEA DI  M+                            +S +   E   + ++D+ L  
Subjt:  -------------------------VLRSLPKRFDMKVTAIEEAHDIATMK----------------------------LSVEPDAESYKNKESDKNLAQ

Query:  SISLLTKQFEEALRRWDKRSMPRNSNVSSNVDNGKGFGSSQDSK---DKKFKCRECEGYGHYQAECPNFLS----------DDDEEVTSSSDSDEEIHAL
        ++ LL KQF + L R D+R  P   N+  ++  G  +    D K    K  +C  CEGYGH +AECP  L            DD E    SDSD +++AL
Subjt:  SISLLTKQFEEALRRWDKRSMPRNSNVSSNVDNGKGFGSSQDSK---DKKFKCRECEGYGHYQAECPNFLS----------DDDEEVTSSSDSDEEIHAL

Query:  VGCLSPSSVLEANSSDYCNVLIFE----------------TSPNRETDQHCLTLEEDNLRLLSTIFELKKELRITKAELESMTKSFPNELKGESSLLKVF
         G    +     +SSD  + + F+                     +  +    LE +       I ELK E+    ++LE+MTKS     KG   L +V 
Subjt:  VGCLSPSSVLEANSSDYCNVLIFE----------------TSPNRETDQHCLTLEEDNLRLLSTIFELKKELRITKAELESMTKSFPNELKGESSLLKVF

Query:  VSAQHV-------------------QITPHDYKQKCVCDEHAQHKF--------QKKWICHFCGRASHIHLYCFWLYGSRAH-QRSHKLPFKAHGVAQHV
           ++V                   +  P          +H             +KKW CH+CG+  HI  +C+ L+G   H  +S     K   V +H 
Subjt:  VSAQHV-------------------QITPHDYKQKCVCDEHAQHKF--------QKKWICHFCGRASHIHLYCFWLYGSRAH-QRSHKLPFKAHGVAQHV

Query:  KNLRSNKVEKCQVALTTLRSSIQKDWYFDNGCSQHMTGKSDQKWQIE
                    V  T+LR+S ++DWY D+GCS+HMTG  +    IE
Subjt:  KNLRSNKVEKCQVALTTLRSSIQKDWYFDNGCSQHMTGKSDQKWQIE

AAO73523.1 gag-pol polyprotein [Glycine max]1.0e-3728.39Show/hide
Query:  PQVTDAKGKVGAKL--EKDWIEEEDEAFLGNSRAPNAIFNGI------------------------------------AAFNYKFESLKMSEEETITEFN
        P++ D +GK   +L  E+DW +EEDE  LGNS+A NA+FNG+                                         KFE+LKM EEE I +F+
Subjt:  PQVTDAKGKVGAKL--EKDWIEEEDEAFLGNSRAPNAIFNGI------------------------------------AAFNYKFESLKMSEEETITEFN

Query:  -------------------------VLRSLPKRFDMKVTAIEEAHDIATMK----------------------------LSVEPDAESYKNKESDKNLAQ
                                 +LRSLPKRFDMKVTAIEEA DI  M+                            +S +   E   + ++D+ L  
Subjt:  -------------------------VLRSLPKRFDMKVTAIEEAHDIATMK----------------------------LSVEPDAESYKNKESDKNLAQ

Query:  SISLLTKQFEEALRRWDKRSMPRNSNVSSNVDNGKGFGSSQD---SKDKKFKCRECEGYGHYQAECPNFLS---------DDDEEVTSSSDSDEEIHALV
        ++ LL KQF + L R DKR  P   N+  ++  G  +    D   S  K  +C  CEGYGH  AECP  L            D E    SDSD +++AL+
Subjt:  SISLLTKQFEEALRRWDKRSMPRNSNVSSNVDNGKGFGSSQD---SKDKKFKCRECEGYGHYQAECPNFLS---------DDDEEVTSSSDSDEEIHALV

Query:  GCLSPSSVLEANSSDYCNVLIFE----------------TSPNRETDQHCLTLEEDNLRLLSTIFELKKELRITKAELESMTKSFPNELKGESSLLKVFV
        G    +     +SSD  + + F+                     +  +    LE +       I ELK E+    ++LE+MTKS     KG  +L +V +
Subjt:  GCLSPSSVLEANSSDYCNVLIFE----------------TSPNRETDQHCLTLEEDNLRLLSTIFELKKELRITKAELESMTKSFPNELKGESSLLKVFV

Query:  SAQH-------------------VQITPHDYKQKCVCDEH--AQHKFQ------KKWICHFCGRASHIHLYCFWLYGSRAH-QRSHKLPFKAHGVAQHVK
          ++                    +  P   +      +H    H  Q      KKW CH+CG+  HI  +C+ L+G   H  +S     K   V +H  
Subjt:  SAQH-------------------VQITPHDYKQKCVCDEH--AQHKFQ------KKWICHFCGRASHIHLYCFWLYGSRAH-QRSHKLPFKAHGVAQHVK

Query:  NLRSNKVEKCQVALTTLRSSIQKDWYFDNGCSQHMTGKSDQKWQIE
                   V  T+LR+S ++DWY D+GCS+HMTG  +    IE
Subjt:  NLRSNKVEKCQVALTTLRSSIQKDWYFDNGCSQHMTGKSDQKWQIE

AAO73525.1 gag-pol polyprotein [Glycine max]2.3e-3728.78Show/hide
Query:  PQVTDAKGKVGAKL--EKDWIEEEDEAFLGNSRAPNAIFNGI-------------------------------------AAFNYKFESLKMSEEETITEF
        P++ D +GK   +L  E+DW +EEDE  LGNS+A NA+FNG+                                          KFE+LKM EEE I +F
Subjt:  PQVTDAKGKVGAKL--EKDWIEEEDEAFLGNSRAPNAIFNGI-------------------------------------AAFNYKFESLKMSEEETITEF

Query:  N-------------------------VLRSLPKRFDMKVTAIEEAHDIATMK----------------------------LSVEPDAESYKNKESDKNLA
        +                         +LRSLPKRFDMKVTAIEEA DI  M+                            +S +   E   + ++D+ L 
Subjt:  N-------------------------VLRSLPKRFDMKVTAIEEAHDIATMK----------------------------LSVEPDAESYKNKESDKNLA

Query:  QSISLLTKQFEEALRRWDKRSMPRNSNVSSNVDNGKGFGSSQDSK---DKKFKCRECEGYGHYQAECPNFLS----------DDDEEVTSSSDSDEEIHA
         ++ LL KQF + L R D+R  P   N+  ++  G  +    D K    K  +C  CEGYGH +AECP  L            DD E    SDSD +++A
Subjt:  QSISLLTKQFEEALRRWDKRSMPRNSNVSSNVDNGKGFGSSQDSK---DKKFKCRECEGYGHYQAECPNFLS----------DDDEEVTSSSDSDEEIHA

Query:  LVGCL---SPSSVLEANSSD----YCNVLIFE---TSPNRETDQHCLTLEEDNLRLLSTIFELKKELRITKAELESMTKSFPNELKGESSLLKVFVSAQH
        L G       SS +E    +    Y  + I          +  +    LE +       I ELK E+    ++LE+MTKS     KG   L +V    ++
Subjt:  LVGCL---SPSSVLEANSSD----YCNVLIFE---TSPNRETDQHCLTLEEDNLRLLSTIFELKKELRITKAELESMTKSFPNELKGESSLLKVFVSAQH

Query:  V-------------------QITPHDYKQKCVCDEHAQHKF--------QKKWICHFCGRASHIHLYCFWLYGSRAH-QRSHKLPFKAHGVAQHVKNLRS
        V                   +  P          +H             +KKW CH+CG+  HI  +C+ L+G   H  +S     K   V +H      
Subjt:  V-------------------QITPHDYKQKCVCDEHAQHKF--------QKKWICHFCGRASHIHLYCFWLYGSRAH-QRSHKLPFKAHGVAQHVKNLRS

Query:  NKVEKCQVALTTLRSSIQKDWYFDNGCSQHMTGKSDQKWQIE
         K+    V  T+LR+S ++DWY D+GCS+HMTG  +    IE
Subjt:  NKVEKCQVALTTLRSSIQKDWYFDNGCSQHMTGKSDQKWQIE

AAO73529.1 gag-pol polyprotein [Glycine max]5.1e-3727.66Show/hide
Query:  PQVTDAKGKVGAKL--EKDWIEEEDEAFLGNSRAPNAIFNGI------------------------------------AAFNYKFESLKMSEEETITEFN
        P++ D +GK   +L  E+DW +EEDE  LGNS+A NA+FNG+                                         KFE+LKM EEE I +F+
Subjt:  PQVTDAKGKVGAKL--EKDWIEEEDEAFLGNSRAPNAIFNGI------------------------------------AAFNYKFESLKMSEEETITEFN

Query:  -------------------------VLRSLPKRFDMKVTAIEEAHDIATMK----------------------------LSVEPDAESYKNKESDKNLAQ
                                 +LRSLPKRFDMKVTAIEEA DI  M+                            +S +   E   + ++D+ L  
Subjt:  -------------------------VLRSLPKRFDMKVTAIEEAHDIATMK----------------------------LSVEPDAESYKNKESDKNLAQ

Query:  SISLLTKQFEEALRRWDKRSMPRNSNVSSNVDNGKGFGSSQDSK---DKKFKCRECEGYGHYQAECPNFLS----------DDDEEVTSSSDSDEEIHAL
        ++  L KQF + L R D+R  P   N+S ++  G  +    D K    K  +CR CEGYGH +AECP  L            DD E    SDSD +++AL
Subjt:  SISLLTKQFEEALRRWDKRSMPRNSNVSSNVDNGKGFGSSQDSK---DKKFKCRECEGYGHYQAECPNFLS----------DDDEEVTSSSDSDEEIHAL

Query:  VGCLSPSSVLEANSSDYCNVLIFE----------------TSPNRETDQHCLTLEEDNLRLLSTIFELKKELRITKAELESMTKSFPNELKGESSLLKVF
         G    +     +SSD  + + F+                     +  +    LE +       I +LK E+    ++LE+MTKS     KG   L +V 
Subjt:  VGCLSPSSVLEANSSDYCNVLIFE----------------TSPNRETDQHCLTLEEDNLRLLSTIFELKKELRITKAELESMTKSFPNELKGESSLLKVF

Query:  VSAQHV-------------------QITPHDYKQKCVCDEH--------AQHKFQKKWICHFCGRASHIHLYCFWLYGSRAHQRSHKLPFKAHGVAQHVK
           + V                   +  P          +H         +   +KKW CH+CG+  HI  +C+ L+G   H        +     + + 
Subjt:  VSAQHV-------------------QITPHDYKQKCVCDEH--------AQHKFQKKWICHFCGRASHIHLYCFWLYGSRAHQRSHKLPFKAHGVAQHVK

Query:  NLRSNKVEKCQVALTTLRSSIQKDWYFDNGCSQHMTGKSDQKWQIE
         +  +K+    V  T+LR+S ++DWY D+GCS+HMTG  +    IE
Subjt:  NLRSNKVEKCQVALTTLRSSIQKDWYFDNGCSQHMTGKSDQKWQIE

NP_001235160.1 gag-protease polyprotein [Glycine max]1.2e-3828.52Show/hide
Query:  PQVTDAKGKV--GAKLEKDWIEEEDEAFLGNSRAPNAIFNGI------------------------------------AAFNYKFESLKMSEEETITEFN
        P++ D +GK   G K E+DW +EEDE  LGNS+A NA+FNG+                                         KFE+LKM EEE I +F+
Subjt:  PQVTDAKGKV--GAKLEKDWIEEEDEAFLGNSRAPNAIFNGI------------------------------------AAFNYKFESLKMSEEETITEFN

Query:  -------------------------VLRSLPKRFDMKVTAIEEAHDIATMK----------------------------LSVEPDAESYKNKESDKNLAQ
                                 +LRSLPKRFDMKVTAIEEA DI  ++                            +S +   E   + ++D+ L  
Subjt:  -------------------------VLRSLPKRFDMKVTAIEEAHDIATMK----------------------------LSVEPDAESYKNKESDKNLAQ

Query:  SISLLTKQFEEALRRWDKRSMPRNSNVSSNVDNGKGFGSSQDSK---DKKFKCRECEGYGHYQAECPNFLS----------DDDEEVTSSSDSDEEIHAL
        ++ LL KQF + L R D+R  P   N+  ++  G  +    D K    K F+C  CEGYGH +AECP  L            DD E    SDSD +++AL
Subjt:  SISLLTKQFEEALRRWDKRSMPRNSNVSSNVDNGKGFGSSQDSK---DKKFKCRECEGYGHYQAECPNFLS----------DDDEEVTSSSDSDEEIHAL

Query:  VGCLSPSSVLEANSSDYCNVLIFE----------------TSPNRETDQHCLTLEEDNLRLLSTIFELKKELRITKAELESMTKSFPNELKGESSLLKVF
         G    +     +SSD  + + F+                     +  +    LE +       I ELK E+    ++LE+MTKS     KG   L +V 
Subjt:  VGCLSPSSVLEANSSDYCNVLIFE----------------TSPNRETDQHCLTLEEDNLRLLSTIFELKKELRITKAELESMTKSFPNELKGESSLLKVF

Query:  VSAQHV-------------------QITPHDYKQKCVCDEHAQHKF--------QKKWICHFCGRASHIHLYCFWLYGSRAH-QRSHKLPFKAHGVAQHV
           ++V                   +  P          +H             +KKW CH+CG+  HI  +C+ L+G   H  +S     K   V +H 
Subjt:  VSAQHV-------------------QITPHDYKQKCVCDEHAQHKF--------QKKWICHFCGRASHIHLYCFWLYGSRAH-QRSHKLPFKAHGVAQHV

Query:  KNLRSNKVEKCQVALTTLRSSIQKDWYFDNGCSQHMTGKSDQKWQIE
              K+    V  T+LR+S ++DWY D+GCS+HMTG  +    IE
Subjt:  KNLRSNKVEKCQVALTTLRSSIQKDWYFDNGCSQHMTGKSDQKWQIE

TrEMBL top hitse value%identityAlignment
O65147 Gag-pol polyprotein5.0e-3828.34Show/hide
Query:  PQVTDAKGKVGAKL--EKDWIEEEDEAFLGNSRAPNAIFNGI------------------------------------AAFNYKFESLKMSEEETITEFN
        P++ D +GK   +L  E+DW +EEDE  LGNS+A NA+FNG+                                         KFE+LKM EEE I EF+
Subjt:  PQVTDAKGKVGAKL--EKDWIEEEDEAFLGNSRAPNAIFNGI------------------------------------AAFNYKFESLKMSEEETITEFN

Query:  -------------------------VLRSLPKRFDMKVTAIEEAHDIATMK----------------------------LSVEPDAESYKNKESDKNLAQ
                                 +LRSLPKRFDMKVTAIEEA DI  M+                            +S +   E   + ++D+ L  
Subjt:  -------------------------VLRSLPKRFDMKVTAIEEAHDIATMK----------------------------LSVEPDAESYKNKESDKNLAQ

Query:  SISLLTKQFEEALRRWDKRSMPRNSNVSSNVDNGKGFGSSQDSK---DKKFKCRECEGYGHYQAECPNFLS----------DDDEEVTSSSDSDEEIHAL
        ++ LL KQF + L R D+R  P   N+  ++  G  +    D K    K  +C  CEGYGH +AECP  L            DD E    SDSD +++AL
Subjt:  SISLLTKQFEEALRRWDKRSMPRNSNVSSNVDNGKGFGSSQDSK---DKKFKCRECEGYGHYQAECPNFLS----------DDDEEVTSSSDSDEEIHAL

Query:  VGCLSPSSVLEANSSDYCNVLIFE----------------TSPNRETDQHCLTLEEDNLRLLSTIFELKKELRITKAELESMTKSFPNELKGESSLLKVF
         G    +     +SSD  + + F+                     +  +    LE +       I ELK E+    ++LE+MTKS     KG   L +V 
Subjt:  VGCLSPSSVLEANSSDYCNVLIFE----------------TSPNRETDQHCLTLEEDNLRLLSTIFELKKELRITKAELESMTKSFPNELKGESSLLKVF

Query:  VSAQHV-------------------QITPHDYKQKCVCDEHAQHKF--------QKKWICHFCGRASHIHLYCFWLYGSRAH-QRSHKLPFKAHGVAQHV
           ++V                   +  P          +H             +KKW CH+CG+  HI  +C+ L+G   H  +S     K   V +H 
Subjt:  VSAQHV-------------------QITPHDYKQKCVCDEHAQHKF--------QKKWICHFCGRASHIHLYCFWLYGSRAH-QRSHKLPFKAHGVAQHV

Query:  KNLRSNKVEKCQVALTTLRSSIQKDWYFDNGCSQHMTGKSDQKWQIE
                    V  T+LR+S ++DWY D+GCS+HMTG  +    IE
Subjt:  KNLRSNKVEKCQVALTTLRSSIQKDWYFDNGCSQHMTGKSDQKWQIE

Q84VH6 Gag-pol polyprotein2.5e-3727.66Show/hide
Query:  PQVTDAKGKVGAKL--EKDWIEEEDEAFLGNSRAPNAIFNGI------------------------------------AAFNYKFESLKMSEEETITEFN
        P++ D +GK   +L  E+DW +EEDE  LGNS+A NA+FNG+                                         KFE+LKM EEE I +F+
Subjt:  PQVTDAKGKVGAKL--EKDWIEEEDEAFLGNSRAPNAIFNGI------------------------------------AAFNYKFESLKMSEEETITEFN

Query:  -------------------------VLRSLPKRFDMKVTAIEEAHDIATMK----------------------------LSVEPDAESYKNKESDKNLAQ
                                 +LRSLPKRFDMKVTAIEEA DI  M+                            +S +   E   + ++D+ L  
Subjt:  -------------------------VLRSLPKRFDMKVTAIEEAHDIATMK----------------------------LSVEPDAESYKNKESDKNLAQ

Query:  SISLLTKQFEEALRRWDKRSMPRNSNVSSNVDNGKGFGSSQDSK---DKKFKCRECEGYGHYQAECPNFLS----------DDDEEVTSSSDSDEEIHAL
        ++  L KQF + L R D+R  P   N+S ++  G  +    D K    K  +CR CEGYGH +AECP  L            DD E    SDSD +++AL
Subjt:  SISLLTKQFEEALRRWDKRSMPRNSNVSSNVDNGKGFGSSQDSK---DKKFKCRECEGYGHYQAECPNFLS----------DDDEEVTSSSDSDEEIHAL

Query:  VGCLSPSSVLEANSSDYCNVLIFE----------------TSPNRETDQHCLTLEEDNLRLLSTIFELKKELRITKAELESMTKSFPNELKGESSLLKVF
         G    +     +SSD  + + F+                     +  +    LE +       I +LK E+    ++LE+MTKS     KG   L +V 
Subjt:  VGCLSPSSVLEANSSDYCNVLIFE----------------TSPNRETDQHCLTLEEDNLRLLSTIFELKKELRITKAELESMTKSFPNELKGESSLLKVF

Query:  VSAQHV-------------------QITPHDYKQKCVCDEH--------AQHKFQKKWICHFCGRASHIHLYCFWLYGSRAHQRSHKLPFKAHGVAQHVK
           + V                   +  P          +H         +   +KKW CH+CG+  HI  +C+ L+G   H        +     + + 
Subjt:  VSAQHV-------------------QITPHDYKQKCVCDEH--------AQHKFQKKWICHFCGRASHIHLYCFWLYGSRAHQRSHKLPFKAHGVAQHVK

Query:  NLRSNKVEKCQVALTTLRSSIQKDWYFDNGCSQHMTGKSDQKWQIE
         +  +K+    V  T+LR+S ++DWY D+GCS+HMTG  +    IE
Subjt:  NLRSNKVEKCQVALTTLRSSIQKDWYFDNGCSQHMTGKSDQKWQIE

Q84VI0 Gag-pol polyprotein1.1e-3728.78Show/hide
Query:  PQVTDAKGKVGAKL--EKDWIEEEDEAFLGNSRAPNAIFNGI-------------------------------------AAFNYKFESLKMSEEETITEF
        P++ D +GK   +L  E+DW +EEDE  LGNS+A NA+FNG+                                          KFE+LKM EEE I +F
Subjt:  PQVTDAKGKVGAKL--EKDWIEEEDEAFLGNSRAPNAIFNGI-------------------------------------AAFNYKFESLKMSEEETITEF

Query:  N-------------------------VLRSLPKRFDMKVTAIEEAHDIATMK----------------------------LSVEPDAESYKNKESDKNLA
        +                         +LRSLPKRFDMKVTAIEEA DI  M+                            +S +   E   + ++D+ L 
Subjt:  N-------------------------VLRSLPKRFDMKVTAIEEAHDIATMK----------------------------LSVEPDAESYKNKESDKNLA

Query:  QSISLLTKQFEEALRRWDKRSMPRNSNVSSNVDNGKGFGSSQDSK---DKKFKCRECEGYGHYQAECPNFLS----------DDDEEVTSSSDSDEEIHA
         ++ LL KQF + L R D+R  P   N+  ++  G  +    D K    K  +C  CEGYGH +AECP  L            DD E    SDSD +++A
Subjt:  QSISLLTKQFEEALRRWDKRSMPRNSNVSSNVDNGKGFGSSQDSK---DKKFKCRECEGYGHYQAECPNFLS----------DDDEEVTSSSDSDEEIHA

Query:  LVGCL---SPSSVLEANSSD----YCNVLIFE---TSPNRETDQHCLTLEEDNLRLLSTIFELKKELRITKAELESMTKSFPNELKGESSLLKVFVSAQH
        L G       SS +E    +    Y  + I          +  +    LE +       I ELK E+    ++LE+MTKS     KG   L +V    ++
Subjt:  LVGCL---SPSSVLEANSSD----YCNVLIFE---TSPNRETDQHCLTLEEDNLRLLSTIFELKKELRITKAELESMTKSFPNELKGESSLLKVFVSAQH

Query:  V-------------------QITPHDYKQKCVCDEHAQHKF--------QKKWICHFCGRASHIHLYCFWLYGSRAH-QRSHKLPFKAHGVAQHVKNLRS
        V                   +  P          +H             +KKW CH+CG+  HI  +C+ L+G   H  +S     K   V +H      
Subjt:  V-------------------QITPHDYKQKCVCDEHAQHKF--------QKKWICHFCGRASHIHLYCFWLYGSRAH-QRSHKLPFKAHGVAQHVKNLRS

Query:  NKVEKCQVALTTLRSSIQKDWYFDNGCSQHMTGKSDQKWQIE
         K+    V  T+LR+S ++DWY D+GCS+HMTG  +    IE
Subjt:  NKVEKCQVALTTLRSSIQKDWYFDNGCSQHMTGKSDQKWQIE

Q84VI2 Gag-pol polyprotein5.0e-3828.39Show/hide
Query:  PQVTDAKGKVGAKL--EKDWIEEEDEAFLGNSRAPNAIFNGI------------------------------------AAFNYKFESLKMSEEETITEFN
        P++ D +GK   +L  E+DW +EEDE  LGNS+A NA+FNG+                                         KFE+LKM EEE I +F+
Subjt:  PQVTDAKGKVGAKL--EKDWIEEEDEAFLGNSRAPNAIFNGI------------------------------------AAFNYKFESLKMSEEETITEFN

Query:  -------------------------VLRSLPKRFDMKVTAIEEAHDIATMK----------------------------LSVEPDAESYKNKESDKNLAQ
                                 +LRSLPKRFDMKVTAIEEA DI  M+                            +S +   E   + ++D+ L  
Subjt:  -------------------------VLRSLPKRFDMKVTAIEEAHDIATMK----------------------------LSVEPDAESYKNKESDKNLAQ

Query:  SISLLTKQFEEALRRWDKRSMPRNSNVSSNVDNGKGFGSSQD---SKDKKFKCRECEGYGHYQAECPNFLS---------DDDEEVTSSSDSDEEIHALV
        ++ LL KQF + L R DKR  P   N+  ++  G  +    D   S  K  +C  CEGYGH  AECP  L            D E    SDSD +++AL+
Subjt:  SISLLTKQFEEALRRWDKRSMPRNSNVSSNVDNGKGFGSSQD---SKDKKFKCRECEGYGHYQAECPNFLS---------DDDEEVTSSSDSDEEIHALV

Query:  GCLSPSSVLEANSSDYCNVLIFE----------------TSPNRETDQHCLTLEEDNLRLLSTIFELKKELRITKAELESMTKSFPNELKGESSLLKVFV
        G    +     +SSD  + + F+                     +  +    LE +       I ELK E+    ++LE+MTKS     KG  +L +V +
Subjt:  GCLSPSSVLEANSSDYCNVLIFE----------------TSPNRETDQHCLTLEEDNLRLLSTIFELKKELRITKAELESMTKSFPNELKGESSLLKVFV

Query:  SAQH-------------------VQITPHDYKQKCVCDEH--AQHKFQ------KKWICHFCGRASHIHLYCFWLYGSRAH-QRSHKLPFKAHGVAQHVK
          ++                    +  P   +      +H    H  Q      KKW CH+CG+  HI  +C+ L+G   H  +S     K   V +H  
Subjt:  SAQH-------------------VQITPHDYKQKCVCDEH--AQHKFQ------KKWICHFCGRASHIHLYCFWLYGSRAH-QRSHKLPFKAHGVAQHVK

Query:  NLRSNKVEKCQVALTTLRSSIQKDWYFDNGCSQHMTGKSDQKWQIE
                   V  T+LR+S ++DWY D+GCS+HMTG  +    IE
Subjt:  NLRSNKVEKCQVALTTLRSSIQKDWYFDNGCSQHMTGKSDQKWQIE

V9H042 Gag-protease polyprotein5.9e-3928.52Show/hide
Query:  PQVTDAKGKV--GAKLEKDWIEEEDEAFLGNSRAPNAIFNGI------------------------------------AAFNYKFESLKMSEEETITEFN
        P++ D +GK   G K E+DW +EEDE  LGNS+A NA+FNG+                                         KFE+LKM EEE I +F+
Subjt:  PQVTDAKGKV--GAKLEKDWIEEEDEAFLGNSRAPNAIFNGI------------------------------------AAFNYKFESLKMSEEETITEFN

Query:  -------------------------VLRSLPKRFDMKVTAIEEAHDIATMK----------------------------LSVEPDAESYKNKESDKNLAQ
                                 +LRSLPKRFDMKVTAIEEA DI  ++                            +S +   E   + ++D+ L  
Subjt:  -------------------------VLRSLPKRFDMKVTAIEEAHDIATMK----------------------------LSVEPDAESYKNKESDKNLAQ

Query:  SISLLTKQFEEALRRWDKRSMPRNSNVSSNVDNGKGFGSSQDSK---DKKFKCRECEGYGHYQAECPNFLS----------DDDEEVTSSSDSDEEIHAL
        ++ LL KQF + L R D+R  P   N+  ++  G  +    D K    K F+C  CEGYGH +AECP  L            DD E    SDSD +++AL
Subjt:  SISLLTKQFEEALRRWDKRSMPRNSNVSSNVDNGKGFGSSQDSK---DKKFKCRECEGYGHYQAECPNFLS----------DDDEEVTSSSDSDEEIHAL

Query:  VGCLSPSSVLEANSSDYCNVLIFE----------------TSPNRETDQHCLTLEEDNLRLLSTIFELKKELRITKAELESMTKSFPNELKGESSLLKVF
         G    +     +SSD  + + F+                     +  +    LE +       I ELK E+    ++LE+MTKS     KG   L +V 
Subjt:  VGCLSPSSVLEANSSDYCNVLIFE----------------TSPNRETDQHCLTLEEDNLRLLSTIFELKKELRITKAELESMTKSFPNELKGESSLLKVF

Query:  VSAQHV-------------------QITPHDYKQKCVCDEHAQHKF--------QKKWICHFCGRASHIHLYCFWLYGSRAH-QRSHKLPFKAHGVAQHV
           ++V                   +  P          +H             +KKW CH+CG+  HI  +C+ L+G   H  +S     K   V +H 
Subjt:  VSAQHV-------------------QITPHDYKQKCVCDEHAQHKF--------QKKWICHFCGRASHIHLYCFWLYGSRAH-QRSHKLPFKAHGVAQHV

Query:  KNLRSNKVEKCQVALTTLRSSIQKDWYFDNGCSQHMTGKSDQKWQIE
              K+    V  T+LR+S ++DWY D+GCS+HMTG  +    IE
Subjt:  KNLRSNKVEKCQVALTTLRSSIQKDWYFDNGCSQHMTGKSDQKWQIE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGTGAGGTGTATTGGGTTTGGAAACTCATTGGAGAGGAAAATTCGGCCGTCCACCATTGATGTGCATGTTAAGTCGCTTATTATCAAGGTGTGTTTTTTGGCAGA
AGTGTCTCTCGACTATCTTTGTGTATGTTGTCACAATTGTATCAATAACATCATATCAAATAATACTGGTTATTGTTCTGCAGATGCCGCTACTGCTGTTTATAATGTGC
ATCGCCTTCCAGACATAGGTGTGGTTACACCGAAGTGGGTTACCAATTTTAGAAAGGTTGGCCTCTTTTTCATTGAGAATGGAGGCGATCAGAGAAGGTGGATCAACAAC
TCTCCTCCTCAGGTTACAGACGCTAAAGGGAAGGTTGGTGCTAAACTAGAGAAAGACTGGATAGAGGAAGAAGATGAAGCTTTCTTAGGAAATTCTCGTGCTCCTAATGC
AATCTTTAACGGTATTGCAGCTTTTAACTATAAATTTGAGTCTCTAAAGATGTCAGAAGAAGAGACCATAACTGAGTTTAATGTTCTTCGTTCTCTTCCAAAAAGGTTTG
ATATGAAAGTCACCGCGATTGAGGAAGCTCATGATATAGCAACGATGAAATTATCTGTCGAGCCTGATGCTGAGAGTTATAAGAACAAGGAATCAGATAAGAATCTTGCT
CAATCCATTTCCTTGCTTACAAAACAATTTGAAGAGGCTCTTAGACGATGGGACAAGCGCAGTATGCCCAGGAATAGTAATGTTTCCTCCAATGTTGACAATGGTAAAGG
TTTCGGTTCCAGTCAAGATTCAAAAGACAAAAAATTTAAATGTAGAGAGTGCGAGGGATATGGACACTATCAAGCTGAATGTCCCAACTTTCTATCAGACGATGATGAGG
AAGTTACTTCAAGTAGTGATTCTGATGAAGAAATACATGCCTTAGTGGGTTGTCTATCTCCAAGCAGTGTTCTAGAGGCTAACTCATCTGATTATTGTAATGTTTTGATT
TTTGAGACGTCACCTAATAGAGAAACTGATCAGCATTGTCTTACCTTAGAGGAAGACAATCTTCGTCTCCTAAGTACTATCTTTGAGCTGAAGAAGGAACTGAGAATAAC
AAAGGCCGAGCTCGAGTCGATGACCAAGTCTTTTCCCAATGAGCTCAAAGGAGAGTCTTCCCTATTAAAGGTTTTCGTTTCTGCTCAACATGTTCAAATAACTCCTCATG
ACTACAAGCAGAAATGTGTTTGTGATGAACATGCTCAACACAAATTTCAGAAGAAGTGGATTTGTCATTTCTGTGGAAGAGCTAGTCATATTCATCTTTACTGCTTTTGG
TTGTATGGAAGTCGTGCTCATCAGAGATCTCATAAGCTGCCTTTTAAAGCTCATGGTGTTGCTCAACATGTTAAAAACTTGAGAAGCAACAAAGTGGAAAAATGTCAAGT
TGCTCTTACAACCTTACGCTCGTCCATTCAGAAGGATTGGTACTTTGATAATGGGTGCTCACAGCACATGACAGGGAAAAGTGATCAGAAATGGCAGATTGAACTATCCT
GGTCTGCCCATTCTCAAGGAAGTTATTATGGTGGAAGGATTAACAACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGTGAGGTGTATTGGGTTTGGAAACTCATTGGAGAGGAAAATTCGGCCGTCCACCATTGATGTGCATGTTAAGTCGCTTATTATCAAGGTGTGTTTTTTGGCAGA
AGTGTCTCTCGACTATCTTTGTGTATGTTGTCACAATTGTATCAATAACATCATATCAAATAATACTGGTTATTGTTCTGCAGATGCCGCTACTGCTGTTTATAATGTGC
ATCGCCTTCCAGACATAGGTGTGGTTACACCGAAGTGGGTTACCAATTTTAGAAAGGTTGGCCTCTTTTTCATTGAGAATGGAGGCGATCAGAGAAGGTGGATCAACAAC
TCTCCTCCTCAGGTTACAGACGCTAAAGGGAAGGTTGGTGCTAAACTAGAGAAAGACTGGATAGAGGAAGAAGATGAAGCTTTCTTAGGAAATTCTCGTGCTCCTAATGC
AATCTTTAACGGTATTGCAGCTTTTAACTATAAATTTGAGTCTCTAAAGATGTCAGAAGAAGAGACCATAACTGAGTTTAATGTTCTTCGTTCTCTTCCAAAAAGGTTTG
ATATGAAAGTCACCGCGATTGAGGAAGCTCATGATATAGCAACGATGAAATTATCTGTCGAGCCTGATGCTGAGAGTTATAAGAACAAGGAATCAGATAAGAATCTTGCT
CAATCCATTTCCTTGCTTACAAAACAATTTGAAGAGGCTCTTAGACGATGGGACAAGCGCAGTATGCCCAGGAATAGTAATGTTTCCTCCAATGTTGACAATGGTAAAGG
TTTCGGTTCCAGTCAAGATTCAAAAGACAAAAAATTTAAATGTAGAGAGTGCGAGGGATATGGACACTATCAAGCTGAATGTCCCAACTTTCTATCAGACGATGATGAGG
AAGTTACTTCAAGTAGTGATTCTGATGAAGAAATACATGCCTTAGTGGGTTGTCTATCTCCAAGCAGTGTTCTAGAGGCTAACTCATCTGATTATTGTAATGTTTTGATT
TTTGAGACGTCACCTAATAGAGAAACTGATCAGCATTGTCTTACCTTAGAGGAAGACAATCTTCGTCTCCTAAGTACTATCTTTGAGCTGAAGAAGGAACTGAGAATAAC
AAAGGCCGAGCTCGAGTCGATGACCAAGTCTTTTCCCAATGAGCTCAAAGGAGAGTCTTCCCTATTAAAGGTTTTCGTTTCTGCTCAACATGTTCAAATAACTCCTCATG
ACTACAAGCAGAAATGTGTTTGTGATGAACATGCTCAACACAAATTTCAGAAGAAGTGGATTTGTCATTTCTGTGGAAGAGCTAGTCATATTCATCTTTACTGCTTTTGG
TTGTATGGAAGTCGTGCTCATCAGAGATCTCATAAGCTGCCTTTTAAAGCTCATGGTGTTGCTCAACATGTTAAAAACTTGAGAAGCAACAAAGTGGAAAAATGTCAAGT
TGCTCTTACAACCTTACGCTCGTCCATTCAGAAGGATTGGTACTTTGATAATGGGTGCTCACAGCACATGACAGGGAAAAGTGATCAGAAATGGCAGATTGAACTATCCT
GGTCTGCCCATTCTCAAGGAAGTTATTATGGTGGAAGGATTAACAACTAA
Protein sequenceShow/hide protein sequence
MVVRCIGFGNSLERKIRPSTIDVHVKSLIIKVCFLAEVSLDYLCVCCHNCINNIISNNTGYCSADAATAVYNVHRLPDIGVVTPKWVTNFRKVGLFFIENGGDQRRWINN
SPPQVTDAKGKVGAKLEKDWIEEEDEAFLGNSRAPNAIFNGIAAFNYKFESLKMSEEETITEFNVLRSLPKRFDMKVTAIEEAHDIATMKLSVEPDAESYKNKESDKNLA
QSISLLTKQFEEALRRWDKRSMPRNSNVSSNVDNGKGFGSSQDSKDKKFKCRECEGYGHYQAECPNFLSDDDEEVTSSSDSDEEIHALVGCLSPSSVLEANSSDYCNVLI
FETSPNRETDQHCLTLEEDNLRLLSTIFELKKELRITKAELESMTKSFPNELKGESSLLKVFVSAQHVQITPHDYKQKCVCDEHAQHKFQKKWICHFCGRASHIHLYCFW
LYGSRAHQRSHKLPFKAHGVAQHVKNLRSNKVEKCQVALTTLRSSIQKDWYFDNGCSQHMTGKSDQKWQIELSWSAHSQGSYYGGRINN