; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc11G10265 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc11G10265
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionGag-pol polyprotein
Genome locationClcChr11:15459371..15461567
RNA-Seq ExpressionClc11G10265
SyntenyClc11G10265
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAO73521.1 gag-pol polyprotein [Glycine max]9.9e-7039.65Show/hide
Query:  MESIREGGSTTCPSVLDGSNYSYWKARMTTFLKSIDNKTWKVVIFGWTPPQVTDTDGNVS--LKLEKDFTEDEDEV-LGNCQALNAIFNRVDKNIFCLIN
        M   +EGG    P +LDGSNY YWKARM  FLKS+D++TWK VI GW  P++ DT+G  +  LK E+D+T++EDE+ LGN +ALNA+FN VDKNIF LIN
Subjt:  MESIREGGSTTCPSVLDGSNYSYWKARMTTFLKSIDNKTWKVVIFGWTPPQVTDTDGNVS--LKLEKDFTEDEDEV-LGNCQALNAIFNRVDKNIFCLIN

Query:  TCVSAKEAWDILAVAHEGTSKF--------------------------------------ALVEKISEEKLVRKVLRSLPKRFDMKVIAIEEAHHIATMK
        TC  AK+AW+IL + HEGTSK                                       AL E+I++EKLVRK+LRSLPKRFDMKV AIEEA  I  M+
Subjt:  TCVSAKEAWDILAVAHEGTSKF--------------------------------------ALVEKISEEKLVRKVLRSLPKRFDMKVIAIEEAHHIATMK

Query:  VDELFGFMCTFKMSFDDKSDKKSKSIALQSTIENDAPIVKIKESDQNLAQSISLLAKKFGKVLRRWDKRGGSRGNHVSPNVQDNNSPNNHSNQKIGKQSR
        VDEL G + TF++   D+++KKSK++A  S  E +     +  +D+ L  ++ LL K+F KVL R DKR      ++  +++  +     S+ K      
Subjt:  VDELFGFMCTFKMSFDDKSDKKSKSIALQSTIENDAPIVKIKESDQNLAQSISLLAKKFGKVLRRWDKRGGSRGNHVSPNVQDNNSPNNHSNQKIGKQSR

Query:  IERQKFRCRECEGFDHYQAECPNFLKRQNKSYSVTLFDDDHESSSDSDDEIRALMRCLSLESSQVTSPSDIVIAMHWIE--------DAQAIIVQKKRIE
        I+     C  CEG+ H  AECP  LK+  K  SV   D + E  SDSD ++ AL      E+++ +S +D  I    +          ++ I+ Q+ +++
Subjt:  IERQKFRCRECEGFDHYQAECPNFLKRQNKSYSVTLFDDDHESSSDSDDEIRALMRCLSLESSQVTSPSDIVIAMHWIE--------DAQAIIVQKKRIE

Query:  KLIED----NHCLLSTIFELKKELKSSKAELEVMTKSVQ---KGCDIALAVNLAFKEQG
        K+I D           I ELK E+    ++LE MTKS++   KG D    V L  K  G
Subjt:  KLIED----NHCLLSTIFELKKELKSSKAELEVMTKSVQ---KGCDIALAVNLAFKEQG

AAO73523.1 gag-pol polyprotein [Glycine max]3.2e-6839.43Show/hide
Query:  MESIREGGSTTCPSVLDGSNYSYWKARMTTFLKSIDNKTWKVVIFGWTPPQVTDTDGNVS--LKLEKDFTEDEDEV-LGNCQALNAIFNRVDKNIFCLIN
        M   +EGG    P +LDGSNY YWKARM  FLKS+D++TWK VI GW  P++ DT+G  +  LK E+D+T++EDE+ LGN +ALNA+FN VDKNIF LIN
Subjt:  MESIREGGSTTCPSVLDGSNYSYWKARMTTFLKSIDNKTWKVVIFGWTPPQVTDTDGNVS--LKLEKDFTEDEDEV-LGNCQALNAIFNRVDKNIFCLIN

Query:  TCVSAKEAWDILAVAHEGTSKF--------------------------------------ALVEKISEEKLVRKVLRSLPKRFDMKVIAIEEAHHIATMK
        TC  AK+A +IL   HEGTSK                                       AL E+I++EKLVRK+LRSLPKRFDMKV AIEEA  I  M+
Subjt:  TCVSAKEAWDILAVAHEGTSKF--------------------------------------ALVEKISEEKLVRKVLRSLPKRFDMKVIAIEEAHHIATMK

Query:  VDELFGFMCTFKMSFDDKSDKKSKSIALQSTIENDAPIVKIKESDQNLAQSISLLAKKFGKVLRRWDKRGGSRGNHVSPNVQDNNSPNNHSNQKIGKQSR
        VDEL G + TF++   D+++KKSK++A  S  E +     + ++D+ L  ++ LL K+F KVL R DKR      ++  +++  +     S+ K      
Subjt:  VDELFGFMCTFKMSFDDKSDKKSKSIALQSTIENDAPIVKIKESDQNLAQSISLLAKKFGKVLRRWDKRGGSRGNHVSPNVQDNNSPNNHSNQKIGKQSR

Query:  IERQKFRCRECEGFDHYQAECPNFLKRQNKSYSVTLFDDDHESSSDSDDEIRALMRCLSLESSQVTSPSDIVIAMHWIE--------DAQAIIVQKKRIE
        I+     C  CEG+ H  AECP  LK+  K  SV   D + E  SDSD ++ AL+     E+++ +S +D  I    +          ++ I+ Q+ +++
Subjt:  IERQKFRCRECEGFDHYQAECPNFLKRQNKSYSVTLFDDDHESSSDSDDEIRALMRCLSLESSQVTSPSDIVIAMHWIE--------DAQAIIVQKKRIE

Query:  KLIED----NHCLLSTIFELKKELKSSKAELEVMTKSVQ---KGCDIALAVNLAFKEQG
        K+I D           I ELK E+    ++LE MTKS++   KG D    V L  K  G
Subjt:  KLIED----NHCLLSTIFELKKELKSSKAELEVMTKSVQ---KGCDIALAVNLAFKEQG

AAO73527.1 gag-pol polyprotein [Glycine max]1.3e-6939.43Show/hide
Query:  MESIREGGSTTCPSVLDGSNYSYWKARMTTFLKSIDNKTWKVVIFGWTPPQVTDTDGNVS--LKLEKDFTEDEDEV-LGNCQALNAIFNRVDKNIFCLIN
        M   +EGG    P +LDGSNY YWKARM  FLKS+D++TWK VI GW  P++ DT+G  +  LK E+D+T++EDE+ LGN +ALNA+FN VDKNIF LIN
Subjt:  MESIREGGSTTCPSVLDGSNYSYWKARMTTFLKSIDNKTWKVVIFGWTPPQVTDTDGNVS--LKLEKDFTEDEDEV-LGNCQALNAIFNRVDKNIFCLIN

Query:  TCVSAKEAWDILAVAHEGTSKF--------------------------------------ALVEKISEEKLVRKVLRSLPKRFDMKVIAIEEAHHIATMK
        TC  AK+AW+IL + HEGTSK                                       AL E+I++EKLVRK+LRSLPKRFDMKV AIEEA  I  M+
Subjt:  TCVSAKEAWDILAVAHEGTSKF--------------------------------------ALVEKISEEKLVRKVLRSLPKRFDMKVIAIEEAHHIATMK

Query:  VDELFGFMCTFKMSFDDKSDKKSKSIALQSTIENDAPIVKIKESDQNLAQSISLLAKKFGKVLRRWDKRGGSRGNHVSPNVQDNNSPNNHSNQKIGKQSR
        VDEL G + TF++   D+++KKSK++A  S  E +     + ++D+ L  ++ LL K+F KVL R DKR      ++  +++  +     S+ K      
Subjt:  VDELFGFMCTFKMSFDDKSDKKSKSIALQSTIENDAPIVKIKESDQNLAQSISLLAKKFGKVLRRWDKRGGSRGNHVSPNVQDNNSPNNHSNQKIGKQSR

Query:  IERQKFRCRECEGFDHYQAECPNFLKRQNKSYSVTLFDDDHESSSDSDDEIRALMRCLSLESSQVTSPSDIVIAMHWIE--------DAQAIIVQKKRIE
        I+     C  CEG+ H  AECP  LK+  K  SV   D + E  SDSD ++ AL      E+++ +S +D  I    +          ++ I+ Q+ +++
Subjt:  IERQKFRCRECEGFDHYQAECPNFLKRQNKSYSVTLFDDDHESSSDSDDEIRALMRCLSLESSQVTSPSDIVIAMHWIE--------DAQAIIVQKKRIE

Query:  KLIED----NHCLLSTIFELKKELKSSKAELEVMTKSVQ---KGCDIALAVNLAFKEQG
        K+I D           I ELK E+    ++LE M KS++   KG D    V L  K  G
Subjt:  KLIED----NHCLLSTIFELKKELKSSKAELEVMTKSVQ---KGCDIALAVNLAFKEQG

AAO73529.1 gag-pol polyprotein [Glycine max]3.7e-6940.49Show/hide
Query:  MESIREGGSTTCPSVLDGSNYSYWKARMTTFLKSIDNKTWKVVIFGWTPPQVTDTDGNVS--LKLEKDFTEDEDEV-LGNCQALNAIFNRVDKNIFCLIN
        M   +EGG    P +LDG+NY YWKARM  FLKS+D++TWK VI GW  P++ DT+G  +  LK E+D+T++EDE+ LGN +ALNA+FN VDKNIF LIN
Subjt:  MESIREGGSTTCPSVLDGSNYSYWKARMTTFLKSIDNKTWKVVIFGWTPPQVTDTDGNVS--LKLEKDFTEDEDEV-LGNCQALNAIFNRVDKNIFCLIN

Query:  TCVSAKEAWDILAVAHEGTSKF--------------------------------------ALVEKISEEKLVRKVLRSLPKRFDMKVIAIEEAHHIATMK
        TC  AK+AW+IL   HEGTSK                                       AL E++++EKLVRK+LRSLPKRFDMKV AIEEA  I  M+
Subjt:  TCVSAKEAWDILAVAHEGTSKF--------------------------------------ALVEKISEEKLVRKVLRSLPKRFDMKVIAIEEAHHIATMK

Query:  VDELFGFMCTFKMSFDDKSDKKSKSIALQSTIENDAPIVKIKESDQNLAQSISLLAKKFGKVLRRWDKRGGSRGNHVSPNVQDNNSPNNHSNQKIGKQSR
        VDEL G + TF++   D+++KKSK++A  S  E +     + ++D+ L  ++  L K+F KVL R D+R      ++S +++  +     S++K      
Subjt:  VDELFGFMCTFKMSFDDKSDKKSKSIALQSTIENDAPIVKIKESDQNLAQSISLLAKKFGKVLRRWDKRGGSRGNHVSPNVQDNNSPNNHSNQKIGKQSR

Query:  IERQKFRCRECEGFDHYQAECPNFLKRQNKSYSVTLFDD-DHESSSDSDDEIRALM-RCLSLESSQVTSPSDIV---IAMHWIE---DAQAIIVQKKRIE
        I+     CR CEG+ H +AECP  LK+Q K  SV   DD + E  SDSD ++ AL  R  S E S  T  S+I    +A+ + E    ++ I+ Q+ +++
Subjt:  IERQKFRCRECEGFDHYQAECPNFLKRQNKSYSVTLFDD-DHESSSDSDDEIRALM-RCLSLESSQVTSPSDIV---IAMHWIE---DAQAIIVQKKRIE

Query:  KLI----EDNHCLLSTIFELKKELKSSKAELEVMTKSVQ---KGCDI
        K+I     +       I +LK E+    ++LE MTKS++   KG D+
Subjt:  KLI----EDNHCLLSTIFELKKELKSSKAELEVMTKSVQ---KGCDI

KAA0032410.1 gag-proteinase polyprotein [Cucumis melo var. makuwa]3.1e-7140.1Show/hide
Query:  MESIREGGSTTCPSVLDGSNYSYWKARMTTFLKSIDNKTWKVVIFGWTPPQVTDTDGNVSLKLEKDFTEDEDEV-LGNCQALNAIFNRVDKNIFCLINTC
        ME IREG ST+ P +LDG NYSYWK+RM  F+K++D + W+ ++ G+ PP +   DG    K E D+T+ E++  +GN +A+NAIFN VD N+F LIN+C
Subjt:  MESIREGGSTTCPSVLDGSNYSYWKARMTTFLKSIDNKTWKVVIFGWTPPQVTDTDGNVSLKLEKDFTEDEDEV-LGNCQALNAIFNRVDKNIFCLINTC

Query:  VSAKEAWDILAVAHEGTSKF------------ALVEKISEEKLVRKVLRSLPKRFDMKVIAIEEAHHIATMKVDELFGFMCTFKMSFDDKSDKKSKSIAL
         +AKEAW +L V +EGTSK+             L EKI + K+V+KVL+SLP++F+M V AIEEAH I T+++DELFG + T +M+  DK +KK K IA 
Subjt:  VSAKEAWDILAVAHEGTSKF------------ALVEKISEEKLVRKVLRSLPKRFDMKVIAIEEAHHIATMKVDELFGFMCTFKMSFDDKSDKKSKSIAL

Query:  QSTIENDAPIVKIKESDQNLAQSISLLAKKFGKVLRRWDKRGGSRGNHVSPN----------VQDNNSPNNHSNQKIGKQSRIERQKFRCRECEGFDHYQ
        +S  + +  IV   + + N+ +SI+LL K+F KV+R++        N  +PN           +  N  +N      GK+   E + F+CREC G  HYQ
Subjt:  QSTIENDAPIVKIKESDQNLAQSISLLAKKFGKVLRRWDKRGGSRGNHVSPN----------VQDNNSPNNHSNQKIGKQSRIERQKFRCRECEGFDHYQ

Query:  AECPNFLKRQNKSYSVTLFDDDHESSSDSDDEIRALMRCLS-------LESSQVTSPSDIV---IAMHWIEDAQAIIVQKKRIEKLIEDNHCLLSTIFEL
         ECP FL+RQ KS+  TL D+D + S D D  + A   C++        E S+     ++    + + W ED++A  +QK+ I+ L+E+N  L+S I  L
Subjt:  AECPNFLKRQNKSYSVTLFDDDHESSSDSDDEIRALMRCLS-------LESSQVTSPSDIV---IAMHWIEDAQAIIVQKKRIEKLIEDNHCLLSTIFEL

Query:  KKELKSSKAELEVMTKSVQ
        K +LK  + + +   K V+
Subjt:  KKELKSSKAELEVMTKSVQ

TrEMBL top hitse value%identityAlignment
A0A5D3DCW5 Gag-proteinase polyprotein1.5e-7140.1Show/hide
Query:  MESIREGGSTTCPSVLDGSNYSYWKARMTTFLKSIDNKTWKVVIFGWTPPQVTDTDGNVSLKLEKDFTEDEDEV-LGNCQALNAIFNRVDKNIFCLINTC
        ME IREG ST+ P +LDG NYSYWK+RM  F+K++D + W+ ++ G+ PP +   DG    K E D+T+ E++  +GN +A+NAIFN VD N+F LIN+C
Subjt:  MESIREGGSTTCPSVLDGSNYSYWKARMTTFLKSIDNKTWKVVIFGWTPPQVTDTDGNVSLKLEKDFTEDEDEV-LGNCQALNAIFNRVDKNIFCLINTC

Query:  VSAKEAWDILAVAHEGTSKF------------ALVEKISEEKLVRKVLRSLPKRFDMKVIAIEEAHHIATMKVDELFGFMCTFKMSFDDKSDKKSKSIAL
         +AKEAW +L V +EGTSK+             L EKI + K+V+KVL+SLP++F+M V AIEEAH I T+++DELFG + T +M+  DK +KK K IA 
Subjt:  VSAKEAWDILAVAHEGTSKF------------ALVEKISEEKLVRKVLRSLPKRFDMKVIAIEEAHHIATMKVDELFGFMCTFKMSFDDKSDKKSKSIAL

Query:  QSTIENDAPIVKIKESDQNLAQSISLLAKKFGKVLRRWDKRGGSRGNHVSPN----------VQDNNSPNNHSNQKIGKQSRIERQKFRCRECEGFDHYQ
        +S  + +  IV   + + N+ +SI+LL K+F KV+R++        N  +PN           +  N  +N      GK+   E + F+CREC G  HYQ
Subjt:  QSTIENDAPIVKIKESDQNLAQSISLLAKKFGKVLRRWDKRGGSRGNHVSPN----------VQDNNSPNNHSNQKIGKQSRIERQKFRCRECEGFDHYQ

Query:  AECPNFLKRQNKSYSVTLFDDDHESSSDSDDEIRALMRCLS-------LESSQVTSPSDIV---IAMHWIEDAQAIIVQKKRIEKLIEDNHCLLSTIFEL
         ECP FL+RQ KS+  TL D+D + S D D  + A   C++        E S+     ++    + + W ED++A  +QK+ I+ L+E+N  L+S I  L
Subjt:  AECPNFLKRQNKSYSVTLFDDDHESSSDSDDEIRALMRCLS-------LESSQVTSPSDIV---IAMHWIEDAQAIIVQKKRIEKLIEDNHCLLSTIFEL

Query:  KKELKSSKAELEVMTKSVQ
        K +LK  + + +   K V+
Subjt:  KKELKSSKAELEVMTKSVQ

Q84VH6 Gag-pol polyprotein1.8e-6940.49Show/hide
Query:  MESIREGGSTTCPSVLDGSNYSYWKARMTTFLKSIDNKTWKVVIFGWTPPQVTDTDGNVS--LKLEKDFTEDEDEV-LGNCQALNAIFNRVDKNIFCLIN
        M   +EGG    P +LDG+NY YWKARM  FLKS+D++TWK VI GW  P++ DT+G  +  LK E+D+T++EDE+ LGN +ALNA+FN VDKNIF LIN
Subjt:  MESIREGGSTTCPSVLDGSNYSYWKARMTTFLKSIDNKTWKVVIFGWTPPQVTDTDGNVS--LKLEKDFTEDEDEV-LGNCQALNAIFNRVDKNIFCLIN

Query:  TCVSAKEAWDILAVAHEGTSKF--------------------------------------ALVEKISEEKLVRKVLRSLPKRFDMKVIAIEEAHHIATMK
        TC  AK+AW+IL   HEGTSK                                       AL E++++EKLVRK+LRSLPKRFDMKV AIEEA  I  M+
Subjt:  TCVSAKEAWDILAVAHEGTSKF--------------------------------------ALVEKISEEKLVRKVLRSLPKRFDMKVIAIEEAHHIATMK

Query:  VDELFGFMCTFKMSFDDKSDKKSKSIALQSTIENDAPIVKIKESDQNLAQSISLLAKKFGKVLRRWDKRGGSRGNHVSPNVQDNNSPNNHSNQKIGKQSR
        VDEL G + TF++   D+++KKSK++A  S  E +     + ++D+ L  ++  L K+F KVL R D+R      ++S +++  +     S++K      
Subjt:  VDELFGFMCTFKMSFDDKSDKKSKSIALQSTIENDAPIVKIKESDQNLAQSISLLAKKFGKVLRRWDKRGGSRGNHVSPNVQDNNSPNNHSNQKIGKQSR

Query:  IERQKFRCRECEGFDHYQAECPNFLKRQNKSYSVTLFDD-DHESSSDSDDEIRALM-RCLSLESSQVTSPSDIV---IAMHWIE---DAQAIIVQKKRIE
        I+     CR CEG+ H +AECP  LK+Q K  SV   DD + E  SDSD ++ AL  R  S E S  T  S+I    +A+ + E    ++ I+ Q+ +++
Subjt:  IERQKFRCRECEGFDHYQAECPNFLKRQNKSYSVTLFDD-DHESSSDSDDEIRALM-RCLSLESSQVTSPSDIV---IAMHWIE---DAQAIIVQKKRIE

Query:  KLI----EDNHCLLSTIFELKKELKSSKAELEVMTKSVQ---KGCDI
        K+I     +       I +LK E+    ++LE MTKS++   KG D+
Subjt:  KLI----EDNHCLLSTIFELKKELKSSKAELEVMTKSVQ---KGCDI

Q84VH8 Gag-pol polyprotein6.2e-7039.43Show/hide
Query:  MESIREGGSTTCPSVLDGSNYSYWKARMTTFLKSIDNKTWKVVIFGWTPPQVTDTDGNVS--LKLEKDFTEDEDEV-LGNCQALNAIFNRVDKNIFCLIN
        M   +EGG    P +LDGSNY YWKARM  FLKS+D++TWK VI GW  P++ DT+G  +  LK E+D+T++EDE+ LGN +ALNA+FN VDKNIF LIN
Subjt:  MESIREGGSTTCPSVLDGSNYSYWKARMTTFLKSIDNKTWKVVIFGWTPPQVTDTDGNVS--LKLEKDFTEDEDEV-LGNCQALNAIFNRVDKNIFCLIN

Query:  TCVSAKEAWDILAVAHEGTSKF--------------------------------------ALVEKISEEKLVRKVLRSLPKRFDMKVIAIEEAHHIATMK
        TC  AK+AW+IL + HEGTSK                                       AL E+I++EKLVRK+LRSLPKRFDMKV AIEEA  I  M+
Subjt:  TCVSAKEAWDILAVAHEGTSKF--------------------------------------ALVEKISEEKLVRKVLRSLPKRFDMKVIAIEEAHHIATMK

Query:  VDELFGFMCTFKMSFDDKSDKKSKSIALQSTIENDAPIVKIKESDQNLAQSISLLAKKFGKVLRRWDKRGGSRGNHVSPNVQDNNSPNNHSNQKIGKQSR
        VDEL G + TF++   D+++KKSK++A  S  E +     + ++D+ L  ++ LL K+F KVL R DKR      ++  +++  +     S+ K      
Subjt:  VDELFGFMCTFKMSFDDKSDKKSKSIALQSTIENDAPIVKIKESDQNLAQSISLLAKKFGKVLRRWDKRGGSRGNHVSPNVQDNNSPNNHSNQKIGKQSR

Query:  IERQKFRCRECEGFDHYQAECPNFLKRQNKSYSVTLFDDDHESSSDSDDEIRALMRCLSLESSQVTSPSDIVIAMHWIE--------DAQAIIVQKKRIE
        I+     C  CEG+ H  AECP  LK+  K  SV   D + E  SDSD ++ AL      E+++ +S +D  I    +          ++ I+ Q+ +++
Subjt:  IERQKFRCRECEGFDHYQAECPNFLKRQNKSYSVTLFDDDHESSSDSDDEIRALMRCLSLESSQVTSPSDIVIAMHWIE--------DAQAIIVQKKRIE

Query:  KLIED----NHCLLSTIFELKKELKSSKAELEVMTKSVQ---KGCDIALAVNLAFKEQG
        K+I D           I ELK E+    ++LE M KS++   KG D    V L  K  G
Subjt:  KLIED----NHCLLSTIFELKKELKSSKAELEVMTKSVQ---KGCDIALAVNLAFKEQG

Q84VI2 Gag-pol polyprotein1.5e-6839.43Show/hide
Query:  MESIREGGSTTCPSVLDGSNYSYWKARMTTFLKSIDNKTWKVVIFGWTPPQVTDTDGNVS--LKLEKDFTEDEDEV-LGNCQALNAIFNRVDKNIFCLIN
        M   +EGG    P +LDGSNY YWKARM  FLKS+D++TWK VI GW  P++ DT+G  +  LK E+D+T++EDE+ LGN +ALNA+FN VDKNIF LIN
Subjt:  MESIREGGSTTCPSVLDGSNYSYWKARMTTFLKSIDNKTWKVVIFGWTPPQVTDTDGNVS--LKLEKDFTEDEDEV-LGNCQALNAIFNRVDKNIFCLIN

Query:  TCVSAKEAWDILAVAHEGTSKF--------------------------------------ALVEKISEEKLVRKVLRSLPKRFDMKVIAIEEAHHIATMK
        TC  AK+A +IL   HEGTSK                                       AL E+I++EKLVRK+LRSLPKRFDMKV AIEEA  I  M+
Subjt:  TCVSAKEAWDILAVAHEGTSKF--------------------------------------ALVEKISEEKLVRKVLRSLPKRFDMKVIAIEEAHHIATMK

Query:  VDELFGFMCTFKMSFDDKSDKKSKSIALQSTIENDAPIVKIKESDQNLAQSISLLAKKFGKVLRRWDKRGGSRGNHVSPNVQDNNSPNNHSNQKIGKQSR
        VDEL G + TF++   D+++KKSK++A  S  E +     + ++D+ L  ++ LL K+F KVL R DKR      ++  +++  +     S+ K      
Subjt:  VDELFGFMCTFKMSFDDKSDKKSKSIALQSTIENDAPIVKIKESDQNLAQSISLLAKKFGKVLRRWDKRGGSRGNHVSPNVQDNNSPNNHSNQKIGKQSR

Query:  IERQKFRCRECEGFDHYQAECPNFLKRQNKSYSVTLFDDDHESSSDSDDEIRALMRCLSLESSQVTSPSDIVIAMHWIE--------DAQAIIVQKKRIE
        I+     C  CEG+ H  AECP  LK+  K  SV   D + E  SDSD ++ AL+     E+++ +S +D  I    +          ++ I+ Q+ +++
Subjt:  IERQKFRCRECEGFDHYQAECPNFLKRQNKSYSVTLFDDDHESSSDSDDEIRALMRCLSLESSQVTSPSDIVIAMHWIE--------DAQAIIVQKKRIE

Query:  KLIED----NHCLLSTIFELKKELKSSKAELEVMTKSVQ---KGCDIALAVNLAFKEQG
        K+I D           I ELK E+    ++LE MTKS++   KG D    V L  K  G
Subjt:  KLIED----NHCLLSTIFELKKELKSSKAELEVMTKSVQ---KGCDIALAVNLAFKEQG

Q84VI4 Gag-pol polyprotein4.8e-7039.65Show/hide
Query:  MESIREGGSTTCPSVLDGSNYSYWKARMTTFLKSIDNKTWKVVIFGWTPPQVTDTDGNVS--LKLEKDFTEDEDEV-LGNCQALNAIFNRVDKNIFCLIN
        M   +EGG    P +LDGSNY YWKARM  FLKS+D++TWK VI GW  P++ DT+G  +  LK E+D+T++EDE+ LGN +ALNA+FN VDKNIF LIN
Subjt:  MESIREGGSTTCPSVLDGSNYSYWKARMTTFLKSIDNKTWKVVIFGWTPPQVTDTDGNVS--LKLEKDFTEDEDEV-LGNCQALNAIFNRVDKNIFCLIN

Query:  TCVSAKEAWDILAVAHEGTSKF--------------------------------------ALVEKISEEKLVRKVLRSLPKRFDMKVIAIEEAHHIATMK
        TC  AK+AW+IL + HEGTSK                                       AL E+I++EKLVRK+LRSLPKRFDMKV AIEEA  I  M+
Subjt:  TCVSAKEAWDILAVAHEGTSKF--------------------------------------ALVEKISEEKLVRKVLRSLPKRFDMKVIAIEEAHHIATMK

Query:  VDELFGFMCTFKMSFDDKSDKKSKSIALQSTIENDAPIVKIKESDQNLAQSISLLAKKFGKVLRRWDKRGGSRGNHVSPNVQDNNSPNNHSNQKIGKQSR
        VDEL G + TF++   D+++KKSK++A  S  E +     +  +D+ L  ++ LL K+F KVL R DKR      ++  +++  +     S+ K      
Subjt:  VDELFGFMCTFKMSFDDKSDKKSKSIALQSTIENDAPIVKIKESDQNLAQSISLLAKKFGKVLRRWDKRGGSRGNHVSPNVQDNNSPNNHSNQKIGKQSR

Query:  IERQKFRCRECEGFDHYQAECPNFLKRQNKSYSVTLFDDDHESSSDSDDEIRALMRCLSLESSQVTSPSDIVIAMHWIE--------DAQAIIVQKKRIE
        I+     C  CEG+ H  AECP  LK+  K  SV   D + E  SDSD ++ AL      E+++ +S +D  I    +          ++ I+ Q+ +++
Subjt:  IERQKFRCRECEGFDHYQAECPNFLKRQNKSYSVTLFDDDHESSSDSDDEIRALMRCLSLESSQVTSPSDIVIAMHWIE--------DAQAIIVQKKRIE

Query:  KLIED----NHCLLSTIFELKKELKSSKAELEVMTKSVQ---KGCDIALAVNLAFKEQG
        K+I D           I ELK E+    ++LE MTKS++   KG D    V L  K  G
Subjt:  KLIED----NHCLLSTIFELKKELKSSKAELEVMTKSVQ---KGCDIALAVNLAFKEQG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTCTATCAGGGAAGGTGGATCAACAACTTGTCCTTCTGTACTTGATGGTTCAAACTATTCGTATTGGAAGGCTAGGATGACAACCTTCTTAAAATCTATCGACAA
CAAGACCTGGAAAGTCGTCATTTTCGGATGGACTCCTCCTCAAGTCACTGATACAGATGGAAATGTGAGTCTTAAGCTTGAGAAAGACTTTACGGAAGATGAAGATGAGG
TGTTAGGGAATTGTCAGGCTCTTAATGCCATCTTTAACAGAGTAGATAAGAATATCTTTTGCTTAATCAACACTTGTGTCTCTGCCAAAGAAGCATGGGATATTCTTGCT
GTTGCACATGAAGGAACCTCCAAATTTGCTCTCGTTGAGAAGATATCAGAAGAAAAGTTGGTGCGTAAGGTTCTTCGATCCCTTCCTAAAAGATTTGATATGAAAGTTAT
AGCTATTGAAGAGGCTCATCACATTGCCACCATGAAAGTTGATGAGCTTTTTGGCTTTATGTGTACTTTCAAAATGTCGTTTGATGACAAGTCTGATAAGAAATCTAAGA
GTATTGCATTACAGTCGACTATTGAAAATGATGCTCCTATTGTCAAAATTAAGGAATCTGATCAGAACCTCGCTCAATCGATATCTCTTCTGGCCAAGAAGTTTGGAAAG
GTCCTCAGGCGATGGGACAAACGTGGAGGATCTCGGGGTAATCATGTGTCTCCCAATGTCCAAGACAACAATAGTCCAAATAATCACTCCAACCAAAAGATTGGGAAGCA
ATCAAGGATTGAGAGACAAAAATTCAGATGTAGAGAATGTGAGGGATTTGACCATTACCAAGCTGAATGTCCAAACTTTCTGAAAAGACAGAACAAGAGTTACTCTGTGA
CATTATTCGATGATGACCATGAATCAAGTAGTGACTCTGATGATGAAATTCGTGCTTTAATGAGATGTTTATCTCTTGAGAGTTCCCAAGTGACATCCCCTTCGGATATC
GTGATTGCTATGCATTGGATTGAAGACGCCCAAGCCATTATTGTTCAGAAAAAGAGAATTGAAAAGTTGATAGAAGACAACCACTGTCTTTTGAGCACTATTTTTGAATT
GAAGAAGGAATTGAAATCCTCTAAAGCTGAGCTTGAAGTAATGACCAAGTCAGTTCAAAAAGGGTGTGACATTGCGTTAGCTGTCAACCTTGCGTTCAAGGAGCAAGGTT
GCGTCCAACTGAGCTGCCATAAAAGAAAAACAGCTACCTTGCAGTTTAGAGTCTCAAGCCTTGCGTTCAATGAGCTTCGGAAAGCAGAATTTGAAGGAAAAAAAGGGAAT
TCCTTAAGCGTTTTTGTGAGACGTGAGAAGGCATGTGAGTTTTCCCAGGACCCAGTACTGAGGGACATAGAGAAGGTTGTGATGTTGGGTGTGTTGAGTCTTGCTCCACC
CCAACTAAGGTTGTTTTCAGCGTTGAGTGCATTGAGTTTTGCTCCGCCTCAACTAAGGTTGTTTTCGACGTTTAGTGCGTTGGGTTTTGCTCCGCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTCTATCAGGGAAGGTGGATCAACAACTTGTCCTTCTGTACTTGATGGTTCAAACTATTCGTATTGGAAGGCTAGGATGACAACCTTCTTAAAATCTATCGACAA
CAAGACCTGGAAAGTCGTCATTTTCGGATGGACTCCTCCTCAAGTCACTGATACAGATGGAAATGTGAGTCTTAAGCTTGAGAAAGACTTTACGGAAGATGAAGATGAGG
TGTTAGGGAATTGTCAGGCTCTTAATGCCATCTTTAACAGAGTAGATAAGAATATCTTTTGCTTAATCAACACTTGTGTCTCTGCCAAAGAAGCATGGGATATTCTTGCT
GTTGCACATGAAGGAACCTCCAAATTTGCTCTCGTTGAGAAGATATCAGAAGAAAAGTTGGTGCGTAAGGTTCTTCGATCCCTTCCTAAAAGATTTGATATGAAAGTTAT
AGCTATTGAAGAGGCTCATCACATTGCCACCATGAAAGTTGATGAGCTTTTTGGCTTTATGTGTACTTTCAAAATGTCGTTTGATGACAAGTCTGATAAGAAATCTAAGA
GTATTGCATTACAGTCGACTATTGAAAATGATGCTCCTATTGTCAAAATTAAGGAATCTGATCAGAACCTCGCTCAATCGATATCTCTTCTGGCCAAGAAGTTTGGAAAG
GTCCTCAGGCGATGGGACAAACGTGGAGGATCTCGGGGTAATCATGTGTCTCCCAATGTCCAAGACAACAATAGTCCAAATAATCACTCCAACCAAAAGATTGGGAAGCA
ATCAAGGATTGAGAGACAAAAATTCAGATGTAGAGAATGTGAGGGATTTGACCATTACCAAGCTGAATGTCCAAACTTTCTGAAAAGACAGAACAAGAGTTACTCTGTGA
CATTATTCGATGATGACCATGAATCAAGTAGTGACTCTGATGATGAAATTCGTGCTTTAATGAGATGTTTATCTCTTGAGAGTTCCCAAGTGACATCCCCTTCGGATATC
GTGATTGCTATGCATTGGATTGAAGACGCCCAAGCCATTATTGTTCAGAAAAAGAGAATTGAAAAGTTGATAGAAGACAACCACTGTCTTTTGAGCACTATTTTTGAATT
GAAGAAGGAATTGAAATCCTCTAAAGCTGAGCTTGAAGTAATGACCAAGTCAGTTCAAAAAGGGTGTGACATTGCGTTAGCTGTCAACCTTGCGTTCAAGGAGCAAGGTT
GCGTCCAACTGAGCTGCCATAAAAGAAAAACAGCTACCTTGCAGTTTAGAGTCTCAAGCCTTGCGTTCAATGAGCTTCGGAAAGCAGAATTTGAAGGAAAAAAAGGGAAT
TCCTTAAGCGTTTTTGTGAGACGTGAGAAGGCATGTGAGTTTTCCCAGGACCCAGTACTGAGGGACATAGAGAAGGTTGTGATGTTGGGTGTGTTGAGTCTTGCTCCACC
CCAACTAAGGTTGTTTTCAGCGTTGAGTGCATTGAGTTTTGCTCCGCCTCAACTAAGGTTGTTTTCGACGTTTAGTGCGTTGGGTTTTGCTCCGCCTTAA
Protein sequenceShow/hide protein sequence
MESIREGGSTTCPSVLDGSNYSYWKARMTTFLKSIDNKTWKVVIFGWTPPQVTDTDGNVSLKLEKDFTEDEDEVLGNCQALNAIFNRVDKNIFCLINTCVSAKEAWDILA
VAHEGTSKFALVEKISEEKLVRKVLRSLPKRFDMKVIAIEEAHHIATMKVDELFGFMCTFKMSFDDKSDKKSKSIALQSTIENDAPIVKIKESDQNLAQSISLLAKKFGK
VLRRWDKRGGSRGNHVSPNVQDNNSPNNHSNQKIGKQSRIERQKFRCRECEGFDHYQAECPNFLKRQNKSYSVTLFDDDHESSSDSDDEIRALMRCLSLESSQVTSPSDI
VIAMHWIEDAQAIIVQKKRIEKLIEDNHCLLSTIFELKKELKSSKAELEVMTKSVQKGCDIALAVNLAFKEQGCVQLSCHKRKTATLQFRVSSLAFNELRKAEFEGKKGN
SLSVFVRREKACEFSQDPVLRDIEKVVMLGVLSLAPPQLRLFSALSALSFAPPQLRLFSTFSALGFAPP