; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0001894 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0001894
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr06:23076367..23078262
RNA-Seq ExpressionPay0001894
SyntenyPay0001894
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037703.1 uncharacterized protein E6C27_scaffold1593G00270 [Cucumis melo var. makuwa]9.8e-28183.19Show/hide
Query:  MDEQTNDQVQAVRQDVEGLKDQLAKILELLTTGREKSVVGISSQVEVDLNQVLEDMPAYPLGFTPQRSSSPHMTYATQNPNPITQQENHMSDPMSTLITE
        MDEQTNDQVQAVRQDVEGLKDQLAKILEL TTGR KSV GISSQVEVDLNQVLEDMPAYP+GFTPQRSSSP MTY TQNPNPITQQ +H+SDPMST IT+
Subjt:  MDEQTNDQVQAVRQDVEGLKDQLAKILELLTTGREKSVVGISSQVEVDLNQVLEDMPAYPLGFTPQRSSSPHMTYATQNPNPITQQENHMSDPMSTLITE

Query:  SGKKISEEQGSRKKLEFLEERLRAIEGADMYGSIDATQLCLISDVVIPPKFKTPDFEKYNETTCPKSHLVMYCRKMSAYAHDDKLLIYCFQDSL------
        SGKKISEEQ SRK+LEFLEERLR IEGADMYGSIDATQLCLISDVVIPPKFKTPDFEKYN TTCPKSHLVMYC+KMSAYAHDDKLLI+CFQDSL      
Subjt:  SGKKISEEQGSRKKLEFLEERLRAIEGADMYGSIDATQLCLISDVVIPPKFKTPDFEKYNETTCPKSHLVMYCRKMSAYAHDDKLLIYCFQDSL------

Query:  -------------------------YNIDMTPDRLDLQRMEKKN-------------LAAQVQPPLTDKELMAMFINTLRAPYYDRMVGSASTNFSDVIT
                                 YNIDMTPDRLDLQRMEKKN             LA QVQPPLTDKELMAMFINTLRAPYYDRMVGSASTNFSDVIT
Subjt:  -------------------------YNIDMTPDRLDLQRMEKKN-------------LAAQVQPPLTDKELMAMFINTLRAPYYDRMVGSASTNFSDVIT

Query:  IGERIEFGVKNGRISDPASEIRRMMTPKKKEEEIHELSSTQRVVHVSSPTVGQTNYSYSYQNGELLPQLLKSHQVAIVPQEPLQPPYPKWYDPNIKCEYH
        IGE IEFGVKNGRI DP+SEIRRMMTPKKKEEEIHELSST++VVHVSSPTVGQ NYSY+YQNGE LPQLLKSHQVAIVPQEPLQPPYPKWYDPN+KCEYH
Subjt:  IGERIEFGVKNGRISDPASEIRRMMTPKKKEEEIHELSSTQRVVHVSSPTVGQTNYSYSYQNGELLPQLLKSHQVAIVPQEPLQPPYPKWYDPNIKCEYH

Query:  ARVVGHSTENCFPLKTKVQSLVKAEWLKFKKTEEESDVNQNPFPNHERPTINIADTFAERYKNKVCDVTTSMNTLFQILRRAGYLSPRFNNDEGEKFRCA
          +VGHSTENCFPLK KVQSLVKA WLKFKKTEEE DVNQNP PNHE P INI D F +RYKNKVCDVTTSMNTLFQIL RAGYLSPRFNNDEG KF CA
Subjt:  ARVVGHSTENCFPLKTKVQSLVKAEWLKFKKTEEESDVNQNPFPNHERPTINIADTFAERYKNKVCDVTTSMNTLFQILRRAGYLSPRFNNDEGEKFRCA

Query:  NEEQYLFHPKIDDHFIEDCCEFKNEVQNLMDTKILLVGQMSMQEIEVDMIIDKETSNDTSITVISKNTISLNLLISQFPPKFELNNWEIKRTLKVSKGSQ
        NE+Q LFHP+IDDHFIEDCCE KNEVQ LMD KILLVGQ+S+QEIEVDMIIDKETSNDTSIT+IS+NTIS NLL+ Q PPKFELNNWEIKRTLKVSKGSQ
Subjt:  NEEQYLFHPKIDDHFIEDCCEFKNEVQNLMDTKILLVGQMSMQEIEVDMIIDKETSNDTSITVISKNTISLNLLISQFPPKFELNNWEIKRTLKVSKGSQ

Query:  K
        K
Subjt:  K

KAA0055146.1 uncharacterized protein E6C27_scaffold231G00770 [Cucumis melo var. makuwa]9.5e-28482.69Show/hide
Query:  QVQAVRQDVEGLKDQLAKILELLTTGREKSVVGISSQVEVDLNQVLEDMPAYPLGFTPQRSSSPHMTYATQNPNPITQQENHMSDPMSTLITESGKKISE
        +VQAVRQDVEGLKDQLAKILELLTTGR KSV GISSQVEVDLNQVLEDMPAYP GFTPQRSSSP MTY TQNPNPITQQENH+SDPMST ITESGKKISE
Subjt:  QVQAVRQDVEGLKDQLAKILELLTTGREKSVVGISSQVEVDLNQVLEDMPAYPLGFTPQRSSSPHMTYATQNPNPITQQENHMSDPMSTLITESGKKISE

Query:  EQGSRKKLEFLEERLRAIEGADMYGSIDATQLCLISDVVIPPKFKTPDFEKYNETTCPKSHLVMYCRKMSAYAHDDKLLIYCFQDSL-------------
        EQGSRK+LEFLEERLRAI+GADMYGSIDATQLCLISDVVIPPKFKTPDFEKYN TTCPKSHLVMYCRKMSAYAHD+KLLI+CFQDSL             
Subjt:  EQGSRKKLEFLEERLRAIEGADMYGSIDATQLCLISDVVIPPKFKTPDFEKYNETTCPKSHLVMYCRKMSAYAHDDKLLIYCFQDSL-------------

Query:  ------------------YNIDMTPDRLDLQRMEKKN-------------LAAQVQPPLTDKELMAMFINTLRAPYYDRMVGSASTNFSDVITIGERIEF
                          YNIDM PDRLDLQRMEKKN             LAAQVQPPLTDKELMAMFINTLRAPYYDRMVGSASTNFSDVITIGERIEF
Subjt:  ------------------YNIDMTPDRLDLQRMEKKN-------------LAAQVQPPLTDKELMAMFINTLRAPYYDRMVGSASTNFSDVITIGERIEF

Query:  GVKNGRISDPASEIRRMMTPKKKEEEIHELSSTQRVVHVSSPTVGQTNYSYSYQNG------------------------------ELLPQLLKSHQVAI
        GVKNGRISDPASEIRRMMTPKKKEEEIHELSSTQRVVHVSSP VGQTNYSYSYQNG                              ELLPQLLKSHQVAI
Subjt:  GVKNGRISDPASEIRRMMTPKKKEEEIHELSSTQRVVHVSSPTVGQTNYSYSYQNG------------------------------ELLPQLLKSHQVAI

Query:  VPQEPLQPPYPKWYDPNIKCEYHARVVGHSTENCFPLKTKVQSLVKAEWLKFKKTEEESDVNQNPFPNHERPTINIADTFAERYKNKVCDVTTSMNTLFQ
        VPQEPLQPPYPKWYDPN+KCEYHA VVGHSTENCFPLK KVQSLVKA WLKFKK EEESDVNQNP PNHE P INI DTF ERYKNKVCDVTTSMNTLFQ
Subjt:  VPQEPLQPPYPKWYDPNIKCEYHARVVGHSTENCFPLKTKVQSLVKAEWLKFKKTEEESDVNQNPFPNHERPTINIADTFAERYKNKVCDVTTSMNTLFQ

Query:  ILRRAGYLSPRFNNDEGEKFRCANEEQYLFHPKIDDHFIEDCCEFKNEVQNLMDTKILLVGQMSMQEIEVDMIIDKETSNDTSITVISKNTISLNLLISQ
        ILRRAGYLSPRFNNDEGEKF CANE+Q LFHP+IDDHFIEDCCEFKNEVQ LMD KILLVGQMSMQEIEVDMIIDKETSNDTSITVISKNTIS NLL+SQ
Subjt:  ILRRAGYLSPRFNNDEGEKFRCANEEQYLFHPKIDDHFIEDCCEFKNEVQNLMDTKILLVGQMSMQEIEVDMIIDKETSNDTSITVISKNTISLNLLISQ

Query:  FPPKFELNNWEIKRTLKVSKGSQK
        FPPKFELNNWEIKRTLKV+KGSQK
Subjt:  FPPKFELNNWEIKRTLKVSKGSQK

KAA0060308.1 RNA-directed DNA polymerase (Reverse transcriptase), Ribonuclease H-like protein [Cucumis melo var. makuwa]4.2e-23984.9Show/hide
Query:  MDEQTNDQVQAVRQDVEGLKDQLAKILELLTTGREKSVVGISSQVEVDLNQVLEDMPAYPLGFTPQRSSSPHMTYATQNPNPITQQENHMSDPMSTLITE
        MDEQTNDQVQAVRQDVEGLKDQLAKILELLTTGR KSVVGISSQVEVDLNQVLEDMPAYPLGFTPQRSSSPHMTYATQNPNPITQQENHMSDPMSTLITE
Subjt:  MDEQTNDQVQAVRQDVEGLKDQLAKILELLTTGREKSVVGISSQVEVDLNQVLEDMPAYPLGFTPQRSSSPHMTYATQNPNPITQQENHMSDPMSTLITE

Query:  SGKKISEEQGSRKKLEFLEERLRAIEGADMYGSIDATQLCLISDVVIPPKFKTPDFEKYNETTCPKSHLVMYCRKMSAYAHDDKLLIYCFQDSLYNIDMT
        SGKKISEEQGSRKKLEFLEERLRAIEGADMYGSIDATQLCLISDVVIPPKFKTPDFEKYNETTCPKSHLVMYCRKMSAYAHDDKLLIYCFQDSL      
Subjt:  SGKKISEEQGSRKKLEFLEERLRAIEGADMYGSIDATQLCLISDVVIPPKFKTPDFEKYNETTCPKSHLVMYCRKMSAYAHDDKLLIYCFQDSLYNIDMT

Query:  PDRLDLQRMEKKNLAAQVQPPLTDKELMAMFINTLRAPYYDRMVGSASTNFSDVITIGERIEFGVKNGRISDPASEIRRMMTPKKKEEEIHELSSTQRVV
                                  L   +   L      RMVGSASTNFSDVITIGERIEFGVKNGRISDPASEIRRMMTPKKKEEEIHELSSTQRVV
Subjt:  PDRLDLQRMEKKNLAAQVQPPLTDKELMAMFINTLRAPYYDRMVGSASTNFSDVITIGERIEFGVKNGRISDPASEIRRMMTPKKKEEEIHELSSTQRVV

Query:  HVSSPTVGQTNYSYSYQNG------------------------------ELLPQLLKSHQVAIVPQEPLQPPYPKWYDPNIKCEYHARVVGHSTENCFPL
        HVSSPTVGQTNYSYSYQNG                              ELLPQLLKSHQVAIVPQEPLQPPYPKWYDPNIKCEYHARVVGHSTENCFPL
Subjt:  HVSSPTVGQTNYSYSYQNG------------------------------ELLPQLLKSHQVAIVPQEPLQPPYPKWYDPNIKCEYHARVVGHSTENCFPL

Query:  KTKVQSLVKAEWLKFKKTEEESDVNQNPFPNHERPTINIADTFAERYKNKVCDVTTSMNTLFQILRRAGYLSPRFNNDEGEKFRCANEEQYLFHPKIDDH
        KTKVQSLVKAEWLKFKKTEEESDVNQNPFPNHERPTINIADTF ERYKNKVCDVTTSMNTLFQILRRAGYLSPRFNNDEGEKFRCANEEQYLFHPKIDDH
Subjt:  KTKVQSLVKAEWLKFKKTEEESDVNQNPFPNHERPTINIADTFAERYKNKVCDVTTSMNTLFQILRRAGYLSPRFNNDEGEKFRCANEEQYLFHPKIDDH

Query:  FIEDCCEFKN
        FIEDCCEFKN
Subjt:  FIEDCCEFKN

KAA0065293.1 uncharacterized protein E6C27_scaffold1023G00060 [Cucumis melo var. makuwa]4.9e-23272.92Show/hide
Query:  MDEQTNDQVQAVRQDVEGLKDQLAKILELLTTGREKSVVGISSQVEVDLNQVLEDMPAYPLGFTPQRSSSPHM---TYATQ----NPNPITQQENHMSDP
        MDEQTNDQVQAVRQDVEGLKDQLAKILELLTTGR KSVVG SSQVEVDLNQVLEDMP YP GFTPQRSSSP M   TY T     NPN  TQQ  H ++P
Subjt:  MDEQTNDQVQAVRQDVEGLKDQLAKILELLTTGREKSVVGISSQVEVDLNQVLEDMPAYPLGFTPQRSSSPHM---TYATQ----NPNPITQQENHMSDP

Query:  MSTLITESGKKISEEQGSRKKLEFLEERLRAIEGADMYGSIDATQLCLISDVVIPPKFKTPDFEKYNETTCPKSHLVMYCRKMSAYAHDDKLLIYCFQDS
        +STLI E GKKISEEQGSR++LEFLEERLR IEGADMYGSIDATQLCLISDVVIPPKFKTPDFEKYN T+CPKSHLVMYCRKMSAYAHDDKLLI+CFQDS
Subjt:  MSTLITESGKKISEEQGSRKKLEFLEERLRAIEGADMYGSIDATQLCLISDVVIPPKFKTPDFEKYNETTCPKSHLVMYCRKMSAYAHDDKLLIYCFQDS

Query:  L-------------------------------YNIDMTPDRLDLQRMEKKN-------------LAAQVQPPLTDKELMAMFINTLRAPYYDRMVGSAST
        L                               YNIDM PDRLDLQRMEKKN             LAAQVQPPLTDKEL AMFINTLRAPYYDRMVGSAST
Subjt:  L-------------------------------YNIDMTPDRLDLQRMEKKN-------------LAAQVQPPLTDKELMAMFINTLRAPYYDRMVGSAST

Query:  NFSDVITIGERIEFGVKNGRISDPASEIRRMMTPKKKEEEIHELSSTQRV-VHVSSPTVGQTNYSYSYQNG-----------------------------
        NFSDVITIGERIEFGVKNGRISDPASE RR+MTPKKKE E+HELSSTQRV   VSSP VGQTN+S SYQNG                             
Subjt:  NFSDVITIGERIEFGVKNGRISDPASEIRRMMTPKKKEEEIHELSSTQRV-VHVSSPTVGQTNYSYSYQNG-----------------------------

Query:  -ELLPQLLKSHQVAIVPQEPLQPPYPKWYDPNIKCEYHARVVGHSTENCFPLKTKVQSLVKAEWLKFKKTEEESDVNQNPFPNHERPTINIADTFAERYK
         ELLPQL+KSHQVAIVPQEPLQPPYPKWYDPN KCEYHA  VGHSTENCFPLK KVQSLVKA WL+FKKT EE DVNQNP PNHE P IN  DTF +R+K
Subjt:  -ELLPQLLKSHQVAIVPQEPLQPPYPKWYDPNIKCEYHARVVGHSTENCFPLKTKVQSLVKAEWLKFKKTEEESDVNQNPFPNHERPTINIADTFAERYK

Query:  NKVCDVTTSMNTLFQILRRAGYLSPRFNNDEGEKFRCANEEQYLFHPKIDDHFIEDCCEFKNEVQNLMDTKILLVGQMSMQEIEVDMII----DKETSND
        NKV DV TSM TLFQIL  AGYLSPRFNND+ EK  C N EQ LFHP+ +DH IEDCCEFKNEVQ LMD+KILL+GQMSMQEIEV+MI     +++TSN+
Subjt:  NKVCDVTTSMNTLFQILRRAGYLSPRFNNDEGEKFRCANEEQYLFHPKIDDHFIEDCCEFKNEVQNLMDTKILLVGQMSMQEIEVDMII----DKETSND

Query:  TS
        T+
Subjt:  TS

KAA0065293.1 uncharacterized protein E6C27_scaffold1023G00060 [Cucumis melo var. makuwa]5.1e-0342.53Show/hide
Query:  CEFKNE---VQNLMDTKILLVGQMSMQEIEVDMIIDKETSNDT--SITVISKNTISLNLLISQFPPKFELNNWEIKRTLKVSKGSQK
        CE K E   + +L +T         + E+  D    K  + D+  SI V+S+NT   + L+ + PP FELNNWEIK+TLKV+KGSQK
Subjt:  CEFKNE---VQNLMDTKILLVGQMSMQEIEVDMIIDKETSNDT--SITVISKNTISLNLLISQFPPKFELNNWEIKRTLKVSKGSQK

XP_016903339.1 PREDICTED: uncharacterized protein LOC103502838 [Cucumis melo]9.8e-28183.19Show/hide
Query:  MDEQTNDQVQAVRQDVEGLKDQLAKILELLTTGREKSVVGISSQVEVDLNQVLEDMPAYPLGFTPQRSSSPHMTYATQNPNPITQQENHMSDPMSTLITE
        MDEQTNDQVQAVRQDVEGLKDQLAKILEL TTGR KSV GISSQVEVDLNQVLEDMPAYP+GFTPQRSSSP MTY TQNPNPITQQ +H+SDPMST IT+
Subjt:  MDEQTNDQVQAVRQDVEGLKDQLAKILELLTTGREKSVVGISSQVEVDLNQVLEDMPAYPLGFTPQRSSSPHMTYATQNPNPITQQENHMSDPMSTLITE

Query:  SGKKISEEQGSRKKLEFLEERLRAIEGADMYGSIDATQLCLISDVVIPPKFKTPDFEKYNETTCPKSHLVMYCRKMSAYAHDDKLLIYCFQDSL------
        SGKKISEEQ SRK+LEFLEERLR IEGADMYGSIDATQLCLISDVVIPPKFKTPDFEKYN TTCPKSHLVMYC+KMSAYAHDDKLLI+CFQDSL      
Subjt:  SGKKISEEQGSRKKLEFLEERLRAIEGADMYGSIDATQLCLISDVVIPPKFKTPDFEKYNETTCPKSHLVMYCRKMSAYAHDDKLLIYCFQDSL------

Query:  -------------------------YNIDMTPDRLDLQRMEKKN-------------LAAQVQPPLTDKELMAMFINTLRAPYYDRMVGSASTNFSDVIT
                                 YNIDMTPDRLDLQRMEKKN             LA QVQPPLTDKELMAMFINTLRAPYYDRMVGSASTNFSDVIT
Subjt:  -------------------------YNIDMTPDRLDLQRMEKKN-------------LAAQVQPPLTDKELMAMFINTLRAPYYDRMVGSASTNFSDVIT

Query:  IGERIEFGVKNGRISDPASEIRRMMTPKKKEEEIHELSSTQRVVHVSSPTVGQTNYSYSYQNGELLPQLLKSHQVAIVPQEPLQPPYPKWYDPNIKCEYH
        IGE IEFGVKNGRI DP+SEIRRMMTPKKKEEEIHELSST++VVHVSSPTVGQ NYSY+YQNGE LPQLLKSHQVAIVPQEPLQPPYPKWYDPN+KCEYH
Subjt:  IGERIEFGVKNGRISDPASEIRRMMTPKKKEEEIHELSSTQRVVHVSSPTVGQTNYSYSYQNGELLPQLLKSHQVAIVPQEPLQPPYPKWYDPNIKCEYH

Query:  ARVVGHSTENCFPLKTKVQSLVKAEWLKFKKTEEESDVNQNPFPNHERPTINIADTFAERYKNKVCDVTTSMNTLFQILRRAGYLSPRFNNDEGEKFRCA
          +VGHSTENCFPLK KVQSLVKA WLKFKKTEEE DVNQNP PNHE P INI D F +RYKNKVCDVTTSMNTLFQIL RAGYLSPRFNNDEG KF CA
Subjt:  ARVVGHSTENCFPLKTKVQSLVKAEWLKFKKTEEESDVNQNPFPNHERPTINIADTFAERYKNKVCDVTTSMNTLFQILRRAGYLSPRFNNDEGEKFRCA

Query:  NEEQYLFHPKIDDHFIEDCCEFKNEVQNLMDTKILLVGQMSMQEIEVDMIIDKETSNDTSITVISKNTISLNLLISQFPPKFELNNWEIKRTLKVSKGSQ
        NE+Q LFHP+IDDHFIEDCCE KNEVQ LMD KILLVGQ+S+QEIEVDMIIDKETSNDTSIT+IS+NTIS NLL+ Q PPKFELNNWEIKRTLKVSKGSQ
Subjt:  NEEQYLFHPKIDDHFIEDCCEFKNEVQNLMDTKILLVGQMSMQEIEVDMIIDKETSNDTSITVISKNTISLNLLISQFPPKFELNNWEIKRTLKVSKGSQ

Query:  K
        K
Subjt:  K

TrEMBL top hitse value%identityAlignment
A0A1S4E534 uncharacterized protein LOC1035028384.8e-28183.19Show/hide
Query:  MDEQTNDQVQAVRQDVEGLKDQLAKILELLTTGREKSVVGISSQVEVDLNQVLEDMPAYPLGFTPQRSSSPHMTYATQNPNPITQQENHMSDPMSTLITE
        MDEQTNDQVQAVRQDVEGLKDQLAKILEL TTGR KSV GISSQVEVDLNQVLEDMPAYP+GFTPQRSSSP MTY TQNPNPITQQ +H+SDPMST IT+
Subjt:  MDEQTNDQVQAVRQDVEGLKDQLAKILELLTTGREKSVVGISSQVEVDLNQVLEDMPAYPLGFTPQRSSSPHMTYATQNPNPITQQENHMSDPMSTLITE

Query:  SGKKISEEQGSRKKLEFLEERLRAIEGADMYGSIDATQLCLISDVVIPPKFKTPDFEKYNETTCPKSHLVMYCRKMSAYAHDDKLLIYCFQDSL------
        SGKKISEEQ SRK+LEFLEERLR IEGADMYGSIDATQLCLISDVVIPPKFKTPDFEKYN TTCPKSHLVMYC+KMSAYAHDDKLLI+CFQDSL      
Subjt:  SGKKISEEQGSRKKLEFLEERLRAIEGADMYGSIDATQLCLISDVVIPPKFKTPDFEKYNETTCPKSHLVMYCRKMSAYAHDDKLLIYCFQDSL------

Query:  -------------------------YNIDMTPDRLDLQRMEKKN-------------LAAQVQPPLTDKELMAMFINTLRAPYYDRMVGSASTNFSDVIT
                                 YNIDMTPDRLDLQRMEKKN             LA QVQPPLTDKELMAMFINTLRAPYYDRMVGSASTNFSDVIT
Subjt:  -------------------------YNIDMTPDRLDLQRMEKKN-------------LAAQVQPPLTDKELMAMFINTLRAPYYDRMVGSASTNFSDVIT

Query:  IGERIEFGVKNGRISDPASEIRRMMTPKKKEEEIHELSSTQRVVHVSSPTVGQTNYSYSYQNGELLPQLLKSHQVAIVPQEPLQPPYPKWYDPNIKCEYH
        IGE IEFGVKNGRI DP+SEIRRMMTPKKKEEEIHELSST++VVHVSSPTVGQ NYSY+YQNGE LPQLLKSHQVAIVPQEPLQPPYPKWYDPN+KCEYH
Subjt:  IGERIEFGVKNGRISDPASEIRRMMTPKKKEEEIHELSSTQRVVHVSSPTVGQTNYSYSYQNGELLPQLLKSHQVAIVPQEPLQPPYPKWYDPNIKCEYH

Query:  ARVVGHSTENCFPLKTKVQSLVKAEWLKFKKTEEESDVNQNPFPNHERPTINIADTFAERYKNKVCDVTTSMNTLFQILRRAGYLSPRFNNDEGEKFRCA
          +VGHSTENCFPLK KVQSLVKA WLKFKKTEEE DVNQNP PNHE P INI D F +RYKNKVCDVTTSMNTLFQIL RAGYLSPRFNNDEG KF CA
Subjt:  ARVVGHSTENCFPLKTKVQSLVKAEWLKFKKTEEESDVNQNPFPNHERPTINIADTFAERYKNKVCDVTTSMNTLFQILRRAGYLSPRFNNDEGEKFRCA

Query:  NEEQYLFHPKIDDHFIEDCCEFKNEVQNLMDTKILLVGQMSMQEIEVDMIIDKETSNDTSITVISKNTISLNLLISQFPPKFELNNWEIKRTLKVSKGSQ
        NE+Q LFHP+IDDHFIEDCCE KNEVQ LMD KILLVGQ+S+QEIEVDMIIDKETSNDTSIT+IS+NTIS NLL+ Q PPKFELNNWEIKRTLKVSKGSQ
Subjt:  NEEQYLFHPKIDDHFIEDCCEFKNEVQNLMDTKILLVGQMSMQEIEVDMIIDKETSNDTSITVISKNTISLNLLISQFPPKFELNNWEIKRTLKVSKGSQ

Query:  K
        K
Subjt:  K

A0A5A7T401 Retrotrans_gag domain-containing protein4.8e-28183.19Show/hide
Query:  MDEQTNDQVQAVRQDVEGLKDQLAKILELLTTGREKSVVGISSQVEVDLNQVLEDMPAYPLGFTPQRSSSPHMTYATQNPNPITQQENHMSDPMSTLITE
        MDEQTNDQVQAVRQDVEGLKDQLAKILEL TTGR KSV GISSQVEVDLNQVLEDMPAYP+GFTPQRSSSP MTY TQNPNPITQQ +H+SDPMST IT+
Subjt:  MDEQTNDQVQAVRQDVEGLKDQLAKILELLTTGREKSVVGISSQVEVDLNQVLEDMPAYPLGFTPQRSSSPHMTYATQNPNPITQQENHMSDPMSTLITE

Query:  SGKKISEEQGSRKKLEFLEERLRAIEGADMYGSIDATQLCLISDVVIPPKFKTPDFEKYNETTCPKSHLVMYCRKMSAYAHDDKLLIYCFQDSL------
        SGKKISEEQ SRK+LEFLEERLR IEGADMYGSIDATQLCLISDVVIPPKFKTPDFEKYN TTCPKSHLVMYC+KMSAYAHDDKLLI+CFQDSL      
Subjt:  SGKKISEEQGSRKKLEFLEERLRAIEGADMYGSIDATQLCLISDVVIPPKFKTPDFEKYNETTCPKSHLVMYCRKMSAYAHDDKLLIYCFQDSL------

Query:  -------------------------YNIDMTPDRLDLQRMEKKN-------------LAAQVQPPLTDKELMAMFINTLRAPYYDRMVGSASTNFSDVIT
                                 YNIDMTPDRLDLQRMEKKN             LA QVQPPLTDKELMAMFINTLRAPYYDRMVGSASTNFSDVIT
Subjt:  -------------------------YNIDMTPDRLDLQRMEKKN-------------LAAQVQPPLTDKELMAMFINTLRAPYYDRMVGSASTNFSDVIT

Query:  IGERIEFGVKNGRISDPASEIRRMMTPKKKEEEIHELSSTQRVVHVSSPTVGQTNYSYSYQNGELLPQLLKSHQVAIVPQEPLQPPYPKWYDPNIKCEYH
        IGE IEFGVKNGRI DP+SEIRRMMTPKKKEEEIHELSST++VVHVSSPTVGQ NYSY+YQNGE LPQLLKSHQVAIVPQEPLQPPYPKWYDPN+KCEYH
Subjt:  IGERIEFGVKNGRISDPASEIRRMMTPKKKEEEIHELSSTQRVVHVSSPTVGQTNYSYSYQNGELLPQLLKSHQVAIVPQEPLQPPYPKWYDPNIKCEYH

Query:  ARVVGHSTENCFPLKTKVQSLVKAEWLKFKKTEEESDVNQNPFPNHERPTINIADTFAERYKNKVCDVTTSMNTLFQILRRAGYLSPRFNNDEGEKFRCA
          +VGHSTENCFPLK KVQSLVKA WLKFKKTEEE DVNQNP PNHE P INI D F +RYKNKVCDVTTSMNTLFQIL RAGYLSPRFNNDEG KF CA
Subjt:  ARVVGHSTENCFPLKTKVQSLVKAEWLKFKKTEEESDVNQNPFPNHERPTINIADTFAERYKNKVCDVTTSMNTLFQILRRAGYLSPRFNNDEGEKFRCA

Query:  NEEQYLFHPKIDDHFIEDCCEFKNEVQNLMDTKILLVGQMSMQEIEVDMIIDKETSNDTSITVISKNTISLNLLISQFPPKFELNNWEIKRTLKVSKGSQ
        NE+Q LFHP+IDDHFIEDCCE KNEVQ LMD KILLVGQ+S+QEIEVDMIIDKETSNDTSIT+IS+NTIS NLL+ Q PPKFELNNWEIKRTLKVSKGSQ
Subjt:  NEEQYLFHPKIDDHFIEDCCEFKNEVQNLMDTKILLVGQMSMQEIEVDMIIDKETSNDTSITVISKNTISLNLLISQFPPKFELNNWEIKRTLKVSKGSQ

Query:  K
        K
Subjt:  K

A0A5A7ULI2 Retrotrans_gag domain-containing protein4.6e-28482.69Show/hide
Query:  QVQAVRQDVEGLKDQLAKILELLTTGREKSVVGISSQVEVDLNQVLEDMPAYPLGFTPQRSSSPHMTYATQNPNPITQQENHMSDPMSTLITESGKKISE
        +VQAVRQDVEGLKDQLAKILELLTTGR KSV GISSQVEVDLNQVLEDMPAYP GFTPQRSSSP MTY TQNPNPITQQENH+SDPMST ITESGKKISE
Subjt:  QVQAVRQDVEGLKDQLAKILELLTTGREKSVVGISSQVEVDLNQVLEDMPAYPLGFTPQRSSSPHMTYATQNPNPITQQENHMSDPMSTLITESGKKISE

Query:  EQGSRKKLEFLEERLRAIEGADMYGSIDATQLCLISDVVIPPKFKTPDFEKYNETTCPKSHLVMYCRKMSAYAHDDKLLIYCFQDSL-------------
        EQGSRK+LEFLEERLRAI+GADMYGSIDATQLCLISDVVIPPKFKTPDFEKYN TTCPKSHLVMYCRKMSAYAHD+KLLI+CFQDSL             
Subjt:  EQGSRKKLEFLEERLRAIEGADMYGSIDATQLCLISDVVIPPKFKTPDFEKYNETTCPKSHLVMYCRKMSAYAHDDKLLIYCFQDSL-------------

Query:  ------------------YNIDMTPDRLDLQRMEKKN-------------LAAQVQPPLTDKELMAMFINTLRAPYYDRMVGSASTNFSDVITIGERIEF
                          YNIDM PDRLDLQRMEKKN             LAAQVQPPLTDKELMAMFINTLRAPYYDRMVGSASTNFSDVITIGERIEF
Subjt:  ------------------YNIDMTPDRLDLQRMEKKN-------------LAAQVQPPLTDKELMAMFINTLRAPYYDRMVGSASTNFSDVITIGERIEF

Query:  GVKNGRISDPASEIRRMMTPKKKEEEIHELSSTQRVVHVSSPTVGQTNYSYSYQNG------------------------------ELLPQLLKSHQVAI
        GVKNGRISDPASEIRRMMTPKKKEEEIHELSSTQRVVHVSSP VGQTNYSYSYQNG                              ELLPQLLKSHQVAI
Subjt:  GVKNGRISDPASEIRRMMTPKKKEEEIHELSSTQRVVHVSSPTVGQTNYSYSYQNG------------------------------ELLPQLLKSHQVAI

Query:  VPQEPLQPPYPKWYDPNIKCEYHARVVGHSTENCFPLKTKVQSLVKAEWLKFKKTEEESDVNQNPFPNHERPTINIADTFAERYKNKVCDVTTSMNTLFQ
        VPQEPLQPPYPKWYDPN+KCEYHA VVGHSTENCFPLK KVQSLVKA WLKFKK EEESDVNQNP PNHE P INI DTF ERYKNKVCDVTTSMNTLFQ
Subjt:  VPQEPLQPPYPKWYDPNIKCEYHARVVGHSTENCFPLKTKVQSLVKAEWLKFKKTEEESDVNQNPFPNHERPTINIADTFAERYKNKVCDVTTSMNTLFQ

Query:  ILRRAGYLSPRFNNDEGEKFRCANEEQYLFHPKIDDHFIEDCCEFKNEVQNLMDTKILLVGQMSMQEIEVDMIIDKETSNDTSITVISKNTISLNLLISQ
        ILRRAGYLSPRFNNDEGEKF CANE+Q LFHP+IDDHFIEDCCEFKNEVQ LMD KILLVGQMSMQEIEVDMIIDKETSNDTSITVISKNTIS NLL+SQ
Subjt:  ILRRAGYLSPRFNNDEGEKFRCANEEQYLFHPKIDDHFIEDCCEFKNEVQNLMDTKILLVGQMSMQEIEVDMIIDKETSNDTSITVISKNTISLNLLISQ

Query:  FPPKFELNNWEIKRTLKVSKGSQK
        FPPKFELNNWEIKRTLKV+KGSQK
Subjt:  FPPKFELNNWEIKRTLKVSKGSQK

A0A5A7VAU5 Uncharacterized protein2.4e-23272.92Show/hide
Query:  MDEQTNDQVQAVRQDVEGLKDQLAKILELLTTGREKSVVGISSQVEVDLNQVLEDMPAYPLGFTPQRSSSPHM---TYATQ----NPNPITQQENHMSDP
        MDEQTNDQVQAVRQDVEGLKDQLAKILELLTTGR KSVVG SSQVEVDLNQVLEDMP YP GFTPQRSSSP M   TY T     NPN  TQQ  H ++P
Subjt:  MDEQTNDQVQAVRQDVEGLKDQLAKILELLTTGREKSVVGISSQVEVDLNQVLEDMPAYPLGFTPQRSSSPHM---TYATQ----NPNPITQQENHMSDP

Query:  MSTLITESGKKISEEQGSRKKLEFLEERLRAIEGADMYGSIDATQLCLISDVVIPPKFKTPDFEKYNETTCPKSHLVMYCRKMSAYAHDDKLLIYCFQDS
        +STLI E GKKISEEQGSR++LEFLEERLR IEGADMYGSIDATQLCLISDVVIPPKFKTPDFEKYN T+CPKSHLVMYCRKMSAYAHDDKLLI+CFQDS
Subjt:  MSTLITESGKKISEEQGSRKKLEFLEERLRAIEGADMYGSIDATQLCLISDVVIPPKFKTPDFEKYNETTCPKSHLVMYCRKMSAYAHDDKLLIYCFQDS

Query:  L-------------------------------YNIDMTPDRLDLQRMEKKN-------------LAAQVQPPLTDKELMAMFINTLRAPYYDRMVGSAST
        L                               YNIDM PDRLDLQRMEKKN             LAAQVQPPLTDKEL AMFINTLRAPYYDRMVGSAST
Subjt:  L-------------------------------YNIDMTPDRLDLQRMEKKN-------------LAAQVQPPLTDKELMAMFINTLRAPYYDRMVGSAST

Query:  NFSDVITIGERIEFGVKNGRISDPASEIRRMMTPKKKEEEIHELSSTQRV-VHVSSPTVGQTNYSYSYQNG-----------------------------
        NFSDVITIGERIEFGVKNGRISDPASE RR+MTPKKKE E+HELSSTQRV   VSSP VGQTN+S SYQNG                             
Subjt:  NFSDVITIGERIEFGVKNGRISDPASEIRRMMTPKKKEEEIHELSSTQRV-VHVSSPTVGQTNYSYSYQNG-----------------------------

Query:  -ELLPQLLKSHQVAIVPQEPLQPPYPKWYDPNIKCEYHARVVGHSTENCFPLKTKVQSLVKAEWLKFKKTEEESDVNQNPFPNHERPTINIADTFAERYK
         ELLPQL+KSHQVAIVPQEPLQPPYPKWYDPN KCEYHA  VGHSTENCFPLK KVQSLVKA WL+FKKT EE DVNQNP PNHE P IN  DTF +R+K
Subjt:  -ELLPQLLKSHQVAIVPQEPLQPPYPKWYDPNIKCEYHARVVGHSTENCFPLKTKVQSLVKAEWLKFKKTEEESDVNQNPFPNHERPTINIADTFAERYK

Query:  NKVCDVTTSMNTLFQILRRAGYLSPRFNNDEGEKFRCANEEQYLFHPKIDDHFIEDCCEFKNEVQNLMDTKILLVGQMSMQEIEVDMII----DKETSND
        NKV DV TSM TLFQIL  AGYLSPRFNND+ EK  C N EQ LFHP+ +DH IEDCCEFKNEVQ LMD+KILL+GQMSMQEIEV+MI     +++TSN+
Subjt:  NKVCDVTTSMNTLFQILRRAGYLSPRFNNDEGEKFRCANEEQYLFHPKIDDHFIEDCCEFKNEVQNLMDTKILLVGQMSMQEIEVDMII----DKETSND

Query:  TS
        T+
Subjt:  TS

A0A5A7VAU5 Uncharacterized protein2.5e-0342.53Show/hide
Query:  CEFKNE---VQNLMDTKILLVGQMSMQEIEVDMIIDKETSNDT--SITVISKNTISLNLLISQFPPKFELNNWEIKRTLKVSKGSQK
        CE K E   + +L +T         + E+  D    K  + D+  SI V+S+NT   + L+ + PP FELNNWEIK+TLKV+KGSQK
Subjt:  CEFKNE---VQNLMDTKILLVGQMSMQEIEVDMIIDKETSNDT--SITVISKNTISLNLLISQFPPKFELNNWEIKRTLKVSKGSQK

A0A5D3DEF4 RNA-directed DNA polymerase (Reverse transcriptase), Ribonuclease H-like protein2.0e-23984.9Show/hide
Query:  MDEQTNDQVQAVRQDVEGLKDQLAKILELLTTGREKSVVGISSQVEVDLNQVLEDMPAYPLGFTPQRSSSPHMTYATQNPNPITQQENHMSDPMSTLITE
        MDEQTNDQVQAVRQDVEGLKDQLAKILELLTTGR KSVVGISSQVEVDLNQVLEDMPAYPLGFTPQRSSSPHMTYATQNPNPITQQENHMSDPMSTLITE
Subjt:  MDEQTNDQVQAVRQDVEGLKDQLAKILELLTTGREKSVVGISSQVEVDLNQVLEDMPAYPLGFTPQRSSSPHMTYATQNPNPITQQENHMSDPMSTLITE

Query:  SGKKISEEQGSRKKLEFLEERLRAIEGADMYGSIDATQLCLISDVVIPPKFKTPDFEKYNETTCPKSHLVMYCRKMSAYAHDDKLLIYCFQDSLYNIDMT
        SGKKISEEQGSRKKLEFLEERLRAIEGADMYGSIDATQLCLISDVVIPPKFKTPDFEKYNETTCPKSHLVMYCRKMSAYAHDDKLLIYCFQDSL      
Subjt:  SGKKISEEQGSRKKLEFLEERLRAIEGADMYGSIDATQLCLISDVVIPPKFKTPDFEKYNETTCPKSHLVMYCRKMSAYAHDDKLLIYCFQDSLYNIDMT

Query:  PDRLDLQRMEKKNLAAQVQPPLTDKELMAMFINTLRAPYYDRMVGSASTNFSDVITIGERIEFGVKNGRISDPASEIRRMMTPKKKEEEIHELSSTQRVV
                                  L   +   L      RMVGSASTNFSDVITIGERIEFGVKNGRISDPASEIRRMMTPKKKEEEIHELSSTQRVV
Subjt:  PDRLDLQRMEKKNLAAQVQPPLTDKELMAMFINTLRAPYYDRMVGSASTNFSDVITIGERIEFGVKNGRISDPASEIRRMMTPKKKEEEIHELSSTQRVV

Query:  HVSSPTVGQTNYSYSYQNG------------------------------ELLPQLLKSHQVAIVPQEPLQPPYPKWYDPNIKCEYHARVVGHSTENCFPL
        HVSSPTVGQTNYSYSYQNG                              ELLPQLLKSHQVAIVPQEPLQPPYPKWYDPNIKCEYHARVVGHSTENCFPL
Subjt:  HVSSPTVGQTNYSYSYQNG------------------------------ELLPQLLKSHQVAIVPQEPLQPPYPKWYDPNIKCEYHARVVGHSTENCFPL

Query:  KTKVQSLVKAEWLKFKKTEEESDVNQNPFPNHERPTINIADTFAERYKNKVCDVTTSMNTLFQILRRAGYLSPRFNNDEGEKFRCANEEQYLFHPKIDDH
        KTKVQSLVKAEWLKFKKTEEESDVNQNPFPNHERPTINIADTF ERYKNKVCDVTTSMNTLFQILRRAGYLSPRFNNDEGEKFRCANEEQYLFHPKIDDH
Subjt:  KTKVQSLVKAEWLKFKKTEEESDVNQNPFPNHERPTINIADTFAERYKNKVCDVTTSMNTLFQILRRAGYLSPRFNNDEGEKFRCANEEQYLFHPKIDDH

Query:  FIEDCCEFKN
        FIEDCCEFKN
Subjt:  FIEDCCEFKN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGAGCAAACCAATGATCAGGTTCAAGCAGTTCGTCAAGACGTTGAAGGATTGAAAGATCAATTGGCAAAGATCTTAGAACTGCTCACTACTGGAAGAGAAAAGAG
TGTTGTAGGGATTTCATCACAAGTGGAAGTAGATCTTAACCAGGTGCTAGAAGACATGCCTGCATACCCTCTAGGTTTTACTCCGCAAAGGTCATCTAGTCCTCACATGA
CTTATGCTACACAAAATCCTAATCCGATCACCCAACAGGAAAACCATATGAGTGATCCAATGTCTACTCTAATTACAGAAAGTGGTAAGAAAATTTCAGAAGAGCAGGGT
AGTAGAAAAAAACTGGAATTTCTAGAAGAAAGATTACGCGCCATTGAAGGTGCAGACATGTATGGGAGTATCGACGCAACACAACTATGTTTGATATCAGATGTGGTGAT
TCCTCCCAAATTCAAGACTCCAGATTTTGAGAAGTACAATGAAACCACGTGCCCAAAAAGTCATCTAGTTATGTACTGTCGGAAAATGTCAGCTTACGCTCATGATGATA
AATTGTTGATTTATTGCTTCCAAGACAGCTTATACAACATTGATATGACACCCGATCGTCTGGATCTTCAGAGAATGGAAAAAAAGAATTTAGCTGCACAAGTGCAACCT
CCCCTGACCGACAAAGAATTGATGGCTATGTTCATAAACACTCTTCGGGCCCCATATTATGATAGAATGGTTGGAAGTGCTTCAACTAATTTCTCAGACGTCATAACCAT
TGGGGAGAGGATTGAATTTGGAGTGAAGAATGGAAGGATCTCTGATCCCGCTTCAGAGATAAGAAGAATGATGACCCCAAAGAAAAAAGAAGAAGAAATACATGAGTTGA
GCTCGACTCAAAGAGTAGTGCATGTATCCTCACCAACTGTGGGGCAGACAAATTACTCCTATAGTTATCAGAATGGAGAGCTTTTGCCTCAACTTTTAAAGAGTCATCAA
GTGGCTATCGTACCACAAGAGCCTTTGCAACCACCATATCCTAAGTGGTATGACCCTAACATAAAATGTGAATATCATGCTAGGGTAGTTGGACATTCTACAGAGAACTG
TTTTCCTTTGAAAACTAAAGTGCAAAGTCTAGTCAAGGCCGAATGGTTGAAATTCAAGAAGACAGAAGAAGAGTCTGACGTTAACCAGAACCCTTTCCCAAATCATGAGC
GTCCCACCATAAATATTGCTGACACATTCGCGGAAAGATACAAGAATAAGGTGTGTGATGTGACTACTTCAATGAATACACTTTTCCAAATCCTTCGTAGAGCTGGATAT
TTGTCACCAAGGTTTAACAATGATGAGGGGGAGAAGTTTAGATGCGCCAACGAGGAGCAGTATTTATTCCACCCTAAGATAGATGACCATTTCATTGAAGATTGTTGTGA
GTTTAAGAATGAGGTACAAAATTTGATGGATACAAAGATTCTCTTGGTAGGACAAATGAGCATGCAAGAAATCGAGGTCGATATGATTATTGATAAGGAAACTTCAAATG
ATACCTCAATTACGGTTATCTCCAAGAATACTATTTCGCTTAATCTATTGATTTCTCAGTTTCCACCAAAATTTGAGTTGAACAATTGGGAGATCAAGAGGACGCTAAAA
GTCTCTAAAGGATCACAAAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATGAGCAAACCAATGATCAGGTTCAAGCAGTTCGTCAAGACGTTGAAGGATTGAAAGATCAATTGGCAAAGATCTTAGAACTGCTCACTACTGGAAGAGAAAAGAG
TGTTGTAGGGATTTCATCACAAGTGGAAGTAGATCTTAACCAGGTGCTAGAAGACATGCCTGCATACCCTCTAGGTTTTACTCCGCAAAGGTCATCTAGTCCTCACATGA
CTTATGCTACACAAAATCCTAATCCGATCACCCAACAGGAAAACCATATGAGTGATCCAATGTCTACTCTAATTACAGAAAGTGGTAAGAAAATTTCAGAAGAGCAGGGT
AGTAGAAAAAAACTGGAATTTCTAGAAGAAAGATTACGCGCCATTGAAGGTGCAGACATGTATGGGAGTATCGACGCAACACAACTATGTTTGATATCAGATGTGGTGAT
TCCTCCCAAATTCAAGACTCCAGATTTTGAGAAGTACAATGAAACCACGTGCCCAAAAAGTCATCTAGTTATGTACTGTCGGAAAATGTCAGCTTACGCTCATGATGATA
AATTGTTGATTTATTGCTTCCAAGACAGCTTATACAACATTGATATGACACCCGATCGTCTGGATCTTCAGAGAATGGAAAAAAAGAATTTAGCTGCACAAGTGCAACCT
CCCCTGACCGACAAAGAATTGATGGCTATGTTCATAAACACTCTTCGGGCCCCATATTATGATAGAATGGTTGGAAGTGCTTCAACTAATTTCTCAGACGTCATAACCAT
TGGGGAGAGGATTGAATTTGGAGTGAAGAATGGAAGGATCTCTGATCCCGCTTCAGAGATAAGAAGAATGATGACCCCAAAGAAAAAAGAAGAAGAAATACATGAGTTGA
GCTCGACTCAAAGAGTAGTGCATGTATCCTCACCAACTGTGGGGCAGACAAATTACTCCTATAGTTATCAGAATGGAGAGCTTTTGCCTCAACTTTTAAAGAGTCATCAA
GTGGCTATCGTACCACAAGAGCCTTTGCAACCACCATATCCTAAGTGGTATGACCCTAACATAAAATGTGAATATCATGCTAGGGTAGTTGGACATTCTACAGAGAACTG
TTTTCCTTTGAAAACTAAAGTGCAAAGTCTAGTCAAGGCCGAATGGTTGAAATTCAAGAAGACAGAAGAAGAGTCTGACGTTAACCAGAACCCTTTCCCAAATCATGAGC
GTCCCACCATAAATATTGCTGACACATTCGCGGAAAGATACAAGAATAAGGTGTGTGATGTGACTACTTCAATGAATACACTTTTCCAAATCCTTCGTAGAGCTGGATAT
TTGTCACCAAGGTTTAACAATGATGAGGGGGAGAAGTTTAGATGCGCCAACGAGGAGCAGTATTTATTCCACCCTAAGATAGATGACCATTTCATTGAAGATTGTTGTGA
GTTTAAGAATGAGGTACAAAATTTGATGGATACAAAGATTCTCTTGGTAGGACAAATGAGCATGCAAGAAATCGAGGTCGATATGATTATTGATAAGGAAACTTCAAATG
ATACCTCAATTACGGTTATCTCCAAGAATACTATTTCGCTTAATCTATTGATTTCTCAGTTTCCACCAAAATTTGAGTTGAACAATTGGGAGATCAAGAGGACGCTAAAA
GTCTCTAAAGGATCACAAAAGTAA
Protein sequenceShow/hide protein sequence
MDEQTNDQVQAVRQDVEGLKDQLAKILELLTTGREKSVVGISSQVEVDLNQVLEDMPAYPLGFTPQRSSSPHMTYATQNPNPITQQENHMSDPMSTLITESGKKISEEQG
SRKKLEFLEERLRAIEGADMYGSIDATQLCLISDVVIPPKFKTPDFEKYNETTCPKSHLVMYCRKMSAYAHDDKLLIYCFQDSLYNIDMTPDRLDLQRMEKKNLAAQVQP
PLTDKELMAMFINTLRAPYYDRMVGSASTNFSDVITIGERIEFGVKNGRISDPASEIRRMMTPKKKEEEIHELSSTQRVVHVSSPTVGQTNYSYSYQNGELLPQLLKSHQ
VAIVPQEPLQPPYPKWYDPNIKCEYHARVVGHSTENCFPLKTKVQSLVKAEWLKFKKTEEESDVNQNPFPNHERPTINIADTFAERYKNKVCDVTTSMNTLFQILRRAGY
LSPRFNNDEGEKFRCANEEQYLFHPKIDDHFIEDCCEFKNEVQNLMDTKILLVGQMSMQEIEVDMIIDKETSNDTSITVISKNTISLNLLISQFPPKFELNNWEIKRTLK
VSKGSQK