; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0010051 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0010051
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr06:13356854..13358749
RNA-Seq ExpressionPay0010051
SyntenyPay0010051
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037703.1 uncharacterized protein E6C27_scaffold1593G00270 [Cucumis melo var. makuwa]1.5e-30088.19Show/hide
Query:  MDEQTNDQVQAVRQDVEGLKYQLAKILELLTTGRGKSVAGISSQVEVDLNQVLEDMPAYPLGFTLQRSSSPRMTYPTQNPNPITQQADHVSDPMSTPITE
        MDEQTNDQVQAVRQDVEGLK QLAKILEL TTGR KSVAGISSQVEVDLNQVLEDMPAYP+GFT QRSSSPRMTYPTQNPNPITQQ DHVSDPMSTPIT+
Subjt:  MDEQTNDQVQAVRQDVEGLKYQLAKILELLTTGRGKSVAGISSQVEVDLNQVLEDMPAYPLGFTLQRSSSPRMTYPTQNPNPITQQADHVSDPMSTPITE

Query:  SGKKISEEQGSKKRLKFLEERLRVIEGADMY-----------ENVVIPPKFKTPDFEKYNGTTCPKSHLVMYCQKMSAYAHDDKLLIHCFQDSLVGPASR
        SGKKISEEQ S+KRL+FLEERLRVIEGADMY            +VVIPPKFKTPDFEKYNGTTCPKSHLVMYCQKMSAYAHDDKLLIHCFQDSLVGP SR
Subjt:  SGKKISEEQGSKKRLKFLEERLRVIEGADMY-----------ENVVIPPKFKTPDFEKYNGTTCPKSHLVMYCQKMSAYAHDDKLLIHCFQDSLVGPASR

Query:  WYMQLDGSQVHRWKDLADFFLKQY--------------KMEKKNVETFKEYAQRWRELAAQVQPPLTDKELMAMFINTLQAPYYDRMVGSASTNFSDVIT
        WYMQLDGSQVHRWKDLAD FLKQY              +MEKKNVETFKEYAQRWRELA QVQPPLTDKELMAMFINTL+APYYDRMVGSASTNFSDVIT
Subjt:  WYMQLDGSQVHRWKDLADFFLKQY--------------KMEKKNVETFKEYAQRWRELAAQVQPPLTDKELMAMFINTLQAPYYDRMVGSASTNFSDVIT

Query:  IGERIEFGVKNGRIFDPASEIRRMMTLKKKEEEIHELSSTQKVVHVSSPTVGQTNYSYSYQNGS------KS------PFKPLQPPYPKWYDPSVKCEYH
        IGE IEFGVKNGRIFDP+SEIRRMMT KKKEEEIHELSST+KVVHVSSPTVGQ NYSY+YQNG       KS      P +PLQPPYPKWYDP+VKCEYH
Subjt:  IGERIEFGVKNGRIFDPASEIRRMMTLKKKEEEIHELSSTQKVVHVSSPTVGQTNYSYSYQNGS------KS------PFKPLQPPYPKWYDPSVKCEYH

Query:  GGVVGHSTENCFPLKAKVQSLVKAGWLKFKKTEEESDVNQNPLPNHEGPAINIVDIFMKRYKNKVCDVTTSMNTLFQILSRAGYLSPRFNNDEGEKFGCA
         G+VGHSTENCFPLKAKVQSLVKAGWLKFKKTEEE DVNQNPLPNHEGPAINIVDIF KRYKNKVCDVTTSMNTLFQILSRAGYLSPRFNNDEG KFGCA
Subjt:  GGVVGHSTENCFPLKAKVQSLVKAGWLKFKKTEEESDVNQNPLPNHEGPAINIVDIFMKRYKNKVCDVTTSMNTLFQILSRAGYLSPRFNNDEGEKFGCA

Query:  NEKQCLFHPEIDDHFIEDCCELKNEVQKLMDAKILLVGQVSIQEIEVDMIIDKETSNDTSITIISENTISPNLLVYQCPPKFELNNWEIKRTLKVFKGSQ
        NEKQCLFHPEIDDHFIEDCCE KNEVQKLMDAKILLVGQVSIQEIEVDMIIDKETSNDTSITIISENTISPNLLVYQCPPKFELNNWEIKRTLKV KGSQ
Subjt:  NEKQCLFHPEIDDHFIEDCCELKNEVQKLMDAKILLVGQVSIQEIEVDMIIDKETSNDTSITIISENTISPNLLVYQCPPKFELNNWEIKRTLKVFKGSQ

Query:  K
        K
Subjt:  K

KAA0055146.1 uncharacterized protein E6C27_scaffold231G00770 [Cucumis melo var. makuwa]4.8e-29183.01Show/hide
Query:  QVQAVRQDVEGLKYQLAKILELLTTGRGKSVAGISSQVEVDLNQVLEDMPAYPLGFTLQRSSSPRMTYPTQNPNPITQQADHVSDPMSTPITESGKKISE
        +VQAVRQDVEGLK QLAKILELLTTGRGKSV GISSQVEVDLNQVLEDMPAYP GFT QRSSSP MTYPTQNPNPITQQ +HVSDPMSTPITESGKKISE
Subjt:  QVQAVRQDVEGLKYQLAKILELLTTGRGKSVAGISSQVEVDLNQVLEDMPAYPLGFTLQRSSSPRMTYPTQNPNPITQQADHVSDPMSTPITESGKKISE

Query:  EQGSKKRLKFLEERLRVIEGADMY-----------ENVVIPPKFKTPDFEKYNGTTCPKSHLVMYCQKMSAYAHDDKLLIHCFQDSLVGPASRWYMQLDG
        EQGS+KRL+FLEERLR I+GADMY            +VVIPPKFKTPDFEKYNGTTCPKSHLVMYC+KMSAYAHD+KLLIHCFQDSLVGPASRWYMQLDG
Subjt:  EQGSKKRLKFLEERLRVIEGADMY-----------ENVVIPPKFKTPDFEKYNGTTCPKSHLVMYCQKMSAYAHDDKLLIHCFQDSLVGPASRWYMQLDG

Query:  SQVHRWKDLADFFLKQYK--------------MEKKNVETFKEYAQRWRELAAQVQPPLTDKELMAMFINTLQAPYYDRMVGSASTNFSDVITIGERIEF
        SQVHRWKDLAD FLKQYK              MEKKNVETFKEYAQRWRELAAQVQPPLTDKELMAMFINTL+APYYDRMVGSASTNFSDVITIGERIEF
Subjt:  SQVHRWKDLADFFLKQYK--------------MEKKNVETFKEYAQRWRELAAQVQPPLTDKELMAMFINTLQAPYYDRMVGSASTNFSDVITIGERIEF

Query:  GVKNGRIFDPASEIRRMMTLKKKEEEIHELSSTQKVVHVSSPTVGQTNYSYSYQNGSKSPF---------------------------------------
        GVKNGRI DPASEIRRMMT KKKEEEIHELSSTQ+VVHVSSP VGQTNYSYSYQNG KSPF                                       
Subjt:  GVKNGRIFDPASEIRRMMTLKKKEEEIHELSSTQKVVHVSSPTVGQTNYSYSYQNGSKSPF---------------------------------------

Query:  ---KPLQPPYPKWYDPSVKCEYHGGVVGHSTENCFPLKAKVQSLVKAGWLKFKKTEEESDVNQNPLPNHEGPAINIVDIFMKRYKNKVCDVTTSMNTLFQ
           +PLQPPYPKWYDP+VKCEYH GVVGHSTENCFPLKAKVQSLVKAGWLKFKK EEESDVNQNPLPNHEGPAINIVD F +RYKNKVCDVTTSMNTLFQ
Subjt:  ---KPLQPPYPKWYDPSVKCEYHGGVVGHSTENCFPLKAKVQSLVKAGWLKFKKTEEESDVNQNPLPNHEGPAINIVDIFMKRYKNKVCDVTTSMNTLFQ

Query:  ILSRAGYLSPRFNNDEGEKFGCANEKQCLFHPEIDDHFIEDCCELKNEVQKLMDAKILLVGQVSIQEIEVDMIIDKETSNDTSITIISENTISPNLLVYQ
        IL RAGYLSPRFNNDEGEKFGCANEKQCLFHPEIDDHFIEDCCE KNEVQKLMDAKILLVGQ+S+QEIEVDMIIDKETSNDTSIT+IS+NTISPNLLV Q
Subjt:  ILSRAGYLSPRFNNDEGEKFGCANEKQCLFHPEIDDHFIEDCCELKNEVQKLMDAKILLVGQVSIQEIEVDMIIDKETSNDTSITIISENTISPNLLVYQ

Query:  CPPKFELNNWEIKRTLKVFKGSQK
         PPKFELNNWEIKRTLKV KGSQK
Subjt:  CPPKFELNNWEIKRTLKVFKGSQK

KAA0063019.1 uncharacterized protein E6C27_scaffold468G001210 [Cucumis melo var. makuwa]3.2e-24786.92Show/hide
Query:  MSTPITESGKKISEEQGSKKRLKFLEERLRVIEGADMY-----------ENVVIPPKFKTPDFEKYNGTTCPKSHLVMYCQKMSAYAHDDKLLIHCFQDS
        MSTPITESGKKISEEQGS+KRL+FLEERLR+IEGADMY            +VVIPPKFKTPDFEKYNGTTCPKSHLVMYC+KMSAYAHDDKLLIHCFQDS
Subjt:  MSTPITESGKKISEEQGSKKRLKFLEERLRVIEGADMY-----------ENVVIPPKFKTPDFEKYNGTTCPKSHLVMYCQKMSAYAHDDKLLIHCFQDS

Query:  LVGPASRWYMQLDGSQVHRWKDLADFFLKQYK--------------MEKKNVETFKEYAQRWRELAAQVQPPLTDKELMAMFINTLQAPYYDRMVGSAST
        LVGPASRWYMQLDGSQVHRWKDLAD FLKQYK              MEKKNVETFKEYAQRWRELAAQVQPPLTDKELMAMFINTL+ PYYDRMVGSAST
Subjt:  LVGPASRWYMQLDGSQVHRWKDLADFFLKQYK--------------MEKKNVETFKEYAQRWRELAAQVQPPLTDKELMAMFINTLQAPYYDRMVGSAST

Query:  NFSDVITIGERIEFGVKNGRIFDPASEIRRMMTLKKKEEEIHELSSTQKVVHVSSPTVGQTNYSYSYQNG-SKSPFKPLQPPYPKWYDPSVKCEYHGGVV
        NFSDVIT+GE+IEFGVKNGRI DPASEIRRMMT KKKEEEIHELSSTQ+VVHVSSPTVGQTNYSYSYQNG +  P +PLQPPYPKWYDP+VKCEYH GVV
Subjt:  NFSDVITIGERIEFGVKNGRIFDPASEIRRMMTLKKKEEEIHELSSTQKVVHVSSPTVGQTNYSYSYQNG-SKSPFKPLQPPYPKWYDPSVKCEYHGGVV

Query:  GHSTENCFPLKAKVQSLVKAGWLKFKKTEEESDVNQNPLPNHEGPAINIVDIFMKRYKNKVCDVTTSMNTLFQILSRAGYLSPRFNNDEGEKFGCANEKQ
         HSTENCFPLKAKVQSLVKAGWLKFKKTEEESDVNQNP P HEGPAINIVD F KRYKNKVCDVTTSMNTLFQIL RAGYLSPRFNND+GEK GC NEKQ
Subjt:  GHSTENCFPLKAKVQSLVKAGWLKFKKTEEESDVNQNPLPNHEGPAINIVDIFMKRYKNKVCDVTTSMNTLFQILSRAGYLSPRFNNDEGEKFGCANEKQ

Query:  CLFHPEIDDHFIEDCCELKNEVQKLMDAKILLVGQVSIQEIEVDMIIDKETSNDTSITIISENTISPNLLVYQCPPKFELNNWEIKRTLKVFKGSQK
        CLFHPEIDDHFIEDCCE KNEVQKLMDAKILLVGQ+S+QEIEVDMIIDKETSNDTS T+IS+NTISPNLLV Q PPKFELNNWEIKRTLKV KGSQK
Subjt:  CLFHPEIDDHFIEDCCELKNEVQKLMDAKILLVGQVSIQEIEVDMIIDKETSNDTSITIISENTISPNLLVYQCPPKFELNNWEIKRTLKVFKGSQK

KAA0065293.1 uncharacterized protein E6C27_scaffold1023G00060 [Cucumis melo var. makuwa]7.7e-24171.41Show/hide
Query:  MDEQTNDQVQAVRQDVEGLKYQLAKILELLTTGRGKSVAGISSQVEVDLNQVLEDMPAYPLGFTLQRSSSPRM---TYPTQ----NPNPITQQADHVSDP
        MDEQTNDQVQAVRQDVEGLK QLAKILELLTTGRGKSV G SSQVEVDLNQVLEDMP YP GFT QRSSSPRM   TYPT     NPN  TQQA H ++P
Subjt:  MDEQTNDQVQAVRQDVEGLKYQLAKILELLTTGRGKSVAGISSQVEVDLNQVLEDMPAYPLGFTLQRSSSPRM---TYPTQ----NPNPITQQADHVSDP

Query:  MSTPITESGKKISEEQGSKKRLKFLEERLRVIEGADMY-----------ENVVIPPKFKTPDFEKYNGTTCPKSHLVMYCQKMSAYAHDDKLLIHCFQDS
        +ST I E GKKISEEQGS++RL+FLEERLRVIEGADMY            +VVIPPKFKTPDFEKYNGT+CPKSHLVMYC+KMSAYAHDDKLLIHCFQDS
Subjt:  MSTPITESGKKISEEQGSKKRLKFLEERLRVIEGADMY-----------ENVVIPPKFKTPDFEKYNGTTCPKSHLVMYCQKMSAYAHDDKLLIHCFQDS

Query:  LVGPASRWYMQLDGSQVHRWKDLADFFLKQYK--------------MEKKNVETFKEYAQRWRELAAQVQPPLTDKELMAMFINTLQAPYYDRMVGSAST
        LVGPASRWYMQLDGSQVHRWKDLAD FLKQYK              MEKKNVETFKEYAQRWRELAAQVQPPLTDKEL AMFINTL+APYYDRMVGSAST
Subjt:  LVGPASRWYMQLDGSQVHRWKDLADFFLKQYK--------------MEKKNVETFKEYAQRWRELAAQVQPPLTDKELMAMFINTLQAPYYDRMVGSAST

Query:  NFSDVITIGERIEFGVKNGRIFDPASEIRRMMTLKKKEEEIHELSSTQKV-VHVSSPTVGQTNYSYSYQNGSKSPF------------------------
        NFSDVITIGERIEFGVKNGRI DPASE RR+MT KKKE E+HELSSTQ+V   VSSP VGQTN+S SYQNG +SPF                        
Subjt:  NFSDVITIGERIEFGVKNGRIFDPASEIRRMMTLKKKEEEIHELSSTQKV-VHVSSPTVGQTNYSYSYQNGSKSPF------------------------

Query:  ------------------KPLQPPYPKWYDPSVKCEYHGGVVGHSTENCFPLKAKVQSLVKAGWLKFKKTEEESDVNQNPLPNHEGPAINIVDIFMKRYK
                          +PLQPPYPKWYDP+ KCEYH G VGHSTENCFPLKAKVQSLVKAGWL+FKKT EE DVNQNPLPNHEGP IN VD FM+R+K
Subjt:  ------------------KPLQPPYPKWYDPSVKCEYHGGVVGHSTENCFPLKAKVQSLVKAGWLKFKKTEEESDVNQNPLPNHEGPAINIVDIFMKRYK

Query:  NKVCDVTTSMNTLFQILSRAGYLSPRFNNDEGEKFGCANEKQCLFHPEIDDHFIEDCCELKNEVQKLMDAKILLVGQVSIQEIEVDMII----DKETSND
        NKV DV TSM TLFQIL  AGYLSPRFNND+ EK GC N +QCLFHPE +DH IEDCCE KNEVQKLMD+KILL+GQ+S+QEIEV+MI     +++TSN+
Subjt:  NKVCDVTTSMNTLFQILSRAGYLSPRFNNDEGEKFGCANEKQCLFHPEIDDHFIEDCCELKNEVQKLMDAKILLVGQVSIQEIEVDMII----DKETSND

Query:  TSI---TIISENTISPNLLVYQCPPK
        T+     ++      P+++ Y   PK
Subjt:  TSI---TIISENTISPNLLVYQCPPK

XP_016903339.1 PREDICTED: uncharacterized protein LOC103502838 [Cucumis melo]1.5e-30088.19Show/hide
Query:  MDEQTNDQVQAVRQDVEGLKYQLAKILELLTTGRGKSVAGISSQVEVDLNQVLEDMPAYPLGFTLQRSSSPRMTYPTQNPNPITQQADHVSDPMSTPITE
        MDEQTNDQVQAVRQDVEGLK QLAKILEL TTGR KSVAGISSQVEVDLNQVLEDMPAYP+GFT QRSSSPRMTYPTQNPNPITQQ DHVSDPMSTPIT+
Subjt:  MDEQTNDQVQAVRQDVEGLKYQLAKILELLTTGRGKSVAGISSQVEVDLNQVLEDMPAYPLGFTLQRSSSPRMTYPTQNPNPITQQADHVSDPMSTPITE

Query:  SGKKISEEQGSKKRLKFLEERLRVIEGADMY-----------ENVVIPPKFKTPDFEKYNGTTCPKSHLVMYCQKMSAYAHDDKLLIHCFQDSLVGPASR
        SGKKISEEQ S+KRL+FLEERLRVIEGADMY            +VVIPPKFKTPDFEKYNGTTCPKSHLVMYCQKMSAYAHDDKLLIHCFQDSLVGP SR
Subjt:  SGKKISEEQGSKKRLKFLEERLRVIEGADMY-----------ENVVIPPKFKTPDFEKYNGTTCPKSHLVMYCQKMSAYAHDDKLLIHCFQDSLVGPASR

Query:  WYMQLDGSQVHRWKDLADFFLKQY--------------KMEKKNVETFKEYAQRWRELAAQVQPPLTDKELMAMFINTLQAPYYDRMVGSASTNFSDVIT
        WYMQLDGSQVHRWKDLAD FLKQY              +MEKKNVETFKEYAQRWRELA QVQPPLTDKELMAMFINTL+APYYDRMVGSASTNFSDVIT
Subjt:  WYMQLDGSQVHRWKDLADFFLKQY--------------KMEKKNVETFKEYAQRWRELAAQVQPPLTDKELMAMFINTLQAPYYDRMVGSASTNFSDVIT

Query:  IGERIEFGVKNGRIFDPASEIRRMMTLKKKEEEIHELSSTQKVVHVSSPTVGQTNYSYSYQNGS------KS------PFKPLQPPYPKWYDPSVKCEYH
        IGE IEFGVKNGRIFDP+SEIRRMMT KKKEEEIHELSST+KVVHVSSPTVGQ NYSY+YQNG       KS      P +PLQPPYPKWYDP+VKCEYH
Subjt:  IGERIEFGVKNGRIFDPASEIRRMMTLKKKEEEIHELSSTQKVVHVSSPTVGQTNYSYSYQNGS------KS------PFKPLQPPYPKWYDPSVKCEYH

Query:  GGVVGHSTENCFPLKAKVQSLVKAGWLKFKKTEEESDVNQNPLPNHEGPAINIVDIFMKRYKNKVCDVTTSMNTLFQILSRAGYLSPRFNNDEGEKFGCA
         G+VGHSTENCFPLKAKVQSLVKAGWLKFKKTEEE DVNQNPLPNHEGPAINIVDIF KRYKNKVCDVTTSMNTLFQILSRAGYLSPRFNNDEG KFGCA
Subjt:  GGVVGHSTENCFPLKAKVQSLVKAGWLKFKKTEEESDVNQNPLPNHEGPAINIVDIFMKRYKNKVCDVTTSMNTLFQILSRAGYLSPRFNNDEGEKFGCA

Query:  NEKQCLFHPEIDDHFIEDCCELKNEVQKLMDAKILLVGQVSIQEIEVDMIIDKETSNDTSITIISENTISPNLLVYQCPPKFELNNWEIKRTLKVFKGSQ
        NEKQCLFHPEIDDHFIEDCCE KNEVQKLMDAKILLVGQVSIQEIEVDMIIDKETSNDTSITIISENTISPNLLVYQCPPKFELNNWEIKRTLKV KGSQ
Subjt:  NEKQCLFHPEIDDHFIEDCCELKNEVQKLMDAKILLVGQVSIQEIEVDMIIDKETSNDTSITIISENTISPNLLVYQCPPKFELNNWEIKRTLKVFKGSQ

Query:  K
        K
Subjt:  K

TrEMBL top hitse value%identityAlignment
A0A1S4E534 uncharacterized protein LOC1035028387.1e-30188.19Show/hide
Query:  MDEQTNDQVQAVRQDVEGLKYQLAKILELLTTGRGKSVAGISSQVEVDLNQVLEDMPAYPLGFTLQRSSSPRMTYPTQNPNPITQQADHVSDPMSTPITE
        MDEQTNDQVQAVRQDVEGLK QLAKILEL TTGR KSVAGISSQVEVDLNQVLEDMPAYP+GFT QRSSSPRMTYPTQNPNPITQQ DHVSDPMSTPIT+
Subjt:  MDEQTNDQVQAVRQDVEGLKYQLAKILELLTTGRGKSVAGISSQVEVDLNQVLEDMPAYPLGFTLQRSSSPRMTYPTQNPNPITQQADHVSDPMSTPITE

Query:  SGKKISEEQGSKKRLKFLEERLRVIEGADMY-----------ENVVIPPKFKTPDFEKYNGTTCPKSHLVMYCQKMSAYAHDDKLLIHCFQDSLVGPASR
        SGKKISEEQ S+KRL+FLEERLRVIEGADMY            +VVIPPKFKTPDFEKYNGTTCPKSHLVMYCQKMSAYAHDDKLLIHCFQDSLVGP SR
Subjt:  SGKKISEEQGSKKRLKFLEERLRVIEGADMY-----------ENVVIPPKFKTPDFEKYNGTTCPKSHLVMYCQKMSAYAHDDKLLIHCFQDSLVGPASR

Query:  WYMQLDGSQVHRWKDLADFFLKQY--------------KMEKKNVETFKEYAQRWRELAAQVQPPLTDKELMAMFINTLQAPYYDRMVGSASTNFSDVIT
        WYMQLDGSQVHRWKDLAD FLKQY              +MEKKNVETFKEYAQRWRELA QVQPPLTDKELMAMFINTL+APYYDRMVGSASTNFSDVIT
Subjt:  WYMQLDGSQVHRWKDLADFFLKQY--------------KMEKKNVETFKEYAQRWRELAAQVQPPLTDKELMAMFINTLQAPYYDRMVGSASTNFSDVIT

Query:  IGERIEFGVKNGRIFDPASEIRRMMTLKKKEEEIHELSSTQKVVHVSSPTVGQTNYSYSYQNGS------KS------PFKPLQPPYPKWYDPSVKCEYH
        IGE IEFGVKNGRIFDP+SEIRRMMT KKKEEEIHELSST+KVVHVSSPTVGQ NYSY+YQNG       KS      P +PLQPPYPKWYDP+VKCEYH
Subjt:  IGERIEFGVKNGRIFDPASEIRRMMTLKKKEEEIHELSSTQKVVHVSSPTVGQTNYSYSYQNGS------KS------PFKPLQPPYPKWYDPSVKCEYH

Query:  GGVVGHSTENCFPLKAKVQSLVKAGWLKFKKTEEESDVNQNPLPNHEGPAINIVDIFMKRYKNKVCDVTTSMNTLFQILSRAGYLSPRFNNDEGEKFGCA
         G+VGHSTENCFPLKAKVQSLVKAGWLKFKKTEEE DVNQNPLPNHEGPAINIVDIF KRYKNKVCDVTTSMNTLFQILSRAGYLSPRFNNDEG KFGCA
Subjt:  GGVVGHSTENCFPLKAKVQSLVKAGWLKFKKTEEESDVNQNPLPNHEGPAINIVDIFMKRYKNKVCDVTTSMNTLFQILSRAGYLSPRFNNDEGEKFGCA

Query:  NEKQCLFHPEIDDHFIEDCCELKNEVQKLMDAKILLVGQVSIQEIEVDMIIDKETSNDTSITIISENTISPNLLVYQCPPKFELNNWEIKRTLKVFKGSQ
        NEKQCLFHPEIDDHFIEDCCE KNEVQKLMDAKILLVGQVSIQEIEVDMIIDKETSNDTSITIISENTISPNLLVYQCPPKFELNNWEIKRTLKV KGSQ
Subjt:  NEKQCLFHPEIDDHFIEDCCELKNEVQKLMDAKILLVGQVSIQEIEVDMIIDKETSNDTSITIISENTISPNLLVYQCPPKFELNNWEIKRTLKVFKGSQ

Query:  K
        K
Subjt:  K

A0A5A7T401 Retrotrans_gag domain-containing protein7.1e-30188.19Show/hide
Query:  MDEQTNDQVQAVRQDVEGLKYQLAKILELLTTGRGKSVAGISSQVEVDLNQVLEDMPAYPLGFTLQRSSSPRMTYPTQNPNPITQQADHVSDPMSTPITE
        MDEQTNDQVQAVRQDVEGLK QLAKILEL TTGR KSVAGISSQVEVDLNQVLEDMPAYP+GFT QRSSSPRMTYPTQNPNPITQQ DHVSDPMSTPIT+
Subjt:  MDEQTNDQVQAVRQDVEGLKYQLAKILELLTTGRGKSVAGISSQVEVDLNQVLEDMPAYPLGFTLQRSSSPRMTYPTQNPNPITQQADHVSDPMSTPITE

Query:  SGKKISEEQGSKKRLKFLEERLRVIEGADMY-----------ENVVIPPKFKTPDFEKYNGTTCPKSHLVMYCQKMSAYAHDDKLLIHCFQDSLVGPASR
        SGKKISEEQ S+KRL+FLEERLRVIEGADMY            +VVIPPKFKTPDFEKYNGTTCPKSHLVMYCQKMSAYAHDDKLLIHCFQDSLVGP SR
Subjt:  SGKKISEEQGSKKRLKFLEERLRVIEGADMY-----------ENVVIPPKFKTPDFEKYNGTTCPKSHLVMYCQKMSAYAHDDKLLIHCFQDSLVGPASR

Query:  WYMQLDGSQVHRWKDLADFFLKQY--------------KMEKKNVETFKEYAQRWRELAAQVQPPLTDKELMAMFINTLQAPYYDRMVGSASTNFSDVIT
        WYMQLDGSQVHRWKDLAD FLKQY              +MEKKNVETFKEYAQRWRELA QVQPPLTDKELMAMFINTL+APYYDRMVGSASTNFSDVIT
Subjt:  WYMQLDGSQVHRWKDLADFFLKQY--------------KMEKKNVETFKEYAQRWRELAAQVQPPLTDKELMAMFINTLQAPYYDRMVGSASTNFSDVIT

Query:  IGERIEFGVKNGRIFDPASEIRRMMTLKKKEEEIHELSSTQKVVHVSSPTVGQTNYSYSYQNGS------KS------PFKPLQPPYPKWYDPSVKCEYH
        IGE IEFGVKNGRIFDP+SEIRRMMT KKKEEEIHELSST+KVVHVSSPTVGQ NYSY+YQNG       KS      P +PLQPPYPKWYDP+VKCEYH
Subjt:  IGERIEFGVKNGRIFDPASEIRRMMTLKKKEEEIHELSSTQKVVHVSSPTVGQTNYSYSYQNGS------KS------PFKPLQPPYPKWYDPSVKCEYH

Query:  GGVVGHSTENCFPLKAKVQSLVKAGWLKFKKTEEESDVNQNPLPNHEGPAINIVDIFMKRYKNKVCDVTTSMNTLFQILSRAGYLSPRFNNDEGEKFGCA
         G+VGHSTENCFPLKAKVQSLVKAGWLKFKKTEEE DVNQNPLPNHEGPAINIVDIF KRYKNKVCDVTTSMNTLFQILSRAGYLSPRFNNDEG KFGCA
Subjt:  GGVVGHSTENCFPLKAKVQSLVKAGWLKFKKTEEESDVNQNPLPNHEGPAINIVDIFMKRYKNKVCDVTTSMNTLFQILSRAGYLSPRFNNDEGEKFGCA

Query:  NEKQCLFHPEIDDHFIEDCCELKNEVQKLMDAKILLVGQVSIQEIEVDMIIDKETSNDTSITIISENTISPNLLVYQCPPKFELNNWEIKRTLKVFKGSQ
        NEKQCLFHPEIDDHFIEDCCE KNEVQKLMDAKILLVGQVSIQEIEVDMIIDKETSNDTSITIISENTISPNLLVYQCPPKFELNNWEIKRTLKV KGSQ
Subjt:  NEKQCLFHPEIDDHFIEDCCELKNEVQKLMDAKILLVGQVSIQEIEVDMIIDKETSNDTSITIISENTISPNLLVYQCPPKFELNNWEIKRTLKVFKGSQ

Query:  K
        K
Subjt:  K

A0A5A7ULI2 Retrotrans_gag domain-containing protein2.3e-29183.01Show/hide
Query:  QVQAVRQDVEGLKYQLAKILELLTTGRGKSVAGISSQVEVDLNQVLEDMPAYPLGFTLQRSSSPRMTYPTQNPNPITQQADHVSDPMSTPITESGKKISE
        +VQAVRQDVEGLK QLAKILELLTTGRGKSV GISSQVEVDLNQVLEDMPAYP GFT QRSSSP MTYPTQNPNPITQQ +HVSDPMSTPITESGKKISE
Subjt:  QVQAVRQDVEGLKYQLAKILELLTTGRGKSVAGISSQVEVDLNQVLEDMPAYPLGFTLQRSSSPRMTYPTQNPNPITQQADHVSDPMSTPITESGKKISE

Query:  EQGSKKRLKFLEERLRVIEGADMY-----------ENVVIPPKFKTPDFEKYNGTTCPKSHLVMYCQKMSAYAHDDKLLIHCFQDSLVGPASRWYMQLDG
        EQGS+KRL+FLEERLR I+GADMY            +VVIPPKFKTPDFEKYNGTTCPKSHLVMYC+KMSAYAHD+KLLIHCFQDSLVGPASRWYMQLDG
Subjt:  EQGSKKRLKFLEERLRVIEGADMY-----------ENVVIPPKFKTPDFEKYNGTTCPKSHLVMYCQKMSAYAHDDKLLIHCFQDSLVGPASRWYMQLDG

Query:  SQVHRWKDLADFFLKQYK--------------MEKKNVETFKEYAQRWRELAAQVQPPLTDKELMAMFINTLQAPYYDRMVGSASTNFSDVITIGERIEF
        SQVHRWKDLAD FLKQYK              MEKKNVETFKEYAQRWRELAAQVQPPLTDKELMAMFINTL+APYYDRMVGSASTNFSDVITIGERIEF
Subjt:  SQVHRWKDLADFFLKQYK--------------MEKKNVETFKEYAQRWRELAAQVQPPLTDKELMAMFINTLQAPYYDRMVGSASTNFSDVITIGERIEF

Query:  GVKNGRIFDPASEIRRMMTLKKKEEEIHELSSTQKVVHVSSPTVGQTNYSYSYQNGSKSPF---------------------------------------
        GVKNGRI DPASEIRRMMT KKKEEEIHELSSTQ+VVHVSSP VGQTNYSYSYQNG KSPF                                       
Subjt:  GVKNGRIFDPASEIRRMMTLKKKEEEIHELSSTQKVVHVSSPTVGQTNYSYSYQNGSKSPF---------------------------------------

Query:  ---KPLQPPYPKWYDPSVKCEYHGGVVGHSTENCFPLKAKVQSLVKAGWLKFKKTEEESDVNQNPLPNHEGPAINIVDIFMKRYKNKVCDVTTSMNTLFQ
           +PLQPPYPKWYDP+VKCEYH GVVGHSTENCFPLKAKVQSLVKAGWLKFKK EEESDVNQNPLPNHEGPAINIVD F +RYKNKVCDVTTSMNTLFQ
Subjt:  ---KPLQPPYPKWYDPSVKCEYHGGVVGHSTENCFPLKAKVQSLVKAGWLKFKKTEEESDVNQNPLPNHEGPAINIVDIFMKRYKNKVCDVTTSMNTLFQ

Query:  ILSRAGYLSPRFNNDEGEKFGCANEKQCLFHPEIDDHFIEDCCELKNEVQKLMDAKILLVGQVSIQEIEVDMIIDKETSNDTSITIISENTISPNLLVYQ
        IL RAGYLSPRFNNDEGEKFGCANEKQCLFHPEIDDHFIEDCCE KNEVQKLMDAKILLVGQ+S+QEIEVDMIIDKETSNDTSIT+IS+NTISPNLLV Q
Subjt:  ILSRAGYLSPRFNNDEGEKFGCANEKQCLFHPEIDDHFIEDCCELKNEVQKLMDAKILLVGQVSIQEIEVDMIIDKETSNDTSITIISENTISPNLLVYQ

Query:  CPPKFELNNWEIKRTLKVFKGSQK
         PPKFELNNWEIKRTLKV KGSQK
Subjt:  CPPKFELNNWEIKRTLKVFKGSQK

A0A5A7VAU5 Uncharacterized protein3.7e-24171.41Show/hide
Query:  MDEQTNDQVQAVRQDVEGLKYQLAKILELLTTGRGKSVAGISSQVEVDLNQVLEDMPAYPLGFTLQRSSSPRM---TYPTQ----NPNPITQQADHVSDP
        MDEQTNDQVQAVRQDVEGLK QLAKILELLTTGRGKSV G SSQVEVDLNQVLEDMP YP GFT QRSSSPRM   TYPT     NPN  TQQA H ++P
Subjt:  MDEQTNDQVQAVRQDVEGLKYQLAKILELLTTGRGKSVAGISSQVEVDLNQVLEDMPAYPLGFTLQRSSSPRM---TYPTQ----NPNPITQQADHVSDP

Query:  MSTPITESGKKISEEQGSKKRLKFLEERLRVIEGADMY-----------ENVVIPPKFKTPDFEKYNGTTCPKSHLVMYCQKMSAYAHDDKLLIHCFQDS
        +ST I E GKKISEEQGS++RL+FLEERLRVIEGADMY            +VVIPPKFKTPDFEKYNGT+CPKSHLVMYC+KMSAYAHDDKLLIHCFQDS
Subjt:  MSTPITESGKKISEEQGSKKRLKFLEERLRVIEGADMY-----------ENVVIPPKFKTPDFEKYNGTTCPKSHLVMYCQKMSAYAHDDKLLIHCFQDS

Query:  LVGPASRWYMQLDGSQVHRWKDLADFFLKQYK--------------MEKKNVETFKEYAQRWRELAAQVQPPLTDKELMAMFINTLQAPYYDRMVGSAST
        LVGPASRWYMQLDGSQVHRWKDLAD FLKQYK              MEKKNVETFKEYAQRWRELAAQVQPPLTDKEL AMFINTL+APYYDRMVGSAST
Subjt:  LVGPASRWYMQLDGSQVHRWKDLADFFLKQYK--------------MEKKNVETFKEYAQRWRELAAQVQPPLTDKELMAMFINTLQAPYYDRMVGSAST

Query:  NFSDVITIGERIEFGVKNGRIFDPASEIRRMMTLKKKEEEIHELSSTQKV-VHVSSPTVGQTNYSYSYQNGSKSPF------------------------
        NFSDVITIGERIEFGVKNGRI DPASE RR+MT KKKE E+HELSSTQ+V   VSSP VGQTN+S SYQNG +SPF                        
Subjt:  NFSDVITIGERIEFGVKNGRIFDPASEIRRMMTLKKKEEEIHELSSTQKV-VHVSSPTVGQTNYSYSYQNGSKSPF------------------------

Query:  ------------------KPLQPPYPKWYDPSVKCEYHGGVVGHSTENCFPLKAKVQSLVKAGWLKFKKTEEESDVNQNPLPNHEGPAINIVDIFMKRYK
                          +PLQPPYPKWYDP+ KCEYH G VGHSTENCFPLKAKVQSLVKAGWL+FKKT EE DVNQNPLPNHEGP IN VD FM+R+K
Subjt:  ------------------KPLQPPYPKWYDPSVKCEYHGGVVGHSTENCFPLKAKVQSLVKAGWLKFKKTEEESDVNQNPLPNHEGPAINIVDIFMKRYK

Query:  NKVCDVTTSMNTLFQILSRAGYLSPRFNNDEGEKFGCANEKQCLFHPEIDDHFIEDCCELKNEVQKLMDAKILLVGQVSIQEIEVDMII----DKETSND
        NKV DV TSM TLFQIL  AGYLSPRFNND+ EK GC N +QCLFHPE +DH IEDCCE KNEVQKLMD+KILL+GQ+S+QEIEV+MI     +++TSN+
Subjt:  NKVCDVTTSMNTLFQILSRAGYLSPRFNNDEGEKFGCANEKQCLFHPEIDDHFIEDCCELKNEVQKLMDAKILLVGQVSIQEIEVDMII----DKETSND

Query:  TSI---TIISENTISPNLLVYQCPPK
        T+     ++      P+++ Y   PK
Subjt:  TSI---TIISENTISPNLLVYQCPPK

A0A5A7VBN6 Retrotrans_gag domain-containing protein1.6e-24786.92Show/hide
Query:  MSTPITESGKKISEEQGSKKRLKFLEERLRVIEGADMY-----------ENVVIPPKFKTPDFEKYNGTTCPKSHLVMYCQKMSAYAHDDKLLIHCFQDS
        MSTPITESGKKISEEQGS+KRL+FLEERLR+IEGADMY            +VVIPPKFKTPDFEKYNGTTCPKSHLVMYC+KMSAYAHDDKLLIHCFQDS
Subjt:  MSTPITESGKKISEEQGSKKRLKFLEERLRVIEGADMY-----------ENVVIPPKFKTPDFEKYNGTTCPKSHLVMYCQKMSAYAHDDKLLIHCFQDS

Query:  LVGPASRWYMQLDGSQVHRWKDLADFFLKQYK--------------MEKKNVETFKEYAQRWRELAAQVQPPLTDKELMAMFINTLQAPYYDRMVGSAST
        LVGPASRWYMQLDGSQVHRWKDLAD FLKQYK              MEKKNVETFKEYAQRWRELAAQVQPPLTDKELMAMFINTL+ PYYDRMVGSAST
Subjt:  LVGPASRWYMQLDGSQVHRWKDLADFFLKQYK--------------MEKKNVETFKEYAQRWRELAAQVQPPLTDKELMAMFINTLQAPYYDRMVGSAST

Query:  NFSDVITIGERIEFGVKNGRIFDPASEIRRMMTLKKKEEEIHELSSTQKVVHVSSPTVGQTNYSYSYQNG-SKSPFKPLQPPYPKWYDPSVKCEYHGGVV
        NFSDVIT+GE+IEFGVKNGRI DPASEIRRMMT KKKEEEIHELSSTQ+VVHVSSPTVGQTNYSYSYQNG +  P +PLQPPYPKWYDP+VKCEYH GVV
Subjt:  NFSDVITIGERIEFGVKNGRIFDPASEIRRMMTLKKKEEEIHELSSTQKVVHVSSPTVGQTNYSYSYQNG-SKSPFKPLQPPYPKWYDPSVKCEYHGGVV

Query:  GHSTENCFPLKAKVQSLVKAGWLKFKKTEEESDVNQNPLPNHEGPAINIVDIFMKRYKNKVCDVTTSMNTLFQILSRAGYLSPRFNNDEGEKFGCANEKQ
         HSTENCFPLKAKVQSLVKAGWLKFKKTEEESDVNQNP P HEGPAINIVD F KRYKNKVCDVTTSMNTLFQIL RAGYLSPRFNND+GEK GC NEKQ
Subjt:  GHSTENCFPLKAKVQSLVKAGWLKFKKTEEESDVNQNPLPNHEGPAINIVDIFMKRYKNKVCDVTTSMNTLFQILSRAGYLSPRFNNDEGEKFGCANEKQ

Query:  CLFHPEIDDHFIEDCCELKNEVQKLMDAKILLVGQVSIQEIEVDMIIDKETSNDTSITIISENTISPNLLVYQCPPKFELNNWEIKRTLKVFKGSQK
        CLFHPEIDDHFIEDCCE KNEVQKLMDAKILLVGQ+S+QEIEVDMIIDKETSNDTS T+IS+NTISPNLLV Q PPKFELNNWEIKRTLKV KGSQK
Subjt:  CLFHPEIDDHFIEDCCELKNEVQKLMDAKILLVGQVSIQEIEVDMIIDKETSNDTSITIISENTISPNLLVYQCPPKFELNNWEIKRTLKVFKGSQK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGAGCAAACCAATGATCAGGTTCAAGCAGTTCGTCAAGACGTTGAAGGATTGAAATATCAATTGGCTAAGATCTTAGAACTGCTCACTACTGGAAGAGGAAAGAG
TGTTGCGGGGATTTCATCACAAGTGGAAGTAGATCTGAACCAGGTGCTAGAAGACATGCCTGCATACCCTCTAGGTTTTACTCTGCAAAGGTCGTCTAGTCCTCGCATGA
CTTATCCTACACAAAATCCTAATCCGATCACCCAACAGGCAGACCATGTGAGCGATCCAATGTCTACTCCAATTACAGAAAGTGGTAAGAAAATTTCAGAAGAGCAGGGT
AGTAAAAAAAGACTGAAATTTCTAGAAGAAAGATTACGCGTCATTGAAGGTGCAGACATGTATGAGAATGTGGTGATCCCTCCCAAATTCAAGACTCCAGATTTTGAGAA
GTATAATGGAACCACGTGCCCAAAAAGTCATCTAGTTATGTATTGTCAGAAAATGTCAGCTTACGCTCATGATGATAAATTGTTGATTCATTGCTTCCAAGACAGTTTAG
TCGGCCCGGCTTCTCGCTGGTACATGCAATTGGATGGTTCACAAGTGCATAGGTGGAAGGATCTTGCTGATTTTTTTTTAAAACAATATAAAATGGAAAAAAAGAATGTT
GAAACTTTTAAGGAATATGCACAGCGATGGAGAGAGTTAGCTGCACAAGTGCAACCTCCCCTGACCGACAAAGAATTGATGGCTATGTTCATAAACACTCTTCAGGCCCC
ATATTATGATAGAATGGTTGGAAGTGCTTCAACTAATTTTTCAGACGTCATAACCATTGGGGAGAGGATTGAATTTGGAGTGAAGAATGGAAGGATCTTTGATCCCGCTT
CAGAGATAAGAAGAATGATGACCCTAAAGAAAAAAGAAGAAGAAATACATGAGTTGAGCTCGACTCAAAAAGTAGTGCATGTATCTTCACCAACTGTGGGGCAGACAAAT
TACTCCTATAGTTATCAGAATGGAAGTAAAAGCCCATTCAAGCCTTTGCAACCACCATATCCTAAGTGGTATGACCCCAGCGTAAAATGTGAATATCATGGTGGGGTAGT
TGGACATTCTACAGAGAACTGTTTTCCTTTGAAAGCTAAAGTGCAAAGTCTAGTCAAGGCTGGATGGTTGAAATTCAAGAAGACAGAAGAAGAGTCTGACGTTAACCAAA
ACCCTTTACCAAATCATGAGGGTCCCGCCATAAATATTGTTGACATATTCATGAAAAGATACAAGAATAAGGTGTGTGATGTGACTACTTCAATGAATACACTTTTCCAA
ATCCTTAGTAGAGCTGGATATTTGTCACCAAGGTTTAACAATGATGAGGGGGAGAAGTTTGGATGCGCCAACGAGAAGCAGTGTTTATTCCACCCTGAGATAGATGACCA
TTTCATTGAAGATTGTTGTGAGCTCAAGAATGAGGTACAAAAGTTGATGGATGCAAAGATTCTCTTGGTAGGACAAGTGAGCATACAAGAAATCGAGGTCGATATGATTA
TTGACAAGGAAACTTCAAATGATACCTCAATTACGATTATCTCCGAGAATACTATTTCTCCTAATCTATTGGTTTATCAGTGTCCACCAAAATTTGAGTTGAACAATTGG
GAGATCAAGAGGACGTTAAAAGTTTTTAAAGGATCACAAAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATGAGCAAACCAATGATCAGGTTCAAGCAGTTCGTCAAGACGTTGAAGGATTGAAATATCAATTGGCTAAGATCTTAGAACTGCTCACTACTGGAAGAGGAAAGAG
TGTTGCGGGGATTTCATCACAAGTGGAAGTAGATCTGAACCAGGTGCTAGAAGACATGCCTGCATACCCTCTAGGTTTTACTCTGCAAAGGTCGTCTAGTCCTCGCATGA
CTTATCCTACACAAAATCCTAATCCGATCACCCAACAGGCAGACCATGTGAGCGATCCAATGTCTACTCCAATTACAGAAAGTGGTAAGAAAATTTCAGAAGAGCAGGGT
AGTAAAAAAAGACTGAAATTTCTAGAAGAAAGATTACGCGTCATTGAAGGTGCAGACATGTATGAGAATGTGGTGATCCCTCCCAAATTCAAGACTCCAGATTTTGAGAA
GTATAATGGAACCACGTGCCCAAAAAGTCATCTAGTTATGTATTGTCAGAAAATGTCAGCTTACGCTCATGATGATAAATTGTTGATTCATTGCTTCCAAGACAGTTTAG
TCGGCCCGGCTTCTCGCTGGTACATGCAATTGGATGGTTCACAAGTGCATAGGTGGAAGGATCTTGCTGATTTTTTTTTAAAACAATATAAAATGGAAAAAAAGAATGTT
GAAACTTTTAAGGAATATGCACAGCGATGGAGAGAGTTAGCTGCACAAGTGCAACCTCCCCTGACCGACAAAGAATTGATGGCTATGTTCATAAACACTCTTCAGGCCCC
ATATTATGATAGAATGGTTGGAAGTGCTTCAACTAATTTTTCAGACGTCATAACCATTGGGGAGAGGATTGAATTTGGAGTGAAGAATGGAAGGATCTTTGATCCCGCTT
CAGAGATAAGAAGAATGATGACCCTAAAGAAAAAAGAAGAAGAAATACATGAGTTGAGCTCGACTCAAAAAGTAGTGCATGTATCTTCACCAACTGTGGGGCAGACAAAT
TACTCCTATAGTTATCAGAATGGAAGTAAAAGCCCATTCAAGCCTTTGCAACCACCATATCCTAAGTGGTATGACCCCAGCGTAAAATGTGAATATCATGGTGGGGTAGT
TGGACATTCTACAGAGAACTGTTTTCCTTTGAAAGCTAAAGTGCAAAGTCTAGTCAAGGCTGGATGGTTGAAATTCAAGAAGACAGAAGAAGAGTCTGACGTTAACCAAA
ACCCTTTACCAAATCATGAGGGTCCCGCCATAAATATTGTTGACATATTCATGAAAAGATACAAGAATAAGGTGTGTGATGTGACTACTTCAATGAATACACTTTTCCAA
ATCCTTAGTAGAGCTGGATATTTGTCACCAAGGTTTAACAATGATGAGGGGGAGAAGTTTGGATGCGCCAACGAGAAGCAGTGTTTATTCCACCCTGAGATAGATGACCA
TTTCATTGAAGATTGTTGTGAGCTCAAGAATGAGGTACAAAAGTTGATGGATGCAAAGATTCTCTTGGTAGGACAAGTGAGCATACAAGAAATCGAGGTCGATATGATTA
TTGACAAGGAAACTTCAAATGATACCTCAATTACGATTATCTCCGAGAATACTATTTCTCCTAATCTATTGGTTTATCAGTGTCCACCAAAATTTGAGTTGAACAATTGG
GAGATCAAGAGGACGTTAAAAGTTTTTAAAGGATCACAAAAGTAA
Protein sequenceShow/hide protein sequence
MDEQTNDQVQAVRQDVEGLKYQLAKILELLTTGRGKSVAGISSQVEVDLNQVLEDMPAYPLGFTLQRSSSPRMTYPTQNPNPITQQADHVSDPMSTPITESGKKISEEQG
SKKRLKFLEERLRVIEGADMYENVVIPPKFKTPDFEKYNGTTCPKSHLVMYCQKMSAYAHDDKLLIHCFQDSLVGPASRWYMQLDGSQVHRWKDLADFFLKQYKMEKKNV
ETFKEYAQRWRELAAQVQPPLTDKELMAMFINTLQAPYYDRMVGSASTNFSDVITIGERIEFGVKNGRIFDPASEIRRMMTLKKKEEEIHELSSTQKVVHVSSPTVGQTN
YSYSYQNGSKSPFKPLQPPYPKWYDPSVKCEYHGGVVGHSTENCFPLKAKVQSLVKAGWLKFKKTEEESDVNQNPLPNHEGPAINIVDIFMKRYKNKVCDVTTSMNTLFQ
ILSRAGYLSPRFNNDEGEKFGCANEKQCLFHPEIDDHFIEDCCELKNEVQKLMDAKILLVGQVSIQEIEVDMIIDKETSNDTSITIISENTISPNLLVYQCPPKFELNNW
EIKRTLKVFKGSQK