; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC02G043640 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC02G043640
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionWLM domain-containing protein
Genome locationCiama_Chr02:31334695..31337045
RNA-Seq ExpressionCaUC02G043640
SyntenyCaUC02G043640
Gene Ontology termsNA
InterPro domainsIPR013536 - WLM domain
IPR018997 - PUB domain
IPR036339 - PUB-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570558.1 hypothetical protein SDJN03_29473, partial [Cucurbita argyrosperma subsp. sororia]5.0e-21168.49Show/hide
Query:  MGISKNDVDGVLNNAKKKERI-------------------GVLKLLEGPYVFCEFWTLQIPGIEQDCIFQGLLRFNLDTDLLLNIVFELFHLAVKNYINP
        MG+ +N+VD VLNNAKK ERI                   GVLKL EGPYVFCEF TLQIPGIE                                    
Subjt:  MGISKNDVDGVLNNAKKKERI-------------------GVLKLLEGPYVFCEFWTLQIPGIEQDCIFQGLLRFNLDTDLLLNIVFELFHLAVKNYINP

Query:  HLNPPASEALKRMHMLAVDPGIVAIMNKVKESLNLDHCWRVGIMTEIAPIGYVGLSPKCILGFNKCMENLSILFSKVDFLTLVSDQNHGEEISLQLRIDD
         LNPPASEALKRMHMLA DPGIVAIMNK        H WRVGIMTE+ P+GYVG+SPKCILGFNK                     NHGEEISL+LR DD
Subjt:  HLNPPASEALKRMHMLAVDPGIVAIMNKVKESLNLDHCWRVGIMTEIAPIGYVGLSPKCILGFNKCMENLSILFSKVDFLTLVSDQNHGEEISLQLRIDD

Query:  LKGFRKYESIKKTLLHELAHMNYFEHDANFYALDKQ---EAAALDWTRSKCHTLTGIKYSQYHEENVDVEDDFGVPQKLGGSMSHQLVNACAASVAAAYH
        LKGFRKYES+KKTLLHELAHM Y EHDANFYALDKQ   EAA LDWTRSK HTL GIKYSQYHEE+ DVED FGV QKLGGS SHQLVNA +ASVAAAYH
Subjt:  LKGFRKYESIKKTLLHELAHMNYFEHDANFYALDKQ---EAAALDWTRSKCHTLTGIKYSQYHEENVDVEDDFGVPQKLGGSMSHQLVNACAASVAAAYH

Query:  CMTNTSDFSSRVSQVSAVLDPNSSNYQNKLEPDPDDSAYQKKLEPDPDDSSNDQNMLEIDSDNSSNYKSKLEPDPDDSIGSKNLESECEPRFIRSRVVQT
         +TNT+DFSSRVS VS   DPNSSNYQ KLEPDP+D AYQKKLEPDPDDS N+QNML+ D DN+SNY++K EPDPDD+IG K +ES+  PRF RS VVQT
Subjt:  CMTNTSDFSSRVSQVSAVLDPNSSNYQNKLEPDPDDSAYQKKLEPDPDDSSNDQNMLEIDSDNSSNYKSKLEPDPDDSIGSKNLESECEPRFIRSRVVQT

Query:  DLSNTEVQAVPATNSRLLEVTRLYGEPDPDDMGSSSNSKITVTDHFSQGMQTLDCNIFQRMVVEPNPDDLGEKANTLGCGNATGHDEGDCLEAGLVTNQT
        DLS+ +VQ VP+TNSRLLE T+LYGEPDPDDMGSS N KIT  +HFS GMQ LDCN  QRMVVEP+PDDLGEK NTLGCGNATGHDE DCLEAGLV +QT
Subjt:  DLSNTEVQAVPATNSRLLEVTRLYGEPDPDDMGSSSNSKITVTDHFSQGMQTLDCNIFQRMVVEPNPDDLGEKANTLGCGNATGHDEGDCLEAGLVTNQT

Query:  LLSTNCQK---------DDTDESLV-EVDLSKMPIDEPDPDDQEIRRIQDSVSVVCNRLREAIAKLLAEVKPSESSAVVQTLFKIVKNVIEHPDEMKYRK
         LS +C+K          D DESLV + DLSKMP+DEPDPDDQEI+RIQDSVSVVCNRLREAI KLLAE+KPSESSAV QTLFKIVKNVIEHPDE KYRK
Subjt:  LLSTNCQK---------DDTDESLV-EVDLSKMPIDEPDPDDQEIRRIQDSVSVVCNRLREAIAKLLAEVKPSESSAVVQTLFKIVKNVIEHPDEMKYRK

Query:  LRK
        LRK
Subjt:  LRK

XP_008456391.1 PREDICTED: uncharacterized protein LOC103496343 [Cucumis melo]8.4e-21169.9Show/hide
Query:  MGISKNDVDGVLNNAKKKERI-------------------GVLKLLEGPYVFCEFWTLQIPGIEQDCIFQGLLRFNLDTDLLLNIVFELFHLAVKNYINP
        MG+SKN+VD VLNNAKK ERI                   GVLKL EGPYVFCEF TLQIPGIE                                    
Subjt:  MGISKNDVDGVLNNAKKKERI-------------------GVLKLLEGPYVFCEFWTLQIPGIEQDCIFQGLLRFNLDTDLLLNIVFELFHLAVKNYINP

Query:  HLNPPASEALKRMHMLAVDPGIVAIMNKVKESLNLDHCWRVGIMTEIAPIGYVGLSPKCILGFNKCMENLSILFSKVDFLTLVSDQNHGEEISLQLRIDD
         LNP ASEALKRMHMLA DPGIVAIMNK        H WRVGIMTE+APIGYVG+SPKCILGFNK                     NHGEEISL+LR DD
Subjt:  HLNPPASEALKRMHMLAVDPGIVAIMNKVKESLNLDHCWRVGIMTEIAPIGYVGLSPKCILGFNKCMENLSILFSKVDFLTLVSDQNHGEEISLQLRIDD

Query:  LKGFRKYESIKKTLLHELAHMNYFEHDANFYALDKQ---EAAALDWTRSKCHTLTGIKYSQYHEENVDVEDDFGVPQKLGGSMSHQLVNACAASVAAAYH
        LKGFRKYESIKKTLLHELAHM + EHDANFYALDKQ   EAAALDWTRSK HTLTG+KYSQYHEE+ DVED FGV QKLGGSMSHQLVNA AASVAAAYH
Subjt:  LKGFRKYESIKKTLLHELAHMNYFEHDANFYALDKQ---EAAALDWTRSKCHTLTGIKYSQYHEENVDVEDDFGVPQKLGGSMSHQLVNACAASVAAAYH

Query:  CMTNTSDFSSRVSQVSAVLDPNSSNYQNKLEPDPDDSAYQKKLEPDPDDSSNDQNMLEIDSDNSSNYKSKLEPDPDDSIGSKNLESECEPRFIRSRVVQT
         MTNTSD+SS V  VSA  +PNSSN+QNKLEPDPDDSAY  KL+PD D +SNDQNML +DS+NSSN+KSKLEP  DDSIGSKNLESECEPRFI+S VVQT
Subjt:  CMTNTSDFSSRVSQVSAVLDPNSSNYQNKLEPDPDDSAYQKKLEPDPDDSSNDQNMLEIDSDNSSNYKSKLEPDPDDSIGSKNLESECEPRFIRSRVVQT

Query:  DLSNTEVQAVPATNSRLLEVTRLYGEPDPDDMGSSSNSKITVTDHFSQGMQTLDCNIFQRMVVEPNPDDLGEKANTLGCGNATGHDEGDCLEAGLVTNQT
        DLS+TEV  V ATNSRLLE T+LYGEPD DDMGSSSNSK+  TDHFSQGMQ LDCN  QRMVVE +PD LGEK NTLG G ATGH+E DCLEAGLVTNQ+
Subjt:  DLSNTEVQAVPATNSRLLEVTRLYGEPDPDDMGSSSNSKITVTDHFSQGMQTLDCNIFQRMVVEPNPDDLGEKANTLGCGNATGHDEGDCLEAGLVTNQT

Query:  LLSTNCQKDDT-------------DESLV-EVDLSKMPIDEPDPDDQEIRRIQDSVSVVCNRLREAIAKLLAEVKPSESSAVVQTLFKIVKNVIEHPDEM
         LS NC+K DT             DE LV +VD SKM +D+ DPDDQEI+RIQDSVSVVCNRLREAI KLLAEVKPSESSAVVQTLFKIVKNVIEHPDEM
Subjt:  LLSTNCQKDDT-------------DESLV-EVDLSKMPIDEPDPDDQEIRRIQDSVSVVCNRLREAIAKLLAEVKPSESSAVVQTLFKIVKNVIEHPDEM

Query:  KYRKLRKA
        KYRKLRKA
Subjt:  KYRKLRKA

XP_022944497.1 uncharacterized protein LOC111448938 [Cucurbita moschata]3.2e-21068.33Show/hide
Query:  MGISKNDVDGVLNNAKKKERI-------------------GVLKLLEGPYVFCEFWTLQIPGIEQDCIFQGLLRFNLDTDLLLNIVFELFHLAVKNYINP
        MG+ +N+VD VLNNAKK ERI                   GVLKL EGPYVFCEF TLQIPGIE                                    
Subjt:  MGISKNDVDGVLNNAKKKERI-------------------GVLKLLEGPYVFCEFWTLQIPGIEQDCIFQGLLRFNLDTDLLLNIVFELFHLAVKNYINP

Query:  HLNPPASEALKRMHMLAVDPGIVAIMNKVKESLNLDHCWRVGIMTEIAPIGYVGLSPKCILGFNKCMENLSILFSKVDFLTLVSDQNHGEEISLQLRIDD
         LNPPASEALKRM MLA DPGIVAIMNK        H WRVGIMTE+ P+GYVG+SPKCILG NK                     NHGEEISL+LR DD
Subjt:  HLNPPASEALKRMHMLAVDPGIVAIMNKVKESLNLDHCWRVGIMTEIAPIGYVGLSPKCILGFNKCMENLSILFSKVDFLTLVSDQNHGEEISLQLRIDD

Query:  LKGFRKYESIKKTLLHELAHMNYFEHDANFYALDKQ---EAAALDWTRSKCHTLTGIKYSQYHEENVDVEDDFGVPQKLGGSMSHQLVNACAASVAAAYH
        LKGFRKYES+KKTLLHELAHM YFEHDANFYALDKQ   EAA LDWTRSK HTL GIKYSQYHEE+ DVED FGV QKLGGS SHQLVNA +ASVAAAYH
Subjt:  LKGFRKYESIKKTLLHELAHMNYFEHDANFYALDKQ---EAAALDWTRSKCHTLTGIKYSQYHEENVDVEDDFGVPQKLGGSMSHQLVNACAASVAAAYH

Query:  CMTNTSDFSSRVSQVSAVLDPNSSNYQNKLEPDPDDSAYQKKLEPDPDDSSNDQNMLEIDSDNSSNYKSKLEPDPDDSIGSKNLESECEPRFIRSRVVQT
         +TNT+DFSSRVS VS   DPNSS YQ KLEPDP+D AYQKKLEPDPDDSSN+QNML+ D DN+SNY++K EPDPDD+IG K +ES+  PRF RS VVQT
Subjt:  CMTNTSDFSSRVSQVSAVLDPNSSNYQNKLEPDPDDSAYQKKLEPDPDDSSNDQNMLEIDSDNSSNYKSKLEPDPDDSIGSKNLESECEPRFIRSRVVQT

Query:  DLSNTEVQAVPATNSRLLEVTRLYGEPDPDDMGSSSNSKITVTDHFSQGMQTLDCNIFQRMVVEPNPDDLGEKANTLGCGNATGHDEGDCLEAGLVTNQT
        DLS+ +VQ VP+TNSRLLE T+LYGEPDPDDMGSS N KIT  +HFS GMQ LDCN  QRMVVEP+PDDLGEK NTLGCGNATGHDE DCLEAGLV +QT
Subjt:  DLSNTEVQAVPATNSRLLEVTRLYGEPDPDDMGSSSNSKITVTDHFSQGMQTLDCNIFQRMVVEPNPDDLGEKANTLGCGNATGHDEGDCLEAGLVTNQT

Query:  LLSTNCQK---------DDTDESLV-EVDLSKMPIDEPDPDDQEIRRIQDSVSVVCNRLREAIAKLLAEVKPSESSAVVQTLFKIVKNVIEHPDEMKYRK
         LS +C+K          D DESLV + DLSKMP+DEPDPDDQEI+RIQDSVSVVCNRLREAI KLLAE+KPSESSAV QTLFKIVKNVIEHPDE KYRK
Subjt:  LLSTNCQK---------DDTDESLV-EVDLSKMPIDEPDPDDQEIRRIQDSVSVVCNRLREAIAKLLAEVKPSESSAVVQTLFKIVKNVIEHPDEMKYRK

Query:  LRK
        LRK
Subjt:  LRK

XP_022985928.1 uncharacterized protein LOC111483826 [Cucurbita maxima]4.3e-21569.65Show/hide
Query:  MGISKNDVDGVLNNAKKKERI-------------------GVLKLLEGPYVFCEFWTLQIPGIEQDCIFQGLLRFNLDTDLLLNIVFELFHLAVKNYINP
        MG+ +N+VD VLNNAKK ERI                   GVLKL EGPYVFCEF TLQIPGIE                                    
Subjt:  MGISKNDVDGVLNNAKKKERI-------------------GVLKLLEGPYVFCEFWTLQIPGIEQDCIFQGLLRFNLDTDLLLNIVFELFHLAVKNYINP

Query:  HLNPPASEALKRMHMLAVDPGIVAIMNKVKESLNLDHCWRVGIMTEIAPIGYVGLSPKCILGFNKCMENLSILFSKVDFLTLVSDQNHGEEISLQLRIDD
         L PPASEALKRMHMLA DPGIVAIMNK        H WRVGIMTE+AP+GYVG+SPKCILGFNK                     NHGEEISL+LR DD
Subjt:  HLNPPASEALKRMHMLAVDPGIVAIMNKVKESLNLDHCWRVGIMTEIAPIGYVGLSPKCILGFNKCMENLSILFSKVDFLTLVSDQNHGEEISLQLRIDD

Query:  LKGFRKYESIKKTLLHELAHMNYFEHDANFYALDKQ---EAAALDWTRSKCHTLTGIKYSQYHEENVDVEDDFGVPQKLGGSMSHQLVNACAASVAAAYH
        LKGFRKYES+KKTLLHELAHM Y EHDANFY+LDKQ   EAA LDWTRSK HTL GIKYSQYHEE+ DVED FGV QKLGG  SHQLVNA +ASVAAAYH
Subjt:  LKGFRKYESIKKTLLHELAHMNYFEHDANFYALDKQ---EAAALDWTRSKCHTLTGIKYSQYHEENVDVEDDFGVPQKLGGSMSHQLVNACAASVAAAYH

Query:  CMTNTSDFSSRVSQVSAVLDPNSSNYQNKLEPDPDDSAYQKKLEPDPDDSSNDQNMLEIDSDNSSNYKSKLEPDPDDSIGSKNLESECEPRFIRSRVVQT
          TNT+DFSSRVS VS   DPNSSNYQ KLEPDPDDSAYQKKLEPDPDD+SN+QNMLE D DNSSNY+SKLEPDPDD+IG K +ES+  PRF R  VVQT
Subjt:  CMTNTSDFSSRVSQVSAVLDPNSSNYQNKLEPDPDDSAYQKKLEPDPDDSSNDQNMLEIDSDNSSNYKSKLEPDPDDSIGSKNLESECEPRFIRSRVVQT

Query:  DLSNTEVQAVPATNSRLLEVTRLYGEPDPDDMGSSSNSKITVTDHFSQGMQTLDCNIFQRMVVEPNPDDLGEKANTLGCGNATGHDEGDCLEAGLVTNQT
        +LS+ EVQ VPATNSRL + T+L+GEPDPDDMGSSSNSKIT T+HFSQGMQ LDCN FQRMVVEP+PDDLGEK NTLGCGNATGHDE DCLEAGLV +QT
Subjt:  DLSNTEVQAVPATNSRLLEVTRLYGEPDPDDMGSSSNSKITVTDHFSQGMQTLDCNIFQRMVVEPNPDDLGEKANTLGCGNATGHDEGDCLEAGLVTNQT

Query:  LLSTNCQK---------DDTDESLV-EVDLSKMPIDEPDPDDQEIRRIQDSVSVVCNRLREAIAKLLAEVKPSESSAVVQTLFKIVKNVIEHPDEMKYRK
         LS +C+K          D DESL  +VDLSKMP+DEPDPDDQEI+RIQDSVSVVCNRLREAIAKLLAE+KPSESSAV QTLFKIVKNVIEHPDE KYRK
Subjt:  LLSTNCQK---------DDTDESLV-EVDLSKMPIDEPDPDDQEIRRIQDSVSVVCNRLREAIAKLLAEVKPSESSAVVQTLFKIVKNVIEHPDEMKYRK

Query:  LRK
        LRK
Subjt:  LRK

XP_038901675.1 uncharacterized protein LOC120088443 [Benincasa hispida]2.8e-21469.06Show/hide
Query:  MGISKNDVDGVLNNAKKKERI-------------------GVLKLLEGPYVFCEFWTLQIPGIEQDCIFQGLLRFNLDTDLLLNIVFELFHLAVKNYINP
        MG+SKN+VD VLNNA K ERI                   G+LKL EGPYVFCEF TLQIPGIE                                    
Subjt:  MGISKNDVDGVLNNAKKKERI-------------------GVLKLLEGPYVFCEFWTLQIPGIEQDCIFQGLLRFNLDTDLLLNIVFELFHLAVKNYINP

Query:  HLNPPASEALKRMHMLAVDPGIVAIMNKVKESLNLDHCWRVGIMTEIAPIGYVGLSPKCILGFNKCMENLSILFSKVDFLTLVSDQNHGEEISLQLRIDD
         LNPPASEALKRMHMLA DPGI+AIMNK        H WRVGIMTE+AP+GYVG+SPKCILGFNK                     NHGEEISL+LR DD
Subjt:  HLNPPASEALKRMHMLAVDPGIVAIMNKVKESLNLDHCWRVGIMTEIAPIGYVGLSPKCILGFNKCMENLSILFSKVDFLTLVSDQNHGEEISLQLRIDD

Query:  LKGFRKYESIKKTLLHELAHMNYFEHDANFYALDKQ---EAAALDWTRSKCHTLTGIKYSQYHEENVDVEDDFGVPQKLGGSMSHQLVNACAASVAAAYH
        LKGFRKYESIKKTLLHELAHM Y EHDANF+ALDKQ   EAAALDWTRSK HTLTGIKYSQYHEE+ DVED F V QKLGGS+SHQLVNA AASVAAAYH
Subjt:  LKGFRKYESIKKTLLHELAHMNYFEHDANFYALDKQ---EAAALDWTRSKCHTLTGIKYSQYHEENVDVEDDFGVPQKLGGSMSHQLVNACAASVAAAYH

Query:  CMTNTSDFSSRVSQVSAVLDPNSSNYQNKLEPDPDDSAYQKKLEPDPDDSSNDQNMLEIDSDNSSNYKSKLEPDPDDSIGSKNLESECEPRFIRSRVVQT
         +TNTS+FSSRVSQVSA             E DPDDSAY KKLEPDPDDSSNDQ+MLE+DSDNSSNYK+KLEPD DDSIGSK+LESE EPRFI+S VVQT
Subjt:  CMTNTSDFSSRVSQVSAVLDPNSSNYQNKLEPDPDDSAYQKKLEPDPDDSSNDQNMLEIDSDNSSNYKSKLEPDPDDSIGSKNLESECEPRFIRSRVVQT

Query:  DLSNTEVQAVPATNSRLLEVTRLYGEPDPDDMGSSSNSKITVTDHFSQGMQTLDCNIFQRMVVEPNPDDLGEKANTLGCGNATGHDEGDCLEAGLVTNQT
        DLS+TEVQ VP TNSRLLE T+LYGEPDPDD+GSSSNSKIT TDHFSQGM  LDCNIFQRMVVEP PD+L EKANTLGCGNA GHDE DCLEAGLVT+QT
Subjt:  DLSNTEVQAVPATNSRLLEVTRLYGEPDPDDMGSSSNSKITVTDHFSQGMQTLDCNIFQRMVVEPNPDDLGEKANTLGCGNATGHDEGDCLEAGLVTNQT

Query:  LLSTNCQKDDTDE--------------------SLVEVDLSKMPIDEPDPDDQEIRRIQDSVSVVCNRLREAIAKLLAEVKPSESSAVVQTLFKIVKNVI
         LS NC+K DT +                    S+ +VD SKMP+DEPDPDDQEI+RIQDSVSVVCNRLREAIA+LLAEVKPSESSAV+QTLFKIVKNVI
Subjt:  LLSTNCQKDDTDE--------------------SLVEVDLSKMPIDEPDPDDQEIRRIQDSVSVVCNRLREAIAKLLAEVKPSESSAVVQTLFKIVKNVI

Query:  EHPDEMKYRKLRKA
        EHPDEMKYRKLRKA
Subjt:  EHPDEMKYRKLRKA

TrEMBL top hitse value%identityAlignment
A0A0A0KF59 WLM domain-containing protein7.2e-20869.08Show/hide
Query:  MGISKNDVDGVLNNAKKKERI-------------------GVLKLLEGPYVFCEFWTLQIPGIEQDCIFQGLLRFNLDTDLLLNIVFELFHLAVKNYINP
        MG+SKN+VD +LNNAKK ERI                   GVLKL EGPYVFCEF TLQIPGIE                                    
Subjt:  MGISKNDVDGVLNNAKKKERI-------------------GVLKLLEGPYVFCEFWTLQIPGIEQDCIFQGLLRFNLDTDLLLNIVFELFHLAVKNYINP

Query:  HLNPPASEALKRMHMLAVDPGIVAIMNKVKESLNLDHCWRVGIMTEIAPIGYVGLSPKCILGFNKCMENLSILFSKVDFLTLVSDQNHGEEISLQLRIDD
         LNPPASEALKRMHMLA DPGIVAIMNK        H WRVGIMTE+APIGYVG++PKCILGFNK                     NHGEEISL+LR DD
Subjt:  HLNPPASEALKRMHMLAVDPGIVAIMNKVKESLNLDHCWRVGIMTEIAPIGYVGLSPKCILGFNKCMENLSILFSKVDFLTLVSDQNHGEEISLQLRIDD

Query:  LKGFRKYESIKKTLLHELAHMNYFEHDANFYALDKQ---EAAALDWTRSKCHTLTGIKYSQYHEENVDVEDDFGVPQKLGGSMSHQLVNACAASVAAAYH
        LKGFRKYESIKKTLLHELAHM + EHDANFYALDKQ   EAAALDWTRSK HTLTG+ YSQYHEEN DVEDDFGV QKLGGSMSHQLVNA AASVAAAYH
Subjt:  LKGFRKYESIKKTLLHELAHMNYFEHDANFYALDKQ---EAAALDWTRSKCHTLTGIKYSQYHEENVDVEDDFGVPQKLGGSMSHQLVNACAASVAAAYH

Query:  CMTNTSDFSSRVSQVSAVLDPNSSNYQNKLEPDPDDSAYQKKLEPDPDDSSNDQNMLEIDSDNSSNYKSKLEPDPDDSIGSKNLESECEPRFIRSRVVQT
         MTN SD SS V QVSA  +PNSS +QNKLEPDPDDS Y  KLEPDPD SSNDQNML +DS+NS N+K KLEP PDDSIGS+NLESE EPR I+S VVQT
Subjt:  CMTNTSDFSSRVSQVSAVLDPNSSNYQNKLEPDPDDSAYQKKLEPDPDDSSNDQNMLEIDSDNSSNYKSKLEPDPDDSIGSKNLESECEPRFIRSRVVQT

Query:  DLSNTEVQAVPATNSRLLEVTRLYGEPDPDDMGSSSNSKITVTDHFSQGMQTLDCNIFQRMVVEPNPDDLGEKANTLGCGNATGHDEGDCLEAGLVTNQT
        DLS+TEV  VPATNSRLLE T+ YGEPD DD GSSSNSK+  TDH SQGMQ LDCNIFQRM+VEP+PD LGEK NTL  G A GH+E DCLEAGLV NQ+
Subjt:  DLSNTEVQAVPATNSRLLEVTRLYGEPDPDDMGSSSNSKITVTDHFSQGMQTLDCNIFQRMVVEPNPDDLGEKANTLGCGNATGHDEGDCLEAGLVTNQT

Query:  LLSTNCQKDDT-------------DESLV-EVDLSKMPIDEPDPDDQEIRRIQDSVSVVCNRLREAIAKLLAEVKPSESSAVVQTLFKIVKNVIEHPDEM
         LS NC+K DT             DESLV +VD SKM +D+ DPDDQEI+RIQDSVSVVCNRLREAI KLLAEVKPSESSAVVQTLFKIVKNVIEHPDEM
Subjt:  LLSTNCQKDDT-------------DESLV-EVDLSKMPIDEPDPDDQEIRRIQDSVSVVCNRLREAIAKLLAEVKPSESSAVVQTLFKIVKNVIEHPDEM

Query:  KYRKLRKA
        KYRKLRKA
Subjt:  KYRKLRKA

A0A1S3C3T6 uncharacterized protein LOC1034963434.1e-21169.9Show/hide
Query:  MGISKNDVDGVLNNAKKKERI-------------------GVLKLLEGPYVFCEFWTLQIPGIEQDCIFQGLLRFNLDTDLLLNIVFELFHLAVKNYINP
        MG+SKN+VD VLNNAKK ERI                   GVLKL EGPYVFCEF TLQIPGIE                                    
Subjt:  MGISKNDVDGVLNNAKKKERI-------------------GVLKLLEGPYVFCEFWTLQIPGIEQDCIFQGLLRFNLDTDLLLNIVFELFHLAVKNYINP

Query:  HLNPPASEALKRMHMLAVDPGIVAIMNKVKESLNLDHCWRVGIMTEIAPIGYVGLSPKCILGFNKCMENLSILFSKVDFLTLVSDQNHGEEISLQLRIDD
         LNP ASEALKRMHMLA DPGIVAIMNK        H WRVGIMTE+APIGYVG+SPKCILGFNK                     NHGEEISL+LR DD
Subjt:  HLNPPASEALKRMHMLAVDPGIVAIMNKVKESLNLDHCWRVGIMTEIAPIGYVGLSPKCILGFNKCMENLSILFSKVDFLTLVSDQNHGEEISLQLRIDD

Query:  LKGFRKYESIKKTLLHELAHMNYFEHDANFYALDKQ---EAAALDWTRSKCHTLTGIKYSQYHEENVDVEDDFGVPQKLGGSMSHQLVNACAASVAAAYH
        LKGFRKYESIKKTLLHELAHM + EHDANFYALDKQ   EAAALDWTRSK HTLTG+KYSQYHEE+ DVED FGV QKLGGSMSHQLVNA AASVAAAYH
Subjt:  LKGFRKYESIKKTLLHELAHMNYFEHDANFYALDKQ---EAAALDWTRSKCHTLTGIKYSQYHEENVDVEDDFGVPQKLGGSMSHQLVNACAASVAAAYH

Query:  CMTNTSDFSSRVSQVSAVLDPNSSNYQNKLEPDPDDSAYQKKLEPDPDDSSNDQNMLEIDSDNSSNYKSKLEPDPDDSIGSKNLESECEPRFIRSRVVQT
         MTNTSD+SS V  VSA  +PNSSN+QNKLEPDPDDSAY  KL+PD D +SNDQNML +DS+NSSN+KSKLEP  DDSIGSKNLESECEPRFI+S VVQT
Subjt:  CMTNTSDFSSRVSQVSAVLDPNSSNYQNKLEPDPDDSAYQKKLEPDPDDSSNDQNMLEIDSDNSSNYKSKLEPDPDDSIGSKNLESECEPRFIRSRVVQT

Query:  DLSNTEVQAVPATNSRLLEVTRLYGEPDPDDMGSSSNSKITVTDHFSQGMQTLDCNIFQRMVVEPNPDDLGEKANTLGCGNATGHDEGDCLEAGLVTNQT
        DLS+TEV  V ATNSRLLE T+LYGEPD DDMGSSSNSK+  TDHFSQGMQ LDCN  QRMVVE +PD LGEK NTLG G ATGH+E DCLEAGLVTNQ+
Subjt:  DLSNTEVQAVPATNSRLLEVTRLYGEPDPDDMGSSSNSKITVTDHFSQGMQTLDCNIFQRMVVEPNPDDLGEKANTLGCGNATGHDEGDCLEAGLVTNQT

Query:  LLSTNCQKDDT-------------DESLV-EVDLSKMPIDEPDPDDQEIRRIQDSVSVVCNRLREAIAKLLAEVKPSESSAVVQTLFKIVKNVIEHPDEM
         LS NC+K DT             DE LV +VD SKM +D+ DPDDQEI+RIQDSVSVVCNRLREAI KLLAEVKPSESSAVVQTLFKIVKNVIEHPDEM
Subjt:  LLSTNCQKDDT-------------DESLV-EVDLSKMPIDEPDPDDQEIRRIQDSVSVVCNRLREAIAKLLAEVKPSESSAVVQTLFKIVKNVIEHPDEM

Query:  KYRKLRKA
        KYRKLRKA
Subjt:  KYRKLRKA

A0A5A7UHQ6 Putative Ubiquitin and WLM domain-containing protein4.1e-21169.9Show/hide
Query:  MGISKNDVDGVLNNAKKKERI-------------------GVLKLLEGPYVFCEFWTLQIPGIEQDCIFQGLLRFNLDTDLLLNIVFELFHLAVKNYINP
        MG+SKN+VD VLNNAKK ERI                   GVLKL EGPYVFCEF TLQIPGIE                                    
Subjt:  MGISKNDVDGVLNNAKKKERI-------------------GVLKLLEGPYVFCEFWTLQIPGIEQDCIFQGLLRFNLDTDLLLNIVFELFHLAVKNYINP

Query:  HLNPPASEALKRMHMLAVDPGIVAIMNKVKESLNLDHCWRVGIMTEIAPIGYVGLSPKCILGFNKCMENLSILFSKVDFLTLVSDQNHGEEISLQLRIDD
         LNP ASEALKRMHMLA DPGIVAIMNK        H WRVGIMTE+APIGYVG+SPKCILGFNK                     NHGEEISL+LR DD
Subjt:  HLNPPASEALKRMHMLAVDPGIVAIMNKVKESLNLDHCWRVGIMTEIAPIGYVGLSPKCILGFNKCMENLSILFSKVDFLTLVSDQNHGEEISLQLRIDD

Query:  LKGFRKYESIKKTLLHELAHMNYFEHDANFYALDKQ---EAAALDWTRSKCHTLTGIKYSQYHEENVDVEDDFGVPQKLGGSMSHQLVNACAASVAAAYH
        LKGFRKYESIKKTLLHELAHM + EHDANFYALDKQ   EAAALDWTRSK HTLTG+KYSQYHEE+ DVED FGV QKLGGSMSHQLVNA AASVAAAYH
Subjt:  LKGFRKYESIKKTLLHELAHMNYFEHDANFYALDKQ---EAAALDWTRSKCHTLTGIKYSQYHEENVDVEDDFGVPQKLGGSMSHQLVNACAASVAAAYH

Query:  CMTNTSDFSSRVSQVSAVLDPNSSNYQNKLEPDPDDSAYQKKLEPDPDDSSNDQNMLEIDSDNSSNYKSKLEPDPDDSIGSKNLESECEPRFIRSRVVQT
         MTNTSD+SS V  VSA  +PNSSN+QNKLEPDPDDSAY  KL+PD D +SNDQNML +DS+NSSN+KSKLEP  DDSIGSKNLESECEPRFI+S VVQT
Subjt:  CMTNTSDFSSRVSQVSAVLDPNSSNYQNKLEPDPDDSAYQKKLEPDPDDSSNDQNMLEIDSDNSSNYKSKLEPDPDDSIGSKNLESECEPRFIRSRVVQT

Query:  DLSNTEVQAVPATNSRLLEVTRLYGEPDPDDMGSSSNSKITVTDHFSQGMQTLDCNIFQRMVVEPNPDDLGEKANTLGCGNATGHDEGDCLEAGLVTNQT
        DLS+TEV  V ATNSRLLE T+LYGEPD DDMGSSSNSK+  TDHFSQGMQ LDCN  QRMVVE +PD LGEK NTLG G ATGH+E DCLEAGLVTNQ+
Subjt:  DLSNTEVQAVPATNSRLLEVTRLYGEPDPDDMGSSSNSKITVTDHFSQGMQTLDCNIFQRMVVEPNPDDLGEKANTLGCGNATGHDEGDCLEAGLVTNQT

Query:  LLSTNCQKDDT-------------DESLV-EVDLSKMPIDEPDPDDQEIRRIQDSVSVVCNRLREAIAKLLAEVKPSESSAVVQTLFKIVKNVIEHPDEM
         LS NC+K DT             DE LV +VD SKM +D+ DPDDQEI+RIQDSVSVVCNRLREAI KLLAEVKPSESSAVVQTLFKIVKNVIEHPDEM
Subjt:  LLSTNCQKDDT-------------DESLV-EVDLSKMPIDEPDPDDQEIRRIQDSVSVVCNRLREAIAKLLAEVKPSESSAVVQTLFKIVKNVIEHPDEM

Query:  KYRKLRKA
        KYRKLRKA
Subjt:  KYRKLRKA

A0A6J1FVU2 uncharacterized protein LOC1114489381.6e-21068.33Show/hide
Query:  MGISKNDVDGVLNNAKKKERI-------------------GVLKLLEGPYVFCEFWTLQIPGIEQDCIFQGLLRFNLDTDLLLNIVFELFHLAVKNYINP
        MG+ +N+VD VLNNAKK ERI                   GVLKL EGPYVFCEF TLQIPGIE                                    
Subjt:  MGISKNDVDGVLNNAKKKERI-------------------GVLKLLEGPYVFCEFWTLQIPGIEQDCIFQGLLRFNLDTDLLLNIVFELFHLAVKNYINP

Query:  HLNPPASEALKRMHMLAVDPGIVAIMNKVKESLNLDHCWRVGIMTEIAPIGYVGLSPKCILGFNKCMENLSILFSKVDFLTLVSDQNHGEEISLQLRIDD
         LNPPASEALKRM MLA DPGIVAIMNK        H WRVGIMTE+ P+GYVG+SPKCILG NK                     NHGEEISL+LR DD
Subjt:  HLNPPASEALKRMHMLAVDPGIVAIMNKVKESLNLDHCWRVGIMTEIAPIGYVGLSPKCILGFNKCMENLSILFSKVDFLTLVSDQNHGEEISLQLRIDD

Query:  LKGFRKYESIKKTLLHELAHMNYFEHDANFYALDKQ---EAAALDWTRSKCHTLTGIKYSQYHEENVDVEDDFGVPQKLGGSMSHQLVNACAASVAAAYH
        LKGFRKYES+KKTLLHELAHM YFEHDANFYALDKQ   EAA LDWTRSK HTL GIKYSQYHEE+ DVED FGV QKLGGS SHQLVNA +ASVAAAYH
Subjt:  LKGFRKYESIKKTLLHELAHMNYFEHDANFYALDKQ---EAAALDWTRSKCHTLTGIKYSQYHEENVDVEDDFGVPQKLGGSMSHQLVNACAASVAAAYH

Query:  CMTNTSDFSSRVSQVSAVLDPNSSNYQNKLEPDPDDSAYQKKLEPDPDDSSNDQNMLEIDSDNSSNYKSKLEPDPDDSIGSKNLESECEPRFIRSRVVQT
         +TNT+DFSSRVS VS   DPNSS YQ KLEPDP+D AYQKKLEPDPDDSSN+QNML+ D DN+SNY++K EPDPDD+IG K +ES+  PRF RS VVQT
Subjt:  CMTNTSDFSSRVSQVSAVLDPNSSNYQNKLEPDPDDSAYQKKLEPDPDDSSNDQNMLEIDSDNSSNYKSKLEPDPDDSIGSKNLESECEPRFIRSRVVQT

Query:  DLSNTEVQAVPATNSRLLEVTRLYGEPDPDDMGSSSNSKITVTDHFSQGMQTLDCNIFQRMVVEPNPDDLGEKANTLGCGNATGHDEGDCLEAGLVTNQT
        DLS+ +VQ VP+TNSRLLE T+LYGEPDPDDMGSS N KIT  +HFS GMQ LDCN  QRMVVEP+PDDLGEK NTLGCGNATGHDE DCLEAGLV +QT
Subjt:  DLSNTEVQAVPATNSRLLEVTRLYGEPDPDDMGSSSNSKITVTDHFSQGMQTLDCNIFQRMVVEPNPDDLGEKANTLGCGNATGHDEGDCLEAGLVTNQT

Query:  LLSTNCQK---------DDTDESLV-EVDLSKMPIDEPDPDDQEIRRIQDSVSVVCNRLREAIAKLLAEVKPSESSAVVQTLFKIVKNVIEHPDEMKYRK
         LS +C+K          D DESLV + DLSKMP+DEPDPDDQEI+RIQDSVSVVCNRLREAI KLLAE+KPSESSAV QTLFKIVKNVIEHPDE KYRK
Subjt:  LLSTNCQK---------DDTDESLV-EVDLSKMPIDEPDPDDQEIRRIQDSVSVVCNRLREAIAKLLAEVKPSESSAVVQTLFKIVKNVIEHPDEMKYRK

Query:  LRK
        LRK
Subjt:  LRK

A0A6J1JF11 uncharacterized protein LOC1114838262.1e-21569.65Show/hide
Query:  MGISKNDVDGVLNNAKKKERI-------------------GVLKLLEGPYVFCEFWTLQIPGIEQDCIFQGLLRFNLDTDLLLNIVFELFHLAVKNYINP
        MG+ +N+VD VLNNAKK ERI                   GVLKL EGPYVFCEF TLQIPGIE                                    
Subjt:  MGISKNDVDGVLNNAKKKERI-------------------GVLKLLEGPYVFCEFWTLQIPGIEQDCIFQGLLRFNLDTDLLLNIVFELFHLAVKNYINP

Query:  HLNPPASEALKRMHMLAVDPGIVAIMNKVKESLNLDHCWRVGIMTEIAPIGYVGLSPKCILGFNKCMENLSILFSKVDFLTLVSDQNHGEEISLQLRIDD
         L PPASEALKRMHMLA DPGIVAIMNK        H WRVGIMTE+AP+GYVG+SPKCILGFNK                     NHGEEISL+LR DD
Subjt:  HLNPPASEALKRMHMLAVDPGIVAIMNKVKESLNLDHCWRVGIMTEIAPIGYVGLSPKCILGFNKCMENLSILFSKVDFLTLVSDQNHGEEISLQLRIDD

Query:  LKGFRKYESIKKTLLHELAHMNYFEHDANFYALDKQ---EAAALDWTRSKCHTLTGIKYSQYHEENVDVEDDFGVPQKLGGSMSHQLVNACAASVAAAYH
        LKGFRKYES+KKTLLHELAHM Y EHDANFY+LDKQ   EAA LDWTRSK HTL GIKYSQYHEE+ DVED FGV QKLGG  SHQLVNA +ASVAAAYH
Subjt:  LKGFRKYESIKKTLLHELAHMNYFEHDANFYALDKQ---EAAALDWTRSKCHTLTGIKYSQYHEENVDVEDDFGVPQKLGGSMSHQLVNACAASVAAAYH

Query:  CMTNTSDFSSRVSQVSAVLDPNSSNYQNKLEPDPDDSAYQKKLEPDPDDSSNDQNMLEIDSDNSSNYKSKLEPDPDDSIGSKNLESECEPRFIRSRVVQT
          TNT+DFSSRVS VS   DPNSSNYQ KLEPDPDDSAYQKKLEPDPDD+SN+QNMLE D DNSSNY+SKLEPDPDD+IG K +ES+  PRF R  VVQT
Subjt:  CMTNTSDFSSRVSQVSAVLDPNSSNYQNKLEPDPDDSAYQKKLEPDPDDSSNDQNMLEIDSDNSSNYKSKLEPDPDDSIGSKNLESECEPRFIRSRVVQT

Query:  DLSNTEVQAVPATNSRLLEVTRLYGEPDPDDMGSSSNSKITVTDHFSQGMQTLDCNIFQRMVVEPNPDDLGEKANTLGCGNATGHDEGDCLEAGLVTNQT
        +LS+ EVQ VPATNSRL + T+L+GEPDPDDMGSSSNSKIT T+HFSQGMQ LDCN FQRMVVEP+PDDLGEK NTLGCGNATGHDE DCLEAGLV +QT
Subjt:  DLSNTEVQAVPATNSRLLEVTRLYGEPDPDDMGSSSNSKITVTDHFSQGMQTLDCNIFQRMVVEPNPDDLGEKANTLGCGNATGHDEGDCLEAGLVTNQT

Query:  LLSTNCQK---------DDTDESLV-EVDLSKMPIDEPDPDDQEIRRIQDSVSVVCNRLREAIAKLLAEVKPSESSAVVQTLFKIVKNVIEHPDEMKYRK
         LS +C+K          D DESL  +VDLSKMP+DEPDPDDQEI+RIQDSVSVVCNRLREAIAKLLAE+KPSESSAV QTLFKIVKNVIEHPDE KYRK
Subjt:  LLSTNCQK---------DDTDESLV-EVDLSKMPIDEPDPDDQEIRRIQDSVSVVCNRLREAIAKLLAEVKPSESSAVVQTLFKIVKNVIEHPDEMKYRK

Query:  LRK
        LRK
Subjt:  LRK

SwissProt top hitse value%identityAlignment
O94580 DNA-dependent metalloprotease WSS1 homolog 25.1e-0931.69Show/hide
Query:  PPASEALKRMHMLAVDPGIVAIMNKVKESLNLDHCWRVGIMTEIAPIGYVGLSPKCILGFNKCMENLSILFSKVDFLTLVSDQNHGEEISLQLRIDDLKG
        P    AL+ +  L  D GI  IM+         H W V +++E+ P  +                      ++ D  TL  + N G  I L+LR D   G
Subjt:  PPASEALKRMHMLAVDPGIVAIMNKVKESLNLDHCWRVGIMTEIAPIGYVGLSPKCILGFNKCMENLSILFSKVDFLTLVSDQNHGEEISLQLRIDDLKG

Query:  FRKYESIKKTLLHELAHMNYFEHDANFYALDKQ---EAAALD
        FR Y+++K TL+HEL H  + EHD++F+ L +Q   EA A D
Subjt:  FRKYESIKKTLLHELAHMNYFEHDANFYALDKQ---EAAALD

Arabidopsis top hitse value%identityAlignment
AT5G35690.1 CONTAINS InterPro DOMAIN/s: WLM (InterPro:IPR013536), PUB domain (InterPro:IPR018997), PUG domain (InterPro:IPR006567)2.7e-8237.88Show/hide
Query:  MGISKNDVDGVLNNAKKKERI-------------------GVLKLLEGPYVFCEFWTLQIPGIEQDCIFQGLLRFNLDTDLLLNIVFELFHLAVKNYINP
        MG+S+ +V+GVL  A    RI                     +KL +G Y+F +F TLQ+PGIE                                    
Subjt:  MGISKNDVDGVLNNAKKKERI-------------------GVLKLLEGPYVFCEFWTLQIPGIEQDCIFQGLLRFNLDTDLLLNIVFELFHLAVKNYINP

Query:  HLNPPASEALKRMHMLAVDPGIVAIMNKVKESLNLDHCWRVGIMTEIAPIGYVGLSPKCILGFNKCMENLSILFSKVDFLTLVSDQNHGEEISLQLRIDD
         LNPP S ALKRMHMLA DPGI+A+MNK        H WRVGIMTE+AP+GYVG+SP+C+LGFNK                     N GEEISL+LR DD
Subjt:  HLNPPASEALKRMHMLAVDPGIVAIMNKVKESLNLDHCWRVGIMTEIAPIGYVGLSPKCILGFNKCMENLSILFSKVDFLTLVSDQNHGEEISLQLRIDD

Query:  LKGFRKYESIKKTLLHELAHMNYFEHDANFYALDKQ---EAAALDWTRSKCHTLTGIKY-SQYHEENVDVEDDFGVPQKLGGSMSHQLVNACAASVAAAY
        LKGFRKY+SIKKTLLHELAHM Y EHD  FYALD Q   EA +LDWT+S+ HTL G K+ +   EE+   +++  V Q+LGG+ S  L NA  +SVAAAY
Subjt:  LKGFRKYESIKKTLLHELAHMNYFEHDANFYALDKQ---EAAALDWTRSKCHTLTGIKY-SQYHEENVDVEDDFGVPQKLGGSMSHQLVNACAASVAAAY

Query:  HCMTNTSDFSSRVSQVSAVLDPNSSNYQNKLEPDPDDSAYQKKLEPDPDDSSNDQNMLEIDSDNSSNYKSKLEPDPDDSIGSKNLESECEPRFIRSRVVQ
          +++TS     VS++S              EPDPDD                D+N   +     S+  +K EPDPDD+                     
Subjt:  HCMTNTSDFSSRVSQVSAVLDPNSSNYQNKLEPDPDDSAYQKKLEPDPDDSSNDQNMLEIDSDNSSNYKSKLEPDPDDSIGSKNLESECEPRFIRSRVVQ

Query:  TDLSNTEVQAVPATNSRLLEVTRLYGEPDPDDMGSSSNSKITVTDHFSQGMQTLDCNIFQRMVVEPNPDDLGEKANTLGCGNATGHDEGDCLEAGLVTNQ
         D + TE        S L   T+   EPDPDD  S +++ I   ++      T+                       + CGN         L A   T  
Subjt:  TDLSNTEVQAVPATNSRLLEVTRLYGEPDPDDMGSSSNSKITVTDHFSQGMQTLDCNIFQRMVVEPNPDDLGEKANTLGCGNATGHDEGDCLEAGLVTNQ

Query:  TLLSTNCQKDDTDESLVEVDLSKMPIDEPDPDDQEIRRIQDSVSVVCNRLREAIAKLLAEVKPSESSAVVQTLFKIVKNVIEHPDEMKYRKLRK
        T    N +       +V    + M +DEPDPDDQEI+RIQDSV+++ NRL++AI  L  EV P +++ V+Q L KIV+N+IE P+EMK+++LRK
Subjt:  TLLSTNCQKDDTDESLVEVDLSKMPIDEPDPDDQEIRRIQDSVSVVCNRLREAIAKLLAEVKPSESSAVVQTLFKIVKNVIEHPDEMKYRKLRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAATCTCTAAGAATGATGTAGACGGAGTTTTGAACAATGCTAAGAAAAAAGAACGAATTGGCGTGCTGAAACTACTAGAAGGGCCCTATGTATTTTGTGAATTTTG
GACACTTCAAATTCCAGGAATTGAGCAGGATTGCATTTTTCAGGGCTTATTGCGCTTTAATCTGGATACAGATTTGCTCTTAAATATTGTTTTTGAACTATTTCATTTAG
CTGTTAAGAATTACATTAATCCACATCTAAACCCTCCAGCTTCAGAAGCTTTGAAAAGAATGCATATGCTTGCTGTTGACCCTGGTATTGTCGCAATCATGAACAAGGTA
AAGGAATCTCTAAATCTGGATCACTGTTGGCGTGTGGGAATTATGACTGAGATTGCCCCTATTGGCTACGTTGGTTTGAGCCCTAAATGTATTCTTGGCTTTAATAAGTG
CATGGAAAACTTAAGTATATTATTCTCCAAAGTTGATTTTTTGACTTTAGTTTCTGATCAGAACCATGGAGAGGAGATATCCCTGCAACTTCGTATAGATGACCTGAAGG
GCTTCCGAAAATATGAAAGTATTAAGAAAACATTACTCCACGAACTTGCACACATGAATTATTTTGAGCACGACGCCAACTTTTATGCTTTGGACAAGCAGGAGGCTGCT
GCTTTAGATTGGACAAGATCAAAATGCCACACGTTGACTGGAATTAAGTATTCCCAATATCATGAAGAAAACGTTGATGTTGAAGATGATTTCGGTGTCCCACAGAAGCT
TGGTGGAAGTATGTCGCATCAGCTGGTTAATGCCTGTGCTGCTTCAGTTGCCGCTGCTTATCACTGTATGACAAATACTTCAGATTTCAGTTCAAGAGTATCTCAAGTAA
GTGCGGTGTTGGATCCCAATAGTTCTAATTACCAAAACAAACTGGAGCCTGATCCTGATGACAGTGCTTATCAAAAGAAGCTCGAGCCTGACCCTGATGACAGCTCCAAT
GATCAAAACATGCTTGAGATTGATTCTGACAACAGTTCTAATTATAAAAGCAAGCTCGAACCTGATCCTGATGATTCGATAGGGAGCAAAAATTTAGAATCTGAATGTGA
ACCAAGATTCATTAGGAGCCGAGTAGTTCAAACAGATTTGAGCAACACAGAAGTACAGGCTGTGCCTGCTACGAACAGCAGATTGTTGGAAGTCACAAGGTTGTATGGAG
AACCAGATCCCGATGATATGGGAAGCTCTTCTAACAGTAAAATAACTGTCACAGACCATTTCTCTCAAGGAATGCAGACCCTGGATTGCAACATTTTCCAAAGAATGGTT
GTTGAACCTAATCCAGATGACTTGGGAGAGAAAGCGAACACGTTGGGGTGTGGAAATGCCACTGGACATGACGAGGGCGATTGTTTAGAAGCTGGCTTAGTGACAAACCA
AACCCTTTTAAGCACAAATTGTCAAAAGGATGATACTGATGAAAGTTTGGTAGAAGTGGATTTATCTAAAATGCCTATTGACGAGCCTGATCCTGATGACCAAGAAATTC
GAAGAATACAAGACTCTGTTTCTGTTGTTTGCAATCGATTACGTGAGGCTATTGCAAAGCTGCTGGCTGAAGTTAAACCTTCTGAATCTTCAGCAGTTGTTCAAACTCTG
TTCAAGATTGTTAAGAATGTAATCGAACACCCTGATGAAATGAAATACAGAAAGCTTCGCAAGGCAAGACAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAATCTCTAAGAATGATGTAGACGGAGTTTTGAACAATGCTAAGAAAAAAGAACGAATTGGCGTGCTGAAACTACTAGAAGGGCCCTATGTATTTTGTGAATTTTG
GACACTTCAAATTCCAGGAATTGAGCAGGATTGCATTTTTCAGGGCTTATTGCGCTTTAATCTGGATACAGATTTGCTCTTAAATATTGTTTTTGAACTATTTCATTTAG
CTGTTAAGAATTACATTAATCCACATCTAAACCCTCCAGCTTCAGAAGCTTTGAAAAGAATGCATATGCTTGCTGTTGACCCTGGTATTGTCGCAATCATGAACAAGGTA
AAGGAATCTCTAAATCTGGATCACTGTTGGCGTGTGGGAATTATGACTGAGATTGCCCCTATTGGCTACGTTGGTTTGAGCCCTAAATGTATTCTTGGCTTTAATAAGTG
CATGGAAAACTTAAGTATATTATTCTCCAAAGTTGATTTTTTGACTTTAGTTTCTGATCAGAACCATGGAGAGGAGATATCCCTGCAACTTCGTATAGATGACCTGAAGG
GCTTCCGAAAATATGAAAGTATTAAGAAAACATTACTCCACGAACTTGCACACATGAATTATTTTGAGCACGACGCCAACTTTTATGCTTTGGACAAGCAGGAGGCTGCT
GCTTTAGATTGGACAAGATCAAAATGCCACACGTTGACTGGAATTAAGTATTCCCAATATCATGAAGAAAACGTTGATGTTGAAGATGATTTCGGTGTCCCACAGAAGCT
TGGTGGAAGTATGTCGCATCAGCTGGTTAATGCCTGTGCTGCTTCAGTTGCCGCTGCTTATCACTGTATGACAAATACTTCAGATTTCAGTTCAAGAGTATCTCAAGTAA
GTGCGGTGTTGGATCCCAATAGTTCTAATTACCAAAACAAACTGGAGCCTGATCCTGATGACAGTGCTTATCAAAAGAAGCTCGAGCCTGACCCTGATGACAGCTCCAAT
GATCAAAACATGCTTGAGATTGATTCTGACAACAGTTCTAATTATAAAAGCAAGCTCGAACCTGATCCTGATGATTCGATAGGGAGCAAAAATTTAGAATCTGAATGTGA
ACCAAGATTCATTAGGAGCCGAGTAGTTCAAACAGATTTGAGCAACACAGAAGTACAGGCTGTGCCTGCTACGAACAGCAGATTGTTGGAAGTCACAAGGTTGTATGGAG
AACCAGATCCCGATGATATGGGAAGCTCTTCTAACAGTAAAATAACTGTCACAGACCATTTCTCTCAAGGAATGCAGACCCTGGATTGCAACATTTTCCAAAGAATGGTT
GTTGAACCTAATCCAGATGACTTGGGAGAGAAAGCGAACACGTTGGGGTGTGGAAATGCCACTGGACATGACGAGGGCGATTGTTTAGAAGCTGGCTTAGTGACAAACCA
AACCCTTTTAAGCACAAATTGTCAAAAGGATGATACTGATGAAAGTTTGGTAGAAGTGGATTTATCTAAAATGCCTATTGACGAGCCTGATCCTGATGACCAAGAAATTC
GAAGAATACAAGACTCTGTTTCTGTTGTTTGCAATCGATTACGTGAGGCTATTGCAAAGCTGCTGGCTGAAGTTAAACCTTCTGAATCTTCAGCAGTTGTTCAAACTCTG
TTCAAGATTGTTAAGAATGTAATCGAACACCCTGATGAAATGAAATACAGAAAGCTTCGCAAGGCAAGACAATAA
Protein sequenceShow/hide protein sequence
MGISKNDVDGVLNNAKKKERIGVLKLLEGPYVFCEFWTLQIPGIEQDCIFQGLLRFNLDTDLLLNIVFELFHLAVKNYINPHLNPPASEALKRMHMLAVDPGIVAIMNKV
KESLNLDHCWRVGIMTEIAPIGYVGLSPKCILGFNKCMENLSILFSKVDFLTLVSDQNHGEEISLQLRIDDLKGFRKYESIKKTLLHELAHMNYFEHDANFYALDKQEAA
ALDWTRSKCHTLTGIKYSQYHEENVDVEDDFGVPQKLGGSMSHQLVNACAASVAAAYHCMTNTSDFSSRVSQVSAVLDPNSSNYQNKLEPDPDDSAYQKKLEPDPDDSSN
DQNMLEIDSDNSSNYKSKLEPDPDDSIGSKNLESECEPRFIRSRVVQTDLSNTEVQAVPATNSRLLEVTRLYGEPDPDDMGSSSNSKITVTDHFSQGMQTLDCNIFQRMV
VEPNPDDLGEKANTLGCGNATGHDEGDCLEAGLVTNQTLLSTNCQKDDTDESLVEVDLSKMPIDEPDPDDQEIRRIQDSVSVVCNRLREAIAKLLAEVKPSESSAVVQTL
FKIVKNVIEHPDEMKYRKLRKARQ