; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC06G117600 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC06G117600
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionMyb-like protein X
Genome locationCicolChr06:10409931..10424867
RNA-Seq ExpressionCcUC06G117600
SyntenyCcUC06G117600
Gene Ontology termsGO:0016874 - ligase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033345.1 myb-like protein X [Cucumis melo var. makuwa]3.6e-21487.37Show/hide
Query:  LKKEKSKEKKHKKEKKEKEKREGKEKRDKDKSDEKRREKKDKKKDRDKKKDRNKEKEKAHALEESRHSGKLGSQNGEEAVRVEHNSVQEEKHAGQFDSYV
        L+ EKSKEKKHKKEKKEKEKREGKEKRDKDKSDEKRREKKDKKKDRDKKKDRNKEKEKAHA EES+HSGK+GSQNGEE VRV+HN+VQEEKHAGQFDSY+
Subjt:  LKKEKSKEKKHKKEKKEKEKREGKEKRDKDKSDEKRREKKDKKKDRDKKKDRNKEKEKAHALEESRHSGKLGSQNGEEAVRVEHNSVQEEKHAGQFDSYV

Query:  GNKISQNAFLSKETKNTKMVLEVGRRIEDSGTAKVEKFAVAQPRRDDGMVTQMGRSSETLFEGKEQSKIKKIDESKYNGQGIRHEERFSGNSAVPSSVTT
        GNKISQN+F SKETKNTKMVLEVGRRIEDSGTAKVEKFAVAQPRRDDGMV Q+GR SETL E K+QSK KKIDE KYNGQG+RHEERF G+SAVPSS TT
Subjt:  GNKISQNAFLSKETKNTKMVLEVGRRIEDSGTAKVEKFAVAQPRRDDGMVTQMGRSSETLFEGKEQSKIKKIDESKYNGQGIRHEERFSGNSAVPSSVTT

Query:  ----ATATPTATVESGVPGMFKQMEKSAERRNEANDKTRQKEGEEKRSKHKDKDKEKKGRSNDKESDKEKKKEKKEKKAKEKEMVEEKNAMVDKTKEINR
            ATAT TATVE GVPGMFKQ+EKSAERR E+NDKTRQKEGEEKRSKHKDKDKEKKGRSNDKESDKEKKKEK     K KE VEEKNAMVDKTKEIN+
Subjt:  ----ATATPTATVESGVPGMFKQMEKSAERRNEANDKTRQKEGEEKRSKHKDKDKEKKGRSNDKESDKEKKKEKKEKKAKEKEMVEEKNAMVDKTKEINR

Query:  DNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLKPTPHPLKENGRILEPCLTSTLTPSERQAVSNDLILVSKERKINGII
        DNLLGRHSTNTNTSQLPDSNIGAAVEEN+IKKRKDFEPNGVLHAIDNRSSKLL+PT H LKENGRILEPCLTSTL PSERQAVSNDLILV KERKINGII
Subjt:  DNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLKPTPHPLKENGRILEPCLTSTLTPSERQAVSNDLILVSKERKINGII

Query:  EAHHTPASSKHKSGGQDDHPQPTAIHKKSPHPDSKHLSQVYSVPKMEELSDSDSQDWLFSSDTSLVMKPKLEALEAEEMPQVWAEAMQIDSADVCALPY
        +AHH PAS KHKS GQ DHPQPTAIHKKSPHPDSKHLSQVYSVPKMEELSDSDSQDWLF  ++SL MKPKLEA +A EMP VWA+AMQI+SADV ALPY
Subjt:  EAHHTPASSKHKSGGQDDHPQPTAIHKKSPHPDSKHLSQVYSVPKMEELSDSDSQDWLFSSDTSLVMKPKLEALEAEEMPQVWAEAMQIDSADVCALPY

TYK21562.1 myb-like protein X [Cucumis melo var. makuwa]3.6e-21487.7Show/hide
Query:  EKSKEKKHKKEKKEKEKREGKEKRDKDKSDEKRREKKDKKKDRDKKKDRNKEKEKAHALEESRHSGKLGSQNGEEAVRVEHNSVQEEKHAGQFDSYVGNK
        EKSKEKKHKKEKKEKEKREGKEKRDKDKSDEKRREKKDKKKDRDKKKDRNKEKEKAHA EES+HSGK+GSQNGEE VRV+HN+VQEEKHAGQFDSY+GNK
Subjt:  EKSKEKKHKKEKKEKEKREGKEKRDKDKSDEKRREKKDKKKDRDKKKDRNKEKEKAHALEESRHSGKLGSQNGEEAVRVEHNSVQEEKHAGQFDSYVGNK

Query:  ISQNAFLSKETKNTKMVLEVGRRIEDSGTAKVEKFAVAQPRRDDGMVTQMGRSSETLFEGKEQSKIKKIDESKYNGQGIRHEERFSGNSAVPSSVTT---
        ISQN+F SKETKNTKMVLEVGRRIEDSGTAKVEKFAVAQPRRDDGMV Q+GR SETL E K+QSK KKIDE KYNGQG+RHEERF G+SAVPSS TT   
Subjt:  ISQNAFLSKETKNTKMVLEVGRRIEDSGTAKVEKFAVAQPRRDDGMVTQMGRSSETLFEGKEQSKIKKIDESKYNGQGIRHEERFSGNSAVPSSVTT---

Query:  -ATATPTATVESGVPGMFKQMEKSAERRNEANDKTRQKEGEEKRSKHKDKDKEKKGRSNDKESDKEKKKEKKEKKAKEKEMVEEKNAMVDKTKEINRDNL
         ATAT TATVE GVPGMFKQ+EKSAERR E+NDKTRQKEGEEKRSKHKDKDKEKKGRSNDKESDKEKKKEK     K KE VEEKNAMVDKTKEIN+DNL
Subjt:  -ATATPTATVESGVPGMFKQMEKSAERRNEANDKTRQKEGEEKRSKHKDKDKEKKGRSNDKESDKEKKKEKKEKKAKEKEMVEEKNAMVDKTKEINRDNL

Query:  LGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLKPTPHPLKENGRILEPCLTSTLTPSERQAVSNDLILVSKERKINGIIEAH
        LGRHSTNTNTSQLPDSNIGAAVEEN+IKKRKDFEPNGVLHAIDNRSSKLL+PT H LKENGRILEPCLTSTL PSERQAVSNDLILV KERKINGII+AH
Subjt:  LGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLKPTPHPLKENGRILEPCLTSTLTPSERQAVSNDLILVSKERKINGIIEAH

Query:  HTPASSKHKSGGQDDHPQPTAIHKKSPHPDSKHLSQVYSVPKMEELSDSDSQDWLFSSDTSLVMKPKLEALEAEEMPQVWAEAMQIDSADVCALPY
        H PAS KHKS GQ DHPQPTAIHKKSPHPDSKHLSQVYSVPKMEELSDSDSQDWLF  ++SL MKPKLEA +A EMP VWA+AMQI+SADV ALPY
Subjt:  HTPASSKHKSGGQDDHPQPTAIHKKSPHPDSKHLSQVYSVPKMEELSDSDSQDWLFSSDTSLVMKPKLEALEAEEMPQVWAEAMQIDSADVCALPY

XP_004139013.2 glutamic acid-rich protein [Cucumis sativus]4.7e-23087.22Show/hide
Query:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKEKKHKKEKKEKEKREGKEKRDKDKSDEKRREKKDKKKDRDKKKDRNKEKEKAHALEESRHSGKLGSQNGE
        MSRCFPFPPPGYEKKSRAEDADLLKKEKSKEKKHKKEKKEKEKREGKEKRDKDKSDEKRREKKDKKKDRDKKKDRNKEKEKAHA EES+HSGK+GSQNGE
Subjt:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKEKKHKKEKKEKEKREGKEKRDKDKSDEKRREKKDKKKDRDKKKDRNKEKEKAHALEESRHSGKLGSQNGE

Query:  EAVRVEHNSVQEEKHAGQFDSYVGNKISQNAFLSKETKNTKMVLEVGRRIEDSGTAKVEKFAVAQPRRDDGMVTQMGRSSETLFEGKEQSKIKKIDESKY
        E VRV+HNSVQEEKHAGQFDSYVGNKISQNAF SKETK+TKMVLEVGRRIEDSGTAKVEKFAVAQPRRDDGMVTQ+GR SETL E K+QSK KKIDE KY
Subjt:  EAVRVEHNSVQEEKHAGQFDSYVGNKISQNAFLSKETKNTKMVLEVGRRIEDSGTAKVEKFAVAQPRRDDGMVTQMGRSSETLFEGKEQSKIKKIDESKY

Query:  NGQGIRHEERFSGNSAVPSS----------VTTATATPTATVESGVPGMFKQMEKSAERRNEANDKTRQKEGEEKRSKHKDKDKEKKGRSNDKESDKEKK
        NGQG+RHEERFSG+SAVPSS            TATAT TATVE GVPGMFKQ+EKSAERR E+NDKTRQKEGEE+RSKHKDKDKEKKGRSNDK+ +KEKK
Subjt:  NGQGIRHEERFSGNSAVPSS----------VTTATATPTATVESGVPGMFKQMEKSAERRNEANDKTRQKEGEEKRSKHKDKDKEKKGRSNDKESDKEKK

Query:  KEKKEKKAKEKEMVEEKNAMVDKTKEINRDNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLKPTPHPLKENGRILEPCL
        KEK     K KE VEEKNAMVDKTKEIN+DNLLGRHSTNTNTSQLPDSNI AAVEENLIKKRKDFEPNGVLHAIDNRSSKLL+PT H LKENGRILEPCL
Subjt:  KEKKEKKAKEKEMVEEKNAMVDKTKEINRDNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLKPTPHPLKENGRILEPCL

Query:  TSTLTPSERQAVSNDLILVSKERKINGIIEAHHTPASSKHKSGGQDDHPQPTAIHKKSPHPDSKHLSQVYSVPKMEELSDSDSQDWLFSSDTSLVMKPKL
         STL PSERQAVSNDLILVSKERKINGIIEAHH PASSKH+S GQ DHPQP A+HKKSPHPDSKHLSQVYSVPKMEELSDSDSQDWLFS ++SL MKPKL
Subjt:  TSTLTPSERQAVSNDLILVSKERKINGIIEAHHTPASSKHKSGGQDDHPQPTAIHKKSPHPDSKHLSQVYSVPKMEELSDSDSQDWLFSSDTSLVMKPKL

Query:  EALEAEEMPQVWAEAMQIDSADVCALPYVIPY
        E  EA+EMP VWA+AMQI+SADV ALPYVIPY
Subjt:  EALEAEEMPQVWAEAMQIDSADVCALPYVIPY

XP_022138508.1 DNA ligase 1 [Momordica charantia]4.0e-20580.34Show/hide
Query:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKEKKHKKEKKEKEKREGKEKRDKDKSDEKRREKKDKKKDRDKKKDRN-----KEKEKAHALEESRHSGKLG
        MSRCFPFPPPGYEKKSRAEDADLLKKEKSKEKKHKKEKKEKEKREGKEKRDKD+SDEKRREKKDKKKDRDKKKDR+     KEKEKAHA EESR  G+ G
Subjt:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKEKKHKKEKKEKEKREGKEKRDKDKSDEKRREKKDKKKDRDKKKDRN-----KEKEKAHALEESRHSGKLG

Query:  SQNGEEAVRVEHNSVQEEKHAGQFDSYVGNKISQNAFLSKETKNTKMVLEVGRRIEDSGTAKVEKFAVAQPRRDDGMVTQMGRSSETLFEGKEQSKIKKI
        SQNGEE VRV+HN+VQEEK+ GQFDSYVGNKISQNAFLSKETKNT +  EVGRRIED GT K+EKFAVA+P+RDDGM T + RSSET  EGKEQ+K +KI
Subjt:  SQNGEEAVRVEHNSVQEEKHAGQFDSYVGNKISQNAFLSKETKNTKMVLEVGRRIEDSGTAKVEKFAVAQPRRDDGMVTQMGRSSETLFEGKEQSKIKKI

Query:  DESKYNGQGIRHEERFSGNSAVPSSVTTATATPTATVESGVPGMFKQMEKSAERRNEANDKTRQKEGEEKR-SKHKDKDKEKKGRSNDKESDKEKK----
        DE  YNGQGIRHEERFSGNSAVPSS      T TATVESGV GMFK ++K AERR E NDK R KE E K+ SKHKDKDK+KKGRSN KE  KEKK    
Subjt:  DESKYNGQGIRHEERFSGNSAVPSSVTTATATPTATVESGVPGMFKQMEKSAERRNEANDKTRQKEGEEKR-SKHKDKDKEKKGRSNDKESDKEKK----

Query:  -KEKKEKKAKEKEMVEEKNAMVDKTKEINRDNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLKPTPHPLKENGRILEPC
         KEK+++K KEKE VEEKNAMVDKTKEIN+DNLLGR+STNTNTSQLPDSNIGAAVEE LIKKRKDFEPNG+LHAIDNRSSKLL+PT HPLKENGRIL+PC
Subjt:  -KEKKEKKAKEKEMVEEKNAMVDKTKEINRDNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLKPTPHPLKENGRILEPC

Query:  LTSTLTPSERQAVSNDLILVSKERKINGIIEA-HHTPASSKHKSGGQDDHPQPTAIHKKSPHPDSKHLSQVYSVPKMEELSDSDSQDWLFSSDTSLVMKP
        +TS L PSE QAV+NDLI  SKERKINGIIEA HH PASSKH S GQDD PQPT +HKKSPHPDSK+LSQVYSVPKMEELSDSDSQ+WLFSS+TS  MKP
Subjt:  LTSTLTPSERQAVSNDLILVSKERKINGIIEA-HHTPASSKHKSGGQDDHPQPTAIHKKSPHPDSKHLSQVYSVPKMEELSDSDSQDWLFSSDTSLVMKP

Query:  KLEALEAEEMPQVWAEAMQIDSADVCALPYVIPY
        KLE  E EE  QVWA+AMQI+SADV ALPYVIPY
Subjt:  KLEALEAEEMPQVWAEAMQIDSADVCALPYVIPY

XP_038875356.1 myb-like protein X [Benincasa hispida]1.1e-24593.49Show/hide
Query:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKEKKHKKEKKEKEKREGKEKRDKDKSDEKRREKKDKKKDRDKKKDRNKEKEKAHALEESRHSGKLGSQNGE
        MSRCFPFPPPGYEKKSRAEDADLLKKEKSKEKKHKKEKKEKEKREGKEKRDKDKSDEKRREKKDKKKDRDKKKDRNKEKEKA A EESR+SGK+GSQNGE
Subjt:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKEKKHKKEKKEKEKREGKEKRDKDKSDEKRREKKDKKKDRDKKKDRNKEKEKAHALEESRHSGKLGSQNGE

Query:  EAVRVEHNSVQEEKHAGQFDSYVGNKISQNAFLSKETKNTKMVLEVGRRIEDSGTAKVEKFAVAQPRRDDGMVTQMGRSSETLFEGKEQSKIKKIDESKY
        EAVRV+HNSVQEEKHAGQFDSYVGNKISQNAFLSKE KN+KMVLEVGRRIED GTAKVEKFAVAQPRRDDGMVTQMGRSSETLFE KEQSK KKIDE KY
Subjt:  EAVRVEHNSVQEEKHAGQFDSYVGNKISQNAFLSKETKNTKMVLEVGRRIEDSGTAKVEKFAVAQPRRDDGMVTQMGRSSETLFEGKEQSKIKKIDESKY

Query:  NGQGIRHEERFSGNSAVPSSVTTATATPTATVESGVPGMFKQMEKSAERRNEANDKTRQKEGEEKRSKHKDKDKEKKGRSNDKESDKEKKKEKKEKKAKE
        NGQGIRHEERFSGNSAVPSS TTA    TATVESGVPGMFKQMEKSAERR EANDKTRQKEGEEKRSKHKDKDKEKKGRSNDKESDKEKKKEKKEKK KE
Subjt:  NGQGIRHEERFSGNSAVPSSVTTATATPTATVESGVPGMFKQMEKSAERRNEANDKTRQKEGEEKRSKHKDKDKEKKGRSNDKESDKEKKKEKKEKKAKE

Query:  KEMVEEKNAMVDKTKEINRDNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLKPTPHPLKENGRILEPCLTSTLTPSERQ
        KEMVEEKNAMVDKTKEIN+DNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRS KLLKP  HPLKENGRILEPC TS L PS+RQ
Subjt:  KEMVEEKNAMVDKTKEINRDNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLKPTPHPLKENGRILEPCLTSTLTPSERQ

Query:  AVSNDLILVSKERKINGIIEAHHTPASSKHKSGGQDDHPQPTAIHKKSPHPDSKHLSQVYSVPKMEELSDSDSQDWLFSSDTSLVMKPKLEALEAEEMPQ
        AVSNDLILVSKERKINGIIEAHH PASSKHKSGGQDDH QPT IHKKSPHPDSKHLSQVYSVPKMEELSDSD+QDWLFSS+TSLVMKPKLEA EAEEMPQ
Subjt:  AVSNDLILVSKERKINGIIEAHHTPASSKHKSGGQDDHPQPTAIHKKSPHPDSKHLSQVYSVPKMEELSDSDSQDWLFSSDTSLVMKPKLEALEAEEMPQ

Query:  VWAEAMQIDSADVCALPYVIPY
        VWAEAMQI+SADV ALPYVIPY
Subjt:  VWAEAMQIDSADVCALPYVIPY

TrEMBL top hitse value%identityAlignment
A0A0A0LHZ6 Uncharacterized protein2.3e-23087.22Show/hide
Query:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKEKKHKKEKKEKEKREGKEKRDKDKSDEKRREKKDKKKDRDKKKDRNKEKEKAHALEESRHSGKLGSQNGE
        MSRCFPFPPPGYEKKSRAEDADLLKKEKSKEKKHKKEKKEKEKREGKEKRDKDKSDEKRREKKDKKKDRDKKKDRNKEKEKAHA EES+HSGK+GSQNGE
Subjt:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKEKKHKKEKKEKEKREGKEKRDKDKSDEKRREKKDKKKDRDKKKDRNKEKEKAHALEESRHSGKLGSQNGE

Query:  EAVRVEHNSVQEEKHAGQFDSYVGNKISQNAFLSKETKNTKMVLEVGRRIEDSGTAKVEKFAVAQPRRDDGMVTQMGRSSETLFEGKEQSKIKKIDESKY
        E VRV+HNSVQEEKHAGQFDSYVGNKISQNAF SKETK+TKMVLEVGRRIEDSGTAKVEKFAVAQPRRDDGMVTQ+GR SETL E K+QSK KKIDE KY
Subjt:  EAVRVEHNSVQEEKHAGQFDSYVGNKISQNAFLSKETKNTKMVLEVGRRIEDSGTAKVEKFAVAQPRRDDGMVTQMGRSSETLFEGKEQSKIKKIDESKY

Query:  NGQGIRHEERFSGNSAVPSS----------VTTATATPTATVESGVPGMFKQMEKSAERRNEANDKTRQKEGEEKRSKHKDKDKEKKGRSNDKESDKEKK
        NGQG+RHEERFSG+SAVPSS            TATAT TATVE GVPGMFKQ+EKSAERR E+NDKTRQKEGEE+RSKHKDKDKEKKGRSNDK+ +KEKK
Subjt:  NGQGIRHEERFSGNSAVPSS----------VTTATATPTATVESGVPGMFKQMEKSAERRNEANDKTRQKEGEEKRSKHKDKDKEKKGRSNDKESDKEKK

Query:  KEKKEKKAKEKEMVEEKNAMVDKTKEINRDNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLKPTPHPLKENGRILEPCL
        KEK     K KE VEEKNAMVDKTKEIN+DNLLGRHSTNTNTSQLPDSNI AAVEENLIKKRKDFEPNGVLHAIDNRSSKLL+PT H LKENGRILEPCL
Subjt:  KEKKEKKAKEKEMVEEKNAMVDKTKEINRDNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLKPTPHPLKENGRILEPCL

Query:  TSTLTPSERQAVSNDLILVSKERKINGIIEAHHTPASSKHKSGGQDDHPQPTAIHKKSPHPDSKHLSQVYSVPKMEELSDSDSQDWLFSSDTSLVMKPKL
         STL PSERQAVSNDLILVSKERKINGIIEAHH PASSKH+S GQ DHPQP A+HKKSPHPDSKHLSQVYSVPKMEELSDSDSQDWLFS ++SL MKPKL
Subjt:  TSTLTPSERQAVSNDLILVSKERKINGIIEAHHTPASSKHKSGGQDDHPQPTAIHKKSPHPDSKHLSQVYSVPKMEELSDSDSQDWLFSSDTSLVMKPKL

Query:  EALEAEEMPQVWAEAMQIDSADVCALPYVIPY
        E  EA+EMP VWA+AMQI+SADV ALPYVIPY
Subjt:  EALEAEEMPQVWAEAMQIDSADVCALPYVIPY

A0A5A7SW08 Myb-like protein X1.8e-21487.37Show/hide
Query:  LKKEKSKEKKHKKEKKEKEKREGKEKRDKDKSDEKRREKKDKKKDRDKKKDRNKEKEKAHALEESRHSGKLGSQNGEEAVRVEHNSVQEEKHAGQFDSYV
        L+ EKSKEKKHKKEKKEKEKREGKEKRDKDKSDEKRREKKDKKKDRDKKKDRNKEKEKAHA EES+HSGK+GSQNGEE VRV+HN+VQEEKHAGQFDSY+
Subjt:  LKKEKSKEKKHKKEKKEKEKREGKEKRDKDKSDEKRREKKDKKKDRDKKKDRNKEKEKAHALEESRHSGKLGSQNGEEAVRVEHNSVQEEKHAGQFDSYV

Query:  GNKISQNAFLSKETKNTKMVLEVGRRIEDSGTAKVEKFAVAQPRRDDGMVTQMGRSSETLFEGKEQSKIKKIDESKYNGQGIRHEERFSGNSAVPSSVTT
        GNKISQN+F SKETKNTKMVLEVGRRIEDSGTAKVEKFAVAQPRRDDGMV Q+GR SETL E K+QSK KKIDE KYNGQG+RHEERF G+SAVPSS TT
Subjt:  GNKISQNAFLSKETKNTKMVLEVGRRIEDSGTAKVEKFAVAQPRRDDGMVTQMGRSSETLFEGKEQSKIKKIDESKYNGQGIRHEERFSGNSAVPSSVTT

Query:  ----ATATPTATVESGVPGMFKQMEKSAERRNEANDKTRQKEGEEKRSKHKDKDKEKKGRSNDKESDKEKKKEKKEKKAKEKEMVEEKNAMVDKTKEINR
            ATAT TATVE GVPGMFKQ+EKSAERR E+NDKTRQKEGEEKRSKHKDKDKEKKGRSNDKESDKEKKKEK     K KE VEEKNAMVDKTKEIN+
Subjt:  ----ATATPTATVESGVPGMFKQMEKSAERRNEANDKTRQKEGEEKRSKHKDKDKEKKGRSNDKESDKEKKKEKKEKKAKEKEMVEEKNAMVDKTKEINR

Query:  DNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLKPTPHPLKENGRILEPCLTSTLTPSERQAVSNDLILVSKERKINGII
        DNLLGRHSTNTNTSQLPDSNIGAAVEEN+IKKRKDFEPNGVLHAIDNRSSKLL+PT H LKENGRILEPCLTSTL PSERQAVSNDLILV KERKINGII
Subjt:  DNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLKPTPHPLKENGRILEPCLTSTLTPSERQAVSNDLILVSKERKINGII

Query:  EAHHTPASSKHKSGGQDDHPQPTAIHKKSPHPDSKHLSQVYSVPKMEELSDSDSQDWLFSSDTSLVMKPKLEALEAEEMPQVWAEAMQIDSADVCALPY
        +AHH PAS KHKS GQ DHPQPTAIHKKSPHPDSKHLSQVYSVPKMEELSDSDSQDWLF  ++SL MKPKLEA +A EMP VWA+AMQI+SADV ALPY
Subjt:  EAHHTPASSKHKSGGQDDHPQPTAIHKKSPHPDSKHLSQVYSVPKMEELSDSDSQDWLFSSDTSLVMKPKLEALEAEEMPQVWAEAMQIDSADVCALPY

A0A5D3DDA1 Myb-like protein X1.8e-21487.7Show/hide
Query:  EKSKEKKHKKEKKEKEKREGKEKRDKDKSDEKRREKKDKKKDRDKKKDRNKEKEKAHALEESRHSGKLGSQNGEEAVRVEHNSVQEEKHAGQFDSYVGNK
        EKSKEKKHKKEKKEKEKREGKEKRDKDKSDEKRREKKDKKKDRDKKKDRNKEKEKAHA EES+HSGK+GSQNGEE VRV+HN+VQEEKHAGQFDSY+GNK
Subjt:  EKSKEKKHKKEKKEKEKREGKEKRDKDKSDEKRREKKDKKKDRDKKKDRNKEKEKAHALEESRHSGKLGSQNGEEAVRVEHNSVQEEKHAGQFDSYVGNK

Query:  ISQNAFLSKETKNTKMVLEVGRRIEDSGTAKVEKFAVAQPRRDDGMVTQMGRSSETLFEGKEQSKIKKIDESKYNGQGIRHEERFSGNSAVPSSVTT---
        ISQN+F SKETKNTKMVLEVGRRIEDSGTAKVEKFAVAQPRRDDGMV Q+GR SETL E K+QSK KKIDE KYNGQG+RHEERF G+SAVPSS TT   
Subjt:  ISQNAFLSKETKNTKMVLEVGRRIEDSGTAKVEKFAVAQPRRDDGMVTQMGRSSETLFEGKEQSKIKKIDESKYNGQGIRHEERFSGNSAVPSSVTT---

Query:  -ATATPTATVESGVPGMFKQMEKSAERRNEANDKTRQKEGEEKRSKHKDKDKEKKGRSNDKESDKEKKKEKKEKKAKEKEMVEEKNAMVDKTKEINRDNL
         ATAT TATVE GVPGMFKQ+EKSAERR E+NDKTRQKEGEEKRSKHKDKDKEKKGRSNDKESDKEKKKEK     K KE VEEKNAMVDKTKEIN+DNL
Subjt:  -ATATPTATVESGVPGMFKQMEKSAERRNEANDKTRQKEGEEKRSKHKDKDKEKKGRSNDKESDKEKKKEKKEKKAKEKEMVEEKNAMVDKTKEINRDNL

Query:  LGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLKPTPHPLKENGRILEPCLTSTLTPSERQAVSNDLILVSKERKINGIIEAH
        LGRHSTNTNTSQLPDSNIGAAVEEN+IKKRKDFEPNGVLHAIDNRSSKLL+PT H LKENGRILEPCLTSTL PSERQAVSNDLILV KERKINGII+AH
Subjt:  LGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLKPTPHPLKENGRILEPCLTSTLTPSERQAVSNDLILVSKERKINGIIEAH

Query:  HTPASSKHKSGGQDDHPQPTAIHKKSPHPDSKHLSQVYSVPKMEELSDSDSQDWLFSSDTSLVMKPKLEALEAEEMPQVWAEAMQIDSADVCALPY
        H PAS KHKS GQ DHPQPTAIHKKSPHPDSKHLSQVYSVPKMEELSDSDSQDWLF  ++SL MKPKLEA +A EMP VWA+AMQI+SADV ALPY
Subjt:  HTPASSKHKSGGQDDHPQPTAIHKKSPHPDSKHLSQVYSVPKMEELSDSDSQDWLFSSDTSLVMKPKLEALEAEEMPQVWAEAMQIDSADVCALPY

A0A6J1CD63 DNA ligase 12.0e-20580.34Show/hide
Query:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKEKKHKKEKKEKEKREGKEKRDKDKSDEKRREKKDKKKDRDKKKDRN-----KEKEKAHALEESRHSGKLG
        MSRCFPFPPPGYEKKSRAEDADLLKKEKSKEKKHKKEKKEKEKREGKEKRDKD+SDEKRREKKDKKKDRDKKKDR+     KEKEKAHA EESR  G+ G
Subjt:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKEKKHKKEKKEKEKREGKEKRDKDKSDEKRREKKDKKKDRDKKKDRN-----KEKEKAHALEESRHSGKLG

Query:  SQNGEEAVRVEHNSVQEEKHAGQFDSYVGNKISQNAFLSKETKNTKMVLEVGRRIEDSGTAKVEKFAVAQPRRDDGMVTQMGRSSETLFEGKEQSKIKKI
        SQNGEE VRV+HN+VQEEK+ GQFDSYVGNKISQNAFLSKETKNT +  EVGRRIED GT K+EKFAVA+P+RDDGM T + RSSET  EGKEQ+K +KI
Subjt:  SQNGEEAVRVEHNSVQEEKHAGQFDSYVGNKISQNAFLSKETKNTKMVLEVGRRIEDSGTAKVEKFAVAQPRRDDGMVTQMGRSSETLFEGKEQSKIKKI

Query:  DESKYNGQGIRHEERFSGNSAVPSSVTTATATPTATVESGVPGMFKQMEKSAERRNEANDKTRQKEGEEKR-SKHKDKDKEKKGRSNDKESDKEKK----
        DE  YNGQGIRHEERFSGNSAVPSS      T TATVESGV GMFK ++K AERR E NDK R KE E K+ SKHKDKDK+KKGRSN KE  KEKK    
Subjt:  DESKYNGQGIRHEERFSGNSAVPSSVTTATATPTATVESGVPGMFKQMEKSAERRNEANDKTRQKEGEEKR-SKHKDKDKEKKGRSNDKESDKEKK----

Query:  -KEKKEKKAKEKEMVEEKNAMVDKTKEINRDNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLKPTPHPLKENGRILEPC
         KEK+++K KEKE VEEKNAMVDKTKEIN+DNLLGR+STNTNTSQLPDSNIGAAVEE LIKKRKDFEPNG+LHAIDNRSSKLL+PT HPLKENGRIL+PC
Subjt:  -KEKKEKKAKEKEMVEEKNAMVDKTKEINRDNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLKPTPHPLKENGRILEPC

Query:  LTSTLTPSERQAVSNDLILVSKERKINGIIEA-HHTPASSKHKSGGQDDHPQPTAIHKKSPHPDSKHLSQVYSVPKMEELSDSDSQDWLFSSDTSLVMKP
        +TS L PSE QAV+NDLI  SKERKINGIIEA HH PASSKH S GQDD PQPT +HKKSPHPDSK+LSQVYSVPKMEELSDSDSQ+WLFSS+TS  MKP
Subjt:  LTSTLTPSERQAVSNDLILVSKERKINGIIEA-HHTPASSKHKSGGQDDHPQPTAIHKKSPHPDSKHLSQVYSVPKMEELSDSDSQDWLFSSDTSLVMKP

Query:  KLEALEAEEMPQVWAEAMQIDSADVCALPYVIPY
        KLE  E EE  QVWA+AMQI+SADV ALPYVIPY
Subjt:  KLEALEAEEMPQVWAEAMQIDSADVCALPYVIPY

A0A6J1HKL8 DNA ligase 1-like1.7e-19675.86Show/hide
Query:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKEKKHKKEKKEKEKREGKEKRDKDKSDEKRREKKDKKKDRDKKKDRNKEKEKA-HALEESRHSGKLGSQNG
        MSRCFPFPPPGYEKKSRAEDADLLKKEKSKEKKHKKEKKEKEK+EGKEKRDKDKS+EKRREKKDKKKDRDKKK+R+KEKEKA HALE+SR SGK+ SQNG
Subjt:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKEKKHKKEKKEKEKREGKEKRDKDKSDEKRREKKDKKKDRDKKKDRNKEKEKA-HALEESRHSGKLGSQNG

Query:  EEAVRVEHNSVQEEKHAGQFDSYVGNKISQNAFLSKETKNTKMVLEVGRRIEDSGTAKVEKFAVAQPRRDDGMVTQMGRSSETLFEG-KEQSKIKKIDES
        EE VR++HNS+QEEKH+G+ DSYVGNKISQN  L KETKN+K+V EVGRRIEDSGTAKVEK AVAQP+RDDG+ TQ+ RSSE L EG KE  K K ID+ 
Subjt:  EEAVRVEHNSVQEEKHAGQFDSYVGNKISQNAFLSKETKNTKMVLEVGRRIEDSGTAKVEKFAVAQPRRDDGMVTQMGRSSETLFEG-KEQSKIKKIDES

Query:  KYNGQGIRHEERFSGNSAVPSSVTTATATPTATVESGVPGMFKQMEKSAERRNEANDKTRQKEGEEKR-SKHKDKDKEKKGRSNDKESDKEKK-----KE
        KYNGQGIRH+ERFSG+S VPS   T   T  ATVES         EK AERR E NDK R KEGEEKR SKHKDKDKEKKGRSNDK +DKEKK     KE
Subjt:  KYNGQGIRHEERFSGNSAVPSSVTTATATPTATVESGVPGMFKQMEKSAERRNEANDKTRQKEGEEKR-SKHKDKDKEKKGRSNDKESDKEKK-----KE

Query:  KKEKKAKEKEMVEEKNAMVDKTKEINRDNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLKPTPHPLKENGRILEPCLTS
        KKEKKAK+KE  EEKNA+VDKTKEI +DNL+GRHSTNT+TSQLPD NIGAAVEENL+KKRK FEPNGVLHAIDNRSSKLL+P  HPLKENGRILEPCL S
Subjt:  KKEKKAKEKEMVEEKNAMVDKTKEINRDNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLKPTPHPLKENGRILEPCLTS

Query:  TLTPSERQAV-SNDLILVSKERKINGIIEAHHTPASSKHKSGGQDDH------------------------PQPTAIHKKSPHPDSKHLSQVYSVPKMEE
         L PSERQ V +ND+ILVSKERKINGIIEAHH PAS KHK+GGQDDH                        PQPTAIHKKSPHPDSK LS VYSVPKMEE
Subjt:  TLTPSERQAV-SNDLILVSKERKINGIIEAHHTPASSKHKSGGQDDH------------------------PQPTAIHKKSPHPDSKHLSQVYSVPKMEE

Query:  LSDSDSQDWLFSSDTSLVMKPKLEALEAEEMPQVWAEAMQIDSADVCALPYVIPY
        L +SD+QDWLF+S+TS  MKP+LEA EA+EMPQVWAEAMQI+  DV ALPYVIPY
Subjt:  LSDSDSQDWLFSSDTSLVMKPKLEALEAEEMPQVWAEAMQIDSADVCALPYVIPY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G48610.1 unknown protein1.8e-0931.25Show/hide
Query:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKEKKHKKEKKEKEKREGKEKRDKDKSDEKRREKKDKKKDRDKKKDRNKEKEKAHALEESRHSGKLGSQNGE
        MSRCFPFPPPGYEKK R ++AD L K+K KEKKH   KK+KEKREGKEK+ KD+S +K++E+K+KK     +KD+ K KEK   LEE +           
Subjt:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKEKKHKKEKKEKEKREGKEKRDKDKSDEKRREKKDKKKDRDKKKDRNKEKEKAHALEESRHSGKLGSQNGE

Query:  EAVRVEHNSVQEEKHAGQFDSYVGNKISQNAFLSKETKNTKMVLEVGRRI--EDSGTAKVEKFAVAQPRRDDGMVTQMGRSSETLFEGKEQSKIKKIDES
                  +   +AG  ++ V + +  N+        +K V ++ RRI  ++  T       +  P + +  +T+         +  E S I++    
Subjt:  EAVRVEHNSVQEEKHAGQFDSYVGNKISQNAFLSKETKNTKMVLEVGRRI--EDSGTAKVEKFAVAQPRRDDGMVTQMGRSSETLFEGKEQSKIKKIDES

Query:  KYNGQGIRHEERFSGNSAVPSSVTTATATPTATVESGVPGMFKQME-KSAERRNEANDKTRQKEGEEKRSKHKDKDKEKKGRSNDKESDKEKKKEKKEKK
          + + I +++ F   +A  SS    +             M K ME +   R+ E+ +K+ +KE   K  K +D++  KK  + DK+ +KEKK+EK E  
Subjt:  KYNGQGIRHEERFSGNSAVPSSVTTATATPTATVESGVPGMFKQME-KSAERRNEANDKTRQKEGEEKRSKHKDKDKEKKGRSNDKESDKEKKKEKKEKK

Query:  AKEKEMVEEKNAMVDKTKEINRDNLLGRHSTNTNTSQLPD---SNIGAAVEENLIKKRKDFEPNGVLH
         K +   +EK  ++   K   R+    + S +    +LPD   ++I     E  + KRKD   NG L+
Subjt:  AKEKEMVEEKNAMVDKTKEINRDNLLGRHSTNTNTSQLPD---SNIGAAVEENLIKKRKDFEPNGVLH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCTGGAGCTCCGATCTTTTGTGGGATTCTGGTTTTAGTCTGTACTTTGTCGAGGCTGCGATTCTGAAACTGTTTTCTCTGATTTTCGGGAACTGGCATTTT
TGCTTGTGGAAAGAGCTCTTGAGTTTTGTCTTACTCTGTGCAATGTCGCGCTGCTTTCCATTTCCGCCACCAGGATATGAAAAGAAGTCTAGGGCAGAGGATGCG
GACCTACTAAAAAAGGAAAAGAGCAAAGAGAAAAAGCATAAAAAGGAGAAGAAAGAGAAGGAAAAGAGAGAGGGCAAAGAAAAAAGAGATAAGGATAAAAGTGAT
GAGAAACGCCGGGAAAAGAAAGACAAGAAGAAAGACAGGGACAAAAAGAAGGATAGGAATAAGGAGAAGGAGAAAGCACACGCCTTGGAAGAGAGTAGACATTCC
GGGAAACTTGGGAGTCAAAATGGAGAGGAAGCTGTTAGAGTCGAACATAACAGTGTCCAAGAGGAGAAGCATGCTGGGCAATTTGATTCTTACGTTGGAAATAAG
ATTAGTCAGAATGCATTTCTTTCTAAGGAGACAAAGAACACAAAAATGGTGCTGGAGGTGGGGAGGAGAATTGAGGACAGTGGAACTGCGAAGGTCGAGAAGTTT
GCTGTTGCACAACCAAGAAGGGATGATGGAATGGTCACACAGATGGGTAGGAGTTCTGAGACCTTGTTTGAAGGTAAGGAACAAAGCAAGATCAAGAAAATTGAT
GAAAGCAAATACAACGGTCAAGGAATTAGACATGAGGAAAGATTTAGTGGCAATTCTGCAGTCCCAAGTTCCGTTACGACTGCGACTGCAACTCCAACTGCAACT
GTTGAATCTGGAGTTCCAGGAATGTTTAAACAAATGGAGAAAAGTGCCGAGAGAAGGAACGAAGCAAATGATAAAACCAGACAGAAAGAAGGTGAAGAAAAACGG
AGCAAACACAAGGATAAAGATAAAGAGAAGAAAGGACGGTCAAATGATAAAGAGAGTGATAAAGAGAAGAAAAAAGAAAAGAAAGAAAAGAAAGCGAAGGAGAAG
GAGATGGTGGAAGAGAAGAATGCCATGGTAGATAAAACAAAAGAGATCAATAGGGATAACCTATTAGGTAGACATTCTACAAATACGAATACATCACAACTTCCT
GACAGCAATATAGGTGCTGCAGTTGAGGAAAACTTAATTAAGAAACGGAAGGATTTTGAGCCTAATGGAGTCTTGCATGCAATTGACAACAGATCCAGTAAGTTG
TTGAAGCCTACTCCTCATCCATTGAAGGAAAATGGTAGGATACTGGAACCGTGCTTGACTTCCACCTTGACTCCTTCAGAGAGACAGGCTGTGTCAAATGACCTT
ATTTTGGTTAGTAAAGAACGCAAGATTAATGGCATTATTGAAGCTCACCACACACCTGCCTCTTCGAAACACAAGTCTGGCGGACAAGATGATCATCCTCAACCT
ACTGCAATCCATAAAAAATCACCCCATCCAGATTCCAAGCACTTGAGTCAGGTATATTCGGTACCCAAAATGGAGGAATTGTCAGATTCTGATAGCCAAGACTGG
TTATTTAGCAGCGATACTTCCCTTGTGATGAAGCCCAAGTTGGAAGCTTTGGAGGCTGAGGAAATGCCGCAGGTATGGGCTGAGGCAATGCAGATAGATTCAGCT
GATGTTTGTGCTCTGCCTTATGTTATTCCATACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCTGGAGCTCCGATCTTTTGTGGGATTCTGGTTTTAGTCTGTACTTTGTCGAGGCTGCGATTCTGAAACTGTTTTCTCTGATTTTCGGGAACTGGCATTTT
TGCTTGTGGAAAGAGCTCTTGAGTTTTGTCTTACTCTGTGCAATGTCGCGCTGCTTTCCATTTCCGCCACCAGGATATGAAAAGAAGTCTAGGGCAGAGGATGCG
GACCTACTAAAAAAGGAAAAGAGCAAAGAGAAAAAGCATAAAAAGGAGAAGAAAGAGAAGGAAAAGAGAGAGGGCAAAGAAAAAAGAGATAAGGATAAAAGTGAT
GAGAAACGCCGGGAAAAGAAAGACAAGAAGAAAGACAGGGACAAAAAGAAGGATAGGAATAAGGAGAAGGAGAAAGCACACGCCTTGGAAGAGAGTAGACATTCC
GGGAAACTTGGGAGTCAAAATGGAGAGGAAGCTGTTAGAGTCGAACATAACAGTGTCCAAGAGGAGAAGCATGCTGGGCAATTTGATTCTTACGTTGGAAATAAG
ATTAGTCAGAATGCATTTCTTTCTAAGGAGACAAAGAACACAAAAATGGTGCTGGAGGTGGGGAGGAGAATTGAGGACAGTGGAACTGCGAAGGTCGAGAAGTTT
GCTGTTGCACAACCAAGAAGGGATGATGGAATGGTCACACAGATGGGTAGGAGTTCTGAGACCTTGTTTGAAGGTAAGGAACAAAGCAAGATCAAGAAAATTGAT
GAAAGCAAATACAACGGTCAAGGAATTAGACATGAGGAAAGATTTAGTGGCAATTCTGCAGTCCCAAGTTCCGTTACGACTGCGACTGCAACTCCAACTGCAACT
GTTGAATCTGGAGTTCCAGGAATGTTTAAACAAATGGAGAAAAGTGCCGAGAGAAGGAACGAAGCAAATGATAAAACCAGACAGAAAGAAGGTGAAGAAAAACGG
AGCAAACACAAGGATAAAGATAAAGAGAAGAAAGGACGGTCAAATGATAAAGAGAGTGATAAAGAGAAGAAAAAAGAAAAGAAAGAAAAGAAAGCGAAGGAGAAG
GAGATGGTGGAAGAGAAGAATGCCATGGTAGATAAAACAAAAGAGATCAATAGGGATAACCTATTAGGTAGACATTCTACAAATACGAATACATCACAACTTCCT
GACAGCAATATAGGTGCTGCAGTTGAGGAAAACTTAATTAAGAAACGGAAGGATTTTGAGCCTAATGGAGTCTTGCATGCAATTGACAACAGATCCAGTAAGTTG
TTGAAGCCTACTCCTCATCCATTGAAGGAAAATGGTAGGATACTGGAACCGTGCTTGACTTCCACCTTGACTCCTTCAGAGAGACAGGCTGTGTCAAATGACCTT
ATTTTGGTTAGTAAAGAACGCAAGATTAATGGCATTATTGAAGCTCACCACACACCTGCCTCTTCGAAACACAAGTCTGGCGGACAAGATGATCATCCTCAACCT
ACTGCAATCCATAAAAAATCACCCCATCCAGATTCCAAGCACTTGAGTCAGGTATATTCGGTACCCAAAATGGAGGAATTGTCAGATTCTGATAGCCAAGACTGG
TTATTTAGCAGCGATACTTCCCTTGTGATGAAGCCCAAGTTGGAAGCTTTGGAGGCTGAGGAAATGCCGCAGGTATGGGCTGAGGCAATGCAGATAGATTCAGCT
GATGTTTGTGCTCTGCCTTATGTTATTCCATACTGATGGTCTCACATGGTTTGGCATTGGTTTACCAACTTGACTCTATCTGACTGAGGGTTGGTAGACCCGCCT
TGAAATCGTGTTTGGAGTGGTGATTCACTGACAGCCATGTCGCACCAACTCCCTCAAACACATGGGAGCAGAGATCCATTTTGCTTCCACTGCCGTTCTTTGCTC
AGTTCTTGCAGCTGGGGCTGGACATGTGATTTTGATAATCTTGCTATCATTGTTGATTGGAACAGCATCTGGATTGCTCTTGATCGTTGGGGGTGGGCTAGGTAT
GGAAGGTGGTTTTTGGCTTAGGACATTAAATAACTAATTCATTGAAATGAGAAAATTCATAAAGGCAGCTCTGGGAGGTTTGCCTGCAGAGAGAGAGAGAGATAG
AGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGGAGGTGTGAGCTTGTGGTATTTTATTTAATTTGCTTGGAGCTGAAAAAAGCACTTTCTAATTGAG
AATGTGAAAGGAATTAGATGTGGTCTTTACAACTCCACCACAATTTTGATTTGTATCATAGAACTTTTATTTACAGAAATGTAAGATAGAGGGAAAAGGAAAGTA
GTGCAATTCATTTATGAGCCTGTGGATTAGTTTTCATAATGTTCAATTTTCATTTCTAGATGGTGATAAAGACTGCTGGTTTGCTTTTGAAGATACAGAAAGCTT
ACTAGCCTTATTGGGCTCAAGAAATTTTCTCCTATTTGCTAGCTTTCTGTTCTGCTGTCAGACTTTGGTAGCCATTGAAGTGATTTGTTAGCATTTAAGATCTCT
CTCCATAAATTAGTTTCCTGTTAATTAGACTGCGGATGAGAATAGAACAAGAATGGGGGCTGTTGAAGTTGATTGGTTGCAGGTGTTGGTGCTCTCCACAATCAT
TGAAACCCGATGGAACATTCCATCGGCCATCAACTCAGCCTTCAGCCCCGCCACTTCATTCTCCGGCAGAATCACACTCACGAACCTTCCGCCATCGTTGCGGCC
ACTGACCCCATCATCCGCTGGCCTGGGAGTGATCACGACCGCTCCACCGTCCACGACGCCGTGCAAGGTGTAGTACACAAATTTGGGTCTCTCAAAATCCAATTC
CACGATCCCATCTCCATATAAATCAGCTTCATCCCATTTCACAAATGTCAGATTCGCCCCATAAACAATGAAATCACTCACTCCATCATCCCTGTCTACAGCCAC
TTCAATGAGCTCCCCTTCTTCCTCCGCCGCGTTAGCTACCAGCAGGGCCGCCAGATCCCTCCGGTCAACGTCGGAGACGGCAGAGGCGGCGGATTTGACTGTGCT
GATCTTTTGGGTGTTCGCAATTATTTTCCCCTGTTGTTTAATTGGGTTGAGTTTGCAGAGAGTCACCGTCGTGGGTTCGTAGCCTCGCCGGAGCTTGGCTATGGA
CTGCCAGAGAGTAGCGGAGATGGACTCGAAAGGTGGGGTTTGATGGGGCATTTGGGTTTGTAATTTGGCCAATTGGGTCGCGTTTAATTTGAAGGAAAATGATTC
CATTTTGTACTTGTTGGTGGGAATCCAGTGGTCGCCGACTGGGTCGACTCGCCGGAGGGAAAGTGGGGGTTTGGCGGCGGTGTTGACAGGAGGTTTGGGCGGCGG
AGTGGTTGTGATGCTGCCGAAGGCCGGTAGTGGCGGGGGCGTGGCGGCGGCGGCGCCGAAAAGGATATTAGTGAGAGAGTTCATGAAAGCGGCGGCGGAGAAGGC
GTCGCCGAGGACGTGGGCCCAGCTCAGCCCAATTGATATTCCCTTGCATTTGAATCTCGTCACCTGTATATAAATGGGAGGGGAAAACTGTAATTCAGGTCCGAT
AACCTTCTGCGAAACCAGCAATTTCATCGCTGACCAATCGTCTACGATCATCTCCAGCCATTCCGCCACCGTACTATTGCACTCCGCCTCAATAAACCTAGCGCC
ACAGTCATTACACTTTATGAACGGCCGCCCGGAATCCGCTCGCCGGAGCCGACCGCAGGTGACGTAATACTCGTTAAATAGAAAGAATGTCGCCGCCTTAATCTG
CGACAGCGTCACCGTCTGGCTGGCCTCGGAATCGAAGAAATAGATTCCGTTGATGTAATGGAGCTTCATCGCCAAATCGAGCCCGGTCAGGTGATAGGCACTGTC
GGATCCAATAGAGTTTCCGGGACCGACGGAGGAGATTTTGAAGCTGTGAACGAGGCTCTGTTGATTATCGCCGGAAACCATGGCGGAAGGAGGAGGATTTGATCG
G
Protein sequenceShow/hide protein sequence
MGWSSDLLWDSGFSLYFVEAAILKLFSLIFGNWHFCLWKELLSFVLLCAMSRCFPFPPPGYEKKSRAEDADLLKKEKSKEKKHKKEKKEKEKREGKEKRDKDKSD
EKRREKKDKKKDRDKKKDRNKEKEKAHALEESRHSGKLGSQNGEEAVRVEHNSVQEEKHAGQFDSYVGNKISQNAFLSKETKNTKMVLEVGRRIEDSGTAKVEKF
AVAQPRRDDGMVTQMGRSSETLFEGKEQSKIKKIDESKYNGQGIRHEERFSGNSAVPSSVTTATATPTATVESGVPGMFKQMEKSAERRNEANDKTRQKEGEEKR
SKHKDKDKEKKGRSNDKESDKEKKKEKKEKKAKEKEMVEEKNAMVDKTKEINRDNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKL
LKPTPHPLKENGRILEPCLTSTLTPSERQAVSNDLILVSKERKINGIIEAHHTPASSKHKSGGQDDHPQPTAIHKKSPHPDSKHLSQVYSVPKMEELSDSDSQDW
LFSSDTSLVMKPKLEALEAEEMPQVWAEAMQIDSADVCALPYVIPY