; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002858 (gene) of Snake gourd v1 genome

Gene IDTan0002858
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDNA ligase 1
Genome locationLG06:73844439..73857267
RNA-Seq ExpressionTan0002858
SyntenyTan0002858
Gene Ontology termsGO:0016874 - ligase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139013.2 glutamic acid-rich protein [Cucumis sativus]1.8e-21582.65Show/hide
Query:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKERKHKKEKKEKEKREGKEKREKDKSDEKRREKKDKKKDRDKKKDRDKEKDKARASEESRPLGKIGSQNGE
        MSRCFPFPPPGYEKKSRAEDADLLKKEKSKE+KHKKEKKEKEKREGKEKR+KDKSDEKRREKKDKKKDRDKKKDR+KEK+KA ASEES+  GKIGSQNGE
Subjt:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKERKHKKEKKEKEKREGKEKREKDKSDEKRREKKDKKKDRDKKKDRDKEKDKARASEESRPLGKIGSQNGE

Query:  EVVRVKHNSVQEEKHAAQFDSYVGNKIGQNTFLSKETKNSKLAPEVGRRIEDSGTAKVEKFAVAQPKRDDGAATQVARSSETLFDGKEQNKNKKIDERKY
        EVVRVKHNSVQEEKHA QFDSYVGNKI QN F SKETK++K+  EVGRRIEDSGTAKVEKFAVAQP+RDDG  TQV R SETL + K+Q+KNKKIDERKY
Subjt:  EVVRVKHNSVQEEKHAAQFDSYVGNKIGQNTFLSKETKNSKLAPEVGRRIEDSGTAKVEKFAVAQPKRDDGAATQVARSSETLFDGKEQNKNKKIDERKY

Query:  NGQGIRHEERFGGNSAV--------AVPNFTATATATATAAVESGVAGMFKPLERSVERRKETNDKIRPKEGDEKRGSKHKDKDKEKKGRSKDKESDKEK
        NGQG+RHEERF G+SAV        A    TATATATATA VE GV GMFK LE+S ERRKE+NDK R KEG+E+R SKHKDKDKEKKGRS DK+ +KEK
Subjt:  NGQGIRHEERFGGNSAV--------AVPNFTATATATATAAVESGVAGMFKPLERSVERRKETNDKIRPKEGDEKRGSKHKDKDKEKKGRSKDKESDKEK

Query:  KKEKKAKEKEKEKEKEKVEEKNAMVDKTKEINKDNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLRPTSHPLKENGRIL
        KKEKKA        KEKVEEKNAMVDKTKEINKDNLLGRHSTNTNTSQLPDSNI AAVEENLIKKRKDFEPNGVLHAIDNRSSKLLRPTSH LKENGRIL
Subjt:  KKEKKAKEKEKEKEKEKVEEKNAMVDKTKEINKDNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLRPTSHPLKENGRIL

Query:  EPCLTSTLPSSERQAVANDLILVSKERKINGIVEAHHPPSSSKHKSGGPDDHPQPTAIHKKSPHPDSKYLSVVYSVPKMEELSDSDNQDWLLSNNTSQVM
        EPCL STLP SERQAV+NDLILVSKERKINGI+EAHHPP+SSKH+S G  DHPQP A+HKKSPHPDSK+LS VYSVPKMEELSDSD+QDWL S N+S  M
Subjt:  EPCLTSTLPSSERQAVANDLILVSKERKINGIVEAHHPPSSSKHKSGGPDDHPQPTAIHKKSPHPDSKYLSVVYSVPKMEELSDSDNQDWLLSNNTSQVM

Query:  KPKSEASEAEEMPQVWAEAMEIESADVYALPYVIPY
        KPK E SEA+EMP VWA+AM+IESADVYALPYVIPY
Subjt:  KPKSEASEAEEMPQVWAEAMEIESADVYALPYVIPY

XP_022138508.1 DNA ligase 1 [Momordica charantia]7.8e-21181.72Show/hide
Query:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKERKHKKEKKEKEKREGKEKREKDKSDEKRREKKDKKKDRDKKKDRD-----KEKDKARASEESRPLGKIG
        MSRCFPFPPPGYEKKSRAEDADLLKKEKSKE+KHKKEKKEKEKREGKEKR+KD+SDEKRREKKDKKKDRDKKKDRD     KEK+KA ASEESRPLG+ G
Subjt:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKERKHKKEKKEKEKREGKEKREKDKSDEKRREKKDKKKDRDKKKDRD-----KEKDKARASEESRPLGKIG

Query:  SQNGEEVVRVKHNSVQEEKHAAQFDSYVGNKIGQNTFLSKETKNSKLAPEVGRRIEDSGTAKVEKFAVAQPKRDDGAATQVARSSETLFDGKEQNKNKKI
        SQNGEEVVRVKHN+VQEEK+  QFDSYVGNKI QN FLSKETKN+ L PEVGRRIED GT K+EKFAVA+PKRDDG AT VARSSET  +GKEQNKN+KI
Subjt:  SQNGEEVVRVKHNSVQEEKHAAQFDSYVGNKIGQNTFLSKETKNSKLAPEVGRRIEDSGTAKVEKFAVAQPKRDDGAATQVARSSETLFDGKEQNKNKKI

Query:  DERKYNGQGIRHEERFGGNSAVAVPNFTATATATATAAVESGVAGMFKPLERSVERRKETNDKIRPKEGDEKRGSKHKDKDKEKKGRSKDKESDKEKK-K
        DER YNGQGIRHEERF GNSAV         ++TATA VESGVAGMFK L++  ERRKETNDKIRPKE + K+GSKHKDKDK+KKGRS  KE  KEKK K
Subjt:  DERKYNGQGIRHEERFGGNSAVAVPNFTATATATATAAVESGVAGMFKPLERSVERRKETNDKIRPKEGDEKRGSKHKDKDKEKKGRSKDKESDKEKK-K

Query:  EK-KAKEKEKEKEKEKVEEKNAMVDKTKEINKDNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLRPTSHPLKENGRILE
        EK K KEK+KEKEKEKVEEKNAMVDKTKEINKDNLLGR+STNTNTSQLPDSNIGAAVEE LIKKRKDFEPNG+LHAIDNRSSKLLRPT+HPLKENGRIL+
Subjt:  EK-KAKEKEKEKEKEKVEEKNAMVDKTKEINKDNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLRPTSHPLKENGRILE

Query:  PCLTSTLPSSERQAVANDLILVSKERKINGIVEAHHP-PSSSKHKSGGPDDHPQPTAIHKKSPHPDSKYLSVVYSVPKMEELSDSDNQDWLLSNNTSQVM
        PC+TS LP SE QAVANDLI  SKERKINGI+EAHH  P+SSKH S G DD PQPT +HKKSPHPDSKYLS VYSVPKMEELSDSD+Q+WL S+NTSQ M
Subjt:  PCLTSTLPSSERQAVANDLILVSKERKINGIVEAHHP-PSSSKHKSGGPDDHPQPTAIHKKSPHPDSKYLSVVYSVPKMEELSDSDNQDWLLSNNTSQVM

Query:  KPKSEASEAEEMPQVWAEAMEIESADVYALPYVIPY
        KPK E +E EE  QVWA+AM+IESADVYALPYVIPY
Subjt:  KPKSEASEAEEMPQVWAEAMEIESADVYALPYVIPY

XP_022964370.1 DNA ligase 1-like [Cucurbita moschata]1.4e-20477.52Show/hide
Query:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKERKHKKEKKEKEKREGKEKREKDKSDEKRREKKDKKKDRDKKKDRDKEKDKA-RASEESRPLGKIGSQNG
        MSRCFPFPPPGYEKKSRAEDADLLKKEKSKE+KHKKEKKEKEK+EGKEKR+KDKS+EKRREKKDKKKDRDKKK+RDKEK+KA  A E+SRP GKI SQNG
Subjt:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKERKHKKEKKEKEKREGKEKREKDKSDEKRREKKDKKKDRDKKKDRDKEKDKA-RASEESRPLGKIGSQNG

Query:  EEVVRVKHNSVQEEKHAAQFDSYVGNKIGQNTFLSKETKNSKLAPEVGRRIEDSGTAKVEKFAVAQPKRDDGAATQVARSSETLFDG-KEQNKNKKIDER
        EEVVR+KHNS+QEEKH+ + DSYVGNKI QNT L KETKNSKL PEVGRRIEDSGTAKVEK AVAQPKRDDG ATQVARSSE L +G KE  KNK ID+ 
Subjt:  EEVVRVKHNSVQEEKHAAQFDSYVGNKIGQNTFLSKETKNSKLAPEVGRRIEDSGTAKVEKFAVAQPKRDDGAATQVARSSETLFDG-KEQNKNKKIDER

Query:  KYNGQGIRHEERFGGNSAVAVPNFTATATATATAAVESGVAGMFKPLERSVERRKETNDKIRPKEGDEKRGSKHKDKDKEKKGRSKDKESDKEKKKEKKA
        KYNGQGIRH+ERF G+S   VP+FTAT   TA A VES         E+  ERRKETNDKIRPKEG+EKRGSKHKDKDKEKKGRS DK +DKEKKKEKK+
Subjt:  KYNGQGIRHEERFGGNSAVAVPNFTATATATATAAVESGVAGMFKPLERSVERRKETNDKIRPKEGDEKRGSKHKDKDKEKKGRSKDKESDKEKKKEKKA

Query:  KEKEKEKEKEK-VEEKNAMVDKTKEINKDNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLRPTSHPLKENGRILEPCLT
        KEK+++K K+K  EEKNA+VDKTKEI KDNL+GRHSTNT+TSQLPD NIGAAVEENL+KKRK FEPNGVLHAIDNRSSKLLRP SHPLKENGRILEPCL 
Subjt:  KEKEKEKEKEK-VEEKNAMVDKTKEINKDNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLRPTSHPLKENGRILEPCLT

Query:  STLPSSERQAV-ANDLILVSKERKINGIVEAHHPPSSSKHKSGGPDDH------------------------PQPTAIHKKSPHPDSKYLSVVYSVPKME
        S LP SERQ V AND+ILVSKERKINGI+EAHHPP+S KHK+GG DDH                        PQPTAIHKKSPHPDSK+LSVVYSVPKME
Subjt:  STLPSSERQAV-ANDLILVSKERKINGIVEAHHPPSSSKHKSGGPDDH------------------------PQPTAIHKKSPHPDSKYLSVVYSVPKME

Query:  ELSDSDNQDWLLSNNTSQVMKPKSEASEAEEMPQVWAEAMEIESADVYALPYVIPY
        EL +SDNQDWL ++NTSQ MKP+ EASEA+EMPQVWAEAM+IE  DVYALPYVIPY
Subjt:  ELSDSDNQDWLLSNNTSQVMKPKSEASEAEEMPQVWAEAMEIESADVYALPYVIPY

XP_023513944.1 DNA ligase 1-like [Cucurbita pepo subsp. pepo]2.4e-20477.82Show/hide
Query:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKERKHKKEK--KEKEKREGKEKREKDKSDEKRREKKDKKKDRDKKKDRDKEKDKAR-------ASEESRPL
        MSRCFPFPPPGYEKKSRAEDADLLKKEKSKE+KHKKEK  KEKEK+EGKEKR+KDKS+EKRREKKDKKKDRDKKK+RDKEK+K +       A E+SRP 
Subjt:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKERKHKKEK--KEKEKREGKEKREKDKSDEKRREKKDKKKDRDKKKDRDKEKDKAR-------ASEESRPL

Query:  GKIGSQNGEEVVRVKHNSVQEEKHAAQFDSYVGNKIGQNTFLSKETKNSKLAPEVGRRIEDSGTAKVEKFAVAQPKRDDGAATQVARSSETLFDG-KEQN
        GKI SQNGEEVVR+KHNS+QEEKH+ + DSYVGNKI QNT L KETKNSKL PEVGRRIEDSGTAKVEK AVAQPKRDDG ATQVARSSE LF+G KE  
Subjt:  GKIGSQNGEEVVRVKHNSVQEEKHAAQFDSYVGNKIGQNTFLSKETKNSKLAPEVGRRIEDSGTAKVEKFAVAQPKRDDGAATQVARSSETLFDG-KEQN

Query:  KNKKIDERKYNGQGIRHEERFGGNSAVAVPNFTATATATATAAVESGVAGMFKPLERSVERRKETNDKIRPKEGDEKRGSKHKDKDKEKKGRSKDKESDK
        KNK ID+ KYNGQGIRH+ERF G+S   VP+FTAT T TA A +ES         E+  ERRKETNDKIRPKEG+EKRGSKHKDKDKEKKGRS DK +DK
Subjt:  KNKKIDERKYNGQGIRHEERFGGNSAVAVPNFTATATATATAAVESGVAGMFKPLERSVERRKETNDKIRPKEGDEKRGSKHKDKDKEKKGRSKDKESDK

Query:  EKKKEKKAKEKEKEKEKEKVEEKNAMVDKTKEINKDNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLRPTSHPLKENGR
        EKKKEKKAKEK K K+KE+ EEKNA+VDKTKEI KDNL+GRHSTNT+TSQLPDSNIGAAVEENLIKKRK FEPNGVLHAIDNRSSKLLRP SHPLKENGR
Subjt:  EKKKEKKAKEKEKEKEKEKVEEKNAMVDKTKEINKDNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLRPTSHPLKENGR

Query:  ILEPCLTSTLPSSERQAV-ANDLILVSKERKINGIVEAHHPPSSSKHKSGGPDD--------------------HPQPTAIHKKSPHPDSKYLSVVYSVP
        ILEPC+ S LP SERQ V AND+ILVSKERKINGI+EAHHPP+S KHK+GG DD                     PQPTAIHKKSPHPDSK+LSVVYSVP
Subjt:  ILEPCLTSTLPSSERQAV-ANDLILVSKERKINGIVEAHHPPSSSKHKSGGPDD--------------------HPQPTAIHKKSPHPDSKYLSVVYSVP

Query:  KMEELSDSDNQDWLLSNNTSQVMKPKSEASEAEEMPQVWAEAMEIESADVYALPYVIPY
        KMEELSDSDNQDWL ++NTSQ MKP+ EASEA+EMPQVWAEAM+IE +DVYALPYVIPY
Subjt:  KMEELSDSDNQDWLLSNNTSQVMKPKSEASEAEEMPQVWAEAMEIESADVYALPYVIPY

XP_038875356.1 myb-like protein X [Benincasa hispida]1.4e-22385.98Show/hide
Query:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKERKHKKEKKEKEKREGKEKREKDKSDEKRREKKDKKKDRDKKKDRDKEKDKARASEESRPLGKIGSQNGE
        MSRCFPFPPPGYEKKSRAEDADLLKKEKSKE+KHKKEKKEKEKREGKEKR+KDKSDEKRREKKDKKKDRDKKKDR+KEK+KARA+EESR  GKIGSQNGE
Subjt:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKERKHKKEKKEKEKREGKEKREKDKSDEKRREKKDKKKDRDKKKDRDKEKDKARASEESRPLGKIGSQNGE

Query:  EVVRVKHNSVQEEKHAAQFDSYVGNKIGQNTFLSKETKNSKLAPEVGRRIEDSGTAKVEKFAVAQPKRDDGAATQVARSSETLFDGKEQNKNKKIDERKY
        E VRVKHNSVQEEKHA QFDSYVGNKI QN FLSKE KNSK+  EVGRRIED GTAKVEKFAVAQP+RDDG  TQ+ RSSETLF+ KEQ+KNKKIDERKY
Subjt:  EVVRVKHNSVQEEKHAAQFDSYVGNKIGQNTFLSKETKNSKLAPEVGRRIEDSGTAKVEKFAVAQPKRDDGAATQVARSSETLFDGKEQNKNKKIDERKY

Query:  NGQGIRHEERFGGNSAVAVPNFTATATATATAAVESGVAGMFKPLERSVERRKETNDKIRPKEGDEKRGSKHKDKDKEKKGRSKDKESDKEKKKEKKAKE
        NGQGIRHEERF GNSAV       ++  TATA VESGV GMFK +E+S ERRKE NDK R KEG+EKR SKHKDKDKEKKGRS DKESDKEKKKEKK K 
Subjt:  NGQGIRHEERFGGNSAVAVPNFTATATATATAAVESGVAGMFKPLERSVERRKETNDKIRPKEGDEKRGSKHKDKDKEKKGRSKDKESDKEKKKEKKAKE

Query:  KEKEKEKEKVEEKNAMVDKTKEINKDNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLRPTSHPLKENGRILEPCLTSTL
          K KEKE VEEKNAMVDKTKEINKDNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRS KLL+P SHPLKENGRILEPC TS L
Subjt:  KEKEKEKEKVEEKNAMVDKTKEINKDNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLRPTSHPLKENGRILEPCLTSTL

Query:  PSSERQAVANDLILVSKERKINGIVEAHHPPSSSKHKSGGPDDHPQPTAIHKKSPHPDSKYLSVVYSVPKMEELSDSDNQDWLLSNNTSQVMKPKSEASE
        P S+RQAV+NDLILVSKERKINGI+EAHHPP+SSKHKSGG DDH QPT IHKKSPHPDSK+LS VYSVPKMEELSDSDNQDWL S+NTS VMKPK EASE
Subjt:  PSSERQAVANDLILVSKERKINGIVEAHHPPSSSKHKSGGPDDHPQPTAIHKKSPHPDSKYLSVVYSVPKMEELSDSDNQDWLLSNNTSQVMKPKSEASE

Query:  AEEMPQVWAEAMEIESADVYALPYVIPY
        AEEMPQVWAEAM+IESADVYALPYVIPY
Subjt:  AEEMPQVWAEAMEIESADVYALPYVIPY

TrEMBL top hitse value%identityAlignment
A0A0A0LHZ6 Uncharacterized protein8.7e-21682.65Show/hide
Query:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKERKHKKEKKEKEKREGKEKREKDKSDEKRREKKDKKKDRDKKKDRDKEKDKARASEESRPLGKIGSQNGE
        MSRCFPFPPPGYEKKSRAEDADLLKKEKSKE+KHKKEKKEKEKREGKEKR+KDKSDEKRREKKDKKKDRDKKKDR+KEK+KA ASEES+  GKIGSQNGE
Subjt:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKERKHKKEKKEKEKREGKEKREKDKSDEKRREKKDKKKDRDKKKDRDKEKDKARASEESRPLGKIGSQNGE

Query:  EVVRVKHNSVQEEKHAAQFDSYVGNKIGQNTFLSKETKNSKLAPEVGRRIEDSGTAKVEKFAVAQPKRDDGAATQVARSSETLFDGKEQNKNKKIDERKY
        EVVRVKHNSVQEEKHA QFDSYVGNKI QN F SKETK++K+  EVGRRIEDSGTAKVEKFAVAQP+RDDG  TQV R SETL + K+Q+KNKKIDERKY
Subjt:  EVVRVKHNSVQEEKHAAQFDSYVGNKIGQNTFLSKETKNSKLAPEVGRRIEDSGTAKVEKFAVAQPKRDDGAATQVARSSETLFDGKEQNKNKKIDERKY

Query:  NGQGIRHEERFGGNSAV--------AVPNFTATATATATAAVESGVAGMFKPLERSVERRKETNDKIRPKEGDEKRGSKHKDKDKEKKGRSKDKESDKEK
        NGQG+RHEERF G+SAV        A    TATATATATA VE GV GMFK LE+S ERRKE+NDK R KEG+E+R SKHKDKDKEKKGRS DK+ +KEK
Subjt:  NGQGIRHEERFGGNSAV--------AVPNFTATATATATAAVESGVAGMFKPLERSVERRKETNDKIRPKEGDEKRGSKHKDKDKEKKGRSKDKESDKEK

Query:  KKEKKAKEKEKEKEKEKVEEKNAMVDKTKEINKDNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLRPTSHPLKENGRIL
        KKEKKA        KEKVEEKNAMVDKTKEINKDNLLGRHSTNTNTSQLPDSNI AAVEENLIKKRKDFEPNGVLHAIDNRSSKLLRPTSH LKENGRIL
Subjt:  KKEKKAKEKEKEKEKEKVEEKNAMVDKTKEINKDNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLRPTSHPLKENGRIL

Query:  EPCLTSTLPSSERQAVANDLILVSKERKINGIVEAHHPPSSSKHKSGGPDDHPQPTAIHKKSPHPDSKYLSVVYSVPKMEELSDSDNQDWLLSNNTSQVM
        EPCL STLP SERQAV+NDLILVSKERKINGI+EAHHPP+SSKH+S G  DHPQP A+HKKSPHPDSK+LS VYSVPKMEELSDSD+QDWL S N+S  M
Subjt:  EPCLTSTLPSSERQAVANDLILVSKERKINGIVEAHHPPSSSKHKSGGPDDHPQPTAIHKKSPHPDSKYLSVVYSVPKMEELSDSDNQDWLLSNNTSQVM

Query:  KPKSEASEAEEMPQVWAEAMEIESADVYALPYVIPY
        KPK E SEA+EMP VWA+AM+IESADVYALPYVIPY
Subjt:  KPKSEASEAEEMPQVWAEAMEIESADVYALPYVIPY

A0A6J1CD63 DNA ligase 13.8e-21181.72Show/hide
Query:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKERKHKKEKKEKEKREGKEKREKDKSDEKRREKKDKKKDRDKKKDRD-----KEKDKARASEESRPLGKIG
        MSRCFPFPPPGYEKKSRAEDADLLKKEKSKE+KHKKEKKEKEKREGKEKR+KD+SDEKRREKKDKKKDRDKKKDRD     KEK+KA ASEESRPLG+ G
Subjt:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKERKHKKEKKEKEKREGKEKREKDKSDEKRREKKDKKKDRDKKKDRD-----KEKDKARASEESRPLGKIG

Query:  SQNGEEVVRVKHNSVQEEKHAAQFDSYVGNKIGQNTFLSKETKNSKLAPEVGRRIEDSGTAKVEKFAVAQPKRDDGAATQVARSSETLFDGKEQNKNKKI
        SQNGEEVVRVKHN+VQEEK+  QFDSYVGNKI QN FLSKETKN+ L PEVGRRIED GT K+EKFAVA+PKRDDG AT VARSSET  +GKEQNKN+KI
Subjt:  SQNGEEVVRVKHNSVQEEKHAAQFDSYVGNKIGQNTFLSKETKNSKLAPEVGRRIEDSGTAKVEKFAVAQPKRDDGAATQVARSSETLFDGKEQNKNKKI

Query:  DERKYNGQGIRHEERFGGNSAVAVPNFTATATATATAAVESGVAGMFKPLERSVERRKETNDKIRPKEGDEKRGSKHKDKDKEKKGRSKDKESDKEKK-K
        DER YNGQGIRHEERF GNSAV         ++TATA VESGVAGMFK L++  ERRKETNDKIRPKE + K+GSKHKDKDK+KKGRS  KE  KEKK K
Subjt:  DERKYNGQGIRHEERFGGNSAVAVPNFTATATATATAAVESGVAGMFKPLERSVERRKETNDKIRPKEGDEKRGSKHKDKDKEKKGRSKDKESDKEKK-K

Query:  EK-KAKEKEKEKEKEKVEEKNAMVDKTKEINKDNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLRPTSHPLKENGRILE
        EK K KEK+KEKEKEKVEEKNAMVDKTKEINKDNLLGR+STNTNTSQLPDSNIGAAVEE LIKKRKDFEPNG+LHAIDNRSSKLLRPT+HPLKENGRIL+
Subjt:  EK-KAKEKEKEKEKEKVEEKNAMVDKTKEINKDNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLRPTSHPLKENGRILE

Query:  PCLTSTLPSSERQAVANDLILVSKERKINGIVEAHHP-PSSSKHKSGGPDDHPQPTAIHKKSPHPDSKYLSVVYSVPKMEELSDSDNQDWLLSNNTSQVM
        PC+TS LP SE QAVANDLI  SKERKINGI+EAHH  P+SSKH S G DD PQPT +HKKSPHPDSKYLS VYSVPKMEELSDSD+Q+WL S+NTSQ M
Subjt:  PCLTSTLPSSERQAVANDLILVSKERKINGIVEAHHP-PSSSKHKSGGPDDHPQPTAIHKKSPHPDSKYLSVVYSVPKMEELSDSDNQDWLLSNNTSQVM

Query:  KPKSEASEAEEMPQVWAEAMEIESADVYALPYVIPY
        KPK E +E EE  QVWA+AM+IESADVYALPYVIPY
Subjt:  KPKSEASEAEEMPQVWAEAMEIESADVYALPYVIPY

A0A6J1HKL8 DNA ligase 1-like6.9e-20577.52Show/hide
Query:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKERKHKKEKKEKEKREGKEKREKDKSDEKRREKKDKKKDRDKKKDRDKEKDKA-RASEESRPLGKIGSQNG
        MSRCFPFPPPGYEKKSRAEDADLLKKEKSKE+KHKKEKKEKEK+EGKEKR+KDKS+EKRREKKDKKKDRDKKK+RDKEK+KA  A E+SRP GKI SQNG
Subjt:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKERKHKKEKKEKEKREGKEKREKDKSDEKRREKKDKKKDRDKKKDRDKEKDKA-RASEESRPLGKIGSQNG

Query:  EEVVRVKHNSVQEEKHAAQFDSYVGNKIGQNTFLSKETKNSKLAPEVGRRIEDSGTAKVEKFAVAQPKRDDGAATQVARSSETLFDG-KEQNKNKKIDER
        EEVVR+KHNS+QEEKH+ + DSYVGNKI QNT L KETKNSKL PEVGRRIEDSGTAKVEK AVAQPKRDDG ATQVARSSE L +G KE  KNK ID+ 
Subjt:  EEVVRVKHNSVQEEKHAAQFDSYVGNKIGQNTFLSKETKNSKLAPEVGRRIEDSGTAKVEKFAVAQPKRDDGAATQVARSSETLFDG-KEQNKNKKIDER

Query:  KYNGQGIRHEERFGGNSAVAVPNFTATATATATAAVESGVAGMFKPLERSVERRKETNDKIRPKEGDEKRGSKHKDKDKEKKGRSKDKESDKEKKKEKKA
        KYNGQGIRH+ERF G+S   VP+FTAT   TA A VES         E+  ERRKETNDKIRPKEG+EKRGSKHKDKDKEKKGRS DK +DKEKKKEKK+
Subjt:  KYNGQGIRHEERFGGNSAVAVPNFTATATATATAAVESGVAGMFKPLERSVERRKETNDKIRPKEGDEKRGSKHKDKDKEKKGRSKDKESDKEKKKEKKA

Query:  KEKEKEKEKEK-VEEKNAMVDKTKEINKDNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLRPTSHPLKENGRILEPCLT
        KEK+++K K+K  EEKNA+VDKTKEI KDNL+GRHSTNT+TSQLPD NIGAAVEENL+KKRK FEPNGVLHAIDNRSSKLLRP SHPLKENGRILEPCL 
Subjt:  KEKEKEKEKEK-VEEKNAMVDKTKEINKDNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLRPTSHPLKENGRILEPCLT

Query:  STLPSSERQAV-ANDLILVSKERKINGIVEAHHPPSSSKHKSGGPDDH------------------------PQPTAIHKKSPHPDSKYLSVVYSVPKME
        S LP SERQ V AND+ILVSKERKINGI+EAHHPP+S KHK+GG DDH                        PQPTAIHKKSPHPDSK+LSVVYSVPKME
Subjt:  STLPSSERQAV-ANDLILVSKERKINGIVEAHHPPSSSKHKSGGPDDH------------------------PQPTAIHKKSPHPDSKYLSVVYSVPKME

Query:  ELSDSDNQDWLLSNNTSQVMKPKSEASEAEEMPQVWAEAMEIESADVYALPYVIPY
        EL +SDNQDWL ++NTSQ MKP+ EASEA+EMPQVWAEAM+IE  DVYALPYVIPY
Subjt:  ELSDSDNQDWLLSNNTSQVMKPKSEASEAEEMPQVWAEAMEIESADVYALPYVIPY

A0A6J1KEE4 myb-like protein X isoform X23.2e-20278.43Show/hide
Query:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKERKHKKEKKEKEKREGKEKREKDKSDEKRREKKDKKKDRDKKKDRDKEKDKARAS---EESRPLGKIGSQ
        MSRCFPFPPPGYEKKSRAEDADLLKKEKSKE+KHKKEKKEKEK+EGKEKR+KDKS+EKRREKKDK KDRDKKK+RDKEK+K +A    E+SR  GKI SQ
Subjt:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKERKHKKEKKEKEKREGKEKREKDKSDEKRREKKDKKKDRDKKKDRDKEKDKARAS---EESRPLGKIGSQ

Query:  NGEEVVRVKHNSVQEEKHAAQFDSYVGNKIGQNTFLSKETKNSKLAPEVGRRIEDSGTAKVEKFAVAQPKRDDGAATQVARSSETLFDG-KEQNKNKKID
        NGEEVVR+KHNS+QEEKH+ + DSYVGNKI QNT L KETKNSKL PEVGRRIEDSGTAKVEK AVAQPKRDDG ATQVARSSE LF+G KE  KNK ID
Subjt:  NGEEVVRVKHNSVQEEKHAAQFDSYVGNKIGQNTFLSKETKNSKLAPEVGRRIEDSGTAKVEKFAVAQPKRDDGAATQVARSSETLFDG-KEQNKNKKID

Query:  ERKYNGQGIRHEERFGGNSAVAVPNFTATATATATAAVESGVAGMFKPLERSVERRKETNDKIRPKEGDEKRGSKHKDKDKEKKGRSKDKESDKEKKKEK
        + KYNGQGIRH+ERF G+S   VP+FTAT T TA A VES         E+  E+RKETNDKIRPKEG+ KRGSKHKDKDKEKKGRS DK +DKEKKKEK
Subjt:  ERKYNGQGIRHEERFGGNSAVAVPNFTATATATATAAVESGVAGMFKPLERSVERRKETNDKIRPKEGDEKRGSKHKDKDKEKKGRSKDKESDKEKKKEK

Query:  KAKEKEKEKEKEKVEEKNAMVDKTKEINKDNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLRPTSHPLKENGRILEPCL
        KAKEK K K+KE+ EEKNA+VDKTKEI KDNL+GRHST+TNTSQLPDSNIGAAVEENLIKKRK FEPNGVLHAIDNRSSKLLRP SHPLKENGRILEPCL
Subjt:  KAKEKEKEKEKEKVEEKNAMVDKTKEINKDNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLRPTSHPLKENGRILEPCL

Query:  TSTLPSSERQAV-ANDLILVSKERKINGIVEAHHPPSSSKHKSGGPDDH--------------PQPTAIHKKSPHPDSKYLSVVYSVPKMEELSDSDNQD
         S LP SERQ V AND+IL SKERKINGI+EAHHPP+S KHK+   DDH              PQPTAIHKKSPHPDSK+LSVVYSVPKMEELSDSDNQD
Subjt:  TSTLPSSERQAV-ANDLILVSKERKINGIVEAHHPPSSSKHKSGGPDDH--------------PQPTAIHKKSPHPDSKYLSVVYSVPKMEELSDSDNQD

Query:  WLLSNNTSQVMKPKSEASEAEEMPQVWAEAMEIESADVYALPYVIPY
        WL ++N SQ MKP+ EASEA+EM QVWAEAM+IE +DVYALPYVIPY
Subjt:  WLLSNNTSQVMKPKSEASEAEEMPQVWAEAMEIESADVYALPYVIPY

A0A6J1KGW4 DNA ligase 1-like isoform X13.2e-20277.16Show/hide
Query:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKERKHKKEKKEKEKREGKEKREKDKSDEKRREKKDKKKDRDKKKDRDKEKDKARAS---EESRPLGKIGSQ
        MSRCFPFPPPGYEKKSRAEDADLLKKEKSKE+KHKKEKKEKEK+EGKEKR+KDKS+EKRREKKDK KDRDKKK+RDKEK+K +A    E+SR  GKI SQ
Subjt:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKERKHKKEKKEKEKREGKEKREKDKSDEKRREKKDKKKDRDKKKDRDKEKDKARAS---EESRPLGKIGSQ

Query:  NGEEVVRVKHNSVQEEKHAAQFDSYVGNKIGQNTFLSKETKNSKLAPEVGRRIEDSGTAKVEKFAVAQPKRDDGAATQVARSSETLFDG-KEQNKNKKID
        NGEEVVR+KHNS+QEEKH+ + DSYVGNKI QNT L KETKNSKL PEVGRRIEDSGTAKVEK AVAQPKRDDG ATQVARSSE LF+G KE  KNK ID
Subjt:  NGEEVVRVKHNSVQEEKHAAQFDSYVGNKIGQNTFLSKETKNSKLAPEVGRRIEDSGTAKVEKFAVAQPKRDDGAATQVARSSETLFDG-KEQNKNKKID

Query:  ERKYNGQGIRHEERFGGNSAVAVPNFTATATATATAAVESGVAGMFKPLERSVERRKETNDKIRPKEGDEKRGSKHKDKDKEKKGRSKDKESDKEKKKEK
        + KYNGQGIRH+ERF G+S   VP+FTAT T TA A VES         E+  E+RKETNDKIRPKEG+ KRGSKHKDKDKEKKGRS DK +DKEKKKEK
Subjt:  ERKYNGQGIRHEERFGGNSAVAVPNFTATATATATAAVESGVAGMFKPLERSVERRKETNDKIRPKEGDEKRGSKHKDKDKEKKGRSKDKESDKEKKKEK

Query:  KAKEKEKEKEKEK---------VEEKNAMVDKTKEINKDNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLRPTSHPLKE
        KAKEK+++K KEK          EEKNA+VDKTKEI KDNL+GRHST+TNTSQLPDSNIGAAVEENLIKKRK FEPNGVLHAIDNRSSKLLRP SHPLKE
Subjt:  KAKEKEKEKEKEK---------VEEKNAMVDKTKEINKDNLLGRHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLRPTSHPLKE

Query:  NGRILEPCLTSTLPSSERQAV-ANDLILVSKERKINGIVEAHHPPSSSKHKSGGPDDH--------------PQPTAIHKKSPHPDSKYLSVVYSVPKME
        NGRILEPCL S LP SERQ V AND+IL SKERKINGI+EAHHPP+S KHK+   DDH              PQPTAIHKKSPHPDSK+LSVVYSVPKME
Subjt:  NGRILEPCLTSTLPSSERQAV-ANDLILVSKERKINGIVEAHHPPSSSKHKSGGPDDH--------------PQPTAIHKKSPHPDSKYLSVVYSVPKME

Query:  ELSDSDNQDWLLSNNTSQVMKPKSEASEAEEMPQVWAEAMEIESADVYALPYVIPY
        ELSDSDNQDWL ++N SQ MKP+ EASEA+EM QVWAEAM+IE +DVYALPYVIPY
Subjt:  ELSDSDNQDWLLSNNTSQVMKPKSEASEAEEMPQVWAEAMEIESADVYALPYVIPY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G48610.1 unknown protein2.9e-0630.89Show/hide
Query:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKERKHKKEKKEKEKREGKEKREKDKSDEKRREKKDKKKDRDKKKDRDKEKDKARASEESRP--LGKIGSQN
        MSRCFPFPPPGYEKK R ++AD L K+K KE+KH   KK+KEKREGKEK+ KD+S +K++E+K+KK     +KD++K K+K +  EE +   L   G + 
Subjt:  MSRCFPFPPPGYEKKSRAEDADLLKKEKSKERKHKKEKKEKEKREGKEKREKDKSDEKRREKKDKKKDRDKKKDRDKEKDKARASEESRP--LGKIGSQN

Query:  GEEVVRVKHNSVQEEKHAAQFDSYVGNKIGQNTFLSKETKNSKLAPEVGRRI--EDSGTAKVEKFAVAQPKRDDGAATQVARSSETLFDGKEQNKNKKID
              V++NS  E                           SK   ++ RRI  ++  T       +  P + +   T+ A  +  +    E+  ++  D
Subjt:  GEEVVRVKHNSVQEEKHAAQFDSYVGNKIGQNTFLSKETKNSKLAPEVGRRI--EDSGTAKVEKFAVAQPKRDDGAATQVARSSETLFDGKEQNKNKKID

Query:  ERKYNGQGIRHEERFGGNSAVAVPNFTAT-ATATATAAVESG-----VAGMFKPLE-RSVERRKETNDKIRPKEGDEKRGSKHKDKDKEKKGRSKDKESD
         ++ N Q                 NFTA  ++  A + V  G        M KP+E R   R+ E+ +K   KE   K   K +D++  KK  +KDK+ +
Subjt:  ERKYNGQGIRHEERFGGNSAVAVPNFTAT-ATATATAAVESG-----VAGMFKPLE-RSVERRKETNDKIRPKEGDEKRGSKHKDKDKEKKGRSKDKESD

Query:  KEKKKEKKAKEKEKEKEKEKVEEKNAMVDKTKEINKDNLLGRHSTNTNTSQLPD---SNIGAAVEENLIKKRKDFEPNGVLH
        KEKK+EK     +  +EK K+     + ++ K+          S +    +LPD   ++I     E  + KRKD   NG L+
Subjt:  KEKKKEKKAKEKEKEKEKEKVEEKNAMVDKTKEINKDNLLGRHSTNTNTSQLPD---SNIGAAVEENLIKKRKDFEPNGVLH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGCGCTGCTTTCCATTTCCGCCACCAGGATATGAAAAGAAGTCTAGGGCAGAGGATGCGGATCTACTAAAAAAGGAGAAGAGCAAAGAGAGAAAACATAAAAAGGA
GAAGAAAGAGAAGGAGAAGAGAGAGGGTAAAGAAAAAAGAGAGAAGGATAAAAGTGATGAGAAACGCAGGGAGAAGAAAGACAAAAAGAAAGACAGGGACAAGAAGAAGG
ATAGGGATAAGGAGAAGGATAAAGCACGTGCCTCAGAGGAGAGTAGACCTTTGGGGAAAATTGGGAGTCAAAATGGAGAGGAAGTTGTTAGAGTCAAACATAACAGTGTC
CAAGAGGAGAAGCATGCTGCGCAATTTGATTCTTACGTTGGAAATAAGATTGGTCAGAATACTTTTCTTTCTAAGGAGACAAAGAACTCAAAATTGGCACCAGAGGTGGG
GAGGAGAATTGAGGACAGTGGAACTGCAAAGGTCGAGAAGTTTGCTGTTGCACAACCAAAAAGGGATGATGGAGCGGCCACACAGGTGGCTAGGAGTTCTGAGACCTTGT
TTGATGGTAAGGAACAGAACAAGAACAAGAAAATTGATGAAAGAAAATACAATGGTCAAGGAATTAGACACGAGGAAAGATTTGGTGGAAATTCTGCAGTTGCGGTCCCA
AATTTCACTGCAACTGCAACTGCAACTGCCACTGCAGCTGTTGAATCTGGAGTTGCAGGAATGTTTAAACCATTGGAGAGAAGTGTTGAGAGAAGGAAAGAAACAAATGA
TAAAATCAGACCGAAAGAAGGTGATGAAAAACGAGGGAGCAAACACAAGGATAAAGATAAAGAGAAAAAAGGACGATCGAAGGATAAAGAGAGCGATAAAGAGAAGAAAA
AAGAAAAGAAAGCCAAGGAGAAGGAGAAGGAAAAGGAGAAGGAGAAGGTGGAAGAGAAGAATGCCATGGTAGATAAAACAAAAGAGATCAATAAGGATAACCTCTTAGGT
AGACATTCTACAAATACAAATACATCACAACTTCCTGACAGCAATATAGGTGCTGCAGTTGAGGAAAATCTAATTAAGAAACGGAAGGATTTTGAGCCTAATGGAGTCTT
GCATGCAATTGACAACAGATCCAGTAAGTTGTTGAGGCCAACTTCTCATCCATTGAAGGAAAATGGTAGGATACTTGAACCGTGCTTGACTTCCACCTTGCCTTCTTCAG
AGAGACAGGCTGTGGCCAATGACCTTATTTTGGTTAGTAAAGAACGCAAGATTAATGGCATTGTTGAAGCTCACCACCCGCCTTCCTCTTCAAAACACAAGTCTGGCGGA
CCAGACGATCATCCTCAACCTACTGCAATACATAAAAAATCACCCCATCCTGATTCCAAGTACTTGAGCGTGGTATATTCAGTACCCAAAATGGAAGAATTGTCAGATTC
TGATAACCAAGATTGGTTATTGAGCAACAATACTTCCCAAGTGATGAAGCCAAAGTCAGAAGCTTCAGAGGCCGAGGAAATGCCACAGGTATGGGCTGAGGCGATGGAGA
TAGAATCGGCTGATGTTTATGCTCTGCCTTATGTTATTCCATACTGA
mRNA sequenceShow/hide mRNA sequence
GTTTGAGGCATTCTCAAATCGTTTCTCTTATTTTTCTTGAGAATAGTTGGGGGAACTCAGAGACAGAGATACACCCAAAGAGAAACAAAGAAGGAAACACAGTAGAACCA
GAGAAGAGAAGAGAAGAGACGAAACCCCCAATTTTTACACACTTAATAGACTCAAATCTCTACACACAGAAGAAGAGAAACAGAATCAGAAACAGAAACAGAAAGAAGAG
AGAATCTAGAGCTTGAAACTTCTTTTTTTTTCTTCGAAATTTTGATTAGTTTTTTTTTTCTTCCCTTTTTCCCATTTCCTATGCCTTCTGTTTCTGGTTAATCTCTGTTT
GAGCGAGATAGGAGAGAGATCTTGTCCTTCTACATTGCCCACTCTGTAGCGGACAAGGTAATCTCATTACCCAATTTCCGAATCACTTACTTTCATCGACCCACTTGATT
TTCCCCTTGGAATTTCTTCGTTTTCAGCCTGAATCGAGCTGCTTTTACCCTTTTCATCGATACCCTATAATTGCATAGGGTTTTGACGGATCGTAACTGGGTGGCTCTCC
AATTTATCGGTTTTGCACTAATTAGGGTTTGATACCGACTATGGATTGAAAGTATGATTCTCTAGAATGGGCTGGAGCTCCGATCTTTCGTGGGATTCTGGTTTTAGTCT
CTACTTTGCCGAGGCTGCGATTCTGAAACTGTAACGTACCACTAGAATGAACTGAACTTTGCTGGTCTGATCATGGAAGAGGATTGCACATTATAAACTTGATTTTGGCA
GTGATCACTCTCAACCCAATCAATTTCTTGGATTACAAGGGATTTTCGTTACTGTTTCAGGTTTTTCATCTCTAATTAGGAACTTCTGTTGGTTTAAAGAGGAGCTAATA
ATTCGCCCAAGAGGATAATCAAGAGACCATTGCGTTGAATCATTGGCATTTTTCACGAGCTGATCGGGCGAGCTAATTTGTGCTAAATTTCTTGCTTAGGAGTTCTTTTT
ATCTGCGTTTCAGTTGATTTTGTCCTTTCATATTTAGGTTTTCTCTGATATTCGGGATCTGGGTTAGAATTCAGTGGCAGAGTTGGCTCCTATTATTTTTGAGAATTTTG
GAAATATCTCTTGGCCTGTGCTTATTATTCCTGTACACGCTTACAAGTGGGAAGAGTTCTCTGAGGTTTGGGAATTGCATCTTTGGGATTGACTAAAAAGCACATGGAGG
AATATTGAAGCTGGAAATGAATGCGAGGACATGGTGCATGGACAGCAGCTTAGTTTCTTGTTCGTTCCCTTTTCGGAACAGTGCTTCTGGAAAGTAGTTATAGCATTTTC
GCCTGTGGAAGGAGCTCTTGTTTTTTGTCTAGGTACTCTGTGCAATGTCGCGCTGCTTTCCATTTCCGCCACCAGGATATGAAAAGAAGTCTAGGGCAGAGGATGCGGAT
CTACTAAAAAAGGAGAAGAGCAAAGAGAGAAAACATAAAAAGGAGAAGAAAGAGAAGGAGAAGAGAGAGGGTAAAGAAAAAAGAGAGAAGGATAAAAGTGATGAGAAACG
CAGGGAGAAGAAAGACAAAAAGAAAGACAGGGACAAGAAGAAGGATAGGGATAAGGAGAAGGATAAAGCACGTGCCTCAGAGGAGAGTAGACCTTTGGGGAAAATTGGGA
GTCAAAATGGAGAGGAAGTTGTTAGAGTCAAACATAACAGTGTCCAAGAGGAGAAGCATGCTGCGCAATTTGATTCTTACGTTGGAAATAAGATTGGTCAGAATACTTTT
CTTTCTAAGGAGACAAAGAACTCAAAATTGGCACCAGAGGTGGGGAGGAGAATTGAGGACAGTGGAACTGCAAAGGTCGAGAAGTTTGCTGTTGCACAACCAAAAAGGGA
TGATGGAGCGGCCACACAGGTGGCTAGGAGTTCTGAGACCTTGTTTGATGGTAAGGAACAGAACAAGAACAAGAAAATTGATGAAAGAAAATACAATGGTCAAGGAATTA
GACACGAGGAAAGATTTGGTGGAAATTCTGCAGTTGCGGTCCCAAATTTCACTGCAACTGCAACTGCAACTGCCACTGCAGCTGTTGAATCTGGAGTTGCAGGAATGTTT
AAACCATTGGAGAGAAGTGTTGAGAGAAGGAAAGAAACAAATGATAAAATCAGACCGAAAGAAGGTGATGAAAAACGAGGGAGCAAACACAAGGATAAAGATAAAGAGAA
AAAAGGACGATCGAAGGATAAAGAGAGCGATAAAGAGAAGAAAAAAGAAAAGAAAGCCAAGGAGAAGGAGAAGGAAAAGGAGAAGGAGAAGGTGGAAGAGAAGAATGCCA
TGGTAGATAAAACAAAAGAGATCAATAAGGATAACCTCTTAGGTAGACATTCTACAAATACAAATACATCACAACTTCCTGACAGCAATATAGGTGCTGCAGTTGAGGAA
AATCTAATTAAGAAACGGAAGGATTTTGAGCCTAATGGAGTCTTGCATGCAATTGACAACAGATCCAGTAAGTTGTTGAGGCCAACTTCTCATCCATTGAAGGAAAATGG
TAGGATACTTGAACCGTGCTTGACTTCCACCTTGCCTTCTTCAGAGAGACAGGCTGTGGCCAATGACCTTATTTTGGTTAGTAAAGAACGCAAGATTAATGGCATTGTTG
AAGCTCACCACCCGCCTTCCTCTTCAAAACACAAGTCTGGCGGACCAGACGATCATCCTCAACCTACTGCAATACATAAAAAATCACCCCATCCTGATTCCAAGTACTTG
AGCGTGGTATATTCAGTACCCAAAATGGAAGAATTGTCAGATTCTGATAACCAAGATTGGTTATTGAGCAACAATACTTCCCAAGTGATGAAGCCAAAGTCAGAAGCTTC
AGAGGCCGAGGAAATGCCACAGGTATGGGCTGAGGCGATGGAGATAGAATCGGCTGATGTTTATGCTCTGCCTTATGTTATTCCATACTGATGGTCTCACATGATTGGGC
ATGATCTACCAACTTGACTCTATTTGACTCGGGGTAGAGACCCGCCTTGGAATCATGTCGGAGTGGTGATTCATCAACAGCCAGCTCTTGATATGGAGTTTTTCCATTCT
TCTTCCACTGCACCTGCATTCTTGCTCGGTTCTGGCAGCTGGGGCTGGACACGTGATTTTGATCATCTTGCTATCAAATCGTTGATTGGAACAGCATCTATGGAATGCTC
TTGATCATTGAGGGGTGGGGCTTTAGGTATGGAAGGTGGTTTTTGGCTTTAGGACATTAAATAACTAATTCATTGAAATGAGAAAATTCATAAAGGCAACTCTGGGAGGT
TTTGCCTGCAGAGAGAGAGAGAGAGTGAGTGAGAGAGAGAGAGAGAGAGAGAGAGAAAGAAAGGAGGTGTGAGCTTGTGGTATTTTATTTAATTTGCTTGGAGCTGAAAA
AAGCACTTTCTAATTGAGAATGTGAAAGGAATTAGATGTGGTCTTTACAACTCCACCACAATTTTGATTTGTATCATAGAACTTTTATTTACAGAAATGTAAGATAGAGG
GAAAAAGGAAAGTAGTGCAATTCATTCATGAGCCTGTGGATTAGTTTCCATATCTTCATTTTTCATTCCTAGATGGATGGTGATAAAGAACTGCTGGTTGCTTTTGAAGA
TAACAGAAAGCTTATTAGCCTTATAAATAGAGATGATGGCTCAAGAAAACTTCTCTTCTTCCCTTTGCTAGCTTTCTGTTCTGCATTCAGACTTTGGTGGCCATTGAAGT
GATTTGTTTGAAACTAAGATCTCTCTCCATTTATTAGTTTTTGGTTCAAATTTGCAGACAGATGGAAATAGAATAGAAGATGAATGGAGGCTGAAATTGCAGGTGTTGGT
GTCTCACAATTCATTGAAAAATGAGGGTAAGTTGAAGAATCCATTATAAACCCTGCATTTTGATCCTTTCTGCCTTAAAATCTTTATTCAATTTTTTTTCCCTTAGGGTG
AGAGGATTCGAATATCTATTCTCTTAGTCGAAGGCATATGCCTTAACCAGTTGAACTTGGTTGTGGTTGACAATATTTATGGTTTGTTTTCTGAACTGGTTTTCAAATCT
GTTTTATAGAGGTAGCAAGTGGTTTGTTGAAGTTGGGTTTATTGGATACTTGAAAACTTGAATTTCAATTGTAGAGTAGCTTTTATGATCTCTAAAGGAAGCAATATCTT
TTTTTTGGGTGGTTTAGGTGTAATTTTTTAAAATTTTATTTTGCAAATTAATCTTTTAGAACTTCAAATGTTTCTTCTTCATATTAGTGTAACAAAGATTTAAAACATAG
CTTTGGTTTTTAAAGTTCCCTCATGTTTTTATTGGACTCAATTTCAAGGGGTCATTGTTTCTACAATGTAATATTCCAATCATTAAAAAAATTTTAACAAAATTTTCTTT
AAATTTATTATTAGTTGACATGTTCTTTAGGATCACAGTATGTTGGGCAGTGTTGGAGGTCCACTTCCCATTATAGAGATTGAAGATTGAAGATTTCTTTTTATTGATAG
GG
Protein sequenceShow/hide protein sequence
MSRCFPFPPPGYEKKSRAEDADLLKKEKSKERKHKKEKKEKEKREGKEKREKDKSDEKRREKKDKKKDRDKKKDRDKEKDKARASEESRPLGKIGSQNGEEVVRVKHNSV
QEEKHAAQFDSYVGNKIGQNTFLSKETKNSKLAPEVGRRIEDSGTAKVEKFAVAQPKRDDGAATQVARSSETLFDGKEQNKNKKIDERKYNGQGIRHEERFGGNSAVAVP
NFTATATATATAAVESGVAGMFKPLERSVERRKETNDKIRPKEGDEKRGSKHKDKDKEKKGRSKDKESDKEKKKEKKAKEKEKEKEKEKVEEKNAMVDKTKEINKDNLLG
RHSTNTNTSQLPDSNIGAAVEENLIKKRKDFEPNGVLHAIDNRSSKLLRPTSHPLKENGRILEPCLTSTLPSSERQAVANDLILVSKERKINGIVEAHHPPSSSKHKSGG
PDDHPQPTAIHKKSPHPDSKYLSVVYSVPKMEELSDSDNQDWLLSNNTSQVMKPKSEASEAEEMPQVWAEAMEIESADVYALPYVIPY