; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001186 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001186
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDNA-directed DNA polymerase
Genome locationchr4:26212956..26216104
RNA-Seq ExpressionLag0001186
SyntenyLag0001186
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017227899.1 PREDICTED: uncharacterized protein LOC108203467 [Daucus carota subsp. sativus]2.7e-3028.98Show/hide
Query:  MTKELMSSKKEYMAKNDAVVQSQATSLRNLEIQVGQLASKLKNRPTGALPSNTGAPK--GKEHCHTLTLRSGKQL-------------LPCEPAPVYEDS
        M KE +   +   ++ +A+V SQA SLRNLE QVGQLA++L+NRP G L S+T  PK  G EHC  +TL+SGK L              P     + +D 
Subjt:  MTKELMSSKKEYMAKNDAVVQSQATSLRNLEIQVGQLASKLKNRPTGALPSNTGAPK--GKEHCHTLTLRSGKQL-------------LPCEPAPVYEDS

Query:  NKDTTR-------------------PNNVQEQEDN-------------------TEAAETKP-----------KKK-------------------EKRPK
         ++  +                   P   Q+Q+ +                    EA E  P           KK+                    K P 
Subjt:  NKDTTR-------------------PNNVQEQEDN-------------------TEAAETKP-----------KKK-------------------EKRPK

Query:  K-------------------------------------------SLKPTS------------------------PQFSSP-DFIILEYEADMEVPIILGR
        K                                            ++PT+                         +F  P DFI+L+YEAD EVPIILGR
Subjt:  K-------------------------------------------SLKPTS------------------------PQFSSP-DFIILEYEADMEVPIILGR

Query:  PFLATGQTLIDVQTGELTMGMNDQKVVFNVFTAFKYSDEPKGSETVAC--------IDVCSPMEILQMIVLKAARKEEQDNQE-----------------
        PFLATG+TLIDVQ GELTM + D+KV FNVF A KYSD+ +    +          +D  S  + L+ ++L  + +EE DN E                 
Subjt:  PFLATGQTLIDVQTGELTMGMNDQKVVFNVFTAFKYSDEPKGSETVAC--------IDVCSPMEILQMIVLKAARKEEQDNQE-----------------

Query:  -------------------------------------EGEGESLPVIISSKLNLSQESLLMQVLARHKGAIGWSLADIKGINPSYCMHKI
                                              G   +LPVIIS++L++++E  L+++L  H+ AIGWS+ADIKGI+PS CMHKI
Subjt:  -------------------------------------EGEGESLPVIISSKLNLSQESLLMQVLARHKGAIGWSLADIKGINPSYCMHKI

XP_023521781.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111785639, partial [Cucurbita pepo subsp. pepo]7.1e-3133.33Show/hide
Query:  QGATSSNSLETMTKELMSSKKEYMAKNDAVVQSQATSLRNLEIQVGQLASKLKNRPTG--ALPSNTGAPKG----------------KEHCHTLTLR---
        QG  +S +       L S  KEYMAKNDAV+QSQ  SLRNLE+QVGQLA++L+NRP     +P+     K                  E C  +      
Subjt:  QGATSSNSLETMTKELMSSKKEYMAKNDAVVQSQATSLRNLEIQVGQLASKLKNRPTG--ALPSNTGAPKG----------------KEHCHTLTLR---

Query:  ---------------SGKQL----------LPCEPAPVYEDSNKDTTRPNNVQEQEDNTEAAETKPKKKEKRPKKSLKPTSPQFSSP-DFIILEYEADME
                        GK+L          +   P  +Y+       RP  V  Q    + + T P+ K     + +     +F+ P DFIIL+YEAD +
Subjt:  ---------------SGKQL----------LPCEPAPVYEDSNKDTTRPNNVQEQEDNTEAAETKPKKKEKRPKKSLKPTSPQFSSP-DFIILEYEADME

Query:  VPIILGRPFLATGQTLIDVQTGELTMGMNDQKVVFNVFTAFKY------------------SDEPKGSETVACIDVCSPMEILQMIVLKAARKEEQDNQE
        VPIILGRPFL TG+TL+DV  G +T+ M DQKV FN+  + KY                  ++E    E+    D      I Q+ VL    +  +  + 
Subjt:  VPIILGRPFLATGQTLIDVQTGELTMGMNDQKVVFNVFTAFKY------------------SDEPKGSETVACIDVCSPMEILQMIVLKAARKEEQDNQE

Query:  E-------------------------------GEGESLPVIISSKLNLSQESLLMQVLARHKGAIGWSLADIKGINPSYCMHKI
        E                               G+ ++LP+IIS+ L+ +QE +L++ L +HKGAIGW+LADIKGI+PS CMHKI
Subjt:  E-------------------------------GEGESLPVIISSKLNLSQESLLMQVLARHKGAIGWSLADIKGINPSYCMHKI

XP_024022362.1 uncharacterized protein LOC112091881 [Morus notabilis]2.4e-3131.33Show/hide
Query:  KEYMAKND-------AVVQSQATSLRNLEIQVGQLASKLKNRPTGALPSNTGAPKG------KEHCHTLTLRSGKQL-LPCEPAPVYEDSN---KDTTRP
        KEYMA+ND       A++QSQA SLR LE QVGQLA+ L NRP G+LPS+T  P+       KEHC  +TLR+G+++  P       E S+   ++  +P
Subjt:  KEYMAKND-------AVVQSQATSLRNLEIQVGQLASKLKNRPTGALPSNTGAPKG------KEHCHTLTLRSGKQL-LPCEPAPVYEDSN---KDTTRP

Query:  NNVQEQ----EDNT--------EAAETKP-----------KK-------------------KEKRPKKSLKPTS--------------------------
            EQ    +D T        EA E  P           KK                   K + P K   P S                          
Subjt:  NNVQEQ----EDNT--------EAAETKP-----------KK-------------------KEKRPKKSLKPTS--------------------------

Query:  -----------------------------------------PQFSSP-DFIILEYEADMEVPIILGRPFLATGQTLIDVQTGELTMGMNDQKVVFNVFTA
                                                  +F  P DFI+L+YEAD EVPIILGRPFLATG+TLIDVQ GELTM ++DQ+V FNVF A
Subjt:  -----------------------------------------PQFSSP-DFIILEYEADMEVPIILGRPFLATGQTLIDVQTGELTMGMNDQKVVFNVFTA

Query:  FKYSDEPKGSETVACIDVCSPMEILQMIVLKAARKE----------------EQDNQEE-----------------------------------------
         +++DE         ++ CS M IL  +V     K                  +DN ++                                         
Subjt:  FKYSDEPKGSETVACIDVCSPMEILQMIVLKAARKE----------------EQDNQEE-----------------------------------------

Query:  --------------GEGESLPVIISSKLNLSQESLLMQVLARHKGAIGWSLADIKGINPSYCMHKI
                      G+ ++LPVII+S LN  QE  L++VL + K AIGW++ADIKGI+PS CMHKI
Subjt:  --------------GEGESLPVIISSKLNLSQESLLMQVLARHKGAIGWSLADIKGINPSYCMHKI

XP_024035546.1 uncharacterized protein LOC112096353 [Citrus clementina]2.3e-3433.42Show/hide
Query:  KEYMAKNDAVVQSQATSLRNLEIQVGQLASKLKNRPTGALPSNTGAPKG--KEHCHTLTLRSGKQL----------LPCEPA------------PVYEDS
        KEY+AKN+A+VQSQA SLRNLE Q+GQLA  + NR   +LPSNT   +   KEHC  ++LRSGK +          + C  A              ++DS
Subjt:  KEYMAKNDAVVQSQATSLRNLEIQVGQLASKLKNRPTGALPSNTGAPKG--KEHCHTLTLRSGKQL----------LPCEPA------------PVYEDS

Query:  N------------------KDTTRP--NNVQEQ-----EDNTEAAETKP-------KKKEKRPKKSLKPTS----------------PQFSSP-DFIILE
        +                  K+ T P   N+ +Q     E N +     P       +K++K+  K L+                    +F  P DFI+L 
Subjt:  N------------------KDTTRP--NNVQEQ-----EDNTEAAETKP-------KKKEKRPKKSLKPTS----------------PQFSSP-DFIILE

Query:  YEADMEVPIILGRPFLATGQTLIDVQTGELTMGMNDQKVVFNVFTAFKYSDEPKGSETVACIDV---------CS-------PMEILQMIVLKAARKEEQ
        +EAD EVPIILG+PFLATG+TLIDVQ GELTM +NDQ+V FNV  A +  DE +    ++ +D+         CS       P E ++   + A + +  
Subjt:  YEADMEVPIILGRPFLATGQTLIDVQTGELTMGMNDQKVVFNVFTAFKYSDEPKGSETVACIDV---------CS-------PMEILQMIVLKAARKEEQ

Query:  DNQEE--------------------------------------------GEGESLPVIISSKLNLSQESLLMQVLARHKGAIGWSLADIKGINPSYCMHK
        D Q+                                             G+  +LP+IISS L+  QE  L+ +L +++ AIGW++ DIKGI+PS CMHK
Subjt:  DNQEE--------------------------------------------GEGESLPVIISSKLNLSQESLLMQVLARHKGAIGWSLADIKGINPSYCMHK

Query:  I
        I
Subjt:  I

XP_038885994.1 LOW QUALITY PROTEIN: uncharacterized protein LOC120076299 [Benincasa hispida]4.6e-3033.25Show/hide
Query:  ATSLRNLEIQVGQLASKLKNRPTGALPSNT---GAPKGKEHCHTLTLRSGKQL----------------LPC--------------------EPAPVYED
        A+S+RNLEIQVGQ+AS+LKNR  G LPSNT   G+  GK+ CH +TLRSG+ L                + C                    E  P Y  
Subjt:  ATSLRNLEIQVGQLASKLKNRPTGALPSNT---GAPKGKEHCHTLTLRSGKQL----------------LPC--------------------EPAPVYED

Query:  SNKD-TTRPNNVQE-------QEDNTEAAETKPKKKEKRPKKSLKPTS----------------------------------PQFSS-------------
          KD  T+   V E       QE N   + + P KK+K P     P+S                                  P F +             
Subjt:  SNKD-TTRPNNVQE-------QEDNTEAAETKPKKKEKRPKKSLKPTS----------------------------------PQFSS-------------

Query:  ---------------PDFIILEYEADMEVPIILGRPFLATGQTLIDVQTGELTMGMNDQKVVFNVFTAFKYSDEPKGSETVACIDVCSPME---ILQMIV
                        +FIIL+YEAD EVPIILG PFLATG+ LIDV   ELTM +++++V FNV  A K+ D    SE      +  P E   + +++ 
Subjt:  ---------------PDFIILEYEADMEVPIILGRPFLATGQTLIDVQTGELTMGMNDQKVVFNVFTAFKYSDEPKGSETVACIDVCSPME---ILQMIV

Query:  LKAARKE------------------EQDNQEE-------------GEGESLPVIISSKLNLSQESLLMQVLARHKGAIGWSLADIKGINPSYCMHKI
        L+   KE                  E+ ++ E             G   +LPVIIS+ L   +E  L+Q+L +HK AIGW+LADI+GI+PSYCMHKI
Subjt:  LKAARKE------------------EQDNQEE-------------GEGESLPVIISSKLNLSQESLLMQVLARHKGAIGWSLADIKGINPSYCMHKI

TrEMBL top hitse value%identityAlignment
A0A2G9H2I8 DNA-directed DNA polymerase8.7e-2737.86Show/hide
Query:  DFIILEYEADMEVPIILGRPFLATGQTLIDVQTGELTMGMNDQKVVFNVFTAFKYSDEPKGSETVACIDVCS--------PMEILQMIVLKAARKEEQDN
        DF++L+ E D+EVPIILGRPFLATG+TLIDVQ GELTM + DQ++ FNVF A K+ +E      V+  D  +        P++ L+  +L    +E +++
Subjt:  DFIILEYEADMEVPIILGRPFLATGQTLIDVQTGELTMGMNDQKVVFNVFTAFKYSDEPKGSETVACIDVCS--------PMEILQMIVLKAARKEEQDN

Query:  QEE---------------------------------------------------GEGESLPVIISSKLNLSQESLLMQVLARHKGAIGWSLADIKGINPS
        +E                                                    GE ++LPVIIS  L+  Q   L++VL  HKGAIGW++ADIKGI+PS
Subjt:  QEE---------------------------------------------------GEGESLPVIISSKLNLSQESLLMQVLARHKGAIGWSLADIKGINPS

Query:  YCMHKI
        +C+HKI
Subjt:  YCMHKI

A0A2G9HYA0 Reverse transcriptase1.8e-2738.83Show/hide
Query:  DFIILEYEADMEVPIILGRPFLATGQTLIDVQTGELTMGMNDQKVVFNVFTAFKYSDEPKGSETVACIDVCS--------PMEILQMIVLKAARKEEQDN
        DF++L+ E D EVPIILGRPFLATG+TLIDVQ GELTM + DQ++ FNVF A K+ +E      V+  D  +        P++ L+  +L    +E +++
Subjt:  DFIILEYEADMEVPIILGRPFLATGQTLIDVQTGELTMGMNDQKVVFNVFTAFKYSDEPKGSETVACIDVCS--------PMEILQMIVLKAARKEEQDN

Query:  QEE---------------------------------------------------GEGESLPVIISSKLNLSQESLLMQVLARHKGAIGWSLADIKGINPS
         E                                                    GE ++LPVIISS L+  Q   L++VL  HKGAIGW++ADIKGI+PS
Subjt:  QEE---------------------------------------------------GEGESLPVIISSKLNLSQESLLMQVLARHKGAIGWSLADIKGINPS

Query:  YCMHKI
        +CMHKI
Subjt:  YCMHKI

A0A2G9I8H7 Uncharacterized protein3.0e-2734.5Show/hide
Query:  SQATSLRNLEIQVGQLASKLKNRPTGALPSNTG---------APKGKE------HCHTLTLRSGKQL------LPCEPAPVYEDSNKDTTRPNNVQEQED
        S AT+ + +E ++GQLA+ + +RP   LPSNT           P+ K+       C   T  SG+ L      +   P  +Y        +P N+  Q  
Subjt:  SQATSLRNLEIQVGQLASKLKNRPTGALPSNTG---------APKGKE------HCHTLTLRSGKQL------LPCEPAPVYEDSNKDTTRPNNVQEQED

Query:  NTEAAETKPKKKEKRPKKSLKPTSPQFSSP-DFIILEYEADMEVPIILGRPFLATGQTLIDVQTGELTMGMNDQKVVFNVFTAFKY---SDE--------
        N      K   ++   K        +F  P DF+IL+ EAD E+ IILGRPF ATG+TL +VQ GELTMG+ DQ + FNVF A K+   SDE        
Subjt:  NTEAAETKPKKKEKRPKKSLKPTSPQFSSP-DFIILEYEADMEVPIILGRPFLATGQTLIDVQTGELTMGMNDQKVVFNVFTAFKY---SDE--------

Query:  ------------------PKGSETVACIDVCSPMEILQMIVLKAARKEEQDNQEE------GEGESLPVIISSKLNLSQESLLMQVLARHKGAIGWSLAD
                          P+G E+   ++  +P ++L   + +    E +            E ++LP+IISS L+  Q   L++VL  HKGAIGW++AD
Subjt:  ------------------PKGSETVACIDVCSPMEILQMIVLKAARKEEQDNQEE------GEGESLPVIISSKLNLSQESLLMQVLARHKGAIGWSLAD

Query:  IKGINPSYCMHKI
        IKGI+ S+CMHKI
Subjt:  IKGINPSYCMHKI

A0A2G9I9I8 DNA-directed DNA polymerase1.1e-2630.26Show/hide
Query:  SQATSLRNLEIQVGQLASKLKNRPTGALPSNTGAPKGKEHCHTLTLRSGKQLLPCEPAPVYEDSNKDTTRPNNVQEQEDNTEAAETKPKKKEKRPKKSLK
        S A + + +EIQ+GQLA+++ +RP G+LPSNT  P  ++   T  LR+G++L         +   K+     N +E E   E    K      +  K + 
Subjt:  SQATSLRNLEIQVGQLASKLKNRPTGALPSNTGAPKGKEHCHTLTLRSGKQLLPCEPAPVYEDSNKDTTRPNNVQEQEDNTEAAETKPKKKEKRPKKSLK

Query:  PTSPQFSSPDFIILEYE---------------------------------------ADMEVPIILGRPFLATGQTLIDVQTGELTMGMNDQKVVFNVFTA
            +    + + L  E                                        D E+PIILGRPFLATG+TLIDVQ GELTM + DQ++ FNVF A
Subjt:  PTSPQFSSPDFIILEYE---------------------------------------ADMEVPIILGRPFLATGQTLIDVQTGELTMGMNDQKVVFNVFTA

Query:  FKYSDEPKGSETVACID--------VCSPMEILQMIVLKAARKEEQDNQEE--------------------------------------------GEGES
         K+ +E     +V+ +D           P++ L+  +L    +E +D+ E                                             GE ++
Subjt:  FKYSDEPKGSETVACID--------VCSPMEILQMIVLKAARKEEQDNQEE--------------------------------------------GEGES

Query:  LPVIISSKLNLSQESLLMQVLARHKGAIGWSLADIKGINPSYCMHKI
        LPVIISS L   Q   L++VL  HK AIGW++ DIKGI+PS+C+HKI
Subjt:  LPVIISSKLNLSQESLLMQVLARHKGAIGWSLADIKGINPSYCMHKI

A0A5B6VWJ0 Retroelement pol polyprotein-like1.1e-2635.42Show/hide
Query:  ATSSNSLETMTKELMSSKKEYMAKNDAVVQSQATSLRNLEIQVGQLASKLKNRPTGALPSNTGAPK--GKEHCHTLTLRSGKQLLPCEPAPVYEDSN-KD
        A +SNSLE++        K YMAKNDA++QSQA +L+NLE QVGQLA++L+NR  GALPS+T  P+  GKEHC  LTLRS K + P       E +N +D
Subjt:  ATSSNSLETMTKELMSSKKEYMAKNDAVVQSQATSLRNLEIQVGQLASKLKNRPTGALPSNTGAPK--GKEHCHTLTLRSGKQLLPCEPAPVYEDSN-KD

Query:  TTRPNNVQEQEDNTEAAETKPKKKEKRPKKSLKPTS-------PQFSSPDFI-----------------------ILEYEADMEVPIILGRPFLATGQTL
                E   + E   TKP K    P  S +PT+       P+ + P+ +                         ++E D EVPIILGRPFLATG+T+
Subjt:  TTRPNNVQEQEDNTEAAETKPKKKEKRPKKSLKPTS-------PQFSSPDFI-----------------------ILEYEADMEVPIILGRPFLATGQTL

Query:  IDVQTGELTMGMNDQKVVFNVFTAFKYSDEPKGSETVACIDVCSPMEILQMIVLKAARKEEQDNQEEGEGESLPVIISSKLNLSQESLLMQVLARHKGAI
        IDVQ GELTM +                                              ++   +Q +   E  P     KL L               AI
Subjt:  IDVQTGELTMGMNDQKVVFNVFTAFKYSDEPKGSETVACIDVCSPMEILQMIVLKAARKEEQDNQEEGEGESLPVIISSKLNLSQESLLMQVLARHKGAI

Query:  GWSLADIKGINPSYCMHKI
        GW++ADI+GI+PS CMHKI
Subjt:  GWSLADIKGINPSYCMHKI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGAAGCTTGCATTCAGGAGAAAGAGTAGAAGTGAAATCAAAGCTGAAAGGAGGACAAAGAACAAGAATGACCAAATATTTTCCAAGAGGTCGAGGACAAGGTCCAT
GGACGCCTCTCCTGCAGCTCCTTTTATCGTCTCACTTGCCAAGCCGAAGGCTAAATCACCCAAGGTTCCATCTCCTAAAAATCCATTCCCCGAGAGTTATCTCATAGTAC
AAGTGGCAGGAGTTCTGTGCTCACCCTTAGGAGGCTGTAGGGCCTTTAGGTCGAGAGTTTTACGCCGACCTGAGGGAGGAAAGCATAAGTATAGCGGTGGTGAGAGGCAA
GATGGTCAGCTTCTCTTCAGTAGACATCAACCGGGTGTACAGAATCAAGGCCCCCTACATCCAAGAGGGAATGATGTCATTAGGAACCCCTCGACCAAGAAGATGAAAGA
AGCACTAAAATTAGTGGCCAACAAGGGAGTTCAATGGAAAGAGTGCCAGACGAAGGTGAAGACTCTAGTGCCCATCGATCTAAAGTCAGAATCGGTAGTGTGGCTTCACA
TTCTGAAGTACCGATTAATGCCAACCACCCATGATAGCACTATCTCAGTAGATAGAGTTATGCTCCTCTACTGTATTATGAAGGGGTTGGAGATCAACATAGGGAGCATT
ATCAGGGAGGAGATTCTCGCTTGTGGAAGGAAAAGAGCGGGGAAGCTTTTCTTTGGATCATTGATGACCCAGCTTTGCCAGAGGGTGAAGATGGTTCCTGGAAAGGATGA
GGAGCGCCACTTCTTCAAGTCAACCATCGACCTATCCTTGATCGGGAAGCTTCAACAGAATAACCTCCAAAGGAAAGATAAAGCCTCCACATCTCAGGTCACTCCACCAT
CAGGGTTGAACATGGCTTCTCCATCCCATCACACTCCTTTTTCAAGGCCCTCACCATCAACTGAAGCCCTAGCAAATGCCTACAGACAGCTAGATCAAGTCAGGGACAAC
CTGATGACTTATTGGGCATATGCAAAGGAGAGGGATGAAGCCATTAGAAAGTGCTGTCACTCTATCGCCCCGAGTATTGCTTCGATCTTTCCCAATTTCCCTCAATCGCT
GCTGCCTCAAGAAGACAAGGATTCTGATGAAGAAGAAGGACAATTTCAGCGCACCACCAACATAAGTGGCAACAACCAAGGAGCAACATCATCAAACTCGCTCGAAACAA
TGACGAAGGAGTTAATGTCAAGTAAAAAAGAATACATGGCCAAAAACGATGCTGTAGTCCAAAGCCAAGCCACATCTTTACGAAATTTGGAGATACAAGTAGGACAACTT
GCATCAAAACTAAAGAACAGACCTACCGGAGCACTGCCAAGCAACACAGGAGCCCCAAAAGGAAAAGAACACTGCCACACACTGACCCTCCGTAGTGGAAAACAGCTTCT
GCCTTGTGAACCTGCGCCAGTGTATGAAGATTCAAATAAGGATACGACCAGACCAAATAATGTTCAGGAGCAGGAAGATAACACAGAGGCGGCTGAAACAAAGCCCAAGA
AGAAAGAAAAAAGGCCAAAAAAGAGCCTAAAGCCTACAAGCCCACAATTCTCTTCCCCTGACTTCATTATTCTTGAGTATGAAGCAGACATGGAGGTTCCTATTATCTTG
GGAAGACCTTTCCTAGCAACCGGCCAGACACTTATTGATGTTCAAACGGGAGAACTCACCATGGGCATGAATGACCAGAAAGTAGTCTTTAATGTGTTCACTGCATTCAA
ATACTCAGACGAGCCTAAAGGAAGCGAAACTGTTGCATGCATAGATGTTTGTTCACCGATGGAGATTTTGCAAATGATCGTGCTCAAAGCTGCGCGGAAGGAAGAACAGG
ACAACCAGGAAGAAGGCGAAGGAGAATCTCTGCCAGTGATCATCTCTTCAAAACTGAATTTATCTCAAGAGAGCTTATTAATGCAGGTGTTAGCTAGACACAAAGGTGCT
ATTGGATGGAGTCTTGCAGACATCAAGGGAATCAACCCTTCATACTGCATGCACAAAATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTGAAGCTTGCATTCAGGAGAAAGAGTAGAAGTGAAATCAAAGCTGAAAGGAGGACAAAGAACAAGAATGACCAAATATTTTCCAAGAGGTCGAGGACAAGGTCCAT
GGACGCCTCTCCTGCAGCTCCTTTTATCGTCTCACTTGCCAAGCCGAAGGCTAAATCACCCAAGGTTCCATCTCCTAAAAATCCATTCCCCGAGAGTTATCTCATAGTAC
AAGTGGCAGGAGTTCTGTGCTCACCCTTAGGAGGCTGTAGGGCCTTTAGGTCGAGAGTTTTACGCCGACCTGAGGGAGGAAAGCATAAGTATAGCGGTGGTGAGAGGCAA
GATGGTCAGCTTCTCTTCAGTAGACATCAACCGGGTGTACAGAATCAAGGCCCCCTACATCCAAGAGGGAATGATGTCATTAGGAACCCCTCGACCAAGAAGATGAAAGA
AGCACTAAAATTAGTGGCCAACAAGGGAGTTCAATGGAAAGAGTGCCAGACGAAGGTGAAGACTCTAGTGCCCATCGATCTAAAGTCAGAATCGGTAGTGTGGCTTCACA
TTCTGAAGTACCGATTAATGCCAACCACCCATGATAGCACTATCTCAGTAGATAGAGTTATGCTCCTCTACTGTATTATGAAGGGGTTGGAGATCAACATAGGGAGCATT
ATCAGGGAGGAGATTCTCGCTTGTGGAAGGAAAAGAGCGGGGAAGCTTTTCTTTGGATCATTGATGACCCAGCTTTGCCAGAGGGTGAAGATGGTTCCTGGAAAGGATGA
GGAGCGCCACTTCTTCAAGTCAACCATCGACCTATCCTTGATCGGGAAGCTTCAACAGAATAACCTCCAAAGGAAAGATAAAGCCTCCACATCTCAGGTCACTCCACCAT
CAGGGTTGAACATGGCTTCTCCATCCCATCACACTCCTTTTTCAAGGCCCTCACCATCAACTGAAGCCCTAGCAAATGCCTACAGACAGCTAGATCAAGTCAGGGACAAC
CTGATGACTTATTGGGCATATGCAAAGGAGAGGGATGAAGCCATTAGAAAGTGCTGTCACTCTATCGCCCCGAGTATTGCTTCGATCTTTCCCAATTTCCCTCAATCGCT
GCTGCCTCAAGAAGACAAGGATTCTGATGAAGAAGAAGGACAATTTCAGCGCACCACCAACATAAGTGGCAACAACCAAGGAGCAACATCATCAAACTCGCTCGAAACAA
TGACGAAGGAGTTAATGTCAAGTAAAAAAGAATACATGGCCAAAAACGATGCTGTAGTCCAAAGCCAAGCCACATCTTTACGAAATTTGGAGATACAAGTAGGACAACTT
GCATCAAAACTAAAGAACAGACCTACCGGAGCACTGCCAAGCAACACAGGAGCCCCAAAAGGAAAAGAACACTGCCACACACTGACCCTCCGTAGTGGAAAACAGCTTCT
GCCTTGTGAACCTGCGCCAGTGTATGAAGATTCAAATAAGGATACGACCAGACCAAATAATGTTCAGGAGCAGGAAGATAACACAGAGGCGGCTGAAACAAAGCCCAAGA
AGAAAGAAAAAAGGCCAAAAAAGAGCCTAAAGCCTACAAGCCCACAATTCTCTTCCCCTGACTTCATTATTCTTGAGTATGAAGCAGACATGGAGGTTCCTATTATCTTG
GGAAGACCTTTCCTAGCAACCGGCCAGACACTTATTGATGTTCAAACGGGAGAACTCACCATGGGCATGAATGACCAGAAAGTAGTCTTTAATGTGTTCACTGCATTCAA
ATACTCAGACGAGCCTAAAGGAAGCGAAACTGTTGCATGCATAGATGTTTGTTCACCGATGGAGATTTTGCAAATGATCGTGCTCAAAGCTGCGCGGAAGGAAGAACAGG
ACAACCAGGAAGAAGGCGAAGGAGAATCTCTGCCAGTGATCATCTCTTCAAAACTGAATTTATCTCAAGAGAGCTTATTAATGCAGGTGTTAGCTAGACACAAAGGTGCT
ATTGGATGGAGTCTTGCAGACATCAAGGGAATCAACCCTTCATACTGCATGCACAAAATCTGA
Protein sequenceShow/hide protein sequence
MLKLAFRRKSRSEIKAERRTKNKNDQIFSKRSRTRSMDASPAAPFIVSLAKPKAKSPKVPSPKNPFPESYLIVQVAGVLCSPLGGCRAFRSRVLRRPEGGKHKYSGGERQ
DGQLLFSRHQPGVQNQGPLHPRGNDVIRNPSTKKMKEALKLVANKGVQWKECQTKVKTLVPIDLKSESVVWLHILKYRLMPTTHDSTISVDRVMLLYCIMKGLEINIGSI
IREEILACGRKRAGKLFFGSLMTQLCQRVKMVPGKDEERHFFKSTIDLSLIGKLQQNNLQRKDKASTSQVTPPSGLNMASPSHHTPFSRPSPSTEALANAYRQLDQVRDN
LMTYWAYAKERDEAIRKCCHSIAPSIASIFPNFPQSLLPQEDKDSDEEEGQFQRTTNISGNNQGATSSNSLETMTKELMSSKKEYMAKNDAVVQSQATSLRNLEIQVGQL
ASKLKNRPTGALPSNTGAPKGKEHCHTLTLRSGKQLLPCEPAPVYEDSNKDTTRPNNVQEQEDNTEAAETKPKKKEKRPKKSLKPTSPQFSSPDFIILEYEADMEVPIIL
GRPFLATGQTLIDVQTGELTMGMNDQKVVFNVFTAFKYSDEPKGSETVACIDVCSPMEILQMIVLKAARKEEQDNQEEGEGESLPVIISSKLNLSQESLLMQVLARHKGA
IGWSLADIKGINPSYCMHKI