; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000534 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000534
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReplication factor C subunit 1
Genome locationchr4:9564377..9571512
RNA-Seq ExpressionLag0000534
SyntenyLag0000534
Gene Ontology termsGO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
InterPro domainsIPR003593 - AAA+ ATPase domain
IPR003959 - ATPase, AAA-type, core
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR039309 - Biopterin transporter family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605957.1 Replication factor C subunit 1, partial [Cucurbita argyrosperma subsp. sororia]3.5e-4969.46Show/hide
Query:  EKGKGKPPT---SNSSVKVKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNASYNRVK----------
        EK K K P     N S+ VKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKK + LCGGPGIGKTTSAKLVS+MLGYEAI+VNAS NR K          
Subjt:  EKGKGKPPT---SNSSVKVKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNASYNRVK----------

Query:  ---------------LNYR---PKHPKTVLIMDEVDGMSAGDRGGVADLIVSIKISKIPIICICNDR
                       L++R   PKHPKTVLIMDEVDGMSAGDRGGVADLI  IK SKIPIICICNDR
Subjt:  ---------------LNYR---PKHPKTVLIMDEVDGMSAGDRGGVADLIVSIKISKIPIICICNDR

XP_022995017.1 replication factor C subunit 1 isoform X1 [Cucurbita maxima]7.0e-5070.06Show/hide
Query:  EKGKGKPPT---SNSSVKVKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNASYNRVK----------
        EK K K P     N S+ VKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKK + LCGGPGIGKTTSAKLVS+MLGYEAI+VNAS NR K          
Subjt:  EKGKGKPPT---SNSSVKVKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNASYNRVK----------

Query:  ---------------LNYR---PKHPKTVLIMDEVDGMSAGDRGGVADLIVSIKISKIPIICICNDR
                       L++R   PKHPKTVLIMDEVDGMSAGDRGGVADLI SIK SKIPIICICNDR
Subjt:  ---------------LNYR---PKHPKTVLIMDEVDGMSAGDRGGVADLIVSIKISKIPIICICNDR

XP_022995018.1 replication factor C subunit 1 isoform X2 [Cucurbita maxima]7.0e-5070.06Show/hide
Query:  EKGKGKPPT---SNSSVKVKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNASYNRVK----------
        EK K K P     N S+ VKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKK + LCGGPGIGKTTSAKLVS+MLGYEAI+VNAS NR K          
Subjt:  EKGKGKPPT---SNSSVKVKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNASYNRVK----------

Query:  ---------------LNYR---PKHPKTVLIMDEVDGMSAGDRGGVADLIVSIKISKIPIICICNDR
                       L++R   PKHPKTVLIMDEVDGMSAGDRGGVADLI SIK SKIPIICICNDR
Subjt:  ---------------LNYR---PKHPKTVLIMDEVDGMSAGDRGGVADLIVSIKISKIPIICICNDR

XP_023532343.1 replication factor C subunit 1 isoform X1 [Cucurbita pepo subsp. pepo]5.3e-5070.06Show/hide
Query:  EKGKGKPPT---SNSSVKVKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNASYNRVK----------
        EK K K P     N S+ VKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKK + LCGGPGIGKTTSAKLVS+MLGYEAI+VNAS NR K          
Subjt:  EKGKGKPPT---SNSSVKVKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNASYNRVK----------

Query:  ---------------LNYR---PKHPKTVLIMDEVDGMSAGDRGGVADLIVSIKISKIPIICICNDR
                       L++R   PKHPKTVLIMDEVDGMSAGDRGGVADLI SIK SKIPIICICNDR
Subjt:  ---------------LNYR---PKHPKTVLIMDEVDGMSAGDRGGVADLIVSIKISKIPIICICNDR

XP_023532344.1 replication factor C subunit 1 isoform X2 [Cucurbita pepo subsp. pepo]5.3e-5070.06Show/hide
Query:  EKGKGKPPT---SNSSVKVKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNASYNRVK----------
        EK K K P     N S+ VKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKK + LCGGPGIGKTTSAKLVS+MLGYEAI+VNAS NR K          
Subjt:  EKGKGKPPT---SNSSVKVKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNASYNRVK----------

Query:  ---------------LNYR---PKHPKTVLIMDEVDGMSAGDRGGVADLIVSIKISKIPIICICNDR
                       L++R   PKHPKTVLIMDEVDGMSAGDRGGVADLI SIK SKIPIICICNDR
Subjt:  ---------------LNYR---PKHPKTVLIMDEVDGMSAGDRGGVADLIVSIKISKIPIICICNDR

TrEMBL top hitse value%identityAlignment
A0A6J1DHR4 Replication factor C subunit 12.3e-4666.06Show/hide
Query:  KGKPPTSNSSV----KVKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNASYNRVK------------
        K +P  SN  +     VKQLHDWLAHWNENF D  SKKKGKKLNDS AKK V LCGGPGIGKTTSAKLVS+MLGY+AI+VNAS NR K            
Subjt:  KGKPPTSNSSV----KVKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNASYNRVK------------

Query:  -------------LNYR---PKHPKTVLIMDEVDGMSAGDRGGVADLIVSIKISKIPIICICNDR
                     L++R   PK PKTVLIMDEVDGMSAGDRGGVADLI SIK SKIPIICICNDR
Subjt:  -------------LNYR---PKHPKTVLIMDEVDGMSAGDRGGVADLIVSIKISKIPIICICNDR

A0A6J1H1H3 Replication factor C subunit 12.9e-4969.46Show/hide
Query:  EKGKGKPPT---SNSSVKVKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNASYNRVK----------
        EK K K P     N S+ VKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKK + LCGGPGIGKTTSAKLVS+MLGYEAI+VNAS NR K          
Subjt:  EKGKGKPPT---SNSSVKVKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNASYNRVK----------

Query:  ---------------LNYR---PKHPKTVLIMDEVDGMSAGDRGGVADLIVSIKISKIPIICICNDR
                       L++R   PK PKTVLIMDEVDGMSAGDRGGVADLI SIK SKIPIICICNDR
Subjt:  ---------------LNYR---PKHPKTVLIMDEVDGMSAGDRGGVADLIVSIKISKIPIICICNDR

A0A6J1H397 Replication factor C subunit 12.9e-4969.46Show/hide
Query:  EKGKGKPPT---SNSSVKVKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNASYNRVK----------
        EK K K P     N S+ VKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKK + LCGGPGIGKTTSAKLVS+MLGYEAI+VNAS NR K          
Subjt:  EKGKGKPPT---SNSSVKVKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNASYNRVK----------

Query:  ---------------LNYR---PKHPKTVLIMDEVDGMSAGDRGGVADLIVSIKISKIPIICICNDR
                       L++R   PK PKTVLIMDEVDGMSAGDRGGVADLI SIK SKIPIICICNDR
Subjt:  ---------------LNYR---PKHPKTVLIMDEVDGMSAGDRGGVADLIVSIKISKIPIICICNDR

A0A6J1JXK0 Replication factor C subunit 13.4e-5070.06Show/hide
Query:  EKGKGKPPT---SNSSVKVKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNASYNRVK----------
        EK K K P     N S+ VKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKK + LCGGPGIGKTTSAKLVS+MLGYEAI+VNAS NR K          
Subjt:  EKGKGKPPT---SNSSVKVKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNASYNRVK----------

Query:  ---------------LNYR---PKHPKTVLIMDEVDGMSAGDRGGVADLIVSIKISKIPIICICNDR
                       L++R   PKHPKTVLIMDEVDGMSAGDRGGVADLI SIK SKIPIICICNDR
Subjt:  ---------------LNYR---PKHPKTVLIMDEVDGMSAGDRGGVADLIVSIKISKIPIICICNDR

A0A6J1K6Q0 Replication factor C subunit 13.4e-5070.06Show/hide
Query:  EKGKGKPPT---SNSSVKVKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNASYNRVK----------
        EK K K P     N S+ VKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKK + LCGGPGIGKTTSAKLVS+MLGYEAI+VNAS NR K          
Subjt:  EKGKGKPPT---SNSSVKVKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNASYNRVK----------

Query:  ---------------LNYR---PKHPKTVLIMDEVDGMSAGDRGGVADLIVSIKISKIPIICICNDR
                       L++R   PKHPKTVLIMDEVDGMSAGDRGGVADLI SIK SKIPIICICNDR
Subjt:  ---------------LNYR---PKHPKTVLIMDEVDGMSAGDRGGVADLIVSIKISKIPIICICNDR

SwissProt top hitse value%identityAlignment
O60182 Replication factor C subunit 19.5e-1842.07Show/hide
Query:  VKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNASYNRVK---------------------LNYRP--
        V++L  WL  +++N     +K     L   G  K V L G PGIGKTT+A LV+K+ GY+ +++NAS  R K                         P  
Subjt:  VKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNASYNRVK---------------------LNYRP--

Query:  -KHPKTVLIMDEVDGMSAGDRGGVADLIVSIKISKIPIICICNDR
            + VLIMDE+DGMS+GDRGGV  L + IK S IPIICICNDR
Subjt:  -KHPKTVLIMDEVDGMSAGDRGGVADLIVSIKISKIPIICICNDR

P35600 Replication factor C subunit 11.2e-1738.46Show/hide
Query:  IEKHAGEKGKGKPPTSNSSVKVKQLHDWLAHWNENFLDGGSKKK----GKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNASYNRVKLNY
        ++KH     K     + ++  V +L +WL+ W  N  DG  K +      K +D    K   L G PGIGKTT+A LV K LG++A++ NAS  R K   
Subjt:  IEKHAGEKGKGKPPTSNSSVKVKQLHDWLAHWNENFLDGGSKKK----GKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNASYNRVKLNY

Query:  RPK---------------------HPKTVLIMDEVDGMSAG-DRGGVADLIVSIKISKIPIICICNDRD
        + +                       K VLIMDEVDGM+   DRGG+ +LI  IK S IPIIC+CNDR+
Subjt:  RPK---------------------HPKTVLIMDEVDGMSAG-DRGGVADLIVSIKISKIPIICICNDRD

P38630 Replication factor C subunit 15.2e-1639.19Show/hide
Query:  VKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNAS--------------------------YNRVKLN
        V +L +WLA+W EN      K  GK  + SG  +   L G PGIGKTT+A LV++ LGY+ ++ NAS                          +N    N
Subjt:  VKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNAS--------------------------YNRVKLN

Query:  YRPKHPKTVLIMDEVDGMSAGDRGGVADLIVSIKISKIPIICICNDRD
           KH   V+IMDEVDGMS GDRGGV  L    + +  P+I ICN+R+
Subjt:  YRPKHPKTVLIMDEVDGMSAGDRGGVADLIVSIKISKIPIICICNDRD

Q2R2B4 Replication factor C subunit 11.3e-3857.23Show/hide
Query:  EKGKGKPPTS--NSSVKVKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNASYNRVK-----------
        EK + K P     +   VKQLHDWL  W + FL  G K KGKK  DSGAKK V L G PGIGKTT+AK+VS+MLG +AI+VNAS +R K           
Subjt:  EKGKGKPPTS--NSSVKVKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNASYNRVK-----------

Query:  --------------LNY---RPKHPKTVLIMDEVDGMSAGDRGGVADLIVSIKISKIPIICICNDR
                      LNY   R K PK VL+MDEVDGMSAGDRGGVADLI SIK+SKIPIICICNDR
Subjt:  --------------LNY---RPKHPKTVLIMDEVDGMSAGDRGGVADLIVSIKISKIPIICICNDR

Q9C587 Replication factor C subunit 15.6e-4259.28Show/hide
Query:  EKGKGKPP---TSNSSVKVKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNASYNRVKLNY-------
        EK + K P     N S+ V QLH+WL+HW++ F   GSK KGKKLND+G+KK V L G PGIGKTTSAKLVS+MLG++A++VNAS +R K N        
Subjt:  EKGKGKPP---TSNSSVKVKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNASYNRVKLNY-------

Query:  ---------------------RPKHPKTVLIMDEVDGMSAGDRGGVADLIVSIKISKIPIICICNDR
                             R KHPKTVLIMDEVDGMSAGDRGGVADLI SIKISKIPIICICNDR
Subjt:  ---------------------RPKHPKTVLIMDEVDGMSAGDRGGVADLIVSIKISKIPIICICNDR

Arabidopsis top hitse value%identityAlignment
AT1G04730.1 P-loop containing nucleoside triphosphate hydrolases superfamily protein1.1e-0824.81Show/hide
Query:  VVRWRVIVDKYRAQIVSKASTFVAKALEEDKEKLEILLSTAQGEVMLLSDQIGYLVEE--EEIEKHAGEKGKGKPPTSNSSVKVKQLHDWLAH-------
        V+  ++ VDKY        S+F  + L +++   E+LL   Q +  +   +I    E     +++H+      K  + ++  + KQ + W          
Subjt:  VVRWRVIVDKYRAQIVSKASTFVAKALEEDKEKLEILLSTAQGEVMLLSDQIGYLVEE--EEIEKHAGEKGKGKPPTSNSSVKVKQLHDWLAH-------

Query:  ---WNENFLD-GGSKKKGKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNAS-----------------YNRVKLNYRPKHPKTVLIMDEV
            N N  D      K  KL     +K++ LCG PG+GKTT A + +K  GY  +++NAS                  N V  + RPK     L++DE+
Subjt:  ---WNENFLD-GGSKKKGKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNAS-----------------YNRVKLNYRPKHPKTVLIMDEV

Query:  DGMSAGDRGGVADLIVSIKISK---------------------------IPIICICND
        DG + GD  G  D+I+ + +++                            P+ICICND
Subjt:  DGMSAGDRGGVADLIVSIKISK---------------------------IPIICICND

AT2G32040.1 Major facilitator superfamily protein1.2e-1572.58Show/hide
Query:  DFVPLFGYRRRSYLILSRLVGAFSLSFMATIVNSKYGTATCILLGSLSVAFSDVDSDSNTEE
        D VPLFGYRRRSYL+LS L+GAFS S MA  V+SKY  A CILLGSLSVAFSDV  DS   E
Subjt:  DFVPLFGYRRRSYLILSRLVGAFSLSFMATIVNSKYGTATCILLGSLSVAFSDVDSDSNTEE

AT5G22010.1 replication factor C13.9e-4359.28Show/hide
Query:  EKGKGKPP---TSNSSVKVKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNASYNRVKLNY-------
        EK + K P     N S+ V QLH+WL+HW++ F   GSK KGKKLND+G+KK V L G PGIGKTTSAKLVS+MLG++A++VNAS +R K N        
Subjt:  EKGKGKPP---TSNSSVKVKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNASYNRVKLNY-------

Query:  ---------------------RPKHPKTVLIMDEVDGMSAGDRGGVADLIVSIKISKIPIICICNDR
                             R KHPKTVLIMDEVDGMSAGDRGGVADLI SIKISKIPIICICNDR
Subjt:  ---------------------RPKHPKTVLIMDEVDGMSAGDRGGVADLIVSIKISKIPIICICNDR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACAATGAATTTGAGATGAGTATGATGGGAGAACTTAGCTTCTTCCTTGGACTTCAAATCAAAAAACTCAAGGATGGTATTTTCATAAGTCAAGAGAAACACACAAG
GGATTTGCTCAAAGGGTTCAAATTCAATGAAGGTAAAATTGCAAAAACTCCTATGAGCACATCCAGTAAGCTTGACAAGGATGAAAAAGGTTTATGGTATCCTAGAAATG
TTGAATTTAATTTGGTAGGATATTCCGACGCGGATTTTGCAGCCGTCAAAACTGTCTCCCTTCTTCAACCTTCGGGTGCGAAAGGTTCAACCCCCATTCTTGAATCTTTC
TGGGTTGTTCTTAATCAAAAGAAAAAATCGTTCATTTTCTTCTTCGTAATTTTATTTTGCTTCTATGGAGGAGGTTCTGGTCGAGTCGGCCATAGGGGATTTAATTGGAG
AAACAGAGTTGTCCGTTGGCGAGTTATTGTTGACAAATATCGGGCACAAATCGTGTCGAAGGCTTCTACATTTGTCGCAAAGGCTCTCGAGGAAGACAAGGAAAAACTCG
AGATCCTCCTCTCCACAGCTCAAGGGGAGGTCATGTTGTTGTCGGATCAAATCGGTTATCTTGTGGAGGAGGAAGAAATAGAGAAACATGCTGGAGAGAAAGGGAAAGGG
AAACCACCGACTAGCAATTCCTCTGTGAAGGTCAAACAACTTCATGATTGGTTGGCACATTGGAATGAAAACTTCCTTGATGGTGGAAGCAAAAAGAAGGGTAAAAAGCT
CAACGATTCTGGTGCCAAAAAAGTTGTCTCGTTATGTGGAGGTCCTGGCATAGGTAAAACTACATCGGCTAAATTGGTTAGCAAGATGCTTGGTTATGAGGCTATAAAGG
TAAATGCCAGCTATAATCGGGTTAAGTTGAATTATAGGCCAAAACATCCCAAAACTGTGTTGATTATGGACGAGGTAGATGGAATGTCTGCTGGAGATAGGGGTGGAGTT
GCTGATCTGATTGTGAGCATTAAAATCTCCAAAATTCCAATTATTTGCATATGTAATGACCGTGACTTTGTTCCTCTTTTTGGTTACCGAAGAAGGTCCTACTTAATTTT
ATCAAGGCTCGTTGGTGCGTTCTCATTGAGCTTTATGGCTACCATTGTTAATAGCAAGTATGGTACTGCTACGTGTATACTTCTCGGGTCTCTTTCTGTGGCGTTTTCTG
ATGTTGATTCAGACTCTAACACTGAGGAACTTGAATTGTTAGATCCATCACCAGCTAATACAGTGGTCGGCAAAATTTTTGTGTCATGTTTTGTCGAGCGTCGCGACGCT
ATAGGGGTAGCGTCGCGACGCTACTGCCTGCCGCAGATTCTGGAATTGGAAAATTACAAGCGTCGAGACGCTATGGTCACAGCATCGAGACGCTGTGACCTGTTCGCGCC
AATTTCAAAGATGGAATGCCAGCGTCGAGAAGCCCAAAGAGCAGCGTCTCGATGCTGGTCTGCGGATGAGCCCCTTTCTCAATCTTCCAACTTGAATCTCTTTCGTTCCT
TTGGCTTCTTTTCGCATCCCGAACATCCATTTGACTCCAATTTCTCAATTATGACTTGGCAATAA
mRNA sequenceShow/hide mRNA sequence
ATGCACAATGAATTTGAGATGAGTATGATGGGAGAACTTAGCTTCTTCCTTGGACTTCAAATCAAAAAACTCAAGGATGGTATTTTCATAAGTCAAGAGAAACACACAAG
GGATTTGCTCAAAGGGTTCAAATTCAATGAAGGTAAAATTGCAAAAACTCCTATGAGCACATCCAGTAAGCTTGACAAGGATGAAAAAGGTTTATGGTATCCTAGAAATG
TTGAATTTAATTTGGTAGGATATTCCGACGCGGATTTTGCAGCCGTCAAAACTGTCTCCCTTCTTCAACCTTCGGGTGCGAAAGGTTCAACCCCCATTCTTGAATCTTTC
TGGGTTGTTCTTAATCAAAAGAAAAAATCGTTCATTTTCTTCTTCGTAATTTTATTTTGCTTCTATGGAGGAGGTTCTGGTCGAGTCGGCCATAGGGGATTTAATTGGAG
AAACAGAGTTGTCCGTTGGCGAGTTATTGTTGACAAATATCGGGCACAAATCGTGTCGAAGGCTTCTACATTTGTCGCAAAGGCTCTCGAGGAAGACAAGGAAAAACTCG
AGATCCTCCTCTCCACAGCTCAAGGGGAGGTCATGTTGTTGTCGGATCAAATCGGTTATCTTGTGGAGGAGGAAGAAATAGAGAAACATGCTGGAGAGAAAGGGAAAGGG
AAACCACCGACTAGCAATTCCTCTGTGAAGGTCAAACAACTTCATGATTGGTTGGCACATTGGAATGAAAACTTCCTTGATGGTGGAAGCAAAAAGAAGGGTAAAAAGCT
CAACGATTCTGGTGCCAAAAAAGTTGTCTCGTTATGTGGAGGTCCTGGCATAGGTAAAACTACATCGGCTAAATTGGTTAGCAAGATGCTTGGTTATGAGGCTATAAAGG
TAAATGCCAGCTATAATCGGGTTAAGTTGAATTATAGGCCAAAACATCCCAAAACTGTGTTGATTATGGACGAGGTAGATGGAATGTCTGCTGGAGATAGGGGTGGAGTT
GCTGATCTGATTGTGAGCATTAAAATCTCCAAAATTCCAATTATTTGCATATGTAATGACCGTGACTTTGTTCCTCTTTTTGGTTACCGAAGAAGGTCCTACTTAATTTT
ATCAAGGCTCGTTGGTGCGTTCTCATTGAGCTTTATGGCTACCATTGTTAATAGCAAGTATGGTACTGCTACGTGTATACTTCTCGGGTCTCTTTCTGTGGCGTTTTCTG
ATGTTGATTCAGACTCTAACACTGAGGAACTTGAATTGTTAGATCCATCACCAGCTAATACAGTGGTCGGCAAAATTTTTGTGTCATGTTTTGTCGAGCGTCGCGACGCT
ATAGGGGTAGCGTCGCGACGCTACTGCCTGCCGCAGATTCTGGAATTGGAAAATTACAAGCGTCGAGACGCTATGGTCACAGCATCGAGACGCTGTGACCTGTTCGCGCC
AATTTCAAAGATGGAATGCCAGCGTCGAGAAGCCCAAAGAGCAGCGTCTCGATGCTGGTCTGCGGATGAGCCCCTTTCTCAATCTTCCAACTTGAATCTCTTTCGTTCCT
TTGGCTTCTTTTCGCATCCCGAACATCCATTTGACTCCAATTTCTCAATTATGACTTGGCAATAA
Protein sequenceShow/hide protein sequence
MHNEFEMSMMGELSFFLGLQIKKLKDGIFISQEKHTRDLLKGFKFNEGKIAKTPMSTSSKLDKDEKGLWYPRNVEFNLVGYSDADFAAVKTVSLLQPSGAKGSTPILESF
WVVLNQKKKSFIFFFVILFCFYGGGSGRVGHRGFNWRNRVVRWRVIVDKYRAQIVSKASTFVAKALEEDKEKLEILLSTAQGEVMLLSDQIGYLVEEEEIEKHAGEKGKG
KPPTSNSSVKVKQLHDWLAHWNENFLDGGSKKKGKKLNDSGAKKVVSLCGGPGIGKTTSAKLVSKMLGYEAIKVNASYNRVKLNYRPKHPKTVLIMDEVDGMSAGDRGGV
ADLIVSIKISKIPIICICNDRDFVPLFGYRRRSYLILSRLVGAFSLSFMATIVNSKYGTATCILLGSLSVAFSDVDSDSNTEELELLDPSPANTVVGKIFVSCFVERRDA
IGVASRRYCLPQILELENYKRRDAMVTASRRCDLFAPISKMECQRREAQRAASRCWSADEPLSQSSNLNLFRSFGFFSHPEHPFDSNFSIMTWQ