; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028010 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028010
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDNA-directed DNA polymerase
Genome locationchr8:10312947..10317978
RNA-Seq ExpressionLag0028010
SyntenyLag0028010
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIN21706.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]8.5e-3133.99Show/hide
Query:  GKT-QVKACGLCSITSHTTYECLQLQESI--EVNAIGGYARNNGNQRPYNSYGNTYNSGWRDHPNFSYGNQKPQVFNQNQATEPRESNDSDGHTNQQTEK
        GKT QVKACG+C+ T+H T  C  LQE    + NA+GG+      QR Y+ Y NTYN GWRD PNFSYG    ++ N     +P           Q+ EK
Subjt:  GKT-QVKACGLCSITSHTTYECLQLQESI--EVNAIGGYARNNGNQRPYNSYGNTYNSGWRDHPNFSYGNQKPQVFNQNQATEPRESNDSDGHTNQQTEK

Query:  QKLKKNSLHSQRSILRKLQVNISLLEVVAQIPSYIEFL--------------------------------------------------------NLGIKV
        + L+           RK++VNI LL+ + QIP Y +FL                                                        +LG  +
Subjt:  QKLKKNSLHSQRSILRKLQVNISLLEVVAQIPSYIEFL--------------------------------------------------------NLGIKV

Query:  RLSPRARRRESSLTNIKKSGVLIQLADRSCIRPLGIVENVLVKVEGLVFPVEFYVLDMSTHASSSSSASLLLGRPFMKTARVVIDVDEGVISVGFQDRTF
         + P +     ++  +K+ GV+IQLADRS + P G++E+VLV+V  L+FP +FYVL+MS   +S +S  +LLG+PF++T+R  IDV +G +++ F     
Subjt:  RLSPRARRRESSLTNIKKSGVLIQLADRSCIRPLGIVENVLVKVEGLVFPVEFYVLDMSTHASSSSSASLLLGRPFMKTARVVIDVDEGVISVGFQDRTF

Query:  KFKVFD
        KF ++D
Subjt:  KFKVFD

XP_015385876.1 uncharacterized protein LOC107177106 [Citrus sinensis]1.5e-2730.03Show/hide
Query:  QVKACGLCSITSHTTYECLQLQESI--EVNAIGGYARNNGNQRPYNSYGNTYNSGWRDHPNFSYGNQ----------KPQVFNQNQATEPRESNDSDGHT
        Q++ CG+CS+  H T  C  LQE    + NA+ G+     +++ YN Y N YN GW+DHPNF YGNQ          +P  + Q +  +P +        
Subjt:  QVKACGLCSITSHTTYECLQLQESI--EVNAIGGYARNNGNQRPYNSYGNTYNSGWRDHPNFSYGNQ----------KPQVFNQNQATEPRESNDSDGHT

Query:  NQ-------QTEKQKLKKNSL--------------HSQRSIL---RKLQVNISLLEVVAQIPSYIEFL--------------NLGIKVRLSPRARRRESS
        NQ        T   ++K  S+                ++ IL   RK++VNI LL+ + Q+P Y +FL               + +   +S   +R+   
Subjt:  NQ-------QTEKQKLKKNSL--------------HSQRSIL---RKLQVNISLLEVVAQIPSYIEFL--------------NLGIKVRLSPRARRRESS

Query:  LTN----IKKSGVLIQLADRSCIRPLGIVENVLVKVEGLVFPVEFYVLDMSTHASSSSSASLLLGRPFMKTARVVIDVDEGVISVGFQDRTFKFKVFD--
               ++++G++IQLADRS   P G++E+VLV+V  LVFP +FY+L+M    S + +  +LLGRPF+KTAR  IDV +G +++ F     +F +F+  
Subjt:  LTN----IKKSGVLIQLADRSCIRPLGIVENVLVKVEGLVFPVEFYVLDMSTHASSSSSASLLLGRPFMKTARVVIDVDEGVISVGFQDRTFKFKVFD--

Query:  ----DVVS----SQVSTL--EYYVVDSDPVFDQSSNVKSVMVSDNETHSAVIR
            DV S      ++TL  +++ +  +  F+ + + K++   D++ H+ +I+
Subjt:  ----DVVS----SQVSTL--EYYVVDSDPVFDQSSNVKSVMVSDNETHSAVIR

XP_027064430.1 uncharacterized protein LOC113690635 [Coffea arabica]2.3e-2831.44Show/hide
Query:  SRGEEAAAFSFVGALLAKNG-----QVYNEGKTQVKACGLCSITSHTTYECLQLQ-ESIEVNAIGGYARNNGNQRPYNSYGNTYNSGWRDHPNFSYGNQK
        S G   AA + +G  L ++G     Q+     +Q K CG+C+   H+T  C  +Q ES+E   + G+A     ++PY+ Y NTYN GWRDH NFSYG  +
Subjt:  SRGEEAAAFSFVGALLAKNG-----QVYNEGKTQVKACGLCSITSHTTYECLQLQ-ESIEVNAIGGYARNNGNQRPYNSYGNTYNSGWRDHPNFSYGNQK

Query:  PQVFNQNQATEPRESNDSDGHTNQQTEKQKLKKNSLHSQRSILRKLQVNISLLEVVAQIPSYIEF-----------------------------------
           F  N+           G+  QQ EK K K+++      +L K+++NI LL+ + Q+P Y +F                                   
Subjt:  PQVFNQNQATEPRESNDSDGHTNQQTEKQKLKKNSLHSQRSILRKLQVNISLLEVVAQIPSYIEF-----------------------------------

Query:  ---------------------LNLGIKVRLSPRARRRESSLTNIKKSGVLIQLADRSCIRPLGIVENVLVKVEGLVFPVEFYVLDMSTHASSSSSASLLL
                             LNLG  + + P++     +L  +K+ G++IQLADR+   P  +V++VLVK+  LVF  +FYVLD+    S   S  LLL
Subjt:  ---------------------LNLGIKVRLSPRARRRESSLTNIKKSGVLIQLADRSCIRPLGIVENVLVKVEGLVFPVEFYVLDMSTHASSSSSASLLL

Query:  GRPFMKTARVVIDVDEGVISVGFQDRTFKFKVFD
        GRPF  TA+  +DV++G +S+ F ++   F +FD
Subjt:  GRPFMKTARVVIDVDEGVISVGFQDRTFKFKVFD

XP_031096917.1 uncharacterized protein LOC116001167 [Ipomoea triloba]8.2e-2629.33Show/hide
Query:  GKTQVKACGLCSITSHTTYEC--LQLQESIEVNAIGGYARNNGNQRPYNSYGNTYNSGWRDHPNFSYGNQKPQV--FNQNQATEPRESNDSDGHTN----
        G  QVKACG+C+ T H T  C  LQ     + NA+GG+      QR Y+ Y NTYN GWRDHPNFSY   +PQ   F   Q  + + S+     +N    
Subjt:  GKTQVKACGLCSITSHTTYEC--LQLQESIEVNAIGGYARNNGNQRPYNSYGNTYNSGWRDHPNFSYGNQKPQV--FNQNQATEPRESNDSDGHTN----

Query:  ------------------------------------------QQTEK--------QKLKKNS-------------------LHSQRSIL---RKLQVNIS
                                                  Q+ EK        QKL K S                      Q+ IL   RK++VNI 
Subjt:  ------------------------------------------QQTEK--------QKLKKNS-------------------LHSQRSIL---RKLQVNIS

Query:  LLEVVAQIPSYIEF--------------------------------------------------------LNLGIKVRLSPRARRRESSLTNIKKSGVLI
        LL+ + QIP Y +F                                                        L+LG  + + P +     ++ ++K++GV++
Subjt:  LLEVVAQIPSYIEF--------------------------------------------------------LNLGIKVRLSPRARRRESSLTNIKKSGVLI

Query:  QLADRSCIRPLGIVENVLVKVEGLVFPVEFYVLDMSTHASSSSSASLLLGRPFMKTARVVIDVDEGVISVGFQDRTFKFKVF------DDVVSSQVSTLE
        QLADRS + P+G++E+VLV+V+GL+FP +FYVLDM    S +SS  +LLGRPF+KT++  IDV +  +++ F     KF ++      DDV  S VS ++
Subjt:  QLADRSCIRPLGIVENVLVKVEGLVFPVEFYVLDMSTHASSSSSASLLLGRPFMKTARVVIDVDEGVISVGFQDRTFKFKVF------DDVVSSQVSTLE

Query:  ---------YYVVDSD
                  Y+ DSD
Subjt:  ---------YYVVDSD

XP_031282520.1 uncharacterized protein LOC116141123 [Pistacia vera]1.0e-2830.99Show/hide
Query:  QVKACGLCSITSHTTYECLQLQESI--EVNAIGGYARNNGNQRPYNSYGNTYNSGWRDHPNFSYGNQKP-QVFNQNQATEPRESNDSDGHT---------
        QVKACG+C+   H T  CL L++      N  GG+   +  QR Y+ Y NTYN GWRDHPNFSYG + P Q F       P +   +   T         
Subjt:  QVKACGLCSITSHTTYECLQLQESI--EVNAIGGYARNNGNQRPYNSYGNTYNSGWRDHPNFSYGNQKP-QVFNQNQATEPRESNDSDGHT---------

Query:  -------NQQTEKQ--------------------KLKKNSLHSQR--------------SILRKLQVNISLLEVVAQIPSYIEF----------------
                ++ EK+                     L KN   S R                 RK++VNI LL+ + Q+P Y +F                
Subjt:  -------NQQTEKQ--------------------KLKKNSLHSQR--------------SILRKLQVNISLLEVVAQIPSYIEF----------------

Query:  ----------------------------------------LNLGIKVRLSPRARRRESSLTNIKKSGVLIQLADRSCIRPLGIVENVLVKVEGLVFPVEF
                                                L+LG  + + P +     ++  +K++GV+IQLADRS + P G+VE+VLV+V GLVFP +F
Subjt:  ----------------------------------------LNLGIKVRLSPRARRRESSLTNIKKSGVLIQLADRSCIRPLGIVENVLVKVEGLVFPVEF

Query:  YVLDMSTHASSSSSASLLLGRPFMKTARVVIDVDEGVISVGFQDRTFKFKVFDDV
        YVLDM     SS+S  +LLGRPF+KT+R  IDV +G++++ F     KF ++D +
Subjt:  YVLDMSTHASSSSSASLLLGRPFMKTARVVIDVDEGVISVGFQDRTFKFKVFDDV

TrEMBL top hitse value%identityAlignment
A0A2G9HSE5 Retrotrans_gag domain-containing protein4.4e-2528.77Show/hide
Query:  QVKACGLCSITSHTTYECLQLQESI--EVNAIGGYARNNGNQRPYNSYGNTYNSGWRDHPNFSYGN--------------QKPQVFNQNQATEPRESNDS
        QVKACG+C+   H T  C  LQE      NA+GG+      QR Y+ Y NTYN GWRDHPN                   +K  +   +Q+ +P     S
Subjt:  QVKACGLCSITSHTTYECLQLQESI--EVNAIGGYARNNGNQRPYNSYGNTYNSGWRDHPNFSYGN--------------QKPQVFNQNQATEPRESNDS

Query:  DGHTNQQTEKQKLKKNSLHSQRSIL---RKLQVNISLLEVVAQIPSYIEFL-------------------------------------------------
            +    ++  K       + IL   RK++VNI LL+ + +IP Y +FL                                                 
Subjt:  DGHTNQQTEKQKLKKNSLHSQRSIL---RKLQVNISLLEVVAQIPSYIEFL-------------------------------------------------

Query:  -------NLGIKVRLSPRARRRESSLTNIKKSGVLIQLADRSCIRPLGIVENVLVKVEGLVFPVEFYVLDMSTHASSSSSASLLLGRPFMKTARVVIDVD
               +LG  + + P +     ++  +K++GV+ QLADRS +   G+ E+VLV+V G VFP +F+VL+M T  +S  S  +LLGRPF++T+R  IDV 
Subjt:  -------NLGIKVRLSPRARRRESSLTNIKKSGVLIQLADRSCIRPLGIVENVLVKVEGLVFPVEFYVLDMSTHASSSSSASLLLGRPFMKTARVVIDVD

Query:  EGVISVGFQDRTFKFKVFDDVVSSQVSTLEYYVVDSDPVFDQSSNVKSVMV
        EG +++ F     KF ++ D+  +  +    + VD    F Q +   ++ +
Subjt:  EGVISVGFQDRTFKFKVFDDVVSSQVSTLEYYVVDSDPVFDQSSNVKSVMV

A0A2G9HW16 DNA-directed DNA polymerase4.1e-3133.99Show/hide
Query:  GKT-QVKACGLCSITSHTTYECLQLQESI--EVNAIGGYARNNGNQRPYNSYGNTYNSGWRDHPNFSYGNQKPQVFNQNQATEPRESNDSDGHTNQQTEK
        GKT QVKACG+C+ T+H T  C  LQE    + NA+GG+      QR Y+ Y NTYN GWRD PNFSYG    ++ N     +P           Q+ EK
Subjt:  GKT-QVKACGLCSITSHTTYECLQLQESI--EVNAIGGYARNNGNQRPYNSYGNTYNSGWRDHPNFSYGNQKPQVFNQNQATEPRESNDSDGHTNQQTEK

Query:  QKLKKNSLHSQRSILRKLQVNISLLEVVAQIPSYIEFL--------------------------------------------------------NLGIKV
        + L+           RK++VNI LL+ + QIP Y +FL                                                        +LG  +
Subjt:  QKLKKNSLHSQRSILRKLQVNISLLEVVAQIPSYIEFL--------------------------------------------------------NLGIKV

Query:  RLSPRARRRESSLTNIKKSGVLIQLADRSCIRPLGIVENVLVKVEGLVFPVEFYVLDMSTHASSSSSASLLLGRPFMKTARVVIDVDEGVISVGFQDRTF
         + P +     ++  +K+ GV+IQLADRS + P G++E+VLV+V  L+FP +FYVL+MS   +S +S  +LLG+PF++T+R  IDV +G +++ F     
Subjt:  RLSPRARRRESSLTNIKKSGVLIQLADRSCIRPLGIVENVLVKVEGLVFPVEFYVLDMSTHASSSSSASLLLGRPFMKTARVVIDVDEGVISVGFQDRTF

Query:  KFKVFD
        KF ++D
Subjt:  KFKVFD

A0A6I9UKS2 uncharacterized protein LOC1051791654.0e-2628.85Show/hide
Query:  QVKACGLCSITSHTTYECLQL-QESIE-VNAIGGYARNNGNQRPYNSYGNTYNSGWRDHPNFSYGNQKPQVFNQNQATEPRESNDSDGHTNQQTEKQKLK
        Q K CG+C+ + H T  C  L +ES E  +A+G +++    QR Y+ + NTYN GWRDHPN S    + ++         +  +   G T ++ EK K  
Subjt:  QVKACGLCSITSHTTYECLQL-QESIE-VNAIGGYARNNGNQRPYNSYGNTYNSGWRDHPNFSYGNQKPQVFNQNQATEPRESNDSDGHTNQQTEKQKLK

Query:  KNSLHS--------------------QRSI---LRKLQVNISLLEVVAQIPSYIEFL-------------------------------------------
         N + +                    +R I    RK++VNI LL+ + QIP Y +FL                                           
Subjt:  KNSLHS--------------------QRSI---LRKLQVNISLLEVVAQIPSYIEFL-------------------------------------------

Query:  -------------NLGIKVRLSPRARRRESSLTNIKKSGVLIQLADRSCIRPLGIVENVLVKVEGLVFPVEFYVLDMSTHASSSSSASLLLGRPFMKTAR
                     +LG  + + P    +  ++  +K++GV+IQLADRS + P G++E+VLV+V  LVFP +F+V+DM    +S +S S+LLGRPF+KTAR
Subjt:  -------------NLGIKVRLSPRARRRESSLTNIKKSGVLIQLADRSCIRPLGIVENVLVKVEGLVFPVEFYVLDMSTHASSSSSASLLLGRPFMKTAR

Query:  VVIDVDEGVISVGFQDRTFKFKVFDDVVSSQVSTLEYYVVDSDPVFDQSSNVKSVMVSDNETHS
          IDV  G +++ F     KF +++ +         +++    PV  + +N       DNE H+
Subjt:  VVIDVDEGVISVGFQDRTFKFKVFDDVVSSQVSTLEYYVVDSDPVFDQSSNVKSVMVSDNETHS

A0A6P6SA48 uncharacterized protein LOC1136890726.8e-2634.4Show/hide
Query:  QVKACGLCSITSHTTYECLQLQE--SIEVNAIGGYARNNGNQRPYNSYGNTYNSGWRDHPNFSYGNQKPQVFNQNQATEPRESNDSDGHTNQQTEKQKLK
        Q K CG+C+   H T  CL LQE  + +VN   G       +R Y+ Y NTYN GWRD+PNFSYGN+    F           N   G      EK K K
Subjt:  QVKACGLCSITSHTTYECLQLQE--SIEVNAIGGYARNNGNQRPYNSYGNTYNSGWRDHPNFSYGNQKPQVFNQNQATEPRESNDSDGHTNQQTEKQKLK

Query:  KNSLHSQRSILRKLQVNISLLEVVAQIPSYIEF--------------------------------------------------------LNLGIKVRLSP
                 + RK+QVNI LL+ + Q+P Y +F                                                        L+LG+ +   P
Subjt:  KNSLHSQRSILRKLQVNISLLEVVAQIPSYIEF--------------------------------------------------------LNLGIKVRLSP

Query:  RARRRESSLTNIKKSGVLIQLADRSCIRPLGIVENVLVKVEGLVFPVEFYVLDMSTHASSSSSASLLLGRPFMKTARVVIDV
        ++     +L  +K++G++IQLAD +   P GIVENVLV+V+GL+FPV+FYVL M    S+S+S+ ++LGRPF+ TA+  IDV
Subjt:  RARRRESSLTNIKKSGVLIQLADRSCIRPLGIVENVLVKVEGLVFPVEFYVLDMSTHASSSSSASLLLGRPFMKTARVVIDV

A0A6P6SE67 uncharacterized protein LOC1136906351.1e-2831.44Show/hide
Query:  SRGEEAAAFSFVGALLAKNG-----QVYNEGKTQVKACGLCSITSHTTYECLQLQ-ESIEVNAIGGYARNNGNQRPYNSYGNTYNSGWRDHPNFSYGNQK
        S G   AA + +G  L ++G     Q+     +Q K CG+C+   H+T  C  +Q ES+E   + G+A     ++PY+ Y NTYN GWRDH NFSYG  +
Subjt:  SRGEEAAAFSFVGALLAKNG-----QVYNEGKTQVKACGLCSITSHTTYECLQLQ-ESIEVNAIGGYARNNGNQRPYNSYGNTYNSGWRDHPNFSYGNQK

Query:  PQVFNQNQATEPRESNDSDGHTNQQTEKQKLKKNSLHSQRSILRKLQVNISLLEVVAQIPSYIEF-----------------------------------
           F  N+           G+  QQ EK K K+++      +L K+++NI LL+ + Q+P Y +F                                   
Subjt:  PQVFNQNQATEPRESNDSDGHTNQQTEKQKLKKNSLHSQRSILRKLQVNISLLEVVAQIPSYIEF-----------------------------------

Query:  ---------------------LNLGIKVRLSPRARRRESSLTNIKKSGVLIQLADRSCIRPLGIVENVLVKVEGLVFPVEFYVLDMSTHASSSSSASLLL
                             LNLG  + + P++     +L  +K+ G++IQLADR+   P  +V++VLVK+  LVF  +FYVLD+    S   S  LLL
Subjt:  ---------------------LNLGIKVRLSPRARRRESSLTNIKKSGVLIQLADRSCIRPLGIVENVLVKVEGLVFPVEFYVLDMSTHASSSSSASLLL

Query:  GRPFMKTARVVIDVDEGVISVGFQDRTFKFKVFD
        GRPF  TA+  +DV++G +S+ F ++   F +FD
Subjt:  GRPFMKTARVVIDVDEGVISVGFQDRTFKFKVFD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACACTTGACATTCTTGCTCTTGCTCTTAGATTCTCAAAGACAAGACCTAGAATTCTCTCTAGCCTCCCTCTTAGAGAAAGACTCCCACAAGTCTTTTGCCTCCTAGA
CTTAGAGACATACCGGTGTAACCTCTGTGGTTATTGTGTCATTCAAAGAAGAAAATTCCAGCGATCAAGAGGCGAGGAGGCTGCTGCGTTTTCGTTCGTTGGAGCGTTGT
TGGCGAAGAACGGTCAAGTCTACAACGAAGGTAAAACACAGGTGAAGGCTTGTGGGTTGTGTTCAATAACCTCTCACACTACTTATGAATGTCTCCAGCTCCAGGAAAGC
ATCGAAGTCAATGCCATAGGAGGATACGCTAGAAACAACGGGAACCAAAGGCCTTACAACTCATACGGGAACACCTACAATTCGGGGTGGCGTGACCATCCAAATTTCAG
CTATGGGAATCAGAAGCCCCAGGTGTTCAATCAGAATCAAGCCACAGAACCTAGGGAGTCAAATGACTCAGATGGCCACACAAATCAGCAAACTGAAAAACAGAAGCTCA
AGAAAAACTCCTTGCATAGTCAGAGGTCAATCCTAAGGAAGTTGCAGGTGAACATCTCCCTTCTCGAGGTGGTAGCACAAATTCCATCCTATATCGAGTTTCTAAATCTT
GGTATCAAGGTAAGGTTAAGTCCTCGGGCAAGGAGGAGGGAATCAAGTTTAACAAACATAAAGAAATCAGGAGTTTTGATTCAATTAGCTGATAGGTCATGCATTAGACC
TTTAGGCATAGTTGAGAATGTCCTTGTTAAGGTTGAAGGCTTAGTATTTCCTGTCGAATTCTATGTGTTGGATATGTCTACTCATGCATCCTCCTCTTCATCTGCATCAT
TACTCTTGGGTCGTCCATTTATGAAGACAGCTAGAGTTGTGATAGATGTGGATGAGGGAGTAATTTCTGTAGGATTTCAGGATAGGACTTTCAAGTTTAAGGTTTTTGAT
GATGTTGTTAGTTCTCAAGTCTCAACTTTAGAATATTATGTGGTAGACTCAGATCCGGTGTTTGATCAATCTAGTAATGTTAAGTCTGTCATGGTTTCTGATAATGAGAC
TCATTCTGCTGTTATTAGGACCCCAGTTTTTTTATGGTTAGCAGCTTTGAAAGACAGACCCAAGCGCCATCCAACCTTTAGCTTCAGTTCAGGGGAGTTTCTTTTCTCCC
TTTTCTATTTATTGTCTCTGGTTATTCATATCTACTATTACATTGAGGGCAATGATGTTTTCGAGCTATTTTGGAGTCCCCGAGACGACGTGGAGCCAAAGTCATCGACC
AGAGCTCTAGAACACCGTCATTGGAGCCAAATGGACCGAAAAGAAATTGCACAAATTGGGCAAATTCGAGAGCAAGCGTCTCGACGCTATAATGGCAGCGTCGAGATGCT
GCCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGACACTTGACATTCTTGCTCTTGCTCTTAGATTCTCAAAGACAAGACCTAGAATTCTCTCTAGCCTCCCTCTTAGAGAAAGACTCCCACAAGTCTTTTGCCTCCTAGA
CTTAGAGACATACCGGTGTAACCTCTGTGGTTATTGTGTCATTCAAAGAAGAAAATTCCAGCGATCAAGAGGCGAGGAGGCTGCTGCGTTTTCGTTCGTTGGAGCGTTGT
TGGCGAAGAACGGTCAAGTCTACAACGAAGGTAAAACACAGGTGAAGGCTTGTGGGTTGTGTTCAATAACCTCTCACACTACTTATGAATGTCTCCAGCTCCAGGAAAGC
ATCGAAGTCAATGCCATAGGAGGATACGCTAGAAACAACGGGAACCAAAGGCCTTACAACTCATACGGGAACACCTACAATTCGGGGTGGCGTGACCATCCAAATTTCAG
CTATGGGAATCAGAAGCCCCAGGTGTTCAATCAGAATCAAGCCACAGAACCTAGGGAGTCAAATGACTCAGATGGCCACACAAATCAGCAAACTGAAAAACAGAAGCTCA
AGAAAAACTCCTTGCATAGTCAGAGGTCAATCCTAAGGAAGTTGCAGGTGAACATCTCCCTTCTCGAGGTGGTAGCACAAATTCCATCCTATATCGAGTTTCTAAATCTT
GGTATCAAGGTAAGGTTAAGTCCTCGGGCAAGGAGGAGGGAATCAAGTTTAACAAACATAAAGAAATCAGGAGTTTTGATTCAATTAGCTGATAGGTCATGCATTAGACC
TTTAGGCATAGTTGAGAATGTCCTTGTTAAGGTTGAAGGCTTAGTATTTCCTGTCGAATTCTATGTGTTGGATATGTCTACTCATGCATCCTCCTCTTCATCTGCATCAT
TACTCTTGGGTCGTCCATTTATGAAGACAGCTAGAGTTGTGATAGATGTGGATGAGGGAGTAATTTCTGTAGGATTTCAGGATAGGACTTTCAAGTTTAAGGTTTTTGAT
GATGTTGTTAGTTCTCAAGTCTCAACTTTAGAATATTATGTGGTAGACTCAGATCCGGTGTTTGATCAATCTAGTAATGTTAAGTCTGTCATGGTTTCTGATAATGAGAC
TCATTCTGCTGTTATTAGGACCCCAGTTTTTTTATGGTTAGCAGCTTTGAAAGACAGACCCAAGCGCCATCCAACCTTTAGCTTCAGTTCAGGGGAGTTTCTTTTCTCCC
TTTTCTATTTATTGTCTCTGGTTATTCATATCTACTATTACATTGAGGGCAATGATGTTTTCGAGCTATTTTGGAGTCCCCGAGACGACGTGGAGCCAAAGTCATCGACC
AGAGCTCTAGAACACCGTCATTGGAGCCAAATGGACCGAAAAGAAATTGCACAAATTGGGCAAATTCGAGAGCAAGCGTCTCGACGCTATAATGGCAGCGTCGAGATGCT
GCCCTGA
Protein sequenceShow/hide protein sequence
MTLDILALALRFSKTRPRILSSLPLRERLPQVFCLLDLETYRCNLCGYCVIQRRKFQRSRGEEAAAFSFVGALLAKNGQVYNEGKTQVKACGLCSITSHTTYECLQLQES
IEVNAIGGYARNNGNQRPYNSYGNTYNSGWRDHPNFSYGNQKPQVFNQNQATEPRESNDSDGHTNQQTEKQKLKKNSLHSQRSILRKLQVNISLLEVVAQIPSYIEFLNL
GIKVRLSPRARRRESSLTNIKKSGVLIQLADRSCIRPLGIVENVLVKVEGLVFPVEFYVLDMSTHASSSSSASLLLGRPFMKTARVVIDVDEGVISVGFQDRTFKFKVFD
DVVSSQVSTLEYYVVDSDPVFDQSSNVKSVMVSDNETHSAVIRTPVFLWLAALKDRPKRHPTFSFSSGEFLFSLFYLLSLVIHIYYYIEGNDVFELFWSPRDDVEPKSST
RALEHRHWSQMDRKEIAQIGQIREQASRRYNGSVEMLP