; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0007563 (gene) of Chayote v1 genome

Gene IDSed0007563
OrganismSechium edule (Chayote v1)
DescriptionUBP1-associated proteins 1C-like isoform X3
Genome locationLG08:3903353..3905980
RNA-Seq ExpressionSed0007563
SyntenySed0007563
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR003604 - Matrin/U1-C-like, C2H2-type zinc finger
IPR013087 - Zinc finger C2H2-type
IPR036236 - Zinc finger C2H2 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008443847.1 PREDICTED: uncharacterized protein LOC103487343 [Cucumis melo]6.2e-4645.87Show/hide
Query:  FRAIDNKPPVAAAAASSSDPPPQDDSTNAELVKQKIKEEIMIREIVNRRMLEAEIRRELILEQELVMLRATGLTQG-LAFHEPSATRLLDQRLNHIVDQS
        FRAIDNK P  A AAS+SD P QDDS N EL KQ+IKEEI +REIV RRMLEAEIRREL++E+EL + RA G T+G L+F      R +++ +N I+D S
Subjt:  FRAIDNKPPVAAAAASSSDPPPQDDSTNAELVKQKIKEEIMIREIVNRRMLEAEIRRELILEQELVMLRATGLTQG-LAFHEPSATRLLDQRLNHIVDQS

Query:  SRAP--PVSGSRSSLNSLPVRPFPEPLKEEMKPLEVEKKKCFILTKPDPEIFEAKKR-----KAKAAPEVAEGEDHTEPIQPP---SSKNSATHKFICKT
        S      V GS SSLN          LKEE KP E E  K   L +PDP  F  K++     +A AA       D  + I P     SK  A  +F+C  
Subjt:  SRAP--PVSGSRSSLNSLPVRPFPEPLKEEMKPLEVEKKKCFILTKPDPEIFEAKKR-----KAKAAPEVAEGEDHTEPIQPP---SSKNSATHKFICKT

Query:  CNITTTSEITFKTHLEGKKHKYKEGRALQTGAVLNQRPSPAEKGIK---------APKQPALQNKGGFKFWCEMCQQGSNSMVVMDTHYLGRKHMARRLK
        CN+  TSEI+F  H+ GKKHK KEGR  Q     ++ P+  E+ +K           K PAL+     KF CE+C  G   M VM +H  GRKH AR LK
Subjt:  CNITTTSEITFKTHLEGKKHKYKEGRALQTGAVLNQRPSPAEKGIK---------APKQPALQNKGGFKFWCEMCQQGSNSMVVMDTHYLGRKHMARRLK

Query:  LGQ
        L Q
Subjt:  LGQ

XP_010047067.2 uncharacterized protein LOC104436029 [Eucalyptus grandis]1.5e-2333.92Show/hide
Query:  DPPPQDDSTNAELVKQKIKEEIMIREIVNRRMLEAEIRRELILEQELVMLRATGLTQGLAFHEPSATRLLDQR----LNHIVDQSSRAPPVSGSRSSLNS
        DP    ++   E+ KQKI+EEI+  E+  R+MLE E+RREL+LE+++V LR+ G   GL F   S       R    L+ +  ++          ++++ 
Subjt:  DPPPQDDSTNAELVKQKIKEEIMIREIVNRRMLEAEIRRELILEQELVMLRATGLTQGLAFHEPSATRLLDQR----LNHIVDQSSRAPPVSGSRSSLNS

Query:  LPVRPFPEPLKEEMKPLEVEKKKCFILTKPDPEIFEAKKRKAKAAPEVAEGEDHTEPIQPPSSKNSATHKFICKTCNITTTSEITFKTHLEGKKHKYKE-
         P+   PE    E+KP E+ K +  +L +PDP +  A KRKA  +P  A      + +   S K     ++ C  C ++ TSE   K HLEGKKHK KE 
Subjt:  LPVRPFPEPLKEEMKPLEVEKKKCFILTKPDPEIFEAKKRKAKAAPEVAEGEDHTEPIQPPSSKNSATHKFICKTCNITTTSEITFKTHLEGKKHKYKE-

Query:  ------GRALQTGAVLNQRPSPAEKGIKAPKQPALQN-----KGGFKFWCEMCQQGSNSMVVMDTHYLGRKHMARRLKLGQGN
              G+   T + +NQ    ++   +      L+N     K  F+FWC MCQ G+ S+ VM++H  G+KH+AR  +LGQ +
Subjt:  ------GRALQTGAVLNQRPSPAEKGIKAPKQPALQN-----KGGFKFWCEMCQQGSNSMVVMDTHYLGRKHMARRLKLGQGN

XP_022926958.1 uncharacterized protein LOC111433915 [Cucurbita moschata]2.1e-2568.1Show/hide
Query:  RAIDNKPPVAAAAAS-SSDPPPQDDSTNAELVKQKIKEEIMIREIVNRRMLEAEIRRELILEQELVMLRATGLTQGLAFHEPSATRLLDQRLNHIVDQSS
        RA+DNKP +AA+++  SSD P +DDS NAELVKQ+IKEEI  RE  +RRMLEAEIRRELI+EQEL +LRATG T+GLAF E  A R+LD RLNHIVDQSS
Subjt:  RAIDNKPPVAAAAAS-SSDPPPQDDSTNAELVKQKIKEEIMIREIVNRRMLEAEIRRELILEQELVMLRATGLTQGLAFHEPSATRLLDQRLNHIVDQSS

Query:  RAP--PVSGSRSSLNS
              V GS SSLNS
Subjt:  RAP--PVSGSRSSLNS

XP_030456348.1 uncharacterized protein LOC115677338 [Syzygium oleosum]6.7e-2434.67Show/hide
Query:  DNKPPVAAAAASSSDPPPQDDSTNAELVKQKIKEEIMIREIVNRRMLEAEIRRELILEQELVMLRATGLTQGLAFHEPSATRLLDQR----LNHIVDQSS
        + +P + +   +  DP    +S   E+ KQKI+EEI+  E+  R+MLE E+RREL+LE+++V LR  G   GL F   SA      R    L+ + ++++
Subjt:  DNKPPVAAAAASSSDPPPQDDSTNAELVKQKIKEEIMIREIVNRRMLEAEIRRELILEQELVMLRATGLTQGLAFHEPSATRLLDQR----LNHIVDQSS

Query:  RA--PPVSGSRSSLNSLPVRPFPEPLKEEMKPLEVEKKKCFILTKPDPEIFEAKKRKAKAAPEVAEGEDHTEPIQPPSSKNSATHKFICKTCNITTTSEI
              V  + + ++ LP+   PE +  E+KP E+ K +  +L +PDP +    KRKA  +P  A GE     +   S K     ++ C  C ++ TSE 
Subjt:  RA--PPVSGSRSSLNSLPVRPFPEPLKEEMKPLEVEKKKCFILTKPDPEIFEAKKRKAKAAPEVAEGEDHTEPIQPPSSKNSATHKFICKTCNITTTSEI

Query:  TFKTHLEGKKHKYKE-------GRALQTGAVLNQRPSPAEK--GIKAPKQPALQNKGG------FKFWCEMCQQGSNSMVVMDTHYLGRKHMARRLKLGQ
          K HLEGKKHK KE       G+   T + ++Q    ++K  G+K  ++  L+N  G      F+FWCEMCQ G+ S  VM++H  G+KH+A   +LGQ
Subjt:  TFKTHLEGKKHKYKE-------GRALQTGAVLNQRPSPAEK--GIKAPKQPALQNKGG------FKFWCEMCQQGSNSMVVMDTHYLGRKHMARRLKLGQ

XP_038880353.1 zinc finger protein 385B [Benincasa hispida]7.3e-5549.83Show/hide
Query:  FRAIDNKPPVAAAAAS-SSDPPPQDDSTNAELVKQKIKEEIMIREIVNRRMLEAEIRRELILEQELVMLRATGLTQGLAFHEPSATRLLDQ-RLNH-IVD
        FRA DNK P  A + S  S      DS NAEL+KQ++K+EIMIREI +RRMLEAEIRRELI+EQEL   R  G T+GL F +  + RLLDQ R+NH IVD
Subjt:  FRAIDNKPPVAAAAAS-SSDPPPQDDSTNAELVKQKIKEEIMIREIVNRRMLEAEIRRELILEQELVMLRATGLTQGLAFHEPSATRLLDQ-RLNH-IVD

Query:  QSSRAPPVSGSRSSLNSLPVRPFPEPLKEEMKPLEVEKKKCFILTKPDPEIFEAKKRKAKAAPEVAEGEDHTEPIQPPSSKNSATHKFICKTCNITTTSE
               V GS SS   LPV P P P  EE KP + +K K  +L KPDP  FE K++    A EV    D   P    SSK  A  +F+C  CN+  TSE
Subjt:  QSSRAPPVSGSRSSLNSLPVRPFPEPLKEEMKPLEVEKKKCFILTKPDPEIFEAKKRKAKAAPEVAEGEDHTEPIQPPSSKNSATHKFICKTCNITTTSE

Query:  ITFKTHLEGKKHKYKEGRALQTGAVLNQRPSPAE--------KGIKAPKQPALQNKGGFKFWCEMCQQGSNSMVVMDTHYLGRKHMARRLKLG-QGNLED
        I+F  HL+GKKH  KEGR+LQT     Q PSPAE        +   A K+ AL+NK  FKFWC++C+ G+  M +M +H  G+KH AR LKL  Q  L+D
Subjt:  ITFKTHLEGKKHKYKEGRALQTGAVLNQRPSPAE--------KGIKAPKQPALQNKGGFKFWCEMCQQGSNSMVVMDTHYLGRKHMARRLKLG-QGNLED

Query:  Q
        Q
Subjt:  Q

TrEMBL top hitse value%identityAlignment
A0A059CKW4 Uncharacterized protein4.7e-2333.45Show/hide
Query:  DPPPQDDSTNAELVKQKIKEEIMIREIVNRRMLEAEIRRELILEQELVMLRATGLTQGLAFHEPSATRLLDQRLNH--IVDQSSRAPPVSGSRSSLNSLP
        DP    ++   E+ KQKI+EEI+  E+  R+MLE E+RREL+LE+++V+ RA     GL F E  +T        H  +    SRA     +    N++ 
Subjt:  DPPPQDDSTNAELVKQKIKEEIMIREIVNRRMLEAEIRRELILEQELVMLRATGLTQGLAFHEPSATRLLDQRLNH--IVDQSSRAPPVSGSRSSLNSLP

Query:  VRPF---PEPLKEEMKPLEVEKKKCFILTKPDPEIFEAKKRKAKAAPEVAEGEDHTEPIQPPSSKNSATHKFICKTCNITTTSEITFKTHLEGKKHKYKE
        + P    PE    E+KP E+ K +  +L +PDP +  AK++ + ++ +  E       +   S K     ++ C  C ++ TSE   K HLEGKKHK KE
Subjt:  VRPF---PEPLKEEMKPLEVEKKKCFILTKPDPEIFEAKKRKAKAAPEVAEGEDHTEPIQPPSSKNSATHKFICKTCNITTTSEITFKTHLEGKKHKYKE

Query:  -------GRALQTGAVLNQRPSPAEKGIKAPKQPALQN-----KGGFKFWCEMCQQGSNSMVVMDTHYLGRKHMARRLKLGQGN
               G+   T + +NQ    ++   +      L+N     K  F+FWC MCQ G+ S+ VM++H  G+KH+AR  +LGQ +
Subjt:  -------GRALQTGAVLNQRPSPAEKGIKAPKQPALQN-----KGGFKFWCEMCQQGSNSMVVMDTHYLGRKHMARRLKLGQGN

A0A1S3B9S7 uncharacterized protein LOC1034873433.0e-4645.87Show/hide
Query:  FRAIDNKPPVAAAAASSSDPPPQDDSTNAELVKQKIKEEIMIREIVNRRMLEAEIRRELILEQELVMLRATGLTQG-LAFHEPSATRLLDQRLNHIVDQS
        FRAIDNK P  A AAS+SD P QDDS N EL KQ+IKEEI +REIV RRMLEAEIRREL++E+EL + RA G T+G L+F      R +++ +N I+D S
Subjt:  FRAIDNKPPVAAAAASSSDPPPQDDSTNAELVKQKIKEEIMIREIVNRRMLEAEIRRELILEQELVMLRATGLTQG-LAFHEPSATRLLDQRLNHIVDQS

Query:  SRAP--PVSGSRSSLNSLPVRPFPEPLKEEMKPLEVEKKKCFILTKPDPEIFEAKKR-----KAKAAPEVAEGEDHTEPIQPP---SSKNSATHKFICKT
        S      V GS SSLN          LKEE KP E E  K   L +PDP  F  K++     +A AA       D  + I P     SK  A  +F+C  
Subjt:  SRAP--PVSGSRSSLNSLPVRPFPEPLKEEMKPLEVEKKKCFILTKPDPEIFEAKKR-----KAKAAPEVAEGEDHTEPIQPP---SSKNSATHKFICKT

Query:  CNITTTSEITFKTHLEGKKHKYKEGRALQTGAVLNQRPSPAEKGIK---------APKQPALQNKGGFKFWCEMCQQGSNSMVVMDTHYLGRKHMARRLK
        CN+  TSEI+F  H+ GKKHK KEGR  Q     ++ P+  E+ +K           K PAL+     KF CE+C  G   M VM +H  GRKH AR LK
Subjt:  CNITTTSEITFKTHLEGKKHKYKEGRALQTGAVLNQRPSPAEKGIK---------APKQPALQNKGGFKFWCEMCQQGSNSMVVMDTHYLGRKHMARRLK

Query:  LGQ
        L Q
Subjt:  LGQ

A0A5D3B800 UBP1-associated proteins 1C-like isoform X33.0e-4645.87Show/hide
Query:  FRAIDNKPPVAAAAASSSDPPPQDDSTNAELVKQKIKEEIMIREIVNRRMLEAEIRRELILEQELVMLRATGLTQG-LAFHEPSATRLLDQRLNHIVDQS
        FRAIDNK P  A AAS+SD P QDDS N EL KQ+IKEEI +REIV RRMLEAEIRREL++E+EL + RA G T+G L+F      R +++ +N I+D S
Subjt:  FRAIDNKPPVAAAAASSSDPPPQDDSTNAELVKQKIKEEIMIREIVNRRMLEAEIRRELILEQELVMLRATGLTQG-LAFHEPSATRLLDQRLNHIVDQS

Query:  SRAP--PVSGSRSSLNSLPVRPFPEPLKEEMKPLEVEKKKCFILTKPDPEIFEAKKR-----KAKAAPEVAEGEDHTEPIQPP---SSKNSATHKFICKT
        S      V GS SSLN          LKEE KP E E  K   L +PDP  F  K++     +A AA       D  + I P     SK  A  +F+C  
Subjt:  SRAP--PVSGSRSSLNSLPVRPFPEPLKEEMKPLEVEKKKCFILTKPDPEIFEAKKR-----KAKAAPEVAEGEDHTEPIQPP---SSKNSATHKFICKT

Query:  CNITTTSEITFKTHLEGKKHKYKEGRALQTGAVLNQRPSPAEKGIK---------APKQPALQNKGGFKFWCEMCQQGSNSMVVMDTHYLGRKHMARRLK
        CN+  TSEI+F  H+ GKKHK KEGR  Q     ++ P+  E+ +K           K PAL+     KF CE+C  G   M VM +H  GRKH AR LK
Subjt:  CNITTTSEITFKTHLEGKKHKYKEGRALQTGAVLNQRPSPAEKGIK---------APKQPALQNKGGFKFWCEMCQQGSNSMVVMDTHYLGRKHMARRLK

Query:  LGQ
        L Q
Subjt:  LGQ

A0A6J1EJN4 uncharacterized protein LOC1114339151.0e-2568.1Show/hide
Query:  RAIDNKPPVAAAAAS-SSDPPPQDDSTNAELVKQKIKEEIMIREIVNRRMLEAEIRRELILEQELVMLRATGLTQGLAFHEPSATRLLDQRLNHIVDQSS
        RA+DNKP +AA+++  SSD P +DDS NAELVKQ+IKEEI  RE  +RRMLEAEIRRELI+EQEL +LRATG T+GLAF E  A R+LD RLNHIVDQSS
Subjt:  RAIDNKPPVAAAAAS-SSDPPPQDDSTNAELVKQKIKEEIMIREIVNRRMLEAEIRRELILEQELVMLRATGLTQGLAFHEPSATRLLDQRLNHIVDQSS

Query:  RAP--PVSGSRSSLNS
              V GS SSLNS
Subjt:  RAP--PVSGSRSSLNS

A0A6J1HN47 uncharacterized protein LOC111465139 isoform X12.0e-2162.07Show/hide
Query:  RAIDNKPPVAAAAAS-SSDPPPQDDSTNAELVKQKIKEEIMIREIVNRRMLEAEIRRELILEQELVMLRATGLTQGLAFHEPSATRLLDQRLNHIVDQSS
        RA+DN+P +AA+++  SSD P +DDS NAELVKQ+IK +I  REI +RRMLEAE R ELI+EQEL +LRATG T+GLAF E    R+LD RLN IVDQSS
Subjt:  RAIDNKPPVAAAAAS-SSDPPPQDDSTNAELVKQKIKEEIMIREIVNRRMLEAEIRRELILEQELVMLRATGLTQGLAFHEPSATRLLDQRLNHIVDQSS

Query:  RAPPVS--GSRSSLNS
            ++  GS SSLNS
Subjt:  RAPPVS--GSRSSLNS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G24030.1 zinc ion binding;nucleic acid binding1.5e-0524.22Show/hide
Query:  FRAID-NKPPVAAAAASSSDPPPQ------------------DDSTNAELVKQKIKEEIMIREIVNRRMLEAEIRRELILEQELVMLRATGLTQGLAFHE
        +RAID N+PP A     S    P                    ++   E+ K++I++EI+I E   +R L AE+ +E+ +E+E+ + R +     L   E
Subjt:  FRAID-NKPPVAAAAASSSDPPPQ------------------DDSTNAELVKQKIKEEIMIREIVNRRMLEAEIRRELILEQELVMLRATGLTQGLAFHE

Query:  PSATRLLDQR---------LNHIVDQSSRAPPVSGSRSSLNSLPVRPFPE-PLKEEMKP------LEVEKKKCFILTKPDPEIFEAKKRKAKAA----PE
           T  ++QR          N+   Q         +  S NSL   P  + P  ++M        LE  K+   +L + D  I  AK +         P+
Subjt:  PSATRLLDQR---------LNHIVDQSSRAPPVSGSRSSLNSLPVRPFPE-PLKEEMKP------LEVEKKKCFILTKPDPEIFEAKKRKAKAA----PE

Query:  VAEGEDHTEPIQPPSSKNSATHKFICKTCNITTTSE---ITFKTHLEGKKHKYKEGRA----LQTGAVLNQRPSPAEKGIKAPKQPALQNKGGFKFWCEM
        + +  + T      S+K                 +E         L+ K+ K KE  A    L+TG +++ +  P    + + K+   + +   KFWCE+
Subjt:  VAEGEDHTEPIQPPSSKNSATHKFICKTCNITTTSE---ITFKTHLEGKKHKYKEGRA----LQTGAVLNQRPSPAEKGIKAPKQPALQNKGGFKFWCEM

Query:  CQQGSNSMVVMDTHYLGRKHMA
        C+ G+   +VM  H LG+KH A
Subjt:  CQQGSNSMVVMDTHYLGRKHMA

AT2G24030.2 zinc ion binding;nucleic acid binding7.4e-0533.77Show/hide
Query:  LEGKKHKYKEGRA----LQTGAVLNQRPSPAEKGIKAPKQPALQNKGGFKFWCEMCQQGSNSMVVMDTHYLGRKHMA
        L+ K+ K KE  A    L+TG +++ +  P    + + K+   + +   KFWCE+C+ G+   +VM  H LG+KH A
Subjt:  LEGKKHKYKEGRA----LQTGAVLNQRPSPAEKGIKAPKQPALQNKGGFKFWCEMCQQGSNSMVVMDTHYLGRKHMA

AT5G61190.1 putative endonuclease or glycosyl hydrolase with C2H2-type zinc finger domain1.0e-0628.48Show/hide
Query:  AKAAPEVAEGEDHTEPIQPPSSKNSATHKFICKTCNITTTSEITFKTHLEGKKH--KYKEGRAL-------QTGAVLNQRPSPA----------------
        A+ + EV E  +  + +    S+  A  +F+C  CN+   S+I F +HL GKKH     +  AL       + G    ++PS                  
Subjt:  AKAAPEVAEGEDHTEPIQPPSSKNSATHKFICKTCNITTTSEITFKTHLEGKKH--KYKEGRAL-------QTGAVLNQRPSPA----------------

Query:  --------EKGIKAPKQP----ALQNKGGFKFWCEMCQQGSNSMVVMDTHYLGRKHMA
                EKG +   QP    AL+N    K+ C MC  G +S +V +TH  G+KH A
Subjt:  --------EKGIKAPKQP----ALQNKGGFKFWCEMCQQGSNSMVVMDTHYLGRKHMA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGGCATATTAAAAGCCAACTATTTTTTTCAATTGATTTTGTTCCGCGCCATCGACAACAAGCCACCGGTCGCCGCCGCCGCCGCCTCCAGTTCCGATCCACCACC
GCAAGACGATTCTACCAACGCGGAGCTCGTGAAACAGAAGATTAAGGAAGAGATAATGATTCGAGAGATTGTGAACCGACGAATGCTCGAAGCGGAGATTCGGAGGGAGC
TCATCCTCGAGCAAGAACTGGTGATGCTTAGGGCTACAGGCCTGACGCAGGGGCTAGCATTTCACGAGCCATCGGCTACGCGATTGTTGGACCAGAGGCTGAATCACATT
GTTGATCAGTCGTCTCGGGCGCCGCCGGTTTCTGGTTCTAGGTCTTCGCTGAACTCGTTGCCGGTTCGTCCATTTCCGGAGCCTCTAAAGGAAGAAATGAAGCCTTTGGA
AGTTGAGAAGAAAAAGTGTTTCATTCTGACAAAACCAGACCCAGAAATATTCGAAGCAAAGAAGAGGAAAGCAAAGGCAGCACCAGAAGTGGCTGAAGGTGAAGATCACA
CAGAACCAATTCAACCGCCCAGTTCAAAAAACTCAGCCACCCATAAGTTCATTTGCAAAACGTGCAACATCACAACCACAAGTGAAATAACCTTCAAGACTCATTTAGAA
GGCAAGAAACACAAATACAAAGAGGGACGTGCCCTACAAACTGGGGCAGTCCTGAACCAACGACCATCCCCAGCAGAAAAGGGAATTAAAGCTCCTAAACAACCAGCCCT
ACAAAATAAGGGCGGGTTCAAATTTTGGTGCGAAATGTGTCAACAAGGATCTAATAGTATGGTCGTTATGGACACACATTACTTAGGAAGAAAGCATATGGCTCGTCGTT
TGAAACTTGGCCAAGGCAATTTGGAGGACCAAATTGAACCGCAAGGAAATGACGAGGCCACTGATTTAATCGCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGGCATATTAAAAGCCAACTATTTTTTTCAATTGATTTTGTTCCGCGCCATCGACAACAAGCCACCGGTCGCCGCCGCCGCCGCCTCCAGTTCCGATCCACCACC
GCAAGACGATTCTACCAACGCGGAGCTCGTGAAACAGAAGATTAAGGAAGAGATAATGATTCGAGAGATTGTGAACCGACGAATGCTCGAAGCGGAGATTCGGAGGGAGC
TCATCCTCGAGCAAGAACTGGTGATGCTTAGGGCTACAGGCCTGACGCAGGGGCTAGCATTTCACGAGCCATCGGCTACGCGATTGTTGGACCAGAGGCTGAATCACATT
GTTGATCAGTCGTCTCGGGCGCCGCCGGTTTCTGGTTCTAGGTCTTCGCTGAACTCGTTGCCGGTTCGTCCATTTCCGGAGCCTCTAAAGGAAGAAATGAAGCCTTTGGA
AGTTGAGAAGAAAAAGTGTTTCATTCTGACAAAACCAGACCCAGAAATATTCGAAGCAAAGAAGAGGAAAGCAAAGGCAGCACCAGAAGTGGCTGAAGGTGAAGATCACA
CAGAACCAATTCAACCGCCCAGTTCAAAAAACTCAGCCACCCATAAGTTCATTTGCAAAACGTGCAACATCACAACCACAAGTGAAATAACCTTCAAGACTCATTTAGAA
GGCAAGAAACACAAATACAAAGAGGGACGTGCCCTACAAACTGGGGCAGTCCTGAACCAACGACCATCCCCAGCAGAAAAGGGAATTAAAGCTCCTAAACAACCAGCCCT
ACAAAATAAGGGCGGGTTCAAATTTTGGTGCGAAATGTGTCAACAAGGATCTAATAGTATGGTCGTTATGGACACACATTACTTAGGAAGAAAGCATATGGCTCGTCGTT
TGAAACTTGGCCAAGGCAATTTGGAGGACCAAATTGAACCGCAAGGAAATGACGAGGCCACTGATTTAATCGCCTGA
Protein sequenceShow/hide protein sequence
MGGILKANYFFQLILFRAIDNKPPVAAAAASSSDPPPQDDSTNAELVKQKIKEEIMIREIVNRRMLEAEIRRELILEQELVMLRATGLTQGLAFHEPSATRLLDQRLNHI
VDQSSRAPPVSGSRSSLNSLPVRPFPEPLKEEMKPLEVEKKKCFILTKPDPEIFEAKKRKAKAAPEVAEGEDHTEPIQPPSSKNSATHKFICKTCNITTTSEITFKTHLE
GKKHKYKEGRALQTGAVLNQRPSPAEKGIKAPKQPALQNKGGFKFWCEMCQQGSNSMVVMDTHYLGRKHMARRLKLGQGNLEDQIEPQGNDEATDLIA