; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0023941 (gene) of Chayote v1 genome

Gene IDSed0023941
OrganismSechium edule (Chayote v1)
Descriptionzinc finger protein 830-like isoform X3
Genome locationLG03:46196245..46198163
RNA-Seq ExpressionSed0023941
SyntenySed0023941
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR003604 - Matrin/U1-C-like, C2H2-type zinc finger
IPR013087 - Zinc finger C2H2-type
IPR036236 - Zinc finger C2H2 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6586051.1 Signal recognition particle receptor subunit alpha-like protein, partial [Cucurbita argyrosperma subsp. sororia]4.7e-2940Show/hide
Query:  QMALILLRQEELVMEIQRQRFLSQKARSHPILSQPEVAIR--------------SEPPGTAAVAAASASAVAPSFNDGEVLEADKTEERV----VLEKPN
        +M L+ LR+E+L++EI+RQ+FL + AR   +L + E+AIR              S P   A +   S++ V  S N+   +E  ++ +R+    V   P 
Subjt:  QMALILLRQEELVMEIQRQRFLSQKARSHPILSQPEVAIR--------------SEPPGTAAVAAASASAVAPSFNDGEVLEADKTEERV----VLEKPN

Query:  LNAL-------------GQKRKDVQEKHDPNVLGQKRKAETPLLSTNDHIQPF----DKVKEWRCALCRVTVRNEGSFKQHLRGKKHKHREEDRLRTRKA
        +  L              ++   + EK DP V  +KRKAETP   T+D +QPF    +   EW C LC+VTV ++ +F QHL GKKHK R+E  LR +KA
Subjt:  LNAL-------------GQKRKDVQEKHDPNVLGQKRKAETPLLSTNDHIQPF----DKVKEWRCALCRVTVRNEGSFKQHLRGKKHKHREEDRLRTRKA

Query:  SNVSRDMHEQLPNKRRKL-----SSGSAGAKEHESKVGRTLQCDKTGLNATIPNFMKPYATKD---KQHKFKFWCEKCKVGANTT
        SNV+    E LPNKRRKL     +SGS  A+  ESK G TLQC+KTG ++ +    +   T+D    + KFKFWCEKCKVGA  T
Subjt:  SNVSRDMHEQLPNKRRKL-----SSGSAGAKEHESKVGRTLQCDKTGLNATIPNFMKPYATKD---KQHKFKFWCEKCKVGANTT

XP_022937456.1 uncharacterized protein LOC111443862 [Cucurbita moschata]1.8e-3340.68Show/hide
Query:  LLRQEELVMEIQRQRFLSQKARSHPILSQPEVAIR--------------SEPPGTAAVAAASASAVAPSFNDGEVLEADKTEERVVL-------------
        LLR+E+L++EI+RQ+FL + AR   +L + E+AIR              S P   A +   S++ V  S N+   +E  ++ +R+ +             
Subjt:  LLRQEELVMEIQRQRFLSQKARSHPILSQPEVAIR--------------SEPPGTAAVAAASASAVAPSFNDGEVLEADKTEERVVL-------------

Query:  --EKPNLNALGQKRKD--VQEKHDPNVLGQKRKAETPLLSTNDHIQPF----DKVKEWRCALCRVTVRNEGSFKQHLRGKKHKHREEDRLRTRKASNVSR
          +K     L    ++  + EK DP V  +KRKAETP   T+D +QPF    +   EW C LC+VTV ++ +F QHL GKKHK R+E  LR +KASNV+ 
Subjt:  --EKPNLNALGQKRKD--VQEKHDPNVLGQKRKAETPLLSTNDHIQPF----DKVKEWRCALCRVTVRNEGSFKQHLRGKKHKHREEDRLRTRKASNVSR

Query:  DMHEQLPNKRRKL-----SSGSAGAKEHESKVGRTLQCDKTGLNATIPNFMKPYATKD---KQHKFKFWCEKCKVGANTTNNMFGHLNGKKHRAN
           E LPNKRRKL     +SG   A+  ESK G TLQC+KTG ++ +    +   T+D    + KFKFWCEKCKVGA  T  M  HLNGKKH+A+
Subjt:  DMHEQLPNKRRKL-----SSGSAGAKEHESKVGRTLQCDKTGLNATIPNFMKPYATKD---KQHKFKFWCEKCKVGANTTNNMFGHLNGKKHRAN

XP_022969583.1 uncharacterized protein LOC111468562 [Cucurbita maxima]7.5e-3542.11Show/hide
Query:  QMALILLRQEELVMEIQRQRFLSQKARSHPILSQPEVAIR--------------SEP------PGTAAVAAAS---------ASAVAPSFNDG------E
        +M L+ LR+E+L++EI+RQ+FL + AR   +L + E+AIR              S P      P +A V  +S          S+    F  G      +
Subjt:  QMALILLRQEELVMEIQRQRFLSQKARSHPILSQPEVAIR--------------SEP------PGTAAVAAAS---------ASAVAPSFNDG------E

Query:  VLEADKTEERVVLEKPNLNALGQKRKDVQEKHDPNVLGQKRKAETPLLSTNDHIQPF----DKVKEWRCALCRVTVRNEGSFKQHLRGKKHKHREEDRLR
         L AD   ER VL+        ++   + EK DPNV  +KRKAETP   TND +QPF    +   EW C LC+VTV ++ +F QHL GKKHK R+E  LR
Subjt:  VLEADKTEERVVLEKPNLNALGQKRKDVQEKHDPNVLGQKRKAETPLLSTNDHIQPF----DKVKEWRCALCRVTVRNEGSFKQHLRGKKHKHREEDRLR

Query:  TRKASNVSRDMHEQLPNKRRKL-----SSGSAGAKEHESKVGRTLQCDKTGLNATIPNFMKPYATKD---KQHKFKFWCEKCKVGANTTNNMFGHLNGKK
         +KASNV+    E LPNKRRKL     +SG   A+  ESK G TLQC+KTG ++ +    +   T+D    + KFKFWCEKCKVGA  T  M  HLNGKK
Subjt:  TRKASNVSRDMHEQLPNKRRKL-----SSGSAGAKEHESKVGRTLQCDKTGLNATIPNFMKPYATKD---KQHKFKFWCEKCKVGANTTNNMFGHLNGKK

Query:  HRAN
        H+A+
Subjt:  HRAN

XP_023536654.1 uncharacterized protein LOC111797967 [Cucurbita pepo subsp. pepo]2.3e-3641.67Show/hide
Query:  QMALILLRQEELVMEIQRQRFLSQKARSHPILSQPEVAIR--------------SEPPGTAAVAAASASAVAPSFNDGEVLEADKTEERV----VLEKPN
        +M L+ LR+E+L++EI+RQ+FL + AR   +L + E+AIR              S P   A ++  S++ V  S N+   +E  ++ +R+    V   P 
Subjt:  QMALILLRQEELVMEIQRQRFLSQKARSHPILSQPEVAIR--------------SEPPGTAAVAAASASAVAPSFNDGEVLEADKTEERV----VLEKPN

Query:  LNAL-------------GQKRKDVQEKHDPNVLGQKRKAETPLLSTNDHIQPF----DKVKEWRCALCRVTVRNEGSFKQHLRGKKHKHREEDRLRTRKA
        +  L              ++   + EK DPNV  +KRKAETP   T+D +QPF    +   EW C LC+VTV ++ +F QHL GKKHK R+E  LR +KA
Subjt:  LNAL-------------GQKRKDVQEKHDPNVLGQKRKAETPLLSTNDHIQPF----DKVKEWRCALCRVTVRNEGSFKQHLRGKKHKHREEDRLRTRKA

Query:  SNVSRDMHEQLPNKRRKL-----SSGSAGAKEHESKVGRTLQCDKTGLNATI---PNFMKPYATKDKQHKFKFWCEKCKVGANTTNNMFGHLNGKKHRAN
        SNV+    E LPNKRRKL     +SGS  A+  ESK G TLQC+KTG ++ +    NF+       K+ KFKFWCEKCKVGA  T  M  HLNGKKH+A+
Subjt:  SNVSRDMHEQLPNKRRKL-----SSGSAGAKEHESKVGRTLQCDKTGLNATI---PNFMKPYATKDKQHKFKFWCEKCKVGANTTNNMFGHLNGKKHRAN

XP_038890955.1 uncharacterized protein LOC120080381 isoform X2 [Benincasa hispida]1.0e-2836.42Show/hide
Query:  DAVGKHLHQMALILLRQEELVMEIQRQRFLSQKARSHPILSQPEVAIRS----------EP-----PGTAAVAAASASA--------VAPSFNDGEVLEA
        D+  + L +M L+ LR+E+L+ EI+RQRFL ++AR   +L + E+AIR           +P     P +AA AA S S+        V  SF++   +E 
Subjt:  DAVGKHLHQMALILLRQEELVMEIQRQRFLSQKARSHPILSQPEVAIRS----------EP-----PGTAAVAAASASA--------VAPSFNDGEVLEA

Query:  DKTEER-------------------VVLEKPNLNALGQKRKDVQEKHDPNVLGQKRKAETPLLSTND---HIQPFDKVKEWRCALCRVTVRNEGSFKQHL
         K+ +R                   +V +K  L  L   ++++ EK DPNV  +KRKA T  LSTND    I       EW C L +VT  N+  F QHL
Subjt:  DKTEER-------------------VVLEKPNLNALGQKRKDVQEKHDPNVLGQKRKAETPLLSTND---HIQPFDKVKEWRCALCRVTVRNEGSFKQHL

Query:  RGKKHKHREEDRLRTRKASNVSRDMHEQLPNKRRK-------LSSGSAGAKEHESKVGRTLQCDKT----GLNA-TIPNFMKPYATKDK-----------
         GKKH+ R+E  LR +K  N+SR   E L  KRRK       LSS + GA+  ESK   TLQ +K      +NA  +P+F+K    +D            
Subjt:  RGKKHKHREEDRLRTRKASNVSRDMHEQLPNKRRK-------LSSGSAGAKEHESKVGRTLQCDKT----GLNA-TIPNFMKPYATKDK-----------

Query:  ----------QHKFKFWCEKCKVGANTTNNMFGHLNGKKHRANHQE
                  + KF FWCEKCKVGA  T  M  H+NGKKH+ N +E
Subjt:  ----------QHKFKFWCEKCKVGANTTNNMFGHLNGKKHRANHQE

TrEMBL top hitse value%identityAlignment
A0A0A0LJ38 Uncharacterized protein2.5e-2834.49Show/hide
Query:  TKIMTMLNLIDPNDAVGKHLHQMALILLRQEELVMEIQRQRFLSQKARSHPILSQPEVAIR-------------------------SEPPGTAAVA---A
        T I T+    + N+ +  HL +M  + LR+E+L+ EI+R+RFL ++AR    L + E+AIR                         + PP +A      +
Subjt:  TKIMTMLNLIDPNDAVGKHLHQMALILLRQEELVMEIQRQRFLSQKARSHPILSQPEVAIR-------------------------SEPPGTAAVA---A

Query:  ASASAVAPSFNDGEVLEADKTEERV----VLEKPNL---------NALGQKRKDVQEKHDPNVLGQKRKAETPLLSTNDHIQPFDKVK----EWRCALCR
         S + V  S+++ + +E  KT +R+    V  +P +          A  +++  V EK  PN   ++RKAET    +  HI P    K    EW CALC+
Subjt:  ASASAVAPSFNDGEVLEADKTEERV----VLEKPNL---------NALGQKRKDVQEKHDPNVLGQKRKAETPLLSTNDHIQPFDKVK----EWRCALCR

Query:  VTVRNEGSFKQHLRGKKHKHREEDRLRTRKASNVSRDMHEQLPNKRRKLSSGSA-----GAKEHESKVGRTLQCDKT----GLNATIPNFMKPYATKDKQ
        VT   E SF  HLRGKKH+ R+E  LR  K S VSR  HE L  KRRKL    A     GA+  E+K G     +K+     +NA IP F+K    + ++
Subjt:  VTVRNEGSFKQHLRGKKHKHREEDRLRTRKASNVSRDMHEQLPNKRRKLSSGSA-----GAKEHESKVGRTLQCDKT----GLNATIPNFMKPYATKDKQ

Query:  H--------------KFKFWCEKCKVGANTTNNMFGHLNGKKHRA
        +              KF FWCEKCKVGA  T  M  H+NGK+H+A
Subjt:  H--------------KFKFWCEKCKVGANTTNNMFGHLNGKKHRA

A0A1S3BUI4 hepatoma-derived growth factor-related protein 2-like isoform X21.6e-2231.91Show/hide
Query:  TKIMTMLNLIDPNDAVGKHLHQMALILLRQEELVMEIQRQRFLSQKARSHPILSQPEVAIRSEPPGTAA---------VAAASASAVAP-----------
        T I+T+     PN+   +   +M L+ LR+E+L+ EI+R+RFL ++AR    L + E+AIR   P             V   S++  AP           
Subjt:  TKIMTMLNLIDPNDAVGKHLHQMALILLRQEELVMEIQRQRFLSQKARSHPILSQPEVAIRSEPPGTAA---------VAAASASAVAP-----------

Query:  SFNDGEVLEADKTEERVVL-----------------EKPNLNALGQKRKDVQEKHDPNVLGQKRKAETPLLSTNDHIQPFDKVK-----EWRCALCRVTV
        ++++   +E  KT  R+                   ++P +       +++ EK   N L +KRKAET   ST   +     VK     EW CALC+V+ 
Subjt:  SFNDGEVLEADKTEERVVL-----------------EKPNLNALGQKRKDVQEKHDPNVLGQKRKAETPLLSTNDHIQPFDKVK-----EWRCALCRVTV

Query:  RNEGSFKQHLRGKKHKHREEDRLRTRKASNVSRDMHEQLPNKRRK--------LSSGSAGAKEHESKVGRTLQCDKT----GLNATIPNFMKPYATKDKQ
         NE  F +HLRGKKH  R+E  LR RK S VS+   E LP KRRK           G+ G ++  +K G T   +KT     +NA IP F K    ++ +
Subjt:  RNEGSFKQHLRGKKHKHREEDRLRTRKASNVSRDMHEQLPNKRRK--------LSSGSAGAKEHESKVGRTLQCDKT----GLNATIPNFMKPYATKDKQ

Query:  HK----------------FKFWCEKCKVGANTTNNMFGHLNGKKHRANHQE
         +                  FWCE+ KVGA+ T  M  H+NGKKH+A  +E
Subjt:  HK----------------FKFWCEKCKVGANTTNNMFGHLNGKKHRANHQE

A0A6J1E048 uncharacterized protein LOC1110246954.3e-2837.79Show/hide
Query:  QMALILLRQEELVMEIQRQRFLSQKARSHPILSQPEVAIRSEPPGTAAVA---------AASASAVAPSFNDGE------VLEADKTEERVVL-------
        ++ L+ LR+E+L++E++RQRFL ++AR   +L++ E+AIR    G A  A           S SA AP     E      +  +D+   R V        
Subjt:  QMALILLRQEELVMEIQRQRFLSQKARSHPILSQPEVAIRSEPPGTAAVA---------AASASAVAPSFNDGE------VLEADKTEERVVL-------

Query:  ------------EKPNLNALGQKRKD---VQEKHDPNVLGQKRKAETPLLSTNDHIQPFDKVK----EWRCALCRVTVRNEGSFKQHLRGKKHKHREEDR
                    +KP      Q  K    + EK DP +  +KRKAE PL    D +QP    K    EW CALCRVTV +E +F QHL+GKKH+ R+E  
Subjt:  ------------EKPNLNALGQKRKD---VQEKHDPNVLGQKRKAETPLLSTNDHIQPFDKVK----EWRCALCRVTVRNEGSFKQHLRGKKHKHREEDR

Query:  LRTRKASNVSRDMHEQLPNKRRKLSSGSAGAKEHESKVGRTLQCDKTGLNATIPNFMKPYATKDKQHKFKFWCEKCKVGANTTNNMFGHLNGKKHRANH
        LR +KASNVS+     +  KRRK+ SGSAGA+    K  ++ QC+KTG                   +FKFWC+ CKVGA  T  M  HLNGK+H+A +
Subjt:  LRTRKASNVSRDMHEQLPNKRRKLSSGSAGAKEHESKVGRTLQCDKTGLNATIPNFMKPYATKDKQHKFKFWCEKCKVGANTTNNMFGHLNGKKHRANH

A0A6J1FAE1 uncharacterized protein LOC1114438628.9e-3440.68Show/hide
Query:  LLRQEELVMEIQRQRFLSQKARSHPILSQPEVAIR--------------SEPPGTAAVAAASASAVAPSFNDGEVLEADKTEERVVL-------------
        LLR+E+L++EI+RQ+FL + AR   +L + E+AIR              S P   A +   S++ V  S N+   +E  ++ +R+ +             
Subjt:  LLRQEELVMEIQRQRFLSQKARSHPILSQPEVAIR--------------SEPPGTAAVAAASASAVAPSFNDGEVLEADKTEERVVL-------------

Query:  --EKPNLNALGQKRKD--VQEKHDPNVLGQKRKAETPLLSTNDHIQPF----DKVKEWRCALCRVTVRNEGSFKQHLRGKKHKHREEDRLRTRKASNVSR
          +K     L    ++  + EK DP V  +KRKAETP   T+D +QPF    +   EW C LC+VTV ++ +F QHL GKKHK R+E  LR +KASNV+ 
Subjt:  --EKPNLNALGQKRKD--VQEKHDPNVLGQKRKAETPLLSTNDHIQPF----DKVKEWRCALCRVTVRNEGSFKQHLRGKKHKHREEDRLRTRKASNVSR

Query:  DMHEQLPNKRRKL-----SSGSAGAKEHESKVGRTLQCDKTGLNATIPNFMKPYATKD---KQHKFKFWCEKCKVGANTTNNMFGHLNGKKHRAN
           E LPNKRRKL     +SG   A+  ESK G TLQC+KTG ++ +    +   T+D    + KFKFWCEKCKVGA  T  M  HLNGKKH+A+
Subjt:  DMHEQLPNKRRKL-----SSGSAGAKEHESKVGRTLQCDKTGLNATIPNFMKPYATKD---KQHKFKFWCEKCKVGANTTNNMFGHLNGKKHRAN

A0A6J1I1E4 uncharacterized protein LOC1114685623.6e-3542.11Show/hide
Query:  QMALILLRQEELVMEIQRQRFLSQKARSHPILSQPEVAIR--------------SEP------PGTAAVAAAS---------ASAVAPSFNDG------E
        +M L+ LR+E+L++EI+RQ+FL + AR   +L + E+AIR              S P      P +A V  +S          S+    F  G      +
Subjt:  QMALILLRQEELVMEIQRQRFLSQKARSHPILSQPEVAIR--------------SEP------PGTAAVAAAS---------ASAVAPSFNDG------E

Query:  VLEADKTEERVVLEKPNLNALGQKRKDVQEKHDPNVLGQKRKAETPLLSTNDHIQPF----DKVKEWRCALCRVTVRNEGSFKQHLRGKKHKHREEDRLR
         L AD   ER VL+        ++   + EK DPNV  +KRKAETP   TND +QPF    +   EW C LC+VTV ++ +F QHL GKKHK R+E  LR
Subjt:  VLEADKTEERVVLEKPNLNALGQKRKDVQEKHDPNVLGQKRKAETPLLSTNDHIQPF----DKVKEWRCALCRVTVRNEGSFKQHLRGKKHKHREEDRLR

Query:  TRKASNVSRDMHEQLPNKRRKL-----SSGSAGAKEHESKVGRTLQCDKTGLNATIPNFMKPYATKD---KQHKFKFWCEKCKVGANTTNNMFGHLNGKK
         +KASNV+    E LPNKRRKL     +SG   A+  ESK G TLQC+KTG ++ +    +   T+D    + KFKFWCEKCKVGA  T  M  HLNGKK
Subjt:  TRKASNVSRDMHEQLPNKRRKL-----SSGSAGAKEHESKVGRTLQCDKTGLNATIPNFMKPYATKD---KQHKFKFWCEKCKVGANTTNNMFGHLNGKK

Query:  HRAN
        H+A+
Subjt:  HRAN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGACGAAGATCATGACAATGTTGAATCTGATTGATCCGAACGATGCTGTGGGAAAGCATTTGCATCAAATGGCGTTGATACTGTTGAGGCAGGAGGAATTGGTGAT
GGAGATCCAACGGCAGCGATTTCTTAGTCAGAAAGCCAGGTCACATCCGATCTTGTCCCAACCGGAGGTCGCGATTCGATCGGAACCGCCGGGTACGGCGGCGGTTGCGG
CGGCGTCGGCGTCGGCGGTGGCGCCGTCGTTCAATGACGGTGAAGTACTTGAGGCTGATAAAACAGAAGAACGAGTAGTACTGGAAAAACCCAACCTGAATGCATTAGGA
CAAAAGCGAAAGGATGTGCAGGAAAAACACGACCCGAATGTATTAGGACAAAAGCGAAAGGCCGAGACGCCATTGTTATCTACTAATGATCACATCCAACCATTTGATAA
GGTGAAAGAGTGGAGATGTGCACTGTGTCGAGTTACCGTGAGAAACGAAGGATCATTTAAGCAGCACCTTCGTGGCAAGAAGCACAAGCACAGGGAGGAAGATCGGCTGA
GGACTCGAAAGGCGAGCAACGTCTCCAGGGATATGCACGAGCAATTACCGAACAAAAGGAGGAAGCTAAGCTCTGGCTCTGCTGGTGCAAAAGAACATGAATCAAAAGTT
GGAAGAACCCTTCAATGTGATAAAACTGGCCTGAATGCAACAATCCCAAACTTTATGAAACCATATGCCACAAAGGATAAGCAACACAAGTTCAAGTTTTGGTGTGAAAA
GTGCAAAGTTGGTGCTAATACTACAAATAACATGTTTGGTCATCTTAATGGGAAGAAACACAGGGCAAATCATCAGGAATAA
mRNA sequenceShow/hide mRNA sequence
TATTTTTCTCCGCGTTTTATTCACACTTTATTTTTATCACCTCATTATTTGGGACTATATATATAACAGATTTAAAAGAAAAATCCCTAATTCATTCTTCACTTTCACTG
TTAATTTCTCGATTTTCGATTTCGGAGTTCGATCCAGACGCGTTTGTGAGAATGAAGACGAAGATCATGACAATGTTGAATCTGATTGATCCGAACGATGCTGTGGGAAA
GCATTTGCATCAAATGGCGTTGATACTGTTGAGGCAGGAGGAATTGGTGATGGAGATCCAACGGCAGCGATTTCTTAGTCAGAAAGCCAGGTCACATCCGATCTTGTCCC
AACCGGAGGTCGCGATTCGATCGGAACCGCCGGGTACGGCGGCGGTTGCGGCGGCGTCGGCGTCGGCGGTGGCGCCGTCGTTCAATGACGGTGAAGTACTTGAGGCTGAT
AAAACAGAAGAACGAGTAGTACTGGAAAAACCCAACCTGAATGCATTAGGACAAAAGCGAAAGGATGTGCAGGAAAAACACGACCCGAATGTATTAGGACAAAAGCGAAA
GGCCGAGACGCCATTGTTATCTACTAATGATCACATCCAACCATTTGATAAGGTGAAAGAGTGGAGATGTGCACTGTGTCGAGTTACCGTGAGAAACGAAGGATCATTTA
AGCAGCACCTTCGTGGCAAGAAGCACAAGCACAGGGAGGAAGATCGGCTGAGGACTCGAAAGGCGAGCAACGTCTCCAGGGATATGCACGAGCAATTACCGAACAAAAGG
AGGAAGCTAAGCTCTGGCTCTGCTGGTGCAAAAGAACATGAATCAAAAGTTGGAAGAACCCTTCAATGTGATAAAACTGGCCTGAATGCAACAATCCCAAACTTTATGAA
ACCATATGCCACAAAGGATAAGCAACACAAGTTCAAGTTTTGGTGTGAAAAGTGCAAAGTTGGTGCTAATACTACAAATAACATGTTTGGTCATCTTAATGGGAAGAAAC
ACAGGGCAAATCATCAGGAATAAAAAGAAAGCTGATGTAGATATTGGATAGAAGAGGTTGGCTCGACGAACGACAGATGTCGCCTGGTAATTGTCTTAAAGTTTTACAGT
CTGTTCATCTTATGTAAATTTATGATGTGATTCTGGTCTTTAAAGTTAAACAGTTTGTTCATCTTAGGTAAATCTATGATGAGTAATTAATTCTATGCTGATGGATTGTT
CCAAAAGGAATATTGAACAATTTTGTACTAATTTTTTTTGAAAACGAAATGTCAATTCTTATTATCAATATTACGGTTATTTGGAACCAAAATGTTCATAAG
Protein sequenceShow/hide protein sequence
MKTKIMTMLNLIDPNDAVGKHLHQMALILLRQEELVMEIQRQRFLSQKARSHPILSQPEVAIRSEPPGTAAVAAASASAVAPSFNDGEVLEADKTEERVVLEKPNLNALG
QKRKDVQEKHDPNVLGQKRKAETPLLSTNDHIQPFDKVKEWRCALCRVTVRNEGSFKQHLRGKKHKHREEDRLRTRKASNVSRDMHEQLPNKRRKLSSGSAGAKEHESKV
GRTLQCDKTGLNATIPNFMKPYATKDKQHKFKFWCEKCKVGANTTNNMFGHLNGKKHRANHQE