; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0010698 (gene) of Snake gourd v1 genome

Gene IDTan0010698
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCCHC-type domain-containing protein
Genome locationLG10:12674025..12675177
RNA-Seq ExpressionTan0010698
SyntenyTan0010698
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG48431.1 hypothetical protein EZV62_027725 [Acer yangbiense]9.5e-2834.43Show/hide
Query:  EKNRIIVSGPWFFNGAIILFDKPNMLAQPDDISLIHVDFWVQLHNIPLFCLEEEVLKLLGKSIGEVLQIDTFPFEGIWSRFARIRIRVDITKPLRRGLKI
        ++ R+ + GPW F+GA+I+ ++P      + +    VDFWV + N+P+  + +++ K LG  IGEV ++D  P      +F R+R+ V++ KPLRR L++
Subjt:  EKNRIIVSGPWFFNGAIILFDKPNMLAQPDDISLIHVDFWVQLHNIPLFCLEEEVLKLLGKSIGEVLQIDTFPFEGIWSRFARIRIRVDITKPLRRGLKI

Query:  KMQDDGEDRWIPCKYEKLPEFCFTYGLVGHIQNEC-SLPSDVSENAGELNYGPNMRG---------------VEFKKQGTFKQ
         +  DGE+  +P +YE+LP FCF  GLVGH+ +EC S   D       + YG  +R                V+ + QG F+Q
Subjt:  KMQDDGEDRWIPCKYEKLPEFCFTYGLVGHIQNEC-SLPSDVSENAGELNYGPNMRG---------------VEFKKQGTFKQ

TXG60815.1 hypothetical protein EZV62_012178 [Acer yangbiense]1.6e-2733.71Show/hide
Query:  EKNRIIVSGPWFFNGAIILFDKPNMLAQPDDISLIHVDFWVQLHNIPLFCLEEEVLKLLGKSIGEVLQIDTFPFEGIWSRFARIRIRVDITKPLRRGLKI
        ++ R++  GPW F+ ++++ +K N L +   +    V+FWVQ+HN PL C+ +E+ + LG+ IG+++ ID       + ++ R+R+ +DI++PL+R +++
Subjt:  EKNRIIVSGPWFFNGAIILFDKPNMLAQPDDISLIHVDFWVQLHNIPLFCLEEEVLKLLGKSIGEVLQIDTFPFEGIWSRFARIRIRVDITKPLRRGLKI

Query:  KMQDDGEDRWIPCKYEKLPEFCFTYGLVGHIQNECS--LPSDVSENAGELNYGPNMRGVEFKKQGTFKQTDERYVGSG
         ++ DG +  +  +YE+LP++CF YGLVGH    C   L       A E +YG  +R         FKQT  R  G+G
Subjt:  KMQDDGEDRWIPCKYEKLPEFCFTYGLVGHIQNECS--LPSDVSENAGELNYGPNMRGVEFKKQGTFKQTDERYVGSG

TXG73942.1 hypothetical protein EZV62_002521 [Acer yangbiense]6.2e-2729.04Show/hide
Query:  EKNRIIVSGPWFFNGAIILFDKPNMLAQPDDISLIHVDFWVQLHNIPLFCLEEEVLKLLGKSIGEVLQIDTFPFE--GIWSRFARIRIRVDITKPLRRGL
        ++  +   GPW F+  +I+  KP  L +   +S   V+FWVQ+HNIP+ C+  ++ +L+  SIG V+    FP E    W +F R +IRVDI+KPL+RG+
Subjt:  EKNRIIVSGPWFFNGAIILFDKPNMLAQPDDISLIHVDFWVQLHNIPLFCLEEEVLKLLGKSIGEVLQIDTFPFE--GIWSRFARIRIRVDITKPLRRGL

Query:  KIKMQDDGEDRWIPCKYEKLPEFCFTYGLVGHIQNECSLPSDVSE--NAGELNYGPNMRGVEFKKQGTFKQTDERYVGSGKDNGNPKVGFDKLEKSSNQR
         I ++++G       KYE+L +FC+ YGL+GH   EC       +  +  E  YG  +R     K       D    G  +   N   GF+K ++S ++ 
Subjt:  KIKMQDDGEDRWIPCKYEKLPEFCFTYGLVGHIQNECSLPSDVSE--NAGELNYGPNMRGVEFKKQGTFKQTDERYVGSGKDNGNPKVGFDKLEKSSNQR

Query:  SNKDWRKNDALNTDPFDKGSLPLHMEKVSSKAELKGNVNKVVTEDMIQKNGGKVHENHLTFPFTKEFDISEGNESIRFSKLTEEDSSNLKMMEEDDNPSS
         N   R    +N  PF++  L +     ++   +  +V+K V +D      GKV E             ++   S  F+ +  +D    + ++E D  S+
Subjt:  SNKDWRKNDALNTDPFDKGSLPLHMEKVSSKAELKGNVNKVVTEDMIQKNGGKVHENHLTFPFTKEFDISEGNESIRFSKLTEEDSSNLKMMEEDDNPSS

Query:  VLA
         +A
Subjt:  VLA

XP_022149484.1 uncharacterized protein LOC111017902 [Momordica charantia]3.2e-3130.52Show/hide
Query:  EKNRIIVSGPWFFNGAIILFDKPNMLAQPDDISLIHVDFWVQLHNIPLFCLEEEVLKLLGKSIGEVLQIDTFPFEGIWSRFARIRIRVDITKPLRRGLKI
        EK+R++ SGPW FN ++++   P    QP D++     FW+Q+HNIP  C+  E+  +LG  +G+V +I+    +G    F R+R+++D++KPLRRG+K+
Subjt:  EKNRIIVSGPWFFNGAIILFDKPNMLAQPDDISLIHVDFWVQLHNIPLFCLEEEVLKLLGKSIGEVLQIDTFPFEGIWSRFARIRIRVDITKPLRRGLKI

Query:  KMQDDGEDRWIPCKYEKLPEFCFTYGLVGHIQNECSLPSDVSENAGELNYGPNMRGVEFKKQGTFKQTDERYVGSGKDNGNPKVGFDKLEKSSNQRSNKD
        K   DG+D W P +YEKLP+FC+  G +GH   EC   S V        YG  +R    KK  +  + +E +   G+     +V   +  +   +R +++
Subjt:  KMQDDGEDRWIPCKYEKLPEFCFTYGLVGHIQNECSLPSDVSENAGELNYGPNMRGVEFKKQGTFKQTDERYVGSGKDNGNPKVGFDKLEKSSNQRSNKD

Query:  WRKNDALNTD---PFDKGSLPLHMEKVSSKAELKGNVNKVVTEDMIQKN
        WR  D   +      ++G   +  E+  + AE+     K+ ++ +++++
Subjt:  WRKNDALNTD---PFDKGSLPLHMEKVSSKAELKGNVNKVVTEDMIQKN

XP_031091148.1 uncharacterized protein LOC115996146 [Ipomoea triloba]6.2e-2736.13Show/hide
Query:  EKNRIIVSGPWFFNGAIILFDKPNMLAQPDDISLIHVDFWVQLHNIPLFCLEEEVLKLLGKSIGEVLQIDTFPFEGIWSRFARIRIRVDITKPLRRGLKI
        E +RII  GPW +  +++L  K      P+ ++L H DFW+Q+H +P+ C  E VL+ +G+ +G ++++D   F+G    F RIR+ ++++KPL++G+++
Subjt:  EKNRIIVSGPWFFNGAIILFDKPNMLAQPDDISLIHVDFWVQLHNIPLFCLEEEVLKLLGKSIGEVLQIDTFPFEGIWSRFARIRIRVDITKPLRRGLKI

Query:  KMQDDGEDRWIPCKYEKLPEFCFTYGLVGHIQNECSLPSDVSENAGELNYGPNMR
        K +DDGE   I  +YE+LP FCF  G++GH + +C+   +  E   E  + P++R
Subjt:  KMQDDGEDRWIPCKYEKLPEFCFTYGLVGHIQNECSLPSDVSENAGELNYGPNMR

TrEMBL top hitse value%identityAlignment
A0A5C7GUN1 Uncharacterized protein4.6e-2834.43Show/hide
Query:  EKNRIIVSGPWFFNGAIILFDKPNMLAQPDDISLIHVDFWVQLHNIPLFCLEEEVLKLLGKSIGEVLQIDTFPFEGIWSRFARIRIRVDITKPLRRGLKI
        ++ R+ + GPW F+GA+I+ ++P      + +    VDFWV + N+P+  + +++ K LG  IGEV ++D  P      +F R+R+ V++ KPLRR L++
Subjt:  EKNRIIVSGPWFFNGAIILFDKPNMLAQPDDISLIHVDFWVQLHNIPLFCLEEEVLKLLGKSIGEVLQIDTFPFEGIWSRFARIRIRVDITKPLRRGLKI

Query:  KMQDDGEDRWIPCKYEKLPEFCFTYGLVGHIQNEC-SLPSDVSENAGELNYGPNMRG---------------VEFKKQGTFKQ
         +  DGE+  +P +YE+LP FCF  GLVGH+ +EC S   D       + YG  +R                V+ + QG F+Q
Subjt:  KMQDDGEDRWIPCKYEKLPEFCFTYGLVGHIQNEC-SLPSDVSENAGELNYGPNMRG---------------VEFKKQGTFKQ

A0A5C7HVE7 Uncharacterized protein7.9e-2833.71Show/hide
Query:  EKNRIIVSGPWFFNGAIILFDKPNMLAQPDDISLIHVDFWVQLHNIPLFCLEEEVLKLLGKSIGEVLQIDTFPFEGIWSRFARIRIRVDITKPLRRGLKI
        ++ R++  GPW F+ ++++ +K N L +   +    V+FWVQ+HN PL C+ +E+ + LG+ IG+++ ID       + ++ R+R+ +DI++PL+R +++
Subjt:  EKNRIIVSGPWFFNGAIILFDKPNMLAQPDDISLIHVDFWVQLHNIPLFCLEEEVLKLLGKSIGEVLQIDTFPFEGIWSRFARIRIRVDITKPLRRGLKI

Query:  KMQDDGEDRWIPCKYEKLPEFCFTYGLVGHIQNECS--LPSDVSENAGELNYGPNMRGVEFKKQGTFKQTDERYVGSG
         ++ DG +  +  +YE+LP++CF YGLVGH    C   L       A E +YG  +R         FKQT  R  G+G
Subjt:  KMQDDGEDRWIPCKYEKLPEFCFTYGLVGHIQNECS--LPSDVSENAGELNYGPNMRGVEFKKQGTFKQTDERYVGSG

A0A5C7IT66 CCHC-type domain-containing protein3.9e-2731.5Show/hide
Query:  EKNRIIVSGPWFFNGAIILFDKPNMLAQPDDISLIHVDFWVQLHNIPLFCLEEEVLKLLGKSIGEVLQIDTFPFEGIWSRFARIRIRVDITKPLRRGLKI
        ++ R+++ GPW F+GA+I+ ++P      + +    VDFWVQ+ N+P+ C+ +++ K LG  IGEV ++D  P      +F R+R+ V++  PLRR L++
Subjt:  EKNRIIVSGPWFFNGAIILFDKPNMLAQPDDISLIHVDFWVQLHNIPLFCLEEEVLKLLGKSIGEVLQIDTFPFEGIWSRFARIRIRVDITKPLRRGLKI

Query:  KMQDDGEDRWIPCKYEKLPEFCFTYGLVGHIQNEC-SLPSDVSENAGELNYGPNMRG----VEFKKQGTFKQTDERYVGSGKDNGNPKVGFDKLE-----
         +  DGE+  +P +YE+LP FCF  GLVGH   EC S   D       + YG  +R     V++ +       D R    G    +P VG    E     
Subjt:  KMQDDGEDRWIPCKYEKLPEFCFTYGLVGHIQNEC-SLPSDVSENAGELNYGPNMRG----VEFKKQGTFKQTDERYVGSGKDNGNPKVGFDKLE-----

Query:  --KSSNQR---SNKDWRKNDALNTDPFDKGS---LPLHMEKVSSKAELKGNVNK
          KS N R    + +   + + +  P DKG    + + ++ V S  + K  V K
Subjt:  --KSSNQR---SNKDWRKNDALNTDPFDKGS---LPLHMEKVSSKAELKGNVNK

A0A5C7IZA1 Uncharacterized protein3.0e-2729.04Show/hide
Query:  EKNRIIVSGPWFFNGAIILFDKPNMLAQPDDISLIHVDFWVQLHNIPLFCLEEEVLKLLGKSIGEVLQIDTFPFE--GIWSRFARIRIRVDITKPLRRGL
        ++  +   GPW F+  +I+  KP  L +   +S   V+FWVQ+HNIP+ C+  ++ +L+  SIG V+    FP E    W +F R +IRVDI+KPL+RG+
Subjt:  EKNRIIVSGPWFFNGAIILFDKPNMLAQPDDISLIHVDFWVQLHNIPLFCLEEEVLKLLGKSIGEVLQIDTFPFE--GIWSRFARIRIRVDITKPLRRGL

Query:  KIKMQDDGEDRWIPCKYEKLPEFCFTYGLVGHIQNECSLPSDVSE--NAGELNYGPNMRGVEFKKQGTFKQTDERYVGSGKDNGNPKVGFDKLEKSSNQR
         I ++++G       KYE+L +FC+ YGL+GH   EC       +  +  E  YG  +R     K       D    G  +   N   GF+K ++S ++ 
Subjt:  KIKMQDDGEDRWIPCKYEKLPEFCFTYGLVGHIQNECSLPSDVSE--NAGELNYGPNMRGVEFKKQGTFKQTDERYVGSGKDNGNPKVGFDKLEKSSNQR

Query:  SNKDWRKNDALNTDPFDKGSLPLHMEKVSSKAELKGNVNKVVTEDMIQKNGGKVHENHLTFPFTKEFDISEGNESIRFSKLTEEDSSNLKMMEEDDNPSS
         N   R    +N  PF++  L +     ++   +  +V+K V +D      GKV E             ++   S  F+ +  +D    + ++E D  S+
Subjt:  SNKDWRKNDALNTDPFDKGSLPLHMEKVSSKAELKGNVNKVVTEDMIQKNGGKVHENHLTFPFTKEFDISEGNESIRFSKLTEEDSSNLKMMEEDDNPSS

Query:  VLA
         +A
Subjt:  VLA

A0A6J1D765 uncharacterized protein LOC1110179021.5e-3130.52Show/hide
Query:  EKNRIIVSGPWFFNGAIILFDKPNMLAQPDDISLIHVDFWVQLHNIPLFCLEEEVLKLLGKSIGEVLQIDTFPFEGIWSRFARIRIRVDITKPLRRGLKI
        EK+R++ SGPW FN ++++   P    QP D++     FW+Q+HNIP  C+  E+  +LG  +G+V +I+    +G    F R+R+++D++KPLRRG+K+
Subjt:  EKNRIIVSGPWFFNGAIILFDKPNMLAQPDDISLIHVDFWVQLHNIPLFCLEEEVLKLLGKSIGEVLQIDTFPFEGIWSRFARIRIRVDITKPLRRGLKI

Query:  KMQDDGEDRWIPCKYEKLPEFCFTYGLVGHIQNECSLPSDVSENAGELNYGPNMRGVEFKKQGTFKQTDERYVGSGKDNGNPKVGFDKLEKSSNQRSNKD
        K   DG+D W P +YEKLP+FC+  G +GH   EC   S V        YG  +R    KK  +  + +E +   G+     +V   +  +   +R +++
Subjt:  KMQDDGEDRWIPCKYEKLPEFCFTYGLVGHIQNECSLPSDVSENAGELNYGPNMRGVEFKKQGTFKQTDERYVGSGKDNGNPKVGFDKLEKSSNQRSNKD

Query:  WRKNDALNTD---PFDKGSLPLHMEKVSSKAELKGNVNKVVTEDMIQKN
        WR  D   +      ++G   +  E+  + AE+     K+ ++ +++++
Subjt:  WRKNDALNTD---PFDKGSLPLHMEKVSSKAELKGNVNKVVTEDMIQKN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G13450.1 unknown protein1.6e-0929.01Show/hide
Query:  PWFFNGAIILFDKPNMLAQPDDIS-----LIHVDFWVQLHNIPLFCLEEEVLKLLGKSIGEVLQIDTFPFEGIWSRFARIRIRVDITKPLRRGLKIKMQD
        PW +N   +        AQ  +++     L  ++ WVQ+  IPL  + EE    +   +GE++ +D          + R+RIR  IT  LR  L+I + D
Subjt:  PWFFNGAIILFDKPNMLAQPDDIS-----LIHVDFWVQLHNIPLFCLEEEVLKLLGKSIGEVLQIDTFPFEGIWSRFARIRIRVDITKPLRRGLKIKMQD

Query:  DGEDRWIPCKYEKLPEFCFTYGLVGHIQNEC
         GE   I  +YE+L   C +   + H +N C
Subjt:  DGEDRWIPCKYEKLPEFCFTYGLVGHIQNEC

AT2G41590.1 unknown protein4.3e-1031.86Show/hide
Query:  PWFFNGAIILFDKPNMLAQPDDISLIHVDFWVQLHNIPLFCLEEEVLKLLGKSIGEVLQIDTFPFEGIWSRFARIRIRVDITKPLRRGLKIKMQDDGEDR
        PW FN   +   +  +    + ++ I  D WVQ+  IPL  + EE +  + + +GEVL +D      I   + R+R+R  IT  LR   +I + D GE  
Subjt:  PWFFNGAIILFDKPNMLAQPDDISLIHVDFWVQLHNIPLFCLEEEVLKLLGKSIGEVLQIDTFPFEGIWSRFARIRIRVDITKPLRRGLKIKMQDDGEDR

Query:  WIPCKYEKLPEFC
         I  +YE+L   C
Subjt:  WIPCKYEKLPEFC

AT3G31430.1 unknown protein3.9e-1129.86Show/hide
Query:  IIVSGPWFFNGAIILFDKPNMLAQPDDISLIHVDFWVQLHNIPLFCLEEEVLKLLGKSIGEVLQIDTFPFEGIWSR-FARIRIRVDITKPLRRGLKIKMQ
        ++  GPW FN  +IL  +     +P       + FWVQ+  IP   L   V++ +G+++G+VL  D F  E +    FAR+ +  DIT PLR     +  
Subjt:  IIVSGPWFFNGAIILFDKPNMLAQPDDISLIHVDFWVQLHNIPLFCLEEEVLKLLGKSIGEVLQIDTFPFEGIWSR-FARIRIRVDITKPLRRGLKIKMQ

Query:  DDGEDRWIPCKYEKLPEFCFTYGLVGHIQNECSLPSDVSENAGE
          G +  +  +YE+L  FC   G++ H    C + +   E A +
Subjt:  DDGEDRWIPCKYEKLPEFCFTYGLVGHIQNECSLPSDVSENAGE

AT5G25200.1 unknown protein6.2e-0930.09Show/hide
Query:  PWFFNGAIILFDKPNMLAQPDDISLIHVDFWVQLHNIPLFCLEEEVLKLLGKSIGEVLQIDTFPFEGIWSRFARIRIRVDITKPLRRGLKIKMQDDGEDR
        PW FN   +   +      P    +  +D WVQ+  IPL  + EE +  + + +GE++ +D          F R+R+R  IT  LR   +I + D GE  
Subjt:  PWFFNGAIILFDKPNMLAQPDDISLIHVDFWVQLHNIPLFCLEEEVLKLLGKSIGEVLQIDTFPFEGIWSRFARIRIRVDITKPLRRGLKIKMQDDGEDR

Query:  WIPCKYEKLPEFC
         I  +YE+L   C
Subjt:  WIPCKYEKLPEFC

AT5G36228.1 nucleic acid binding;zinc ion binding3.3e-1028.57Show/hide
Query:  PWFFNGAIILFDKPNMLAQPDDISLIHVDFWVQLHNIPLFCLEEEVLKLLGKSIGEVLQIDTFPFEGIWSRFARIRIRVDITKPLRRGLKIKMQDDGEDR
        PW FN   I   +      P +  L  +D WV +  IPL  + E  ++++  ++GEV+ +D          F R+++R+D T+PLR   +++     E  
Subjt:  PWFFNGAIILFDKPNMLAQPDDISLIHVDFWVQLHNIPLFCLEEEVLKLLGKSIGEVLQIDTFPFEGIWSRFARIRIRVDITKPLRRGLKIKMQDDGEDR

Query:  WIPCKYEKLPEFCFTYGLVGHIQNEC
         I  +YEKL   C     V H  + C
Subjt:  WIPCKYEKLPEFCFTYGLVGHIQNEC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAGAATAGAATAATTGTAAGCGGTCCTTGGTTTTTCAATGGAGCCATAATCCTATTTGATAAGCCAAACATGCTTGCACAACCAGATGATATATCCCTAATTCA
TGTTGATTTTTGGGTACAGTTACACAACATTCCCCTATTTTGCCTAGAAGAAGAAGTTCTAAAACTCTTGGGCAAATCAATTGGTGAAGTTTTGCAAATAGATACTTTCC
CCTTCGAGGGTATTTGGAGTCGATTTGCTAGAATCAGAATTAGGGTTGATATAACCAAACCTCTTAGACGTGGTTTGAAAATCAAAATGCAAGATGATGGAGAAGATCGT
TGGATTCCATGTAAATATGAGAAACTTCCAGAATTTTGCTTCACATATGGATTGGTTGGGCACATTCAAAACGAATGTAGCCTTCCATCCGATGTTAGTGAGAATGCAGG
AGAACTGAACTATGGCCCTAACATGAGAGGAGTGGAATTTAAAAAACAGGGAACATTTAAACAAACTGATGAGAGGTATGTCGGCTCTGGAAAAGATAATGGAAATCCCA
AGGTTGGTTTCGACAAACTGGAAAAATCTTCAAATCAAAGGAGTAATAAAGATTGGAGGAAAAATGATGCATTAAATACTGACCCATTTGACAAAGGCTCCCTGCCATTA
CATATGGAGAAAGTCAGTAGTAAAGCAGAGTTAAAGGGTAATGTGAATAAGGTTGTGACTGAAGACATGATTCAAAAAAATGGGGGAAAAGTCCATGAGAACCATTTAAC
TTTTCCCTTTACAAAAGAATTTGATATATCAGAAGGAAATGAATCAATCAGATTTAGCAAGTTAACTGAAGAAGATAGTTCAAATTTAAAGATGATGGAGGAAGATGACA
ATCCTTCTAGTGTCCTTGCTAATCAAACATTAGAATAA
mRNA sequenceShow/hide mRNA sequence
CTGTATTGTTGAAAATCTATTCTTCTTTAAATTTAAAACCCTAATGGAAAAGAATAGAATAATTGTAAGCGGTCCTTGGTTTTTCAATGGAGCCATAATCCTATTTGATA
AGCCAAACATGCTTGCACAACCAGATGATATATCCCTAATTCATGTTGATTTTTGGGTACAGTTACACAACATTCCCCTATTTTGCCTAGAAGAAGAAGTTCTAAAACTC
TTGGGCAAATCAATTGGTGAAGTTTTGCAAATAGATACTTTCCCCTTCGAGGGTATTTGGAGTCGATTTGCTAGAATCAGAATTAGGGTTGATATAACCAAACCTCTTAG
ACGTGGTTTGAAAATCAAAATGCAAGATGATGGAGAAGATCGTTGGATTCCATGTAAATATGAGAAACTTCCAGAATTTTGCTTCACATATGGATTGGTTGGGCACATTC
AAAACGAATGTAGCCTTCCATCCGATGTTAGTGAGAATGCAGGAGAACTGAACTATGGCCCTAACATGAGAGGAGTGGAATTTAAAAAACAGGGAACATTTAAACAAACT
GATGAGAGGTATGTCGGCTCTGGAAAAGATAATGGAAATCCCAAGGTTGGTTTCGACAAACTGGAAAAATCTTCAAATCAAAGGAGTAATAAAGATTGGAGGAAAAATGA
TGCATTAAATACTGACCCATTTGACAAAGGCTCCCTGCCATTACATATGGAGAAAGTCAGTAGTAAAGCAGAGTTAAAGGGTAATGTGAATAAGGTTGTGACTGAAGACA
TGATTCAAAAAAATGGGGGAAAAGTCCATGAGAACCATTTAACTTTTCCCTTTACAAAAGAATTTGATATATCAGAAGGAAATGAATCAATCAGATTTAGCAAGTTAACT
GAAGAAGATAGTTCAAATTTAAAGATGATGGAGGAAGATGACAATCCTTCTAGTGTCCTTGCTAATCAAACATTAGAATAAGGAAAAAAGTTGCACGAAATTAAGGCAGG
GGAAGAGACAGTTCATTGAAGGTAGCCTGGAGGAAAATGTAGAGAATATTAAAAGAAAGAAATTGAACCTCGAAGATTTGGGAGTGGTTTCATGCTCTTTATTGGCGAAC
CCTGGTTCCTAGGGTCGCCATTCACCATGAAAATCATATGTTGGAATGTCAAG
Protein sequenceShow/hide protein sequence
MEKNRIIVSGPWFFNGAIILFDKPNMLAQPDDISLIHVDFWVQLHNIPLFCLEEEVLKLLGKSIGEVLQIDTFPFEGIWSRFARIRIRVDITKPLRRGLKIKMQDDGEDR
WIPCKYEKLPEFCFTYGLVGHIQNECSLPSDVSENAGELNYGPNMRGVEFKKQGTFKQTDERYVGSGKDNGNPKVGFDKLEKSSNQRSNKDWRKNDALNTDPFDKGSLPL
HMEKVSSKAELKGNVNKVVTEDMIQKNGGKVHENHLTFPFTKEFDISEGNESIRFSKLTEEDSSNLKMMEEDDNPSSVLANQTLE