; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008717 (gene) of Snake gourd v1 genome

Gene IDTan0008717
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCCHC-type domain-containing protein
Genome locationLG04:23238161..23239222
RNA-Seq ExpressionTan0008717
SyntenyTan0008717
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_015381281.1 uncharacterized protein LOC107174698 [Citrus sinensis]1.6e-3442.33Show/hide
Query:  MKNVWRVHKDTSIDCIGENLFFFKFKSILEKNRIVCDGPWFFNGAIILFQEPNVLEQPENLRLTHADFWVQFYNIPLFCMNEEVLRSMGSSLGEVLNVDT
        M+N WR  K+  I+ +GEN F FKF    EK R++  GPW F+ A+++  EP+ +   +N   TH+ FWVQ +NIP+ CMN E ++ +G  LG+V  V+T
Subjt:  MKNVWRVHKDTSIDCIGENLFFFKFKSILEKNRIVCDGPWFFNGAIILFQEPNVLEQPENLRLTHADFWVQFYNIPLFCMNEEVLRSMGSSLGEVLNVDT

Query:  SSLGGIWGRFARLRVKIDITKPLRGLKIKIVDNEEDRWIPCRYEKLPEFCFNCGRLGHTQKEC
         + G   G FAR R+ I+IT PL+ + + + + E    +P  YEKLP+FCF C  +GH  +EC
Subjt:  SSLGGIWGRFARLRVKIDITKPLRGLKIKIVDNEEDRWIPCRYEKLPEFCFNCGRLGHTQKEC

XP_015387877.1 uncharacterized protein LOC107177871 [Citrus sinensis]1.4e-3542.94Show/hide
Query:  MKNVWRVHKDTSIDCIGENLFFFKFKSILEKNRIVCDGPWFFNGAIILFQEPNVLEQPENLRLTHADFWVQFYNIPLFCMNEEVLRSMGSSLGEVLNVDT
        M+N WR  KD  ++ +GEN F FKF S  EK R++  GPW F+ A+++  EP+ +   +N   TH+ FWVQ +NIP+ CMN E ++ +G  LG+V  V+T
Subjt:  MKNVWRVHKDTSIDCIGENLFFFKFKSILEKNRIVCDGPWFFNGAIILFQEPNVLEQPENLRLTHADFWVQFYNIPLFCMNEEVLRSMGSSLGEVLNVDT

Query:  SSLGGIWGRFARLRVKIDITKPLRGLKIKIVDNEEDRWIPCRYEKLPEFCFNCGRLGHTQKEC
         + G   G FAR R+ I+IT PL+ + + + + E    +P  YEKLP+FCF C  +GH  +EC
Subjt:  SSLGGIWGRFARLRVKIDITKPLRGLKIKIVDNEEDRWIPCRYEKLPEFCFNCGRLGHTQKEC

XP_022149484.1 uncharacterized protein LOC111017902 [Momordica charantia]4.0e-3842.42Show/hide
Query:  MKNVWRVHKDTSIDCIGENLFFFKFKSILEKNRIVCDGPWFFNGAIILFQEPNVLEQPENLRLTHADFWVQFYNIPLFCMNEEVLRSMGSSLGEVLNVDT
        MK+VWRVH  T  + +G N++   FKS+ EK+R++  GPW FN ++++   P    QP ++      FW+Q +NIP  C++ E+   +G+ LG+V  ++ 
Subjt:  MKNVWRVHKDTSIDCIGENLFFFKFKSILEKNRIVCDGPWFFNGAIILFQEPNVLEQPENLRLTHADFWVQFYNIPLFCMNEEVLRSMGSSLGEVLNVDT

Query:  SSLGGIWGRFARLRVKIDITKPL-RGLKIKIVDNEEDRWIPCRYEKLPEFCFNCGRLGHTQKECD
            G  G F R+RVKID++KPL RG+K+K  D  +D W P RYEKLP+FC+ CG++GH+ +EC+
Subjt:  SSLGGIWGRFARLRVKIDITKPL-RGLKIKIVDNEEDRWIPCRYEKLPEFCFNCGRLGHTQKECD

XP_028102454.1 uncharacterized protein LOC114301689 [Camellia sinensis]7.1e-3540.98Show/hide
Query:  MKNVWRVHKDTSIDCIGENLFFFKFKSILEKNRIVCDGPWFFNGAIILFQEPNVLEQPENLRLTHADFWVQFYNIPLFCMNEEVLRSMGSSLGEVLNVDT
        +  VWR  K      IG NLF F+F  + +K R++ +GPW F+  +IL  E +V  QP  + L+ A FW+Q +N+PL  M +EV   +G+ +G +++V+ 
Subjt:  MKNVWRVHKDTSIDCIGENLFFFKFKSILEKNRIVCDGPWFFNGAIILFQEPNVLEQPENLRLTHADFWVQFYNIPLFCMNEEVLRSMGSSLGEVLNVDT

Query:  SSLGGIWGRFARLRVKIDITKPL-RGLKIKIVDNEEDRWIPCRYEKLPEFCFNCGRLGHTQKECDSCIEQGE-GVGLS-RYGP
           G   GR+ R+RV I++ KPL RG+K+  V N E  W+  +YE+LP FC+ CG LGH++KEC++ + Q E G G + +YGP
Subjt:  SSLGGIWGRFARLRVKIDITKPL-RGLKIKIVDNEEDRWIPCRYEKLPEFCFNCGRLGHTQKECDSCIEQGE-GVGLS-RYGP

XP_028122006.1 uncharacterized protein LOC114319195 [Camellia sinensis]3.5e-3438.01Show/hide
Query:  NVWRVHKDTSIDCIGENLFFFKFKSILEKNRIVCDGPWFFNGAIILFQEPNVLEQPENLRLTHADFWVQFYNIPLFCMNEEVLRSMGSSLGEVLNVDTSS
        +VW+  K   +  IG+NLF F F  +++K R++ DGPW F+  +++  E +   QP +++LT   FWV   N+PL  MN++V   +G+++G+ +++D   
Subjt:  NVWRVHKDTSIDCIGENLFFFKFKSILEKNRIVCDGPWFFNGAIILFQEPNVLEQPENLRLTHADFWVQFYNIPLFCMNEEVLRSMGSSLGEVLNVDTSS

Query:  LGGIWGRFARLRVKIDITKPL-RGLKIKIVDNEEDRWIPCRYEKLPEFCFNCGRLGHTQKECDSCIEQGEG
         G +WGR  R+RV +D+ KPL RG+K+ +  + E  W+  +YE+LP +C+ CGRLGH+++ECD  +   +G
Subjt:  LGGIWGRFARLRVKIDITKPL-RGLKIKIVDNEEDRWIPCRYEKLPEFCFNCGRLGHTQKECDSCIEQGEG

TrEMBL top hitse value%identityAlignment
A0A1S8AC25 CCHC-type domain-containing protein (Fragment)2.2e-3441.57Show/hide
Query:  MKNVWRVHKDTSIDCIGENLFFFKFKSILEKNRIVCDGPWFFNGAIILFQEPNVLEQPENLRLTHADFWVQFYNIPLFCMNEEVLRSMGSSLGEVLNVDT
        M+ VWR  ++  I+ +GEN+F FKF S ++K  I+  GPW F+ A+I   EP  +   +    +H  FWVQ +++P+ CM++++   +G  +G+V  V+T
Subjt:  MKNVWRVHKDTSIDCIGENLFFFKFKSILEKNRIVCDGPWFFNGAIILFQEPNVLEQPENLRLTHADFWVQFYNIPLFCMNEEVLRSMGSSLGEVLNVDT

Query:  SSLGGIWGRFARLRVKIDITKPLRGLKIKIVDNEEDR-WIPCR--YEKLPEFCFNCGRLGHTQKEC
         + G  +G+F RLR+ +DITKPL+ + I++   EED   IP R  YE+LP+FCF CGR+GH  +EC
Subjt:  SSLGGIWGRFARLRVKIDITKPLRGLKIKIVDNEEDR-WIPCR--YEKLPEFCFNCGRLGHTQKEC

A0A5E4G034 PREDICTED: DUF4283 domain-containing4.2e-3339.11Show/hide
Query:  MKNVWRVHKDTSIDCIGENLFFFKFKSILEKNRIVCDGPWFFNGAIILFQEPNVLEQPENLRLTHADFWVQFYNIPLFCMNEEVLRSMGSSLGEVLNVDT
        M  +W   ++  +  IGENLF F F +  ++ R++  GPW F+ A++L + P+    P  + L +ADFW+Q +N+PL CM+  + R +G+S G  L+V  
Subjt:  MKNVWRVHKDTSIDCIGENLFFFKFKSILEKNRIVCDGPWFFNGAIILFQEPNVLEQPENLRLTHADFWVQFYNIPLFCMNEEVLRSMGSSLGEVLNVDT

Query:  SSLGGIWGRFARLRVKIDITKPLRGLKIKIVDNEEDRWIPCRYEKLPEFCFNCGRLGHTQKECDSCIEQGEGVGLSRYG
           G   GRF RLRV +D++KPLR  K   + +    ++  RYE+LPEFCF CGRLGH  KEC    +  +      YG
Subjt:  SSLGGIWGRFARLRVKIDITKPLRGLKIKIVDNEEDRWIPCRYEKLPEFCFNCGRLGHTQKECDSCIEQGEGVGLSRYG

A0A6J1D765 uncharacterized protein LOC1110179022.0e-3842.42Show/hide
Query:  MKNVWRVHKDTSIDCIGENLFFFKFKSILEKNRIVCDGPWFFNGAIILFQEPNVLEQPENLRLTHADFWVQFYNIPLFCMNEEVLRSMGSSLGEVLNVDT
        MK+VWRVH  T  + +G N++   FKS+ EK+R++  GPW FN ++++   P    QP ++      FW+Q +NIP  C++ E+   +G+ LG+V  ++ 
Subjt:  MKNVWRVHKDTSIDCIGENLFFFKFKSILEKNRIVCDGPWFFNGAIILFQEPNVLEQPENLRLTHADFWVQFYNIPLFCMNEEVLRSMGSSLGEVLNVDT

Query:  SSLGGIWGRFARLRVKIDITKPL-RGLKIKIVDNEEDRWIPCRYEKLPEFCFNCGRLGHTQKECD
            G  G F R+RVKID++KPL RG+K+K  D  +D W P RYEKLP+FC+ CG++GH+ +EC+
Subjt:  SSLGGIWGRFARLRVKIDITKPL-RGLKIKIVDNEEDRWIPCRYEKLPEFCFNCGRLGHTQKECD

A0A6J1DU55 uncharacterized protein LOC1110231355.0e-3431.89Show/hide
Query:  WRVHKDTSIDCIGENLFFFKFKSILEKNRIVCDGPWFFNGAIILFQEPNVLEQPENLRLTHADFWVQFYNIPLFCMNEEVLRSMGSSLGEVLNVDTSSLG
        W+V    +++ IG+NLF F F    + NR++  GPWFF+ A+I+ Q+P   +    L      FW+  +++P+  +N+ +   +G+++G  ++VD +  G
Subjt:  WRVHKDTSIDCIGENLFFFKFKSILEKNRIVCDGPWFFNGAIILFQEPNVLEQPENLRLTHADFWVQFYNIPLFCMNEEVLRSMGSSLGEVLNVDTSSLG

Query:  GIWGRFARLRVKIDITKPL-RGLKIKIVDNEEDRWIPCRYEKLPEFCFNCGRLGHTQKECDS--CIEQGEGVGLSRYGPNLHTVEFKRVLDK--KGEARI
          WG   R+RV IDITKPL RG+KI I       WIP +YE+LP+FC+ CG +GH+  +CD+     Q +    S YGP L  V  K    K  KG++  
Subjt:  GIWGRFARLRVKIDITKPL-RGLKIKIVDNEEDRWIPCRYEKLPEFCFNCGRLGHTQKECDS--CIEQGEGVGLSRYGPNLHTVEFKRVLDK--KGEARI

Query:  REDYEPLKSLKTVKENVKNNLNINKIKGFEWKKTEKKEV--EVGFLQK--AEATNDESKMASSNMFERNKEA---DHVLMVSANNL--TEKGKLVKGMEP
        RED     S+             +K +G E  K +  E   + GF  +  AE T D    AS+       E+   DH    S N+   + +G   K M  
Subjt:  REDYEPLKSLKTVKENVKNNLNINKIKGFEWKKTEKKEV--EVGFLQK--AEATNDESKMASSNMFERNKEA---DHVLMVSANNL--TEKGKLVKGMEP

Query:  LIMEEIMTDLVNEGETRSKGFDL
        ++ +    DL+N+  + ++  DL
Subjt:  LIMEEIMTDLVNEGETRSKGFDL

A0A6P5RUR1 uncharacterized protein At4g02000-like1.1e-3341.46Show/hide
Query:  MKNVWRVHKDTSIDCIGENLFFFKFKSILEKNRIVCDGPWFFNGAIILFQEPNVLEQPENLRLTHADFWVQFYNIPLFCMNEEVLRSMGSSLGEVLNVDT
        M  +W   ++     +GENLF F F   L++NR++ +GPW F+ A++L +EPN    P  + L  A+FWVQ +N+PL  M  +  R +G+ +GE ++V  
Subjt:  MKNVWRVHKDTSIDCIGENLFFFKFKSILEKNRIVCDGPWFFNGAIILFQEPNVLEQPENLRLTHADFWVQFYNIPLFCMNEEVLRSMGSSLGEVLNVDT

Query:  SSLGGIWGRFARLRVKIDITKPLR-GLKIKIVDNEEDRWIPCRYEKLPEFCFNCGRLGHTQKEC
           G   GRF R+RVK+DITKPL+ G KI +   +++R +  RYE+LP+FC+NC R+GH    C
Subjt:  SSLGGIWGRFARLRVKIDITKPLR-GLKIKIVDNEEDRWIPCRYEKLPEFCFNCGRLGHTQKEC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding2.2e-1326.38Show/hide
Query:  MKNVWRVHKDTSIDCIGENLFFFKFKSILEKNRIVCDGPWFFNGAIILFQEPNVLEQPENLRLTHADFWVQFYNIPLFCMNEEVLRSMGSSLGEVLNVDT
        ++ +W+     ++  +    F  +F+   E    +  GPW   G  +L Q+ +    P    +     WV+  NIP    +  +L  +   LG  L VD 
Subjt:  MKNVWRVHKDTSIDCIGENLFFFKFKSILEKNRIVCDGPWFFNGAIILFQEPNVLEQPENLRLTHADFWVQFYNIPLFCMNEEVLRSMGSSLGEVLNVDT

Query:  SSLGGIWGRFARLRVKIDITKPLRGLKIKIVDNEEDRWIPCRYEKLPEFCFNCGRLGHTQKEC
        +++    GRFAR+ +++++ KPL+G     V    DR+    YE L + C +CG  GH    C
Subjt:  SSLGGIWGRFARLRVKIDITKPLRGLKIKIVDNEEDRWIPCRYEKLPEFCFNCGRLGHTQKEC

AT2G02103.1 unknown protein1.1e-1232Show/hide
Query:  PWFFNGAIILFQEPNVLEQPENLRLTHADFWVQFYNIPLFCMNEEVLRSMGSSLGEVLNVDTSSLGGIWGRFARLRVKIDITKPLRGLKIKIVDNEEDRW
        PW FN   +      V   P +  +T  D WVQ   IPL  ++EE +  +   LGE++++D          F R+RV+  IT  LR  +  I D+ E   
Subjt:  PWFFNGAIILFQEPNVLEQPENLRLTHADFWVQFYNIPLFCMNEEVLRSMGSSLGEVLNVDTSSLGGIWGRFARLRVKIDITKPLRGLKIKIVDNEEDRW

Query:  IPCRYEKLPEFCFNCGRLGHTQKEC
        I  +YE+L   C +C R  H +  C
Subjt:  IPCRYEKLPEFCFNCGRLGHTQKEC

AT2G41590.1 unknown protein1.3e-1330.5Show/hide
Query:  FKFKSILEKNRIVCDGPWFFNGAIILFQEPNVLEQPENLRLTHADFWVQFYNIPLFCMNEEVLRSMGSSLGEVLNVDTSSLGGIWGRFARLRVKIDITKP
        F F++ ++   +    PW FN   +      V   P +  +T  D WVQ   IPL  ++EE +  +   LGEVL +D      I   + R+RV+  IT  
Subjt:  FKFKSILEKNRIVCDGPWFFNGAIILFQEPNVLEQPENLRLTHADFWVQFYNIPLFCMNEEVLRSMGSSLGEVLNVDTSSLGGIWGRFARLRVKIDITKP

Query:  LRGLKIKIVDNEEDRWIPCRYEKLPEFCFNCGRLGHTQKEC
        LR  +  + D+ E   I  +YE+L   C +C R  H +  C
Subjt:  LRGLKIKIVDNEEDRWIPCRYEKLPEFCFNCGRLGHTQKEC

AT5G25200.1 unknown protein1.4e-1232Show/hide
Query:  PWFFNGAIILFQEPNVLEQPENLRLTHADFWVQFYNIPLFCMNEEVLRSMGSSLGEVLNVDTSSLGGIWGRFARLRVKIDITKPLRGLKIKIVDNEEDRW
        PW FN   +      V   P +  +T  D WVQ   IPL  ++EE +  +   LGE++++D          F R+RV+  IT  LR  +  I D+ E   
Subjt:  PWFFNGAIILFQEPNVLEQPENLRLTHADFWVQFYNIPLFCMNEEVLRSMGSSLGEVLNVDTSSLGGIWGRFARLRVKIDITKPLRGLKIKIVDNEEDRW

Query:  IPCRYEKLPEFCFNCGRLGHTQKEC
        I  +YE+L   C +C R  H +  C
Subjt:  IPCRYEKLPEFCFNCGRLGHTQKEC

AT5G36228.1 nucleic acid binding;zinc ion binding5.1e-1530.13Show/hide
Query:  IGENLFFFKFKSILEKNRIVCDGPWFFNGAIILFQEPNVLEQPENLRLTHADFWVQFYNIPLFCMNEEVLRSMGSSLGEVLNVDTSSLGGIWGRFARLRV
        + +  F  +F+S ++    +   PW FN   I  Q     + P    LT  D WV    IPL  ++E  +  + S+LGEV+ +D +        F R++V
Subjt:  IGENLFFFKFKSILEKNRIVCDGPWFFNGAIILFQEPNVLEQPENLRLTHADFWVQFYNIPLFCMNEEVLRSMGSSLGEVLNVDTSSLGGIWGRFARLRV

Query:  KIDITKPLRGLKIKIVDNEEDRWIPCRYEKLPEFCFNCGRLGHTQKECDSCIEQGE
        ++D T+PLR  +     + E   I   YEKL   C NC R+ H    C   + Q E
Subjt:  KIDITKPLRGLKIKIVDNEEDRWIPCRYEKLPEFCFNCGRLGHTQKECDSCIEQGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAATGTTTGGAGGGTTCATAAGGATACATCTATTGATTGCATTGGGGAAAACCTATTTTTCTTTAAATTCAAATCAATTTTGGAGAAGAATAGGATTGTCTGTGA
CGGCCCTTGGTTTTTTAATGGAGCTATCATACTTTTTCAAGAGCCAAACGTTCTAGAACAACCTGAAAACTTGAGACTGACTCATGCTGACTTCTGGGTCCAATTTTATA
ATATCCCTTTATTTTGTATGAACGAAGAGGTTCTTAGATCGATGGGGAGCTCATTAGGAGAAGTCCTAAATGTGGATACTTCCTCGTTGGGAGGAATTTGGGGTAGATTT
GCTCGCTTACGAGTAAAGATTGATATTACTAAACCTCTAAGAGGGCTGAAAATCAAAATTGTCGATAATGAAGAGGATAGGTGGATTCCTTGTAGATATGAGAAGCTACC
CGAATTTTGCTTTAATTGTGGGCGATTAGGTCACACTCAAAAAGAATGTGATAGTTGCATTGAGCAAGGGGAGGGAGTTGGTCTTTCTAGATATGGCCCAAATCTACATA
CAGTAGAGTTTAAAAGAGTTCTGGATAAAAAAGGGGAAGCCAGAATTAGGGAGGATTATGAACCCTTAAAAAGTTTGAAAACAGTCAAGGAGAATGTGAAGAATAATTTA
AATATTAATAAGATTAAGGGTTTTGAATGGAAGAAGACTGAGAAGAAGGAGGTTGAGGTTGGTTTTCTACAGAAAGCTGAAGCTACTAATGACGAAAGTAAGATGGCTAG
TTCTAATATGTTTGAAAGGAATAAAGAGGCTGATCATGTATTGATGGTTAGTGCTAATAACCTAACTGAGAAAGGAAAATTAGTAAAAGGCATGGAGCCTTTGATTATGG
AGGAGATTATGACAGATCTAGTAAATGAAGGGGAAACTCGATCTAAGGGATTTGATTTGAATTCCTTGGAAGGTGAAGATTTGATGATCGAGAGACATTTGGAGTTAGAC
CCTAAGGAAAACAATAAATCTCATTCATCTAAAAAGAAGAAATGGCATAGGCTTTTCATTGATAAAGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAATGTTTGGAGGGTTCATAAGGATACATCTATTGATTGCATTGGGGAAAACCTATTTTTCTTTAAATTCAAATCAATTTTGGAGAAGAATAGGATTGTCTGTGA
CGGCCCTTGGTTTTTTAATGGAGCTATCATACTTTTTCAAGAGCCAAACGTTCTAGAACAACCTGAAAACTTGAGACTGACTCATGCTGACTTCTGGGTCCAATTTTATA
ATATCCCTTTATTTTGTATGAACGAAGAGGTTCTTAGATCGATGGGGAGCTCATTAGGAGAAGTCCTAAATGTGGATACTTCCTCGTTGGGAGGAATTTGGGGTAGATTT
GCTCGCTTACGAGTAAAGATTGATATTACTAAACCTCTAAGAGGGCTGAAAATCAAAATTGTCGATAATGAAGAGGATAGGTGGATTCCTTGTAGATATGAGAAGCTACC
CGAATTTTGCTTTAATTGTGGGCGATTAGGTCACACTCAAAAAGAATGTGATAGTTGCATTGAGCAAGGGGAGGGAGTTGGTCTTTCTAGATATGGCCCAAATCTACATA
CAGTAGAGTTTAAAAGAGTTCTGGATAAAAAAGGGGAAGCCAGAATTAGGGAGGATTATGAACCCTTAAAAAGTTTGAAAACAGTCAAGGAGAATGTGAAGAATAATTTA
AATATTAATAAGATTAAGGGTTTTGAATGGAAGAAGACTGAGAAGAAGGAGGTTGAGGTTGGTTTTCTACAGAAAGCTGAAGCTACTAATGACGAAAGTAAGATGGCTAG
TTCTAATATGTTTGAAAGGAATAAAGAGGCTGATCATGTATTGATGGTTAGTGCTAATAACCTAACTGAGAAAGGAAAATTAGTAAAAGGCATGGAGCCTTTGATTATGG
AGGAGATTATGACAGATCTAGTAAATGAAGGGGAAACTCGATCTAAGGGATTTGATTTGAATTCCTTGGAAGGTGAAGATTTGATGATCGAGAGACATTTGGAGTTAGAC
CCTAAGGAAAACAATAAATCTCATTCATCTAAAAAGAAGAAATGGCATAGGCTTTTCATTGATAAAGAATAA
Protein sequenceShow/hide protein sequence
MKNVWRVHKDTSIDCIGENLFFFKFKSILEKNRIVCDGPWFFNGAIILFQEPNVLEQPENLRLTHADFWVQFYNIPLFCMNEEVLRSMGSSLGEVLNVDTSSLGGIWGRF
ARLRVKIDITKPLRGLKIKIVDNEEDRWIPCRYEKLPEFCFNCGRLGHTQKECDSCIEQGEGVGLSRYGPNLHTVEFKRVLDKKGEARIREDYEPLKSLKTVKENVKNNL
NINKIKGFEWKKTEKKEVEVGFLQKAEATNDESKMASSNMFERNKEADHVLMVSANNLTEKGKLVKGMEPLIMEEIMTDLVNEGETRSKGFDLNSLEGEDLMIERHLELD
PKENNKSHSSKKKKWHRLFIDKE