; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g14540 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g14540
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCCHC-type domain-containing protein
Genome locationchr9:12545683..12546426
RNA-Seq ExpressionMoc09g14540
SyntenyMoc09g14540
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN81130.1 hypothetical protein VITISV_003944 [Vitis vinifera]4.3e-5549.19Show/hide
Query:  IDKFDGTDFDYWKMQIKDFIYVMDKKFHLPLKG-KPEDMKEEDWKHLDRKVLGIIRSTLSKNVVHHVVNETTTVGLMKTLSDIYEKPYA-NKFCLLTKLF
        I+KFDGTDF YW+MQI+D++Y   +K HLPL G KPE MK E+W  LDR+VLG+IR TLS++V H+VV E TT  LMK LS +YEKP A NK  L+TKLF
Subjt:  IDKFDGTDFDYWKMQIKDFIYVMDKKFHLPLKG-KPEDMKEEDWKHLDRKVLGIIRSTLSKNVVHHVVNETTTVGLMKTLSDIYEKPYA-NKFCLLTKLF

Query:  DLKMADGVPFVEHLNEFNRIINKLISLKVEFDEEPKAIILLISLPDSWEVMKVTTTYSFGTGKLKIVDIIDAAFAEEIRRRDFGVTFTPISSFKAKGRGK
        +LKMA+     +HLNEFN I N+L S++++FD+E +A+I+L SLP+SWE M++  + S G  KLK  DI D   AEEIRRRD G T    S+   + RG+
Subjt:  DLKMADGVPFVEHLNEFNRIINKLISLKVEFDEEPKAIILLISLPDSWEVMKVTTTYSFGTGKLKIVDIIDAAFAEEIRRRDFGVTFTPISSFKAKGRGK

Query:  NIDNKPCKGKG-------NKGRGKGKQDIECYYCHKKGHIKANCRELK
          +    +G+        N+ + +  Q ++C+ C K GH K  C+  K
Subjt:  NIDNKPCKGKG-------NKGRGKGKQDIECYYCHKKGHIKANCRELK

KAG7561662.1 Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa]3.2e-5851.45Show/hide
Query:  IDKFDGTDFDYWKMQIKDFIYVMDKKFHLPLKGKPEDMKEEDWKHLDRKVLGIIRSTLSKNVVHHVVNETTTVGLMKTLSDIYEKPYA-NKFCLLTKLFD
        I KFDGTDF +W+MQI+D++Y   KK H PL  KPE M +E+W  LDR+VLG+IR TLSKNV H+V  E TT GLMK LSD+YEKP A NK  L+ KLF 
Subjt:  IDKFDGTDFDYWKMQIKDFIYVMDKKFHLPLKGKPEDMKEEDWKHLDRKVLGIIRSTLSKNVVHHVVNETTTVGLMKTLSDIYEKPYA-NKFCLLTKLFD

Query:  LKMADGVPFVEHLNEFNRIINKLISLKVEFDEEPKAIILLISLPDSWEVMKVTTTYSFGTGKLKIVDIIDAAFAEEIRRRDFGVTFTPISSFKAKGRG--
        LKM +G P   H+NEFN I+N+L S+++EFD+E +A+IL+ SLP+SWE M+   + S G  KLK VD+ D    EE+RR D G T T  S+F  + RG  
Subjt:  LKMADGVPFVEHLNEFNRIINKLISLKVEFDEEPKAIILLISLPDSWEVMKVTTTYSFGTGKLKIVDIIDAAFAEEIRRRDFGVTFTPISSFKAKGRG--

Query:  ---KNIDNKPCKGKGNKGRGKGKQDIECYYCHKKGHIKANC
           +N +    K +  KG+ K ++ +EC+ C K GH K+NC
Subjt:  ---KNIDNKPCKGKGNKGRGKGKQDIECYYCHKKGHIKANC

KAG7584790.1 Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa]7.2e-5851.45Show/hide
Query:  IDKFDGTDFDYWKMQIKDFIYVMDKKFHLPLKGKPEDMKEEDWKHLDRKVLGIIRSTLSKNVVHHVVNETTTVGLMKTLSDIYEKPYA-NKFCLLTKLFD
        I KFDGTDF +W+MQI+D++Y   KK H PL  KPE M +E+W  LDR+VLG+IR TLSKNV H+V  E TT GLMK LSD+YEKP A NK  L+ KLF 
Subjt:  IDKFDGTDFDYWKMQIKDFIYVMDKKFHLPLKGKPEDMKEEDWKHLDRKVLGIIRSTLSKNVVHHVVNETTTVGLMKTLSDIYEKPYA-NKFCLLTKLFD

Query:  LKMADGVPFVEHLNEFNRIINKLISLKVEFDEEPKAIILLISLPDSWEVMKVTTTYSFGTGKLKIVDIIDAAFAEEIRRRDFGVTFTPISSFKAKGRG--
        LKM +G P   H+NEFN I+N+L S+++EFD+E +A+ILL SLP+SWE M+   + S G  KLK VD+ D    EE+RR D G T +  S+F  + RG  
Subjt:  LKMADGVPFVEHLNEFNRIINKLISLKVEFDEEPKAIILLISLPDSWEVMKVTTTYSFGTGKLKIVDIIDAAFAEEIRRRDFGVTFTPISSFKAKGRG--

Query:  ---KNIDNKPCKGKGNKGRGKGKQDIECYYCHKKGHIKANC
           +N +    K +  KG+ K ++ +EC+ C K GH K+NC
Subjt:  ---KNIDNKPCKGKGNKGRGKGKQDIECYYCHKKGHIKANC

KAG7593230.1 Pentatricopeptide repeat [Arabidopsis thaliana x Arabidopsis arenosa]1.8e-5651.68Show/hide
Query:  IDKFDGTDFDYWKMQIKDFIYVMDKKFHLPLKGKPEDMKEEDWKHLDRKVLGIIRSTLSKNVVHHVVNETTTVGLMKTLSDIYEKPYA-NKFCLLTKLFD
        I KFDGTDF +W+MQI+D++Y   KK H PL  KPE M +E+W  LDR+VLG+IR TLSKNV H+V  E TT GLMK LSD+YEKP A NK  L+ KLF 
Subjt:  IDKFDGTDFDYWKMQIKDFIYVMDKKFHLPLKGKPEDMKEEDWKHLDRKVLGIIRSTLSKNVVHHVVNETTTVGLMKTLSDIYEKPYA-NKFCLLTKLFD

Query:  LKMADGVPFVEHLNEFNRIINKLISLKVEFDEEPKAIILLISLPDSWEVMKVTTTYSFGTGKLKIVDIIDAAFAEEIRRRDFGVTFTPISSFKAKGRG--
        LKM +G P   H+NEFN I+N+L S+++EFD+E +A+ILL SLP+SWE M+   + S G  KLK VD+ D    EE+RR D G T T  S+F  + RG  
Subjt:  LKMADGVPFVEHLNEFNRIINKLISLKVEFDEEPKAIILLISLPDSWEVMKVTTTYSFGTGKLKIVDIIDAAFAEEIRRRDFGVTFTPISSFKAKGRG--

Query:  ---KNIDNKPCKGKGNKGRGKGKQDIECYYCHKKGHIK
           +N +    K +  KG+ K ++ +EC+ C K GH K
Subjt:  ---KNIDNKPCKGKGNKGRGKGKQDIECYYCHKKGHIK

RVW25839.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]5.7e-5548.79Show/hide
Query:  IDKFDGTDFDYWKMQIKDFIYVMDKKFHLPLKG-KPEDMKEEDWKHLDRKVLGIIRSTLSKNVVHHVVNETTTVGLMKTLSDIYEKPYA-NKFCLLTKLF
        I+KFDGTDF YW+MQI+D++Y   +K HLPL G KPE MK E+W  LDR+VLG+IR TLS++V H+VV E TT  LMK LS +YEKP A NK  L+ KLF
Subjt:  IDKFDGTDFDYWKMQIKDFIYVMDKKFHLPLKG-KPEDMKEEDWKHLDRKVLGIIRSTLSKNVVHHVVNETTTVGLMKTLSDIYEKPYA-NKFCLLTKLF

Query:  DLKMADGVPFVEHLNEFNRIINKLISLKVEFDEEPKAIILLISLPDSWEVMKVTTTYSFGTGKLKIVDIIDAAFAEEIRRRDFGVTFTPISSFKAKGRGK
        +LKMA+     +HLNEFN I N+L S++++FD+E +A+I+L SLP+SWE M++  + S G  KLK  DI D   AEEIRRRD G T    S+   + RG+
Subjt:  DLKMADGVPFVEHLNEFNRIINKLISLKVEFDEEPKAIILLISLPDSWEVMKVTTTYSFGTGKLKIVDIIDAAFAEEIRRRDFGVTFTPISSFKAKGRGK

Query:  NIDNKPCKGKG-------NKGRGKGKQDIECYYCHKKGHIKANCRELK
          +    +G+        N+ + +  Q ++C+ C K GH K  C+  K
Subjt:  NIDNKPCKGKG-------NKGRGKGKQDIECYYCHKKGHIKANCRELK

TrEMBL top hitse value%identityAlignment
A0A2N9G6Q3 Uncharacterized protein2.4e-5953.06Show/hide
Query:  IDKFDGTDFDYWKMQIKDFIYVMDKKFHLPLKG-KPEDMKEEDWKHLDRKVLGIIRSTLSKNVVHHVVNETTTVGLMKTLSDIYEKPYA-NKFCLLTKLF
        I+KFDGTDF YW+MQI+D++Y   KK HLPL G KPEDM++ +W  LDR+VLG+IR TLS+ V H+VV E TT  LM  L  +YEKP A NK  L+ KLF
Subjt:  IDKFDGTDFDYWKMQIKDFIYVMDKKFHLPLKG-KPEDMKEEDWKHLDRKVLGIIRSTLSKNVVHHVVNETTTVGLMKTLSDIYEKPYA-NKFCLLTKLF

Query:  DLKMADGVPFVEHLNEFNRIINKLISLKVEFDEEPKAIILLISLPDSWEVMKVTTTYSFGTGKLKIVDIIDAAFAEEIRRRDFGVTFTPIS--SFKAKGR
        +LKMA+G    +HLNEFN I N+L S+++EFD+E +A+I+L SLP+SWE M++  + S G GKLK  DI D    EE+RRRD G T +  S  + +A+GR
Subjt:  DLKMADGVPFVEHLNEFNRIINKLISLKVEFDEEPKAIILLISLPDSWEVMKVTTTYSFGTGKLKIVDIIDAAFAEEIRRRDFGVTFTPIS--SFKAKGR

Query:  GKNIDNKPCKGKGNKGRGKGK--QDIECYYCHKKGHIKANCRELK
        GK+ +    + K  KGR K K  + +EC+ C K GHI+ NC ELK
Subjt:  GKNIDNKPCKGKGNKGRGKGK--QDIECYYCHKKGHIKANCRELK

A0A2N9GHK9 Uncharacterized protein2.4e-5953.06Show/hide
Query:  IDKFDGTDFDYWKMQIKDFIYVMDKKFHLPLKG-KPEDMKEEDWKHLDRKVLGIIRSTLSKNVVHHVVNETTTVGLMKTLSDIYEKPYA-NKFCLLTKLF
        I+KFDGTDF YW+MQI+D++Y   KK HLPL G KPEDM++ +W  LDR+VLG+IR TLS+ V H+VV E TT  LM  L  +YEKP A NK  L+ KLF
Subjt:  IDKFDGTDFDYWKMQIKDFIYVMDKKFHLPLKG-KPEDMKEEDWKHLDRKVLGIIRSTLSKNVVHHVVNETTTVGLMKTLSDIYEKPYA-NKFCLLTKLF

Query:  DLKMADGVPFVEHLNEFNRIINKLISLKVEFDEEPKAIILLISLPDSWEVMKVTTTYSFGTGKLKIVDIIDAAFAEEIRRRDFGVTFTPIS--SFKAKGR
        +LKMA+G    +HLNEFN I N+L S+++EFD+E +A+I+L SLP+SWE M++  + S G GKLK  DI D    EE+RRRD G T +  S  + +A+GR
Subjt:  DLKMADGVPFVEHLNEFNRIINKLISLKVEFDEEPKAIILLISLPDSWEVMKVTTTYSFGTGKLKIVDIIDAAFAEEIRRRDFGVTFTPIS--SFKAKGR

Query:  GKNIDNKPCKGKGNKGRGKGK--QDIECYYCHKKGHIKANCRELK
        GK+ +    + K  KGR K K  + +EC+ C K GHI+ NC ELK
Subjt:  GKNIDNKPCKGKGNKGRGKGK--QDIECYYCHKKGHIKANCRELK

A0A2N9IKI1 Uncharacterized protein2.4e-5953.06Show/hide
Query:  IDKFDGTDFDYWKMQIKDFIYVMDKKFHLPLKG-KPEDMKEEDWKHLDRKVLGIIRSTLSKNVVHHVVNETTTVGLMKTLSDIYEKPYA-NKFCLLTKLF
        I+KFDGTDF YW+MQI+D++Y   KK HLPL G KPEDM++ +W  LDR+VLG+IR TLS+ V H+VV E TT  LM  L  +YEKP A NK  L+ KLF
Subjt:  IDKFDGTDFDYWKMQIKDFIYVMDKKFHLPLKG-KPEDMKEEDWKHLDRKVLGIIRSTLSKNVVHHVVNETTTVGLMKTLSDIYEKPYA-NKFCLLTKLF

Query:  DLKMADGVPFVEHLNEFNRIINKLISLKVEFDEEPKAIILLISLPDSWEVMKVTTTYSFGTGKLKIVDIIDAAFAEEIRRRDFGVTFTPIS--SFKAKGR
        +LKMA+G    +HLNEFN I N+L S+++EFD+E +A+I+L SLP+SWE M++  + S G GKLK  DI D    EE+RRRD G T +  S  + +A+GR
Subjt:  DLKMADGVPFVEHLNEFNRIINKLISLKVEFDEEPKAIILLISLPDSWEVMKVTTTYSFGTGKLKIVDIIDAAFAEEIRRRDFGVTFTPIS--SFKAKGR

Query:  GKNIDNKPCKGKGNKGRGKGK--QDIECYYCHKKGHIKANCRELK
        GK+ +    + K  KGR K K  + +EC+ C K GHI+ NC ELK
Subjt:  GKNIDNKPCKGKGNKGRGKGK--QDIECYYCHKKGHIKANCRELK

A0A2N9J3Y8 Uncharacterized protein2.4e-5953.06Show/hide
Query:  IDKFDGTDFDYWKMQIKDFIYVMDKKFHLPLKG-KPEDMKEEDWKHLDRKVLGIIRSTLSKNVVHHVVNETTTVGLMKTLSDIYEKPYA-NKFCLLTKLF
        I+KFDGTDF YW+MQI+D++Y   KK HLPL G KPEDM++ +W  LDR+VLG+IR TLS+ V H+VV E TT  LM  L  +YEKP A NK  L+ KLF
Subjt:  IDKFDGTDFDYWKMQIKDFIYVMDKKFHLPLKG-KPEDMKEEDWKHLDRKVLGIIRSTLSKNVVHHVVNETTTVGLMKTLSDIYEKPYA-NKFCLLTKLF

Query:  DLKMADGVPFVEHLNEFNRIINKLISLKVEFDEEPKAIILLISLPDSWEVMKVTTTYSFGTGKLKIVDIIDAAFAEEIRRRDFGVTFTPIS--SFKAKGR
        +LKMA+G    +HLNEFN I N+L S+++EFD+E +A+I+L SLP+SWE M++  + S G GKLK  DI D    EE+RRRD G T +  S  + +A+GR
Subjt:  DLKMADGVPFVEHLNEFNRIINKLISLKVEFDEEPKAIILLISLPDSWEVMKVTTTYSFGTGKLKIVDIIDAAFAEEIRRRDFGVTFTPIS--SFKAKGR

Query:  GKNIDNKPCKGKGNKGRGKGK--QDIECYYCHKKGHIKANCRELK
        GK+ +    + K  KGR K K  + +EC+ C K GHI+ NC ELK
Subjt:  GKNIDNKPCKGKGNKGRGKGK--QDIECYYCHKKGHIKANCRELK

A0A2N9JBD5 Uncharacterized protein2.4e-5953.06Show/hide
Query:  IDKFDGTDFDYWKMQIKDFIYVMDKKFHLPLKG-KPEDMKEEDWKHLDRKVLGIIRSTLSKNVVHHVVNETTTVGLMKTLSDIYEKPYA-NKFCLLTKLF
        I+KFDGTDF YW+MQI+D++Y   KK HLPL G KPEDM++ +W  LDR+VLG+IR TLS+ V H+VV E TT  LM  L  +YEKP A NK  L+ KLF
Subjt:  IDKFDGTDFDYWKMQIKDFIYVMDKKFHLPLKG-KPEDMKEEDWKHLDRKVLGIIRSTLSKNVVHHVVNETTTVGLMKTLSDIYEKPYA-NKFCLLTKLF

Query:  DLKMADGVPFVEHLNEFNRIINKLISLKVEFDEEPKAIILLISLPDSWEVMKVTTTYSFGTGKLKIVDIIDAAFAEEIRRRDFGVTFTPIS--SFKAKGR
        +LKMA+G    +HLNEFN I N+L S+++EFD+E +A+I+L SLP+SWE M++  + S G GKLK  DI D    EE+RRRD G T +  S  + +A+GR
Subjt:  DLKMADGVPFVEHLNEFNRIINKLISLKVEFDEEPKAIILLISLPDSWEVMKVTTTYSFGTGKLKIVDIIDAAFAEEIRRRDFGVTFTPIS--SFKAKGR

Query:  GKNIDNKPCKGKGNKGRGKGK--QDIECYYCHKKGHIKANCRELK
        GK+ +    + K  KGR K K  + +EC+ C K GHI+ NC ELK
Subjt:  GKNIDNKPCKGKGNKGRGKGK--QDIECYYCHKKGHIKANCRELK

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.9e-2531.15Show/hide
Query:  FTIDKFDGTD-FDYWKMQIKDFIYVMDKKFHLPL---KGKPEDMKEEDWKHLDRKVLGIIRSTLSKNVVHHVVNETTTVGLMKTLSDIY-EKPYANKFCL
        + + KF+G + F  W+ +++D +  + +  H  L     KP+ MK EDW  LD +    IR  LS +VV+++++E T  G+   L  +Y  K   NK  L
Subjt:  FTIDKFDGTD-FDYWKMQIKDFIYVMDKKFHLPL---KGKPEDMKEEDWKHLDRKVLGIIRSTLSKNVVHHVVNETTTVGLMKTLSDIY-EKPYANKFCL

Query:  LTKLFDLKMADGVPFVEHLNEFNRIINKLISLKVEFDEEPKAIILLISLPDSWEVMKVTTTYSFGTGKLKIVDIIDAAFA-EEIRRRDFGVTFTPISSFK
          +L+ L M++G  F+ HLN FN +I +L +L V+ +EE KAI+LL SLP S++   + TT   G   +++ D+  A    E++R++        I+  +
Subjt:  LTKLFDLKMADGVPFVEHLNEFNRIINKLISLKVEFDEEPKAIILLISLPDSWEVMKVTTTYSFGTGKLKIVDIIDAAFA-EEIRRRDFGVTFTPISSFK

Query:  AKGRGKNIDNKPCKGKGNKGRGKGKQDI-ECYYCHKKGHIKANC
         +   ++ +N    G   K + + K  +  CY C++ GH K +C
Subjt:  AKGRGKNIDNKPCKGKGNKGRGKGKQDI-ECYYCHKKGHIKANC

Arabidopsis top hitse value%identityAlignment
AT3G29785.1 unknown protein3.5e-1850Show/hide
Query:  DKFDGTDFDYWKMQIKDFIYVMDKKFHLPLKGKPEDMKEEDWKHLDRKVLGIIRSTLSKNVVHHVVNETTTVGLMKTLSDIYEKPYAN
        DK DGT + + +M+I+D++Y   KK H PL  K E M ++DW  L R+VL +IR T+SKN+ H+V  E +  GLMK LSDIY+KP  N
Subjt:  DKFDGTDFDYWKMQIKDFIYVMDKKFHLPLKGKPEDMKEEDWKHLDRKVLGIIRSTLSKNVVHHVVNETTTVGLMKTLSDIYEKPYAN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAGCTTTGATGGATTTACTATTGACAAATTTGATGGGACAGACTTTGATTACTGGAAGATGCAGATAAAGGATTTTATATATGTCATGGACAAGAAATTTCATCT
ACCATTGAAAGGAAAACCAGAAGATATGAAAGAAGAAGATTGGAAGCATCTTGACAGAAAAGTACTAGGAATTATTCGCTCGACTTTGTCAAAGAACGTCGTTCACCATG
TGGTAAATGAGACAACGACGGTTGGTCTGATGAAAACCTTGTCAGACATATATGAGAAGCCCTATGCGAACAAGTTTTGTCTTTTGACAAAACTCTTTGATTTAAAAATG
GCCGATGGTGTGCCTTTTGTTGAACATTTGAATGAATTCAATAGGATAATCAACAAGTTAATCTCTCTGAAGGTTGAATTCGACGAAGAACCGAAAGCTATCATTTTGTT
GATATCTTTACCCGATAGTTGGGAAGTAATGAAAGTAACTACAACCTACTCTTTTGGTACTGGAAAGTTGAAAATTGTAGATATTATAGATGCAGCTTTTGCAGAAGAGA
TTCGTAGAAGGGATTTTGGTGTGACTTTTACTCCGATTTCAAGCTTCAAAGCCAAAGGGAGAGGGAAAAATATTGACAATAAGCCTTGTAAAGGCAAAGGCAACAAAGGT
AGAGGAAAAGGTAAGCAAGATATTGAGTGCTACTACTGCCACAAGAAGGGTCACATAAAAGCAAACTGTAGAGAGTTGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTAGCTTTGATGGATTTACTATTGACAAATTTGATGGGACAGACTTTGATTACTGGAAGATGCAGATAAAGGATTTTATATATGTCATGGACAAGAAATTTCATCT
ACCATTGAAAGGAAAACCAGAAGATATGAAAGAAGAAGATTGGAAGCATCTTGACAGAAAAGTACTAGGAATTATTCGCTCGACTTTGTCAAAGAACGTCGTTCACCATG
TGGTAAATGAGACAACGACGGTTGGTCTGATGAAAACCTTGTCAGACATATATGAGAAGCCCTATGCGAACAAGTTTTGTCTTTTGACAAAACTCTTTGATTTAAAAATG
GCCGATGGTGTGCCTTTTGTTGAACATTTGAATGAATTCAATAGGATAATCAACAAGTTAATCTCTCTGAAGGTTGAATTCGACGAAGAACCGAAAGCTATCATTTTGTT
GATATCTTTACCCGATAGTTGGGAAGTAATGAAAGTAACTACAACCTACTCTTTTGGTACTGGAAAGTTGAAAATTGTAGATATTATAGATGCAGCTTTTGCAGAAGAGA
TTCGTAGAAGGGATTTTGGTGTGACTTTTACTCCGATTTCAAGCTTCAAAGCCAAAGGGAGAGGGAAAAATATTGACAATAAGCCTTGTAAAGGCAAAGGCAACAAAGGT
AGAGGAAAAGGTAAGCAAGATATTGAGTGCTACTACTGCCACAAGAAGGGTCACATAAAAGCAAACTGTAGAGAGTTGAAATGA
Protein sequenceShow/hide protein sequence
MASFDGFTIDKFDGTDFDYWKMQIKDFIYVMDKKFHLPLKGKPEDMKEEDWKHLDRKVLGIIRSTLSKNVVHHVVNETTTVGLMKTLSDIYEKPYANKFCLLTKLFDLKM
ADGVPFVEHLNEFNRIINKLISLKVEFDEEPKAIILLISLPDSWEVMKVTTTYSFGTGKLKIVDIIDAAFAEEIRRRDFGVTFTPISSFKAKGRGKNIDNKPCKGKGNKG
RGKGKQDIECYYCHKKGHIKANCRELK