; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0020132 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0020132
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr5:48351174..48352175
RNA-Seq ExpressionLag0020132
SyntenyLag0020132
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036875 - Zinc finger, CCHC-type superfamily
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
OMO70721.1 Zinc finger, CCHC-type [Corchorus olitorius]2.5e-3733.87Show/hide
Query:  EELVNEWKNFNLMEKEKGNDIKLDEEDMTEIKDQMDHCIVGKLLSNRIIAPLAIKNALRGAWKTRKNFKVESTGKNIFNFKFECQEDRNWVMCNGPWTFD
        E+L + WK+F L  +E+ + I+ D+ +   + DQ    +VGKLL+ +     A  N ++  WK  K  ++ +  +N+F FKF  + D+  V+   PWTF 
Subjt:  EELVNEWKNFNLMEKEKGNDIKLDEEDMTEIKDQMDHCIVGKLLSNRIIAPLAIKNALRGAWKTRKNFKVESTGKNIFNFKFECQEDRNWVMCNGPWTFD

Query:  NSLIVLEYPMANQRSVEMEFKNAVFWIRLINLPMGYRNRKVASKIGNSIGDFIDGGECDEELVWGQSMRIRVRLDITKPLLRGFMLKGPDIKGECWVTIR
        N+L++      + R  +  F  A+FWIR+ +L +G R R +A  IG  IG+ +D  +  +   W   +R+RV +D+TKPL R  ++K  D  G     + 
Subjt:  NSLIVLEYPMANQRSVEMEFKNAVFWIRLINLPMGYRNRKVASKIGNSIGDFIDGGECDEELVWGQSMRIRVRLDITKPLLRGFMLKGPDIKGECWVTIR

Query:  YERLPELCFLCGKIGHVAKDCESGKGRKESNMDKWEFGNWLRFQSFPR
        YER P  C+ CG+IGHV++DC   K  KE   +  ++G W+   S  R
Subjt:  YERLPELCFLCGKIGHVAKDCESGKGRKESNMDKWEFGNWLRFQSFPR

XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]3.0e-4336.69Show/hide
Query:  MDVEELVNEWKNFNLMEKEKGNDIKLDEEDMTEIKDQMDHCIVGKLLSNRIIAPLAIKNALRGAWKTR-KNFKVESTGKNIFNFKFECQEDRNWVMCNGP
        M    L+ EWKNF L  +E    + +D   +      ++  ++ KLLS R I+   +KN L+ AWK   K F V+  G NIF F F    DRN ++  GP
Subjt:  MDVEELVNEWKNFNLMEKEKGNDIKLDEEDMTEIKDQMDHCIVGKLLSNRIIAPLAIKNALRGAWKTR-KNFKVESTGKNIFNFKFECQEDRNWVMCNGP

Query:  WTFDNSLIVLEYPMANQRSVEMEFKNAVFWIRLINLPMGYRNRKVASKIGNSIGDFIDGGECDEELVWGQSMRIRVRLDITKPLLRGFMLKGPDIKGECW
        WTFD +LI+++ P++  + ++M+F+N   W+   +L +   N+ +A+++GN+IG F D         WG  +R+RVR D+ KPL RG  L      G CW
Subjt:  WTFDNSLIVLEYPMANQRSVEMEFKNAVFWIRLINLPMGYRNRKVASKIGNSIGDFIDGGECDEELVWGQSMRIRVRLDITKPLLRGFMLKGPDIKGECW

Query:  VTIRYERLPELCFLCGKIGHVAKDCESGKGRKESNMDKWEFGNWLRFQ
        + I+YERLP+  + CG++ H+ KDC       +S     ++G WLRFQ
Subjt:  VTIRYERLPELCFLCGKIGHVAKDCESGKGRKESNMDKWEFGNWLRFQ

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]4.8e-4941.13Show/hide
Query:  MDVEELVNEWKNFNLMEKEKGNDIKLDEEDMTEIKDQMDHCIVGKLLSNRIIAPLAIKNALRGAWKTRKNFKVESTGKNIFNFKFECQEDRNWVMCNGPW
        MD E L+ +W+ F L  +E    + +D + +   +  + + +VGKLL+ RII+   +   L  AWK      VES GKN+F F F  + D N VM  GPW
Subjt:  MDVEELVNEWKNFNLMEKEKGNDIKLDEEDMTEIKDQMDHCIVGKLLSNRIIAPLAIKNALRGAWKTRKNFKVESTGKNIFNFKFECQEDRNWVMCNGPW

Query:  TFDNSLIVLEYPMANQRSVEMEFKNAVFWIRLINLPMGYRNRKVASKIGNSIGDFIDGGECDEE-LVWGQSMRIRVRLDITKPLLRGFMLKGPDIKGECW
         FD +LIVL+ P +++   E+EF    FWI L +LPM + N+ +A ++GN+IG+F+D  +C+E+   WG S+RIRV +DITKPL RG  +      G CW
Subjt:  TFDNSLIVLEYPMANQRSVEMEFKNAVFWIRLINLPMGYRNRKVASKIGNSIGDFIDGGECDEE-LVWGQSMRIRVRLDITKPLLRGFMLKGPDIKGECW

Query:  VTIRYERLPELCFLCGKIGHVAKDCESG-KGRKESNMDKWEFGNWLRF
        + I+YERLP+ C+ CG IGH + DC++     ++ +    E+G WLRF
Subjt:  VTIRYERLPELCFLCGKIGHVAKDCESG-KGRKESNMDKWEFGNWLRF

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]4.1e-4035.11Show/hide
Query:  ELVNEWKNFNLMEKEKGNDIKLDEEDMTEIKDQMDHCIVGKLLSNRIIAPLAIKNALRGAWKTRKN-FKVESTGKNIFNFKFECQEDRNWVMCNGPWTFD
        +L+ EWKNF L  +E+   I +D         +++  +VGKL   R I    +KN +R AWK   N F+V+S G N+F F F    DRN +  +GPWTFD
Subjt:  ELVNEWKNFNLMEKEKGNDIKLDEEDMTEIKDQMDHCIVGKLLSNRIIAPLAIKNALRGAWKTRKN-FKVESTGKNIFNFKFECQEDRNWVMCNGPWTFD

Query:  NSLIVLEYPMANQRSVEMEFKNAVFWIRLINLPMGYRNRKVASKIGNSIGDFIDGGECDE-ELVWGQSMRIRVRLDITKPLLRGFMLKGPDIKGECWVTI
         +L+++  P+A     E++F     W+R  +LP+G   R +A ++GN++G F +  +CD+    WG ++R+RV LDI+KPL RG  L      G  W+ I
Subjt:  NSLIVLEYPMANQRSVEMEFKNAVFWIRLINLPMGYRNRKVASKIGNSIGDFIDGGECDE-ELVWGQSMRIRVRLDITKPLLRGFMLKGPDIKGECWVTI

Query:  RYERLPELCFLCGKIGHVAKDCESGKGRKESNMDKWEFGNWLRFQSFPRFTKKPDSPTNSNYNQDFQKDQSGEGGKDREIGP
        +YERLP+ C+ CG                 S+  K ++G+WLR+Q     T KP  P      +D   D+SG         P
Subjt:  RYERLPELCFLCGKIGHVAKDCESGKGRKESNMDKWEFGNWLRFQSFPRFTKKPDSPTNSNYNQDFQKDQSGEGGKDREIGP

XP_028122006.1 uncharacterized protein LOC114319195 [Camellia sinensis]1.7e-3835.47Show/hide
Query:  MEKEKGNDIKLDEEDMTEIKDQMDHCIVGKLLSNRIIAPLAIKNALRGAWKTRKNFKVESTGKNIFNFKFECQEDRNWVMCNGPWTFDNSLIVLEYPMAN
        +  E+   +++  E  + +  + D C+VGKLL+ R     A+KN L   W+  K  +V   G N+F F F    D+  V+ +GPWTFD  L++L     N
Subjt:  MEKEKGNDIKLDEEDMTEIKDQMDHCIVGKLLSNRIIAPLAIKNALRGAWKTRKNFKVESTGKNIFNFKFECQEDRNWVMCNGPWTFDNSLIVLEYPMAN

Query:  QRSVEMEFKNAVFWIRLINLPMGYRNRKVASKIGNSIGDFIDGGECDEELVWGQSMRIRVRLDITKPLLRGFMLKGPDIKGECWVTIRYERLPELCFLCG
         +  +++     FW+ + NLP+   N+KV   +GN++G FID    D  +VWG++MRIRV LD+ KPL RG  L     +   WV  +YERLP  C+ CG
Subjt:  QRSVEMEFKNAVFWIRLINLPMGYRNRKVASKIGNSIGDFIDGGECDEELVWGQSMRIRVRLDITKPLLRGFMLKGPDIKGECWVTIRYERLPELCFLCG

Query:  KIGHVAKDCESGKGRKE-SNMDKWEFGNWLRFQS
        ++GH  ++C+      + S +D  ++G WLR  +
Subjt:  KIGHVAKDCESGKGRKE-SNMDKWEFGNWLRFQS

TrEMBL top hitse value%identityAlignment
A0A2N9FNT0 RNase H domain-containing protein2.4e-3836.29Show/hide
Query:  VEELVNEWKNFNLMEKEKGNDIKLDEEDMTEIKDQMDHCIVGKLLSNRIIAPLAIKNALRGAWKTRKNFKVESTGKNIFNFKFECQEDRNWVMCNGPWTF
        +E + + WKNF+L +KE G D+     D+     Q ++ +  K L++R++   A+    +  WKTR++F V+  G N   F FE   D   V+ N PWT+
Subjt:  VEELVNEWKNFNLMEKEKGNDIKLDEEDMTEIKDQMDHCIVGKLLSNRIIAPLAIKNALRGAWKTRKNFKVESTGKNIFNFKFECQEDRNWVMCNGPWTF

Query:  DNSLIVLEYPMANQRSVEMEFKNAVFWIRLINLPMGYRNRKVASKIGNSIGDFIDGGECDEELVWGQSMRIRVRLDITKPLLRGFMLKGPD-IKGECWVT
        D  L+V +    ++   +  F +  FW++L NLP+  R  + A  IG SIG        ++E      MR+R+RL+I +PL RG ++K  + IKG  WV 
Subjt:  DNSLIVLEYPMANQRSVEMEFKNAVFWIRLINLPMGYRNRKVASKIGNSIGDFIDGGECDEELVWGQSMRIRVRLDITKPLLRGFMLKGPD-IKGECWVT

Query:  IRYERLPELCFLCGKIGHVAKDCESG-KGRKESNMDKWEFGNWLRFQS
         RYERLP  C+ CG + H  KDC+ G + R+ SN  +++FG WLR  S
Subjt:  IRYERLPELCFLCGKIGHVAKDCESG-KGRKESNMDKWEFGNWLRFQS

A0A2N9HYS7 Uncharacterized protein3.2e-3836.29Show/hide
Query:  VEELVNEWKNFNLMEKEKGNDIKLDEEDMTEIKDQMDHCIVGKLLSNRIIAPLAIKNALRGAWKTRKNFKVESTGKNIFNFKFECQEDRNWVMCNGPWTF
        +E + + WKNF+L +KE G D+     D+     Q ++ +  K L++R++   A+    +  WKTR++F V+  G N   F FE   D   V+ N PWT+
Subjt:  VEELVNEWKNFNLMEKEKGNDIKLDEEDMTEIKDQMDHCIVGKLLSNRIIAPLAIKNALRGAWKTRKNFKVESTGKNIFNFKFECQEDRNWVMCNGPWTF

Query:  DNSLIVLEYPMANQRSVEMEFKNAVFWIRLINLPMGYRNRKVASKIGNSIGDFIDGGECDEELVWGQSMRIRVRLDITKPLLRGFMLKGPD-IKGECWVT
        D  L+V +    ++   +  F +  FW++L NLP+  R  + A  IG SIG        ++E      MR+R+RL+I +PL RG ++K  + IKG  WV 
Subjt:  DNSLIVLEYPMANQRSVEMEFKNAVFWIRLINLPMGYRNRKVASKIGNSIGDFIDGGECDEELVWGQSMRIRVRLDITKPLLRGFMLKGPD-IKGECWVT

Query:  IRYERLPELCFLCGKIGHVAKDCESG-KGRKESNMDKWEFGNWLRFQS
         RYERLP  C+ CG + H  KDC+ G + R+ SN  +++FG WLR  S
Subjt:  IRYERLPELCFLCGKIGHVAKDCESG-KGRKESNMDKWEFGNWLRFQS

A0A6J1BSZ1 uncharacterized protein LOC1110054811.5e-4336.69Show/hide
Query:  MDVEELVNEWKNFNLMEKEKGNDIKLDEEDMTEIKDQMDHCIVGKLLSNRIIAPLAIKNALRGAWKTR-KNFKVESTGKNIFNFKFECQEDRNWVMCNGP
        M    L+ EWKNF L  +E    + +D   +      ++  ++ KLLS R I+   +KN L+ AWK   K F V+  G NIF F F    DRN ++  GP
Subjt:  MDVEELVNEWKNFNLMEKEKGNDIKLDEEDMTEIKDQMDHCIVGKLLSNRIIAPLAIKNALRGAWKTR-KNFKVESTGKNIFNFKFECQEDRNWVMCNGP

Query:  WTFDNSLIVLEYPMANQRSVEMEFKNAVFWIRLINLPMGYRNRKVASKIGNSIGDFIDGGECDEELVWGQSMRIRVRLDITKPLLRGFMLKGPDIKGECW
        WTFD +LI+++ P++  + ++M+F+N   W+   +L +   N+ +A+++GN+IG F D         WG  +R+RVR D+ KPL RG  L      G CW
Subjt:  WTFDNSLIVLEYPMANQRSVEMEFKNAVFWIRLINLPMGYRNRKVASKIGNSIGDFIDGGECDEELVWGQSMRIRVRLDITKPLLRGFMLKGPDIKGECW

Query:  VTIRYERLPELCFLCGKIGHVAKDCESGKGRKESNMDKWEFGNWLRFQ
        + I+YERLP+  + CG++ H+ KDC       +S     ++G WLRFQ
Subjt:  VTIRYERLPELCFLCGKIGHVAKDCESGKGRKESNMDKWEFGNWLRFQ

A0A6J1DU55 uncharacterized protein LOC1110231352.3e-4941.13Show/hide
Query:  MDVEELVNEWKNFNLMEKEKGNDIKLDEEDMTEIKDQMDHCIVGKLLSNRIIAPLAIKNALRGAWKTRKNFKVESTGKNIFNFKFECQEDRNWVMCNGPW
        MD E L+ +W+ F L  +E    + +D + +   +  + + +VGKLL+ RII+   +   L  AWK      VES GKN+F F F  + D N VM  GPW
Subjt:  MDVEELVNEWKNFNLMEKEKGNDIKLDEEDMTEIKDQMDHCIVGKLLSNRIIAPLAIKNALRGAWKTRKNFKVESTGKNIFNFKFECQEDRNWVMCNGPW

Query:  TFDNSLIVLEYPMANQRSVEMEFKNAVFWIRLINLPMGYRNRKVASKIGNSIGDFIDGGECDEE-LVWGQSMRIRVRLDITKPLLRGFMLKGPDIKGECW
         FD +LIVL+ P +++   E+EF    FWI L +LPM + N+ +A ++GN+IG+F+D  +C+E+   WG S+RIRV +DITKPL RG  +      G CW
Subjt:  TFDNSLIVLEYPMANQRSVEMEFKNAVFWIRLINLPMGYRNRKVASKIGNSIGDFIDGGECDEE-LVWGQSMRIRVRLDITKPLLRGFMLKGPDIKGECW

Query:  VTIRYERLPELCFLCGKIGHVAKDCESG-KGRKESNMDKWEFGNWLRF
        + I+YERLP+ C+ CG IGH + DC++     ++ +    E+G WLRF
Subjt:  VTIRYERLPELCFLCGKIGHVAKDCESG-KGRKESNMDKWEFGNWLRF

A0A6J1DX30 uncharacterized protein LOC1110248742.0e-4035.11Show/hide
Query:  ELVNEWKNFNLMEKEKGNDIKLDEEDMTEIKDQMDHCIVGKLLSNRIIAPLAIKNALRGAWKTRKN-FKVESTGKNIFNFKFECQEDRNWVMCNGPWTFD
        +L+ EWKNF L  +E+   I +D         +++  +VGKL   R I    +KN +R AWK   N F+V+S G N+F F F    DRN +  +GPWTFD
Subjt:  ELVNEWKNFNLMEKEKGNDIKLDEEDMTEIKDQMDHCIVGKLLSNRIIAPLAIKNALRGAWKTRKN-FKVESTGKNIFNFKFECQEDRNWVMCNGPWTFD

Query:  NSLIVLEYPMANQRSVEMEFKNAVFWIRLINLPMGYRNRKVASKIGNSIGDFIDGGECDE-ELVWGQSMRIRVRLDITKPLLRGFMLKGPDIKGECWVTI
         +L+++  P+A     E++F     W+R  +LP+G   R +A ++GN++G F +  +CD+    WG ++R+RV LDI+KPL RG  L      G  W+ I
Subjt:  NSLIVLEYPMANQRSVEMEFKNAVFWIRLINLPMGYRNRKVASKIGNSIGDFIDGGECDE-ELVWGQSMRIRVRLDITKPLLRGFMLKGPDIKGECWVTI

Query:  RYERLPELCFLCGKIGHVAKDCESGKGRKESNMDKWEFGNWLRFQSFPRFTKKPDSPTNSNYNQDFQKDQSGEGGKDREIGP
        +YERLP+ C+ CG                 S+  K ++G+WLR+Q     T KP  P      +D   D+SG         P
Subjt:  RYERLPELCFLCGKIGHVAKDCESGKGRKESNMDKWEFGNWLRFQSFPRFTKKPDSPTNSNYNQDFQKDQSGEGGKDREIGP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G31430.1 unknown protein5.5e-1127.85Show/hide
Query:  FNFKFECQEDRNWVMCNGPWTFDNSLIVLEYPMANQRSVEMEFKNAVFWIRLINLPMGYRNRKVASKIGNSIGDFIDGGECDEELVWGQSMRIRVRLDIT
        F+F F  +E    V+  GPW F++ +I+L+     +  + + F    FW+++  +P  + NR V   IG ++G  +D     E +      R+ +  DIT
Subjt:  FNFKFECQEDRNWVMCNGPWTFDNSLIVLEYPMANQRSVEMEFKNAVFWIRLINLPMGYRNRKVASKIGNSIGDFIDGGECDEELVWGQSMRIRVRLDIT

Query:  KPL--LRGFMLKGPDIKGECWVTIRYERLPELCFLCGKIGHVAKDCESGKGRKESNMD
         PL   R F            +  RYERL   C +CG + H    C    G +E   D
Subjt:  KPL--LRGFMLKGPDIKGECWVTIRYERLPELCFLCGKIGHVAKDCESGKGRKESNMD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACATGGATGTTGAAGAGTTAGTAAATGAATGGAAGAATTTCAATCTAATGGAGAAAGAGAAAGGAAATGACATAAAGCTAGATGAAGAAGATATGACAGAAATCAA
AGATCAGATGGACCACTGCATCGTTGGAAAGCTTCTCTCCAATCGAATCATTGCCCCATTGGCCATCAAGAATGCTTTGAGAGGAGCTTGGAAAACAAGGAAAAATTTCA
AAGTTGAATCCACTGGGAAAAATATATTCAACTTCAAGTTTGAATGCCAAGAAGACAGAAACTGGGTGATGTGCAATGGTCCATGGACTTTTGACAATTCACTAATAGTT
CTTGAATACCCAATGGCTAATCAAAGATCCGTGGAGATGGAGTTCAAGAATGCTGTTTTTTGGATAAGGTTAATAAACCTACCTATGGGATACAGAAACAGGAAAGTGGC
AAGTAAAATAGGGAATAGTATTGGAGATTTTATTGATGGAGGAGAATGTGATGAAGAGCTTGTTTGGGGCCAAAGTATGCGCATAAGAGTAAGGTTAGACATAACTAAGC
CTCTGTTGAGAGGCTTTATGCTCAAAGGACCTGATATCAAAGGGGAATGCTGGGTTACTATACGTTATGAAAGGTTGCCTGAGCTCTGTTTTCTTTGTGGCAAGATTGGG
CACGTTGCAAAGGATTGTGAAAGTGGAAAAGGAAGAAAGGAAAGTAACATGGACAAATGGGAATTTGGGAACTGGTTACGGTTCCAATCTTTCCCAAGATTCACCAAGAA
ACCTGATTCACCAACCAATTCAAATTATAACCAAGATTTTCAGAAAGACCAGTCTGGGGAAGGAGGAAAAGACAGAGAGATAGGGCCAAGTTATGGGGAGAAAGGCAAAG
CGATTATTGGGTATGATTTAGAAGCTGAAGAAGATCTTTATGATTTGCGAAACAAAGGGGAGGAAGGTACATCCTCTTGGACTTATGATCTAAAACAGTGGCAAACATGG
AGACTGAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGAACATGGATGTTGAAGAGTTAGTAAATGAATGGAAGAATTTCAATCTAATGGAGAAAGAGAAAGGAAATGACATAAAGCTAGATGAAGAAGATATGACAGAAATCAA
AGATCAGATGGACCACTGCATCGTTGGAAAGCTTCTCTCCAATCGAATCATTGCCCCATTGGCCATCAAGAATGCTTTGAGAGGAGCTTGGAAAACAAGGAAAAATTTCA
AAGTTGAATCCACTGGGAAAAATATATTCAACTTCAAGTTTGAATGCCAAGAAGACAGAAACTGGGTGATGTGCAATGGTCCATGGACTTTTGACAATTCACTAATAGTT
CTTGAATACCCAATGGCTAATCAAAGATCCGTGGAGATGGAGTTCAAGAATGCTGTTTTTTGGATAAGGTTAATAAACCTACCTATGGGATACAGAAACAGGAAAGTGGC
AAGTAAAATAGGGAATAGTATTGGAGATTTTATTGATGGAGGAGAATGTGATGAAGAGCTTGTTTGGGGCCAAAGTATGCGCATAAGAGTAAGGTTAGACATAACTAAGC
CTCTGTTGAGAGGCTTTATGCTCAAAGGACCTGATATCAAAGGGGAATGCTGGGTTACTATACGTTATGAAAGGTTGCCTGAGCTCTGTTTTCTTTGTGGCAAGATTGGG
CACGTTGCAAAGGATTGTGAAAGTGGAAAAGGAAGAAAGGAAAGTAACATGGACAAATGGGAATTTGGGAACTGGTTACGGTTCCAATCTTTCCCAAGATTCACCAAGAA
ACCTGATTCACCAACCAATTCAAATTATAACCAAGATTTTCAGAAAGACCAGTCTGGGGAAGGAGGAAAAGACAGAGAGATAGGGCCAAGTTATGGGGAGAAAGGCAAAG
CGATTATTGGGTATGATTTAGAAGCTGAAGAAGATCTTTATGATTTGCGAAACAAAGGGGAGGAAGGTACATCCTCTTGGACTTATGATCTAAAACAGTGGCAAACATGG
AGACTGAATTAA
Protein sequenceShow/hide protein sequence
MNMDVEELVNEWKNFNLMEKEKGNDIKLDEEDMTEIKDQMDHCIVGKLLSNRIIAPLAIKNALRGAWKTRKNFKVESTGKNIFNFKFECQEDRNWVMCNGPWTFDNSLIV
LEYPMANQRSVEMEFKNAVFWIRLINLPMGYRNRKVASKIGNSIGDFIDGGECDEELVWGQSMRIRVRLDITKPLLRGFMLKGPDIKGECWVTIRYERLPELCFLCGKIG
HVAKDCESGKGRKESNMDKWEFGNWLRFQSFPRFTKKPDSPTNSNYNQDFQKDQSGEGGKDREIGPSYGEKGKAIIGYDLEAEEDLYDLRNKGEEGTSSWTYDLKQWQTW
RLN