; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031581 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031581
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr11:10357503..10358801
RNA-Seq ExpressionLag0031581
SyntenyLag0031581
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]1.4e-3433.99Show/hide
Query:  MAFEDLLANWEKFSLTAEEENTEVDVDRQAAMVTSQSLGFSLIEKLLAPRIIFGDVMRRTFKPAWNI-VNGLIVEKLGANLFLFSLRTEAEQTRVMRQGP
        MA  +LL  W+ F LT+EE+   VD+D  A   T + L  SLI KLL+ R I   V++ T K AW +      V+ +G N+FLF+    +++ R++R GP
Subjt:  MAFEDLLANWEKFSLTAEEENTEVDVDRQAAMVTSQSLGFSLIEKLLAPRIIFGDVMRRTFKPAWNI-VNGLIVEKLGANLFLFSLRTEAEQTRVMRQGP

Query:  WLFNKYLLALSKRIPMVKPTAMEFRFAVFWVHFCELPMDLYNRSMVERLGDAIGQFQDYDNGGRGYGWKESLRVRVILDLTRPLRR--------------
        W F++ L+ +   + + KP  M+FR    WVHF +L +   N++M  RLG+AIG F+D ++    + W   LRVRV  D+ +PL R              
Subjt:  WLFNKYLLALSKRIPMVKPTAMEFRFAVFWVHFCELPMDLYNRSMVERLGDAIGQFQDYDNGGRGYGWKESLRVRVILDLTRPLRR--------------

Query:  ------------------DHVSKDCSHLFLDDGSSDGRSYYGMWMAFKGRSSS
                          DH+ KDCS   +D  S +    YG W+ F+G  +S
Subjt:  ------------------DHVSKDCSHLFLDDGSSDGRSYYGMWMAFKGRSSS

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]1.5e-4136.36Show/hide
Query:  MAFEDLLANWEKFSLTAEEENTEVDVDRQAAMVTSQSLGFSLIEKLLAPRIIFGDVMRRTFKPAWNIVNGLIVEKLGANLFLFSLRTEAEQTRVMRQGPW
        M  E+LLA+W+KF LT+EE+   +DVD  A  +  Q L +SL+ KLLA RII  DV+ R    AW + + L VE +G NLFLF    E +  RVM+ GPW
Subjt:  MAFEDLLANWEKFSLTAEEENTEVDVDRQAAMVTSQSLGFSLIEKLLAPRIIFGDVMRRTFKPAWNIVNGLIVEKLGANLFLFSLRTEAEQTRVMRQGPW

Query:  LFNKYLLALSKRIPMVKPTAMEFRFAVFWVHFCELPMDLYNRSMVERLGDAIGQFQDYDNGGRGYGWKESLRVRVILDLTRPLRR---------------
         F+K L+ L K       + +EF    FW+H  +LPM   N++M  RLG+AIG F D D   +G+ W  SLR+RV++D+T+PLRR               
Subjt:  LFNKYLLALSKRIPMVKPTAMEFRFAVFWVHFCELPMDLYNRSMVERLGDAIGQFQDYDNGGRGYGWKESLRVRVILDLTRPLRR---------------

Query:  -----------------DHVSKDCSHLFL-DDGSSDGRSYYGMWMAF-----------KGRSSSVYRSPSTSPM-----GNNRIMMDTSPQQNPTPFQAQ
                          H S DC   +L     S   S YG W+ F           KG+S +   S  +S M     G        S Q N   FQ+Q
Subjt:  -----------------DHVSKDCSHLFL-DDGSSDGRSYYGMWMAF-----------KGRSSSVYRSPSTSPM-----GNNRIMMDTSPQQNPTPFQAQ

Query:  LPPEQRPD
           EQ  D
Subjt:  LPPEQRPD

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]3.1e-3428.99Show/hide
Query:  MAFEDLLANWEKFSLTAEEENTEVDVDRQAAMVTSQSLGFSLIEKLLAPRIIFGDVMRRTFKPAWNIVNGLI-VEKLGANLFLFSLRTEAEQTRVMRQGP
        MA  DLL  W+ F LT+EEE T +DVD  A   T   L   L+ KL   R I   VM+ T + AW + N    V+ LG NLFLFS     ++ ++ + GP
Subjt:  MAFEDLLANWEKFSLTAEEENTEVDVDRQAAMVTSQSLGFSLIEKLLAPRIIFGDVMRRTFKPAWNIVNGLI-VEKLGANLFLFSLRTEAEQTRVMRQGP

Query:  WLFNKYLLALSKRIPMVKPTAMEFRFAVFWVHFCELPMDLYNRSMVERLGDAIGQFQDYDNGGRGYGWKESLRVRVILDLTRPLRR--------------
        W F++ L+ ++K + ++ P+ ++F     WV F +LP+    R M  RLG+A+G F++ D       W  +LRVRV+LD+++PLRR              
Subjt:  WLFNKYLLALSKRIPMVKPTAMEFRFAVFWVHFCELPMDLYNRSMVERLGDAIGQFQDYDNGGRGYGWKESLRVRVILDLTRPLRR--------------

Query:  -----DHVSKDCSHLFLDDGSSDGRSYYGMWMAFKG------------------RSSSVYRSPSTSPMGNNRIMMDTSPQQNPTPFQAQLPPEQRPDRNS
             + +   C H  L   SS  +  YG W+ ++G                  +S +   S STSP+G     + ++P   P     + P  + P + +
Subjt:  -----DHVSKDCSHLFLDDGSSDGRSYYGMWMAFKG------------------RSSSVYRSPSTSPMGNNRIMMDTSPQQNPTPFQAQLPPEQRPDRNS

Query:  DHRNSTRSSLMTGSGSRPMEISPAMEENGLKILAENLPTLMPNQS
        +     +S ++   G + + +      N    L    P++ P+ S
Subjt:  DHRNSTRSSLMTGSGSRPMEISPAMEENGLKILAENLPTLMPNQS

XP_028071384.1 uncharacterized protein LOC114273772 [Camellia sinensis]4.2e-2335.68Show/hide
Query:  EDLLANWEKFSLTAEEENTEVDVDRQAAMVTSQSLGFS---LIEKLLAPRIIFGDVMRRTFKPAWNIVNGLIVEKLGANLFLFSLRTEAEQTRVMRQGPW
        + LL      SLT+EE+     V R     TS  +G S   L+ KLL  R    + M+ T    W    G+ V  +G NLF+F      ++ R++  GPW
Subjt:  EDLLANWEKFSLTAEEENTEVDVDRQAAMVTSQSLGFS---LIEKLLAPRIIFGDVMRRTFKPAWNIVNGLIVEKLGANLFLFSLRTEAEQTRVMRQGPW

Query:  LFNKYLLALSKRIPMVKPTAMEFRFAVFWVHFCELPMDLYNRSMVERLGDAIGQFQDYDNGGRGYGWKESLRVRVILDLTRPLRR
         F+K+LL L +  P V+P+ ++  +  FWVH C LP+ L N+ + + +G+A+GQF D D    G  W  ++ +RV LD+ +PLRR
Subjt:  LFNKYLLALSKRIPMVKPTAMEFRFAVFWVHFCELPMDLYNRSMVERLGDAIGQFQDYDNGGRGYGWKESLRVRVILDLTRPLRR

XP_028122006.1 uncharacterized protein LOC114319195 [Camellia sinensis]1.9e-2335.16Show/hide
Query:  EDLLANWEKFSLTAEEENTEVDVDRQAAMVTSQSLGFSLIEKLLAPRIIFGDVMRRTFKPAWNIVNGLIVEKLGANLFLFSLRTEAEQTRVMRQGPWLFN
        + LL      SLT+EE+   V +  ++  +        L+ KLL  R    + M+ T    W    G+ V  +G NLF+F      ++ RV+  GPW F+
Subjt:  EDLLANWEKFSLTAEEENTEVDVDRQAAMVTSQSLGFSLIEKLLAPRIIFGDVMRRTFKPAWNIVNGLIVEKLGANLFLFSLRTEAEQTRVMRQGPWLFN

Query:  KYLLALSKRIPMVKPTAMEFRFAVFWVHFCELPMDLYNRSMVERLGDAIGQFQDYDNGGRGYGWKESLRVRVILDLTRPLRR
        K+LL L +  P V+P+ ++     FWVH C LP+ L N+ + E +G+A+GQF D D    G  W  ++R+RV LD+ +PLRR
Subjt:  KYLLALSKRIPMVKPTAMEFRFAVFWVHFCELPMDLYNRSMVERLGDAIGQFQDYDNGGRGYGWKESLRVRVILDLTRPLRR

TrEMBL top hitse value%identityAlignment
A0A1R3K847 Uncharacterized protein2.6e-2334.07Show/hide
Query:  EDLLANWEKFSLTAEEENTEVDVDRQAAMVTSQSLGFSLIEKLLAPRIIFGDVMRRTFKPAWNIVNGLIVEKLGANLFLFSLRTEAEQTRVMRQGPWLFN
        E L A WE F+LT +EE  E+ V+      +       LI KLL+ R +  DVMR      W +  GL V ++G  L++F   +E E+ RV +QGPW FN
Subjt:  EDLLANWEKFSLTAEEENTEVDVDRQAAMVTSQSLGFSLIEKLLAPRIIFGDVMRRTFKPAWNIVNGLIVEKLGANLFLFSLRTEAEQTRVMRQGPWLFN

Query:  KYLLALSKRIPMVKPTAMEFRFAVFWVHFCELPMDLYNRSMVERLGDAIGQFQDYDNGGRGYGWKESLRVRVILDLTRPLRR
        K LL L      +    ++  +  FW+   +LP+     S+ + +GD+ G+  + D  G    W + LR+R  L++ +PLRR
Subjt:  KYLLALSKRIPMVKPTAMEFRFAVFWVHFCELPMDLYNRSMVERLGDAIGQFQDYDNGGRGYGWKESLRVRVILDLTRPLRR

A0A2N9IPS8 Reverse transcriptase domain-containing protein4.5e-2334.91Show/hide
Query:  EDLLANWEKFSLTAEEENTEVDVDRQAAMVTSQSLG-FSLIEKLLAPRIIFGDVMRRTFKPAWNIVNGLIVEKLGANLFLFSLRTEAEQTRVMRQGPWLF
        E+LL  W+KFSLT E+E+    +D   AM  S+ +G   L+ KL+  R    + ++      W +  G+ V+ +G NLF+F  +   E+ RVM   PWLF
Subjt:  EDLLANWEKFSLTAEEENTEVDVDRQAAMVTSQSLG-FSLIEKLLAPRIIFGDVMRRTFKPAWNIVNGLIVEKLGANLFLFSLRTEAEQTRVMRQGPWLF

Query:  NKYLLALSKRIPMVKPTAMEFRFAVFWVHFCELPMDLYNRSMVERLGDAIGQFQDYDNGGRGYGWKESLRVRVILDLTRPLRRDHVSKDCSHLFLDDGSS
        + +LLAL++       + ++F    FWVHF  +P+    +   ER+G  +G   D D    G GW  SLRVR+ LD T+P+ R  +    S L L  G  
Subjt:  NKYLLALSKRIPMVKPTAMEFRFAVFWVHFCELPMDLYNRSMVERLGDAIGQFQDYDNGGRGYGWKESLRVRVILDLTRPLRRDHVSKDCSHLFLDDGSS

Query:  DG--RSYYGMWM
         G     YG W+
Subjt:  DG--RSYYGMWM

A0A6J1BSZ1 uncharacterized protein LOC1110054816.7e-3533.99Show/hide
Query:  MAFEDLLANWEKFSLTAEEENTEVDVDRQAAMVTSQSLGFSLIEKLLAPRIIFGDVMRRTFKPAWNI-VNGLIVEKLGANLFLFSLRTEAEQTRVMRQGP
        MA  +LL  W+ F LT+EE+   VD+D  A   T + L  SLI KLL+ R I   V++ T K AW +      V+ +G N+FLF+    +++ R++R GP
Subjt:  MAFEDLLANWEKFSLTAEEENTEVDVDRQAAMVTSQSLGFSLIEKLLAPRIIFGDVMRRTFKPAWNI-VNGLIVEKLGANLFLFSLRTEAEQTRVMRQGP

Query:  WLFNKYLLALSKRIPMVKPTAMEFRFAVFWVHFCELPMDLYNRSMVERLGDAIGQFQDYDNGGRGYGWKESLRVRVILDLTRPLRR--------------
        W F++ L+ +   + + KP  M+FR    WVHF +L +   N++M  RLG+AIG F+D ++    + W   LRVRV  D+ +PL R              
Subjt:  WLFNKYLLALSKRIPMVKPTAMEFRFAVFWVHFCELPMDLYNRSMVERLGDAIGQFQDYDNGGRGYGWKESLRVRVILDLTRPLRR--------------

Query:  ------------------DHVSKDCSHLFLDDGSSDGRSYYGMWMAFKGRSSS
                          DH+ KDCS   +D  S +    YG W+ F+G  +S
Subjt:  ------------------DHVSKDCSHLFLDDGSSDGRSYYGMWMAFKGRSSS

A0A6J1DU55 uncharacterized protein LOC1110231357.4e-4236.36Show/hide
Query:  MAFEDLLANWEKFSLTAEEENTEVDVDRQAAMVTSQSLGFSLIEKLLAPRIIFGDVMRRTFKPAWNIVNGLIVEKLGANLFLFSLRTEAEQTRVMRQGPW
        M  E+LLA+W+KF LT+EE+   +DVD  A  +  Q L +SL+ KLLA RII  DV+ R    AW + + L VE +G NLFLF    E +  RVM+ GPW
Subjt:  MAFEDLLANWEKFSLTAEEENTEVDVDRQAAMVTSQSLGFSLIEKLLAPRIIFGDVMRRTFKPAWNIVNGLIVEKLGANLFLFSLRTEAEQTRVMRQGPW

Query:  LFNKYLLALSKRIPMVKPTAMEFRFAVFWVHFCELPMDLYNRSMVERLGDAIGQFQDYDNGGRGYGWKESLRVRVILDLTRPLRR---------------
         F+K L+ L K       + +EF    FW+H  +LPM   N++M  RLG+AIG F D D   +G+ W  SLR+RV++D+T+PLRR               
Subjt:  LFNKYLLALSKRIPMVKPTAMEFRFAVFWVHFCELPMDLYNRSMVERLGDAIGQFQDYDNGGRGYGWKESLRVRVILDLTRPLRR---------------

Query:  -----------------DHVSKDCSHLFL-DDGSSDGRSYYGMWMAF-----------KGRSSSVYRSPSTSPM-----GNNRIMMDTSPQQNPTPFQAQ
                          H S DC   +L     S   S YG W+ F           KG+S +   S  +S M     G        S Q N   FQ+Q
Subjt:  -----------------DHVSKDCSHLFL-DDGSSDGRSYYGMWMAF-----------KGRSSSVYRSPSTSPM-----GNNRIMMDTSPQQNPTPFQAQ

Query:  LPPEQRPD
           EQ  D
Subjt:  LPPEQRPD

A0A6J1DX30 uncharacterized protein LOC1110248741.5e-3428.99Show/hide
Query:  MAFEDLLANWEKFSLTAEEENTEVDVDRQAAMVTSQSLGFSLIEKLLAPRIIFGDVMRRTFKPAWNIVNGLI-VEKLGANLFLFSLRTEAEQTRVMRQGP
        MA  DLL  W+ F LT+EEE T +DVD  A   T   L   L+ KL   R I   VM+ T + AW + N    V+ LG NLFLFS     ++ ++ + GP
Subjt:  MAFEDLLANWEKFSLTAEEENTEVDVDRQAAMVTSQSLGFSLIEKLLAPRIIFGDVMRRTFKPAWNIVNGLI-VEKLGANLFLFSLRTEAEQTRVMRQGP

Query:  WLFNKYLLALSKRIPMVKPTAMEFRFAVFWVHFCELPMDLYNRSMVERLGDAIGQFQDYDNGGRGYGWKESLRVRVILDLTRPLRR--------------
        W F++ L+ ++K + ++ P+ ++F     WV F +LP+    R M  RLG+A+G F++ D       W  +LRVRV+LD+++PLRR              
Subjt:  WLFNKYLLALSKRIPMVKPTAMEFRFAVFWVHFCELPMDLYNRSMVERLGDAIGQFQDYDNGGRGYGWKESLRVRVILDLTRPLRR--------------

Query:  -----DHVSKDCSHLFLDDGSSDGRSYYGMWMAFKG------------------RSSSVYRSPSTSPMGNNRIMMDTSPQQNPTPFQAQLPPEQRPDRNS
             + +   C H  L   SS  +  YG W+ ++G                  +S +   S STSP+G     + ++P   P     + P  + P + +
Subjt:  -----DHVSKDCSHLFLDDGSSDGRSYYGMWMAFKG------------------RSSSVYRSPSTSPMGNNRIMMDTSPQQNPTPFQAQLPPEQRPDRNS

Query:  DHRNSTRSSLMTGSGSRPMEISPAMEENGLKILAENLPTLMPNQS
        +     +S ++   G + + +      N    L    P++ P+ S
Subjt:  DHRNSTRSSLMTGSGSRPMEISPAMEENGLKILAENLPTLMPNQS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G31430.1 unknown protein6.6e-1133.05Show/hide
Query:  IVNGLIVEKLGANLFLFSLRTEAEQTRVMRQGPWLFNKYLLALSKRIPMVKPTAMEFRFAVFWVHFCELPMDLYNRSMVERLGDAIGQFQDYDNGGRGYG
        +V+G I+E      F F    E     V+R+GPW FN +++ L +  P +      F F  FWV    +P    NR +VE +G A+GQ  D D       
Subjt:  IVNGLIVEKLGANLFLFSLRTEAEQTRVMRQGPWLFNKYLLALSKRIPMVKPTAMEFRFAVFWVHFCELPMDLYNRSMVERLGDAIGQFQDYDNGGRGYG

Query:  WKESLRVRVILDLTRPLR
          +  RV +  D+T PLR
Subjt:  WKESLRVRVILDLTRPLR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATTCGAAGACTTGCTCGCAAACTGGGAGAAATTCAGTCTAACAGCTGAAGAAGAAAACACTGAAGTTGATGTTGACCGCCAGGCGGCGATGGTGACAAGCCAATC
CCTTGGTTTCAGCTTGATCGAAAAGCTTCTAGCTCCTAGAATAATTTTTGGTGACGTAATGCGGCGAACATTCAAACCAGCTTGGAACATTGTCAACGGCCTGATTGTGG
AAAAATTAGGGGCAAACCTGTTCTTATTCTCTTTAAGGACGGAGGCAGAGCAAACTCGTGTTATGCGCCAAGGCCCGTGGCTCTTCAACAAATATCTTCTAGCTCTTTCC
AAGCGTATTCCGATGGTTAAACCCACGGCCATGGAATTCAGATTTGCTGTTTTCTGGGTACATTTCTGTGAGCTACCTATGGATCTCTATAATAGATCTATGGTTGAAAG
GTTGGGCGATGCGATCGGTCAGTTTCAAGACTACGATAATGGTGGTCGGGGTTATGGGTGGAAGGAAAGCCTCCGTGTCCGAGTAATCCTGGATCTAACGAGGCCTCTAC
GACGCGACCATGTTTCAAAAGATTGCAGTCATTTATTCTTGGATGATGGTTCTTCGGATGGGCGCTCGTATTATGGGATGTGGATGGCTTTTAAAGGTCGTTCCTCAAGT
GTCTATCGTTCACCAAGCACCAGCCCGATGGGAAATAATCGAATTATGATGGATACATCGCCACAACAGAATCCAACACCTTTTCAGGCTCAGTTACCTCCTGAACAGAG
ACCTGACCGGAACTCAGACCATCGGAATTCAACTCGATCATCTCTCATGACAGGTTCCGGCAGCCGACCAATGGAAATCTCGCCGGCAATGGAGGAAAACGGATTAAAGA
TACTGGCAGAGAATTTACCCACATTAATGCCCAATCAATCGACATTCAATGGCATTAATGTGGGGGATTTTTCAAAGGTGAAGAAAAAGCTCCAGTTTGAAACGGAAGGG
ACAGAGTGGGAGGACAAGCGTAAAGGGAAAGCGAGCGTTACGGAGATACCGATTAGCCATTCTTTCGAAGTGGGCCAGGGTTCGAATTCTGCTCCAGACTTACCGCAGGC
AGTTCCAACGACTCAAGTGATTTTTCAAAGTCATAATCCAAATATTCTTCAACCTGTAGCCAGGCTTTACTCGCTCTCGAGCCCATCAAACGGTGTGAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCATTCGAAGACTTGCTCGCAAACTGGGAGAAATTCAGTCTAACAGCTGAAGAAGAAAACACTGAAGTTGATGTTGACCGCCAGGCGGCGATGGTGACAAGCCAATC
CCTTGGTTTCAGCTTGATCGAAAAGCTTCTAGCTCCTAGAATAATTTTTGGTGACGTAATGCGGCGAACATTCAAACCAGCTTGGAACATTGTCAACGGCCTGATTGTGG
AAAAATTAGGGGCAAACCTGTTCTTATTCTCTTTAAGGACGGAGGCAGAGCAAACTCGTGTTATGCGCCAAGGCCCGTGGCTCTTCAACAAATATCTTCTAGCTCTTTCC
AAGCGTATTCCGATGGTTAAACCCACGGCCATGGAATTCAGATTTGCTGTTTTCTGGGTACATTTCTGTGAGCTACCTATGGATCTCTATAATAGATCTATGGTTGAAAG
GTTGGGCGATGCGATCGGTCAGTTTCAAGACTACGATAATGGTGGTCGGGGTTATGGGTGGAAGGAAAGCCTCCGTGTCCGAGTAATCCTGGATCTAACGAGGCCTCTAC
GACGCGACCATGTTTCAAAAGATTGCAGTCATTTATTCTTGGATGATGGTTCTTCGGATGGGCGCTCGTATTATGGGATGTGGATGGCTTTTAAAGGTCGTTCCTCAAGT
GTCTATCGTTCACCAAGCACCAGCCCGATGGGAAATAATCGAATTATGATGGATACATCGCCACAACAGAATCCAACACCTTTTCAGGCTCAGTTACCTCCTGAACAGAG
ACCTGACCGGAACTCAGACCATCGGAATTCAACTCGATCATCTCTCATGACAGGTTCCGGCAGCCGACCAATGGAAATCTCGCCGGCAATGGAGGAAAACGGATTAAAGA
TACTGGCAGAGAATTTACCCACATTAATGCCCAATCAATCGACATTCAATGGCATTAATGTGGGGGATTTTTCAAAGGTGAAGAAAAAGCTCCAGTTTGAAACGGAAGGG
ACAGAGTGGGAGGACAAGCGTAAAGGGAAAGCGAGCGTTACGGAGATACCGATTAGCCATTCTTTCGAAGTGGGCCAGGGTTCGAATTCTGCTCCAGACTTACCGCAGGC
AGTTCCAACGACTCAAGTGATTTTTCAAAGTCATAATCCAAATATTCTTCAACCTGTAGCCAGGCTTTACTCGCTCTCGAGCCCATCAAACGGTGTGAATTAA
Protein sequenceShow/hide protein sequence
MAFEDLLANWEKFSLTAEEENTEVDVDRQAAMVTSQSLGFSLIEKLLAPRIIFGDVMRRTFKPAWNIVNGLIVEKLGANLFLFSLRTEAEQTRVMRQGPWLFNKYLLALS
KRIPMVKPTAMEFRFAVFWVHFCELPMDLYNRSMVERLGDAIGQFQDYDNGGRGYGWKESLRVRVILDLTRPLRRDHVSKDCSHLFLDDGSSDGRSYYGMWMAFKGRSSS
VYRSPSTSPMGNNRIMMDTSPQQNPTPFQAQLPPEQRPDRNSDHRNSTRSSLMTGSGSRPMEISPAMEENGLKILAENLPTLMPNQSTFNGINVGDFSKVKKKLQFETEG
TEWEDKRKGKASVTEIPISHSFEVGQGSNSAPDLPQAVPTTQVIFQSHNPNILQPVARLYSLSSPSNGVN