; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018043 (gene) of Snake gourd v1 genome

Gene IDTan0018043
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRubredoxin-like domain-containing protein
Genome locationLG09:73841543..73845783
RNA-Seq ExpressionTan0018043
SyntenyTan0018043
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005506 - iron ion binding (molecular function)
InterPro domainsIPR024934 - Rubredoxin-like domain
IPR024935 - Rubredoxin domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587388.1 Mitochondrial phosphate carrier protein 2, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia]8.7e-6986.05Show/hide
Query:  MASVSATSLRFQLPPKPHSKIKINQEDGGADRDSNRLSLKSSFFSP---IPSLSKQNSAVVSAPKFSMRVASKQAYICRDCGYIYNDRTPFDKLPDKYFC
        MASVSATSLRFQLPP  HS  KI QEDGGADR S+ LSLKSSFFSP   IPSL KQNSAV +APK SMRVASKQAYICRDCGYIYNDRTPF+KLPDKYFC
Subjt:  MASVSATSLRFQLPPKPHSKIKINQEDGGADRDSNRLSLKSSFFSP---IPSLSKQNSAVVSAPKFSMRVASKQAYICRDCGYIYNDRTPFDKLPDKYFC

Query:  PVCGAPKRRFRPYEQSVTKNANEFDVRKARKAQIQKDESIGKVLPIAAAVGIVALVGAKPGNVLL--RIVSE
        PVCGAPKRRFRPYEQSVTKNANEFD RKARKAQIQKDE+IGKVLPIAAAVGIVALV  KPGNVL   R+VSE
Subjt:  PVCGAPKRRFRPYEQSVTKNANEFDVRKARKAQIQKDESIGKVLPIAAAVGIVALVGAKPGNVLL--RIVSE

XP_004138023.1 uncharacterized protein LOC101207574 isoform X1 [Cucumis sativus]2.5e-6381.21Show/hide
Query:  MASVSATSLRFQLPPKPHSKIKINQEDGGADRD-------SNRLSLKSSFFSP---IPSLSKQNSAVVSA-PKFSMRVASKQAYICRDCGYIYNDRTPFD
        MAS+SA+SL F LPPKPH   K+NQEDGG DR+       SNRLSLKSSF SP   IPSL KQNS V +A PKFSMRVASKQAYICRDCGYIYNDRTPFD
Subjt:  MASVSATSLRFQLPPKPHSKIKINQEDGGADRD-------SNRLSLKSSFFSP---IPSLSKQNSAVVSA-PKFSMRVASKQAYICRDCGYIYNDRTPFD

Query:  KLPDKYFCPVCGAPKRRFRPYEQSVTKNANEFDVRKARKAQIQKDESIGKVLPIAAAVGIVALVG
        KLPDKYFCPVCGAPKRRFRPYEQ+V+KN NEFDVRKARKAQIQKDE+IGKVLPIAAA+GIVALVG
Subjt:  KLPDKYFCPVCGAPKRRFRPYEQSVTKNANEFDVRKARKAQIQKDESIGKVLPIAAAVGIVALVG

XP_008464376.1 PREDICTED: uncharacterized protein LOC103502280 isoform X1 [Cucumis melo]1.2e-6280.61Show/hide
Query:  MASVSATSLRFQLPPKPHSKIKINQEDGGADRD-------SNRLSLKSSFFSP---IPSLSKQNSAVVSA-PKFSMRVASKQAYICRDCGYIYNDRTPFD
        MASVSA+SL F LPPKPH   K+NQEDGG DR+       SNRLSLKSSF SP   IPSL +QNS V +A PKFSMRVASKQAYICRDCGYIYNDRTPFD
Subjt:  MASVSATSLRFQLPPKPHSKIKINQEDGGADRD-------SNRLSLKSSFFSP---IPSLSKQNSAVVSA-PKFSMRVASKQAYICRDCGYIYNDRTPFD

Query:  KLPDKYFCPVCGAPKRRFRPYEQSVTKNANEFDVRKARKAQIQKDESIGKVLPIAAAVGIVALVG
        KLPDKYFCPVCGAPKRRFRPYEQ+V KN NEFD+RKARKAQIQKDE+IGKVLPIAAA+GIVALVG
Subjt:  KLPDKYFCPVCGAPKRRFRPYEQSVTKNANEFDVRKARKAQIQKDESIGKVLPIAAAVGIVALVG

XP_022927654.1 uncharacterized protein LOC111434474 [Cucurbita moschata]6.3e-6788.54Show/hide
Query:  MASVSATSLRFQLPPKPHSKIKINQEDGGADRDSNRLSLKSSFFSP---IPSLSKQNSAVVSAPKFSMRVASKQAYICRDCGYIYNDRTPFDKLPDKYFC
        MASVSATSLRFQLPP  HS  KI QEDGGADR S+ LSLKSSFFSP   IPSL KQNSAV +APK SMRVASKQAYICRDCGYIYNDRTPF+KLPDKYFC
Subjt:  MASVSATSLRFQLPPKPHSKIKINQEDGGADRDSNRLSLKSSFFSP---IPSLSKQNSAVVSAPKFSMRVASKQAYICRDCGYIYNDRTPFDKLPDKYFC

Query:  PVCGAPKRRFRPYEQSVTKNANEFDVRKARKAQIQKDESIGKVLPIAAAVGIVALVG
        PVCGAPKRRFRPYEQSVTKNANEFD RKARKAQIQKDE+IGKVLPIAAAVGIVALVG
Subjt:  PVCGAPKRRFRPYEQSVTKNANEFDVRKARKAQIQKDESIGKVLPIAAAVGIVALVG

XP_023531329.1 uncharacterized protein LOC111793604 [Cucurbita pepo subsp. pepo]2.1e-6789.17Show/hide
Query:  MASVSATSLRFQLPPKPHSKIKINQEDGGADRDSNRLSLKSSFFSP---IPSLSKQNSAVVSAPKFSMRVASKQAYICRDCGYIYNDRTPFDKLPDKYFC
        MASVSATSLRFQLPP  HS  KI QEDGGADR S+ LSLKSSFFSP   IPSL KQNSAV +APK SMRVASKQAYICRDCGYIYNDRTPFDKLPDKYFC
Subjt:  MASVSATSLRFQLPPKPHSKIKINQEDGGADRDSNRLSLKSSFFSP---IPSLSKQNSAVVSAPKFSMRVASKQAYICRDCGYIYNDRTPFDKLPDKYFC

Query:  PVCGAPKRRFRPYEQSVTKNANEFDVRKARKAQIQKDESIGKVLPIAAAVGIVALVG
        PVCGAPKRRFRPYEQSVTKNANEFD RKARKAQIQKDE+IGKVLPIAAAVGIVALVG
Subjt:  PVCGAPKRRFRPYEQSVTKNANEFDVRKARKAQIQKDESIGKVLPIAAAVGIVALVG

TrEMBL top hitse value%identityAlignment
A0A0A0LNF5 Rubredoxin-like domain-containing protein1.2e-6381.21Show/hide
Query:  MASVSATSLRFQLPPKPHSKIKINQEDGGADRD-------SNRLSLKSSFFSP---IPSLSKQNSAVVSA-PKFSMRVASKQAYICRDCGYIYNDRTPFD
        MAS+SA+SL F LPPKPH   K+NQEDGG DR+       SNRLSLKSSF SP   IPSL KQNS V +A PKFSMRVASKQAYICRDCGYIYNDRTPFD
Subjt:  MASVSATSLRFQLPPKPHSKIKINQEDGGADRD-------SNRLSLKSSFFSP---IPSLSKQNSAVVSA-PKFSMRVASKQAYICRDCGYIYNDRTPFD

Query:  KLPDKYFCPVCGAPKRRFRPYEQSVTKNANEFDVRKARKAQIQKDESIGKVLPIAAAVGIVALVG
        KLPDKYFCPVCGAPKRRFRPYEQ+V+KN NEFDVRKARKAQIQKDE+IGKVLPIAAA+GIVALVG
Subjt:  KLPDKYFCPVCGAPKRRFRPYEQSVTKNANEFDVRKARKAQIQKDESIGKVLPIAAAVGIVALVG

A0A5A7UPF0 Rubredoxin family protein5.9e-6380.61Show/hide
Query:  MASVSATSLRFQLPPKPHSKIKINQEDGGADRD-------SNRLSLKSSFFSP---IPSLSKQNSAVVSA-PKFSMRVASKQAYICRDCGYIYNDRTPFD
        MASVSA+SL F LPPKPH   K+NQEDGG DR+       SNRLSLKSSF SP   IPSL +QNS V +A PKFSMRVASKQAYICRDCGYIYNDRTPFD
Subjt:  MASVSATSLRFQLPPKPHSKIKINQEDGGADRD-------SNRLSLKSSFFSP---IPSLSKQNSAVVSA-PKFSMRVASKQAYICRDCGYIYNDRTPFD

Query:  KLPDKYFCPVCGAPKRRFRPYEQSVTKNANEFDVRKARKAQIQKDESIGKVLPIAAAVGIVALVG
        KLPDKYFCPVCGAPKRRFRPYEQ+V KN NEFD+RKARKAQIQKDE+IGKVLPIAAA+GIVALVG
Subjt:  KLPDKYFCPVCGAPKRRFRPYEQSVTKNANEFDVRKARKAQIQKDESIGKVLPIAAAVGIVALVG

A0A6J1EIL7 uncharacterized protein LOC1114344743.0e-6788.54Show/hide
Query:  MASVSATSLRFQLPPKPHSKIKINQEDGGADRDSNRLSLKSSFFSP---IPSLSKQNSAVVSAPKFSMRVASKQAYICRDCGYIYNDRTPFDKLPDKYFC
        MASVSATSLRFQLPP  HS  KI QEDGGADR S+ LSLKSSFFSP   IPSL KQNSAV +APK SMRVASKQAYICRDCGYIYNDRTPF+KLPDKYFC
Subjt:  MASVSATSLRFQLPPKPHSKIKINQEDGGADRDSNRLSLKSSFFSP---IPSLSKQNSAVVSAPKFSMRVASKQAYICRDCGYIYNDRTPFDKLPDKYFC

Query:  PVCGAPKRRFRPYEQSVTKNANEFDVRKARKAQIQKDESIGKVLPIAAAVGIVALVG
        PVCGAPKRRFRPYEQSVTKNANEFD RKARKAQIQKDE+IGKVLPIAAAVGIVALVG
Subjt:  PVCGAPKRRFRPYEQSVTKNANEFDVRKARKAQIQKDESIGKVLPIAAAVGIVALVG

A0A6J1IG97 uncharacterized protein LOC1114731103.0e-6788.54Show/hide
Query:  MASVSATSLRFQLPPKPHSKIKINQEDGGADRDSNRLSLKSSFFSP---IPSLSKQNSAVVSAPKFSMRVASKQAYICRDCGYIYNDRTPFDKLPDKYFC
        MASVSATSLRFQLPP  HS  KI QEDGGADR S+ LSLKSSFFSP   IPSL KQNSAV +APK SMRVASKQAYICRDCGYIYNDRTPF+KLPDKYFC
Subjt:  MASVSATSLRFQLPPKPHSKIKINQEDGGADRDSNRLSLKSSFFSP---IPSLSKQNSAVVSAPKFSMRVASKQAYICRDCGYIYNDRTPFDKLPDKYFC

Query:  PVCGAPKRRFRPYEQSVTKNANEFDVRKARKAQIQKDESIGKVLPIAAAVGIVALVG
        PVCGAPKRRFRPYEQSVTKNANEFD RKARKAQIQKDE+IGKVLPIAAAVGIVALVG
Subjt:  PVCGAPKRRFRPYEQSVTKNANEFDVRKARKAQIQKDESIGKVLPIAAAVGIVALVG

A0A6J1JG21 uncharacterized protein LOC111484793 isoform X15.9e-6381.1Show/hide
Query:  MASVSATSLRFQLPPKPHSKIKINQEDGGADRD-------SNRLSLKSSFFSP---IPSLSKQNSAVVSAPKFSMRVASKQAYICRDCGYIYNDRTPFDK
        MAS SA+SL F LPP PHS  KINQEDGGADRD       SNR SLKSSF SP   IPS  KQ  AV +APKFS+RVASKQAYICRDCGYIYNDRTPF+K
Subjt:  MASVSATSLRFQLPPKPHSKIKINQEDGGADRD-------SNRLSLKSSFFSP---IPSLSKQNSAVVSAPKFSMRVASKQAYICRDCGYIYNDRTPFDK

Query:  LPDKYFCPVCGAPKRRFRPYEQSVTKNANEFDVRKARKAQIQKDESIGKVLPIAAAVGIVALVG
        LPD YFCPVCGAPKRRFRPYEQSVTKN NEFD RKARKAQIQKDE+IGKVLPIAAAVGIVALVG
Subjt:  LPDKYFCPVCGAPKRRFRPYEQSVTKNANEFDVRKARKAQIQKDESIGKVLPIAAAVGIVALVG

SwissProt top hitse value%identityAlignment
O26258 Probable rubredoxin4.1e-0539.29Show/hide
Query:  VASKQAYICRDCGYIYNDR-----------TPFDKLPDKYFCPVCGAPKRRFRPYE
        V++ + Y CR CGYIY+             TPF+ LP+ + CP CGA K+ F+P +
Subjt:  VASKQAYICRDCGYIYNDR-----------TPFDKLPDKYFCPVCGAPKRRFRPYE

P04170 Rubredoxin-13.7e-0651.16Show/hide
Query:  QAYICRDCGYIY----NDRTPFDKLPDKYFCPVCGAPKRRFRP
        Q Y+C  CGY Y    +D  PFD+LPD + CPVCG  K +F P
Subjt:  QAYICRDCGYIY----NDRTPFDKLPDKYFCPVCGAPKRRFRP

P24297 Rubredoxin7.1e-0540Show/hide
Query:  YICRDCGYIYND-----------RTPFDKLPDKYFCPVCGAPKRRFRPYE
        ++C+ CGYIY++            T F++LPD + CP+CGAPK  F   E
Subjt:  YICRDCGYIYND-----------RTPFDKLPDKYFCPVCGAPKRRFRPYE

P58992 Rubredoxin-11.2e-0440Show/hide
Query:  AYICRDCGYIYNDR-----------TPFDKLPDKYFCPVCGAPKRRFRPY
        +++C +CGYIY+              PFDKLPD + CPVC  PK +F  +
Subjt:  AYICRDCGYIYNDR-----------TPFDKLPDKYFCPVCGAPKRRFRPY

Q9AL94 Rubredoxin6.0e-0439.22Show/hide
Query:  YICRDCGYIY-----------NDRTPFDKLPDKYFCPVCGAPKRRFRPYEQ
        Y+C  CGYIY           N  T F+ +PD + CP+CG  K +F P E+
Subjt:  YICRDCGYIY-----------NDRTPFDKLPDKYFCPVCGAPKRRFRPYEQ

Arabidopsis top hitse value%identityAlignment
AT5G17170.1 rubredoxin family protein4.1e-0833.33Show/hide
Query:  PKFSMRVASKQ----AYICRDCGYIYNDRTPFDKLPDKYFCPVCGAPKRRFRPYEQSVTKNANEFDVRKARKAQIQKDESIGKVLPIAAAVGIVALVGA
        P+F  ++   Q     +IC DCG+IY     FD+ PD Y CP C APK+RF  Y+ +  K                     G + PI   VG++A +GA
Subjt:  PKFSMRVASKQ----AYICRDCGYIYNDRTPFDKLPDKYFCPVCGAPKRRFRPYEQSVTKNANEFDVRKARKAQIQKDESIGKVLPIAAAVGIVALVGA

AT5G51010.1 Rubredoxin-like superfamily protein1.5e-3965.55Show/hide
Query:  SLKSSFFSPIPSL----SKQNSAVVSAPKFSMRVASKQAYICRDCGYIYNDRTPFDKLPDKYFCPVCGAPKRRFRPYEQSVTKNANEFDVRKARKAQIQK
        +L+ S F P  S+    +K++ +  SAP+FSMRV+SKQAYICRDCGYIYNDRTPFDKLPD YFCPVC APKRRFR Y   V+KN N+ DVRKARKA++Q+
Subjt:  SLKSSFFSPIPSL----SKQNSAVVSAPKFSMRVASKQAYICRDCGYIYNDRTPFDKLPDKYFCPVCGAPKRRFRPYEQSVTKNANEFDVRKARKAQIQK

Query:  DESIGKVLPIAAAVGIVAL
        DE++GK LPI  AVG++AL
Subjt:  DESIGKVLPIAAAVGIVAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTGTTTCAGCTACTTCTCTCAGGTTCCAGCTACCGCCGAAGCCACATTCCAAGATCAAGATTAACCAGGAAGACGGCGGCGCTGATAGGGACTCCAACCGCTT
ATCTTTGAAATCCTCCTTCTTCTCTCCCATTCCTTCTTTAAGCAAGCAAAACTCTGCTGTTGTATCTGCCCCCAAGTTCTCCATGCGCGTCGCTTCCAAACAAGCCTATA
TCTGTCGTGATTGCGGGTACATTTACAACGATAGAACTCCTTTTGACAAATTGCCTGATAAGTATTTCTGTCCTGTTTGTGGTGCTCCTAAGCGTCGATTTAGGCCTTAT
GAGCAATCTGTGACCAAAAATGCTAACGAATTTGACGTGAGGAAGGCGAGGAAGGCGCAGATCCAAAAAGATGAATCTATTGGGAAGGTGCTGCCTATTGCTGCTGCAGT
GGGAATCGTAGCACTAGTAGGCGCAAAACCAGGAAATGTGCTTCTAAGGATTGTTAGCGAGGCATTCTCAGAAGTGAACTTGGGTCCTGTAGGTTTCCCAAGTATCACTT
AG
mRNA sequenceShow/hide mRNA sequence
GAAGATTGATTATCCGCCACTCATAATCATATAGTTTTGCATTCACTATTCACTCGACTTCTTCACCTTCGTCTTCACCCATTCTATTTCCTTCTTCCTTCTCTCTGATT
CTCCTCTTTCTCTACTCCATATTTGAACTCACTCTGCTTTGAATTGTTATTACCTCCCATGGCTTCTGTTTCAGCTACTTCTCTCAGGTTCCAGCTACCGCCGAAGCCAC
ATTCCAAGATCAAGATTAACCAGGAAGACGGCGGCGCTGATAGGGACTCCAACCGCTTATCTTTGAAATCCTCCTTCTTCTCTCCCATTCCTTCTTTAAGCAAGCAAAAC
TCTGCTGTTGTATCTGCCCCCAAGTTCTCCATGCGCGTCGCTTCCAAACAAGCCTATATCTGTCGTGATTGCGGGTACATTTACAACGATAGAACTCCTTTTGACAAATT
GCCTGATAAGTATTTCTGTCCTGTTTGTGGTGCTCCTAAGCGTCGATTTAGGCCTTATGAGCAATCTGTGACCAAAAATGCTAACGAATTTGACGTGAGGAAGGCGAGGA
AGGCGCAGATCCAAAAAGATGAATCTATTGGGAAGGTGCTGCCTATTGCTGCTGCAGTGGGAATCGTAGCACTAGTAGGCGCAAAACCAGGAAATGTGCTTCTAAGGATT
GTTAGCGAGGCATTCTCAGAAGTGAACTTGGGTCCTGTAGGTTTCCCAAGTATCACTTAGTTTAGCACATTGTTAATGTTGAGGAATCAGAGTTAAAGAAAGCAGTGAAT
TGCAACTTGGTGGTTGTTTTTTCATTGCCACGACATGTTTGGTTGTTAATCATACAAAAAATCTGCATCACAGAACTTGGGGTGGAGGCCTGTTATCAGAAACGACTTGT
CACAGTATAAGTGACCTTTGAGGTTGAGAGAATGGAGCCTCATGAGAGGCTGAGGCACCTCGAATGGTGGCAAGTGAAAGGATATTAGCATAGACTTGGGTTAACCTGCC
ACCTGAGACAGACGGGGAAAGGCTCAACTAATAATGGAATGGAGGCACAATTCGAAATCAGATTCAGGACAAAAATAGCCGGGTTGGTTACCTGCTTTCCCTGAAACCAA
GCACCGTATGGAACTCTAGTGTTAAGACCCAGCTCTGTTGCCAACGTATGAACCTGCCAGCGACTCCCATCAACGGAATTACTGAATCCTGATCTCCACTGAAACAGATG
GTGAAATAAGAACGAAATAGGCGTTGGCGTCGATTTATGGTTTTGATGAGGTAATGGTTAAAATTGTACCTACCCACCTGTAAATCAAGACTCGTATGCCGGCTTGACTA
GTGTCCCAACAATGGAGATGGTTGGAAATGGTATTTCCAGGTTAAGCTCATTGTCCAGAGCTCAGGTGTCATGTCAATAAGAGTCAAATGAGAATGGCCCGGTCCAGAAT
AATGTAAAATTCTAATACACAGTTGAAATACTCGCTGCAAACAGCCCAGTTGTGGACTCCAACAAGACGGGCATGATCGTGAAGAGCCTTTTTCACATCCTCTCGGTTCA
GATATTTGACGGTTTCATCTTCAATGACATATCGGGAATAGTTACAAGCAGAAGTGAACAATCTGTAAGTGGAGTCTGATATTAATCCATTCATGAGACCAGAAGAACTC
CGCCCTCGAGGAATGTAATGACCTGGTGGTGGTATAATTAGGAAAAGGATTTGGTTTTCTTAATTAAGAGAAATAGAAAGAGGGAGGGGAGGGGGAGAGGGAGAGATATA
AAAGACAGAAATGTGGTTATCTAATACCTGCATAGAAACAGATCTCTATGCCCGTAATGAGGGAACTTATTGAACCACCTTTGCAAGAATATGAGATTGTCTTTTGCTGA
AGATAGCAACAGGCACAAAACATGAATTAAAAACAAGCACATGCAACATGATTTGTTTTCTTTCCTTAGCCAAATCAATATCTATCTTCTGAAGAGAGAAAGAACAGAAG
GGATACAAGAATACCTGTGGCCTCATCATCCTTTGTCCCAAGAGAAGCGCCATCATCAAATTCACGCCCCAGCTCCCAGGGAAGAACAACCAGGCCCTGCAAGAAATCAG
AAAAGGTTCCAAGGACCTTGAGTGAATTAGTAGTAATATAGAGGAAGGCACCTCTATTCAGGCAGAGAACCAAAGGCTTCGAATCAGGGTTGGTTCCTGCTTCAACAAAA
TAGTAAAATAGTGCTCTGTGCTTTTGGTTATGTACATGAAGATAGCCTGAAAACTGATGAAACCCCAACGGGGGGGAGGCTGCCCTGGAAGGCTAATAATCTGGTCACAA
TGAGCAACCTCCTCGTAAATGCAAAGACGAAGAATAAGAGCAGCCATTACCATGGCCTTCAACCTTCATGGTTTAGAGAGAGAGTGTGTCCTAGACTCGAGAAAGTGCTT
TTTAAAAGTACTGTGATGATGCTTCCCTATCTTCTGTTCCCGTTCCCCAACAACAGAAAAGAGTTATCCACCGCACCGCCAGCGTCTGGCATTCTTTTGAGTAAAGAAGG
TGAGAGTGTGGCGTTCATTGTAAAGACAACTACTTCCTTCCCTCTTCTCAGATATCAGTTCTCTACAGTTATTCGTAAGACAAATTTAAACTCTTTCTATTAGATTTCTG
ACAGACTATTGGGGGATTTTAAAGAATCTCGACAGAATTTCAGGGGTTCTTTTTAGTTTTCGGGTAGACAGTTGGTTTTTCTTGTTAAATTTGTACATCTGCACTGGTCT
ACAGCAACTAATATAAGTTAAATCCCAAAGTTCACTTCAACTTCAAGGAAAATTTTGAAGGATTTCGAAGCATAGGTTTAAATGAAAATTATATCGAATCAGCAACGATA
TAGTTAGAAGGGACGGGAAGGAACACGAAATGGTCCTTCAAATAAGATAATCCTGAAGGCTTCAGCCAGCAACCATGACTTTTTACCTACACATCAACTTTTTCTGTGTT
TTCTTGTGGGAAACATTTTTTTTTTCTTAGAGC
Protein sequenceShow/hide protein sequence
MASVSATSLRFQLPPKPHSKIKINQEDGGADRDSNRLSLKSSFFSPIPSLSKQNSAVVSAPKFSMRVASKQAYICRDCGYIYNDRTPFDKLPDKYFCPVCGAPKRRFRPY
EQSVTKNANEFDVRKARKAQIQKDESIGKVLPIAAAVGIVALVGAKPGNVLLRIVSEAFSEVNLGPVGFPSIT