; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg09970 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg09970
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionBEST Arabidopsis thaliana protein match is: NHL domain-containing protein .
Genome locationCarg_Chr10:2294093..2297717
RNA-Seq ExpressionCarg09970
SyntenyCarg09970
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589797.1 hypothetical protein SDJN03_15220, partial [Cucurbita argyrosperma subsp. sororia]4.1e-107100Show/hide
Query:  MATHPTRPREQDDLQDIDEAVPSNGCGCFRLFGFNRNGNYEGRSLLQQQQGWEEESWMVRKLKKLKEVSEMVGGPRWKNFIRKMGGLFRRKKQRKNRFQY
        MATHPTRPREQDDLQDIDEAVPSNGCGCFRLFGFNRNGNYEGRSLLQQQQGWEEESWMVRKLKKLKEVSEMVGGPRWKNFIRKMGGLFRRKKQRKNRFQY
Subjt:  MATHPTRPREQDDLQDIDEAVPSNGCGCFRLFGFNRNGNYEGRSLLQQQQGWEEESWMVRKLKKLKEVSEMVGGPRWKNFIRKMGGLFRRKKQRKNRFQY

Query:  DPESYALNFDDGFDGEDDDRHPPIAFSTRLRPHQSPLRKHAGKDPNPSRIAILNCQLFLDLRTRGIGSYSREFPSSAGDRPSSFLCLRS
        DPESYALNFDDGFDGEDDDRHPPIAFSTRLRPHQSPLRKHAGKDPNPSRIAILNCQLFLDLRTRGIGSYSREFPSSAGDRPSSFLCLRS
Subjt:  DPESYALNFDDGFDGEDDDRHPPIAFSTRLRPHQSPLRKHAGKDPNPSRIAILNCQLFLDLRTRGIGSYSREFPSSAGDRPSSFLCLRS

KAG7023468.1 hypothetical protein SDJN02_14493, partial [Cucurbita argyrosperma subsp. argyrosperma]1.2e-117100Show/hide
Query:  MATHPTRPREQDDLQDIDEAVPSNGCGCFRLFGFNRNGNYEGRSLLQQQQGWEEESWMVRKLKKLKEVSEMVGGPRWKNFIRKMGGLFRRKKQRKNRFQY
        MATHPTRPREQDDLQDIDEAVPSNGCGCFRLFGFNRNGNYEGRSLLQQQQGWEEESWMVRKLKKLKEVSEMVGGPRWKNFIRKMGGLFRRKKQRKNRFQY
Subjt:  MATHPTRPREQDDLQDIDEAVPSNGCGCFRLFGFNRNGNYEGRSLLQQQQGWEEESWMVRKLKKLKEVSEMVGGPRWKNFIRKMGGLFRRKKQRKNRFQY

Query:  DPESYALNFDDGFDGEDDDRHPPIAFSTRLRPHQSPLRKHAGKDPNPSRIAILNCQLFLDLRTRGIGSYSREFPSSAGDRPSSFLCLRSVKFVLSFVPGF
        DPESYALNFDDGFDGEDDDRHPPIAFSTRLRPHQSPLRKHAGKDPNPSRIAILNCQLFLDLRTRGIGSYSREFPSSAGDRPSSFLCLRSVKFVLSFVPGF
Subjt:  DPESYALNFDDGFDGEDDDRHPPIAFSTRLRPHQSPLRKHAGKDPNPSRIAILNCQLFLDLRTRGIGSYSREFPSSAGDRPSSFLCLRSVKFVLSFVPGF

Query:  SSLILLFHI
        SSLILLFHI
Subjt:  SSLILLFHI

XP_022921805.1 uncharacterized protein LOC111429948 [Cucurbita moschata]5.5e-6796.12Show/hide
Query:  MATHPTRPREQDDLQDIDEAVPSNGCGCFRLFGFNRNGNYEGRSLLQQQQGWEEESWMVRKLKKLKEVSEMVGGPRWKNFIRKMGGLFRRKKQRKNRFQY
        MATHPTRPREQDDLQDIDEAVPSNGCGCFRLFGFNRNGNYEGRSLLQ+QQGWEEESWMVRKLKKLKEVSEMVGGPRWKNFIRKMGG F+RKKQRKNRFQY
Subjt:  MATHPTRPREQDDLQDIDEAVPSNGCGCFRLFGFNRNGNYEGRSLLQQQQGWEEESWMVRKLKKLKEVSEMVGGPRWKNFIRKMGGLFRRKKQRKNRFQY

Query:  DPESYALNFDDGFDGEDDDRHPPIAFSTR
        DPESYALNFDDGFDGE DD HPPIAFSTR
Subjt:  DPESYALNFDDGFDGEDDDRHPPIAFSTR

XP_022987488.1 uncharacterized protein LOC111485032 [Cucurbita maxima]1.8e-5484.33Show/hide
Query:  MATHPTRPREQDDLQDIDEAVPSNGC-GCFRL----FGFNRNGNYEGRSLLQQQQGWEEESWMVRKLKKLKEVSEMVGGPRWKNFIRKMGGLFRRKKQRK
        MATHPTRPREQDD QDIDEA+PSNGC GCFRL    FGFNRNGNYEGRSLLQQQ+G      MVRK KKLKEVSEMVGGPRWKNFIRKMGG  +RKKQRK
Subjt:  MATHPTRPREQDDLQDIDEAVPSNGC-GCFRL----FGFNRNGNYEGRSLLQQQQGWEEESWMVRKLKKLKEVSEMVGGPRWKNFIRKMGGLFRRKKQRK

Query:  NRFQYDPESYALNFDDGFDGEDDDRHPPIAFSTR
        NRFQYDPESYALNF+ GFDGEDDD HPPIAFSTR
Subjt:  NRFQYDPESYALNFDDGFDGEDDDRHPPIAFSTR

XP_023516738.1 uncharacterized protein LOC111780544 [Cucurbita pepo subsp. pepo]7.4e-6491.04Show/hide
Query:  MATHPTRPREQDDLQDIDEAVPSNGC-GCFRL----FGFNRNGNYEGRSLLQQQQGWEEESWMVRKLKKLKEVSEMVGGPRWKNFIRKMGGLFRRKKQRK
        MATHPTRP+EQDDLQDIDEA+PSNGC GCFRL    FGFNRNGNYEGRSLLQQQ+GWEEESWMVRKLKKLK+VSEMVGGPRWKNFIRKMGG F+RKKQRK
Subjt:  MATHPTRPREQDDLQDIDEAVPSNGC-GCFRL----FGFNRNGNYEGRSLLQQQQGWEEESWMVRKLKKLKEVSEMVGGPRWKNFIRKMGGLFRRKKQRK

Query:  NRFQYDPESYALNFDDGFDGEDDDRHPPIAFSTR
        NRFQYDPESYALNFDDGFDGEDDD HPPIAFSTR
Subjt:  NRFQYDPESYALNFDDGFDGEDDDRHPPIAFSTR

TrEMBL top hitse value%identityAlignment
A0A0A0LTG0 Uncharacterized protein8.0e-4870.71Show/hide
Query:  MATHPTRP---------REQDDLQDIDEAVPSNGCGCFRLFGF--NRNGNYEGRSLLQQQQGWEEESWMVRKLKKLKEVSEMVGGPRWKNFIRKMGGLFR
        MATH TRP          +QDDLQDID+++ SNGCGCF+LFGF  NRN NYEG +LLQQ+QG EEESWMV++LKK++EVSEMV GP+WKNFIRKMGG  +
Subjt:  MATHPTRP---------REQDDLQDIDEAVPSNGCGCFRLFGF--NRNGNYEGRSLLQQQQGWEEESWMVRKLKKLKEVSEMVGGPRWKNFIRKMGGLFR

Query:  RKKQRKNRFQYDPESYALNFDDGFDGEDDDRHPPIAFSTR
         KK+R NRFQYDPESYALNFD GFDGE+DD HPPI FS+R
Subjt:  RKKQRKNRFQYDPESYALNFDDGFDGEDDDRHPPIAFSTR

A0A1S3B8N4 uncharacterized protein LOC1034873839.5e-4971.63Show/hide
Query:  MATHPTRP----------REQDDLQDIDEAVPSNGCGCFRLFGF--NRNGNYEGRSLLQQQQGWEEESWMVRKLKKLKEVSEMVGGPRWKNFIRKMGGLF
        MA+H TRP          ++QDDLQDID+++ SNGCGCF+LFGF  NRN NYEG +LLQQ+QG EEESWMV+KLKK+KEVSEMV GP+WKNFIRKMGG  
Subjt:  MATHPTRP----------REQDDLQDIDEAVPSNGCGCFRLFGF--NRNGNYEGRSLLQQQQGWEEESWMVRKLKKLKEVSEMVGGPRWKNFIRKMGGLF

Query:  RRKKQRKNRFQYDPESYALNFDDGFDGEDDDRHPPIAFSTR
        + KKQR NRFQYDPESYALNFD GFDGE+DD HPPI FS+R
Subjt:  RRKKQRKNRFQYDPESYALNFDDGFDGEDDDRHPPIAFSTR

A0A5A7U6Q0 Uncharacterized protein9.5e-4971.63Show/hide
Query:  MATHPTRP----------REQDDLQDIDEAVPSNGCGCFRLFGF--NRNGNYEGRSLLQQQQGWEEESWMVRKLKKLKEVSEMVGGPRWKNFIRKMGGLF
        MA+H TRP          ++QDDLQDID+++ SNGCGCF+LFGF  NRN NYEG +LLQQ+QG EEESWMV+KLKK+KEVSEMV GP+WKNFIRKMGG  
Subjt:  MATHPTRP----------REQDDLQDIDEAVPSNGCGCFRLFGF--NRNGNYEGRSLLQQQQGWEEESWMVRKLKKLKEVSEMVGGPRWKNFIRKMGGLF

Query:  RRKKQRKNRFQYDPESYALNFDDGFDGEDDDRHPPIAFSTR
        + KKQR NRFQYDPESYALNFD GFDGE+DD HPPI FS+R
Subjt:  RRKKQRKNRFQYDPESYALNFDDGFDGEDDDRHPPIAFSTR

A0A6J1E6U3 uncharacterized protein LOC1114299482.6e-6796.12Show/hide
Query:  MATHPTRPREQDDLQDIDEAVPSNGCGCFRLFGFNRNGNYEGRSLLQQQQGWEEESWMVRKLKKLKEVSEMVGGPRWKNFIRKMGGLFRRKKQRKNRFQY
        MATHPTRPREQDDLQDIDEAVPSNGCGCFRLFGFNRNGNYEGRSLLQ+QQGWEEESWMVRKLKKLKEVSEMVGGPRWKNFIRKMGG F+RKKQRKNRFQY
Subjt:  MATHPTRPREQDDLQDIDEAVPSNGCGCFRLFGFNRNGNYEGRSLLQQQQGWEEESWMVRKLKKLKEVSEMVGGPRWKNFIRKMGGLFRRKKQRKNRFQY

Query:  DPESYALNFDDGFDGEDDDRHPPIAFSTR
        DPESYALNFDDGFDGE DD HPPIAFSTR
Subjt:  DPESYALNFDDGFDGEDDDRHPPIAFSTR

A0A6J1JJK8 uncharacterized protein LOC1114850328.8e-5584.33Show/hide
Query:  MATHPTRPREQDDLQDIDEAVPSNGC-GCFRL----FGFNRNGNYEGRSLLQQQQGWEEESWMVRKLKKLKEVSEMVGGPRWKNFIRKMGGLFRRKKQRK
        MATHPTRPREQDD QDIDEA+PSNGC GCFRL    FGFNRNGNYEGRSLLQQQ+G      MVRK KKLKEVSEMVGGPRWKNFIRKMGG  +RKKQRK
Subjt:  MATHPTRPREQDDLQDIDEAVPSNGC-GCFRL----FGFNRNGNYEGRSLLQQQQGWEEESWMVRKLKKLKEVSEMVGGPRWKNFIRKMGGLFRRKKQRK

Query:  NRFQYDPESYALNFDDGFDGEDDDRHPPIAFSTR
        NRFQYDPESYALNF+ GFDGEDDD HPPIAFSTR
Subjt:  NRFQYDPESYALNFDDGFDGEDDDRHPPIAFSTR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01430.1 BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:AT5G14890.1)2.8e-0828.48Show/hide
Query:  THPTRP-REQDDLQDIDEAV-PSNGCGCFRL------FGFNRNGNYEGRSLLQQQQGWEEESWMVRKLKKLKEVSEMVGGPRWKNFIRKM----------
        +H + P  E D   D+ EA+    GC CF +          R G+   + +    +   +E W +R  ++++E SE+V GPRWK +IR+           
Subjt:  THPTRP-REQDDLQDIDEAV-PSNGCGCFRL------FGFNRNGNYEGRSLLQQQQGWEEESWMVRKLKKLKEVSEMVGGPRWKNFIRKM----------

Query:  -----------GGLFRRKKQRKNRFQYDPESYALNFDDGFD-GEDDDRHPPIAFSTRLRPHQSPL
                   GG    +   + +F+YD  SY+LNFDDG   G  DD  P   +S R      P+
Subjt:  -----------GGLFRRKKQRKNRFQYDPESYALNFDDGFD-GEDDDRHPPIAFSTRLRPHQSPL

AT3G48020.1 unknown protein1.6e-0846.38Show/hide
Query:  EEESWMVRKLKKLKEVSEMVGGPRWKNFIRKMGGLFRRKK--QRKNRFQYDPESYALNFDDGFDGEDDD
        +E  W VR   K++E SE+V GPRWK FIR+     RR +     ++F+YDP SY L+F+D  D +DDD
Subjt:  EEESWMVRKLKKLKEVSEMVGGPRWKNFIRKMGGLFRRKK--QRKNRFQYDPESYALNFDDGFDGEDDD

AT5G14890.1 NHL domain-containing protein2.0e-0631.94Show/hide
Query:  EQDDLQDIDEAV-PSNGCGCFRL--FGFNR----NGNYEGRSLLQQQQGWEEESWMVRKLKKLKEVSEMVGGPRWKNFIRKM-------GGLFRRKKQRK
        E D   ++ EA+    GC CF L   G ++    NG+   + +    +   +E W V    K++E SE+V GP+WK FIR+        GG+     + +
Subjt:  EQDDLQDIDEAV-PSNGCGCFRL--FGFNR----NGNYEGRSLLQQQQGWEEESWMVRKLKKLKEVSEMVGGPRWKNFIRKM-------GGLFRRKKQRK

Query:  N-RFQYDPESYALNFDDGFD-GEDDDRHPPIAFSTRLRPHQSPL
        +  F+YD  SY+LNFDDG   G  +D  P   +S R      P+
Subjt:  N-RFQYDPESYALNFDDGFD-GEDDDRHPPIAFSTRLRPHQSPL

AT5G25240.1 unknown protein1.4e-1545.95Show/hide
Query:  DIDEAVPSNGCGCFRLFGFN--RNGNYEGRSLLQQQQGW----EEE---SWMVRKLKKLKEVSEMVGGPRWKNFIRKMGGLFRRKKQRKNRFQYDPESYA
        D +E     GCG FR F F   R G+ E RS    + GW    +EE   +W   KLK LKE+SE + GP+WKNFIR      R+K +R   F YD ++Y+
Subjt:  DIDEAVPSNGCGCFRLFGFN--RNGNYEGRSLLQQQQGW----EEE---SWMVRKLKKLKEVSEMVGGPRWKNFIRKMGGLFRRKKQRKNRFQYDPESYA

Query:  LNFDDGFDGED
        LNFDDG DG+D
Subjt:  LNFDDGFDGED

AT5G62865.1 unknown protein2.5e-0947.83Show/hide
Query:  EEESWMVRKLKKLKEVSEMVGGPRWKNFIRKMGGLFRRKK--QRKNRFQYDPESYALNFDDGFDGEDDD
        +E  W +R   K++E SE+V GPRWK FIR+     RR +      +FQYDP SY+LNFDD  D E+D+
Subjt:  EEESWMVRKLKKLKEVSEMVGGPRWKNFIRKMGGLFRRKK--QRKNRFQYDPESYALNFDDGFDGEDDD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACCCATCCAACCAGACCAAGGGAACAGGATGACCTTCAGGACATTGACGAGGCCGTTCCCTCAAATGGGTGTGGCTGTTTTCGGCTATTCGGGTTCAATCGGAA
CGGTAATTACGAAGGTAGAAGTCTTCTGCAGCAGCAACAGGGCTGGGAAGAGGAGTCTTGGATGGTGAGGAAATTGAAGAAGTTGAAGGAGGTTTCAGAAATGGTGGGTG
GACCCAGATGGAAGAACTTCATCAGAAAAATGGGTGGCCTTTTTAGGAGGAAAAAACAGAGGAAAAACAGGTTTCAGTATGACCCTGAAAGCTATGCTCTCAATTTCGAC
GACGGTTTCGATGGGGAAGATGACGATCGTCATCCGCCAATTGCCTTTTCTACGAGACTTCGCCCTCACCAGTCACCGTTGCGTAAACACGCCGGGAAAGATCCAAATCC
TTCTCGCATTGCAATACTGAATTGTCAACTATTTCTCGATTTGAGAACTCGCGGTATAGGCAGCTACAGCCGCGAATTTCCTTCCTCCGCCGGAGATCGTCCTTCCAGTT
TTCTATGTCTCAGGTCAGTGAAGTTCGTTCTTTCGTTCGTTCCTGGATTTTCATCTCTTATTCTCTTATTTCATATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGACCCATCCAACCAGACCAAGGGAACAGGATGACCTTCAGGACATTGACGAGGCCGTTCCCTCAAATGGGTGTGGCTGTTTTCGGCTATTCGGGTTCAATCGGAA
CGGTAATTACGAAGGTAGAAGTCTTCTGCAGCAGCAACAGGGCTGGGAAGAGGAGTCTTGGATGGTGAGGAAATTGAAGAAGTTGAAGGAGGTTTCAGAAATGGTGGGTG
GACCCAGATGGAAGAACTTCATCAGAAAAATGGGTGGCCTTTTTAGGAGGAAAAAACAGAGGAAAAACAGGTTTCAGTATGACCCTGAAAGCTATGCTCTCAATTTCGAC
GACGGTTTCGATGGGGAAGATGACGATCGTCATCCGCCAATTGCCTTTTCTACGAGACTTCGCCCTCACCAGTCACCGTTGCGTAAACACGCCGGGAAAGATCCAAATCC
TTCTCGCATTGCAATACTGAATTGTCAACTATTTCTCGATTTGAGAACTCGCGGTATAGGCAGCTACAGCCGCGAATTTCCTTCCTCCGCCGGAGATCGTCCTTCCAGTT
TTCTATGTCTCAGGTCAGTGAAGTTCGTTCTTTCGTTCGTTCCTGGATTTTCATCTCTTATTCTCTTATTTCATATCTGACTCTAATCGAGCTCGTACTCTTCAACAATG
GAGTTTTGGGGTGGATTTAGGGTCTGGGTTTTAACCGTGTGCTTGATATTTCAATCTGGATATGGGTTTTATCTTCCGGGGAGTTACCCTCTCAAACATGTTGTGGGCGA
TGAACTGTCGGTGAAGGTTAATTCCATAACCTCCATCGATACTGAAATGCCATTTAGCTATTATAGTTTGCCTTTTTGCAAACCTCAAGGGGGCGTTAAGGATAGTGCTG
AAAATCTTGGTGAGGTTCTTATGGGGGATCGGATTGAGAATTCGCCGTATCTGTTTAAGATGTATAAGAATCAGACAGATGTGTTCTTGTGCCAGACAGATCCATTGACT
GATGATCAGTTTAAGATCTTAAAGGAGAGGATTGATGAGATGTATCAGGTGAACTTGATCCTGGACAATTTACCGGCAATCCGGTATACCAAGAAAGAAGGATATCCATT
GCGTTGGACAGGATACCCTGTAGGAATCAATCTGAAGGGCTCCTATTATGTCTTTAACCATTTGAAATTTAAGGTTCTTGTTCACAAATACGAGGAGACGAACGTTGCAA
GCATAATGGGAACTGGTGATGCTGCAGGTGTGATCCCAACAGTCAGTAAACAGGAACTAGATGTTCCAGGATATATGGTTGTTGGATTTGAGGTTGTACCCTGCAGCCCT
TTGCACAATGTGGACTTAGTTAAGAACTTAAAGATGTATGAAAAGTATCCAAATCCTGTTCCATGTGACCCTGCTAGTGTATCAATGCAAATTAAGAAAGGCCAATCTAT
AGTGTTCACGTATGAGGTTACGTTTGAAGAGAGTGACATCAAGTGGCCATCCCGATGGGATGCGTATTTGAAGATGGAGGGTTCAAAAGTTCATTGGTTTTCAATCTTGA
ACTCTTTAATGGTGATAACGTTTCTCGCCGGTATTGTTCTTGTAATTTTCTTGAGGACTGTTCGACGAGATCTTACACGTTATGAGGAGCTTGACAAGGAGGCTCAAGCG
CAGATGAACGAGGAGTTATCTGGTTGGAAGCTTGTTGTGGGGGATGTTTTCCGTGCTCCAGCCAATCCTGCACTTTTGTGTATAATGGTTGGTGATGGGGTTCAGCTTCT
AGGGATGGGAATTGTGACCATATTGTTTGCTGCTCTTGGGTTCATGTCCCCAGCATCCCGTGGAACGCTTATTACAGGTATGCTATTTTTCTATATGATTCTCGGTGTTG
CAGCAGGTTATGTCGCTGTACGTCTTTGGAGAACGATCTGTTGTGGTGACCACAGAGGATGGGTTTCAGTCTCATGGAAGGCTGCTTGTTTCTTCCCAGGCATTGCCTTT
CTAATTCTTACCATACTGAATTTTCTATTATGGGGTAGTCAAAGCACGGGAGCGATTCCATTTTCGCTCTTTGTTATCCTACTTTTTCTGTGGTTCTGTATATCAGTCCC
TCTTACTCTTGTTGGTGGGTATTTTGGTGCCAAGGCACCTCACATTGAGTATCCTGTTAGAACCAATCAAATCCCACGGGAAATTCCGCCCCAGAAATACCCGTCATGGC
TTTTAGTCCTAGGCGCTGGCACACTTCCTTTCGGCACCTTGTTCATCGAACTCTTCTTTATCATGTCTAGCCTCTGGATGGGTCGTGTCTATTACGTTTTCGGGTTTCTC
TTCATAGTGCTTGTGCTTCTTGTTGTCGTTTGTGCTGAGGTATCTCTGGTTCTCACCTATATGCATCTATGCGTGGAAGACTGGAAATGGTGGTGGAAGTCTTTCTTTGC
TTCTGGTTCAGTTGCCTTGTACATCTTCTTGTACTCGATTAACTATCTCATCTTCGATCTCAAGAGCTTGAGCGGACCCGTCTCAGCCACTCTCTACCTTGGTTATTCAC
TCTTCATGGTTCTTGCAATCATGTTCACAACTGGAACGGTTGGATTCCTCTCGTCGTTCTGGTTCGTGCATTACTTGTTCTCTTCTGTGAAGCTGGACTGAGTAATCCCC
TCATGCCCAAGATCCAGAAACCATTAAGAAGTGCAGCAGCGTTCGAGGAGCAAAGACTGTAAGTTCTTGGCAGATGTTCTATCTTGTATTAGTCTTGTTTTTTCTCATTC
AAATCCAGTATGGATCTTCTAGATAATCTAGCAACAGCACTTGGTCAAAAGTTTTGCTTCCTCTTATTTCATGAACCTGAGTTTATTTACAGCTTTTGTTATAGAAAAGA
CCAAAATTAGTTCATTATCGGTGGATGTTGAAGAACCTTTTTGTGAGATCCCATCGGTCGGAGAGAGAACGAAATATTCTTGATAAGGGTGTGAAAACTTCTTAACGAAA
TATTCTTGATAAGGGTGTGAAAACTTCTTCCAGACGCATTTT
Protein sequenceShow/hide protein sequence
MATHPTRPREQDDLQDIDEAVPSNGCGCFRLFGFNRNGNYEGRSLLQQQQGWEEESWMVRKLKKLKEVSEMVGGPRWKNFIRKMGGLFRRKKQRKNRFQYDPESYALNFD
DGFDGEDDDRHPPIAFSTRLRPHQSPLRKHAGKDPNPSRIAILNCQLFLDLRTRGIGSYSREFPSSAGDRPSSFLCLRSVKFVLSFVPGFSSLILLFHI