; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012163 (gene) of Snake gourd v1 genome

Gene IDTan0012163
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRibosome maturation factor RimP
Genome locationLG04:26854411..26855925
RNA-Seq ExpressionTan0012163
SyntenyTan0012163
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7013097.1 rimP, partial [Cucurbita argyrosperma subsp. argyrosperma]3.0e-8075.23Show/hide
Query:  GRDEGEETTDGWEEDDDMEPELGDGGGGGGVVLQGVPWGEHVLSLAHEVLLQFGVDMKLYSFKTTPRGYIYVRLDKLSNVYGCPSLEELESYSQEYKKRL
        GRDEGEETTDGWEEDDD+EPELGDGGGGGGVVLQGVPWGE VL+LAHEVLLQFG D+ LYSFKTT RGYIYVRLDKLSN YGCPSLEELESYSQEYKKRL
Subjt:  GRDEGEETTDGWEEDDDMEPELGDGGGGGGVVLQGVPWGEHVLSLAHEVLLQFGVDMKLYSFKTTPRGYIYVRLDKLSNVYGCPSLEELESYSQEYKKRL

Query:  DEPGALGNIPDDLALEVSSPGAERLLKVPDHLFRFKAMP--IC---------TKNDEVFMLDRVDLESESCIWKLAN-----------------------
        DE GALGNIPDDLALEVSSPGAERLLKVPD LFRFKAMP  +C         T+ND VFM+D VDLESESC+WKLAN                       
Subjt:  DEPGALGNIPDDLALEVSSPGAERLLKVPDHLFRFKAMP--IC---------TKNDEVFMLDRVDLESESCIWKLAN-----------------------

Query:  -LPYENHKKVFLYL
         LP+ NHKKVFLYL
Subjt:  -LPYENHKKVFLYL

XP_022944994.1 uncharacterized protein LOC111449365 [Cucurbita moschata]6.7e-8074.77Show/hide
Query:  GRDEGEETTDGWEEDDDMEPELGDGGGGGGVVLQGVPWGEHVLSLAHEVLLQFGVDMKLYSFKTTPRGYIYVRLDKLSNVYGCPSLEELESYSQEYKKRL
        GRDEGEETTDGWEEDDD+EPELGDGGGGGGVVLQGVPWGE VL+LAHEVLLQFG D+ LYSFKTT RGYIYVRLDKLSN YGCPSLEELESYSQEYKKRL
Subjt:  GRDEGEETTDGWEEDDDMEPELGDGGGGGGVVLQGVPWGEHVLSLAHEVLLQFGVDMKLYSFKTTPRGYIYVRLDKLSNVYGCPSLEELESYSQEYKKRL

Query:  DEPGALGNIPDDLALEVSSPGAERLLKVPDHLFRFKAMP--IC---------TKNDEVFMLDRVDLESESCIWKLAN-----------------------
        DE GALGNIPDDLALEVSSPGAERLLKVPD LFRFKAMP  +C         T+ND VFM+D VDLESESC+WKLAN                       
Subjt:  DEPGALGNIPDDLALEVSSPGAERLLKVPDHLFRFKAMP--IC---------TKNDEVFMLDRVDLESESCIWKLAN-----------------------

Query:  -LPYENHKKVFLYL
         +P+ NHKKVFLYL
Subjt:  -LPYENHKKVFLYL

XP_022967944.1 uncharacterized protein LOC111467332 isoform X1 [Cucurbita maxima]8.7e-8075.12Show/hide
Query:  RDEGEETTDGWEEDDDMEPELGDGGGGGGVVLQGVPWGEHVLSLAHEVLLQFGVDMKLYSFKTTPRGYIYVRLDKLSNVYGCPSLEELESYSQEYKKRLD
        RDEGEETTDGWEEDDD+EPELGDGGGGGGVVLQGVPWGE VL+LAHEVLLQ G D+KLYSFKTT RGYIYVRLDKLSN YGCPSLEELESYSQEYKKRLD
Subjt:  RDEGEETTDGWEEDDDMEPELGDGGGGGGVVLQGVPWGEHVLSLAHEVLLQFGVDMKLYSFKTTPRGYIYVRLDKLSNVYGCPSLEELESYSQEYKKRLD

Query:  EPGALGNIPDDLALEVSSPGAERLLKVPDHLFRFKAMP--IC---------TKNDEVFMLDRVDLESESCIWKLAN------------------------
        E GALGNIPDDLALEVSSPGAERLLKVPD LFRFKAMP  +C         T+ND VFM+D VDLESESC+WKLAN                        
Subjt:  EPGALGNIPDDLALEVSSPGAERLLKVPDHLFRFKAMP--IC---------TKNDEVFMLDRVDLESESCIWKLAN------------------------

Query:  LPYENHKKVFLYL
        LP+ NHKKVFLYL
Subjt:  LPYENHKKVFLYL

XP_023541150.1 uncharacterized protein LOC111801398 isoform X1 [Cucurbita pepo subsp. pepo]3.5e-8175.7Show/hide
Query:  GRDEGEETTDGWEEDDDMEPELGDGGGGGGVVLQGVPWGEHVLSLAHEVLLQFGVDMKLYSFKTTPRGYIYVRLDKLSNVYGCPSLEELESYSQEYKKRL
        GRDEGEETTDGWEEDDD+EPELGDGGGGGGVVLQGVPWGE VL+LAHEVLLQFG D+KLYSFKTT RGYIYVRLDKLSN YGCPSLEELESYSQEYKKRL
Subjt:  GRDEGEETTDGWEEDDDMEPELGDGGGGGGVVLQGVPWGEHVLSLAHEVLLQFGVDMKLYSFKTTPRGYIYVRLDKLSNVYGCPSLEELESYSQEYKKRL

Query:  DEPGALGNIPDDLALEVSSPGAERLLKVPDHLFRFKAMP--IC---------TKNDEVFMLDRVDLESESCIWKLAN-----------------------
        DE GALGNIPDDLALEVSSPGAERLLKVPD LFRFKAMP  +C         T+ND VFM+D VDLESESC+WKLAN                       
Subjt:  DEPGALGNIPDDLALEVSSPGAERLLKVPDHLFRFKAMP--IC---------TKNDEVFMLDRVDLESESCIWKLAN-----------------------

Query:  -LPYENHKKVFLYL
         LP+ NHKKVFLYL
Subjt:  -LPYENHKKVFLYL

XP_038891543.1 uncharacterized protein LOC120080930 [Benincasa hispida]6.3e-7872.69Show/hide
Query:  GRDEGEETTDGWEEDDDMEPELGDGGGGGGVVLQGVPWGEHVLSLAHEVLLQFGVDMKLYSFKTTPRGYIYVRLDKLSNVYGCPSLEELESYSQEYKKRL
        G DEG ETTDGWEEDDD+EPELGDGGGGGGVVLQGVPWGE VL LA EVLLQFG D+KLYSFKTTPRGYIYVRLDKLSN +GCPSLEELESYSQEYKKRL
Subjt:  GRDEGEETTDGWEEDDDMEPELGDGGGGGGVVLQGVPWGEHVLSLAHEVLLQFGVDMKLYSFKTTPRGYIYVRLDKLSNVYGCPSLEELESYSQEYKKRL

Query:  DEPGALGNIPDDLALEVSSPGAERLLKVPDHLFRFKAMPI-----------CTKNDEVFMLDRVDLESESCIWKLAN-----------------------
        DE GALGNIPDDLALEVSSPGAERLLKVPD LFRFKA+P+            T+ND V+MLDRV+ E E CIWKLAN                       
Subjt:  DEPGALGNIPDDLALEVSSPGAERLLKVPDHLFRFKAMPI-----------CTKNDEVFMLDRVDLESESCIWKLAN-----------------------

Query:  -LPYENHKKVFLYLAC
         LPY NHKKVFLYL C
Subjt:  -LPYENHKKVFLYLAC

TrEMBL top hitse value%identityAlignment
A0A0A0KW71 Uncharacterized protein1.3e-7368.66Show/hide
Query:  GRDEGEETTDGWEE-DDDMEPELGDGGGGGGVVLQGVPWGEHVLSLAHEVLLQFGVDMKLYSFKTTPRGYIYVRLDKLSNVYGCPSLEELESYSQEYKKR
        GR++  ETTDGWEE DDD+EPELGDGG GGGVVLQGVPWGEHVL LA EVLLQFG D+KLYSFK TPRGYIYVRLDKLS+ +GCP+LEEL+SYS+EYKKR
Subjt:  GRDEGEETTDGWEE-DDDMEPELGDGGGGGGVVLQGVPWGEHVLSLAHEVLLQFGVDMKLYSFKTTPRGYIYVRLDKLSNVYGCPSLEELESYSQEYKKR

Query:  LDEPGALGNIPDDLALEVSSPGAERLLKVPDHLFRFKAMPI-----------CTKNDEVFMLDRVDLESESCIWKLAN----------------------
        LDE GALGNIPDDLALEVSSPGAERLLK+PD L RFKA P+            ++ND VFMLD ++LESESCIWKLAN                      
Subjt:  LDEPGALGNIPDDLALEVSSPGAERLLKVPDHLFRFKAMPI-----------CTKNDEVFMLDRVDLESESCIWKLAN----------------------

Query:  --LPYENHKKVFLYLAC
          LPY NHKKVFLYL C
Subjt:  --LPYENHKKVFLYLAC

A0A1S3BF38 uncharacterized protein LOC103489189 isoform X35.9e-7469.59Show/hide
Query:  GRDEGEETTDGWEE-DDDMEPELGDGGGGGGVVLQGVPWGEHVLSLAHEVLLQFGVDMKLYSFKTTPRGYIYVRLDKLSNVYGCPSLEELESYSQEYKKR
        GR++  ETTDGWEE DDD+EPELGDGG GGGVVLQGVPWGEHVL LA EVLLQFG D+KLYSFK TPRGYIYVRLDKLS+ +GCPS+EEL+SYS+EYKKR
Subjt:  GRDEGEETTDGWEE-DDDMEPELGDGGGGGGVVLQGVPWGEHVLSLAHEVLLQFGVDMKLYSFKTTPRGYIYVRLDKLSNVYGCPSLEELESYSQEYKKR

Query:  LDEPGALGNIPDDLALEVSSPGAERLLKVPDHLFRFKAMPI-----------CTKNDEVFMLDRVDLESESCIWKLAN----------------------
        LDE GALGNIPDDLALEVSSPGAERLLKVPD L RFKA P+            ++ND VFMLD V+LESESCIWKLAN                      
Subjt:  LDEPGALGNIPDDLALEVSSPGAERLLKVPDHLFRFKAMPI-----------CTKNDEVFMLDRVDLESESCIWKLAN----------------------

Query:  --LPYENHKKVFLYLAC
          LPY NHKKVFLYL C
Subjt:  --LPYENHKKVFLYLAC

A0A6J1D5R7 uncharacterized protein LOC1110175625.2e-7871.16Show/hide
Query:  RDEGEETTDGWEEDDDMEPELGDGGGGGGVVLQGVPWGEHVLSLAHEVLLQFGVDMKLYSFKTTPRGYIYVRLDKLSNVYGCPSLEELESYSQEYKKRLD
        +DEG ETTDGWE+DDDMEPE+GDGGGGGGVVLQGVPWGEHVLSLAHEVLLQFG D+KLYSFKTTPRGYIYVRLDKLSN +GCPSLE+LESY+QEYKKRL 
Subjt:  RDEGEETTDGWEEDDDMEPELGDGGGGGGVVLQGVPWGEHVLSLAHEVLLQFGVDMKLYSFKTTPRGYIYVRLDKLSNVYGCPSLEELESYSQEYKKRLD

Query:  EPGALGNIPDDLALEVSSPGAERLLKVPDHLFRFKAMPI-----------CTKNDEVFMLDRVDLESESCIWKLAN------------------------
          GALG IPDDLALEVSSPGAERLLKVPD LFRFKAMP+            T+ND VF+LD V+ ES+SC+WKLAN                        
Subjt:  EPGALGNIPDDLALEVSSPGAERLLKVPDHLFRFKAMPI-----------CTKNDEVFMLDRVDLESESCIWKLAN------------------------

Query:  LPYENHKKVFLYLAC
        LPY NHKKVFLYL C
Subjt:  LPYENHKKVFLYLAC

A0A6J1FZQ6 uncharacterized protein LOC1114493653.2e-8074.77Show/hide
Query:  GRDEGEETTDGWEEDDDMEPELGDGGGGGGVVLQGVPWGEHVLSLAHEVLLQFGVDMKLYSFKTTPRGYIYVRLDKLSNVYGCPSLEELESYSQEYKKRL
        GRDEGEETTDGWEEDDD+EPELGDGGGGGGVVLQGVPWGE VL+LAHEVLLQFG D+ LYSFKTT RGYIYVRLDKLSN YGCPSLEELESYSQEYKKRL
Subjt:  GRDEGEETTDGWEEDDDMEPELGDGGGGGGVVLQGVPWGEHVLSLAHEVLLQFGVDMKLYSFKTTPRGYIYVRLDKLSNVYGCPSLEELESYSQEYKKRL

Query:  DEPGALGNIPDDLALEVSSPGAERLLKVPDHLFRFKAMP--IC---------TKNDEVFMLDRVDLESESCIWKLAN-----------------------
        DE GALGNIPDDLALEVSSPGAERLLKVPD LFRFKAMP  +C         T+ND VFM+D VDLESESC+WKLAN                       
Subjt:  DEPGALGNIPDDLALEVSSPGAERLLKVPDHLFRFKAMP--IC---------TKNDEVFMLDRVDLESESCIWKLAN-----------------------

Query:  -LPYENHKKVFLYL
         +P+ NHKKVFLYL
Subjt:  -LPYENHKKVFLYL

A0A6J1HY61 uncharacterized protein LOC111467332 isoform X14.2e-8075.12Show/hide
Query:  RDEGEETTDGWEEDDDMEPELGDGGGGGGVVLQGVPWGEHVLSLAHEVLLQFGVDMKLYSFKTTPRGYIYVRLDKLSNVYGCPSLEELESYSQEYKKRLD
        RDEGEETTDGWEEDDD+EPELGDGGGGGGVVLQGVPWGE VL+LAHEVLLQ G D+KLYSFKTT RGYIYVRLDKLSN YGCPSLEELESYSQEYKKRLD
Subjt:  RDEGEETTDGWEEDDDMEPELGDGGGGGGVVLQGVPWGEHVLSLAHEVLLQFGVDMKLYSFKTTPRGYIYVRLDKLSNVYGCPSLEELESYSQEYKKRLD

Query:  EPGALGNIPDDLALEVSSPGAERLLKVPDHLFRFKAMP--IC---------TKNDEVFMLDRVDLESESCIWKLAN------------------------
        E GALGNIPDDLALEVSSPGAERLLKVPD LFRFKAMP  +C         T+ND VFM+D VDLESESC+WKLAN                        
Subjt:  EPGALGNIPDDLALEVSSPGAERLLKVPDHLFRFKAMP--IC---------TKNDEVFMLDRVDLESESCIWKLAN------------------------

Query:  LPYENHKKVFLYL
        LP+ NHKKVFLYL
Subjt:  LPYENHKKVFLYL

SwissProt top hitse value%identityAlignment
B0SH16 Ribosome maturation factor RimP6.6e-0640.58Show/hide
Query:  IYVRLDKLSNVYGCPSLEELESYSQEYKKRLDEPGALGNIPDDLALEVSSPGAERLLKVPDHLFRFKAM
        I + LD L++  G  SLE+ E+ S+  K+ LD+ G       D  L+VSS GAER+L++P+ L RF+ +
Subjt:  IYVRLDKLSNVYGCPSLEELESYSQEYKKRLDEPGALGNIPDDLALEVSSPGAERLLKVPDHLFRFKAM

Q04U33 Ribosome maturation factor RimP6.0e-0740.91Show/hide
Query:  MKLYSFKTTPR---GYIYVRLDKLSNVYGCPSLEELESYSQEYKKRLDEPGALGNIPD-DLALEVSSPGAERLLKVPDHLFRFKAMPI
        +KLYS K   R     I V LD L + YG  SL E E  S++ K+ L+        PD D  L+VSS GAER L +P+ + RF+ +P+
Subjt:  MKLYSFKTTPR---GYIYVRLDKLSNVYGCPSLEELESYSQEYKKRLDEPGALGNIPD-DLALEVSSPGAERLLKVPDHLFRFKAMPI

Q04ZJ3 Ribosome maturation factor RimP6.0e-0740.91Show/hide
Query:  MKLYSFKTTPR---GYIYVRLDKLSNVYGCPSLEELESYSQEYKKRLDEPGALGNIPD-DLALEVSSPGAERLLKVPDHLFRFKAMPI
        +KLYS K   R     I V LD L + YG  SL E E  S++ K+ L+        PD D  L+VSS GAER L +P+ + RF+ +P+
Subjt:  MKLYSFKTTPR---GYIYVRLDKLSNVYGCPSLEELESYSQEYKKRLDEPGALGNIPD-DLALEVSSPGAERLLKVPDHLFRFKAMPI

Q72NX1 Ribosome maturation factor RimP6.0e-0743.18Show/hide
Query:  MKLYSFKTTPR---GYIYVRLDKLSNVYGCPSLEELESYSQEYKKRLDEPGALGNIPD-DLALEVSSPGAERLLKVPDHLFRFKAMPI
        +KLYS K   R     I V LD L + YG  SL E E  S++ K+ L+        PD D  L+VSS GAER L +P  L RF+ +PI
Subjt:  MKLYSFKTTPR---GYIYVRLDKLSNVYGCPSLEELESYSQEYKKRLDEPGALGNIPD-DLALEVSSPGAERLLKVPDHLFRFKAMPI

Q8F7K3 Ribosome maturation factor RimP6.0e-0743.18Show/hide
Query:  MKLYSFKTTPR---GYIYVRLDKLSNVYGCPSLEELESYSQEYKKRLDEPGALGNIPD-DLALEVSSPGAERLLKVPDHLFRFKAMPI
        +KLYS K   R     I V LD L + YG  SL E E  S++ K+ L+        PD D  L+VSS GAER L +P  L RF+ +PI
Subjt:  MKLYSFKTTPR---GYIYVRLDKLSNVYGCPSLEELESYSQEYKKRLDEPGALGNIPD-DLALEVSSPGAERLLKVPDHLFRFKAMPI

Arabidopsis top hitse value%identityAlignment
AT1G69210.1 Uncharacterised protein family UPF00902.6e-5351.43Show/hide
Query:  ETTDGWE--EDDDMEPELGDGGGGGGVVLQGVPWGEHVLSLAHEVLLQFGVDMKLYSFKTTPRGYIYVRLDKLSNVYGCPSLEELESYSQEYKKRLDEPG
        E  + WE  ED+D+E +LGDGG GGG+VL+GV WGE VLS+A +VL Q   D++L++FKT+PRGYIYVRLDKLS  YGCP+++ELE +S+E+KKRLD+ G
Subjt:  ETTDGWE--EDDDMEPELGDGGGGGGVVLQGVPWGEHVLSLAHEVLLQFGVDMKLYSFKTTPRGYIYVRLDKLSNVYGCPSLEELESYSQEYKKRLDEPG

Query:  ALGNIPDDLALEVSSPGAERLLKVPDHLFRFKAMPICTKNDE-----------VFMLDRVDLESESCIWKLAN------------------------LPY
        A   IP+DLALEVSSPGAERLL+VP+ L RFK MP+     E           VF+L+ +D ES++C+WKLA+                        LP+
Subjt:  ALGNIPDDLALEVSSPGAERLLKVPDHLFRFKAMPICTKNDE-----------VFMLDRVDLESESCIWKLAN------------------------LPY

Query:  ENHKKVFLYL
         +HKK+ LYL
Subjt:  ENHKKVFLYL

AT1G69210.2 Uncharacterised protein family UPF00901.4e-3562.83Show/hide
Query:  ETTDGWE--EDDDMEPELGDGGGGGGVVLQGVPWGEHVLSLAHEVLLQFGVDMKLYSFKTTPRGYIYVRLDKLSNVYGCPSLEELESYSQEYKKRLDEPG
        E  + WE  ED+D+E +LGDGG GGG+VL+GV WGE VLS+A +VL Q   D++L++FKT+PRGYIYVRLDKLS  YGCP+++ELE +S+E+KKRLD+ G
Subjt:  ETTDGWE--EDDDMEPELGDGGGGGGVVLQGVPWGEHVLSLAHEVLLQFGVDMKLYSFKTTPRGYIYVRLDKLSNVYGCPSLEELESYSQEYKKRLDEPG

Query:  ALGNIPDDLALEV
        A   IP+DLALEV
Subjt:  ALGNIPDDLALEV

AT1G77122.1 Uncharacterised protein family UPF00905.5e-3239.56Show/hide
Query:  DEGEETTDGWEEDDDMEPEL--GDGGGGGGVVLQGVPWGEHVLSLAHEVLLQFGVDMKLYSFKTTPRGYIYVRLDKLSNVYGCPSLEELESYSQEYKKRL
        DE +E  D   E D+ E EL  GDGGGGGG+ L G  W +  L+LA +V   F  D+ +Y+FKT P   I VR+++L+N +G P++E++E++S  Y+ +L
Subjt:  DEGEETTDGWEEDDDMEPEL--GDGGGGGGVVLQGVPWGEHVLSLAHEVLLQFGVDMKLYSFKTTPRGYIYVRLDKLSNVYGCPSLEELESYSQEYKKRL

Query:  DEPGALGNIPDDLALEVSSPGAERLLKVPDHLFRFKAMPIC---------TKNDEVFMLDRVDLESESCIWKLANLPYENHK
         E     +IPD+++LEVSSPG ER++++P  L R+K  P+          T+ D +F L   D+E++ CIW +A++     K
Subjt:  DEPGALGNIPDDLALEVSSPGAERLLKVPDHLFRFKAMPIC---------TKNDEVFMLDRVDLESESCIWKLANLPYENHK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACTATAGATTTGTTGCGGAAGCCGGGAGGGATGAAGGGGAAGAAACTACGGATGGATGGGAAGAAGACGATGATATGGAGCCTGAGCTTGGTGATGGAGGGGGTGG
TGGTGGTGTTGTTTTACAAGGCGTGCCATGGGGCGAACATGTTCTTTCTCTTGCTCATGAGGTCCTGCTGCAATTCGGCGTTGACATGAAACTCTATTCTTTTAAGACTA
CTCCACGTGGATATATCTATGTCAGACTAGACAAGCTCTCAAACGTATATGGGTGTCCCAGCTTGGAGGAACTTGAAAGTTATAGCCAAGAGTACAAGAAAAGATTAGAT
GAACCTGGGGCACTTGGAAATATTCCGGATGATTTGGCTCTTGAGGTATCATCTCCAGGTGCAGAGAGATTACTCAAGGTCCCAGATCATCTGTTTAGATTTAAAGCCAT
GCCAATATGCACTAAAAATGATGAAGTTTTTATGTTGGATCGTGTGGACTTGGAATCTGAGAGCTGTATCTGGAAATTGGCAAATTTACCCTATGAAAATCATAAGAAGG
TATTCCTTTATCTTGCATGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTACTATAGATTTGTTGCGGAAGCCGGGAGGGATGAAGGGGAAGAAACTACGGATGGATGGGAAGAAGACGATGATATGGAGCCTGAGCTTGGTGATGGAGGGGGTGG
TGGTGGTGTTGTTTTACAAGGCGTGCCATGGGGCGAACATGTTCTTTCTCTTGCTCATGAGGTCCTGCTGCAATTCGGCGTTGACATGAAACTCTATTCTTTTAAGACTA
CTCCACGTGGATATATCTATGTCAGACTAGACAAGCTCTCAAACGTATATGGGTGTCCCAGCTTGGAGGAACTTGAAAGTTATAGCCAAGAGTACAAGAAAAGATTAGAT
GAACCTGGGGCACTTGGAAATATTCCGGATGATTTGGCTCTTGAGGTATCATCTCCAGGTGCAGAGAGATTACTCAAGGTCCCAGATCATCTGTTTAGATTTAAAGCCAT
GCCAATATGCACTAAAAATGATGAAGTTTTTATGTTGGATCGTGTGGACTTGGAATCTGAGAGCTGTATCTGGAAATTGGCAAATTTACCCTATGAAAATCATAAGAAGG
TATTCCTTTATCTTGCATGCTGA
Protein sequenceShow/hide protein sequence
MYYRFVAEAGRDEGEETTDGWEEDDDMEPELGDGGGGGGVVLQGVPWGEHVLSLAHEVLLQFGVDMKLYSFKTTPRGYIYVRLDKLSNVYGCPSLEELESYSQEYKKRLD
EPGALGNIPDDLALEVSSPGAERLLKVPDHLFRFKAMPICTKNDEVFMLDRVDLESESCIWKLANLPYENHKKVFLYLAC