; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021907 (gene) of Snake gourd v1 genome

Gene IDTan0021907
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCcmF_C domain-containing protein
Genome locationContig00127:137480..138691
RNA-Seq ExpressionTan0021907
SyntenyTan0021907
Gene Ontology termsGO:0017004 - cytochrome complex assembly (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR044955 - Cytochrome c biogenesis CcmF C-terminal-like mitochondrial protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4277251.1 unnamed protein product [Prunus armeniaca]1.2e-10287.38Show/hide
Query:  MRQKLVSRTVRRPSPTPAVMVRLRSTNTKKIQFTQRLPLGSELHMGKERCCLRGIDHLHGPTSHSICGNFMIYKPSLTNDRLIFEHDKSLRADQLPINFP
        MRQKLV RTVRRPSPTPAVMVRLRSTNTKKIQFTQRLPLGSELHMGKERCCLRG+DHLHGPT HSICGN MIYKPSLTNDRL+FEHD+SLRAD L IN P
Subjt:  MRQKLVSRTVRRPSPTPAVMVRLRSTNTKKIQFTQRLPLGSELHMGKERCCLRGIDHLHGPTSHSICGNFMIYKPSLTNDRLIFEHDKSLRADQLPINFP

Query:  ASYENGKLEHFLHQWMTNREHNNFWLTMFPEKRYFRETTSTTEVAIHTNPFTDLYASIGTGSSRTGGW----------RRSSGCGASRHFFDGPRPPGLP
        ASY+NGKLEHFLH+WM NREHNNFWLTMFPEKRYFRETTSTTEVAIHTNPFTDLYASIGTGSSRTGGW           +SSGCGASRHFFDGPRPPGLP
Subjt:  ASYENGKLEHFLHQWMTNREHNNFWLTMFPEKRYFRETTSTTEVAIHTNPFTDLYASIGTGSSRTGGW----------RRSSGCGASRHFFDGPRPPGLP

Query:  ARSEEFMGRRASRT
        ARSEEFMGR+A  T
Subjt:  ARSEEFMGRRASRT

GFS31153.1 hypothetical protein Acr_00g0015910 [Actinidia rufa]7.0e-9286.8Show/hide
Query:  MRQKLVSRTVRRPSPTPAVMVRLRSTNTKKIQFTQRLPLGSELHMGKERCCLRGIDHLHGPTSHSICGNFMIYKPSLTNDRLIF--EHDKSLRADQLPIN
        MRQKLVSRTVRRPSPTPAV VRLRSTNTKKI+FTQRLPLGSEL+M KERC LRG+DHLHGPT HSICGNFMIYKPSLTNDRL+   EHD+SLRAD LPIN
Subjt:  MRQKLVSRTVRRPSPTPAVMVRLRSTNTKKIQFTQRLPLGSELHMGKERCCLRGIDHLHGPTSHSICGNFMIYKPSLTNDRLIF--EHDKSLRADQLPIN

Query:  FPASYENGKLEHFLHQWMTNREHNNFWLTMFPEKRYFRETTSTTEVAIHTNPFTDLYASIGTGSSRTGGWRRSSGCGASRHFFDGPRPPGLPARSEE
        FPASYENGKLEHFLH+WM N EH NFWLTMF E R FRETTSTTEVAIHTNPFTDLYASIGT SSRT GW +SSGCG  RHFFDGPRPPGLPARSEE
Subjt:  FPASYENGKLEHFLHQWMTNREHNNFWLTMFPEKRYFRETTSTTEVAIHTNPFTDLYASIGTGSSRTGGWRRSSGCGASRHFFDGPRPPGLPARSEE

KAB5511257.1 hypothetical protein DKX38_030052 [Salix brachista]5.2e-8794.64Show/hide
Query:  MRQKLVSRTVRRPSPTPAVMVRLRSTNTKKIQFTQRLPLGSELHMGKERCCLRGIDHLHGPTSHSICGNFMIYKPSLTNDRLIFEHDKSLRADQLPINFP
        MRQKLVSRTVRRPSPTPAVMVRLRSTNTKKIQFTQRLPLGSELHMGKERCCLRG+DHLHGPTSHSICGN MIYKPSLTNDRL+FEHD+SLRAD L INFP
Subjt:  MRQKLVSRTVRRPSPTPAVMVRLRSTNTKKIQFTQRLPLGSELHMGKERCCLRGIDHLHGPTSHSICGNFMIYKPSLTNDRLIFEHDKSLRADQLPINFP

Query:  ASYENGKLEHFLHQWMTNREHNNFWLTMFPEKRYFRETTSTTEVAIHTNPFTDLYASIGTGSSRTGGW
        ASYENGKLEHFLH+WM NREHNNFWLTMFPEKRYFRETTSTTEVAIHTN FTDLYASIGTGSSRTGGW
Subjt:  ASYENGKLEHFLHQWMTNREHNNFWLTMFPEKRYFRETTSTTEVAIHTNPFTDLYASIGTGSSRTGGW

KJB09756.1 hypothetical protein B456_001G162300 [Gossypium raimondii]4.2e-12979.49Show/hide
Query:  MRQKLVSRTVRRPSPTPAVMVRLRSTNTKKIQFTQRLPLGSELHMGKERCCLRGIDHLHGPTSHSICGNFMIYKPSLTNDRLIFEHDKSLRADQLPINFP
        MRQKLV RTVRRPSPTPAVMVR RSTNTKKIQFTQRLPLGSELHMGKERCCLRG+DHLH PT HSICGN MIYKPSLTNDRL+FEHD+SLRAD L INFP
Subjt:  MRQKLVSRTVRRPSPTPAVMVRLRSTNTKKIQFTQRLPLGSELHMGKERCCLRGIDHLHGPTSHSICGNFMIYKPSLTNDRLIFEHDKSLRADQLPINFP

Query:  ASYENGKLEHFLHQWMTNREHNNFWLTMFPEKRYFRETTSTTEVAIHTNPFTDLYASIGTGSSRTGGWRRSSGCGASRHFFDGPRPPGL-PARSEEFMGR
        ASYENGKLEHFLH+WM NR+HNNFWLTMFPEKRYFRETTSTTEVAIHTN FTDLYASIGTGSSRTGGW +SSGCGASRHFFDGPRPP   PA    FMG 
Subjt:  ASYENGKLEHFLHQWMTNREHNNFWLTMFPEKRYFRETTSTTEVAIHTNPFTDLYASIGTGSSRTGGWRRSSGCGASRHFFDGPRPPGL-PARSEEFMGR

Query:  RASRTRGRALLTKPSLV---------------LADRRW---------EGK-KGSLSSPDGRGQSPITFGASLWSFSLAFPRPLELELVDRPLSTSDALRF
        R +R  GRALLTKPSLV               L  R W         EG+ KGSLSSPD RG+SPITFGA LWSFSLAFPRP  LELVDRPLSTSDA RF
Subjt:  RASRTRGRALLTKPSLV---------------LADRRW---------EGK-KGSLSSPDGRGQSPITFGASLWSFSLAFPRPLELELVDRPLSTSDALRF

Query:  HLGFRTSSPIKQ
        HLGF TSSPIKQ
Subjt:  HLGFRTSSPIKQ

TKS02629.1 hypothetical protein D5086_0000160300 [Populus alba]3.0e-8792.49Show/hide
Query:  MRQKLVSRTVRRPSPTPAVMVRLRSTNTKKIQFTQRLPLGSELHMGKERCCLRGIDHLHGPTSHSICGNFMIYKPSLTNDRLIFEHDKSLRADQLPINFP
        MRQKLVSRTVRRPSPTPAVMVRLRSTNTKKIQFTQRLPLGSELHMGKERCCLRG+DHLHGPTSHSICGN MIYKPSLTNDRL+FEHD+SLRAD L INFP
Subjt:  MRQKLVSRTVRRPSPTPAVMVRLRSTNTKKIQFTQRLPLGSELHMGKERCCLRGIDHLHGPTSHSICGNFMIYKPSLTNDRLIFEHDKSLRADQLPINFP

Query:  ASYENGKLEHFLHQWMTNREHNNFWLTMFPEKRYFRETTSTTEVAIHTNPFTDLYASIGTGSSRTGGWRRSSG
        ASYENGKLEHFLH+WM NREHNNFWLTMFPEKRYFRETTSTTEVAIHTN FTDLYASIGTGSSRTGGW   +G
Subjt:  ASYENGKLEHFLHQWMTNREHNNFWLTMFPEKRYFRETTSTTEVAIHTNPFTDLYASIGTGSSRTGGWRRSSG

TrEMBL top hitse value%identityAlignment
A0A0D2PZN2 Uncharacterized protein2.0e-12979.49Show/hide
Query:  MRQKLVSRTVRRPSPTPAVMVRLRSTNTKKIQFTQRLPLGSELHMGKERCCLRGIDHLHGPTSHSICGNFMIYKPSLTNDRLIFEHDKSLRADQLPINFP
        MRQKLV RTVRRPSPTPAVMVR RSTNTKKIQFTQRLPLGSELHMGKERCCLRG+DHLH PT HSICGN MIYKPSLTNDRL+FEHD+SLRAD L INFP
Subjt:  MRQKLVSRTVRRPSPTPAVMVRLRSTNTKKIQFTQRLPLGSELHMGKERCCLRGIDHLHGPTSHSICGNFMIYKPSLTNDRLIFEHDKSLRADQLPINFP

Query:  ASYENGKLEHFLHQWMTNREHNNFWLTMFPEKRYFRETTSTTEVAIHTNPFTDLYASIGTGSSRTGGWRRSSGCGASRHFFDGPRPPGL-PARSEEFMGR
        ASYENGKLEHFLH+WM NR+HNNFWLTMFPEKRYFRETTSTTEVAIHTN FTDLYASIGTGSSRTGGW +SSGCGASRHFFDGPRPP   PA    FMG 
Subjt:  ASYENGKLEHFLHQWMTNREHNNFWLTMFPEKRYFRETTSTTEVAIHTNPFTDLYASIGTGSSRTGGWRRSSGCGASRHFFDGPRPPGL-PARSEEFMGR

Query:  RASRTRGRALLTKPSLV---------------LADRRW---------EGK-KGSLSSPDGRGQSPITFGASLWSFSLAFPRPLELELVDRPLSTSDALRF
        R +R  GRALLTKPSLV               L  R W         EG+ KGSLSSPD RG+SPITFGA LWSFSLAFPRP  LELVDRPLSTSDA RF
Subjt:  RASRTRGRALLTKPSLV---------------LADRRW---------EGK-KGSLSSPDGRGQSPITFGASLWSFSLAFPRPLELELVDRPLSTSDALRF

Query:  HLGFRTSSPIKQ
        HLGF TSSPIKQ
Subjt:  HLGFRTSSPIKQ

A0A4U5PZ81 Uncharacterized protein1.5e-8792.49Show/hide
Query:  MRQKLVSRTVRRPSPTPAVMVRLRSTNTKKIQFTQRLPLGSELHMGKERCCLRGIDHLHGPTSHSICGNFMIYKPSLTNDRLIFEHDKSLRADQLPINFP
        MRQKLVSRTVRRPSPTPAVMVRLRSTNTKKIQFTQRLPLGSELHMGKERCCLRG+DHLHGPTSHSICGN MIYKPSLTNDRL+FEHD+SLRAD L INFP
Subjt:  MRQKLVSRTVRRPSPTPAVMVRLRSTNTKKIQFTQRLPLGSELHMGKERCCLRGIDHLHGPTSHSICGNFMIYKPSLTNDRLIFEHDKSLRADQLPINFP

Query:  ASYENGKLEHFLHQWMTNREHNNFWLTMFPEKRYFRETTSTTEVAIHTNPFTDLYASIGTGSSRTGGWRRSSG
        ASYENGKLEHFLH+WM NREHNNFWLTMFPEKRYFRETTSTTEVAIHTN FTDLYASIGTGSSRTGGW   +G
Subjt:  ASYENGKLEHFLHQWMTNREHNNFWLTMFPEKRYFRETTSTTEVAIHTNPFTDLYASIGTGSSRTGGWRRSSG

A0A5N5J813 CcmF_C domain-containing protein2.5e-8794.64Show/hide
Query:  MRQKLVSRTVRRPSPTPAVMVRLRSTNTKKIQFTQRLPLGSELHMGKERCCLRGIDHLHGPTSHSICGNFMIYKPSLTNDRLIFEHDKSLRADQLPINFP
        MRQKLVSRTVRRPSPTPAVMVRLRSTNTKKIQFTQRLPLGSELHMGKERCCLRG+DHLHGPTSHSICGN MIYKPSLTNDRL+FEHD+SLRAD L INFP
Subjt:  MRQKLVSRTVRRPSPTPAVMVRLRSTNTKKIQFTQRLPLGSELHMGKERCCLRGIDHLHGPTSHSICGNFMIYKPSLTNDRLIFEHDKSLRADQLPINFP

Query:  ASYENGKLEHFLHQWMTNREHNNFWLTMFPEKRYFRETTSTTEVAIHTNPFTDLYASIGTGSSRTGGW
        ASYENGKLEHFLH+WM NREHNNFWLTMFPEKRYFRETTSTTEVAIHTN FTDLYASIGTGSSRTGGW
Subjt:  ASYENGKLEHFLHQWMTNREHNNFWLTMFPEKRYFRETTSTTEVAIHTNPFTDLYASIGTGSSRTGGW

A0A6J5UM80 Uncharacterized protein5.6e-10387.38Show/hide
Query:  MRQKLVSRTVRRPSPTPAVMVRLRSTNTKKIQFTQRLPLGSELHMGKERCCLRGIDHLHGPTSHSICGNFMIYKPSLTNDRLIFEHDKSLRADQLPINFP
        MRQKLV RTVRRPSPTPAVMVRLRSTNTKKIQFTQRLPLGSELHMGKERCCLRG+DHLHGPT HSICGN MIYKPSLTNDRL+FEHD+SLRAD L IN P
Subjt:  MRQKLVSRTVRRPSPTPAVMVRLRSTNTKKIQFTQRLPLGSELHMGKERCCLRGIDHLHGPTSHSICGNFMIYKPSLTNDRLIFEHDKSLRADQLPINFP

Query:  ASYENGKLEHFLHQWMTNREHNNFWLTMFPEKRYFRETTSTTEVAIHTNPFTDLYASIGTGSSRTGGW----------RRSSGCGASRHFFDGPRPPGLP
        ASY+NGKLEHFLH+WM NREHNNFWLTMFPEKRYFRETTSTTEVAIHTNPFTDLYASIGTGSSRTGGW           +SSGCGASRHFFDGPRPPGLP
Subjt:  ASYENGKLEHFLHQWMTNREHNNFWLTMFPEKRYFRETTSTTEVAIHTNPFTDLYASIGTGSSRTGGW----------RRSSGCGASRHFFDGPRPPGLP

Query:  ARSEEFMGRRASRT
        ARSEEFMGR+A  T
Subjt:  ARSEEFMGRRASRT

A0A7J0DAS0 Uncharacterized protein3.4e-9286.8Show/hide
Query:  MRQKLVSRTVRRPSPTPAVMVRLRSTNTKKIQFTQRLPLGSELHMGKERCCLRGIDHLHGPTSHSICGNFMIYKPSLTNDRLIF--EHDKSLRADQLPIN
        MRQKLVSRTVRRPSPTPAV VRLRSTNTKKI+FTQRLPLGSEL+M KERC LRG+DHLHGPT HSICGNFMIYKPSLTNDRL+   EHD+SLRAD LPIN
Subjt:  MRQKLVSRTVRRPSPTPAVMVRLRSTNTKKIQFTQRLPLGSELHMGKERCCLRGIDHLHGPTSHSICGNFMIYKPSLTNDRLIF--EHDKSLRADQLPIN

Query:  FPASYENGKLEHFLHQWMTNREHNNFWLTMFPEKRYFRETTSTTEVAIHTNPFTDLYASIGTGSSRTGGWRRSSGCGASRHFFDGPRPPGLPARSEE
        FPASYENGKLEHFLH+WM N EH NFWLTMF E R FRETTSTTEVAIHTNPFTDLYASIGT SSRT GW +SSGCG  RHFFDGPRPPGLPARSEE
Subjt:  FPASYENGKLEHFLHQWMTNREHNNFWLTMFPEKRYFRETTSTTEVAIHTNPFTDLYASIGTGSSRTGGWRRSSGCGASRHFFDGPRPPGLPARSEE

SwissProt top hitse value%identityAlignment
P38451 Uncharacterized mitochondrial protein ymf28.5e-3247.51Show/hide
Query:  TNTKKIQFTQRLPLGSELHMGKERCCLRGIDHLHGPTSHSICGNFMIYKPSLTNDRLIFEHDKSL-----------------------------------
        +N KKIQFT + PLGSELH+GK R  LRGID LHGPT HSICGNF+IYKPSL     +FEHD S                                    
Subjt:  TNTKKIQFTQRLPLGSELHMGKERCCLRGIDHLHGPTSHSICGNFMIYKPSLTNDRLIFEHDKSL-----------------------------------

Query:  -RADQLPINFPASYENGKLEHFLHQWMTNREHNNF--WLTMFPEKRYF--RETTSTTEVAIHTNPFTDLYASIGTGSSRTG
         R  ++ + FP +    + + F     T  EHN+F  WLTMFPEKR++   + TSTT+VAIHTN FTDLYA IGTGS  TG
Subjt:  -RADQLPINFPASYENGKLEHFLHQWMTNREHNNF--WLTMFPEKRYF--RETTSTTEVAIHTNPFTDLYASIGTGSSRTG

P93286 Cytochrome c biogenesis CcmF C-terminal-like mitochondrial protein4.6e-7083.89Show/hide
Query:  MVRLRSTNTKKIQFTQRLPLGSELHMGKERCCLRGIDHLHGPTSHSICGNFMIYKPSLTNDRLIFEHDKSLRADQLPINFPASYENGKLEHFLHQWMTNR
        M+ +  +NTKKIQFTQRLPLG ELHMGKERCCLRG+DHLHGPT HSICGN MIYK SLTNDRL+FEHD+SL AD L INFPASY+NGKLEHFLH WM NR
Subjt:  MVRLRSTNTKKIQFTQRLPLGSELHMGKERCCLRGIDHLHGPTSHSICGNFMIYKPSLTNDRLIFEHDKSLRADQLPINFPASYENGKLEHFLHQWMTNR

Query:  EHNNFWLTMFPEKRYFRETTSTTEVAIHTNPFTDLYASIGTGSSRTGGW
        +HNNFWLTMFPEKRYFRE TST EVAIHTN FTDLYA IGTGSSRTGGW
Subjt:  EHNNFWLTMFPEKRYFRETTSTTEVAIHTNPFTDLYASIGTGSSRTGGW

Arabidopsis top hitse value%identityAlignment
ATMG00180.1 cytochrome C biogenesis 4527.8e-7385.23Show/hide
Query:  MVRLRSTNTKKIQFTQRLPLGSELHMGKERCCLRGIDHLHGPTSHSICGNFMIYKPSLTNDRLIFEHDKSLRADQLPINFPASYENGKLEHFLHQWMTNR
        M+ +  +NTKKIQFTQRLPLG ELHMGKERCCLRG+DHLHGPT HSICGN MIYKPSLTNDRL+FEHD+SL AD L INFPASY+NGKLEHFLH WM NR
Subjt:  MVRLRSTNTKKIQFTQRLPLGSELHMGKERCCLRGIDHLHGPTSHSICGNFMIYKPSLTNDRLIFEHDKSLRADQLPINFPASYENGKLEHFLHQWMTNR

Query:  EHNNFWLTMFPEKRYFRETTSTTEVAIHTNPFTDLYASIGTGSSRTGGW
        +HNNFWLTMFPEKRYFRE TST EVAIHTN FTDLYASIGTGSSRTGGW
Subjt:  EHNNFWLTMFPEKRYFRETTSTTEVAIHTNPFTDLYASIGTGSSRTGGW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGCAGAAACTCGTCTCACGTACGGTTCGGAGGCCGAGCCCCACCCCAGCAGTAATGGTGCGGCTTAGGTCAACTAACACAAAGAAGATACAGTTCACTCAACGATT
GCCTTTGGGTTCCGAACTCCATATGGGGAAGGAGCGTTGTTGTTTGCGAGGTATCGATCATTTACATGGACCCACTTCTCATTCCATTTGTGGTAATTTTATGATCTATA
AACCGTCCCTAACGAACGATCGGCTCATCTTTGAGCATGATAAATCACTTCGTGCCGACCAGTTGCCAATAAACTTTCCGGCCTCATATGAGAATGGAAAACTGGAGCAT
TTTCTGCATCAGTGGATGACGAATCGCGAACATAATAATTTCTGGTTGACCATGTTCCCAGAAAAAAGATACTTTCGAGAAACGACGAGCACGACTGAAGTAGCTATACA
TACAAATCCATTTACGGATCTATATGCTTCGATTGGAACTGGAAGTTCCAGAACGGGCGGCTGGAGGAGAAGTAGTGGCTGCGGCGCGTCAAGGCACTTCTTCGACGGTC
CCCGTCCGCCCGGCCTGCCGGCCCGCTCTGAAGAATTCATGGGCCGGCGAGCCTCCAGGACAAGGGGGCGGGCACTACTAACCAAGCCAAGCCTAGTTCTCGCCGATAGG
CGCTGGGAAGGGAAAAAGGGGTCTCTTTCCTCGCCCGATGGTAGAGGTCAATCCCCTATCACCTTCGGTGCCTCCTTGTGGTCCTTCTCTTTAGCTTTTCCTCGGCCTTT
AGAGTTAGAGCTAGTTGATAGGCCACTTTCCACATCTGATGCATTGAGATTTCATCTTGGATTTCGAACCTCAAGCCCTATCAAGCAAGGGATTCTTCGTCGGATTCGTT
CAAGCCATTGCGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGAGGCAGAAACTCGTCTCACGTACGGTTCGGAGGCCGAGCCCCACCCCAGCAGTAATGGTGCGGCTTAGGTCAACTAACACAAAGAAGATACAGTTCACTCAACGATT
GCCTTTGGGTTCCGAACTCCATATGGGGAAGGAGCGTTGTTGTTTGCGAGGTATCGATCATTTACATGGACCCACTTCTCATTCCATTTGTGGTAATTTTATGATCTATA
AACCGTCCCTAACGAACGATCGGCTCATCTTTGAGCATGATAAATCACTTCGTGCCGACCAGTTGCCAATAAACTTTCCGGCCTCATATGAGAATGGAAAACTGGAGCAT
TTTCTGCATCAGTGGATGACGAATCGCGAACATAATAATTTCTGGTTGACCATGTTCCCAGAAAAAAGATACTTTCGAGAAACGACGAGCACGACTGAAGTAGCTATACA
TACAAATCCATTTACGGATCTATATGCTTCGATTGGAACTGGAAGTTCCAGAACGGGCGGCTGGAGGAGAAGTAGTGGCTGCGGCGCGTCAAGGCACTTCTTCGACGGTC
CCCGTCCGCCCGGCCTGCCGGCCCGCTCTGAAGAATTCATGGGCCGGCGAGCCTCCAGGACAAGGGGGCGGGCACTACTAACCAAGCCAAGCCTAGTTCTCGCCGATAGG
CGCTGGGAAGGGAAAAAGGGGTCTCTTTCCTCGCCCGATGGTAGAGGTCAATCCCCTATCACCTTCGGTGCCTCCTTGTGGTCCTTCTCTTTAGCTTTTCCTCGGCCTTT
AGAGTTAGAGCTAGTTGATAGGCCACTTTCCACATCTGATGCATTGAGATTTCATCTTGGATTTCGAACCTCAAGCCCTATCAAGCAAGGGATTCTTCGTCGGATTCGTT
CAAGCCATTGCGAATAA
Protein sequenceShow/hide protein sequence
MRQKLVSRTVRRPSPTPAVMVRLRSTNTKKIQFTQRLPLGSELHMGKERCCLRGIDHLHGPTSHSICGNFMIYKPSLTNDRLIFEHDKSLRADQLPINFPASYENGKLEH
FLHQWMTNREHNNFWLTMFPEKRYFRETTSTTEVAIHTNPFTDLYASIGTGSSRTGGWRRSSGCGASRHFFDGPRPPGLPARSEEFMGRRASRTRGRALLTKPSLVLADR
RWEGKKGSLSSPDGRGQSPITFGASLWSFSLAFPRPLELELVDRPLSTSDALRFHLGFRTSSPIKQGILRRIRSSHCE