; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g34510 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g34510
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr8:25207749..25210507
RNA-Seq ExpressionMoc08g34510
SyntenyMoc08g34510
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0005488 - binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065480.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]7.2e-4663.01Show/hide
Query:  IDKPKDN-MLSAWKCNNDVIILWIMNSVSRDIAASLVYSDSASDIWNELRDRFQQSNGPRIYQLRKEFV-------TIEAYYTKLKTVWQELSEYHLSNA
        I KP D  +L AW CNND++  WI+NSVS++IAAS++Y  S  +IW+ELR RF+QSNGP IYQLRKEFV       TIE YYTKLKT+WQ L+EY  +N 
Subjt:  IDKPKDN-MLSAWKCNNDVIILWIMNSVSRDIAASLVYSDSASDIWNELRDRFQQSNGPRIYQLRKEFV-------TIEAYYTKLKTVWQELSEYHLSNA

Query:  CTCGGLKQVLNHFNSEYVMMFLMGLNESYAGVRAQILFMDPMPPIN
        CTCGGLK  ++H  SEY+M FLMGLN+SYA VRAQIL M P+P IN
Subjt:  CTCGGLKQVLNHFNSEYVMMFLMGLNESYAGVRAQILFMDPMPPIN

KAA8519610.1 hypothetical protein F0562_013945 [Nyssa sinensis]5.3e-4130.39Show/hide
Query:  DNMLSAWKCNNDVIILWIMNSVSRDIAASLVYSDSASDIWNELRDRFQQSNGPRIYQLRKEFV-------TIEAYYTKLKTVWQELSEYHLSNAC-TC--
        +N+L++W  NN+++I WI+NSVS++I+AS++++ SA +IW +LRDRFQQ NGPRI+QL++E +       ++  Y+TKLKT+W++LS Y  + +C  C  
Subjt:  DNMLSAWKCNNDVIILWIMNSVSRDIAASLVYSDSASDIWNELRDRFQQSNGPRIYQLRKEFV-------TIEAYYTKLKTVWQELSEYHLSNAC-TC--

Query:  GGLKQVLNHFNSEYVMMFLMGLNESYAGVRAQILFMDPMPPINKSAFFA-----------------SLSTSRYGQLIDLLHTHLSAAKSESITAMSSVSH
        G +K + +H   EY+M FLMGL++S++ VR Q+L MDPMPPIN   F                   S  T  +    D+  ++ S +++   +  S+  +
Subjt:  GGLKQVLNHFNSEYVMMFLMGLNESYAGVRAQILFMDPMPPINKSAFFA-----------------SLSTSRYGQLIDLLHTHLSAAKSESITAMSSVSH

Query:  VDK-----LQLKMIGRTVCLNRLYLL----------------------------SCSSNAFTNNTYVVFEFSDFVLPYPSPNMALTDPMGFSNI----TT
          +        K++G TV  +R Y +                            S  SN+F +   V+  F + VLP+ S        + FS++     T
Subjt:  VDK-----LQLKMIGRTVCLNRLYLL----------------------------SCSSNAFTNNTYVVFEFSDFVLPYPSPNMALTDPMGFSNI----TT

Query:  SSVPLNGPLSGDEDTAANTINVDINSPT-----SYCLTDSIVQPFNVQPFEP--STELRRSQRARYPPGFLRDYHCNLLSSSPAIFGSFIPHPLENYLSY
        S VP + P+   +   +N++ V  + P+     S    + +V   +     P  S  L++S +  + P +L+DY CNLL+ S     +   +P+ NY+SY
Subjt:  SSVPLNGPLSGDEDTAANTINVDINSPT-----SYCLTDSIVQPFNVQPFEP--STELRRSQRARYPPGFLRDYHCNLLSSSPAIFGSFIPHPLENYLSY

Query:  KDFSAAHR
           S +H+
Subjt:  KDFSAAHR

KAA8543184.1 hypothetical protein F0562_021321 [Nyssa sinensis]2.7e-4532.92Show/hide
Query:  DNMLSAWKCNNDVIILWIMNSVSRDIAASLVYSDSASDIWNELRDRFQQSNGPRIYQLRKEFV-------TIEAYYTKLKTVWQELSEYHLS---NACTC
        +N+L++W  NN+++I WI+NSVS++I+AS++++ SA +IW +LRDRFQQ N PRI+QL++E +       ++  Y+TKLKT+W+ELS Y L+     C+C
Subjt:  DNMLSAWKCNNDVIILWIMNSVSRDIAASLVYSDSASDIWNELRDRFQQSNGPRIYQLRKEFV-------TIEAYYTKLKTVWQELSEYHLS---NACTC

Query:  GGLKQVLNHFNSEYVMMFLMGLNESYAGVRAQILFMDPMPPIN------------KSAFFASLSTSRYGQLIDLLHTHLS----AAKSESITAMSSVSHV
        GG+K + +H   EY+M FLMGL++S++ VR Q+L MDPMPPIN            +    +S S++  G +   + T ++    +    S  + SS S  
Subjt:  GGLKQVLNHFNSEYVMMFLMGLNESYAGVRAQILFMDPMPPIN------------KSAFFASLSTSRYGQLIDLLHTHLS----AAKSESITAMSSVSHV

Query:  DK------LQLKMIGRTVCLNRLYLL----------------------------SCSSNAFTNNTYVVFEFSDFVLPYPSPNMALTDPMGFSNI----TT
         K      +  K++G TV  +R Y +                            S  SN+F +   V   F D VLP+ S        +  S++     T
Subjt:  DK------LQLKMIGRTVCLNRLYLL----------------------------SCSSNAFTNNTYVVFEFSDFVLPYPSPNMALTDPMGFSNI----TT

Query:  SSVPLNGPLSGDEDTAANTINVDINSPTSYCLTDSIVQPFNVQP-FEPSTE----LRRSQRARYPPGFLRDYHCNLLSSSPAIFGSFIPHPLENYLSYKD
        S VP + P+    D+ +  +     SP+S   T +   P +  P   PS      LR+  R  + P +L+DYHCNLL+ S     +   +P+ NY+SY  
Subjt:  SSVPLNGPLSGDEDTAANTINVDINSPTSYCLTDSIVQPFNVQP-FEPSTE----LRRSQRARYPPGFLRDYHCNLLSSSPAIFGSFIPHPLENYLSYKD

Query:  FSAAHRQ
         S +H+Q
Subjt:  FSAAHRQ

XP_022145891.1 uncharacterized protein LOC111015239 [Momordica charantia]7.0e-4965.99Show/hide
Query:  TIDKPKDNMLSAWKCNNDVIILWIMNSVSRDIAASLVYSDSASDIWNELRDRFQQSNGPRIYQLRKEFVT-------IEAYYTKLKTVWQELSEYHLSNA
        TI KP  N+L+AWKCNND+I  WI+NSVS++IAAS++Y+ SA DIW+EL++RFQQS+ PRI+QLRKE VT       IEAYYTKLKTVWQEL++Y  +  
Subjt:  TIDKPKDNMLSAWKCNNDVIILWIMNSVSRDIAASLVYSDSASDIWNELRDRFQQSNGPRIYQLRKEFVT-------IEAYYTKLKTVWQELSEYHLSNA

Query:  CTCGGLKQVLNHFNSEYVMMFLMGLNESYAGVRAQILFMDPMPPINK
        CTC GLK +   F SEYVM FLMGLNESYA +RAQIL MDP+PP+NK
Subjt:  CTCGGLKQVLNHFNSEYVMMFLMGLNESYAGVRAQILFMDPMPPINK

XP_022155284.1 uncharacterized protein LOC111022420 [Momordica charantia]4.5e-88100Show/hide
Query:  MESCNANCTFSKKQGRIYHRTIDKPKDNMLSAWKCNNDVIILWIMNSVSRDIAASLVYSDSASDIWNELRDRFQQSNGPRIYQLRKEFVTIEAYYTKLKT
        MESCNANCTFSKKQGRIYHRTIDKPKDNMLSAWKCNNDVIILWIMNSVSRDIAASLVYSDSASDIWNELRDRFQQSNGPRIYQLRKEFVTIEAYYTKLKT
Subjt:  MESCNANCTFSKKQGRIYHRTIDKPKDNMLSAWKCNNDVIILWIMNSVSRDIAASLVYSDSASDIWNELRDRFQQSNGPRIYQLRKEFVTIEAYYTKLKT

Query:  VWQELSEYHLSNACTCGGLKQVLNHFNSEYVMMFLMGLNESYAGVRAQILFMDPMPPINK
        VWQELSEYHLSNACTCGGLKQVLNHFNSEYVMMFLMGLNESYAGVRAQILFMDPMPPINK
Subjt:  VWQELSEYHLSNACTCGGLKQVLNHFNSEYVMMFLMGLNESYAGVRAQILFMDPMPPINK

TrEMBL top hitse value%identityAlignment
A0A5A7VE66 Cysteine-rich RLK (Receptor-like protein kinase) 83.5e-4663.01Show/hide
Query:  IDKPKDN-MLSAWKCNNDVIILWIMNSVSRDIAASLVYSDSASDIWNELRDRFQQSNGPRIYQLRKEFV-------TIEAYYTKLKTVWQELSEYHLSNA
        I KP D  +L AW CNND++  WI+NSVS++IAAS++Y  S  +IW+ELR RF+QSNGP IYQLRKEFV       TIE YYTKLKT+WQ L+EY  +N 
Subjt:  IDKPKDN-MLSAWKCNNDVIILWIMNSVSRDIAASLVYSDSASDIWNELRDRFQQSNGPRIYQLRKEFV-------TIEAYYTKLKTVWQELSEYHLSNA

Query:  CTCGGLKQVLNHFNSEYVMMFLMGLNESYAGVRAQILFMDPMPPIN
        CTCGGLK  ++H  SEY+M FLMGLN+SYA VRAQIL M P+P IN
Subjt:  CTCGGLKQVLNHFNSEYVMMFLMGLNESYAGVRAQILFMDPMPPIN

A0A5J4ZLU4 RING-type E3 ubiquitin transferase2.6e-4130.39Show/hide
Query:  DNMLSAWKCNNDVIILWIMNSVSRDIAASLVYSDSASDIWNELRDRFQQSNGPRIYQLRKEFV-------TIEAYYTKLKTVWQELSEYHLSNAC-TC--
        +N+L++W  NN+++I WI+NSVS++I+AS++++ SA +IW +LRDRFQQ NGPRI+QL++E +       ++  Y+TKLKT+W++LS Y  + +C  C  
Subjt:  DNMLSAWKCNNDVIILWIMNSVSRDIAASLVYSDSASDIWNELRDRFQQSNGPRIYQLRKEFV-------TIEAYYTKLKTVWQELSEYHLSNAC-TC--

Query:  GGLKQVLNHFNSEYVMMFLMGLNESYAGVRAQILFMDPMPPINKSAFFA-----------------SLSTSRYGQLIDLLHTHLSAAKSESITAMSSVSH
        G +K + +H   EY+M FLMGL++S++ VR Q+L MDPMPPIN   F                   S  T  +    D+  ++ S +++   +  S+  +
Subjt:  GGLKQVLNHFNSEYVMMFLMGLNESYAGVRAQILFMDPMPPINKSAFFA-----------------SLSTSRYGQLIDLLHTHLSAAKSESITAMSSVSH

Query:  VDK-----LQLKMIGRTVCLNRLYLL----------------------------SCSSNAFTNNTYVVFEFSDFVLPYPSPNMALTDPMGFSNI----TT
          +        K++G TV  +R Y +                            S  SN+F +   V+  F + VLP+ S        + FS++     T
Subjt:  VDK-----LQLKMIGRTVCLNRLYLL----------------------------SCSSNAFTNNTYVVFEFSDFVLPYPSPNMALTDPMGFSNI----TT

Query:  SSVPLNGPLSGDEDTAANTINVDINSPT-----SYCLTDSIVQPFNVQPFEP--STELRRSQRARYPPGFLRDYHCNLLSSSPAIFGSFIPHPLENYLSY
        S VP + P+   +   +N++ V  + P+     S    + +V   +     P  S  L++S +  + P +L+DY CNLL+ S     +   +P+ NY+SY
Subjt:  SSVPLNGPLSGDEDTAANTINVDINSPT-----SYCLTDSIVQPFNVQPFEP--STELRRSQRARYPPGFLRDYHCNLLSSSPAIFGSFIPHPLENYLSY

Query:  KDFSAAHR
           S +H+
Subjt:  KDFSAAHR

A0A5J5BKC2 Uncharacterized protein1.3e-4532.92Show/hide
Query:  DNMLSAWKCNNDVIILWIMNSVSRDIAASLVYSDSASDIWNELRDRFQQSNGPRIYQLRKEFV-------TIEAYYTKLKTVWQELSEYHLS---NACTC
        +N+L++W  NN+++I WI+NSVS++I+AS++++ SA +IW +LRDRFQQ N PRI+QL++E +       ++  Y+TKLKT+W+ELS Y L+     C+C
Subjt:  DNMLSAWKCNNDVIILWIMNSVSRDIAASLVYSDSASDIWNELRDRFQQSNGPRIYQLRKEFV-------TIEAYYTKLKTVWQELSEYHLS---NACTC

Query:  GGLKQVLNHFNSEYVMMFLMGLNESYAGVRAQILFMDPMPPIN------------KSAFFASLSTSRYGQLIDLLHTHLS----AAKSESITAMSSVSHV
        GG+K + +H   EY+M FLMGL++S++ VR Q+L MDPMPPIN            +    +S S++  G +   + T ++    +    S  + SS S  
Subjt:  GGLKQVLNHFNSEYVMMFLMGLNESYAGVRAQILFMDPMPPIN------------KSAFFASLSTSRYGQLIDLLHTHLS----AAKSESITAMSSVSHV

Query:  DK------LQLKMIGRTVCLNRLYLL----------------------------SCSSNAFTNNTYVVFEFSDFVLPYPSPNMALTDPMGFSNI----TT
         K      +  K++G TV  +R Y +                            S  SN+F +   V   F D VLP+ S        +  S++     T
Subjt:  DK------LQLKMIGRTVCLNRLYLL----------------------------SCSSNAFTNNTYVVFEFSDFVLPYPSPNMALTDPMGFSNI----TT

Query:  SSVPLNGPLSGDEDTAANTINVDINSPTSYCLTDSIVQPFNVQP-FEPSTE----LRRSQRARYPPGFLRDYHCNLLSSSPAIFGSFIPHPLENYLSYKD
        S VP + P+    D+ +  +     SP+S   T +   P +  P   PS      LR+  R  + P +L+DYHCNLL+ S     +   +P+ NY+SY  
Subjt:  SSVPLNGPLSGDEDTAANTINVDINSPTSYCLTDSIVQPFNVQP-FEPSTE----LRRSQRARYPPGFLRDYHCNLLSSSPAIFGSFIPHPLENYLSYKD

Query:  FSAAHRQ
         S +H+Q
Subjt:  FSAAHRQ

A0A6J1CXR2 uncharacterized protein LOC1110152393.4e-4965.99Show/hide
Query:  TIDKPKDNMLSAWKCNNDVIILWIMNSVSRDIAASLVYSDSASDIWNELRDRFQQSNGPRIYQLRKEFVT-------IEAYYTKLKTVWQELSEYHLSNA
        TI KP  N+L+AWKCNND+I  WI+NSVS++IAAS++Y+ SA DIW+EL++RFQQS+ PRI+QLRKE VT       IEAYYTKLKTVWQEL++Y  +  
Subjt:  TIDKPKDNMLSAWKCNNDVIILWIMNSVSRDIAASLVYSDSASDIWNELRDRFQQSNGPRIYQLRKEFVT-------IEAYYTKLKTVWQELSEYHLSNA

Query:  CTCGGLKQVLNHFNSEYVMMFLMGLNESYAGVRAQILFMDPMPPINK
        CTC GLK +   F SEYVM FLMGLNESYA +RAQIL MDP+PP+NK
Subjt:  CTCGGLKQVLNHFNSEYVMMFLMGLNESYAGVRAQILFMDPMPPINK

A0A6J1DPT8 uncharacterized protein LOC1110224202.2e-88100Show/hide
Query:  MESCNANCTFSKKQGRIYHRTIDKPKDNMLSAWKCNNDVIILWIMNSVSRDIAASLVYSDSASDIWNELRDRFQQSNGPRIYQLRKEFVTIEAYYTKLKT
        MESCNANCTFSKKQGRIYHRTIDKPKDNMLSAWKCNNDVIILWIMNSVSRDIAASLVYSDSASDIWNELRDRFQQSNGPRIYQLRKEFVTIEAYYTKLKT
Subjt:  MESCNANCTFSKKQGRIYHRTIDKPKDNMLSAWKCNNDVIILWIMNSVSRDIAASLVYSDSASDIWNELRDRFQQSNGPRIYQLRKEFVTIEAYYTKLKT

Query:  VWQELSEYHLSNACTCGGLKQVLNHFNSEYVMMFLMGLNESYAGVRAQILFMDPMPPINK
        VWQELSEYHLSNACTCGGLKQVLNHFNSEYVMMFLMGLNESYAGVRAQILFMDPMPPINK
Subjt:  VWQELSEYHLSNACTCGGLKQVLNHFNSEYVMMFLMGLNESYAGVRAQILFMDPMPPINK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).2.8e-1631.79Show/hide
Query:  PKDNMLSAWKCNNDVIILWIMNSVSRDIAASLVYSDSASDIWNELRDRFQQSNGPRIYQLRKEFVT-------IEAYYTKLKTVWQELSEYHLSNACTCG
        P   +   W+  N +++ W+MNS++  +  S++Y+++A  +W +LR  F      +IYQLR+   T       +E Y+ KL  VW ELSEY     C CG
Subjt:  PKDNMLSAWKCNNDVIILWIMNSVSRDIAASLVYSDSASDIWNELRDRFQQSNGPRIYQLRKEFVT-------IEAYYTKLKTVWQELSEYHLSNACTCG

Query:  G-----LKQVLNHFNSEYVMMFLMG--LNESYAGVRAQILFMDPMPPINKS
        G      K+       E    FLMG  LN+ +  V  +I+F  P P ++++
Subjt:  G-----LKQVLNHFNSEYVMMFLMG--LNESYAGVRAQILFMDPMPPINKS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTCGTGCAATGCTAATTGCACTTTCAGCAAAAAACAAGGTCGGATTTATCACAGAACTATTGACAAGCCAAAGGATAACATGCTGTCTGCTTGGAAATGC
AACAACGATGTCATTATTTTATGGATAATGAACTCTGTATCTCGTGACATTGCTGCTAGTCTCGTCTATTCAGATTCAGCGAGTGACATATGGAATGAACTTCGA
GATCGTTTTCAACAGAGTAATGGACCGAGAATTTATCAATTACGAAAGGAATTTGTTACTATAGAGGCTTACTATACAAAACTAAAAACTGTTTGGCAAGAGTTG
AGCGAATACCATCTTTCGAACGCTTGTACTTGCGGAGGTTTGAAGCAAGTTTTGAATCATTTCAATTCTGAATATGTCATGATGTTCCTAATGGGACTCAATGAA
TCCTATGCCGGAGTTCGGGCACAAATCTTGTTTATGGATCCTATGCCTCCTATAAATAAGTCGGCTTTCTTTGCAAGCTTGAGCACTAGTCGATATGGTCAACTA
ATAGATCTTCTTCACACACATTTATCTGCTGCGAAATCTGAAAGTATTACTGCAATGTCATCCGTGTCTCATGTGGACAAGTTACAATTGAAGATGATTGGTAGG
ACTGTGTGCTTGAACAGATTATATTTGCTGTCGTGTTCCTCAAATGCTTTCACCAATAATACATATGTTGTCTTTGAGTTTTCTGATTTTGTACTCCCCTACCCT
TCACCGAACATGGCTCTTACTGACCCTATGGGATTTTCTAATATCACTACCTCATCCGTACCCTTGAACGGTCCACTTTCTGGTGATGAGGACACTGCTGCTAAT
ACCATTAATGTTGATATTAATTCACCTACTAGCTACTGCCTTACTGACTCAATTGTGCAGCCTTTTAATGTGCAGCCATTTGAACCTTCTACTGAGCTTCGGCGT
TCTCAACGTGCACGCTACCCACCTGGTTTTTTACGAGATTACCATTGCAATCTCTTATCTTCTTCTCCAGCTATTTTTGGGTCTTTCATTCCCCATCCTCTTGAG
AATTATCTTTCATACAAAGACTTCTCAGCTGCACATCGTCAAACTCAAAGTCATTGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTCGTGCAATGCTAATTGCACTTTCAGCAAAAAACAAGGTCGGATTTATCACAGAACTATTGACAAGCCAAAGGATAACATGCTGTCTGCTTGGAAATGC
AACAACGATGTCATTATTTTATGGATAATGAACTCTGTATCTCGTGACATTGCTGCTAGTCTCGTCTATTCAGATTCAGCGAGTGACATATGGAATGAACTTCGA
GATCGTTTTCAACAGAGTAATGGACCGAGAATTTATCAATTACGAAAGGAATTTGTTACTATAGAGGCTTACTATACAAAACTAAAAACTGTTTGGCAAGAGTTG
AGCGAATACCATCTTTCGAACGCTTGTACTTGCGGAGGTTTGAAGCAAGTTTTGAATCATTTCAATTCTGAATATGTCATGATGTTCCTAATGGGACTCAATGAA
TCCTATGCCGGAGTTCGGGCACAAATCTTGTTTATGGATCCTATGCCTCCTATAAATAAGTCGGCTTTCTTTGCAAGCTTGAGCACTAGTCGATATGGTCAACTA
ATAGATCTTCTTCACACACATTTATCTGCTGCGAAATCTGAAAGTATTACTGCAATGTCATCCGTGTCTCATGTGGACAAGTTACAATTGAAGATGATTGGTAGG
ACTGTGTGCTTGAACAGATTATATTTGCTGTCGTGTTCCTCAAATGCTTTCACCAATAATACATATGTTGTCTTTGAGTTTTCTGATTTTGTACTCCCCTACCCT
TCACCGAACATGGCTCTTACTGACCCTATGGGATTTTCTAATATCACTACCTCATCCGTACCCTTGAACGGTCCACTTTCTGGTGATGAGGACACTGCTGCTAAT
ACCATTAATGTTGATATTAATTCACCTACTAGCTACTGCCTTACTGACTCAATTGTGCAGCCTTTTAATGTGCAGCCATTTGAACCTTCTACTGAGCTTCGGCGT
TCTCAACGTGCACGCTACCCACCTGGTTTTTTACGAGATTACCATTGCAATCTCTTATCTTCTTCTCCAGCTATTTTTGGGTCTTTCATTCCCCATCCTCTTGAG
AATTATCTTTCATACAAAGACTTCTCAGCTGCACATCGTCAAACTCAAAGTCATTGCTGA
Protein sequenceShow/hide protein sequence
MESCNANCTFSKKQGRIYHRTIDKPKDNMLSAWKCNNDVIILWIMNSVSRDIAASLVYSDSASDIWNELRDRFQQSNGPRIYQLRKEFVTIEAYYTKLKTVWQEL
SEYHLSNACTCGGLKQVLNHFNSEYVMMFLMGLNESYAGVRAQILFMDPMPPINKSAFFASLSTSRYGQLIDLLHTHLSAAKSESITAMSSVSHVDKLQLKMIGR
TVCLNRLYLLSCSSNAFTNNTYVVFEFSDFVLPYPSPNMALTDPMGFSNITTSSVPLNGPLSGDEDTAANTINVDINSPTSYCLTDSIVQPFNVQPFEPSTELRR
SQRARYPPGFLRDYHCNLLSSSPAIFGSFIPHPLENYLSYKDFSAAHRQTQSHC