; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g30370 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g30370
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr8:21770758..21777296
RNA-Seq ExpressionMoc08g30370
SyntenyMoc08g30370
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022147186.1 uncharacterized protein LOC111016198 [Momordica charantia]1.4e-6356.3Show/hide
Query:  GSFTIPISIRGKNVGHALCDLGASINLMPLSVYQKLGIGEARLTTITLQLAYRSITHPEGKIEDVLV---------------------------------
        GSFTIP+SI GKNVGH LCDLGA INL+PL VYQKLGIGEAR TT+TLQLA RSITHPEGK EDVLV                                 
Subjt:  GSFTIPISIRGKNVGHALCDLGASINLMPLSVYQKLGIGEARLTTITLQLAYRSITHPEGKIEDVLV---------------------------------

Query:  -------------------QVTLSVFNSIKYPTDVEECYLLRISDDLKSDEIQKEELLNQMKDELTRIFKARDNEAKLIQRQPNEPATDRIYKKTFEPLE
                           QVT  +FNSIK+P D+EEC LLR++DDL+S+E+Q EELL+Q+++E+T+IF+ ++ EAKLIQ +PNE ++DR+YKK FE  E
Subjt:  -------------------QVTLSVFNSIKYPTDVEECYLLRISDDLKSDEIQKEELLNQMKDELTRIFKARDNEAKLIQRQPNEPATDRIYKKTFEPLE

Query:  LKDMVQAPLQPSVVKALKLELKFLPAHLNYAYLGEAETLLVIIAAELAEEKEQR
        LKD  QA LQ SV KA KLELK LP HL YAYLG+AETL + IAA+LAE+KE R
Subjt:  LKDMVQAPLQPSVVKALKLELKFLPAHLNYAYLGEAETLLVIIAAELAEEKEQR

XP_022156989.1 uncharacterized protein LOC111023818 [Momordica charantia]2.3e-5555.47Show/hide
Query:  NPGSFTIPISIRGKNVGHALCDLGASINLMPLSVYQKLGIGEARLTTITLQLAYRSITHPEGKIEDVLV-------------------------------
        +PGSFTIP+ I GKNVGHALCDLGASINLMPLSVYQKLGIGEAR  T+TLQLA RSIT+ EGKIEDVLV                               
Subjt:  NPGSFTIPISIRGKNVGHALCDLGASINLMPLSVYQKLGIGEARLTTITLQLAYRSITHPEGKIEDVLV-------------------------------

Query:  ---------------------QVTLSVFNSIKYPTDVEECYLLRISDDLKSDEIQKEELLNQMKDELTRIFKARDNEAKLIQRQPNEPATDRIYKKTFEP
                             QVTLS+FNSIKYP DVEEC  LRI+DDL SDEIQ EELLNQ++DELTRI                              
Subjt:  ---------------------QVTLSVFNSIKYPTDVEECYLLRISDDLKSDEIQKEELLNQMKDELTRIFKARDNEAKLIQRQPNEPATDRIYKKTFEP

Query:  LELKDMVQAPLQPSVVKALKLELKFLPAHLNYAYLGEAETLLVIIAAELAEEKEQR
          +KD VQAPLQPSVVKA KLELK LP+HL YAYLGE ETL V IAA+LAEEKE R
Subjt:  LELKDMVQAPLQPSVVKALKLELKFLPAHLNYAYLGEAETLLVIIAAELAEEKEQR

XP_022158490.1 uncharacterized protein LOC111024970 [Momordica charantia]2.2e-4231.2Show/hide
Query:  MDQPPRLPNPIVEELYGGDMAEPVVVPPQNAILLADNGDRAIRAYAVLALHGFHPVIAGPEIEAKRFELKPVMFQMLQIVGQFYGIPSEDPHLQ------
        MDQ PRLPNP+VE +  G++      PP N ILL D+G+R IRAYA  A+HGFHPVIAGP IEA+RFELK +MFQMLQ VGQF+G PSEDPHL       
Subjt:  MDQPPRLPNPIVEELYGGDMAEPVVVPPQNAILLADNGDRAIRAYAVLALHGFHPVIAGPEIEAKRFELKPVMFQMLQIVGQFYGIPSEDPHLQ------

Query:  ----------------------RISSNNYHW-------------------------------SDSRAINDRGNYGATSNTKMASLKDQIANLNNMVKNMS
                               +S     W                               SDS+A+N+R N+ A  N  MA+L DQIANL NMVKNM+
Subjt:  ----------------------RISSNNYHW-------------------------------SDSRAINDRGNYGATSNTKMASLKDQIANLNNMVKNMS

Query:  AAIASSTNPG-----------------------------------------SSKVMAISCSYC-------------------------------------
         A  SS +PG                                         S+ +  +   Y                                      
Subjt:  AAIASSTNPG-----------------------------------------SSKVMAISCSYC-------------------------------------

Query:  EGDHTYDSYLGN-----------------------PASWY---------------YSENP----------------------------------------
        + +H  +  L N                       PA                  Y   P                                        
Subjt:  EGDHTYDSYLGN-----------------------PASWY---------------YSENP----------------------------------------

Query:  ------------------------------------------GSFTIPISIRGKNVGHALCDLGASINLMPLSVYQKLGIGEARLTTITLQLAYRSITHP
                                                  GSF IP+SI GKNVG+ALCDL ASINLMPLS+ +KL IG+AR TTITLQLA RSITHP
Subjt:  ------------------------------------------GSFTIPISIRGKNVGHALCDLGASINLMPLSVYQKLGIGEARLTTITLQLAYRSITHP

Query:  EGKIEDVLVQVTLSVF
        EGKIEDVLVQV   +F
Subjt:  EGKIEDVLVQVTLSVF

XP_023521407.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111785222 [Cucurbita pepo subsp. pepo]3.6e-4034.49Show/hide
Query:  LQRISSNNYHWSDSRAINDRGNYGATSNTKMASLKDQIANLNNMVKNMSAAIAS------STNPGSSKVMAISCSYCEGDHTYDSYLGNPASWYY-----
        L+RI+SNN  W+D R+   R   G      ++S+  Q+A++ N+++N++    S       T    ++  A SC YC  +HT+D    NPAS +Y     
Subjt:  LQRISSNNYHWSDSRAINDRGNYGATSNTKMASLKDQIANLNNMVKNMSAAIAS------STNPGSSKVMAISCSYCEGDHTYDSYLGNPASWYY-----

Query:  -------------------------------SENPGSFTIPISIRGKNVGHALCDLGASINLMPLSVYQKLGIGEARLTTITLQLAYRSITHPEGKIEDV
                                        ++PGSFTIPISI GK +G ALCDLG+SINLMPLS+Y+KLGIGEAR TT+TLQLA RS T+PEGKIED+
Subjt:  -------------------------------SENPGSFTIPISIRGKNVGHALCDLGASINLMPLSVYQKLGIGEARLTTITLQLAYRSITHPEGKIEDV

Query:  LVQVTLSVFNS----IKYPTDVEECYLLR---ISDDLKSDEIQKEELLNQMKDELTRI-------FKARDNEAKLIQRQPNEPAT---------------
        L+QV   +F +    + Y  D +   +L    +       ++ K  +  +M D+           + A   E   +     +PAT               
Subjt:  LVQVTLSVFNS----IKYPTDVEECYLLR---ISDDLKSDEIQKEELLNQMKDELTRI-------FKARDNEAKLIQRQPNEPAT---------------

Query:  --DRI--------YKKTFEPLELKDMVQAPLQPSVVKALKLELKFLPAHLNYAYLGEAETLLVIIAAELAEEKE
          DRI        + +TFE LE +    +P++PS+ +A +L+LK LP +L YAYLG+ +TL +II+A L+  +E
Subjt:  --DRI--------YKKTFEPLELKDMVQAPLQPSVVKALKLELKFLPAHLNYAYLGEAETLLVIIAAELAEEKE

XP_030502183.1 uncharacterized protein LOC115717351 [Cannabis sativa]4.0e-3942.47Show/hide
Query:  ENPGSFTIPISIRGKNVGHALCDLGASINLMPLSVYQKLGIGEARLTTITLQLAYRSITHPEGKIEDVLVQ-----------------------------
        ++PGSFTIP SI G++VG ALCDLGASINLMP+S+++KLGIGEAR TT+TLQLA RS+ HPEGKIEDVLVQ                             
Subjt:  ENPGSFTIPISIRGKNVGHALCDLGASINLMPLSVYQKLGIGEARLTTITLQLAYRSITHPEGKIEDVLVQ-----------------------------

Query:  -----------------------VTLSVFNSIKYPTDVEECYLLRISDDLKSDEIQKE----ELLNQMKDELTRIFKARDNEAKLIQRQPNEPATDRIYK
                               VT +VFN++++P ++EEC  + + D + +++  KE    E      DEL  + +  DN+   +  +P +P     +K
Subjt:  -----------------------VTLSVFNSIKYPTDVEECYLLRISDDLKSDEIQKE----ELLNQMKDELTRIFKARDNEAKLIQRQPNEPATDRIYK

Query:  KTFEPLELKDMVQAPLQPSVVKALKLELKFLPAHLNYAYLGEAETLLVIIAAELAEEKE
        K FE LELK+    P +PS  +  KLELK LP+HL YAYLGE +TL VIIA+ L  E E
Subjt:  KTFEPLELKDMVQAPLQPSVVKALKLELKFLPAHLNYAYLGEAETLLVIIAAELAEEKE

TrEMBL top hitse value%identityAlignment
A0A6J1D1L0 uncharacterized protein LOC1110161986.5e-6456.3Show/hide
Query:  GSFTIPISIRGKNVGHALCDLGASINLMPLSVYQKLGIGEARLTTITLQLAYRSITHPEGKIEDVLV---------------------------------
        GSFTIP+SI GKNVGH LCDLGA INL+PL VYQKLGIGEAR TT+TLQLA RSITHPEGK EDVLV                                 
Subjt:  GSFTIPISIRGKNVGHALCDLGASINLMPLSVYQKLGIGEARLTTITLQLAYRSITHPEGKIEDVLV---------------------------------

Query:  -------------------QVTLSVFNSIKYPTDVEECYLLRISDDLKSDEIQKEELLNQMKDELTRIFKARDNEAKLIQRQPNEPATDRIYKKTFEPLE
                           QVT  +FNSIK+P D+EEC LLR++DDL+S+E+Q EELL+Q+++E+T+IF+ ++ EAKLIQ +PNE ++DR+YKK FE  E
Subjt:  -------------------QVTLSVFNSIKYPTDVEECYLLRISDDLKSDEIQKEELLNQMKDELTRIFKARDNEAKLIQRQPNEPATDRIYKKTFEPLE

Query:  LKDMVQAPLQPSVVKALKLELKFLPAHLNYAYLGEAETLLVIIAAELAEEKEQR
        LKD  QA LQ SV KA KLELK LP HL YAYLG+AETL + IAA+LAE+KE R
Subjt:  LKDMVQAPLQPSVVKALKLELKFLPAHLNYAYLGEAETLLVIIAAELAEEKEQR

A0A6J1DUN1 uncharacterized protein LOC1110245242.3e-3764.38Show/hide
Query:  NPGSFTIPISIRGKNVGHALCDLGASINLMPLSVYQKLGIGEARLTTITLQLAYRSITHPEGKIEDVLVQVTLSVFNS----IKYPTDVEECYLL-----
        +PG FTIP+SI GKNVGHALCDLG SINLMPLSVYQKLGIGEARL TITL+L  RSITHP+GKIEDVLVQV   +F +    + Y  D E   +L     
Subjt:  NPGSFTIPISIRGKNVGHALCDLGASINLMPLSVYQKLGIGEARLTTITLQLAYRSITHPEGKIEDVLVQVTLSVFNS----IKYPTDVEECYLL-----

Query:  --------RISDDLKSDEIQKEELLNQMKDELTRIFKARDNEAKLI
                 I+DDLKSD IQKEELLNQ++ ELTRI KA D EAK+I
Subjt:  --------RISDDLKSDEIQKEELLNQMKDELTRIFKARDNEAKLI

A0A6J1DV77 uncharacterized protein LOC1110238181.1e-5555.47Show/hide
Query:  NPGSFTIPISIRGKNVGHALCDLGASINLMPLSVYQKLGIGEARLTTITLQLAYRSITHPEGKIEDVLV-------------------------------
        +PGSFTIP+ I GKNVGHALCDLGASINLMPLSVYQKLGIGEAR  T+TLQLA RSIT+ EGKIEDVLV                               
Subjt:  NPGSFTIPISIRGKNVGHALCDLGASINLMPLSVYQKLGIGEARLTTITLQLAYRSITHPEGKIEDVLV-------------------------------

Query:  ---------------------QVTLSVFNSIKYPTDVEECYLLRISDDLKSDEIQKEELLNQMKDELTRIFKARDNEAKLIQRQPNEPATDRIYKKTFEP
                             QVTLS+FNSIKYP DVEEC  LRI+DDL SDEIQ EELLNQ++DELTRI                              
Subjt:  ---------------------QVTLSVFNSIKYPTDVEECYLLRISDDLKSDEIQKEELLNQMKDELTRIFKARDNEAKLIQRQPNEPATDRIYKKTFEP

Query:  LELKDMVQAPLQPSVVKALKLELKFLPAHLNYAYLGEAETLLVIIAAELAEEKEQR
          +KD VQAPLQPSVVKA KLELK LP+HL YAYLGE ETL V IAA+LAEEKE R
Subjt:  LELKDMVQAPLQPSVVKALKLELKFLPAHLNYAYLGEAETLLVIIAAELAEEKEQR

A0A6J1DVZ9 uncharacterized protein LOC1110249701.1e-4231.2Show/hide
Query:  MDQPPRLPNPIVEELYGGDMAEPVVVPPQNAILLADNGDRAIRAYAVLALHGFHPVIAGPEIEAKRFELKPVMFQMLQIVGQFYGIPSEDPHLQ------
        MDQ PRLPNP+VE +  G++      PP N ILL D+G+R IRAYA  A+HGFHPVIAGP IEA+RFELK +MFQMLQ VGQF+G PSEDPHL       
Subjt:  MDQPPRLPNPIVEELYGGDMAEPVVVPPQNAILLADNGDRAIRAYAVLALHGFHPVIAGPEIEAKRFELKPVMFQMLQIVGQFYGIPSEDPHLQ------

Query:  ----------------------RISSNNYHW-------------------------------SDSRAINDRGNYGATSNTKMASLKDQIANLNNMVKNMS
                               +S     W                               SDS+A+N+R N+ A  N  MA+L DQIANL NMVKNM+
Subjt:  ----------------------RISSNNYHW-------------------------------SDSRAINDRGNYGATSNTKMASLKDQIANLNNMVKNMS

Query:  AAIASSTNPG-----------------------------------------SSKVMAISCSYC-------------------------------------
         A  SS +PG                                         S+ +  +   Y                                      
Subjt:  AAIASSTNPG-----------------------------------------SSKVMAISCSYC-------------------------------------

Query:  EGDHTYDSYLGN-----------------------PASWY---------------YSENP----------------------------------------
        + +H  +  L N                       PA                  Y   P                                        
Subjt:  EGDHTYDSYLGN-----------------------PASWY---------------YSENP----------------------------------------

Query:  ------------------------------------------GSFTIPISIRGKNVGHALCDLGASINLMPLSVYQKLGIGEARLTTITLQLAYRSITHP
                                                  GSF IP+SI GKNVG+ALCDL ASINLMPLS+ +KL IG+AR TTITLQLA RSITHP
Subjt:  ------------------------------------------GSFTIPISIRGKNVGHALCDLGASINLMPLSVYQKLGIGEARLTTITLQLAYRSITHP

Query:  EGKIEDVLVQVTLSVF
        EGKIEDVLVQV   +F
Subjt:  EGKIEDVLVQVTLSVF

A0A6J1DYF9 uncharacterized protein LOC1110246747.5e-3665.91Show/hide
Query:  NPGSFTIPISIRGKNVGHALCDLGASINLMPLSVYQKLGIGEARLTTITLQLAYRSITHPEGKIEDVLVQVTLSVFNSIKYPTDVEECYLLRISDDLKSD
        +P SFTIP+SI GKNVG+ALCDLGASINL+ LS+YQK  IG+AR TTITLQLA RSIT  EGKIEDVL+QVT SVFN+IKY  ++EEC LLRI+DDL ++
Subjt:  NPGSFTIPISIRGKNVGHALCDLGASINLMPLSVYQKLGIGEARLTTITLQLAYRSITHPEGKIEDVLVQVTLSVFNSIKYPTDVEECYLLRISDDLKSD

Query:  EIQKEELLNQMKDELTRIFKARDNEAKLIQRQ
        E+Q EELL+Q+++EL  IF+ +   +K+ Q Q
Subjt:  EIQKEELLNQMKDELTRIFKARDNEAKLIQRQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCAACCACCAAGGCTCCCCAATCCAATTGTGGAAGAACTATATGGAGGAGACATGGCTGAACCAGTAGTTGTGCCCCCTCAAAATGCCATTCTATTGGCAGACAA
TGGAGATAGAGCTATTAGGGCTTATGCTGTGCTAGCACTTCATGGTTTTCATCCAGTCATAGCAGGTCCAGAGATAGAAGCTAAACGATTTGAATTGAAACCTGTGATGT
TTCAAATGCTGCAAATAGTGGGACAATTCTATGGAATTCCATCTGAAGACCCACATCTCCAGAGAATTTCTTCCAATAACTACCATTGGTCAGACTCTAGAGCTATAAAT
GACAGAGGTAATTATGGAGCCACTAGCAATACGAAAATGGCATCCCTGAAGGATCAAATAGCAAACCTGAACAACATGGTAAAGAACATGAGTGCTGCTATAGCATCGTC
CACTAACCCAGGGTCAAGCAAAGTAATGGCAATCTCATGTTCTTACTGTGAAGGTGATCATACTTATGACTCTTACCTTGGAAACCCTGCATCATGGTATTATTCGGAAA
ATCCTGGGAGTTTTACCATTCCTATTTCTATAAGAGGGAAGAATGTAGGGCATGCATTGTGTGACCTGGGCGCTAGCATCAACTTGATGCCCTTATCAGTGTATCAAAAG
TTAGGGATTGGTGAAGCAAGACTAACAACCATCACTTTACAGTTGGCATATAGGTCTATTACACATCCGGAAGGCAAGATAGAGGATGTTTTGGTTCAGGTGACATTATC
TGTTTTTAACTCCATTAAATATCCTACTGATGTGGAAGAATGTTATTTGTTAAGGATTTCAGATGATTTGAAGAGTGATGAAATACAAAAGGAAGAGTTGTTGAACCAAA
TGAAGGACGAGCTAACCAGAATTTTCAAAGCAAGAGACAATGAGGCCAAGCTGATCCAACGCCAGCCGAATGAACCTGCTACTGATAGAATCTATAAGAAAACGTTTGAA
CCGCTGGAATTGAAGGACATGGTACAAGCGCCACTGCAGCCATCCGTGGTGAAGGCCCTCAAGTTAGAGCTGAAATTCCTACCAGCACACCTTAACTATGCTTATCTGGG
AGAAGCAGAAACACTGCTAGTCATCATTGCAGCAGAATTAGCAGAAGAAAAAGAGCAAAGGAAAGACTTTACCAAAAAGTCGCCAGAGCTAGCAAAAGCCCCGCCTCAAC
TCGACCCGTTACCCGAAAATGCTGCCCCAAAAAGAGAAATCCCTTCCCCGCCTAAAAAGAAACCTGCTGCCAAGAGAGGAAAGAAAGTCACGAAAGGAAAGAAGCAGCCG
CCTGTTCTAGCAGAAGAGGAGGAGCAGCTAGTTCAGGGTAGCCAAACCGAGGCAGTCGGTCAGACTGCTGAGCATAAAGAAGTACAGCATGCCGAGATAGAGGCAGAAAG
AGGCCCAGAGCTGTCGCCTGAAGCCAGATGCAGAGTCAAGGATTTTATGCAAGCTCTTCCAAGCGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGATCAACCACCAAGGCTCCCCAATCCAATTGTGGAAGAACTATATGGAGGAGACATGGCTGAACCAGTAGTTGTGCCCCCTCAAAATGCCATTCTATTGGCAGACAA
TGGAGATAGAGCTATTAGGGCTTATGCTGTGCTAGCACTTCATGGTTTTCATCCAGTCATAGCAGGTCCAGAGATAGAAGCTAAACGATTTGAATTGAAACCTGTGATGT
TTCAAATGCTGCAAATAGTGGGACAATTCTATGGAATTCCATCTGAAGACCCACATCTCCAGAGAATTTCTTCCAATAACTACCATTGGTCAGACTCTAGAGCTATAAAT
GACAGAGGTAATTATGGAGCCACTAGCAATACGAAAATGGCATCCCTGAAGGATCAAATAGCAAACCTGAACAACATGGTAAAGAACATGAGTGCTGCTATAGCATCGTC
CACTAACCCAGGGTCAAGCAAAGTAATGGCAATCTCATGTTCTTACTGTGAAGGTGATCATACTTATGACTCTTACCTTGGAAACCCTGCATCATGGTATTATTCGGAAA
ATCCTGGGAGTTTTACCATTCCTATTTCTATAAGAGGGAAGAATGTAGGGCATGCATTGTGTGACCTGGGCGCTAGCATCAACTTGATGCCCTTATCAGTGTATCAAAAG
TTAGGGATTGGTGAAGCAAGACTAACAACCATCACTTTACAGTTGGCATATAGGTCTATTACACATCCGGAAGGCAAGATAGAGGATGTTTTGGTTCAGGTGACATTATC
TGTTTTTAACTCCATTAAATATCCTACTGATGTGGAAGAATGTTATTTGTTAAGGATTTCAGATGATTTGAAGAGTGATGAAATACAAAAGGAAGAGTTGTTGAACCAAA
TGAAGGACGAGCTAACCAGAATTTTCAAAGCAAGAGACAATGAGGCCAAGCTGATCCAACGCCAGCCGAATGAACCTGCTACTGATAGAATCTATAAGAAAACGTTTGAA
CCGCTGGAATTGAAGGACATGGTACAAGCGCCACTGCAGCCATCCGTGGTGAAGGCCCTCAAGTTAGAGCTGAAATTCCTACCAGCACACCTTAACTATGCTTATCTGGG
AGAAGCAGAAACACTGCTAGTCATCATTGCAGCAGAATTAGCAGAAGAAAAAGAGCAAAGGAAAGACTTTACCAAAAAGTCGCCAGAGCTAGCAAAAGCCCCGCCTCAAC
TCGACCCGTTACCCGAAAATGCTGCCCCAAAAAGAGAAATCCCTTCCCCGCCTAAAAAGAAACCTGCTGCCAAGAGAGGAAAGAAAGTCACGAAAGGAAAGAAGCAGCCG
CCTGTTCTAGCAGAAGAGGAGGAGCAGCTAGTTCAGGGTAGCCAAACCGAGGCAGTCGGTCAGACTGCTGAGCATAAAGAAGTACAGCATGCCGAGATAGAGGCAGAAAG
AGGCCCAGAGCTGTCGCCTGAAGCCAGATGCAGAGTCAAGGATTTTATGCAAGCTCTTCCAAGCGAATAG
Protein sequenceShow/hide protein sequence
MDQPPRLPNPIVEELYGGDMAEPVVVPPQNAILLADNGDRAIRAYAVLALHGFHPVIAGPEIEAKRFELKPVMFQMLQIVGQFYGIPSEDPHLQRISSNNYHWSDSRAIN
DRGNYGATSNTKMASLKDQIANLNNMVKNMSAAIASSTNPGSSKVMAISCSYCEGDHTYDSYLGNPASWYYSENPGSFTIPISIRGKNVGHALCDLGASINLMPLSVYQK
LGIGEARLTTITLQLAYRSITHPEGKIEDVLVQVTLSVFNSIKYPTDVEECYLLRISDDLKSDEIQKEELLNQMKDELTRIFKARDNEAKLIQRQPNEPATDRIYKKTFE
PLELKDMVQAPLQPSVVKALKLELKFLPAHLNYAYLGEAETLLVIIAAELAEEKEQRKDFTKKSPELAKAPPQLDPLPENAAPKREIPSPPKKKPAAKRGKKVTKGKKQP
PVLAEEEEQLVQGSQTEAVGQTAEHKEVQHAEIEAERGPELSPEARCRVKDFMQALPSE