; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg015449 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg015449
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionTransposase
Genome locationscaffold10:15718663..15720855
RNA-Seq ExpressionSpg015449
SyntenySpg015449
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8649224.1 hypothetical protein Csa_014966 [Cucumis sativus]1.6e-2226.67Show/hide
Query:  MSGSSSNNDDKVNVAIHMEARPNVGRGLTTMRELAGVQNFRQRLVVEYNNQGQAVGTNANKMQSFIG---------------------------------
        M  S S++ D+ NV I  + +    RG T M EL  ++N  +R  +EYN++GQ VG NA KMQSFIG                                 
Subjt:  MSGSSSNNDDKVNVAIHMEARPNVGRGLTTMRELAGVQNFRQRLVVEYNNQGQAVGTNANKMQSFIG---------------------------------

Query:  ---------------------------------------------------------------------------------------------------E
                                                                                                           E
Subjt:  ---------------------------------------------------------------------------------------------------E

Query:  LANDPTDRAILWKEARKGKNNEYFDDDTRERANRI-----------------------GKIEG-------------------------------------
        L++DP +RA LWKEARK KNN  FDD T E   RI                       G+I G                                     
Subjt:  LANDPTDRAILWKEARKGKNNEYFDDDTRERANRI-----------------------GKIEG-------------------------------------

Query:  ----------------------SQS----KKSSEVESKASRTKTKGKEIVDEPEEVL------ESEEVLEAQCMNLGVG-----------------CPTI
                              SQS    KK+ E + +  +   KGK +V +PEE+L      E E +L+    +L +G                 CPTI
Subjt:  ----------------------SQS----KKSSEVESKASRTKTKGKEIVDEPEEVL------ESEEVLEAQCMNLGVG-----------------CPTI

Query:  HGVPLGANNVQVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWPRELVILNNQKKASSPAKPKMNVPITQSSEHTDTH
        HG+PLGA+N++V VD+I  EDV +PIP+ GEIETL+QA G+FVAWPR+LVIL  +KKA S A  +     TQSS++TD H
Subjt:  HGVPLGANNVQVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWPRELVILNNQKKASSPAKPKMNVPITQSSEHTDTH

XP_022136076.1 uncharacterized protein LOC111007859 isoform X1 [Momordica charantia]1.9e-3130.18Show/hide
Query:  MSGSSSNNDDKVNVAIHMEARPNVGRGLTTMRELAGVQNFRQRLVVEYNNQGQAVGTNANKMQSFIG---------------------------------
        MS SSS++ D+ +V IH E +    RG TTM EL  ++N  +R  +EYN+QGQ +G NA KMQSFIG                                 
Subjt:  MSGSSSNNDDKVNVAIHMEARPNVGRGLTTMRELAGVQNFRQRLVVEYNNQGQAVGTNANKMQSFIG---------------------------------

Query:  ---------------------------------------------------------------------------------------------------E
                                                                                                           +
Subjt:  ---------------------------------------------------------------------------------------------------E

Query:  LANDPTDRAILWKEARKGKNNEYFDDDTRERANRI-----------------------GKIEG----------------------SQSKKSSEVESKASR
        L++DP++RAILWKEARKGKNNEYFDD TRE A RI                       G++ G                       Q  KS+   S  S+
Subjt:  LANDPTDRAILWKEARKGKNNEYFDDDTRERANRI-----------------------GKIEG----------------------SQSKKSSEVESKASR

Query:  TKTKGKEIVDEPEEV-LESEEVLEAQCMNL-----------------GVGCPTIHGVPLGANNVQVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWP
         K+KGKEIV+  EE+ +  E+ +E +  +L                  V CPT+HGVPLG +NV+V+VD++  E   IPIPV GEIETL+Q  G FVAWP
Subjt:  TKTKGKEIVDEPEEV-LESEEVLEAQCMNL-----------------GVGCPTIHGVPLGANNVQVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWP

Query:  RELVILNNQKKASSPAKPKMNVPITQSSEHTDTH
        R LVIL+ +K  SS    +     TQ S+HTD H
Subjt:  RELVILNNQKKASSPAKPKMNVPITQSSEHTDTH

XP_022136077.1 uncharacterized protein LOC111007859 isoform X2 [Momordica charantia]1.5e-3130.25Show/hide
Query:  MSGSSSNNDDKVNVAIHMEARPNVGRGLTTMRELAGVQNFRQRLVVEYNNQGQAVGTNANKMQSFIG---------------------------------
        MS SSS++ D+ +V IH E +    RG TTM EL  ++N  +R  +EYN+QGQ +G NA KMQSFIG                                 
Subjt:  MSGSSSNNDDKVNVAIHMEARPNVGRGLTTMRELAGVQNFRQRLVVEYNNQGQAVGTNANKMQSFIG---------------------------------

Query:  --------------------------------------------------------------------------------------------------EL
                                                                                                          +L
Subjt:  --------------------------------------------------------------------------------------------------EL

Query:  ANDPTDRAILWKEARKGKNNEYFDDDTRERANRI-----------------------GKIEG----------------------SQSKKSSEVESKASRT
        ++DP++RAILWKEARKGKNNEYFDD TRE A RI                       G++ G                       Q  KS+   S  S+ 
Subjt:  ANDPTDRAILWKEARKGKNNEYFDDDTRERANRI-----------------------GKIEG----------------------SQSKKSSEVESKASRT

Query:  KTKGKEIVDEPEEV-LESEEVLEAQCMNL-----------------GVGCPTIHGVPLGANNVQVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWPR
        K+KGKEIV+  EE+ +  E+ +E +  +L                  V CPT+HGVPLG +NV+V+VD++  E   IPIPV GEIETL+Q  G FVAWPR
Subjt:  KTKGKEIVDEPEEV-LESEEVLEAQCMNL-----------------GVGCPTIHGVPLGANNVQVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWPR

Query:  ELVILNNQKKASSPAKPKMNVPITQSSEHTDTH
         LVIL+ +K  SS    +     TQ S+HTD H
Subjt:  ELVILNNQKKASSPAKPKMNVPITQSSEHTDTH

XP_022136079.1 uncharacterized protein LOC111007859 isoform X3 [Momordica charantia]1.9e-3130.18Show/hide
Query:  MSGSSSNNDDKVNVAIHMEARPNVGRGLTTMRELAGVQNFRQRLVVEYNNQGQAVGTNANKMQSFIG---------------------------------
        MS SSS++ D+ +V IH E +    RG TTM EL  ++N  +R  +EYN+QGQ +G NA KMQSFIG                                 
Subjt:  MSGSSSNNDDKVNVAIHMEARPNVGRGLTTMRELAGVQNFRQRLVVEYNNQGQAVGTNANKMQSFIG---------------------------------

Query:  ---------------------------------------------------------------------------------------------------E
                                                                                                           +
Subjt:  ---------------------------------------------------------------------------------------------------E

Query:  LANDPTDRAILWKEARKGKNNEYFDDDTRERANRI-----------------------GKIEG----------------------SQSKKSSEVESKASR
        L++DP++RAILWKEARKGKNNEYFDD TRE A RI                       G++ G                       Q  KS+   S  S+
Subjt:  LANDPTDRAILWKEARKGKNNEYFDDDTRERANRI-----------------------GKIEG----------------------SQSKKSSEVESKASR

Query:  TKTKGKEIVDEPEEV-LESEEVLEAQCMNL-----------------GVGCPTIHGVPLGANNVQVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWP
         K+KGKEIV+  EE+ +  E+ +E +  +L                  V CPT+HGVPLG +NV+V+VD++  E   IPIPV GEIETL+Q  G FVAWP
Subjt:  TKTKGKEIVDEPEEV-LESEEVLEAQCMNL-----------------GVGCPTIHGVPLGANNVQVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWP

Query:  RELVILNNQKKASSPAKPKMNVPITQSSEHTDTH
        R LVIL+ +K  SS    +     TQ S+HTD H
Subjt:  RELVILNNQKKASSPAKPKMNVPITQSSEHTDTH

XP_022136080.1 uncharacterized protein LOC111007859 isoform X4 [Momordica charantia]1.9e-3130.18Show/hide
Query:  MSGSSSNNDDKVNVAIHMEARPNVGRGLTTMRELAGVQNFRQRLVVEYNNQGQAVGTNANKMQSFIG---------------------------------
        MS SSS++ D+ +V IH E +    RG TTM EL  ++N  +R  +EYN+QGQ +G NA KMQSFIG                                 
Subjt:  MSGSSSNNDDKVNVAIHMEARPNVGRGLTTMRELAGVQNFRQRLVVEYNNQGQAVGTNANKMQSFIG---------------------------------

Query:  ---------------------------------------------------------------------------------------------------E
                                                                                                           +
Subjt:  ---------------------------------------------------------------------------------------------------E

Query:  LANDPTDRAILWKEARKGKNNEYFDDDTRERANRI-----------------------GKIEG----------------------SQSKKSSEVESKASR
        L++DP++RAILWKEARKGKNNEYFDD TRE A RI                       G++ G                       Q  KS+   S  S+
Subjt:  LANDPTDRAILWKEARKGKNNEYFDDDTRERANRI-----------------------GKIEG----------------------SQSKKSSEVESKASR

Query:  TKTKGKEIVDEPEEV-LESEEVLEAQCMNL-----------------GVGCPTIHGVPLGANNVQVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWP
         K+KGKEIV+  EE+ +  E+ +E +  +L                  V CPT+HGVPLG +NV+V+VD++  E   IPIPV GEIETL+Q  G FVAWP
Subjt:  TKTKGKEIVDEPEEV-LESEEVLEAQCMNL-----------------GVGCPTIHGVPLGANNVQVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWP

Query:  RELVILNNQKKASSPAKPKMNVPITQSSEHTDTH
        R LVIL+ +K  SS    +     TQ S+HTD H
Subjt:  RELVILNNQKKASSPAKPKMNVPITQSSEHTDTH

TrEMBL top hitse value%identityAlignment
A0A1S3BRX5 uncharacterized protein LOC103493028 isoform X11.7e-2226.67Show/hide
Query:  MSGSSSNNDDKVNVAIHMEARPNVGRGLTTMRELAGVQNFRQRLVVEYNNQGQAVGTNANKMQSFIG---------------------------------
        M  S S++ D+ NV I  E +    RG T M EL  ++N  +R  +EYN++GQ VG NA KMQSFIG                                 
Subjt:  MSGSSSNNDDKVNVAIHMEARPNVGRGLTTMRELAGVQNFRQRLVVEYNNQGQAVGTNANKMQSFIG---------------------------------

Query:  ---------------------------------------------------------------------------------------------------E
                                                                                                           E
Subjt:  ---------------------------------------------------------------------------------------------------E

Query:  LANDPTDRAILWKEARKGKNNEYFDDDTRERANRI-----------------------GKIEG-------------------------------------
        L++DP +RA LWKEARK KNN  FDD TRE   RI                       G+I G                                     
Subjt:  LANDPTDRAILWKEARKGKNNEYFDDDTRERANRI-----------------------GKIEG-------------------------------------

Query:  --SQSKKSSEVE------------SKASRTKTKGKE------------IVDEPEEVLESEEVLEAQCMNLG-----------------------VGCPTI
           QSK  +E +            S  SR KTKGK+            +V E EE LE + + E + ++ G                       V CPTI
Subjt:  --SQSKKSSEVE------------SKASRTKTKGKE------------IVDEPEEVLESEEVLEAQCMNLG-----------------------VGCPTI

Query:  HGVPLGANNVQVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWPRELVILNNQKKASSPAKPKMNVPITQSSEHTDTH
        HG+PLGA N++V VD+   EDV +PIP+ G+IETL+QA G+FVAWPR+LVI+  +KKA S    +     TQSS++TD H
Subjt:  HGVPLGANNVQVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWPRELVILNNQKKASSPAKPKMNVPITQSSEHTDTH

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X19.2e-3230.18Show/hide
Query:  MSGSSSNNDDKVNVAIHMEARPNVGRGLTTMRELAGVQNFRQRLVVEYNNQGQAVGTNANKMQSFIG---------------------------------
        MS SSS++ D+ +V IH E +    RG TTM EL  ++N  +R  +EYN+QGQ +G NA KMQSFIG                                 
Subjt:  MSGSSSNNDDKVNVAIHMEARPNVGRGLTTMRELAGVQNFRQRLVVEYNNQGQAVGTNANKMQSFIG---------------------------------

Query:  ---------------------------------------------------------------------------------------------------E
                                                                                                           +
Subjt:  ---------------------------------------------------------------------------------------------------E

Query:  LANDPTDRAILWKEARKGKNNEYFDDDTRERANRI-----------------------GKIEG----------------------SQSKKSSEVESKASR
        L++DP++RAILWKEARKGKNNEYFDD TRE A RI                       G++ G                       Q  KS+   S  S+
Subjt:  LANDPTDRAILWKEARKGKNNEYFDDDTRERANRI-----------------------GKIEG----------------------SQSKKSSEVESKASR

Query:  TKTKGKEIVDEPEEV-LESEEVLEAQCMNL-----------------GVGCPTIHGVPLGANNVQVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWP
         K+KGKEIV+  EE+ +  E+ +E +  +L                  V CPT+HGVPLG +NV+V+VD++  E   IPIPV GEIETL+Q  G FVAWP
Subjt:  TKTKGKEIVDEPEEV-LESEEVLEAQCMNL-----------------GVGCPTIHGVPLGANNVQVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWP

Query:  RELVILNNQKKASSPAKPKMNVPITQSSEHTDTH
        R LVIL+ +K  SS    +     TQ S+HTD H
Subjt:  RELVILNNQKKASSPAKPKMNVPITQSSEHTDTH

A0A6J1C2V2 uncharacterized protein LOC111007859 isoform X49.2e-3230.18Show/hide
Query:  MSGSSSNNDDKVNVAIHMEARPNVGRGLTTMRELAGVQNFRQRLVVEYNNQGQAVGTNANKMQSFIG---------------------------------
        MS SSS++ D+ +V IH E +    RG TTM EL  ++N  +R  +EYN+QGQ +G NA KMQSFIG                                 
Subjt:  MSGSSSNNDDKVNVAIHMEARPNVGRGLTTMRELAGVQNFRQRLVVEYNNQGQAVGTNANKMQSFIG---------------------------------

Query:  ---------------------------------------------------------------------------------------------------E
                                                                                                           +
Subjt:  ---------------------------------------------------------------------------------------------------E

Query:  LANDPTDRAILWKEARKGKNNEYFDDDTRERANRI-----------------------GKIEG----------------------SQSKKSSEVESKASR
        L++DP++RAILWKEARKGKNNEYFDD TRE A RI                       G++ G                       Q  KS+   S  S+
Subjt:  LANDPTDRAILWKEARKGKNNEYFDDDTRERANRI-----------------------GKIEG----------------------SQSKKSSEVESKASR

Query:  TKTKGKEIVDEPEEV-LESEEVLEAQCMNL-----------------GVGCPTIHGVPLGANNVQVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWP
         K+KGKEIV+  EE+ +  E+ +E +  +L                  V CPT+HGVPLG +NV+V+VD++  E   IPIPV GEIETL+Q  G FVAWP
Subjt:  TKTKGKEIVDEPEEV-LESEEVLEAQCMNL-----------------GVGCPTIHGVPLGANNVQVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWP

Query:  RELVILNNQKKASSPAKPKMNVPITQSSEHTDTH
        R LVIL+ +K  SS    +     TQ S+HTD H
Subjt:  RELVILNNQKKASSPAKPKMNVPITQSSEHTDTH

A0A6J1C398 uncharacterized protein LOC111007859 isoform X39.2e-3230.18Show/hide
Query:  MSGSSSNNDDKVNVAIHMEARPNVGRGLTTMRELAGVQNFRQRLVVEYNNQGQAVGTNANKMQSFIG---------------------------------
        MS SSS++ D+ +V IH E +    RG TTM EL  ++N  +R  +EYN+QGQ +G NA KMQSFIG                                 
Subjt:  MSGSSSNNDDKVNVAIHMEARPNVGRGLTTMRELAGVQNFRQRLVVEYNNQGQAVGTNANKMQSFIG---------------------------------

Query:  ---------------------------------------------------------------------------------------------------E
                                                                                                           +
Subjt:  ---------------------------------------------------------------------------------------------------E

Query:  LANDPTDRAILWKEARKGKNNEYFDDDTRERANRI-----------------------GKIEG----------------------SQSKKSSEVESKASR
        L++DP++RAILWKEARKGKNNEYFDD TRE A RI                       G++ G                       Q  KS+   S  S+
Subjt:  LANDPTDRAILWKEARKGKNNEYFDDDTRERANRI-----------------------GKIEG----------------------SQSKKSSEVESKASR

Query:  TKTKGKEIVDEPEEV-LESEEVLEAQCMNL-----------------GVGCPTIHGVPLGANNVQVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWP
         K+KGKEIV+  EE+ +  E+ +E +  +L                  V CPT+HGVPLG +NV+V+VD++  E   IPIPV GEIETL+Q  G FVAWP
Subjt:  TKTKGKEIVDEPEEV-LESEEVLEAQCMNL-----------------GVGCPTIHGVPLGANNVQVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWP

Query:  RELVILNNQKKASSPAKPKMNVPITQSSEHTDTH
        R LVIL+ +K  SS    +     TQ S+HTD H
Subjt:  RELVILNNQKKASSPAKPKMNVPITQSSEHTDTH

A0A6J1C4J7 uncharacterized protein LOC111007859 isoform X27.0e-3230.25Show/hide
Query:  MSGSSSNNDDKVNVAIHMEARPNVGRGLTTMRELAGVQNFRQRLVVEYNNQGQAVGTNANKMQSFIG---------------------------------
        MS SSS++ D+ +V IH E +    RG TTM EL  ++N  +R  +EYN+QGQ +G NA KMQSFIG                                 
Subjt:  MSGSSSNNDDKVNVAIHMEARPNVGRGLTTMRELAGVQNFRQRLVVEYNNQGQAVGTNANKMQSFIG---------------------------------

Query:  --------------------------------------------------------------------------------------------------EL
                                                                                                          +L
Subjt:  --------------------------------------------------------------------------------------------------EL

Query:  ANDPTDRAILWKEARKGKNNEYFDDDTRERANRI-----------------------GKIEG----------------------SQSKKSSEVESKASRT
        ++DP++RAILWKEARKGKNNEYFDD TRE A RI                       G++ G                       Q  KS+   S  S+ 
Subjt:  ANDPTDRAILWKEARKGKNNEYFDDDTRERANRI-----------------------GKIEG----------------------SQSKKSSEVESKASRT

Query:  KTKGKEIVDEPEEV-LESEEVLEAQCMNL-----------------GVGCPTIHGVPLGANNVQVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWPR
        K+KGKEIV+  EE+ +  E+ +E +  +L                  V CPT+HGVPLG +NV+V+VD++  E   IPIPV GEIETL+Q  G FVAWPR
Subjt:  KTKGKEIVDEPEEV-LESEEVLEAQCMNL-----------------GVGCPTIHGVPLGANNVQVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWPR

Query:  ELVILNNQKKASSPAKPKMNVPITQSSEHTDTH
         LVIL+ +K  SS    +     TQ S+HTD H
Subjt:  ELVILNNQKKASSPAKPKMNVPITQSSEHTDTH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGGTTCAAGCAGCAACAATGATGATAAAGTAAACGTTGCTATTCATATGGAGGCCAGACCGAATGTTGGACGAGGTCTCACTACTATGCGTGAGTTGGCAGGCGT
ACAAAATTTTAGACAACGCTTGGTTGTTGAATACAACAATCAAGGTCAGGCTGTTGGTACGAATGCAAACAAAATGCAGAGTTTCATCGGAGAATTGGCAAATGATCCTA
CCGATCGAGCTATTTTATGGAAAGAAGCGCGAAAAGGAAAAAATAATGAATATTTTGATGACGACACTAGAGAACGTGCTAATCGAATTGGTAAAATTGAAGGCTCACAA
TCAAAGAAGTCATCAGAGGTAGAAAGTAAGGCTTCGAGGACAAAAACAAAAGGAAAGGAGATTGTTGATGAGCCAGAAGAAGTGTTAGAGTCAGAAGAAGTGTTAGAGGC
ACAATGTATGAATCTAGGTGTCGGATGTCCAACGATTCATGGAGTACCACTAGGAGCCAATAATGTTCAAGTGGTGGTGGATATGATCACAGGCGAAGATGTTCTCATAC
CAATTCCTGTGGTTGGAGAAATAGAGACGCTTAGTCAAGCAAAGGGTAGCTTTGTGGCATGGCCTCGCGAGCTTGTGATTTTGAATAACCAGAAAAAGGCATCTTCTCCC
GCAAAACCTAAAATGAATGTGCCCATTACACAATCTTCTGAACATACGGATACCCACTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGGTTCAAGCAGCAACAATGATGATAAAGTAAACGTTGCTATTCATATGGAGGCCAGACCGAATGTTGGACGAGGTCTCACTACTATGCGTGAGTTGGCAGGCGT
ACAAAATTTTAGACAACGCTTGGTTGTTGAATACAACAATCAAGGTCAGGCTGTTGGTACGAATGCAAACAAAATGCAGAGTTTCATCGGAGAATTGGCAAATGATCCTA
CCGATCGAGCTATTTTATGGAAAGAAGCGCGAAAAGGAAAAAATAATGAATATTTTGATGACGACACTAGAGAACGTGCTAATCGAATTGGTAAAATTGAAGGCTCACAA
TCAAAGAAGTCATCAGAGGTAGAAAGTAAGGCTTCGAGGACAAAAACAAAAGGAAAGGAGATTGTTGATGAGCCAGAAGAAGTGTTAGAGTCAGAAGAAGTGTTAGAGGC
ACAATGTATGAATCTAGGTGTCGGATGTCCAACGATTCATGGAGTACCACTAGGAGCCAATAATGTTCAAGTGGTGGTGGATATGATCACAGGCGAAGATGTTCTCATAC
CAATTCCTGTGGTTGGAGAAATAGAGACGCTTAGTCAAGCAAAGGGTAGCTTTGTGGCATGGCCTCGCGAGCTTGTGATTTTGAATAACCAGAAAAAGGCATCTTCTCCC
GCAAAACCTAAAATGAATGTGCCCATTACACAATCTTCTGAACATACGGATACCCACTGA
Protein sequenceShow/hide protein sequence
MSGSSSNNDDKVNVAIHMEARPNVGRGLTTMRELAGVQNFRQRLVVEYNNQGQAVGTNANKMQSFIGELANDPTDRAILWKEARKGKNNEYFDDDTRERANRIGKIEGSQ
SKKSSEVESKASRTKTKGKEIVDEPEEVLESEEVLEAQCMNLGVGCPTIHGVPLGANNVQVVVDMITGEDVLIPIPVVGEIETLSQAKGSFVAWPRELVILNNQKKASSP
AKPKMNVPITQSSEHTDTH