; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g27460 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g27460
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionMuDRA-like transposase
Genome locationchr8:19859358..19869158
RNA-Seq ExpressionMoc08g27460
SyntenyMoc08g27460
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR004332 - Transposase, MuDR, plant
IPR006564 - Zinc finger, PMZ-type
IPR007527 - Zinc finger, SWIM-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151685.1 uncharacterized protein LOC111019601 [Momordica charantia]1.3e-12291.45Show/hide
Query:  GNNIKSRFKNAAFYKLYQDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIERWARCYQVGRRYENMTTNSTESVNVLLQEARELPITKIVEFIRELL
        GNNIKSRFKNA FYKLYQDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIERWARCYQVGRRYENMTTNS  SVN  LQEARELPITKIVEFI ELL
Subjt:  GNNIKSRFKNAAFYKLYQDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIERWARCYQVGRRYENMTTNSTESVNVLLQEARELPITKIVEFIRELL

Query:  QRWFHERRTHWSTQNTSHSDYAEERLAVQFEKSRRYTVKPVDWCMFHVEDSGLDGTVDLNARTCTCMKFQYMGIPCSHAIAAERHKNINCHTLIDPCYSV
        QRWFHERRT WSTQNTSHSDYAEERLAVQFEKSRRYTVKP+DWCMFHV+D GLDGTVDLNART TCM+FQYMGIPCSHAIAA RHKNIN HTLIDPCYSV
Subjt:  QRWFHERRTHWSTQNTSHSDYAEERLAVQFEKSRRYTVKPVDWCMFHVEDSGLDGTVDLNARTCTCMKFQYMGIPCSHAIAAERHKNINCHTLIDPCYSV

Query:  DALVGAYAEPILPVGHMSEWKRPADYQPIPIQPP
        D+L+GAYAEPIL VGHMSEWKRP DY PIP+QPP
Subjt:  DALVGAYAEPILPVGHMSEWKRPADYQPIPIQPP

XP_022154925.1 uncharacterized protein LOC111022071 [Momordica charantia]1.9e-22566.92Show/hide
Query:  MEIGKSLFIRFGGTWEDNTNSYVDGELKEIIVPLTVTYKELKNRLYRLMKVDQNGYDLVIRIPYHLACDSPPMFVTDDDDLRFALVQEQVSKVLLFVSTV
        MEIGKSLFIRFGGTWEDNTNSYV GELK IIVPLTVTYKELKNRL+RLMKVDQNGYDLVIR+PYHLACDSPPMFVTDDDDLRFALVQEQVSKV LFVST+
Subjt:  MEIGKSLFIRFGGTWEDNTNSYVDGELKEIIVPLTVTYKELKNRLYRLMKVDQNGYDLVIRIPYHLACDSPPMFVTDDDDLRFALVQEQVSKVLLFVSTV

Query:  PCESIDIQSSRLNQDGEQSIPNGGSVPEEGFTGVDEWGLNDVYMQDMYSSDCIYTQDSELVAPATRMPPVMHVSSDEINNTAQVQHASPRTVHVFPFDEL
        P ESIDIQSSRLNQDGEQSIPNGGSVPEEG T VDEWGLNDVYMQDMYSSD                                                 
Subjt:  PCESIDIQSSRLNQDGEQSIPNGGSVPEEGFTGVDEWGLNDVYMQDMYSSDCIYTQDSELVAPATRMPPVMHVSSDEINNTAQVQHASPRTVHVFPFDEL

Query:  NNTSQVQHASPLTEHVFPSVEPSSSDPGNVGDHQIPVPQSPSTVTIEQATRYNLLPGGSELHVGKIFVSKQDLRMVLSNTAMRSNREYKVSRSTKSKFAV
             ++HASPLT      VEPSSSDP NV   QIPVPQSPSTVTIEQA RYNLLPGGSELHVGKIFVSKQDLRMVLSN AMRSNREYKVSRSTKSKF V
Subjt:  NNTSQVQHASPLTEHVFPSVEPSSSDPGNVGDHQIPVPQSPSTVTIEQATRYNLLPGGSELHVGKIFVSKQDLRMVLSNTAMRSNREYKVSRSTKSKFAV

Query:  RCIDNTCNWRVAAHFVGE----------------------------QLVVANLIKDRVAGTGRIYKIKHIKEDVRKEFGVNISYDKVHRARELAYAIVRG
        RCIDNTCNWRVAAH VG+                              VVANLIKD VAGTGRIYKIKHIKEDVRKE+GVNISYDK HRARELAY IVRG
Subjt:  RCIDNTCNWRVAAHFVGE----------------------------QLVVANLIKDRVAGTGRIYKIKHIKEDVRKEFGVNISYDKVHRARELAYAIVRG

Query:  RSEDSYMHLHAYGEAIKIENP---------------------------------------GNNIKSRFK-----------NAAFYKL-------------
        R +DSYMHLHAYGEAIKIENP                                       G ++K +FK           N   Y L             
Subjt:  RSEDSYMHLHAYGEAIKIENP---------------------------------------GNNIKSRFK-----------NAAFYKL-------------

Query:  ----------------------YQDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIERWARCYQVGRRYENMTTNSTESVNVLLQEARELPITKIVE
                                DAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIERWARCYQVGRRYEN TTNS ESVN LL+EARELPITKIVE
Subjt:  ----------------------YQDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIERWARCYQVGRRYENMTTNSTESVNVLLQEARELPITKIVE

Query:  FIRELLQRWFHERRTHWSTQNTSHSDYAEERLAVQFEKSRRYTVKPVDWCMFHVED
        FIRELLQRWFHERR HWSTQNTSHSDYAEERLAVQFEKSRRYTVKPVDWCMFHVED
Subjt:  FIRELLQRWFHERRTHWSTQNTSHSDYAEERLAVQFEKSRRYTVKPVDWCMFHVED

XP_022154997.1 uncharacterized protein LOC111022140 [Momordica charantia]7.4e-12157.87Show/hide
Query:  GGSELHVGKIFVSKQDLRMVLSNTAMRSNREYKVSRSTKSKFAVRCIDNTCNWRVAAHFVGE----------------------------QLVVANLIKD
        GGSELHVGKIFVSKQDL MV+SN AMRSNREYKVSRSTKSKF VRCI+NTCN RVA H VG+                              +VANLIKD
Subjt:  GGSELHVGKIFVSKQDLRMVLSNTAMRSNREYKVSRSTKSKFAVRCIDNTCNWRVAAHFVGE----------------------------QLVVANLIKD

Query:  RVAGTGRIYKIKHIKEDVRKEFGVNISYDKVHRARELAYAIVRGRSEDSYMHLHAYGEAIKIENP-----------------------------------
        RVAG GRIY IKHIKEDVRKEFGVN SYDK HRARELAYAIVRGR EDSYMHLH YGEAIKIE P                                   
Subjt:  RVAGTGRIYKIKHIKEDVRKEFGVNISYDKVHRARELAYAIVRGRSEDSYMHLHAYGEAIKIENP-----------------------------------

Query:  ----GNNIKSRFKNA--------------------------AFYKLY--------------------QDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQE
            G ++K +F+                            A +K +                     DAAYAYRKSQFTYYWNQILS+GSGSLAKYLQE
Subjt:  ----GNNIKSRFKNA--------------------------AFYKLY--------------------QDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQE

Query:  IGIERWARCYQVGRRYENMTTNSTESVNVLLQEARELPITKIVEFIRELLQRWFHERRTHWSTQNTSHSDYAEERLAVQFEKSRRYTVKPVDWCMFHVED
        IG+ERWARCYQVGRRYENMTTNST+S                             ERRTHWSTQNTSHSDYA+ERLA+QFEKSRRYTVKPVDWCMFHVED
Subjt:  IGIERWARCYQVGRRYENMTTNSTESVNVLLQEARELPITKIVEFIRELLQRWFHERRTHWSTQNTSHSDYAEERLAVQFEKSRRYTVKPVDWCMFHVED

Query:  SGLDGTVDLNARTCTCMKFQYMGIPCSHAIAA
        +GLDGTVDLNA TCTCM+FQYMGIPCSHAIAA
Subjt:  SGLDGTVDLNARTCTCMKFQYMGIPCSHAIAA

XP_022154997.1 uncharacterized protein LOC111022140 [Momordica charantia]1.6e-0380.65Show/hide
Query:  MDVVLGQVEYFIPSWVDVDVVYSRFCIKDHW
        MDVVLGQVE FIP+WVDVDVVYS   I+DHW
Subjt:  MDVVLGQVEYFIPSWVDVDVVYSRFCIKDHW

XP_022156308.1 uncharacterized protein LOC111023235 [Momordica charantia]3.7e-12893.16Show/hide
Query:  GNNIKSRFKNAAFYKLYQDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIERWARCYQVGRRYENMTTNSTESVNVLLQEARELPITKIVEFIRELL
        GNNIK+RFKNAAFYKLYQDAAYAYRKSQFTYYWNQILS+GSGSLAKYLQEIGIERWARCYQVGRRYENMTTNS ESVN LL+EARELPITKIVEFI +LL
Subjt:  GNNIKSRFKNAAFYKLYQDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIERWARCYQVGRRYENMTTNSTESVNVLLQEARELPITKIVEFIRELL

Query:  QRWFHERRTHWSTQNTSHSDYAEERLAVQFEKSRRYTVKPVDWCMFHVEDSGLDGTVDLNARTCTCMKFQYMGIPCSHAIAAERHKNINCHTLIDPCYSV
        QRWFHERRTHWSTQNTSHSDYAEERLA+QFEKSRRYTVKPVDWCMFHVED GLD TVDLNARTCTCM+FQYMGIPCSHAIAA RHKNINCHTLIDPCY+V
Subjt:  QRWFHERRTHWSTQNTSHSDYAEERLAVQFEKSRRYTVKPVDWCMFHVEDSGLDGTVDLNARTCTCMKFQYMGIPCSHAIAAERHKNINCHTLIDPCYSV

Query:  DALVGAYAEPILPVGHMSEWKRPADYQPIPIQPP
        D+L+GAYAEPILPVGHMSEWKRPADYQPIP+QPP
Subjt:  DALVGAYAEPILPVGHMSEWKRPADYQPIPIQPP

XP_022159086.1 uncharacterized protein LOC111025530 [Momordica charantia]4.6e-16365.82Show/hide
Query:  IPVPQSPSTVTIEQATRYNLLPGGSELHVGKIFVSKQDLRMVLSNTAMRSNREYKVSRSTKSKFAVRCIDNTCNWRVAAHFVGE----------------
        IP+  SPS+    +  R       S+L++G+I   K +L   +       NREYK SRSTKSKF VRCIDNTCNWRVAAH VG+                
Subjt:  IPVPQSPSTVTIEQATRYNLLPGGSELHVGKIFVSKQDLRMVLSNTAMRSNREYKVSRSTKSKFAVRCIDNTCNWRVAAHFVGE----------------

Query:  ------------QLVVANLIKDRVAGTGRIYKIKHIKEDVRKEFGVNISYDKVHRARELAYAIVRGRSEDSYMHLHAYGEAIKIENPG------------
                      VVANLIKDRVA T RIYKIKHIKEDVR+EF VNISYDK HRARELAYAIVRGRSEDSYMHLHAYG+AIKIENPG            
Subjt:  ------------QLVVANLIKDRVAGTGRIYKIKHIKEDVRKEFGVNISYDKVHRARELAYAIVRGRSEDSYMHLHAYGEAIKIENPG------------

Query:  -----------------------------------NNIKSRF---KNAAFYKLYQDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIERWARCYQVG
                                            N+K          F     DAAYAYRKSQFTYYWNQILS+GSGSLAKYLQEIG+ERWARCYQVG
Subjt:  -----------------------------------NNIKSRF---KNAAFYKLYQDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIERWARCYQVG

Query:  RRYENMTTNSTESVNVLLQEARELPITKIVEFIRELLQRWFHERRTHWSTQNTSHSDYAEERLAVQFEKSRRYTVKPVDWCMFHVEDSGLDGTVDLNART
        RRYENMTTNS ESVNVLL+EA ELPITKIVEFIR+LLQRWFH RRTHWSTQNTSHSDYAEE+LA+QFEKSRRYTVKPVDWCMFHVED GLDGTVDLNART
Subjt:  RRYENMTTNSTESVNVLLQEARELPITKIVEFIRELLQRWFHERRTHWSTQNTSHSDYAEERLAVQFEKSRRYTVKPVDWCMFHVEDSGLDGTVDLNART

Query:  CTCMKFQYMGIPCSHAIAAERHKNINCHTLIDPCYSVDALVGAYAEPILPVGHMSEWKRPADYQPIPIQPP
        CTCM+FQYMGIPCSHAIAA RHKNINCHTLIDPCYSVD+L+ AYAEPILP+GHMSEWKRPA+YQ IP+QPP
Subjt:  CTCMKFQYMGIPCSHAIAAERHKNINCHTLIDPCYSVDALVGAYAEPILPVGHMSEWKRPADYQPIPIQPP

TrEMBL top hitse value%identityAlignment
A0A6J1CWE6 protein FAR-RED ELONGATED HYPOCOTYL 3-like3.7e-11891.4Show/hide
Query:  GNNIKSRFKNAAFYKLYQDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIERWARCYQVGRRYENMTTNSTESVNVLLQEARELPITKIVEFIRELL
        GNNIK+RFKNAAFYKLYQDA YAYRKSQFTYYWNQILS+GSG+LAKYLQEIG+ERWARCYQVGRRYENMTTNS ESVN LL++ARELPITKIVEFIR+LL
Subjt:  GNNIKSRFKNAAFYKLYQDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIERWARCYQVGRRYENMTTNSTESVNVLLQEARELPITKIVEFIRELL

Query:  QRWFHERRTHWSTQNTSHSDYAEERLAVQFEKSRRYTVKPVDWCMFHVEDSGLDGTVDLNARTCTCMKFQYMGIPCSHAIAAERHKNINCHTLIDPCYSV
        QRWFHERRTHWSTQNTSHSDYAEERLA+QFEKSRRYTVKPVDWCMF VED GLDGTVDLNARTCTCM+FQYMGIPCSHAIA  RHKNINCHTLIDPCYSV
Subjt:  QRWFHERRTHWSTQNTSHSDYAEERLAVQFEKSRRYTVKPVDWCMFHVEDSGLDGTVDLNARTCTCMKFQYMGIPCSHAIAAERHKNINCHTLIDPCYSV

Query:  DALVGAYAEPILPVGHMSEWK
        D+L+GAYAE ILPVGHMSEWK
Subjt:  DALVGAYAEPILPVGHMSEWK

A0A6J1DBW0 uncharacterized protein LOC1110196016.5e-12391.45Show/hide
Query:  GNNIKSRFKNAAFYKLYQDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIERWARCYQVGRRYENMTTNSTESVNVLLQEARELPITKIVEFIRELL
        GNNIKSRFKNA FYKLYQDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIERWARCYQVGRRYENMTTNS  SVN  LQEARELPITKIVEFI ELL
Subjt:  GNNIKSRFKNAAFYKLYQDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIERWARCYQVGRRYENMTTNSTESVNVLLQEARELPITKIVEFIRELL

Query:  QRWFHERRTHWSTQNTSHSDYAEERLAVQFEKSRRYTVKPVDWCMFHVEDSGLDGTVDLNARTCTCMKFQYMGIPCSHAIAAERHKNINCHTLIDPCYSV
        QRWFHERRT WSTQNTSHSDYAEERLAVQFEKSRRYTVKP+DWCMFHV+D GLDGTVDLNART TCM+FQYMGIPCSHAIAA RHKNIN HTLIDPCYSV
Subjt:  QRWFHERRTHWSTQNTSHSDYAEERLAVQFEKSRRYTVKPVDWCMFHVEDSGLDGTVDLNARTCTCMKFQYMGIPCSHAIAAERHKNINCHTLIDPCYSV

Query:  DALVGAYAEPILPVGHMSEWKRPADYQPIPIQPP
        D+L+GAYAEPIL VGHMSEWKRP DY PIP+QPP
Subjt:  DALVGAYAEPILPVGHMSEWKRPADYQPIPIQPP

A0A6J1DL08 uncharacterized protein LOC1110220719.1e-22666.92Show/hide
Query:  MEIGKSLFIRFGGTWEDNTNSYVDGELKEIIVPLTVTYKELKNRLYRLMKVDQNGYDLVIRIPYHLACDSPPMFVTDDDDLRFALVQEQVSKVLLFVSTV
        MEIGKSLFIRFGGTWEDNTNSYV GELK IIVPLTVTYKELKNRL+RLMKVDQNGYDLVIR+PYHLACDSPPMFVTDDDDLRFALVQEQVSKV LFVST+
Subjt:  MEIGKSLFIRFGGTWEDNTNSYVDGELKEIIVPLTVTYKELKNRLYRLMKVDQNGYDLVIRIPYHLACDSPPMFVTDDDDLRFALVQEQVSKVLLFVSTV

Query:  PCESIDIQSSRLNQDGEQSIPNGGSVPEEGFTGVDEWGLNDVYMQDMYSSDCIYTQDSELVAPATRMPPVMHVSSDEINNTAQVQHASPRTVHVFPFDEL
        P ESIDIQSSRLNQDGEQSIPNGGSVPEEG T VDEWGLNDVYMQDMYSSD                                                 
Subjt:  PCESIDIQSSRLNQDGEQSIPNGGSVPEEGFTGVDEWGLNDVYMQDMYSSDCIYTQDSELVAPATRMPPVMHVSSDEINNTAQVQHASPRTVHVFPFDEL

Query:  NNTSQVQHASPLTEHVFPSVEPSSSDPGNVGDHQIPVPQSPSTVTIEQATRYNLLPGGSELHVGKIFVSKQDLRMVLSNTAMRSNREYKVSRSTKSKFAV
             ++HASPLT      VEPSSSDP NV   QIPVPQSPSTVTIEQA RYNLLPGGSELHVGKIFVSKQDLRMVLSN AMRSNREYKVSRSTKSKF V
Subjt:  NNTSQVQHASPLTEHVFPSVEPSSSDPGNVGDHQIPVPQSPSTVTIEQATRYNLLPGGSELHVGKIFVSKQDLRMVLSNTAMRSNREYKVSRSTKSKFAV

Query:  RCIDNTCNWRVAAHFVGE----------------------------QLVVANLIKDRVAGTGRIYKIKHIKEDVRKEFGVNISYDKVHRARELAYAIVRG
        RCIDNTCNWRVAAH VG+                              VVANLIKD VAGTGRIYKIKHIKEDVRKE+GVNISYDK HRARELAY IVRG
Subjt:  RCIDNTCNWRVAAHFVGE----------------------------QLVVANLIKDRVAGTGRIYKIKHIKEDVRKEFGVNISYDKVHRARELAYAIVRG

Query:  RSEDSYMHLHAYGEAIKIENP---------------------------------------GNNIKSRFK-----------NAAFYKL-------------
        R +DSYMHLHAYGEAIKIENP                                       G ++K +FK           N   Y L             
Subjt:  RSEDSYMHLHAYGEAIKIENP---------------------------------------GNNIKSRFK-----------NAAFYKL-------------

Query:  ----------------------YQDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIERWARCYQVGRRYENMTTNSTESVNVLLQEARELPITKIVE
                                DAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIERWARCYQVGRRYEN TTNS ESVN LL+EARELPITKIVE
Subjt:  ----------------------YQDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIERWARCYQVGRRYENMTTNSTESVNVLLQEARELPITKIVE

Query:  FIRELLQRWFHERRTHWSTQNTSHSDYAEERLAVQFEKSRRYTVKPVDWCMFHVED
        FIRELLQRWFHERR HWSTQNTSHSDYAEERLAVQFEKSRRYTVKPVDWCMFHVED
Subjt:  FIRELLQRWFHERRTHWSTQNTSHSDYAEERLAVQFEKSRRYTVKPVDWCMFHVED

A0A6J1DQ99 uncharacterized protein LOC1110232351.8e-12893.16Show/hide
Query:  GNNIKSRFKNAAFYKLYQDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIERWARCYQVGRRYENMTTNSTESVNVLLQEARELPITKIVEFIRELL
        GNNIK+RFKNAAFYKLYQDAAYAYRKSQFTYYWNQILS+GSGSLAKYLQEIGIERWARCYQVGRRYENMTTNS ESVN LL+EARELPITKIVEFI +LL
Subjt:  GNNIKSRFKNAAFYKLYQDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIERWARCYQVGRRYENMTTNSTESVNVLLQEARELPITKIVEFIRELL

Query:  QRWFHERRTHWSTQNTSHSDYAEERLAVQFEKSRRYTVKPVDWCMFHVEDSGLDGTVDLNARTCTCMKFQYMGIPCSHAIAAERHKNINCHTLIDPCYSV
        QRWFHERRTHWSTQNTSHSDYAEERLA+QFEKSRRYTVKPVDWCMFHVED GLD TVDLNARTCTCM+FQYMGIPCSHAIAA RHKNINCHTLIDPCY+V
Subjt:  QRWFHERRTHWSTQNTSHSDYAEERLAVQFEKSRRYTVKPVDWCMFHVEDSGLDGTVDLNARTCTCMKFQYMGIPCSHAIAAERHKNINCHTLIDPCYSV

Query:  DALVGAYAEPILPVGHMSEWKRPADYQPIPIQPP
        D+L+GAYAEPILPVGHMSEWKRPADYQPIP+QPP
Subjt:  DALVGAYAEPILPVGHMSEWKRPADYQPIPIQPP

A0A6J1E2V3 uncharacterized protein LOC1110255302.2e-16365.82Show/hide
Query:  IPVPQSPSTVTIEQATRYNLLPGGSELHVGKIFVSKQDLRMVLSNTAMRSNREYKVSRSTKSKFAVRCIDNTCNWRVAAHFVGE----------------
        IP+  SPS+    +  R       S+L++G+I   K +L   +       NREYK SRSTKSKF VRCIDNTCNWRVAAH VG+                
Subjt:  IPVPQSPSTVTIEQATRYNLLPGGSELHVGKIFVSKQDLRMVLSNTAMRSNREYKVSRSTKSKFAVRCIDNTCNWRVAAHFVGE----------------

Query:  ------------QLVVANLIKDRVAGTGRIYKIKHIKEDVRKEFGVNISYDKVHRARELAYAIVRGRSEDSYMHLHAYGEAIKIENPG------------
                      VVANLIKDRVA T RIYKIKHIKEDVR+EF VNISYDK HRARELAYAIVRGRSEDSYMHLHAYG+AIKIENPG            
Subjt:  ------------QLVVANLIKDRVAGTGRIYKIKHIKEDVRKEFGVNISYDKVHRARELAYAIVRGRSEDSYMHLHAYGEAIKIENPG------------

Query:  -----------------------------------NNIKSRF---KNAAFYKLYQDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIERWARCYQVG
                                            N+K          F     DAAYAYRKSQFTYYWNQILS+GSGSLAKYLQEIG+ERWARCYQVG
Subjt:  -----------------------------------NNIKSRF---KNAAFYKLYQDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIERWARCYQVG

Query:  RRYENMTTNSTESVNVLLQEARELPITKIVEFIRELLQRWFHERRTHWSTQNTSHSDYAEERLAVQFEKSRRYTVKPVDWCMFHVEDSGLDGTVDLNART
        RRYENMTTNS ESVNVLL+EA ELPITKIVEFIR+LLQRWFH RRTHWSTQNTSHSDYAEE+LA+QFEKSRRYTVKPVDWCMFHVED GLDGTVDLNART
Subjt:  RRYENMTTNSTESVNVLLQEARELPITKIVEFIRELLQRWFHERRTHWSTQNTSHSDYAEERLAVQFEKSRRYTVKPVDWCMFHVEDSGLDGTVDLNART

Query:  CTCMKFQYMGIPCSHAIAAERHKNINCHTLIDPCYSVDALVGAYAEPILPVGHMSEWKRPADYQPIPIQPP
        CTCM+FQYMGIPCSHAIAA RHKNINCHTLIDPCYSVD+L+ AYAEPILP+GHMSEWKRPA+YQ IP+QPP
Subjt:  CTCMKFQYMGIPCSHAIAAERHKNINCHTLIDPCYSVDALVGAYAEPILPVGHMSEWKRPADYQPIPIQPP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49920.1 MuDR family transposase9.6e-1025.21Show/hide
Query:  LYQDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIERWARCYQVGRRYENMTTNSTESVNVLLQEARELPITKIVEFIRELLQRWFHERRTHWSTQN
        L  +A  + +K +F  Y  +I    +    K+L +    +WA  +  GRRY  M  + TE++  + +  R++ +   V  +   L+  F E     S  +
Subjt:  LYQDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIERWARCYQVGRRYENMTTNSTESVNVLLQEARELPITKIVEFIRELLQRWFHERRTHWSTQN

Query:  TSHSDYAEERLAVQFEKSRR------YTVKPVDWCMFHV-------------EDSGLDGTVDLNARTCTCMKFQYMGIPCSHAIAAERHKNINCHTLIDP
          H D   E +  + E+          T+ P++   + V              +    G V LN  TCTC +FQ    PC HA+A      IN    +D 
Subjt:  TSHSDYAEERLAVQFEKSRR------YTVKPVDWCMFHV-------------EDSGLDGTVDLNARTCTCMKFQYMGIPCSHAIAAERHKNINCHTLIDP

Query:  CYSVDALVGAYAEPILPVGHMSEWKRPADYQPIPIQPP
        CY+V+     Y+    PV  +S W  P  Y    + PP
Subjt:  CYSVDALVGAYAEPILPVGHMSEWKRPADYQPIPIQPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGGGGGTAAAAATACCTGGGACACGTGGAGGACGTTCATTGGTGTTCAAGGAATAAAGTATCCCAGGTCCACCTCGAACCTGTCGAGGAGACTGGGTTCG
CCCATCGACAAGGCTAAGTTGTTCACTCTTGAGTTCAGGTCAAGTCGGGGACCGAGTTCGAGCTTGGTTCGTTTAACAGCGTTGTGCACAATGGACCATCAGTTG
AGAATTAAGGAGAATGACCGCTTTTCAGCTCAAGCTACCAGCATGTCTCACTTGAGCAATGTCAACAGGCTTATCAAGGATAAACTCACAGTGGACCAACTTGAT
ATGTTTCGTAGAAGAACAATATTTGGTCGATTTGTCGACTTGGAGATGATGTTCTGTAGTGGTGTAATTCATCACTTTCTGTCAAGGGAGGTTGCTGGGAGCAGT
GACGACAGCATGAGTGGTCCAGAAAAGATTGGAAAGAATAGGTTGCGGAGGAAGTACTTCAACGATGAAGCCTCCATGCTGCTCGAAGAGTTTGTGGAGGTTTAC
AAGCAGACTGATTTCGAGGACGACGAGGACGCCGTTAAAGTGACATTAATTTTGTACATGGAGCTTGTGATGATGGGAAAGAGCAAGAGCAAGTCGAAGGTTGAC
ATCGACTTGTACAACCAAGTCGATGACTTGGACTACTTCAACCATTTGGATTGGGGTTCTGATGTCTTGAGTATGACAGTTAACGGTCTGAAGCGTGCGATGAAT
GGAAAGGTTGCGTTATACAAGAACAAAGTAAGAACGAACAAAAAGTATCTAGTAAAGTATAGCCTACCGAGATTTCCGCTTGCGTTTCAGGTGTGGATATATAAG
GTTGTCCCATCTCTCATCACTCCTGGTGTCAATCGTTTGAGCGAGACCGCCATTCCCCGGATAATTCGATATTCGTGCAATAGAGTCGTCGGTACAAAAGATCTG
GGGGAGGAGGTCCTTGGTTCAGCGGTGTTGGCCATATCTTATCCACTCGTGGAGACGGAACTGGATAAGGACTACCAAAGGTGTCCATTGGACGAAAGAGAGGTG
GTTGATTTAAATGCGCCTGGGTGTTCCACCTCCGATAGTGATGATGGACACAATCCTTCCCCCATCACCGACAATCTTGGCGCCAAAGACGATCTCCCACTCGAC
GATGCGCATTCGTTGGAAACGAATGTACAGACAATGCCGGATGAGTCTTCGGATATGCCACGTACAGAGGCCGCATCTGAAGGTGGGCAACGGACACCGGTCGAA
GTACTTCGACCAAGTACTTCTATTCGGTCAAATGTGGGGCAAAGCACGCGGCAATCACCGCGAGCGACATCATGCGATGCTTGCCCTACACAACGGCACGACACC
CGTCGATCGAATGATAGATTCGGGGCTATGGAGAGAAGGCTGGATCTTTTAGCTTCGGACATGGCGGAGGTGAAGACAGATTTGGGGGAGGTCAAGTCCGACTTG
AGTGAAATGAAACTCATGCTTCAACGGTTGTGTCAGATCGATAGGCGAGAGGTGAATATTGGTGCCTCTCTGTCGGATACAGTCCACGTGTCACATCCATTGGTA
TCCAATGTTATCCTCGAGCATGATGGGGATGCTGATGACAATCAACCTGGAGGTTCCTATGCTGGAAAGGAGGACGATGTCGTTCCTGTAGAAGCGTCGTTGCAT
GAAAAGGCAACGGATGGAGTAGAGATGACCATACCCCCATCCGATCTTGGAGATGCAGAACTAGCCAACCCTGCGTGTATTGTCGATTCGGTGGAGTTGGACGTT
GCAGTGGTGACACCCATTGTTTCGACAGTGATGGTGGAACTCGAAATGGCACCACCAATAGTACAAGATCCACAATCAGAGACGACGTCCGATCCAACCTTCGAG
CCTTCTGCCTCAACCAACATTGATGGTCCGTGTGGCATGATCCATGGGTCTCGTCAAGCCGAGCATATTGAGTTGGCCCTTACACCAGCGGATACAACCCCCACT
ACTCAACCTATTCCCACACCTACACCAGCATATATGACTCCCACCCCTCAACCTATTCCCACCCTTACACCAGTTGAAAACACCACCAGAAGGACCGAGCGTAAG
TGGACAGAAACGAAACCATTCAGTCCGGAGGACACGCATCGGCAGAAGAAGAAGCAGAAGATGGTGGATGTGGATCTCGTACCTGCCAGCCAGGACCCTGGGAAT
GACCAAACAAACGCGGCCGTCTACAACTTGAATGTGCATAGTGGATATTCCAGAAGATTCTTCATTAACATCCTCAATCCTAGAAGAGGTCGAGACCGCTCGGTA
GTCGGCCGCGCCTGTGCGTTCATAAGTTTTCTATCCTGGACCACTACAAATGGCAAGTTATTGCCGCTGCAGGTGGGTCCCTACGCACGAATCAAGGGGAAGGTC
GTCCAGGACACGACCAATGCTTGGGACGAGTATAAGGAATGCATGGATGTCGTGCTGGGTCAGGTGGAATATTTCATTCCATCCTGGGTGGACGTCGACGTAGTG
TACAGCCGGTTCTGTATCAAGGATCACTGGACATCGGCCGTTGAGTCATACAATCTCATCGCTTTTGTACGCATGTGGGCTGATGGATACGGTCGATTGCAAGCT
GAAGAAGACTCCGTGGCGTGTATACCGTCCTACGACCGACACGAGGCAGAAAGGATGGAGATTGGTAAAAGTTTATTCATAAGATTTGGTGGTACGTGGGAAGAC
AACACAAATTCGTATGTCGACGGTGAGCTGAAAGAAATAATTGTCCCACTTACAGTAACGTATAAAGAACTGAAAAATCGGTTGTACAGACTGATGAAAGTCGAC
CAGAATGGGTACGATCTGGTAATTAGGATACCGTACCACTTGGCATGCGATTCACCGCCAATGTTCGTAACGGATGACGATGACCTCCGATTTGCATTAGTACAA
GAACAGGTTTCTAAAGTCCTACTGTTTGTATCGACCGTCCCTTGCGAAAGCATTGACATACAGTCATCTAGATTAAACCAAGATGGAGAGCAATCCATTCCTAAT
GGAGGATCTGTACCGGAAGAAGGTTTCACGGGGGTAGATGAGTGGGGGTTGAACGACGTATACATGCAAGACATGTACAGTTCAGACTGTATATACACTCAAGAC
TCGGAATTGGTCGCCCCCGCGACACGCATGCCTCCCGTAATGCACGTCTCATCAGATGAGATAAATAATACAGCACAAGTACAACATGCAAGTCCTCGTACGGTA
CATGTATTCCCATTTGATGAATTAAATAATACATCACAAGTACAACATGCAAGTCCTTTGACAGAACATGTTTTCCCATCAGTAGAACCCTCATCCAGTGACCCT
GGTAATGTTGGTGATCACCAAATACCAGTACCTCAATCCCCAAGCACCGTGACTATAGAACAGGCCACAAGGTACAACCTTTTACCAGGCGGATCGGAACTGCAC
GTGGGGAAGATATTCGTTTCAAAGCAAGACCTGCGTATGGTGCTGTCAAATACAGCGATGCGATCTAATCGTGAATACAAGGTCAGTAGGTCAACGAAATCAAAG
TTTGCTGTTCGCTGCATCGACAATACATGCAATTGGAGGGTTGCGGCCCACTTCGTCGGCGAGCAGCTGGTCGTAGCGAATCTTATTAAGGATAGAGTTGCTGGA
ACCGGTCGTATTTACAAGATAAAACATATTAAGGAGGATGTCCGTAAAGAGTTCGGGGTGAACATAAGTTATGACAAGGTCCATCGAGCACGAGAGTTAGCATAC
GCTATTGTTAGGGGCAGATCGGAGGACTCCTACATGCATCTCCATGCGTACGGTGAAGCGATAAAAATAGAGAATCCAGGGAACAACATAAAGTCTCGATTCAAG
AATGCTGCATTTTACAAATTGTATCAGGATGCAGCGTATGCGTATCGGAAGTCACAGTTCACGTACTACTGGAACCAGATACTATCGATTGGATCGGGTTCACTT
GCCAAATACTTACAAGAAATTGGGATAGAACGGTGGGCCCGATGCTACCAAGTTGGTAGAAGATATGAAAACATGACGACAAACAGCACTGAGTCGGTAAATGTC
CTCCTTCAAGAGGCTAGAGAGTTACCTATCACTAAGATTGTCGAGTTCATCCGCGAATTGCTACAAAGATGGTTCCACGAAAGAAGGACTCACTGGTCCACCCAA
AACACCTCTCATTCAGACTATGCAGAAGAGCGACTTGCAGTACAGTTTGAGAAGTCTCGTCGCTACACAGTCAAACCAGTCGACTGGTGCATGTTTCATGTTGAG
GACAGTGGCCTGGACGGGACGGTTGATTTGAATGCCCGTACATGTACATGCATGAAGTTCCAGTACATGGGCATTCCATGTTCGCACGCAATTGCAGCAGAGAGA
CACAAGAATATAAATTGCCACACGTTGATCGATCCATGCTATAGTGTGGACGCCCTAGTTGGTGCCTACGCCGAACCAATCTTACCGGTAGGCCACATGTCGGAA
TGGAAAAGGCCAGCTGATTACCAGCCTATTCCCATCCAACCACCATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTGGGGGTAAAAATACCTGGGACACGTGGAGGACGTTCATTGGTGTTCAAGGAATAAAGTATCCCAGGTCCACCTCGAACCTGTCGAGGAGACTGGGTTCG
CCCATCGACAAGGCTAAGTTGTTCACTCTTGAGTTCAGGTCAAGTCGGGGACCGAGTTCGAGCTTGGTTCGTTTAACAGCGTTGTGCACAATGGACCATCAGTTG
AGAATTAAGGAGAATGACCGCTTTTCAGCTCAAGCTACCAGCATGTCTCACTTGAGCAATGTCAACAGGCTTATCAAGGATAAACTCACAGTGGACCAACTTGAT
ATGTTTCGTAGAAGAACAATATTTGGTCGATTTGTCGACTTGGAGATGATGTTCTGTAGTGGTGTAATTCATCACTTTCTGTCAAGGGAGGTTGCTGGGAGCAGT
GACGACAGCATGAGTGGTCCAGAAAAGATTGGAAAGAATAGGTTGCGGAGGAAGTACTTCAACGATGAAGCCTCCATGCTGCTCGAAGAGTTTGTGGAGGTTTAC
AAGCAGACTGATTTCGAGGACGACGAGGACGCCGTTAAAGTGACATTAATTTTGTACATGGAGCTTGTGATGATGGGAAAGAGCAAGAGCAAGTCGAAGGTTGAC
ATCGACTTGTACAACCAAGTCGATGACTTGGACTACTTCAACCATTTGGATTGGGGTTCTGATGTCTTGAGTATGACAGTTAACGGTCTGAAGCGTGCGATGAAT
GGAAAGGTTGCGTTATACAAGAACAAAGTAAGAACGAACAAAAAGTATCTAGTAAAGTATAGCCTACCGAGATTTCCGCTTGCGTTTCAGGTGTGGATATATAAG
GTTGTCCCATCTCTCATCACTCCTGGTGTCAATCGTTTGAGCGAGACCGCCATTCCCCGGATAATTCGATATTCGTGCAATAGAGTCGTCGGTACAAAAGATCTG
GGGGAGGAGGTCCTTGGTTCAGCGGTGTTGGCCATATCTTATCCACTCGTGGAGACGGAACTGGATAAGGACTACCAAAGGTGTCCATTGGACGAAAGAGAGGTG
GTTGATTTAAATGCGCCTGGGTGTTCCACCTCCGATAGTGATGATGGACACAATCCTTCCCCCATCACCGACAATCTTGGCGCCAAAGACGATCTCCCACTCGAC
GATGCGCATTCGTTGGAAACGAATGTACAGACAATGCCGGATGAGTCTTCGGATATGCCACGTACAGAGGCCGCATCTGAAGGTGGGCAACGGACACCGGTCGAA
GTACTTCGACCAAGTACTTCTATTCGGTCAAATGTGGGGCAAAGCACGCGGCAATCACCGCGAGCGACATCATGCGATGCTTGCCCTACACAACGGCACGACACC
CGTCGATCGAATGATAGATTCGGGGCTATGGAGAGAAGGCTGGATCTTTTAGCTTCGGACATGGCGGAGGTGAAGACAGATTTGGGGGAGGTCAAGTCCGACTTG
AGTGAAATGAAACTCATGCTTCAACGGTTGTGTCAGATCGATAGGCGAGAGGTGAATATTGGTGCCTCTCTGTCGGATACAGTCCACGTGTCACATCCATTGGTA
TCCAATGTTATCCTCGAGCATGATGGGGATGCTGATGACAATCAACCTGGAGGTTCCTATGCTGGAAAGGAGGACGATGTCGTTCCTGTAGAAGCGTCGTTGCAT
GAAAAGGCAACGGATGGAGTAGAGATGACCATACCCCCATCCGATCTTGGAGATGCAGAACTAGCCAACCCTGCGTGTATTGTCGATTCGGTGGAGTTGGACGTT
GCAGTGGTGACACCCATTGTTTCGACAGTGATGGTGGAACTCGAAATGGCACCACCAATAGTACAAGATCCACAATCAGAGACGACGTCCGATCCAACCTTCGAG
CCTTCTGCCTCAACCAACATTGATGGTCCGTGTGGCATGATCCATGGGTCTCGTCAAGCCGAGCATATTGAGTTGGCCCTTACACCAGCGGATACAACCCCCACT
ACTCAACCTATTCCCACACCTACACCAGCATATATGACTCCCACCCCTCAACCTATTCCCACCCTTACACCAGTTGAAAACACCACCAGAAGGACCGAGCGTAAG
TGGACAGAAACGAAACCATTCAGTCCGGAGGACACGCATCGGCAGAAGAAGAAGCAGAAGATGGTGGATGTGGATCTCGTACCTGCCAGCCAGGACCCTGGGAAT
GACCAAACAAACGCGGCCGTCTACAACTTGAATGTGCATAGTGGATATTCCAGAAGATTCTTCATTAACATCCTCAATCCTAGAAGAGGTCGAGACCGCTCGGTA
GTCGGCCGCGCCTGTGCGTTCATAAGTTTTCTATCCTGGACCACTACAAATGGCAAGTTATTGCCGCTGCAGGTGGGTCCCTACGCACGAATCAAGGGGAAGGTC
GTCCAGGACACGACCAATGCTTGGGACGAGTATAAGGAATGCATGGATGTCGTGCTGGGTCAGGTGGAATATTTCATTCCATCCTGGGTGGACGTCGACGTAGTG
TACAGCCGGTTCTGTATCAAGGATCACTGGACATCGGCCGTTGAGTCATACAATCTCATCGCTTTTGTACGCATGTGGGCTGATGGATACGGTCGATTGCAAGCT
GAAGAAGACTCCGTGGCGTGTATACCGTCCTACGACCGACACGAGGCAGAAAGGATGGAGATTGGTAAAAGTTTATTCATAAGATTTGGTGGTACGTGGGAAGAC
AACACAAATTCGTATGTCGACGGTGAGCTGAAAGAAATAATTGTCCCACTTACAGTAACGTATAAAGAACTGAAAAATCGGTTGTACAGACTGATGAAAGTCGAC
CAGAATGGGTACGATCTGGTAATTAGGATACCGTACCACTTGGCATGCGATTCACCGCCAATGTTCGTAACGGATGACGATGACCTCCGATTTGCATTAGTACAA
GAACAGGTTTCTAAAGTCCTACTGTTTGTATCGACCGTCCCTTGCGAAAGCATTGACATACAGTCATCTAGATTAAACCAAGATGGAGAGCAATCCATTCCTAAT
GGAGGATCTGTACCGGAAGAAGGTTTCACGGGGGTAGATGAGTGGGGGTTGAACGACGTATACATGCAAGACATGTACAGTTCAGACTGTATATACACTCAAGAC
TCGGAATTGGTCGCCCCCGCGACACGCATGCCTCCCGTAATGCACGTCTCATCAGATGAGATAAATAATACAGCACAAGTACAACATGCAAGTCCTCGTACGGTA
CATGTATTCCCATTTGATGAATTAAATAATACATCACAAGTACAACATGCAAGTCCTTTGACAGAACATGTTTTCCCATCAGTAGAACCCTCATCCAGTGACCCT
GGTAATGTTGGTGATCACCAAATACCAGTACCTCAATCCCCAAGCACCGTGACTATAGAACAGGCCACAAGGTACAACCTTTTACCAGGCGGATCGGAACTGCAC
GTGGGGAAGATATTCGTTTCAAAGCAAGACCTGCGTATGGTGCTGTCAAATACAGCGATGCGATCTAATCGTGAATACAAGGTCAGTAGGTCAACGAAATCAAAG
TTTGCTGTTCGCTGCATCGACAATACATGCAATTGGAGGGTTGCGGCCCACTTCGTCGGCGAGCAGCTGGTCGTAGCGAATCTTATTAAGGATAGAGTTGCTGGA
ACCGGTCGTATTTACAAGATAAAACATATTAAGGAGGATGTCCGTAAAGAGTTCGGGGTGAACATAAGTTATGACAAGGTCCATCGAGCACGAGAGTTAGCATAC
GCTATTGTTAGGGGCAGATCGGAGGACTCCTACATGCATCTCCATGCGTACGGTGAAGCGATAAAAATAGAGAATCCAGGGAACAACATAAAGTCTCGATTCAAG
AATGCTGCATTTTACAAATTGTATCAGGATGCAGCGTATGCGTATCGGAAGTCACAGTTCACGTACTACTGGAACCAGATACTATCGATTGGATCGGGTTCACTT
GCCAAATACTTACAAGAAATTGGGATAGAACGGTGGGCCCGATGCTACCAAGTTGGTAGAAGATATGAAAACATGACGACAAACAGCACTGAGTCGGTAAATGTC
CTCCTTCAAGAGGCTAGAGAGTTACCTATCACTAAGATTGTCGAGTTCATCCGCGAATTGCTACAAAGATGGTTCCACGAAAGAAGGACTCACTGGTCCACCCAA
AACACCTCTCATTCAGACTATGCAGAAGAGCGACTTGCAGTACAGTTTGAGAAGTCTCGTCGCTACACAGTCAAACCAGTCGACTGGTGCATGTTTCATGTTGAG
GACAGTGGCCTGGACGGGACGGTTGATTTGAATGCCCGTACATGTACATGCATGAAGTTCCAGTACATGGGCATTCCATGTTCGCACGCAATTGCAGCAGAGAGA
CACAAGAATATAAATTGCCACACGTTGATCGATCCATGCTATAGTGTGGACGCCCTAGTTGGTGCCTACGCCGAACCAATCTTACCGGTAGGCCACATGTCGGAA
TGGAAAAGGCCAGCTGATTACCAGCCTATTCCCATCCAACCACCATGA
Protein sequenceShow/hide protein sequence
MFGGKNTWDTWRTFIGVQGIKYPRSTSNLSRRLGSPIDKAKLFTLEFRSSRGPSSSLVRLTALCTMDHQLRIKENDRFSAQATSMSHLSNVNRLIKDKLTVDQLD
MFRRRTIFGRFVDLEMMFCSGVIHHFLSREVAGSSDDSMSGPEKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVKVTLILYMELVMMGKSKSKSKVD
IDLYNQVDDLDYFNHLDWGSDVLSMTVNGLKRAMNGKVALYKNKVRTNKKYLVKYSLPRFPLAFQVWIYKVVPSLITPGVNRLSETAIPRIIRYSCNRVVGTKDL
GEEVLGSAVLAISYPLVETELDKDYQRCPLDEREVVDLNAPGCSTSDSDDGHNPSPITDNLGAKDDLPLDDAHSLETNVQTMPDESSDMPRTEAASEGGQRTPVE
VLRPSTSIRSNVGQSTRQSPRATSCDACPTQRHDTRRSNDRFGAMERRLDLLASDMAEVKTDLGEVKSDLSEMKLMLQRLCQIDRREVNIGASLSDTVHVSHPLV
SNVILEHDGDADDNQPGGSYAGKEDDVVPVEASLHEKATDGVEMTIPPSDLGDAELANPACIVDSVELDVAVVTPIVSTVMVELEMAPPIVQDPQSETTSDPTFE
PSASTNIDGPCGMIHGSRQAEHIELALTPADTTPTTQPIPTPTPAYMTPTPQPIPTLTPVENTTRRTERKWTETKPFSPEDTHRQKKKQKMVDVDLVPASQDPGN
DQTNAAVYNLNVHSGYSRRFFINILNPRRGRDRSVVGRACAFISFLSWTTTNGKLLPLQVGPYARIKGKVVQDTTNAWDEYKECMDVVLGQVEYFIPSWVDVDVV
YSRFCIKDHWTSAVESYNLIAFVRMWADGYGRLQAEEDSVACIPSYDRHEAERMEIGKSLFIRFGGTWEDNTNSYVDGELKEIIVPLTVTYKELKNRLYRLMKVD
QNGYDLVIRIPYHLACDSPPMFVTDDDDLRFALVQEQVSKVLLFVSTVPCESIDIQSSRLNQDGEQSIPNGGSVPEEGFTGVDEWGLNDVYMQDMYSSDCIYTQD
SELVAPATRMPPVMHVSSDEINNTAQVQHASPRTVHVFPFDELNNTSQVQHASPLTEHVFPSVEPSSSDPGNVGDHQIPVPQSPSTVTIEQATRYNLLPGGSELH
VGKIFVSKQDLRMVLSNTAMRSNREYKVSRSTKSKFAVRCIDNTCNWRVAAHFVGEQLVVANLIKDRVAGTGRIYKIKHIKEDVRKEFGVNISYDKVHRARELAY
AIVRGRSEDSYMHLHAYGEAIKIENPGNNIKSRFKNAAFYKLYQDAAYAYRKSQFTYYWNQILSIGSGSLAKYLQEIGIERWARCYQVGRRYENMTTNSTESVNV
LLQEARELPITKIVEFIRELLQRWFHERRTHWSTQNTSHSDYAEERLAVQFEKSRRYTVKPVDWCMFHVEDSGLDGTVDLNARTCTCMKFQYMGIPCSHAIAAER
HKNINCHTLIDPCYSVDALVGAYAEPILPVGHMSEWKRPADYQPIPIQPP