; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g21460 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g21460
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionMuDRA-like transposase
Genome locationchr3:14663793..14666294
RNA-Seq ExpressionMoc03g21460
SyntenyMoc03g21460
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR004332 - Transposase, MuDR, plant
IPR006564 - Zinc finger, PMZ-type
IPR007527 - Zinc finger, SWIM-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022153251.1 uncharacterized protein LOC111020787 [Momordica charantia]1.5e-13252.34Show/hide
Query:  MKMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSYTFLISTFYEVHSCTREVMKHDHRQARSRVVGQIIKSTFEDV--------------------
        MK+NFE++VKKSTK+L+TVGCTE GCKW LR++ I+G  +F+IS F +VH C REV+ HDHRQARS VVGQ++KS  EDV                    
Subjt:  MKMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSYTFLISTFYEVHSCTREVMKHDHRQARSRVVGQIIKSTFEDV--------------------

Query:  ----------------SRCYRPKDIVN--DMRKNYGVNIRYEKAWRARERALELLMG----SPKKSYTL------LRKYGEALKSVNPGTMNLNDKFKIR
                        S C RP  +++   ++  Y   I    +     +   L  G       +S+T       L   G+ +   +  T NL D+FK  
Subjt:  ----------------SRCYRPKDIVN--DMRKNYGVNIRYEKAWRARERALELLMG----SPKKSYTL------LRKYGEALKSVNPGTMNLNDKFKIR

Query:  SEGVEWLYLLAAKAFKKSTFRYYWNQLAGFAELRQYLEELGFDKWSRAYQPGLRYNQMTTNIAESMNVVLVHARYLPVTALLEHCRALLQRWYYERRTYA
         + ++ +++LAAKA +KS FRYY++QLAGF E+++YLE +GF+KW+RA+QPGLRY+QMT+NIAESMN VLVHAR LPVTALLEH RALLQRW+YERRTY 
Subjt:  SEGVEWLYLLAAKAFKKSTFRYYWNQLAGFAELRQYLEELGFDKWSRAYQPGLRYNQMTTNIAESMNVVLVHARYLPVTALLEHCRALLQRWYYERRTYA

Query:  STRASILTDYAEGIVKSAVEQARQHTIRPIDNYEYEVHDGNSKMRVNLNSKSCTCKQFDYFQIPCSHAVAAAMHRNVSIYTLCSPKYKLEMLLNAYAEPI
        S+R +ILTDY E  +++A   +R H+I PID++E EV DG  ++RVNLN+++C CK+FDYFQ+PCSHA+A A +R V+ YTLCSP Y L+ L+NAYA+ +
Subjt:  STRASILTDYAEGIVKSAVEQARQHTIRPIDNYEYEVHDGNSKMRVNLNSKSCTCKQFDYFQIPCSHAVAAAMHRNVSIYTLCSPKYKLEMLLNAYAEPI

Query:  YPLGDEEDWPLPDDFVEYTIEPPKFVARVGRRQTVRIPSAGEPQQIHKCSKCGMQGHNRKTCRQPLQTTE
        YPLGDEEDW LPDDFV+  +E P++V R+GRRQTVRIPSAGE +Q+HKC +CG  GHNRKTCRQPL+T +
Subjt:  YPLGDEEDWPLPDDFVEYTIEPPKFVARVGRRQTVRIPSAGEPQQIHKCSKCGMQGHNRKTCRQPLQTTE

XP_022154803.1 uncharacterized protein LOC111021969 [Momordica charantia]5.9e-21449.7Show/hide
Query:  FVSYGGSWNESQFLYEGGIMGGLDVDDSITYEELLSAMFSLTRIDPDQFKILIHYVYKFNLQYQVPKYYIFDDHSLRFFLRGPPHPSEVPLYVSVVPKEI
        F    G WNE+  +YEGG+MGGL+VD+ ITY +L+SA+F +TRI+PD F I++  +YKF  QY VP +YIFDD SL F+L GPPHPS+VPLYVSVVPKE 
Subjt:  FVSYGGSWNESQFLYEGGIMGGLDVDDSITYEELLSAMFSLTRIDPDQFKILIHYVYKFNLQYQVPKYYIFDDHSLRFFLRGPPHPSEVPLYVSVVPKEI

Query:  HGSGSSSMNRNI-PEAEAFQSFPHQLGQTVPYYAPSFPFDSTL--PGPSCFVPSMTSLTDNVIPCNFGDDETNYCGQWDD-NDENDVEYEYEA----EDD
          SGS+S +  + P+ E F SFP Q+ Q VP  AP     S++    P   V  MT LTDNV+PCN GDDE  + GQWDD  D+ D EY        +DD
Subjt:  HGSGSSSMNRNI-PEAEAFQSFPHQLGQTVPYYAPSFPFDSTL--PGPSCFVPSMTSLTDNVIPCNFGDDETNYCGQWDD-NDENDVEYEYEA----EDD

Query:  DNQDTEFEDDVDFENEEEVNPVDVAGPS-SDPSTEVHVVSTNALCAT-DQASCSREIVRTGDEVCSSTEDIAVGSTFRSKEDLQFKLSVYAMKMNFEYRV
        D Q+ E EDD      E   PV    PS   P  EV  VS NA CAT +    S E + T        +DIA+GS FRSK++L+F L+V+A++ NFE++V
Subjt:  DNQDTEFEDDVDFENEEEVNPVDVAGPS-SDPSTEVHVVSTNALCAT-DQASCSREIVRTGDEVCSSTEDIAVGSTFRSKEDLQFKLSVYAMKMNFEYRV

Query:  KKSTKSLYTVGCTEDGCKWSLRSRKIKGSYTFLISTFYEVHSCTREVMKHDHRQARSRVVGQIIKSTFEDVSRCYRPKDIVNDMRKNYGVNIRYEKAWRA
        KKST+SL +V C E+GC+W+LR+RKIKGS TFLISTF E H   RE ++HDH+QA S VVGQ+IK+  ED+SR YRP+DI+ DMR+NYGVN RYEK WRA
Subjt:  KKSTKSLYTVGCTEDGCKWSLRSRKIKGSYTFLISTFYEVHSCTREVMKHDHRQARSRVVGQIIKSTFEDVSRCYRPKDIVNDMRKNYGVNIRYEKAWRA

Query:  RERALELLMGSPKKSYTLLRKYGEALKSVNPGTM--------------------------------------NLNDKF----------------------
        RE AL LLMGSPK+SYT L KYG ALK+ N GT+                                      +L  K+                      
Subjt:  RERALELLMGSPKKSYTLLRKYGEALKSVNPGTM--------------------------------------NLNDKF----------------------

Query:  ---------------------------------------------------------------KIRSEGVEWLYLLAAKAFKKSTFRYYWNQLAGFAELR
                                                                       K R+ G+  ++  AAKAFK S FRYYW QLAGF  + 
Subjt:  ---------------------------------------------------------------KIRSEGVEWLYLLAAKAFKKSTFRYYWNQLAGFAELR

Query:  QYLEELGFDKWSRAYQPGLRYNQMTTNIAESMNVVLVHARYLPVTALLEHCRALLQRWYYERRTYASTRASILTDYAEGIVKSAVEQARQHTIRPIDNYE
        +YLE++G DKW+R YQPG+RYNQMT+N+AESMN VLVHAR LP+TAL E+CR+LLQ+W+Y+RRT  S+R + LT+YAE I+K   EQAR H +RPID +E
Subjt:  QYLEELGFDKWSRAYQPGLRYNQMTTNIAESMNVVLVHARYLPVTALLEHCRALLQRWYYERRTYASTRASILTDYAEGIVKSAVEQARQHTIRPIDNYE

Query:  YEVHDGNSKMRVNLNSKSCTCKQFDYFQIPCSHAVAAAMHRNVSIYTLCSPKYKLEMLLNAYAEPIYPLGDEEDWPLPDDFVEYTIEPPKFVARVGRRQT
        +EVHDG +K+RVN+NSK+CTCKQF Y++IPCSHA+A A+ RN+S++TLCS +Y+++ L+ AY EP+YPLGDEEDW LP D+V  TI+PP+FV RVGR QT
Subjt:  YEVHDGNSKMRVNLNSKSCTCKQFDYFQIPCSHAVAAAMHRNVSIYTLCSPKYKLEMLLNAYAEPIYPLGDEEDWPLPDDFVEYTIEPPKFVARVGRRQT

Query:  VRIPSAGEPQQIHKCSKCGMQGHNRKTCRQPLQTTES
         RIPS GE +Q+HKC +CG  GHN KTCRQPL+TTE+
Subjt:  VRIPSAGEPQQIHKCSKCGMQGHNRKTCRQPLQTTES

XP_022154930.1 uncharacterized protein LOC111022077 [Momordica charantia]1.5e-14845.49Show/hide
Query:  VIPCNFGDDETNYCGQWDDNDENDVEY-----EYE----------AEDDDNQDTEFEDD--VDFENEEEVNP-----VDVAGPSSDPSTEVHVVSTNALC
        ++PCN  DD+  Y   +D+  EN+VEY     EY+           ++DD  ++EFE +   D  N++E+N       D  G   +P  EV  VS NA  
Subjt:  VIPCNFGDDETNYCGQWDDNDENDVEY-----EYE----------AEDDDNQDTEFEDD--VDFENEEEVNP-----VDVAGPSSDPSTEVHVVSTNALC

Query:  ATDQ--ASCSREIVRTGDEVCSSTEDIAVGSTFRSKEDLQFKLSVYAMKMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSYTFLISTFYEVHSCT
         T Q    CS + + T  +      DI VG  FRSK++L+FKL V AMK+NFE+RVKKSTK+LY VGC E GCKW L + +I+G+ +F IS + +VH+CT
Subjt:  ATDQ--ASCSREIVRTGDEVCSSTEDIAVGSTFRSKEDLQFKLSVYAMKMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSYTFLISTFYEVHSCT

Query:  REVMKHDHRQARSRVVGQIIKSTFEDVSRCYRPKDIVNDMRKNYGVNIRYEKAWRARERALELLMGSPKKSYTLLRKYGEALKSVNPGT-----------
        +EV+ HDHRQARS VVGQ++K+  EDVSR YRPKDI+ DMRK YGVNIRYEKAWRA+E AL +L+GSPK+SY  LR+Y EALK VN GT           
Subjt:  REVMKHDHRQARSRVVGQIIKSTFEDVSRCYRPKDIVNDMRKNYGVNIRYEKAWRARERALELLMGSPKKSYTLLRKYGEALKSVNPGT-----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------MNLNDKFKIRSEGVEWLYLLAAKAFKKSTFRYYWNQLAGFAELRQYLEELGFDKWSRAYQPGLRYNQMTTNIAESMNVVLVHARYL
                      MNL DKFK  ++ ++ +++LAAKAF+KS FRYY++QLAGF ++++YLE +GF+KW+RA+QP LRY+QMT+N AES+N VL HAR L
Subjt:  --------------MNLNDKFKIRSEGVEWLYLLAAKAFKKSTFRYYWNQLAGFAELRQYLEELGFDKWSRAYQPGLRYNQMTTNIAESMNVVLVHARYL

Query:  PVTALLEHCRALLQRWYYERRTYASTRASILTDYAEGIVKSAVEQARQHTIRPIDNYEYEVHDGNSKMRVNLNSKSCTCKQFDYFQIPCSHAVAAAMHRN
        PVTALLE   AL+QRW+YERRTYAS+R +ILTDY E  +++A   +R ++I PID +E EVHDG  + RVNLN+++C CK+FD++++PCSHA+AA   +N
Subjt:  PVTALLEHCRALLQRWYYERRTYASTRASILTDYAEGIVKSAVEQARQHTIRPIDNYEYEVHDGNSKMRVNLNSKSCTCKQFDYFQIPCSHAVAAAMHRN

Query:  VSIYTLCSPKYKLEMLLNAYAEPIYPLGDEEDWPLPDDFVEYTIEPPKFVARVGRRQTVRIPSAGEPQQIHKCSKCG
        V+ Y+LCSP Y L+ L+NAYAE +YPLGDEEDW LPD+FV   +EPPK V R+GRRQTVRIPSAGE +Q+ KC +CG
Subjt:  VSIYTLCSPKYKLEMLLNAYAEPIYPLGDEEDWPLPDDFVEYTIEPPKFVARVGRRQTVRIPSAGEPQQIHKCSKCG

XP_022155156.1 uncharacterized protein LOC111022299 [Momordica charantia]3.2e-15193.59Show/hide
Query:  MNLNDKFKIRSEGVEWLYLLAAKAFKKSTFRYYWNQLAGFAELRQYLEELGFDKWSRAYQPGLRYNQMTTNIAESMNVVLVHARYLPVTALLEHCRALLQ
        MNLNDKFKIRSEGVEWLYLLAAKAFKKSTFRYYWNQLAGF ELRQYLEELGFDKWSRAYQP LRYNQ TTNIAESMN VLVHARYLPVT LLEHC ALLQ
Subjt:  MNLNDKFKIRSEGVEWLYLLAAKAFKKSTFRYYWNQLAGFAELRQYLEELGFDKWSRAYQPGLRYNQMTTNIAESMNVVLVHARYLPVTALLEHCRALLQ

Query:  RWYYERRTYASTRASILTDYAEGIVKSAVEQARQHTIRPIDNYEYEVHDGNSKMRVNLNSKSCTCKQFDYFQIPCSHAVAAAMHRNVSIYTLCSPKYKLE
        RWYYE+RTYASTRASILTDY EGIVKSAVEQARQHTIRPIDNYEYEVHDGNSK+RVNLNSKSCTCKQFDY+QIPCSHAV A MHRNVSIYTLCSPKYKLE
Subjt:  RWYYERRTYASTRASILTDYAEGIVKSAVEQARQHTIRPIDNYEYEVHDGNSKMRVNLNSKSCTCKQFDYFQIPCSHAVAAAMHRNVSIYTLCSPKYKLE

Query:  MLLNAYAEPIYPLGDEEDWPLPDDFVEYTIEPPKFVARVGRRQTVRIPSAGEPQQIHKCSKCGMQGHNRKTCRQPLQTTES
         LLN YAEPIYPLGDEEDWPLPDDFVEYTIEPPKFVARVGRRQTVRIPSAGEPQQIHK S+CGMQ HNRKTCRQPL+TTES
Subjt:  MLLNAYAEPIYPLGDEEDWPLPDDFVEYTIEPPKFVARVGRRQTVRIPSAGEPQQIHKCSKCGMQGHNRKTCRQPLQTTES

XP_022157237.1 protein FAR-RED ELONGATED HYPOCOTYL 3-like [Momordica charantia]1.9e-15152.68Show/hide
Query:  DIAVGSTFRSKEDLQFKLSVYAMKMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSYTFLISTFYEVHSCTREVMKHDHRQARSRVVGQIIKSTFE
        DIAVGS FRSK++L+FKL+V+A+  NFEY+VKKST  L +V CTE+GCKW+LR R+IKGS TFLISTF E HSC R  + HDHRQA S VVGQ+IKS FE
Subjt:  DIAVGSTFRSKEDLQFKLSVYAMKMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSYTFLISTFYEVHSCTREVMKHDHRQARSRVVGQIIKSTFE

Query:  DVSRCYRPKDIVNDMRKNYGVNIRYEKAWRARERALELLMGSPKKSYTLLRKYGEALKSVNPGT------------------------------------
        +VSR YRPKDIVNDM+KNYGVN+RYEKA RA+E AL LLMGSP++SY+ L KYGEALK+VN GT                                    
Subjt:  DVSRCYRPKDIVNDMRKNYGVNIRYEKAWRARERALELLMGSPKKSYTLLRKYGEALKSVNPGT------------------------------------

Query:  -----------------------------------------------------------------------------------------MNLNDKFKIRS
                                                                                                 M LN+KF  R+
Subjt:  -----------------------------------------------------------------------------------------MNLNDKFKIRS

Query:  EGVEWLYLLAAKAFKKSTFRYYWNQLAGFAELRQYLEELGFDKWSRAYQPGLRYNQMTTNIAESMNVVLVHARYLPVTALLEHCRALLQRWYYERRTYAS
        EG++ ++  AAKAFK S FRYYW QLAGF  +++YLE++GFDKW+RAYQPG+RYNQMT+N+AESMN VLVHAR LP+TA+ E+CRALLQ+W+YERRT A 
Subjt:  EGVEWLYLLAAKAFKKSTFRYYWNQLAGFAELRQYLEELGFDKWSRAYQPGLRYNQMTTNIAESMNVVLVHARYLPVTALLEHCRALLQRWYYERRTYAS

Query:  TRASILTDYAEGIVKSAVEQARQHTIRPIDNYEYEVHDGNSKMRVNLNSKSCTCKQFDYFQIPCSHAVAAAMHRNVSIYTLCSPKYKLEMLLNAYAEPIY
        +  ++LT+YAE I+K   E+AR H +RPID +E+EVHDG SK+ VNLNSK+CTCKQFDYF+I CSHA+A A+ RN+S+++LCS +Y++E L+  YAEP+Y
Subjt:  TRASILTDYAEGIVKSAVEQARQHTIRPIDNYEYEVHDGNSKMRVNLNSKSCTCKQFDYFQIPCSHAVAAAMHRNVSIYTLCSPKYKLEMLLNAYAEPIY

Query:  PLGDEEDWPLPDDFVEYTIEPPKFVARVGRRQTVRIPSAGE
        PLGDEEDW LPDD+V  TI+PPKFV RVGR QT RIPSAGE
Subjt:  PLGDEEDWPLPDDFVEYTIEPPKFVARVGRRQTVRIPSAGE

TrEMBL top hitse value%identityAlignment
A0A6J1DK28 uncharacterized protein LOC1110207877.2e-13352.34Show/hide
Query:  MKMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSYTFLISTFYEVHSCTREVMKHDHRQARSRVVGQIIKSTFEDV--------------------
        MK+NFE++VKKSTK+L+TVGCTE GCKW LR++ I+G  +F+IS F +VH C REV+ HDHRQARS VVGQ++KS  EDV                    
Subjt:  MKMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSYTFLISTFYEVHSCTREVMKHDHRQARSRVVGQIIKSTFEDV--------------------

Query:  ----------------SRCYRPKDIVN--DMRKNYGVNIRYEKAWRARERALELLMG----SPKKSYTL------LRKYGEALKSVNPGTMNLNDKFKIR
                        S C RP  +++   ++  Y   I    +     +   L  G       +S+T       L   G+ +   +  T NL D+FK  
Subjt:  ----------------SRCYRPKDIVN--DMRKNYGVNIRYEKAWRARERALELLMG----SPKKSYTL------LRKYGEALKSVNPGTMNLNDKFKIR

Query:  SEGVEWLYLLAAKAFKKSTFRYYWNQLAGFAELRQYLEELGFDKWSRAYQPGLRYNQMTTNIAESMNVVLVHARYLPVTALLEHCRALLQRWYYERRTYA
         + ++ +++LAAKA +KS FRYY++QLAGF E+++YLE +GF+KW+RA+QPGLRY+QMT+NIAESMN VLVHAR LPVTALLEH RALLQRW+YERRTY 
Subjt:  SEGVEWLYLLAAKAFKKSTFRYYWNQLAGFAELRQYLEELGFDKWSRAYQPGLRYNQMTTNIAESMNVVLVHARYLPVTALLEHCRALLQRWYYERRTYA

Query:  STRASILTDYAEGIVKSAVEQARQHTIRPIDNYEYEVHDGNSKMRVNLNSKSCTCKQFDYFQIPCSHAVAAAMHRNVSIYTLCSPKYKLEMLLNAYAEPI
        S+R +ILTDY E  +++A   +R H+I PID++E EV DG  ++RVNLN+++C CK+FDYFQ+PCSHA+A A +R V+ YTLCSP Y L+ L+NAYA+ +
Subjt:  STRASILTDYAEGIVKSAVEQARQHTIRPIDNYEYEVHDGNSKMRVNLNSKSCTCKQFDYFQIPCSHAVAAAMHRNVSIYTLCSPKYKLEMLLNAYAEPI

Query:  YPLGDEEDWPLPDDFVEYTIEPPKFVARVGRRQTVRIPSAGEPQQIHKCSKCGMQGHNRKTCRQPLQTTE
        YPLGDEEDW LPDDFV+  +E P++V R+GRRQTVRIPSAGE +Q+HKC +CG  GHNRKTCRQPL+T +
Subjt:  YPLGDEEDWPLPDDFVEYTIEPPKFVARVGRRQTVRIPSAGEPQQIHKCSKCGMQGHNRKTCRQPLQTTE

A0A6J1DL12 uncharacterized protein LOC1110220777.2e-14945.49Show/hide
Query:  VIPCNFGDDETNYCGQWDDNDENDVEY-----EYE----------AEDDDNQDTEFEDD--VDFENEEEVNP-----VDVAGPSSDPSTEVHVVSTNALC
        ++PCN  DD+  Y   +D+  EN+VEY     EY+           ++DD  ++EFE +   D  N++E+N       D  G   +P  EV  VS NA  
Subjt:  VIPCNFGDDETNYCGQWDDNDENDVEY-----EYE----------AEDDDNQDTEFEDD--VDFENEEEVNP-----VDVAGPSSDPSTEVHVVSTNALC

Query:  ATDQ--ASCSREIVRTGDEVCSSTEDIAVGSTFRSKEDLQFKLSVYAMKMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSYTFLISTFYEVHSCT
         T Q    CS + + T  +      DI VG  FRSK++L+FKL V AMK+NFE+RVKKSTK+LY VGC E GCKW L + +I+G+ +F IS + +VH+CT
Subjt:  ATDQ--ASCSREIVRTGDEVCSSTEDIAVGSTFRSKEDLQFKLSVYAMKMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSYTFLISTFYEVHSCT

Query:  REVMKHDHRQARSRVVGQIIKSTFEDVSRCYRPKDIVNDMRKNYGVNIRYEKAWRARERALELLMGSPKKSYTLLRKYGEALKSVNPGT-----------
        +EV+ HDHRQARS VVGQ++K+  EDVSR YRPKDI+ DMRK YGVNIRYEKAWRA+E AL +L+GSPK+SY  LR+Y EALK VN GT           
Subjt:  REVMKHDHRQARSRVVGQIIKSTFEDVSRCYRPKDIVNDMRKNYGVNIRYEKAWRARERALELLMGSPKKSYTLLRKYGEALKSVNPGT-----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------MNLNDKFKIRSEGVEWLYLLAAKAFKKSTFRYYWNQLAGFAELRQYLEELGFDKWSRAYQPGLRYNQMTTNIAESMNVVLVHARYL
                      MNL DKFK  ++ ++ +++LAAKAF+KS FRYY++QLAGF ++++YLE +GF+KW+RA+QP LRY+QMT+N AES+N VL HAR L
Subjt:  --------------MNLNDKFKIRSEGVEWLYLLAAKAFKKSTFRYYWNQLAGFAELRQYLEELGFDKWSRAYQPGLRYNQMTTNIAESMNVVLVHARYL

Query:  PVTALLEHCRALLQRWYYERRTYASTRASILTDYAEGIVKSAVEQARQHTIRPIDNYEYEVHDGNSKMRVNLNSKSCTCKQFDYFQIPCSHAVAAAMHRN
        PVTALLE   AL+QRW+YERRTYAS+R +ILTDY E  +++A   +R ++I PID +E EVHDG  + RVNLN+++C CK+FD++++PCSHA+AA   +N
Subjt:  PVTALLEHCRALLQRWYYERRTYASTRASILTDYAEGIVKSAVEQARQHTIRPIDNYEYEVHDGNSKMRVNLNSKSCTCKQFDYFQIPCSHAVAAAMHRN

Query:  VSIYTLCSPKYKLEMLLNAYAEPIYPLGDEEDWPLPDDFVEYTIEPPKFVARVGRRQTVRIPSAGEPQQIHKCSKCG
        V+ Y+LCSP Y L+ L+NAYAE +YPLGDEEDW LPD+FV   +EPPK V R+GRRQTVRIPSAGE +Q+ KC +CG
Subjt:  VSIYTLCSPKYKLEMLLNAYAEPIYPLGDEEDWPLPDDFVEYTIEPPKFVARVGRRQTVRIPSAGEPQQIHKCSKCG

A0A6J1DLB0 uncharacterized protein LOC1110219692.9e-21449.7Show/hide
Query:  FVSYGGSWNESQFLYEGGIMGGLDVDDSITYEELLSAMFSLTRIDPDQFKILIHYVYKFNLQYQVPKYYIFDDHSLRFFLRGPPHPSEVPLYVSVVPKEI
        F    G WNE+  +YEGG+MGGL+VD+ ITY +L+SA+F +TRI+PD F I++  +YKF  QY VP +YIFDD SL F+L GPPHPS+VPLYVSVVPKE 
Subjt:  FVSYGGSWNESQFLYEGGIMGGLDVDDSITYEELLSAMFSLTRIDPDQFKILIHYVYKFNLQYQVPKYYIFDDHSLRFFLRGPPHPSEVPLYVSVVPKEI

Query:  HGSGSSSMNRNI-PEAEAFQSFPHQLGQTVPYYAPSFPFDSTL--PGPSCFVPSMTSLTDNVIPCNFGDDETNYCGQWDD-NDENDVEYEYEA----EDD
          SGS+S +  + P+ E F SFP Q+ Q VP  AP     S++    P   V  MT LTDNV+PCN GDDE  + GQWDD  D+ D EY        +DD
Subjt:  HGSGSSSMNRNI-PEAEAFQSFPHQLGQTVPYYAPSFPFDSTL--PGPSCFVPSMTSLTDNVIPCNFGDDETNYCGQWDD-NDENDVEYEYEA----EDD

Query:  DNQDTEFEDDVDFENEEEVNPVDVAGPS-SDPSTEVHVVSTNALCAT-DQASCSREIVRTGDEVCSSTEDIAVGSTFRSKEDLQFKLSVYAMKMNFEYRV
        D Q+ E EDD      E   PV    PS   P  EV  VS NA CAT +    S E + T        +DIA+GS FRSK++L+F L+V+A++ NFE++V
Subjt:  DNQDTEFEDDVDFENEEEVNPVDVAGPS-SDPSTEVHVVSTNALCAT-DQASCSREIVRTGDEVCSSTEDIAVGSTFRSKEDLQFKLSVYAMKMNFEYRV

Query:  KKSTKSLYTVGCTEDGCKWSLRSRKIKGSYTFLISTFYEVHSCTREVMKHDHRQARSRVVGQIIKSTFEDVSRCYRPKDIVNDMRKNYGVNIRYEKAWRA
        KKST+SL +V C E+GC+W+LR+RKIKGS TFLISTF E H   RE ++HDH+QA S VVGQ+IK+  ED+SR YRP+DI+ DMR+NYGVN RYEK WRA
Subjt:  KKSTKSLYTVGCTEDGCKWSLRSRKIKGSYTFLISTFYEVHSCTREVMKHDHRQARSRVVGQIIKSTFEDVSRCYRPKDIVNDMRKNYGVNIRYEKAWRA

Query:  RERALELLMGSPKKSYTLLRKYGEALKSVNPGTM--------------------------------------NLNDKF----------------------
        RE AL LLMGSPK+SYT L KYG ALK+ N GT+                                      +L  K+                      
Subjt:  RERALELLMGSPKKSYTLLRKYGEALKSVNPGTM--------------------------------------NLNDKF----------------------

Query:  ---------------------------------------------------------------KIRSEGVEWLYLLAAKAFKKSTFRYYWNQLAGFAELR
                                                                       K R+ G+  ++  AAKAFK S FRYYW QLAGF  + 
Subjt:  ---------------------------------------------------------------KIRSEGVEWLYLLAAKAFKKSTFRYYWNQLAGFAELR

Query:  QYLEELGFDKWSRAYQPGLRYNQMTTNIAESMNVVLVHARYLPVTALLEHCRALLQRWYYERRTYASTRASILTDYAEGIVKSAVEQARQHTIRPIDNYE
        +YLE++G DKW+R YQPG+RYNQMT+N+AESMN VLVHAR LP+TAL E+CR+LLQ+W+Y+RRT  S+R + LT+YAE I+K   EQAR H +RPID +E
Subjt:  QYLEELGFDKWSRAYQPGLRYNQMTTNIAESMNVVLVHARYLPVTALLEHCRALLQRWYYERRTYASTRASILTDYAEGIVKSAVEQARQHTIRPIDNYE

Query:  YEVHDGNSKMRVNLNSKSCTCKQFDYFQIPCSHAVAAAMHRNVSIYTLCSPKYKLEMLLNAYAEPIYPLGDEEDWPLPDDFVEYTIEPPKFVARVGRRQT
        +EVHDG +K+RVN+NSK+CTCKQF Y++IPCSHA+A A+ RN+S++TLCS +Y+++ L+ AY EP+YPLGDEEDW LP D+V  TI+PP+FV RVGR QT
Subjt:  YEVHDGNSKMRVNLNSKSCTCKQFDYFQIPCSHAVAAAMHRNVSIYTLCSPKYKLEMLLNAYAEPIYPLGDEEDWPLPDDFVEYTIEPPKFVARVGRRQT

Query:  VRIPSAGEPQQIHKCSKCGMQGHNRKTCRQPLQTTES
         RIPS GE +Q+HKC +CG  GHN KTCRQPL+TTE+
Subjt:  VRIPSAGEPQQIHKCSKCGMQGHNRKTCRQPLQTTES

A0A6J1DQV1 uncharacterized protein LOC1110222991.5e-15193.59Show/hide
Query:  MNLNDKFKIRSEGVEWLYLLAAKAFKKSTFRYYWNQLAGFAELRQYLEELGFDKWSRAYQPGLRYNQMTTNIAESMNVVLVHARYLPVTALLEHCRALLQ
        MNLNDKFKIRSEGVEWLYLLAAKAFKKSTFRYYWNQLAGF ELRQYLEELGFDKWSRAYQP LRYNQ TTNIAESMN VLVHARYLPVT LLEHC ALLQ
Subjt:  MNLNDKFKIRSEGVEWLYLLAAKAFKKSTFRYYWNQLAGFAELRQYLEELGFDKWSRAYQPGLRYNQMTTNIAESMNVVLVHARYLPVTALLEHCRALLQ

Query:  RWYYERRTYASTRASILTDYAEGIVKSAVEQARQHTIRPIDNYEYEVHDGNSKMRVNLNSKSCTCKQFDYFQIPCSHAVAAAMHRNVSIYTLCSPKYKLE
        RWYYE+RTYASTRASILTDY EGIVKSAVEQARQHTIRPIDNYEYEVHDGNSK+RVNLNSKSCTCKQFDY+QIPCSHAV A MHRNVSIYTLCSPKYKLE
Subjt:  RWYYERRTYASTRASILTDYAEGIVKSAVEQARQHTIRPIDNYEYEVHDGNSKMRVNLNSKSCTCKQFDYFQIPCSHAVAAAMHRNVSIYTLCSPKYKLE

Query:  MLLNAYAEPIYPLGDEEDWPLPDDFVEYTIEPPKFVARVGRRQTVRIPSAGEPQQIHKCSKCGMQGHNRKTCRQPLQTTES
         LLN YAEPIYPLGDEEDWPLPDDFVEYTIEPPKFVARVGRRQTVRIPSAGEPQQIHK S+CGMQ HNRKTCRQPL+TTES
Subjt:  MLLNAYAEPIYPLGDEEDWPLPDDFVEYTIEPPKFVARVGRRQTVRIPSAGEPQQIHKCSKCGMQGHNRKTCRQPLQTTES

A0A6J1DU12 protein FAR-RED ELONGATED HYPOCOTYL 3-like9.0e-15252.68Show/hide
Query:  DIAVGSTFRSKEDLQFKLSVYAMKMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSYTFLISTFYEVHSCTREVMKHDHRQARSRVVGQIIKSTFE
        DIAVGS FRSK++L+FKL+V+A+  NFEY+VKKST  L +V CTE+GCKW+LR R+IKGS TFLISTF E HSC R  + HDHRQA S VVGQ+IKS FE
Subjt:  DIAVGSTFRSKEDLQFKLSVYAMKMNFEYRVKKSTKSLYTVGCTEDGCKWSLRSRKIKGSYTFLISTFYEVHSCTREVMKHDHRQARSRVVGQIIKSTFE

Query:  DVSRCYRPKDIVNDMRKNYGVNIRYEKAWRARERALELLMGSPKKSYTLLRKYGEALKSVNPGT------------------------------------
        +VSR YRPKDIVNDM+KNYGVN+RYEKA RA+E AL LLMGSP++SY+ L KYGEALK+VN GT                                    
Subjt:  DVSRCYRPKDIVNDMRKNYGVNIRYEKAWRARERALELLMGSPKKSYTLLRKYGEALKSVNPGT------------------------------------

Query:  -----------------------------------------------------------------------------------------MNLNDKFKIRS
                                                                                                 M LN+KF  R+
Subjt:  -----------------------------------------------------------------------------------------MNLNDKFKIRS

Query:  EGVEWLYLLAAKAFKKSTFRYYWNQLAGFAELRQYLEELGFDKWSRAYQPGLRYNQMTTNIAESMNVVLVHARYLPVTALLEHCRALLQRWYYERRTYAS
        EG++ ++  AAKAFK S FRYYW QLAGF  +++YLE++GFDKW+RAYQPG+RYNQMT+N+AESMN VLVHAR LP+TA+ E+CRALLQ+W+YERRT A 
Subjt:  EGVEWLYLLAAKAFKKSTFRYYWNQLAGFAELRQYLEELGFDKWSRAYQPGLRYNQMTTNIAESMNVVLVHARYLPVTALLEHCRALLQRWYYERRTYAS

Query:  TRASILTDYAEGIVKSAVEQARQHTIRPIDNYEYEVHDGNSKMRVNLNSKSCTCKQFDYFQIPCSHAVAAAMHRNVSIYTLCSPKYKLEMLLNAYAEPIY
        +  ++LT+YAE I+K   E+AR H +RPID +E+EVHDG SK+ VNLNSK+CTCKQFDYF+I CSHA+A A+ RN+S+++LCS +Y++E L+  YAEP+Y
Subjt:  TRASILTDYAEGIVKSAVEQARQHTIRPIDNYEYEVHDGNSKMRVNLNSKSCTCKQFDYFQIPCSHAVAAAMHRNVSIYTLCSPKYKLEMLLNAYAEPIY

Query:  PLGDEEDWPLPDDFVEYTIEPPKFVARVGRRQTVRIPSAGE
        PLGDEEDW LPDD+V  TI+PPKFV RVGR QT RIPSAGE
Subjt:  PLGDEEDWPLPDDFVEYTIEPPKFVARVGRRQTVRIPSAGE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G64255.1 MuDR family transposase6.7e-0623.72Show/hide
Query:  AAKAFKKSTFRYYWNQL-AGFAELRQYLEELGFDKWSRAYQPGLRYNQMTTNIAESMNVVLV-----HARYLPVTALLEHCRALLQRWYYERRTYASTRA
        A    +K  F  Y N +     E R++L++   ++W+ A+  G RY  M  N      V        H     V  L +  R+         ++++ +R+
Subjt:  AAKAFKKSTFRYYWNQL-AGFAELRQYLEELGFDKWSRAYQPGLRYNQMTTNIAESMNVVLV-----HARYLPVTALLEHCRALLQRWYYERRTYASTRA

Query:  SI-LTDYAEGIVKSAVEQAR------QHTIRPIDNYEYEVHDGNSKMR--VNLNSKSCTCKQFDYFQIPCSHAVAAAMHRNVSIYTLCSPKYKLEMLLNA
        S+   D     V   +E+ R       + + P+DN  ++V     K    V L+  SCTC  F  ++ PC HA+A       +        Y LE L   
Subjt:  SI-LTDYAEGIVKSAVEQAR------QHTIRPIDNYEYEVHDGNSKMR--VNLNSKSCTCKQFDYFQIPCSHAVAAAMHRNVSIYTLCSPKYKLEMLLNA

Query:  YAEPIYPLGDEEDWP
        YA     + +   WP
Subjt:  YAEPIYPLGDEEDWP

AT1G64260.1 MuDR family transposase3.5e-0722.46Show/hide
Query:  RSEGVEWLYLLAAKAFKKSTFRYYWNQL-AGFAELRQYLEELGFDKWSRAYQPGLRYNQMTTNIAESMNVVLVHARYLPVTALLEHCRALLQRWYYERRT
        R   +E L   A    +K  F  Y N +     E  ++L+++   KW+ A+  GLRY  +  +  E++  V     Y  V         ++  +   R +
Subjt:  RSEGVEWLYLLAAKAFKKSTFRYYWNQL-AGFAELRQYLEELGFDKWSRAYQPGLRYNQMTTNIAESMNVVLVHARYLPVTALLEHCRALLQRWYYERRT

Query:  YASTRASILTDYAEGIV---------KSAVEQARQHTIRPIDNYEYEVHDGNSKMR--VNLNSKSCTCKQFDYFQIPCSHAVAAAMHRNVSIYTLCSPKY
        +  + +SI +    G+V         +  +  +  + I  ++   ++V + + K    V LN  +CTC++F  ++ PC HA+A      ++        Y
Subjt:  YASTRASILTDYAEGIV---------KSAVEQARQHTIRPIDNYEYEVHDGNSKMR--VNLNSKSCTCKQFDYFQIPCSHAVAAAMHRNVSIYTLCSPKY

Query:  KLEMLLNAYAEPIYPLGDEEDWPLPDDFVEYTIEPP
         +E     YA    P+ D   W  P+D    T+ PP
Subjt:  KLEMLLNAYAEPIYPLGDEEDWPLPDDFVEYTIEPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCGCCTATTTGTTAGTTATGGTGGTAGTTGGAATGAGTCACAATTTCTATATGAAGGTGGAATTATGGGAGGTTTGGATGTGGACGACTCTATAACTTAT
GAGGAGCTCCTTAGTGCTATGTTCAGCCTTACCCGAATAGATCCGGATCAGTTCAAAATCTTGATACACTATGTATATAAGTTCAATCTGCAGTACCAGGTTCCG
AAGTATTACATCTTTGATGACCATAGCCTTAGATTTTTTTTAAGAGGCCCTCCACATCCCTCTGAAGTCCCATTGTATGTATCTGTCGTACCGAAGGAAATACAT
GGCAGTGGAAGCAGTTCAATGAATCGTAACATTCCAGAAGCAGAAGCATTCCAATCATTTCCCCACCAGTTAGGGCAGACTGTTCCGTATTATGCTCCATCGTTT
CCTTTTGATTCCACGCTCCCAGGCCCATCATGTTTTGTCCCATCAATGACGTCGCTGACGGACAATGTAATCCCATGTAACTTCGGTGACGATGAAACAAACTAT
TGCGGTCAATGGGACGATAACGATGAGAACGACGTGGAGTACGAGTACGAGGCCGAGGATGATGACAACCAGGATACTGAATTCGAGGATGATGTTGATTTTGAG
AACGAGGAGGAAGTAAACCCAGTCGATGTAGCCGGTCCATCATCGGACCCCTCGACCGAAGTGCACGTGGTCAGTACGAATGCACTGTGCGCAACCGATCAAGCT
TCTTGCTCAAGGGAAATTGTTAGGACAGGTGATGAAGTTTGTTCGTCAACGGAGGACATTGCGGTAGGGAGTACTTTTCGATCGAAAGAAGATTTGCAGTTCAAA
CTCTCGGTGTACGCAATGAAGATGAATTTTGAATATCGCGTGAAGAAGTCGACAAAAAGTTTGTACACTGTCGGATGCACCGAGGATGGGTGCAAATGGAGCCTA
CGTTCAAGGAAAATTAAAGGTTCATATACTTTTCTTATCTCTACATTCTATGAGGTTCACAGTTGCACTCGTGAGGTAATGAAACATGACCACCGGCAAGCTCGA
AGTCGTGTGGTGGGTCAGATTATAAAGTCCACATTTGAGGATGTAAGTCGATGTTATAGACCGAAGGATATTGTTAATGACATGCGGAAAAATTACGGTGTTAAC
ATTCGATATGAAAAGGCGTGGCGTGCGAGAGAGAGGGCTTTGGAACTACTAATGGGATCGCCGAAGAAGTCGTACACTCTTTTGCGTAAATACGGTGAGGCGTTG
AAATCGGTGAACCCGGGCACGATGAACTTGAATGATAAGTTCAAGATTCGGAGCGAAGGCGTGGAATGGCTATACCTCTTAGCAGCTAAGGCATTCAAGAAGTCT
ACCTTCAGGTATTATTGGAATCAGCTTGCGGGGTTCGCAGAACTGCGACAGTACTTGGAGGAACTCGGGTTCGATAAATGGTCACGCGCATATCAACCTGGATTG
AGGTACAATCAGATGACAACTAACATTGCAGAGTCCATGAATGTAGTTCTAGTTCACGCACGATATTTGCCAGTCACTGCACTATTAGAACATTGTAGGGCTCTT
CTGCAACGATGGTATTACGAACGACGGACGTACGCATCGACCAGAGCATCCATTCTAACTGATTATGCTGAGGGGATTGTTAAGAGTGCAGTGGAGCAGGCCCGA
CAACATACGATTAGACCGATTGATAATTACGAGTACGAGGTACACGATGGTAACAGCAAGATGCGTGTCAACCTAAACAGTAAGAGTTGTACGTGTAAGCAGTTT
GACTACTTCCAGATCCCGTGCTCCCATGCTGTCGCTGCCGCCATGCATCGTAATGTTAGTATATACACGTTGTGTTCGCCCAAGTATAAATTAGAAATGCTTCTT
AACGCGTACGCCGAACCAATCTACCCATTGGGTGATGAGGAGGACTGGCCTTTACCAGACGACTTTGTTGAGTACACCATCGAGCCTCCGAAGTTCGTTGCAAGA
GTTGGTAGACGACAAACAGTACGAATACCATCAGCAGGCGAGCCTCAACAGATACATAAATGCAGCAAGTGTGGAATGCAAGGTCACAATAGGAAAACTTGCCGC
CAACCATTACAAACAACTGAGTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTCGCCTATTTGTTAGTTATGGTGGTAGTTGGAATGAGTCACAATTTCTATATGAAGGTGGAATTATGGGAGGTTTGGATGTGGACGACTCTATAACTTAT
GAGGAGCTCCTTAGTGCTATGTTCAGCCTTACCCGAATAGATCCGGATCAGTTCAAAATCTTGATACACTATGTATATAAGTTCAATCTGCAGTACCAGGTTCCG
AAGTATTACATCTTTGATGACCATAGCCTTAGATTTTTTTTAAGAGGCCCTCCACATCCCTCTGAAGTCCCATTGTATGTATCTGTCGTACCGAAGGAAATACAT
GGCAGTGGAAGCAGTTCAATGAATCGTAACATTCCAGAAGCAGAAGCATTCCAATCATTTCCCCACCAGTTAGGGCAGACTGTTCCGTATTATGCTCCATCGTTT
CCTTTTGATTCCACGCTCCCAGGCCCATCATGTTTTGTCCCATCAATGACGTCGCTGACGGACAATGTAATCCCATGTAACTTCGGTGACGATGAAACAAACTAT
TGCGGTCAATGGGACGATAACGATGAGAACGACGTGGAGTACGAGTACGAGGCCGAGGATGATGACAACCAGGATACTGAATTCGAGGATGATGTTGATTTTGAG
AACGAGGAGGAAGTAAACCCAGTCGATGTAGCCGGTCCATCATCGGACCCCTCGACCGAAGTGCACGTGGTCAGTACGAATGCACTGTGCGCAACCGATCAAGCT
TCTTGCTCAAGGGAAATTGTTAGGACAGGTGATGAAGTTTGTTCGTCAACGGAGGACATTGCGGTAGGGAGTACTTTTCGATCGAAAGAAGATTTGCAGTTCAAA
CTCTCGGTGTACGCAATGAAGATGAATTTTGAATATCGCGTGAAGAAGTCGACAAAAAGTTTGTACACTGTCGGATGCACCGAGGATGGGTGCAAATGGAGCCTA
CGTTCAAGGAAAATTAAAGGTTCATATACTTTTCTTATCTCTACATTCTATGAGGTTCACAGTTGCACTCGTGAGGTAATGAAACATGACCACCGGCAAGCTCGA
AGTCGTGTGGTGGGTCAGATTATAAAGTCCACATTTGAGGATGTAAGTCGATGTTATAGACCGAAGGATATTGTTAATGACATGCGGAAAAATTACGGTGTTAAC
ATTCGATATGAAAAGGCGTGGCGTGCGAGAGAGAGGGCTTTGGAACTACTAATGGGATCGCCGAAGAAGTCGTACACTCTTTTGCGTAAATACGGTGAGGCGTTG
AAATCGGTGAACCCGGGCACGATGAACTTGAATGATAAGTTCAAGATTCGGAGCGAAGGCGTGGAATGGCTATACCTCTTAGCAGCTAAGGCATTCAAGAAGTCT
ACCTTCAGGTATTATTGGAATCAGCTTGCGGGGTTCGCAGAACTGCGACAGTACTTGGAGGAACTCGGGTTCGATAAATGGTCACGCGCATATCAACCTGGATTG
AGGTACAATCAGATGACAACTAACATTGCAGAGTCCATGAATGTAGTTCTAGTTCACGCACGATATTTGCCAGTCACTGCACTATTAGAACATTGTAGGGCTCTT
CTGCAACGATGGTATTACGAACGACGGACGTACGCATCGACCAGAGCATCCATTCTAACTGATTATGCTGAGGGGATTGTTAAGAGTGCAGTGGAGCAGGCCCGA
CAACATACGATTAGACCGATTGATAATTACGAGTACGAGGTACACGATGGTAACAGCAAGATGCGTGTCAACCTAAACAGTAAGAGTTGTACGTGTAAGCAGTTT
GACTACTTCCAGATCCCGTGCTCCCATGCTGTCGCTGCCGCCATGCATCGTAATGTTAGTATATACACGTTGTGTTCGCCCAAGTATAAATTAGAAATGCTTCTT
AACGCGTACGCCGAACCAATCTACCCATTGGGTGATGAGGAGGACTGGCCTTTACCAGACGACTTTGTTGAGTACACCATCGAGCCTCCGAAGTTCGTTGCAAGA
GTTGGTAGACGACAAACAGTACGAATACCATCAGCAGGCGAGCCTCAACAGATACATAAATGCAGCAAGTGTGGAATGCAAGGTCACAATAGGAAAACTTGCCGC
CAACCATTACAAACAACTGAGTCTTAG
Protein sequenceShow/hide protein sequence
MSRLFVSYGGSWNESQFLYEGGIMGGLDVDDSITYEELLSAMFSLTRIDPDQFKILIHYVYKFNLQYQVPKYYIFDDHSLRFFLRGPPHPSEVPLYVSVVPKEIH
GSGSSSMNRNIPEAEAFQSFPHQLGQTVPYYAPSFPFDSTLPGPSCFVPSMTSLTDNVIPCNFGDDETNYCGQWDDNDENDVEYEYEAEDDDNQDTEFEDDVDFE
NEEEVNPVDVAGPSSDPSTEVHVVSTNALCATDQASCSREIVRTGDEVCSSTEDIAVGSTFRSKEDLQFKLSVYAMKMNFEYRVKKSTKSLYTVGCTEDGCKWSL
RSRKIKGSYTFLISTFYEVHSCTREVMKHDHRQARSRVVGQIIKSTFEDVSRCYRPKDIVNDMRKNYGVNIRYEKAWRARERALELLMGSPKKSYTLLRKYGEAL
KSVNPGTMNLNDKFKIRSEGVEWLYLLAAKAFKKSTFRYYWNQLAGFAELRQYLEELGFDKWSRAYQPGLRYNQMTTNIAESMNVVLVHARYLPVTALLEHCRAL
LQRWYYERRTYASTRASILTDYAEGIVKSAVEQARQHTIRPIDNYEYEVHDGNSKMRVNLNSKSCTCKQFDYFQIPCSHAVAAAMHRNVSIYTLCSPKYKLEMLL
NAYAEPIYPLGDEEDWPLPDDFVEYTIEPPKFVARVGRRQTVRIPSAGEPQQIHKCSKCGMQGHNRKTCRQPLQTTES