; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc10g12380 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc10g12380
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Description17.4 kDa class I heat shock protein
Genome locationchr10:9396960..9416921
RNA-Seq ExpressionMoc10g12380
SyntenyMoc10g12380
Gene Ontology termsGO:0006313 - transposition, DNA-mediated (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0004803 - transposase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002068 - Alpha crystallin/Hsp20 domain
IPR004332 - Transposase, MuDR, plant
IPR006564 - Zinc finger, PMZ-type
IPR007527 - Zinc finger, SWIM-type
IPR008978 - HSP20-like chaperone


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131652.1 protein FAR1-RELATED SEQUENCE 4-like [Momordica charantia]8.5e-18564.39Show/hide
Query:  EGDDEREYGNEHASDRLDVQHEHEDVTIHNTMAEYPVDAVHETSSNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKK
        EGDDE EYGNE+ASDRLDVQHEHE VTIHNTMAEYPVD VHE +SNR+TGQSE DRLQAMVQSA T+DVKE DVFDSKKELVMKMHLLALRKNFQF+VKK
Subjt:  EGDDEREYGNEHASDRLDVQHEHEDVTIHNTMAEYPVDAVHETSSNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKK

Query:  STPKIYLLRCVDPTCTWRLRATEIRDCNLFKIKKYIAVHSNCNGALMKQDHHQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVN----------
        STPK+YL+RC+DPTCTWRLR T+IRDCNLFKIKKYIAVHS CNGA+MKQDH QAKSWVVGHLVQSKFTDVSRTYRPKDI+QDIREEYGVN          
Subjt:  STPKIYLLRCVDPTCTWRLRATEIRDCNLFKIKKYIAVHSNCNGALMKQDHHQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVN----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------MNLLAKFKTSELEELFFKAAKACRESYFNENWVQLCAHPGVRE
                                                                 MNLLAKFKTS LE LFFKAAKA RESYFNENWVQLCAHPGVRE
Subjt:  ---------------------------------------------------------MNLLAKFKTSELEELFFKAAKACRESYFNENWVQLCAHPGVRE

Query:  YVEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNF
        Y+EAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKL +TALLDHIRGVLQRWFYE RTLASSRQSTLSDYAEEMIAEA DNARRHIVMNIDQFNF
Subjt:  YVEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNF

Query:  EVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAACSRSINPYTLCDEAYTVNS
        EV DGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAA SRSINPYTLCDEAYTVNS
Subjt:  EVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAACSRSINPYTLCDEAYTVNS

XP_022142677.1 uncharacterized protein LOC111012733 [Momordica charantia]4.6e-13859.45Show/hide
Query:  MAEYPVDAVHETSSNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKKSTPKIYLLRCVDPTCTWRLRATEIRDCNLFK
        MAEYPVDAVHE ++NR+TGQSE DRLQAMVQSAGT+DVKEGDVFDSKKELVMKMH  ALRKNFQFRVKKSTP++YLLRC+DPTCTWRLRAT+IRDCNLFK
Subjt:  MAEYPVDAVHETSSNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKKSTPKIYLLRCVDPTCTWRLRATEIRDCNLFK

Query:  IKKYIAVHSNCNGALMKQDHHQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVN-----------------------------------------
        IKKYIAVHS CNGA+MKQDH QAKSWVVGHLVQSKFTDVSRTYRPKDI+QDIREEYGVN                                         
Subjt:  IKKYIAVHSNCNGALMKQDHHQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVN-----------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------MNLLAKFKTSELEELFFKAAKACRESYFNENWVQLCAHPGVREYVEAIGKERWARCFQTKLRYSQMTTNIAESV
                                  MNLLAKFKT ELE LFFKAAKA RESYFNENWVQLCAHP VREY+EAIGKERWARCFQTKLRYSQMTTNIAESV
Subjt:  --------------------------MNLLAKFKTSELEELFFKAAKACRESYFNENWVQLCAHPGVREYVEAIGKERWARCFQTKLRYSQMTTNIAESV

Query:  NALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEV
        NALFRHARKLPVTALLDHIR VLQRWFYERRTLASSRQSTLSDYAEEMI+EASDNARRHIVMNIDQFNFE+
Subjt:  NALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEV

XP_022153146.1 uncharacterized protein LOC111020715 [Momordica charantia]1.3e-14850Show/hide
Query:  EEGDDEREYGNEHASDRLDVQHEHEDVTIHNTMAEYPVDAVHETSSNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVK
        EEGD E E+ N+   D LD + E +   +H  +      AV +   + LTGQ   + LQ +VQS+GTNDVKEG+VFD+KKEL ++MHL+ +R NFQF+VK
Subjt:  EEGDDEREYGNEHASDRLDVQHEHEDVTIHNTMAEYPVDAVHETSSNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVK

Query:  KSTPKIYLLRCVDPTCTWRLRATEIRDCNLFKIKKYIAVHSNCNGALMKQDHHQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVN---------
        KSTP++Y+L CVD +CTWRLRAT++RDCNLFKIKKY ++H+ CNG ++KQDH QAKSWVVGHLVQ+KFTDVSRTYRPKDIIQD+R+EYGVN         
Subjt:  KSTPKIYLLRCVDPTCTWRLRATEIRDCNLFKIKKYIAVHSNCNGALMKQDHHQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVN---------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------MNLLAKFK--TSELEELFFKAAKACRESYFNENWVQLCAHPG
                                                                  MNLLAKFK     LEELF KAAKA RESYFN  W QL A+PG
Subjt:  ----------------------------------------------------------MNLLAKFK--TSELEELFFKAAKACRESYFNENWVQLCAHPG

Query:  VREYVEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQ
        VREY++ IGKERWARCFQT+LRY+QMT+N AESVNALFRHARKLPVTALLDHIRG+LQ WFY+RRTLASSR +TLS YAE  +AE SDNARRH+V+NIDQ
Subjt:  VREYVEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQ

Query:  FNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAACSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRR
        F+ +VRDGNL+G VD  S+TC CREFDYFK+PCSHAIA A  R+INPYTLCDEAYT NSW++AYAEPIFP+G  STW SSP FV+  V+ P  V RVGRR
Subjt:  FNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAACSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRR

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.1e-15282.77Show/hide
Query:  SSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERVDNPPEGWVTLYFKMFEYGL
        SSS SSNL  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLR+PEEGER DNPPEGWVTLYFKMFEYGL
Subjt:  SSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERVDNPPEGWVTLYFKMFEYGL

Query:  RLPLHPFVQEFLFRTGLALAQVAPNGWGVIFALAILFWLRAGDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGA---------------------
        RLPLHPFVQEFLFRTGLA AQVAPNGWGVIFALAILFWLRA DSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGA                     
Subjt:  RLPLHPFVQEFLFRTGLALAQVAPNGWGVIFALAILFWLRAGDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGA---------------------

Query:  ----------------------CVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPVVRPIESSRPNSELAMVCGFASNVKRKS
                               VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP VRPIESSRPNSELAMVCGFAS VKRKS
Subjt:  ----------------------CVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPVVRPIESSRPNSELAMVCGFASNVKRKS

Query:  KGRAHALEAAQSSKPATPTVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        KGRAHALEAAQSSKPATP VVGPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  KGRAHALEAAQSSKPATPTVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.1e-14461.27Show/hide
Query:  FYMCARKGACVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPVVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSS
        F +  R G  VSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP+VR IE+SRPNSELAMVCGF  +VKRKSKGRAHAL+    +
Subjt:  FYMCARKGACVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPVVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSS

Query:  KPATPTV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRKKKKKTTSPLEVGARGVLPASFANRVDDPEARIG
        +P TPTV         GP+S  P PVIEL+ SGG S EKR R+++EA+D  PL  EVR E PL+RR+KKKKT+S  E GARG LP S A+ VDDPEAR+ 
Subjt:  KPATPTV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRKKKKKTTSPLEVGARGVLPASFANRVDDPEARIG

Query:  GT---------------------------LDRCLKRASKFVSDQGSVRQRTIDYAAEAFVASIQSALAVKAELDGREVLAAKEKEEFSAALEAASSTMKD
        GT                           LDR L+RASKFVSD GSV QRTID  AEAF+ASI  A+ VKAELDGRE LAAKE+E   AALEAA +T+K 
Subjt:  GT---------------------------LDRCLKRASKFVSDQGSVRQRTIDYAAEAFVASIQSALAVKAELDGREVLAAKEKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILMAEVETKAELLKKEEDRRKAQLRAAHAITRGLE-------KEKDDMLQALEAKDKELKHATAELETAKERLSNGILLEESFRQHPDFD
        ELLKA  EV+IL AEV+ K +LLKKE ++ KA LRAAHAIT+GLE       KEKDD+ Q LE KD  +   T EL+  KERL+NG LLEESFRQHPDFD
Subjt:  ELLKAHSEVEILMAEVETKAELLKKEEDRRKAQLRAAHAITRGLE-------KEKDDMLQALEAKDKELKHATAELETAKERLSNGILLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEQWAPGPGGTPGPQALVDKYVRDLDSDNSDLKEDQGRAARSISLGSALHSI
        GFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLKK+Y+E+WA GP GTP PQ+LVDKYVR+LDSD SD++E+   +     +G+    +
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEQWAPGPGGTPGPQALVDKYVRDLDSDNSDLKEDQGRAARSISLGSALHSI

TrEMBL top hitse value%identityAlignment
A0A6J1BRM2 protein FAR1-RELATED SEQUENCE 4-like4.1e-18564.39Show/hide
Query:  EGDDEREYGNEHASDRLDVQHEHEDVTIHNTMAEYPVDAVHETSSNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKK
        EGDDE EYGNE+ASDRLDVQHEHE VTIHNTMAEYPVD VHE +SNR+TGQSE DRLQAMVQSA T+DVKE DVFDSKKELVMKMHLLALRKNFQF+VKK
Subjt:  EGDDEREYGNEHASDRLDVQHEHEDVTIHNTMAEYPVDAVHETSSNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKK

Query:  STPKIYLLRCVDPTCTWRLRATEIRDCNLFKIKKYIAVHSNCNGALMKQDHHQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVN----------
        STPK+YL+RC+DPTCTWRLR T+IRDCNLFKIKKYIAVHS CNGA+MKQDH QAKSWVVGHLVQSKFTDVSRTYRPKDI+QDIREEYGVN          
Subjt:  STPKIYLLRCVDPTCTWRLRATEIRDCNLFKIKKYIAVHSNCNGALMKQDHHQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVN----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------MNLLAKFKTSELEELFFKAAKACRESYFNENWVQLCAHPGVRE
                                                                 MNLLAKFKTS LE LFFKAAKA RESYFNENWVQLCAHPGVRE
Subjt:  ---------------------------------------------------------MNLLAKFKTSELEELFFKAAKACRESYFNENWVQLCAHPGVRE

Query:  YVEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNF
        Y+EAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKL +TALLDHIRGVLQRWFYE RTLASSRQSTLSDYAEEMIAEA DNARRHIVMNIDQFNF
Subjt:  YVEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNF

Query:  EVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAACSRSINPYTLCDEAYTVNS
        EV DGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAA SRSINPYTLCDEAYTVNS
Subjt:  EVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAACSRSINPYTLCDEAYTVNS

A0A6J1CNJ2 uncharacterized protein LOC1110127332.2e-13859.45Show/hide
Query:  MAEYPVDAVHETSSNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKKSTPKIYLLRCVDPTCTWRLRATEIRDCNLFK
        MAEYPVDAVHE ++NR+TGQSE DRLQAMVQSAGT+DVKEGDVFDSKKELVMKMH  ALRKNFQFRVKKSTP++YLLRC+DPTCTWRLRAT+IRDCNLFK
Subjt:  MAEYPVDAVHETSSNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKKSTPKIYLLRCVDPTCTWRLRATEIRDCNLFK

Query:  IKKYIAVHSNCNGALMKQDHHQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVN-----------------------------------------
        IKKYIAVHS CNGA+MKQDH QAKSWVVGHLVQSKFTDVSRTYRPKDI+QDIREEYGVN                                         
Subjt:  IKKYIAVHSNCNGALMKQDHHQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVN-----------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------MNLLAKFKTSELEELFFKAAKACRESYFNENWVQLCAHPGVREYVEAIGKERWARCFQTKLRYSQMTTNIAESV
                                  MNLLAKFKT ELE LFFKAAKA RESYFNENWVQLCAHP VREY+EAIGKERWARCFQTKLRYSQMTTNIAESV
Subjt:  --------------------------MNLLAKFKTSELEELFFKAAKACRESYFNENWVQLCAHPGVREYVEAIGKERWARCFQTKLRYSQMTTNIAESV

Query:  NALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEV
        NALFRHARKLPVTALLDHIR VLQRWFYERRTLASSRQSTLSDYAEEMI+EASDNARRHIVMNIDQFNFE+
Subjt:  NALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEV

A0A6J1DJT1 uncharacterized protein LOC1110207156.2e-14950Show/hide
Query:  EEGDDEREYGNEHASDRLDVQHEHEDVTIHNTMAEYPVDAVHETSSNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVK
        EEGD E E+ N+   D LD + E +   +H  +      AV +   + LTGQ   + LQ +VQS+GTNDVKEG+VFD+KKEL ++MHL+ +R NFQF+VK
Subjt:  EEGDDEREYGNEHASDRLDVQHEHEDVTIHNTMAEYPVDAVHETSSNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVK

Query:  KSTPKIYLLRCVDPTCTWRLRATEIRDCNLFKIKKYIAVHSNCNGALMKQDHHQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVN---------
        KSTP++Y+L CVD +CTWRLRAT++RDCNLFKIKKY ++H+ CNG ++KQDH QAKSWVVGHLVQ+KFTDVSRTYRPKDIIQD+R+EYGVN         
Subjt:  KSTPKIYLLRCVDPTCTWRLRATEIRDCNLFKIKKYIAVHSNCNGALMKQDHHQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVN---------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------MNLLAKFK--TSELEELFFKAAKACRESYFNENWVQLCAHPG
                                                                  MNLLAKFK     LEELF KAAKA RESYFN  W QL A+PG
Subjt:  ----------------------------------------------------------MNLLAKFK--TSELEELFFKAAKACRESYFNENWVQLCAHPG

Query:  VREYVEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQ
        VREY++ IGKERWARCFQT+LRY+QMT+N AESVNALFRHARKLPVTALLDHIRG+LQ WFY+RRTLASSR +TLS YAE  +AE SDNARRH+V+NIDQ
Subjt:  VREYVEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQ

Query:  FNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAACSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRR
        F+ +VRDGNL+G VD  S+TC CREFDYFK+PCSHAIA A  R+INPYTLCDEAYT NSW++AYAEPIFP+G  STW SSP FV+  V+ P  V RVGRR
Subjt:  FNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAACSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRR

A0A6J1DXS5 uncharacterized protein LOC1110255025.5e-15382.77Show/hide
Query:  SSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERVDNPPEGWVTLYFKMFEYGL
        SSS SSNL  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLR+PEEGER DNPPEGWVTLYFKMFEYGL
Subjt:  SSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERVDNPPEGWVTLYFKMFEYGL

Query:  RLPLHPFVQEFLFRTGLALAQVAPNGWGVIFALAILFWLRAGDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGA---------------------
        RLPLHPFVQEFLFRTGLA AQVAPNGWGVIFALAILFWLRA DSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGA                     
Subjt:  RLPLHPFVQEFLFRTGLALAQVAPNGWGVIFALAILFWLRAGDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGA---------------------

Query:  ----------------------CVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPVVRPIESSRPNSELAMVCGFASNVKRKS
                               VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP VRPIESSRPNSELAMVCGFAS VKRKS
Subjt:  ----------------------CVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPVVRPIESSRPNSELAMVCGFASNVKRKS

Query:  KGRAHALEAAQSSKPATPTVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        KGRAHALEAAQSSKPATP VVGPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  KGRAHALEAAQSSKPATPTVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256655.5e-14561.27Show/hide
Query:  FYMCARKGACVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPVVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSS
        F +  R G  VSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP+VR IE+SRPNSELAMVCGF  +VKRKSKGRAHAL+    +
Subjt:  FYMCARKGACVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPVVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSS

Query:  KPATPTV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRKKKKKTTSPLEVGARGVLPASFANRVDDPEARIG
        +P TPTV         GP+S  P PVIEL+ SGG S EKR R+++EA+D  PL  EVR E PL+RR+KKKKT+S  E GARG LP S A+ VDDPEAR+ 
Subjt:  KPATPTV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRKKKKKTTSPLEVGARGVLPASFANRVDDPEARIG

Query:  GT---------------------------LDRCLKRASKFVSDQGSVRQRTIDYAAEAFVASIQSALAVKAELDGREVLAAKEKEEFSAALEAASSTMKD
        GT                           LDR L+RASKFVSD GSV QRTID  AEAF+ASI  A+ VKAELDGRE LAAKE+E   AALEAA +T+K 
Subjt:  GT---------------------------LDRCLKRASKFVSDQGSVRQRTIDYAAEAFVASIQSALAVKAELDGREVLAAKEKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILMAEVETKAELLKKEEDRRKAQLRAAHAITRGLE-------KEKDDMLQALEAKDKELKHATAELETAKERLSNGILLEESFRQHPDFD
        ELLKA  EV+IL AEV+ K +LLKKE ++ KA LRAAHAIT+GLE       KEKDD+ Q LE KD  +   T EL+  KERL+NG LLEESFRQHPDFD
Subjt:  ELLKAHSEVEILMAEVETKAELLKKEEDRRKAQLRAAHAITRGLE-------KEKDDMLQALEAKDKELKHATAELETAKERLSNGILLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEQWAPGPGGTPGPQALVDKYVRDLDSDNSDLKEDQGRAARSISLGSALHSI
        GFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLKK+Y+E+WA GP GTP PQ+LVDKYVR+LDSD SD++E+   +     +G+    +
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEQWAPGPGGTPGPQALVDKYVRDLDSDNSDLKEDQGRAARSISLGSALHSI

SwissProt top hitse value%identityAlignment
P27396 17.8 kDa class I heat shock protein2.3e-0729.66Show/hide
Query:  PLLFG----KFFDPADAFPLWEFESD--LLLSNLRNSGKST-------IDWFQIDQEYVLQAELPEAVRNTMQIFIENGKVLEISGQLKQQQQRESKTII
        P  FG      FDP  +  +W+   D  L+ S+    GK T       IDW +  Q +V +A+LP   +  +++ +E GKVL+ISG+  ++++ ++    
Subjt:  PLLFG----KFFDPADAFPLWEFESD--LLLSNLRNSGKST-------IDWFQIDQEYVLQAELPEAVRNTMQIFIENGKVLEISGQLKQQQQRESKTII

Query:  DWRSVNWWEHGYVRRLELPDDADWRRMEARVKNDMVIEVKIPKLE
         W  V      ++RR  LP++A    ++A + N  V+ V +PK+E
Subjt:  DWRSVNWWEHGYVRRLELPDDADWRRMEARVKNDMVIEVKIPKLE

P27397 18.0 kDa class I heat shock protein8.7e-0730.93Show/hide
Query:  IDWFQIDQEYVLQAELPEAVRNTMQIFIENGKVLEISGQLKQQQQRESKTIIDWRSVNWWEHGYVRRLELPDDADWRRMEARVKNDMVIEVKIPKLE
        IDW +  Q +V +A+LP   +  +++ +E GKVL+ISG+  ++++ ++     W  V +    ++RR  LP++A+   ++A ++N  V+ V +PK+E
Subjt:  IDWFQIDQEYVLQAELPEAVRNTMQIFIENGKVLEISGQLKQQQQRESKTIIDWRSVNWWEHGYVRRLELPDDADWRRMEARVKNDMVIEVKIPKLE

Q05832 18.3 kDa class I heat shock protein8.7e-0725.9Show/hide
Query:  SLFSPLLFGKFFDPADAFP--LWEFESDLLLSNLRNSGKSTIDWFQIDQEYVLQAELPEAVRNTMQIFIENGKVLEISGQLKQQQQRESKTIIDWRSVNW
        ++F P    + +DP    P  L         +       + IDW +  + +V +A+LP   +  +++ +E+G VL ISGQ  ++++ ++ T   W  V  
Subjt:  SLFSPLLFGKFFDPADAFP--LWEFESDLLLSNLRNSGKSTIDWFQIDQEYVLQAELPEAVRNTMQIFIENGKVLEISGQLKQQQQRESKTIIDWRSVNW

Query:  WEHGYVRRLELPDDADWRRMEARVKNDMVIEVKIPKLEA
            ++R+  LP++A   +++A ++N  V+ V +PK EA
Subjt:  WEHGYVRRLELPDDADWRRMEARVKNDMVIEVKIPKLEA

Q6AUW3 22.3 kDa class VI heat shock protein3.6e-3742.7Show/hide
Query:  DHTPSKWSVSLGEEAFRRFL-------GQANPVVQKVFGDGSLFSPLLFGKFFDPADAFPLWEFESDLLLSNLRNSGKSTIDWFQIDQEYVLQAELPEAV
        D    +W +SL E  F  FL              + VFG+GSLFSP LFGKFFDPADAFPLWEFE ++LL+ LR   ++T+DW + D EY L+A++P   
Subjt:  DHTPSKWSVSLGEEAFRRFL-------GQANPVVQKVFGDGSLFSPLLFGKFFDPADAFPLWEFESDLLLSNLRNSGKSTIDWFQIDQEYVLQAELPEAV

Query:  RNTMQIFIENG-KVLEISGQLKQQQQRESKTIIDWRSVNWWEHGYVRRLELPDDADWRRMEARVKN-DMVIEVKIPKL-EANQSA
        +  +++  ++  +V+++SG  +           DWR+  WWEHG+VRR+ELP+DADWR++EA   + + ++E+K+PK  +A+Q+A
Subjt:  RNTMQIFIENG-KVLEISGQLKQQQQRESKTIIDWRSVNWWEHGYVRRLELPDDADWRRMEARVKN-DMVIEVKIPKL-EANQSA

Q9FIT9 21.7 kDa class VI heat shock protein3.4e-5151.3Show/hide
Query:  SKQLEVHREDHTPSKWSVSLGEEAFRRFLGQANPVVQKVFGDGSLFSPLLFGKFFDPADAFPLWEFESDLLLSNLRNSGKSTIDWFQIDQEYVLQAELPE
        S +LE+H +D TP KWSV LG++ FRRFL       + VFG+GSLFSP LFGK+FDP+DAFPLWEFE+++LL++LR+ G+  +DW Q DQ YVL++++P 
Subjt:  SKQLEVHREDHTPSKWSVSLGEEAFRRFLGQANPVVQKVFGDGSLFSPLLFGKFFDPADAFPLWEFESDLLLSNLRNSGKSTIDWFQIDQEYVLQAELPE

Query:  AVRNTMQIFIE-NGKVLEISGQLKQQQQRESKTIIDWRSVNWWEHGYVRRLELPDDADWRRMEARVKND---MVIEVKIPKLEANQSAESKNK
          +N +Q++++ NG+V+EISGQ      +++ T  DWRS  WWEHGYVRRLELP DAD +  EA + N+     +E++IPK+       SKNK
Subjt:  AVRNTMQIFIE-NGKVLEISGQLKQQQQRESKTIIDWRSVNWWEHGYVRRLELPDDADWRRMEARVKND---MVIEVKIPKLEANQSAESKNK

Arabidopsis top hitse value%identityAlignment
AT1G49920.1 MuDR family transposase6.2e-0836.78Show/hide
Query:  NGDVDLQSQTCTCREFDYFKVPCSHAIAAACSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVN-----IDVQPPK
        +G V L   TCTC EF   K PC HA+A      INP    D+ YTV  +   Y+    PV   S W  + G        I+  PPK
Subjt:  NGDVDLQSQTCTCREFDYFKVPCSHAIAAACSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVN-----IDVQPPK

AT1G59860.1 HSP20-like chaperones superfamily protein6.2e-0825.95Show/hide
Query:  VVQKVFGDGSLFSPLLFGKF-FDPADAFPLWEFESDLLLSNLRNSGKSTIDWFQIDQEYVLQAELPEAVRNTMQIFIENGKVLEISGQLKQQQQRESKTI
        ++   FG+    +  +F  F  D  D F   +F S    S+      + +DW +  + +V +A+LP   +  +++ IE+  VL+ISG+   +++ +  T 
Subjt:  VVQKVFGDGSLFSPLLFGKF-FDPADAFPLWEFESDLLLSNLRNSGKSTIDWFQIDQEYVLQAELPEAVRNTMQIFIENGKVLEISGQLKQQQQRESKTI

Query:  IDWRSVNWWEHGYVRRLELPDDADWRRMEARVKNDMVIEVKIPKLEAN-QSAESKNKD
          W  V     G+ R+  LP++    +++A ++N  V+ V +PK+E N + A+ K+ D
Subjt:  IDWRSVNWWEHGYVRRLELPDDADWRRMEARVKNDMVIEVKIPKLEAN-QSAESKNKD

AT1G64255.1 MuDR family transposase3.6e-0826.11Show/hide
Query:  HPGVREYVEAIGKERWARCFQTKLRYSQMTTNIAESVNALFR---------HARKLPVTALLDHIRGVLQRWFYERRTLASSRQS-TLSDYAEEMIAEAS
        +P  R++++   + RWA       RY  M  N      ALF          H     V  L D +R    + F      + SR S    D   E + +  
Subjt:  HPGVREYVEAIGKERWARCFQTKLRYSQMTTNIAESVNALFR---------HARKLPVTALLDHIRGVLQRWFYERRTLASSRQS-TLSDYAEEMIAEAS

Query:  DNAR------RHIVMNIDQFNFEVRDGNLNGD--VDLQSQTCTCREFDYFKVPCSHAIAAACSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKS
        +  R       +IV  +D   F+V      G+  V L   +CTC +F  +K PC HA+A       NP    D+ YT+      YA     V   S W  
Subjt:  DNAR------RHIVMNIDQFNFEVRDGNLNGD--VDLQSQTCTCREFDYFKVPCSHAIAAACSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKS

Query:  SPG
        + G
Subjt:  SPG

AT1G64260.1 MuDR family transposase7.8e-1126.43Show/hide
Query:  LAKFKTSELEELFFKAAKACRESYFNENWVQLC-AHPGVREYVEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFY
        L  F+   LE L  +A    ++  F+     +   +P   ++++ I + +WA    + LRY      I     ALF   R  P   +   + G +   F 
Subjt:  LAKFKTSELEELFFKAAKACRESYFNENWVQLC-AHPGVREYVEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFY

Query:  ERRT----LASSRQSTLSD---YAE---EMIAEASDNARRHIVMNIDQFNFEVRDGNLNGD--VDLQSQTCTCREFDYFKVPCSHAIAAACSRSINPYTL
        E R+      SS  S+L+    Y E   + + E   ++  +++  +++ +F+V + +   +  V L   TCTCR+F  +K PC HA+A      INP   
Subjt:  ERRT----LASSRQSTLSD---YAE---EMIAEASDNARRHIVMNIDQFNFEVRDGNLNGD--VDLQSQTCTCREFDYFKVPCSHAIAAACSRSINPYTL

Query:  CDEAYTVNSWMLAYAEPIFPVGSSSTW
         DE YTV  +   YA    PV   + W
Subjt:  CDEAYTVNSWMLAYAEPIFPVGSSSTW

AT5G54660.1 HSP20-like chaperones superfamily protein2.4e-5251.3Show/hide
Query:  SKQLEVHREDHTPSKWSVSLGEEAFRRFLGQANPVVQKVFGDGSLFSPLLFGKFFDPADAFPLWEFESDLLLSNLRNSGKSTIDWFQIDQEYVLQAELPE
        S +LE+H +D TP KWSV LG++ FRRFL       + VFG+GSLFSP LFGK+FDP+DAFPLWEFE+++LL++LR+ G+  +DW Q DQ YVL++++P 
Subjt:  SKQLEVHREDHTPSKWSVSLGEEAFRRFLGQANPVVQKVFGDGSLFSPLLFGKFFDPADAFPLWEFESDLLLSNLRNSGKSTIDWFQIDQEYVLQAELPE

Query:  AVRNTMQIFIE-NGKVLEISGQLKQQQQRESKTIIDWRSVNWWEHGYVRRLELPDDADWRRMEARVKND---MVIEVKIPKLEANQSAESKNK
          +N +Q++++ NG+V+EISGQ      +++ T  DWRS  WWEHGYVRRLELP DAD +  EA + N+     +E++IPK+       SKNK
Subjt:  AVRNTMQIFIE-NGKVLEISGQLKQQQQRESKTIIDWRSVNWWEHGYVRRLELPDDADWRRMEARVKND---MVIEVKIPKLEANQSAESKNK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACAACAGTGTATTTCAGATTGCACCTCGAACTCGGCCTCCGGACCGACCTGAATACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTCAGTAT
AGGGTATTCTCTTCCCCAAACATCGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTTCATTCGATTTGCTTTGGACGCGTGGCGACTTCCTATTCGTGG
GAAAATATAACCGTTGCGGTAGATTTATCGTCGGAATATTCAAATATTCCGACGCTTCGGATCTTAGGGAGGATCCTAGCCGCTCGTTGATTACACGTCTCGAACCCTTG
GTAGGTCGGTCTCTTCCCTCACTTTCTCTTTCGAACGTAGTTGCCAGGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGA
GCTCGAGGAGATAGAAAACTTTAGATTCTCCGATGACGGGGAGGATAGTGACGCTTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCG
GATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTTAGGATTCCGGAGGAGGGGGAGAGAGTTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAA
ATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCGGACTGGGTTGGCTCTGGCTCAAGTGGCTCCCAATGGGTGGGGTGTCATTTT
CGCTTTGGCCATCCTTTTTTGGCTACGAGCTGGGGATAGTGAAGAGGCTGAGCTGTTGGACGTAGACCAACTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGC
CTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCATGCGTTTCAATCCGACCAGTCCCCGAGCTTACACAAGCCTCCTTCGACACGTTGAAATATTACAAGGAGCGCTTT
CCGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACGAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCCGTAGTTCGTCCCATTGAGTCCTCAAGGCCTAACTC
CGAACTTGCCATGGTTTGCGGATTTGCAAGCAACGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACTCCTACTGTGG
TAGGGCCTGCCTCGGAAGATCCAGCCCCGGTGATCGAGCTGGAGTCTTCTGGGGGTCCTTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCCTTGCCC
TTGGGCGAGGAGGTGAGGGAGGAAGTCCCTCTGAAGCGAAGGAAGAAGAAGAAGAAGACGACCTCCCCCTTGGAGGTTGGAGCTCGTGGGGTCTTGCCTGCGAGCTTCGC
AAATCGGGTGGACGATCCTGAGGCCAGGATAGGCGGGACTTTGGACCGCTGCCTAAAGAGGGCGTCCAAATTTGTGAGCGACCAAGGGTCCGTTCGGCAGAGGACCATCG
ACTACGCCGCTGAGGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCAGTGAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAAGGAGAAAGAGGAGTTCTCTGCT
GCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAAATTTTGATGGCCGAGGTGGAGACCAAGGCCGAGCTGCTGAAGAAGGA
AGAAGACAGACGCAAGGCCCAGCTCCGAGCCGCCCATGCTATCACCAGGGGCTTGGAGAAGGAGAAGGACGACATGCTCCAAGCACTTGAAGCGAAGGATAAAGAGCTGA
AGCATGCGACTGCCGAGCTGGAGACGGCAAAGGAGCGTCTCAGCAATGGAATCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTC
TCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCTGACATGCCTGACCTTCAGATCGATCTCAGTGGTCTGAAAAAGAGGTATGCCGAGCAATGGGCGCCTGG
GCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATAAGTATGTCAGAGATCTGGACTCTGACAACTCCGATCTCAAAGAGGACCAGGGCAGAGCTGCAAGGTCTATAA
GCCTTGGCTCTGCTCTTCATTCAATAAAGAGGCTCCCATTTGTTTTTACTTTGTCGTCGGCCACATCTTTCCTTTGCTTTTTCTTTGAACTGCAGCCAATCTATTCTTCA
CATCGCTCTTCGTACCCTCAAGTTCGGAGGCTTGACCATTTTGAACTTTCCACATCGTCCCTTTACCTTGAAGGTTTGAATTCAAAGTTCATCAGTGATTTTAGCATCGC
ACCTCGTACCCTTAGATCCATTGAAAACCCTTCTGGCATTTCAAGGATAATAACGCTTCAGGTGCTCCGCATTCCACGGGTGCGCGAGGACGTCTCCTTTCAGATCGGCC
AATACGTACGTCCCAGGCGTAGTTGGGCCATTGCTCTTCTTTCTTCCAACAAGTCGAGGTTGAGGAGCAGCTCTTCCTCATTTGCCGTAGGCTCGCCCTCTTCAGGGGTT
AGGCATCTCAATAAAGGCAGGGAAAAGCCGCGTCGGTACAACGCTCCATCTCGGACCACGAACCGAGCTACTCGCCTTGCCAACTTTCTGCGCTCCTTGGGGTCTTGTGG
TGAGTTGCCCCTAATGAATTCCGCAGTCGGGTCCATCCATGAGGACTCTGGAGCGCCGATCTCCATCAAATCTGGCTCTGAGATCGAGGGATTATCTAAGATCTCGACGG
GGACCGACCTGGCCAGGAGCTGTCTTTGTCTTCCTCATAAAGAACTACTCACTTTGGCAATGGCCACTTCAAAACAACTAGAAGTTCACAGAGAAGACCATACTCCATCA
AAATGGAGTGTTTCTTTGGGAGAAGAAGCTTTCAGAAGATTTCTAGGGCAAGCTAATCCGGTTGTGCAAAAGGTATTCGGCGATGGCTCGTTGTTCAGTCCATTGCTGTT
CGGAAAGTTCTTCGATCCAGCTGATGCTTTTCCGCTGTGGGAGTTTGAGTCGGATTTGTTATTGTCCAACCTCCGCAACTCAGGAAAATCAACGATCGATTGGTTTCAGA
TTGATCAAGAATATGTTCTCCAGGCAGAATTACCAGAAGCTGTGAGAAACACAATGCAGATCTTCATAGAGAATGGGAAGGTCTTAGAGATAAGTGGTCAGTTGAAGCAG
CAGCAGCAGAGGGAAAGCAAGACGATCATCGACTGGCGAAGTGTCAACTGGTGGGAGCATGGATATGTTCGGCGGCTCGAGTTGCCCGACGATGCTGATTGGCGGAGAAT
GGAGGCGCGTGTCAAAAACGACATGGTTATCGAGGTAAAAATTCCCAAGTTGGAAGCCAATCAAAGTGCTGAATCAAAGAACAAAGATTCTGAAGATGATGTCTCGTGTT
TTTATATCATTCGGTGGAGAATGAAAGATATTGAAAAGGATTACGTGGGTGGTCGTACAAAAGGATTGACTGTGGATAGTAAAATCACTTATGCTGAATTTCTAGGACAT
GTATGTAGGCTAAGTAGTATAAATCCATTACAGGAAGATATCATAATTAGACGTGTATATAATTTTAAGGCGAAAGTTTGTGTAATGGAAATAACTGACGACGATGACCT
GACTTTCTTCTTGACTGGTGAAGATGTCTCTGAATTGCCGCTATACATATCTACCGTGCCAAAGAAGGTACATCAGAATGAACCGTACATGCCTTCTTTCCCATATTATT
TAGGCCAACACGTGTCCAATGTTCCTATTTCCTCAGCTTGTGCCCCCCATTTGCAAAACCTTATTTCCGAGACCCTCATTTTCAAGTTCGTCAGTTCCGTCCTCGTCGTC
GAACCCCTCTTTTTCCCGCCCACCACCCCCTACTTTGGTCATATTGATGTGTCTGGCTTGTGGAATGGAAGTGAAAATATGGATGAAGATAATGATGAATCATATCGTCT
AATGACCGACACGGAAGAAGGAGACGACGAAAGGGAATATGGAAATGAGCACGCCAGTGATCGACTTGATGTGCAACATGAGCATGAAGATGTAACAATTCATAATACAA
TGGCTGAATATCCTGTAGATGCCGTCCATGAAACGTCAAGCAATAGACTCACCGGTCAGTCAGAAGCTGATAGATTGCAAGCCATGGTCCAATCGGCTGGGACCAATGAT
GTTAAGGAGGGTGATGTATTCGACTCGAAGAAGGAACTAGTTATGAAAATGCATTTACTTGCATTGCGGAAGAACTTTCAATTTCGAGTGAAGAAGTCTACGCCGAAAAT
ATACTTGCTGCGATGCGTCGATCCTACTTGCACGTGGCGACTTAGAGCCACTGAGATTAGAGATTGCAACCTGTTTAAGATAAAGAAATATATCGCAGTCCATTCTAATT
GCAATGGTGCCCTTATGAAACAGGATCATCATCAGGCGAAGAGTTGGGTGGTCGGTCATCTAGTACAATCAAAGTTTACTGATGTTTCTCGCACGTACAGGCCGAAGGAC
ATCATCCAAGATATTCGTGAGGAGTACGGTGTAAATATGAACTTGCTGGCCAAATTTAAAACGTCCGAGTTGGAGGAATTATTTTTTAAGGCTGCGAAGGCATGTCGCGA
GTCATATTTCAATGAGAACTGGGTCCAACTGTGCGCACACCCAGGAGTGAGGGAATATGTGGAAGCTATAGGAAAGGAACGATGGGCTCGCTGCTTTCAGACGAAACTAA
GATACTCACAAATGACCACCAATATTGCAGAGTCCGTTAATGCCCTTTTTAGGCATGCACGTAAGTTGCCAGTCACCGCATTACTTGATCATATCAGAGGTGTGTTGCAG
AGGTGGTTCTACGAACGTCGGACGCTTGCTTCTTCACGTCAGAGTACGTTGTCTGACTACGCAGAGGAAATGATTGCCGAAGCTTCGGATAATGCACGGAGACACATTGT
TATGAACATCGACCAGTTTAATTTTGAGGTACGCGATGGGAACCTCAATGGGGACGTTGACTTGCAATCGCAGACGTGTACTTGTCGGGAGTTCGATTATTTTAAAGTCC
CGTGCTCCCATGCTATTGCTGCAGCCTGTTCTCGTAGCATAAATCCGTACACACTATGCGATGAGGCGTACACGGTCAACAGCTGGATGTTGGCGTATGCAGAACCAATA
TTTCCAGTGGGTTCATCCTCAACATGGAAGAGTTCTCCGGGGTTTGTGAATATCGATGTTCAACCACCGAAGAAGGTCGTTAGGGTTGGACGGCGATAG
mRNA sequenceShow/hide mRNA sequence
ATGCACAACAGTGTATTTCAGATTGCACCTCGAACTCGGCCTCCGGACCGACCTGAATACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTCAGTAT
AGGGTATTCTCTTCCCCAAACATCGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTTCATTCGATTTGCTTTGGACGCGTGGCGACTTCCTATTCGTGG
GAAAATATAACCGTTGCGGTAGATTTATCGTCGGAATATTCAAATATTCCGACGCTTCGGATCTTAGGGAGGATCCTAGCCGCTCGTTGATTACACGTCTCGAACCCTTG
GTAGGTCGGTCTCTTCCCTCACTTTCTCTTTCGAACGTAGTTGCCAGGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGA
GCTCGAGGAGATAGAAAACTTTAGATTCTCCGATGACGGGGAGGATAGTGACGCTTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCG
GATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTTAGGATTCCGGAGGAGGGGGAGAGAGTTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAA
ATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCGGACTGGGTTGGCTCTGGCTCAAGTGGCTCCCAATGGGTGGGGTGTCATTTT
CGCTTTGGCCATCCTTTTTTGGCTACGAGCTGGGGATAGTGAAGAGGCTGAGCTGTTGGACGTAGACCAACTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGC
CTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCATGCGTTTCAATCCGACCAGTCCCCGAGCTTACACAAGCCTCCTTCGACACGTTGAAATATTACAAGGAGCGCTTT
CCGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACGAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCCGTAGTTCGTCCCATTGAGTCCTCAAGGCCTAACTC
CGAACTTGCCATGGTTTGCGGATTTGCAAGCAACGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACTCCTACTGTGG
TAGGGCCTGCCTCGGAAGATCCAGCCCCGGTGATCGAGCTGGAGTCTTCTGGGGGTCCTTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCCTTGCCC
TTGGGCGAGGAGGTGAGGGAGGAAGTCCCTCTGAAGCGAAGGAAGAAGAAGAAGAAGACGACCTCCCCCTTGGAGGTTGGAGCTCGTGGGGTCTTGCCTGCGAGCTTCGC
AAATCGGGTGGACGATCCTGAGGCCAGGATAGGCGGGACTTTGGACCGCTGCCTAAAGAGGGCGTCCAAATTTGTGAGCGACCAAGGGTCCGTTCGGCAGAGGACCATCG
ACTACGCCGCTGAGGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCAGTGAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAAGGAGAAAGAGGAGTTCTCTGCT
GCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAAATTTTGATGGCCGAGGTGGAGACCAAGGCCGAGCTGCTGAAGAAGGA
AGAAGACAGACGCAAGGCCCAGCTCCGAGCCGCCCATGCTATCACCAGGGGCTTGGAGAAGGAGAAGGACGACATGCTCCAAGCACTTGAAGCGAAGGATAAAGAGCTGA
AGCATGCGACTGCCGAGCTGGAGACGGCAAAGGAGCGTCTCAGCAATGGAATCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTC
TCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCTGACATGCCTGACCTTCAGATCGATCTCAGTGGTCTGAAAAAGAGGTATGCCGAGCAATGGGCGCCTGG
GCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATAAGTATGTCAGAGATCTGGACTCTGACAACTCCGATCTCAAAGAGGACCAGGGCAGAGCTGCAAGGTCTATAA
GCCTTGGCTCTGCTCTTCATTCAATAAAGAGGCTCCCATTTGTTTTTACTTTGTCGTCGGCCACATCTTTCCTTTGCTTTTTCTTTGAACTGCAGCCAATCTATTCTTCA
CATCGCTCTTCGTACCCTCAAGTTCGGAGGCTTGACCATTTTGAACTTTCCACATCGTCCCTTTACCTTGAAGGTTTGAATTCAAAGTTCATCAGTGATTTTAGCATCGC
ACCTCGTACCCTTAGATCCATTGAAAACCCTTCTGGCATTTCAAGGATAATAACGCTTCAGGTGCTCCGCATTCCACGGGTGCGCGAGGACGTCTCCTTTCAGATCGGCC
AATACGTACGTCCCAGGCGTAGTTGGGCCATTGCTCTTCTTTCTTCCAACAAGTCGAGGTTGAGGAGCAGCTCTTCCTCATTTGCCGTAGGCTCGCCCTCTTCAGGGGTT
AGGCATCTCAATAAAGGCAGGGAAAAGCCGCGTCGGTACAACGCTCCATCTCGGACCACGAACCGAGCTACTCGCCTTGCCAACTTTCTGCGCTCCTTGGGGTCTTGTGG
TGAGTTGCCCCTAATGAATTCCGCAGTCGGGTCCATCCATGAGGACTCTGGAGCGCCGATCTCCATCAAATCTGGCTCTGAGATCGAGGGATTATCTAAGATCTCGACGG
GGACCGACCTGGCCAGGAGCTGTCTTTGTCTTCCTCATAAAGAACTACTCACTTTGGCAATGGCCACTTCAAAACAACTAGAAGTTCACAGAGAAGACCATACTCCATCA
AAATGGAGTGTTTCTTTGGGAGAAGAAGCTTTCAGAAGATTTCTAGGGCAAGCTAATCCGGTTGTGCAAAAGGTATTCGGCGATGGCTCGTTGTTCAGTCCATTGCTGTT
CGGAAAGTTCTTCGATCCAGCTGATGCTTTTCCGCTGTGGGAGTTTGAGTCGGATTTGTTATTGTCCAACCTCCGCAACTCAGGAAAATCAACGATCGATTGGTTTCAGA
TTGATCAAGAATATGTTCTCCAGGCAGAATTACCAGAAGCTGTGAGAAACACAATGCAGATCTTCATAGAGAATGGGAAGGTCTTAGAGATAAGTGGTCAGTTGAAGCAG
CAGCAGCAGAGGGAAAGCAAGACGATCATCGACTGGCGAAGTGTCAACTGGTGGGAGCATGGATATGTTCGGCGGCTCGAGTTGCCCGACGATGCTGATTGGCGGAGAAT
GGAGGCGCGTGTCAAAAACGACATGGTTATCGAGGTAAAAATTCCCAAGTTGGAAGCCAATCAAAGTGCTGAATCAAAGAACAAAGATTCTGAAGATGATGTCTCGTGTT
TTTATATCATTCGGTGGAGAATGAAAGATATTGAAAAGGATTACGTGGGTGGTCGTACAAAAGGATTGACTGTGGATAGTAAAATCACTTATGCTGAATTTCTAGGACAT
GTATGTAGGCTAAGTAGTATAAATCCATTACAGGAAGATATCATAATTAGACGTGTATATAATTTTAAGGCGAAAGTTTGTGTAATGGAAATAACTGACGACGATGACCT
GACTTTCTTCTTGACTGGTGAAGATGTCTCTGAATTGCCGCTATACATATCTACCGTGCCAAAGAAGGTACATCAGAATGAACCGTACATGCCTTCTTTCCCATATTATT
TAGGCCAACACGTGTCCAATGTTCCTATTTCCTCAGCTTGTGCCCCCCATTTGCAAAACCTTATTTCCGAGACCCTCATTTTCAAGTTCGTCAGTTCCGTCCTCGTCGTC
GAACCCCTCTTTTTCCCGCCCACCACCCCCTACTTTGGTCATATTGATGTGTCTGGCTTGTGGAATGGAAGTGAAAATATGGATGAAGATAATGATGAATCATATCGTCT
AATGACCGACACGGAAGAAGGAGACGACGAAAGGGAATATGGAAATGAGCACGCCAGTGATCGACTTGATGTGCAACATGAGCATGAAGATGTAACAATTCATAATACAA
TGGCTGAATATCCTGTAGATGCCGTCCATGAAACGTCAAGCAATAGACTCACCGGTCAGTCAGAAGCTGATAGATTGCAAGCCATGGTCCAATCGGCTGGGACCAATGAT
GTTAAGGAGGGTGATGTATTCGACTCGAAGAAGGAACTAGTTATGAAAATGCATTTACTTGCATTGCGGAAGAACTTTCAATTTCGAGTGAAGAAGTCTACGCCGAAAAT
ATACTTGCTGCGATGCGTCGATCCTACTTGCACGTGGCGACTTAGAGCCACTGAGATTAGAGATTGCAACCTGTTTAAGATAAAGAAATATATCGCAGTCCATTCTAATT
GCAATGGTGCCCTTATGAAACAGGATCATCATCAGGCGAAGAGTTGGGTGGTCGGTCATCTAGTACAATCAAAGTTTACTGATGTTTCTCGCACGTACAGGCCGAAGGAC
ATCATCCAAGATATTCGTGAGGAGTACGGTGTAAATATGAACTTGCTGGCCAAATTTAAAACGTCCGAGTTGGAGGAATTATTTTTTAAGGCTGCGAAGGCATGTCGCGA
GTCATATTTCAATGAGAACTGGGTCCAACTGTGCGCACACCCAGGAGTGAGGGAATATGTGGAAGCTATAGGAAAGGAACGATGGGCTCGCTGCTTTCAGACGAAACTAA
GATACTCACAAATGACCACCAATATTGCAGAGTCCGTTAATGCCCTTTTTAGGCATGCACGTAAGTTGCCAGTCACCGCATTACTTGATCATATCAGAGGTGTGTTGCAG
AGGTGGTTCTACGAACGTCGGACGCTTGCTTCTTCACGTCAGAGTACGTTGTCTGACTACGCAGAGGAAATGATTGCCGAAGCTTCGGATAATGCACGGAGACACATTGT
TATGAACATCGACCAGTTTAATTTTGAGGTACGCGATGGGAACCTCAATGGGGACGTTGACTTGCAATCGCAGACGTGTACTTGTCGGGAGTTCGATTATTTTAAAGTCC
CGTGCTCCCATGCTATTGCTGCAGCCTGTTCTCGTAGCATAAATCCGTACACACTATGCGATGAGGCGTACACGGTCAACAGCTGGATGTTGGCGTATGCAGAACCAATA
TTTCCAGTGGGTTCATCCTCAACATGGAAGAGTTCTCCGGGGTTTGTGAATATCGATGTTCAACCACCGAAGAAGGTCGTTAGGGTTGGACGGCGATAG
Protein sequenceShow/hide protein sequence
MHNSVFQIAPRTRPPDRPEYLGGPAQKGEHSDDQVSIGYSLPQTSAPSLSGPISTWQRSSFDLLWTRGDFLFVGKYNRCGRFIVGIFKYSDASDLREDPSRSLITRLEPL
VGRSLPSLSLSNVVARSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERVDNPPEGWVTLYFK
MFEYGLRLPLHPFVQEFLFRTGLALAQVAPNGWGVIFALAILFWLRAGDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGACVSIRPVPELTQASFDTLKYYKERF
PRGRKVGTLVTDELLLESGLLDYNPVVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPTVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALP
LGEEVREEVPLKRRKKKKKTTSPLEVGARGVLPASFANRVDDPEARIGGTLDRCLKRASKFVSDQGSVRQRTIDYAAEAFVASIQSALAVKAELDGREVLAAKEKEEFSA
ALEAASSTMKDELLKAHSEVEILMAEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKDDMLQALEAKDKELKHATAELETAKERLSNGILLEESFRQHPDFDGFAKDF
SDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEQWAPGPGGTPGPQALVDKYVRDLDSDNSDLKEDQGRAARSISLGSALHSIKRLPFVFTLSSATSFLCFFFELQPIYSS
HRSSYPQVRRLDHFELSTSSLYLEGLNSKFISDFSIAPRTLRSIENPSGISRIITLQVLRIPRVREDVSFQIGQYVRPRRSWAIALLSSNKSRLRSSSSSFAVGSPSSGV
RHLNKGREKPRRYNAPSRTTNRATRLANFLRSLGSCGELPLMNSAVGSIHEDSGAPISIKSGSEIEGLSKISTGTDLARSCLCLPHKELLTLAMATSKQLEVHREDHTPS
KWSVSLGEEAFRRFLGQANPVVQKVFGDGSLFSPLLFGKFFDPADAFPLWEFESDLLLSNLRNSGKSTIDWFQIDQEYVLQAELPEAVRNTMQIFIENGKVLEISGQLKQ
QQQRESKTIIDWRSVNWWEHGYVRRLELPDDADWRRMEARVKNDMVIEVKIPKLEANQSAESKNKDSEDDVSCFYIIRWRMKDIEKDYVGGRTKGLTVDSKITYAEFLGH
VCRLSSINPLQEDIIIRRVYNFKAKVCVMEITDDDDLTFFLTGEDVSELPLYISTVPKKVHQNEPYMPSFPYYLGQHVSNVPISSACAPHLQNLISETLIFKFVSSVLVV
EPLFFPPTTPYFGHIDVSGLWNGSENMDEDNDESYRLMTDTEEGDDEREYGNEHASDRLDVQHEHEDVTIHNTMAEYPVDAVHETSSNRLTGQSEADRLQAMVQSAGTND
VKEGDVFDSKKELVMKMHLLALRKNFQFRVKKSTPKIYLLRCVDPTCTWRLRATEIRDCNLFKIKKYIAVHSNCNGALMKQDHHQAKSWVVGHLVQSKFTDVSRTYRPKD
IIQDIREEYGVNMNLLAKFKTSELEELFFKAAKACRESYFNENWVQLCAHPGVREYVEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQ
RWFYERRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAACSRSINPYTLCDEAYTVNSWMLAYAEPI
FPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRR