; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy05g014810 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy05g014810
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionBED-type domain-containing protein
Genome locationChr05:18254098..18256085
RNA-Seq ExpressionLcy05g014810
SyntenyLcy05g014810
Gene Ontology termsGO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR008906 - HAT, C-terminal dimerisation domain
IPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG5522171.1 hypothetical protein RHGRI_034377 [Rhododendron griersonianum]5.9e-18453.65Show/hide
Query:  WQCNFCQEKKKGSYTRVRAHLLKLGGCGVGPCRKVTPKDVAEMQKLEDEAKIRIQKNAPKQVPLPPSRHTQPESQSFGSMGSYSYLTMETKKRKGSVSSL
        WQCNFC   KK SYTRVR HLLKL   G+GPC+ VT + +A M+KL++EAK R+++N PK+VPLPPS      S  F   G  +     +  R  ++S +
Subjt:  WQCNFCQEKKKGSYTRVRAHLLKLGGCGVGPCRKVTPKDVAEMQKLEDEAKIRIQKNAPKQVPLPPSRHTQPESQSFGSMGSYSYLTMETKKRKGSVSSL

Query:  EKSFNMAICDQVHAEIARMFYSSGLPFHLARNPHYVNAFALAANNALSGYLPPGYNMLRTTLLQREKANVERLLHLIKSTWCQKG---------------
        EK+F+    DQ+HAEIARMFYS G+PF+LARNP+YV+++  AANN LSGY+PPGYN+LRTTLLQ+EK NVERLL  IK TW +KG               
Subjt:  EKSFNMAICDQVHAEIARMFYSSGLPFHLARNPHYVNAFALAANNALSGYLPPGYNMLRTTLLQREKANVERLLHLIKSTWCQKG---------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------DVMFVKNFIMNHSMRLVMFNEFVPLKLLSVAEARFASTIIMLKRFKLIKSGLQTMVISDKWACYKEDDVGKARHVKELILDNIWWDKI
                    DV F+KN+IMNHSMRL +FN+FVPLKLLSVA  RFAS ++MLKRFKL+K+ LQTMVIS +W  Y+EDD GKA+ VKE +LD+IWWD+I
Subjt:  ------------DVMFVKNFIMNHSMRLVMFNEFVPLKLLSVAEARFASTIIMLKRFKLIKSGLQTMVISDKWACYKEDDVGKARHVKELILDNIWWDKI

Query:  DYILSFTAPIYDMIRASDTDKPSLHLVYDMWDTMIEKVKVAIYRHEGKHVDEFSSFYEVLHQILIDRWNKNNTPLHCLAHSLNLRYYSEQWLQEDKNRVP
        DYILSFT+P+YDM+R  DTDKP LHLVYDMWDTMIEKVKVAIYRHEGK  ++ S+FYEV++ IL+DRWNKNNTPLHCLAHSLN RYYS++WL E  +RVP
Subjt:  DYILSFTAPIYDMIRASDTDKPSLHLVYDMWDTMIEKVKVAIYRHEGKHVDEFSSFYEVLHQILIDRWNKNNTPLHCLAHSLNLRYYSEQWLQEDKNRVP

Query:  PHKDLEVSRERMKCIKRYFSSNEERTKVNIEFANFSTMAAKFADHDSIRDRYCMDPKSWWAIHSAFTPTLQALAMKLLVQPSSSSCCERNWSTYSFVNSM
        PHKD+EV+RERMKC+KRYF ++ +R+K N+EFANFS+ A +F D DSI DRY MDPKSWW  + A  P LQ++A+KLLVQPSSSSC ERNWSTYSFV+S 
Subjt:  PHKDLEVSRERMKCIKRYFSSNEERTKVNIEFANFSTMAAKFADHDSIRDRYCMDPKSWWAIHSAFTPTLQALAMKLLVQPSSSSCCERNWSTYSFVNSM

Query:  RRNKMTPQRAEDLVFIHTNLRLLSRRTPEY
        +RNKMTP+RAEDLVFIH+NLRLLSRRT +Y
Subjt:  RRNKMTPQRAEDLVFIHTNLRLLSRRTPEY

KAG5532188.1 hypothetical protein RHGRI_026721 [Rhododendron griersonianum]7.2e-18253.81Show/hide
Query:  WQCNFCQEKKKGSYTRVRAHLLKLGGCGVGPCRKVTPKDVAEMQKLEDEAKIRIQKNAPKQVPLPPSRHTQPESQSFGSMGSYSYLTMETKKRKGSVSSL
        WQCNFC   KK SYTRV+ HLLKL   G+GPC KVT + +A M+KLEDEAK +++ NAPK+VPLPPS  +      F   G  +     +  R  ++S L
Subjt:  WQCNFCQEKKKGSYTRVRAHLLKLGGCGVGPCRKVTPKDVAEMQKLEDEAKIRIQKNAPKQVPLPPSRHTQPESQSFGSMGSYSYLTMETKKRKGSVSSL

Query:  EKSFNMAICDQVHAEIARMFYSSGLPFHLARNPHYVNAFALAANNALSGYLPPGYNMLRTTLLQREKANVERLLHLIKSTWCQKG---------------
        EK+++    DQ+HAEIARMFYS G+PF+LARNP+YV ++  AANN LSGY+PPGYN+LRTTLLQ+EK NVERLL  IK TW +KG               
Subjt:  EKSFNMAICDQVHAEIARMFYSSGLPFHLARNPHYVNAFALAANNALSGYLPPGYNMLRTTLLQREKANVERLLHLIKSTWCQKG---------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------DVMFVKNFIMNHSMRLVMFNEFVPLKLLSVAEARFASTIIMLKRFKLIKSGLQTMVISDKWACYKEDDVGKARHVKELILDNIWWDKI
                    DV F+KNFIMNHSMRL +FN+FVPLKLLSVA  RFAS ++MLKRFKL+K+ LQTMVIS +W  Y+EDD GKA+ VKE +LD+IWWD I
Subjt:  ------------DVMFVKNFIMNHSMRLVMFNEFVPLKLLSVAEARFASTIIMLKRFKLIKSGLQTMVISDKWACYKEDDVGKARHVKELILDNIWWDKI

Query:  DYILSFTAPIYDMIRASDTDKPSLHLVYDMWDTMIEKVKVAIYRHEGKHVDEFSSFYEVLHQILIDRWNKNNTPLHCLAHSLNLRYYSEQWLQEDKNRVP
        DYILSFT+P+YDM+R  DTDKP LHLVYDMWDTMIEKVKVAIYRHEGK  ++ S+FY+V+H IL+DRWNKN+TPLHCLAHSLN RYYS++WL E  +RVP
Subjt:  DYILSFTAPIYDMIRASDTDKPSLHLVYDMWDTMIEKVKVAIYRHEGKHVDEFSSFYEVLHQILIDRWNKNNTPLHCLAHSLNLRYYSEQWLQEDKNRVP

Query:  PHKDLEVSRERMKCIKRYFSSNEERTKVNIEFANFSTMAAKFADHDSIRDRYCMDPKSWWAIHSAFTPTLQALAMKLLVQPSSSSCCERNWSTYSFVNSM
        PHKD+EV+RERMKC+K+YF ++ +R+KVN+EFANFS+ A +FAD DSI DRY MDPKSWW  + A  P LQ++A+KLLVQPSSSS C+RNWSTYSFV+S 
Subjt:  PHKDLEVSRERMKCIKRYFSSNEERTKVNIEFANFSTMAAKFADHDSIRDRYCMDPKSWWAIHSAFTPTLQALAMKLLVQPSSSSCCERNWSTYSFVNSM

Query:  RRNKMTPQRAEDLVFIHTNLRLLSRRTPEY
        +RNKMTP+RAEDLVFIH+NLRLLSRRT +Y
Subjt:  RRNKMTPQRAEDLVFIHTNLRLLSRRTPEY

KAG5540579.1 hypothetical protein RHGRI_020709 [Rhododendron griersonianum]1.5e-17959.09Show/hide
Query:  WQCNFCQEKKKGSYTRVRAHLLKLGGCGVGPCRKVTPKDVAEMQKLEDEAKIRIQKNAPKQVPLPPSRHTQPESQSFGSMGSYSYLTMETKKRKGSVSSL
        W+CN+C   K GSYTRV+AHLLK+ G GV  C  VT  +VA+  KL +E K + +++ PK+VPLPPS  +   +    + G        +KKR+   S  
Subjt:  WQCNFCQEKKKGSYTRVRAHLLKLGGCGVGPCRKVTPKDVAEMQKLEDEAKIRIQKNAPKQVPLPPSRHTQPESQSFGSMGSYSYLTMETKKRKGSVSSL

Query:  E----KSFNMAICDQVHAEIARMFYSSGLPFHLARNPHYVNAFALAANNALSGYLPPGYNMLRTTLLQREKANVERLLHLIKSTWCQK------------
        +    K+FN+   D +H+EIARMFYS GLPF+LARNPHYV+++  AAN+++ GY+PP YN+LRT LLQRE+AN+ERLL  IK TW +K            
Subjt:  E----KSFNMAICDQVHAEIARMFYSSGLPFHLARNPHYVNAFALAANNALSGYLPPGYNMLRTTLLQREKANVERLLHLIKSTWCQK------------

Query:  ---------GDVMFVKNFIMNHSMRLVMFNEFVPLKLLSVAEARFASTIIMLKRFKLIKSGLQTMVISDKWACYKEDDVGKARHVKELILDNIWWDKIDY
                 GD   +KNFIMNHSMRL +FNEFV LKLLSVAE RFAS+I+ML+RFKLIK GLQ+MVIS +W  ++E+    A  V+E +L+  WWDK+DY
Subjt:  ---------GDVMFVKNFIMNHSMRLVMFNEFVPLKLLSVAEARFASTIIMLKRFKLIKSGLQTMVISDKWACYKEDDVGKARHVKELILDNIWWDKIDY

Query:  ILSFTAPIYDMIRASDTDKPSLHLVYDMWDTMIEKVKVAIYRHEGKHVDEFSSFYEVLHQILIDRWNKNNTPLHCLAHSLNLRYYSEQWLQEDKNRVPPH
        +LSFT PIYDM+R  DTDKP+LHL+YDMWD MIEKVK  IY+HE KH+++ S FY+V+H+IL+DRWNKNNTPLHCLAH+LN RYYS+ WL+ED  RVPPH
Subjt:  ILSFTAPIYDMIRASDTDKPSLHLVYDMWDTMIEKVKVAIYRHEGKHVDEFSSFYEVLHQILIDRWNKNNTPLHCLAHSLNLRYYSEQWLQEDKNRVPPH

Query:  KDLEVSRERMKCIKRYFSSNEERTKVNIEFANFSTMAAKFADHDSIRDRYCMDPKSWWAIHSAFTPTLQALAMKLLVQPSSSSCCERNWSTYSFVNSMRR
        KD EV+ ER KC+KRY + + ERTK N+E+A FST    F+D DSI DR  MDP SWW IH A  P LQ LA+K+L QPSSSSCCERNWSTYSF++S+RR
Subjt:  KDLEVSRERMKCIKRYFSSNEERTKVNIEFANFSTMAAKFADHDSIRDRYCMDPKSWWAIHSAFTPTLQALAMKLLVQPSSSSCCERNWSTYSFVNSMRR

Query:  NKMTPQRAEDLVFIHTNLRLLSRRTPEY
        NKMTPQRAEDLV+IH+NLRLLSRR+P+Y
Subjt:  NKMTPQRAEDLVFIHTNLRLLSRRTPEY

KAG5540580.1 hypothetical protein RHGRI_020709 [Rhododendron griersonianum]1.5e-17959.09Show/hide
Query:  WQCNFCQEKKKGSYTRVRAHLLKLGGCGVGPCRKVTPKDVAEMQKLEDEAKIRIQKNAPKQVPLPPSRHTQPESQSFGSMGSYSYLTMETKKRKGSVSSL
        W+CN+C   K GSYTRV+AHLLK+ G GV  C  VT  +VA+  KL +E K + +++ PK+VPLPPS  +   +    + G        +KKR+   S  
Subjt:  WQCNFCQEKKKGSYTRVRAHLLKLGGCGVGPCRKVTPKDVAEMQKLEDEAKIRIQKNAPKQVPLPPSRHTQPESQSFGSMGSYSYLTMETKKRKGSVSSL

Query:  E----KSFNMAICDQVHAEIARMFYSSGLPFHLARNPHYVNAFALAANNALSGYLPPGYNMLRTTLLQREKANVERLLHLIKSTWCQK------------
        +    K+FN+   D +H+EIARMFYS GLPF+LARNPHYV+++  AAN+++ GY+PP YN+LRT LLQRE+AN+ERLL  IK TW +K            
Subjt:  E----KSFNMAICDQVHAEIARMFYSSGLPFHLARNPHYVNAFALAANNALSGYLPPGYNMLRTTLLQREKANVERLLHLIKSTWCQK------------

Query:  ---------GDVMFVKNFIMNHSMRLVMFNEFVPLKLLSVAEARFASTIIMLKRFKLIKSGLQTMVISDKWACYKEDDVGKARHVKELILDNIWWDKIDY
                 GD   +KNFIMNHSMRL +FNEFV LKLLSVAE RFAS+I+ML+RFKLIK GLQ+MVIS +W  ++E+    A  V+E +L+  WWDK+DY
Subjt:  ---------GDVMFVKNFIMNHSMRLVMFNEFVPLKLLSVAEARFASTIIMLKRFKLIKSGLQTMVISDKWACYKEDDVGKARHVKELILDNIWWDKIDY

Query:  ILSFTAPIYDMIRASDTDKPSLHLVYDMWDTMIEKVKVAIYRHEGKHVDEFSSFYEVLHQILIDRWNKNNTPLHCLAHSLNLRYYSEQWLQEDKNRVPPH
        +LSFT PIYDM+R  DTDKP+LHL+YDMWD MIEKVK  IY+HE KH+++ S FY+V+H+IL+DRWNKNNTPLHCLAH+LN RYYS+ WL+ED  RVPPH
Subjt:  ILSFTAPIYDMIRASDTDKPSLHLVYDMWDTMIEKVKVAIYRHEGKHVDEFSSFYEVLHQILIDRWNKNNTPLHCLAHSLNLRYYSEQWLQEDKNRVPPH

Query:  KDLEVSRERMKCIKRYFSSNEERTKVNIEFANFSTMAAKFADHDSIRDRYCMDPKSWWAIHSAFTPTLQALAMKLLVQPSSSSCCERNWSTYSFVNSMRR
        KD EV+ ER KC+KRY + + ERTK N+E+A FST    F+D DSI DR  MDP SWW IH A  P LQ LA+K+L QPSSSSCCERNWSTYSF++S+RR
Subjt:  KDLEVSRERMKCIKRYFSSNEERTKVNIEFANFSTMAAKFADHDSIRDRYCMDPKSWWAIHSAFTPTLQALAMKLLVQPSSSSCCERNWSTYSFVNSMRR

Query:  NKMTPQRAEDLVFIHTNLRLLSRRTPEY
        NKMTPQRAEDLV+IH+NLRLLSRR+P+Y
Subjt:  NKMTPQRAEDLVFIHTNLRLLSRRTPEY

XP_038721052.1 uncharacterized protein LOC120013346 isoform X1 [Tripterygium wilfordii]2.3e-18053.69Show/hide
Query:  MSWQCNFCQEKKKGSYTRVRAHLLKLGGCGVGPCRKVTPKDVAEMQKLEDEAKIRIQKNAPKQVPLPPSRHTQPESQSFGSMGSYSYLTMETKKRKGSVS
        +SW CNFC  +K  SYTRVRAHLLK+   G+  C+ VT KD+AEMQ+LE+EAK R   +APK+VPLPPS        S  S G Y+ +  E+KKRK   S
Subjt:  MSWQCNFCQEKKKGSYTRVRAHLLKLGGCGVGPCRKVTPKDVAEMQKLEDEAKIRIQKNAPKQVPLPPSRHTQPESQSFGSMGSYSYLTMETKKRKGSVS

Query:  S-----LEKSFNMAICDQVHAEIARMFYSSGLPFHLARNPHYVNAFALAANNALSGYLPPGYNMLRTTLLQREKANVERLLHLIKSTWCQK---------
        S     LEKSFN+   +Q+HA IAR FY+SGLPFHLAR+P+YV+ F  A ++ L+GYLPPGYN+LRTTLLQ+EKANVERLL  IKSTW +K         
Subjt:  S-----LEKSFNMAICDQVHAEIARMFYSSGLPFHLARNPHYVNAFALAANNALSGYLPPGYNMLRTTLLQREKANVERLLHLIKSTWCQK---------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------GDVMFVKNFIMNHSMRLVMFNEFVPLKLLSVAEARFASTIIMLKRFKLIKSGLQTMVISDKWACYKEDDVGKARHVKELILD
                          GDV  +KNFIMNHSMRL +FNEFVPLKLLS+A  RFAS ++MLKRF LIK  L +MVIS++W  Y+EDD GKA+ VKE +LD
Subjt:  ------------------GDVMFVKNFIMNHSMRLVMFNEFVPLKLLSVAEARFASTIIMLKRFKLIKSGLQTMVISDKWACYKEDDVGKARHVKELILD

Query:  NIWWDKIDYILSFTAPIYDMIRASDTDKPSLHLVYDMWDTMIEKVKVAIYRHEGKHVDEFSSFYEVLHQILIDRWNKNNTPLHCLAHSLNLRYYSEQWLQ
        ++WWD IDYIL FTAPIYDM+RA DTDKP LHLVYDMWD+MIEKV++AIYR EGK V+E S FY+V+H IL+ RWNKNNTPLHCLAHSLN RYYSE+WL 
Subjt:  NIWWDKIDYILSFTAPIYDMIRASDTDKPSLHLVYDMWDTMIEKVKVAIYRHEGKHVDEFSSFYEVLHQILIDRWNKNNTPLHCLAHSLNLRYYSEQWLQ

Query:  EDKNRVPPHKDLEVSRERMKCIKRYFSSNEERTKVNIEFANFSTMAAKFADHDSIRDRYCMDPKSWWAIHSAFTPTLQALAMKLLVQPSSSSCCERNWST
        ED  RVPPHKD+EV+RERMKC+K+YFS++EER  V +EF NFS+M+  FAD DSI  R  MDPKSWW    A  P LQ LA+K+LVQPSSSSC ERNWST
Subjt:  EDKNRVPPHKDLEVSRERMKCIKRYFSSNEERTKVNIEFANFSTMAAKFADHDSIRDRYCMDPKSWWAIHSAFTPTLQALAMKLLVQPSSSSCCERNWST

Query:  YSFVNSMRRNKMTPQRAEDLVFIHTNLRLLSRRTPEY
        YSFV+S+RRN+M P+RAEDLVFIH+NLRLLSRR P+Y
Subjt:  YSFVNSMRRNKMTPQRAEDLVFIHTNLRLLSRRTPEY

TrEMBL top hitse value%identityAlignment
A0A443N8D6 DUF659 domain-containing protein/Dimer_Tnp_hAT domain-containing protein1.2e-17455.61Show/hide
Query:  DEAKIRIQKNAPKQVPLP-PSRHTQPESQSFGSMGSYSYLTMETKKRK----GSVSSLEKSFNMAICDQVHAEIARMFYSSGLPFHLARNPHYVNAFALA
        +E K+R++ NAPK+VPLP PS      S    SM S  +   ++KKRK    G+ + +EK+FN+   DQ+HAEIARMFYS+GLPFHLARNPH+VNAF  A
Subjt:  DEAKIRIQKNAPKQVPLP-PSRHTQPESQSFGSMGSYSYLTMETKKRK----GSVSSLEKSFNMAICDQVHAEIARMFYSSGLPFHLARNPHYVNAFALA

Query:  ANNALSGYLPPGYNMLRTTLLQREKANVERLLHLIKSTWCQK----------------------------------------------------------
        AN+ L+GY+PPGYNMLRT+LLQREKAN+ERLL  IK TW +K                                                          
Subjt:  ANNALSGYLPPGYNMLRTTLLQREKANVERLLHLIKSTWCQK----------------------------------------------------------

Query:  ---------------------------------------------------------------------GDVMFVKNFIMNHSMRLVMFNEFVPLKLLSV
                                                                             GDVM +K+FIMNHSMRL MFNEFV LKLLSV
Subjt:  ---------------------------------------------------------------------GDVMFVKNFIMNHSMRLVMFNEFVPLKLLSV

Query:  AEARFASTIIMLKRFKLIKSGLQTMVISDKWACYKEDDVGKARHVKELILDNIWWDKIDYILSFTAPIYDMIRASDTDKPSLHLVYDMWDTMIEKVKVAI
        A+ RFAS+I+MLKRFKLIK GLQ MVISDKW+CY+E DVG AR VKE +LD+IWWD IDYILSFT+PIYDM+R  DTDKP LHLVYDMWDTMIEKVK  I
Subjt:  AEARFASTIIMLKRFKLIKSGLQTMVISDKWACYKEDDVGKARHVKELILDNIWWDKIDYILSFTAPIYDMIRASDTDKPSLHLVYDMWDTMIEKVKVAI

Query:  YRHEGKHVDEFSSFYEVLHQILIDRWNKNNTPLHCLAHSLNLRYYSEQWLQEDKNRVPPHKDLEVSRERMKCIKRYFSSNEERTKVNIEFANFSTMAAKF
        +RHEGK  DE S FY+V+HQIL+D WNKNNTPLHCLAHSLN RYYS++WLQED +RVPP+KD+EVSRER KC+ +YF ++EERT VN+EFANFS    +F
Subjt:  YRHEGKHVDEFSSFYEVLHQILIDRWNKNNTPLHCLAHSLNLRYYSEQWLQEDKNRVPPHKDLEVSRERMKCIKRYFSSNEERTKVNIEFANFSTMAAKF

Query:  ADHDSIRDRYCMDPKSWWAIHSAFTPTLQALAMKLLVQPSSSSCCERNWSTYSFVNSMRRNKMTPQRAEDLVFIHTNLRLLSRRTPEY
         ++DS+ DRY MDP SWWAIH A  P LQ+LA KLL+QPSSSSCCERNWSTYSFV+S+RRNKMTP+RAEDLVFIH+NLRLLSR+TP+Y
Subjt:  ADHDSIRDRYCMDPKSWWAIHSAFTPTLQALAMKLLVQPSSSSCCERNWSTYSFVNSMRRNKMTPQRAEDLVFIHTNLRLLSRRTPEY

A0A5B7AFB0 Uncharacterized protein2.7e-19055.03Show/hide
Query:  MSWQCNFCQEKKKGSYTRVRAHLLKLGGCGVGPCRKVTPKDVAEMQKLEDEAKIRIQKNAPKQVPLPPSRHTQPESQSFGSMGSYSYLTMETKKRK----
        +SWQCNFC + KK SYTRVRAHLL+L G G+  C KVT KD+ EMQKLEDE K+R++ NA K+VPLP S  +   S SF   G       ++KKRK    
Subjt:  MSWQCNFCQEKKKGSYTRVRAHLLKLGGCGVGPCRKVTPKDVAEMQKLEDEAKIRIQKNAPKQVPLPPSRHTQPESQSFGSMGSYSYLTMETKKRK----

Query:  GSVSSLEKSFNMAICDQVHAEIARMFYSSGLPFHLARNPHYVNAFALAANNALSGYLPPGYNMLRTTLLQREKANVERLLHLIKSTWCQK----------
        GS + LEK+FNM   +Q+HAEIARMFYSSGLPFHLARNP+YV++F  AANN + GYLPPGYN+LRTTLLQ EK N+ERLL  IK TW +K          
Subjt:  GSVSSLEKSFNMAICDQVHAEIARMFYSSGLPFHLARNPHYVNAFALAANNALSGYLPPGYNMLRTTLLQREKANVERLLHLIKSTWCQK----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------GDVMFVKNFIMNHSMRLVMFNEFVPLKLLSVAEARFASTIIMLKRFKLIKSGLQTMVISDKWACYKEDDVGKARHVKELILDN
                         GDVM +K+FIMNHS+RLVMFNEFV LKLLSVA+ RFAS I+M +RFKLIK GLQ MVISDKW+ Y+EDDVG+ R VKE +L++
Subjt:  -----------------GDVMFVKNFIMNHSMRLVMFNEFVPLKLLSVAEARFASTIIMLKRFKLIKSGLQTMVISDKWACYKEDDVGKARHVKELILDN

Query:  IWWDKIDYILSFTAPIYDMIRASDTDKPSLHLVYDMWDTMIEKVKVAIYRHEGKHVDEFSSFYEVLHQILIDRWNKNNTPLHCLAHSLNLRYYSEQWLQE
        IWWD IDYILSFT PIY+M++A DTDKP LHLVYDMWD+M+EKVK+AIYRHE K  +E S+FY+V+H IL+DRWNKNNTPLHCLAHSLN +YYS +WL E
Subjt:  IWWDKIDYILSFTAPIYDMIRASDTDKPSLHLVYDMWDTMIEKVKVAIYRHEGKHVDEFSSFYEVLHQILIDRWNKNNTPLHCLAHSLNLRYYSEQWLQE

Query:  DKNRVPPHKDLEVSRERMKCIKRYFSSNEERTKVNIEFANFSTMAAKFADHDSIRDRYCMDPKSWWAIHSAFTPTLQALAMKLLVQPSSSSCCERNWSTY
        + NRVPP+K+ E+S+ER+KC+KRYFS++E+RTKV +E+A FST +  F   DSI DRY MDPKSWW IH +  P LQ+LA+KLLVQPSSSSCCERNWSTY
Subjt:  DKNRVPPHKDLEVSRERMKCIKRYFSSNEERTKVNIEFANFSTMAAKFADHDSIRDRYCMDPKSWWAIHSAFTPTLQALAMKLLVQPSSSSCCERNWSTY

Query:  SFVNSMRRNKMTPQRAEDLVFIHTNLRLLSRRTPEY
        SFV+S+RRNKMTPQ AEDLVF+H+NLRLLSRRTP+Y
Subjt:  SFVNSMRRNKMTPQRAEDLVFIHTNLRLLSRRTPEY

A0A6J1DT13 uncharacterized protein LOC111023231 isoform X13.4e-17759.89Show/hide
Query:  MSWQCNFCQEKKKGSYTRVRAHLLKLGGCGVGPCRKVTPKDVAEMQKLEDEAKIRIQKNAPKQVPLPPSRHTQPESQSFGSMG--SYSYLTMETKKRKGS
        +SWQCNFCQE K+ SYTRVRAHL+KL G G+G C+KVT KDVAEMQ+LEDEAKIR +KNAPK+V LPP  H Q  +QS GSM   S+S+ T + KKRK S
Subjt:  MSWQCNFCQEKKKGSYTRVRAHLLKLGGCGVGPCRKVTPKDVAEMQKLEDEAKIRIQKNAPKQVPLPPSRHTQPESQSFGSMG--SYSYLTMETKKRKGS

Query:  VSSLEKSFNMAICDQVHAEIARMFYSSGLPFHLARNPHYVNAFALAANNALSGYLPPGYNMLRTTLLQREKANVERLLHLIKSTWCQK------------
         S+LEKSFNM   DQ+H+EIA+MFYSSGLPF LARNPH+V AF  AANN LSGY+PPGYNMLRTTLLQREK N+ERLL  IKSTW  K            
Subjt:  VSSLEKSFNMAICDQVHAEIARMFYSSGLPFHLARNPHYVNAFALAANNALSGYLPPGYNMLRTTLLQREKANVERLLHLIKSTWCQK------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------GDVMFVKNFIMNHSMRLVMFNEFVPLKLLSVAEARFASTIIMLKRFKLIKSGLQTMVISDKWACYKEDDVGKARHVKELILDNIW
                       GDVM VK+FIMNH MRL MF EFV LKLLS+AE RFA TI MLKRFKLIKSGLQ M ISDKW+CY+EDDVGKA+H+K+L+L++IW
Subjt:  ---------------GDVMFVKNFIMNHSMRLVMFNEFVPLKLLSVAEARFASTIIMLKRFKLIKSGLQTMVISDKWACYKEDDVGKARHVKELILDNIW

Query:  WDKIDYILSFTAPIYDMIRASDTDKPSLHLVYDMWDTMIEKVKVAIYRHEGKHVDEFSSFYEVLHQILIDRWNKNNTPLHCLAHSLNLRYYSEQWLQEDK
        WDKIDYILSFT+PIYDMIRA DTDKP LHL+YDMWDTMIEKVK AIYR++GKH+D+ SSFY V+HQILIDRWNKNNTPLHCLAHSLN RYYSEQWLQEDK
Subjt:  WDKIDYILSFTAPIYDMIRASDTDKPSLHLVYDMWDTMIEKVKVAIYRHEGKHVDEFSSFYEVLHQILIDRWNKNNTPLHCLAHSLNLRYYSEQWLQEDK

Query:  NRVPPHKDLEVSRERMKCIKRYFSSNEERTKVNIEFANFSTMAAKFADHDSIRDRY
        NRVPPH+DLEV+RERMK +KRYFSSNEERTKV +EFANFSTMAA+FAD+DSIR+RY
Subjt:  NRVPPHKDLEVSRERMKCIKRYFSSNEERTKVNIEFANFSTMAAKFADHDSIRDRY

A0A6J1DUJ6 uncharacterized protein LOC111023231 isoform X23.4e-17759.89Show/hide
Query:  MSWQCNFCQEKKKGSYTRVRAHLLKLGGCGVGPCRKVTPKDVAEMQKLEDEAKIRIQKNAPKQVPLPPSRHTQPESQSFGSMG--SYSYLTMETKKRKGS
        +SWQCNFCQE K+ SYTRVRAHL+KL G G+G C+KVT KDVAEMQ+LEDEAKIR +KNAPK+V LPP  H Q  +QS GSM   S+S+ T + KKRK S
Subjt:  MSWQCNFCQEKKKGSYTRVRAHLLKLGGCGVGPCRKVTPKDVAEMQKLEDEAKIRIQKNAPKQVPLPPSRHTQPESQSFGSMG--SYSYLTMETKKRKGS

Query:  VSSLEKSFNMAICDQVHAEIARMFYSSGLPFHLARNPHYVNAFALAANNALSGYLPPGYNMLRTTLLQREKANVERLLHLIKSTWCQK------------
         S+LEKSFNM   DQ+H+EIA+MFYSSGLPF LARNPH+V AF  AANN LSGY+PPGYNMLRTTLLQREK N+ERLL  IKSTW  K            
Subjt:  VSSLEKSFNMAICDQVHAEIARMFYSSGLPFHLARNPHYVNAFALAANNALSGYLPPGYNMLRTTLLQREKANVERLLHLIKSTWCQK------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------GDVMFVKNFIMNHSMRLVMFNEFVPLKLLSVAEARFASTIIMLKRFKLIKSGLQTMVISDKWACYKEDDVGKARHVKELILDNIW
                       GDVM VK+FIMNH MRL MF EFV LKLLS+AE RFA TI MLKRFKLIKSGLQ M ISDKW+CY+EDDVGKA+H+K+L+L++IW
Subjt:  ---------------GDVMFVKNFIMNHSMRLVMFNEFVPLKLLSVAEARFASTIIMLKRFKLIKSGLQTMVISDKWACYKEDDVGKARHVKELILDNIW

Query:  WDKIDYILSFTAPIYDMIRASDTDKPSLHLVYDMWDTMIEKVKVAIYRHEGKHVDEFSSFYEVLHQILIDRWNKNNTPLHCLAHSLNLRYYSEQWLQEDK
        WDKIDYILSFT+PIYDMIRA DTDKP LHL+YDMWDTMIEKVK AIYR++GKH+D+ SSFY V+HQILIDRWNKNNTPLHCLAHSLN RYYSEQWLQEDK
Subjt:  WDKIDYILSFTAPIYDMIRASDTDKPSLHLVYDMWDTMIEKVKVAIYRHEGKHVDEFSSFYEVLHQILIDRWNKNNTPLHCLAHSLNLRYYSEQWLQEDK

Query:  NRVPPHKDLEVSRERMKCIKRYFSSNEERTKVNIEFANFSTMAAKFADHDSIRDRY
        NRVPPH+DLEV+RERMK +KRYFSSNEERTKV +EFANFSTMAA+FAD+DSIR+RY
Subjt:  NRVPPHKDLEVSRERMKCIKRYFSSNEERTKVNIEFANFSTMAAKFADHDSIRDRY

A0A7J0G1I1 Uncharacterized protein1.1e-16757.82Show/hide
Query:  MSWQCNFCQEKKKGSYTRVRAHLLKLGGCGVGPCRKVTPKDVAEMQKLEDEAKIRIQKNAPKQVPLPPSRHTQPESQSFGSMGSYSYLTMETKKRKGSVS
        + W+CNFC   K GSYTRVRAHLLK+ G G+GPC  VT  ++AE  +L +E ++  ++N  K+VPLPP   T+ + +   S  S               +
Subjt:  MSWQCNFCQEKKKGSYTRVRAHLLKLGGCGVGPCRKVTPKDVAEMQKLEDEAKIRIQKNAPKQVPLPPSRHTQPESQSFGSMGSYSYLTMETKKRKGSVS

Query:  SLEKSFNMAICDQVHAEIARMFYSSGLPFHLARNPHYVNAFALAANNALSGYLPPGYNMLRTTLLQREKANVERLLHLIKSTWCQKGDVMFVKNFIMNHS
         + K+FN+   DQ+H+EIARMFYS GLPF+LARNP+YV+++  AANN++ GY+PPGYN+LRT LLQRE+AN+ERLL  IK TW +KG  +    +  +  
Subjt:  SLEKSFNMAICDQVHAEIARMFYSSGLPFHLARNPHYVNAFALAANNALSGYLPPGYNMLRTTLLQREKANVERLLHLIKSTWCQKGDVMFVKNFIMNHS

Query:  MRLVMFNEFVPLKLLSVAEARFASTIIMLKRFKLIKSGLQTMVISDKWACYKEDDVGKARHVKELILDNIWWDKIDYILSFTAPIYDMIRASDTDKPSLH
          L+ F     + LLSVAE RFA+  ++L+RFKLIK  LQ+MVIS +W  ++E +VG A  V+E +LD  WWDK+DYILSFTAPIYDM+R  DTD+P+LH
Subjt:  MRLVMFNEFVPLKLLSVAEARFASTIIMLKRFKLIKSGLQTMVISDKWACYKEDDVGKARHVKELILDNIWWDKIDYILSFTAPIYDMIRASDTDKPSLH

Query:  LVYDMWDTMIEKVKVAIYRHEGKHVDEFSSFYEVLHQILIDRWNKNNTPLHCLAHSLNLRYYSEQWLQEDKNRVPPHKDLEVSRERMKCIKRYFSSNEER
        L+YDMWD+MI KVK AIY+HEGK +D+   FY V+H+IL+DRWNKNNTPLHCLAH+LN RYYS++WL+ED NRVPPH+D EV+ ER KC+KRY     ER
Subjt:  LVYDMWDTMIEKVKVAIYRHEGKHVDEFSSFYEVLHQILIDRWNKNNTPLHCLAHSLNLRYYSEQWLQEDKNRVPPHKDLEVSRERMKCIKRYFSSNEER

Query:  TKVNIEFANFSTMAAKFADHDSIRDRYCMDPKSWWAIHSAFTPTLQALAMKLLVQPSSSSCCERNWSTYSFVNSMRRNKMTPQRAEDLVFIHTNLRLLSR
        TK N+E+A FST    F+D DSI DR+ MDP SWW IH     TLQ +A+K+L QPSSSSCCERNWSTYSF++S++RNKMTPQRAEDLVF+H+NLRLLSR
Subjt:  TKVNIEFANFSTMAAKFADHDSIRDRYCMDPKSWWAIHSAFTPTLQALAMKLLVQPSSSSCCERNWSTYSFVNSMRRNKMTPQRAEDLVFIHTNLRLLSR

Query:  RTPEY
        R+P+Y
Subjt:  RTPEY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G79740.1 hAT transposon superfamily5.6e-1523.93Show/hide
Query:  LILDNIWWDKIDYILSFTAPIYDMIRASDTDKPSLHLVYDMWDTMIEKVKVAIYRHEGKHVDEFSSFYEVLHQILIDRWNKN-NTPLHCLAHSLNLRYYS
        ++ DN +W  ++  ++ + PI  ++R   T KP++  +Y++     E ++      E KH        +V   I+   W ++ ++PLH  A  LN    S
Subjt:  LILDNIWWDKIDYILSFTAPIYDMIRASDTDKPSLHLVYDMWDTMIEKVKVAIYRHEGKHVDEFSSFYEVLHQILIDRWNKN-NTPLHCLAHSLNLRYYS

Query:  EQWLQEDKNRVPPHKDLEVSRERMKCIKRYFSSNEERTKVNIEFANFSTMAAKFADHDSIRDRYCMDPKSWWAIHSAFTPTLQALAMKLLVQPSSSSCCE
         Q+  E K      +D        K +++   +++ R  +  +   F+     F  + ++  R  + P  WW       P LQ +A+++L Q  S    E
Subjt:  EQWLQEDKNRVPPHKDLEVSRERMKCIKRYFSSNEERTKVNIEFANFSTMAAKFADHDSIRDRYCMDPKSWWAIHSAFTPTLQALAMKLLVQPSSSSCCE

Query:  RNWSTYSFVNSMRRNKMTPQRAEDLVFIHTNLRL
        R WST+  ++  RRNK+  +    L +++ NL+L
Subjt:  RNWSTYSFVNSMRRNKMTPQRAEDLVFIHTNLRL

AT3G17450.1 hAT dimerisation domain-containing protein4.0e-2127.07Show/hide
Query:  VKNFIMNHSMRL-VMFNEFVP-LKLLSVAEARFASTIIMLKRFKLIKSGLQTMVISDKWACYKEDDVG-KARHVKELILDNIWWDKIDYILSFTAPIYDM
        +  FI N +  L +M NEF   L LL  A  R AS    L+     K+ L+ +  SD W   +      + R V++++L  ++W K+ Y+L    P+  +
Subjt:  VKNFIMNHSMRL-VMFNEFVP-LKLLSVAEARFASTIIMLKRFKLIKSGLQTMVISDKWACYKEDDVG-KARHVKELILDNIWWDKIDYILSFTAPIYDM

Query:  IRASDTDKPSLHLVYDMWDTMIEKVKVAIYRHEGKHVDEFSSFYEVLHQILIDRWNK-NNTPLHCLAHSLNLRYYSEQWLQEDKNRVPPHKDLEVSRERM
        I   +     L + Y          K+AI         ++  F+ V+      RWN   + PL+  A+  N  Y         K R       EV R   
Subjt:  IRASDTDKPSLHLVYDMWDTMIEKVKVAIYRHEGKHVDEFSSFYEVLHQILIDRWNK-NNTPLHCLAHSLNLRYYSEQWLQEDKNRVPPHKDLEVSRERM

Query:  KCIKRYFSSNEERTKVNIEFANFSTMAAKFADHDSIRDRYCMDPKSWWAIHSAFTPTLQALAMKLLVQPSSSSCCERNWSTYSFVNSMRRNKMTPQRAED
        +CI R    N  R    ++  +++   A F    +I  R  +DP +WW  H      LQ +A+++L    SS  CE  WS Y  VNS  +++   +  +D
Subjt:  KCIKRYFSSNEERTKVNIEFANFSTMAAKFADHDSIRDRYCMDPKSWWAIHSAFTPTLQALAMKLLVQPSSSSCCERNWSTYSFVNSMRRNKMTPQRAED

Query:  LVFIHTNLRLLSRR
        L ++H NLRL  ++
Subjt:  LVFIHTNLRLLSRR

AT3G22220.1 hAT transposon superfamily2.2e-1124.44Show/hide
Query:  VKNFIMNHSMRLVMFNEFV-PLKLLSVAEARFASTIIMLKRFKLIKSGLQTMVISDKW--ACYKEDDVGKARHVKELILDNIWWDKIDYILSFTAPIYDM
        V   I NHS  L +  +F     ++       A+    + R   +K  LQ MV S +W    Y ++  G A  + E I D  +W  +      TAPI  +
Subjt:  VKNFIMNHSMRLVMFNEFV-PLKLLSVAEARFASTIIMLKRFKLIKSGLQTMVISDKW--ACYKEDDVGKARHVKELILDNIWWDKIDYILSFTAPIYDM

Query:  IRASDTD-KPSLHLVYDMWDTMIEKVKVAIYRHEGKHVDEFSSFYEVLHQILIDRWNKNNTPLHCLAHSLNLRYYSEQWLQEDKNRVPPHKDLEVSRERM
        +R   ++ KP++  VY       E +K  +      H +E+  ++++     IDRW     PL+     LN +++           +      E+    +
Subjt:  IRASDTD-KPSLHLVYDMWDTMIEKVKVAIYRHEGKHVDEFSSFYEVLHQILIDRWNKNNTPLHCLAHSLNLRYYSEQWLQEDKNRVPPHKDLEVSRERM

Query:  KCIKRYFSSNEERTKVNIEFANFSTMAAKFADHDSIRDRYCMDPKSWWAIHSAFTPTLQALAMKLLVQPSSSSC-CERNWSTYSFVNSMRRNKMTPQRAE
         CI++       +  V  +  ++      F  + +IR R  M P  WW+ +      L   A+++L Q  SSS    RN ++ S +    +N +  QR  
Subjt:  KCIKRYFSSNEERTKVNIEFANFSTMAAKFADHDSIRDRYCMDPKSWWAIHSAFTPTLQALAMKLLVQPSSSSC-CERNWSTYSFVNSMRRNKMTPQRAE

Query:  DLVFIHTNLRL
        DLVF+  N+RL
Subjt:  DLVFIHTNLRL

AT3G22220.2 hAT transposon superfamily2.2e-1124.44Show/hide
Query:  VKNFIMNHSMRLVMFNEFV-PLKLLSVAEARFASTIIMLKRFKLIKSGLQTMVISDKW--ACYKEDDVGKARHVKELILDNIWWDKIDYILSFTAPIYDM
        V   I NHS  L +  +F     ++       A+    + R   +K  LQ MV S +W    Y ++  G A  + E I D  +W  +      TAPI  +
Subjt:  VKNFIMNHSMRLVMFNEFV-PLKLLSVAEARFASTIIMLKRFKLIKSGLQTMVISDKW--ACYKEDDVGKARHVKELILDNIWWDKIDYILSFTAPIYDM

Query:  IRASDTD-KPSLHLVYDMWDTMIEKVKVAIYRHEGKHVDEFSSFYEVLHQILIDRWNKNNTPLHCLAHSLNLRYYSEQWLQEDKNRVPPHKDLEVSRERM
        +R   ++ KP++  VY       E +K  +      H +E+  ++++     IDRW     PL+     LN +++           +      E+    +
Subjt:  IRASDTD-KPSLHLVYDMWDTMIEKVKVAIYRHEGKHVDEFSSFYEVLHQILIDRWNKNNTPLHCLAHSLNLRYYSEQWLQEDKNRVPPHKDLEVSRERM

Query:  KCIKRYFSSNEERTKVNIEFANFSTMAAKFADHDSIRDRYCMDPKSWWAIHSAFTPTLQALAMKLLVQPSSSSC-CERNWSTYSFVNSMRRNKMTPQRAE
         CI++       +  V  +  ++      F  + +IR R  M P  WW+ +      L   A+++L Q  SSS    RN ++ S +    +N +  QR  
Subjt:  KCIKRYFSSNEERTKVNIEFANFSTMAAKFADHDSIRDRYCMDPKSWWAIHSAFTPTLQALAMKLLVQPSSSSC-CERNWSTYSFVNSMRRNKMTPQRAE

Query:  DLVFIHTNLRL
        DLVF+  N+RL
Subjt:  DLVFIHTNLRL

AT5G33406.1 hAT dimerisation domain-containing protein / transposase-related1.0e-2427.87Show/hide
Query:  AEARFASTIIMLKRFKLIKSGLQTMVISDKWACYKEDDVGKARHVKELILDNIWWDKIDYILSFTAPIYDMIRASDTD-KPSLHLVYDMWDTMIEKV-KV
        A  R A++ I L +F  +K  L+ MV SD+W   K         +K       +W  + + L    P+  ++R  D + KP +  +Y   D   E + K 
Subjt:  AEARFASTIIMLKRFKLIKSGLQTMVISDKWACYKEDDVGKARHVKELILDNIWWDKIDYILSFTAPIYDMIRASDTD-KPSLHLVYDMWDTMIEKV-KV

Query:  AIYRHEGKHVDEFSSFYEVLHQILIDRWN-KNNTPLHCLAHSLNLRYYSEQWLQEDKNRVPPHKDLEVSRERMKCIKRYFSSNEERTKVNIEFANFSTMA
          Y+ E          Y++  +I+  RW+ + + PLH   + LN  ++  Q   +D          EV    + C+ R     E + K+  E   F    
Subjt:  AIYRHEGKHVDEFSSFYEVLHQILIDRWN-KNNTPLHCLAHSLNLRYYSEQWLQEDKNRVPPHKDLEVSRERMKCIKRYFSSNEERTKVNIEFANFSTMA

Query:  AKFADHDSIRDRYCMDPKSWWAIHSAFTPTLQALAMKLLVQPSSSSCCERNWSTYSFVNSMRRNKMTPQRAEDLVFIHTNLRLLSRR
          F    +IR R  M P  WW+ + + TP LQ  A+K+L    S++ CERNW  +  +++ RRN++T  R  D++F+  N R L RR
Subjt:  AKFADHDSIRDRYCMDPKSWWAIHSAFTPTLQALAMKLLVQPSSSSCCERNWSTYSFVNSMRRNKMTPQRAEDLVFIHTNLRLLSRR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTGGCAGTGTAATTTTTGTCAAGAAAAAAAGAAAGGTTCTTATACAAGAGTTAGGGCTCACTTATTGAAATTAGGTGGTTGTGGAGTTGGACCATGTCGAAAGGT
CACCCCTAAAGATGTTGCTGAAATGCAAAAGTTGGAAGATGAGGCAAAAATTCGAATCCAAAAGAATGCTCCTAAACAAGTTCCTTTACCACCTTCACGTCATACCCAAC
CTGAATCTCAATCTTTTGGAAGTATGGGTAGCTATTCTTATTTAACCATGGAAACAAAGAAGAGAAAGGGAAGTGTAAGTTCACTTGAAAAGTCATTTAACATGGCAATC
TGTGATCAAGTGCATGCAGAAATTGCTAGAATGTTTTATTCTTCAGGTTTGCCTTTTCATTTGGCTAGAAATCCACATTATGTGAATGCATTTGCTTTGGCAGCAAACAA
TGCATTGTCGGGTTATTTACCTCCGGGATATAATATGTTGAGAACAACTCTTCTTCAAAGAGAAAAAGCAAATGTGGAAAGATTATTGCATCTTATTAAAAGCACATGGT
GTCAGAAGGGTGATGTTATGTTTGTCAAGAACTTCATTATGAATCATTCCATGAGGCTTGTTATGTTCAACGAGTTTGTGCCTTTGAAGTTACTTTCAGTTGCAGAAGCA
CGCTTTGCATCTACGATCATTATGCTTAAAAGATTTAAACTCATTAAGAGTGGTTTGCAAACTATGGTTATTAGCGATAAATGGGCGTGCTACAAAGAAGATGATGTGGG
GAAAGCAAGACATGTCAAGGAGTTGATACTTGATAATATTTGGTGGGATAAGATTGATTATATTCTTTCTTTTACTGCGCCTATATATGACATGATCAGAGCTAGTGATA
CAGATAAACCTTCCCTTCATTTGGTATATGATATGTGGGACACCATGATTGAAAAAGTGAAGGTTGCAATCTATAGACATGAAGGAAAGCATGTAGATGAATTCTCGTCT
TTCTATGAGGTGTTGCATCAAATTCTAATTGATCGTTGGAACAAAAATAACACTCCACTCCATTGTTTGGCACATTCTCTAAACCTAAGGTATTATAGTGAACAATGGCT
TCAAGAAGACAAGAATCGAGTCCCACCACATAAAGATTTAGAAGTATCTCGAGAGAGGATGAAGTGTATCAAAAGATATTTTAGTTCAAATGAGGAGCGCACTAAAGTGA
ACATAGAATTTGCCAATTTCTCTACAATGGCTGCAAAGTTTGCTGACCATGATTCTATACGTGATAGATATTGTATGGATCCTAAAAGTTGGTGGGCCATCCATAGTGCC
TTTACTCCAACTCTACAGGCACTAGCTATGAAACTACTTGTGCAACCTTCATCTTCCTCATGTTGTGAGAGAAATTGGAGTACATATTCATTTGTGAACTCCATGAGAAG
AAACAAGATGACACCACAACGTGCAGAAGACTTGGTGTTTATCCATACCAATCTTCGTCTTTTATCGAGAAGAACTCCTGAATATTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTGGCAGTGTAATTTTTGTCAAGAAAAAAAGAAAGGTTCTTATACAAGAGTTAGGGCTCACTTATTGAAATTAGGTGGTTGTGGAGTTGGACCATGTCGAAAGGT
CACCCCTAAAGATGTTGCTGAAATGCAAAAGTTGGAAGATGAGGCAAAAATTCGAATCCAAAAGAATGCTCCTAAACAAGTTCCTTTACCACCTTCACGTCATACCCAAC
CTGAATCTCAATCTTTTGGAAGTATGGGTAGCTATTCTTATTTAACCATGGAAACAAAGAAGAGAAAGGGAAGTGTAAGTTCACTTGAAAAGTCATTTAACATGGCAATC
TGTGATCAAGTGCATGCAGAAATTGCTAGAATGTTTTATTCTTCAGGTTTGCCTTTTCATTTGGCTAGAAATCCACATTATGTGAATGCATTTGCTTTGGCAGCAAACAA
TGCATTGTCGGGTTATTTACCTCCGGGATATAATATGTTGAGAACAACTCTTCTTCAAAGAGAAAAAGCAAATGTGGAAAGATTATTGCATCTTATTAAAAGCACATGGT
GTCAGAAGGGTGATGTTATGTTTGTCAAGAACTTCATTATGAATCATTCCATGAGGCTTGTTATGTTCAACGAGTTTGTGCCTTTGAAGTTACTTTCAGTTGCAGAAGCA
CGCTTTGCATCTACGATCATTATGCTTAAAAGATTTAAACTCATTAAGAGTGGTTTGCAAACTATGGTTATTAGCGATAAATGGGCGTGCTACAAAGAAGATGATGTGGG
GAAAGCAAGACATGTCAAGGAGTTGATACTTGATAATATTTGGTGGGATAAGATTGATTATATTCTTTCTTTTACTGCGCCTATATATGACATGATCAGAGCTAGTGATA
CAGATAAACCTTCCCTTCATTTGGTATATGATATGTGGGACACCATGATTGAAAAAGTGAAGGTTGCAATCTATAGACATGAAGGAAAGCATGTAGATGAATTCTCGTCT
TTCTATGAGGTGTTGCATCAAATTCTAATTGATCGTTGGAACAAAAATAACACTCCACTCCATTGTTTGGCACATTCTCTAAACCTAAGGTATTATAGTGAACAATGGCT
TCAAGAAGACAAGAATCGAGTCCCACCACATAAAGATTTAGAAGTATCTCGAGAGAGGATGAAGTGTATCAAAAGATATTTTAGTTCAAATGAGGAGCGCACTAAAGTGA
ACATAGAATTTGCCAATTTCTCTACAATGGCTGCAAAGTTTGCTGACCATGATTCTATACGTGATAGATATTGTATGGATCCTAAAAGTTGGTGGGCCATCCATAGTGCC
TTTACTCCAACTCTACAGGCACTAGCTATGAAACTACTTGTGCAACCTTCATCTTCCTCATGTTGTGAGAGAAATTGGAGTACATATTCATTTGTGAACTCCATGAGAAG
AAACAAGATGACACCACAACGTGCAGAAGACTTGGTGTTTATCCATACCAATCTTCGTCTTTTATCGAGAAGAACTCCTGAATATTAA
Protein sequenceShow/hide protein sequence
MSWQCNFCQEKKKGSYTRVRAHLLKLGGCGVGPCRKVTPKDVAEMQKLEDEAKIRIQKNAPKQVPLPPSRHTQPESQSFGSMGSYSYLTMETKKRKGSVSSLEKSFNMAI
CDQVHAEIARMFYSSGLPFHLARNPHYVNAFALAANNALSGYLPPGYNMLRTTLLQREKANVERLLHLIKSTWCQKGDVMFVKNFIMNHSMRLVMFNEFVPLKLLSVAEA
RFASTIIMLKRFKLIKSGLQTMVISDKWACYKEDDVGKARHVKELILDNIWWDKIDYILSFTAPIYDMIRASDTDKPSLHLVYDMWDTMIEKVKVAIYRHEGKHVDEFSS
FYEVLHQILIDRWNKNNTPLHCLAHSLNLRYYSEQWLQEDKNRVPPHKDLEVSRERMKCIKRYFSSNEERTKVNIEFANFSTMAAKFADHDSIRDRYCMDPKSWWAIHSA
FTPTLQALAMKLLVQPSSSSCCERNWSTYSFVNSMRRNKMTPQRAEDLVFIHTNLRLLSRRTPEY