; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi11G009580 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi11G009580
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionRibonuclease H
Genome locationchr11:14686420..14694526
RNA-Seq ExpressionLsi11G009580
SyntenyLsi11G009580
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033221.1 uncharacterized protein E6C27_scaffold845G00100 [Cucumis melo var. makuwa]6.5e-16532.37Show/hide
Query:  DWDFLYKSDVTNAQAHEVIKAWKFAKRMKSLRHCEGTMKQ----------------------------------------ERNHVLAAENEGLREEVRRW
        DW+F Y+S++ N + HE ++AWKF K+MKSLRHCEGT +Q                                        E N VLA ENE LREEV++W
Subjt:  DWDFLYKSDVTNAQAHEVIKAWKFAKRMKSLRHCEGTMKQ----------------------------------------ERNHVLAAENEGLREEVRRW

Query:  VQQATSAQRQLEEAKRHIQKQSKLEKDMNSLNTEAVQMRKKNKRLLRNIVALHNKADTQKICIKELKQELERMNKVATKFQNALNEQVISSQKIAAEFEA
        VQQA +  R L+EAKR                          KRL                                               + A   E 
Subjt:  VQQATSAQRQLEEAKRHIQKQSKLEKDMNSLNTEAVQMRKKNKRLLRNIVALHNKADTQKICIKELKQELERMNKVATKFQNALNEQVISSQKIAAEFEA

Query:  LQHSTNEYKSQLIEVECRNTFLQKAVASFEGQLLICRKALEVATDDYTQLKEEYQDMSTDFAIWRDEYDIVRHKYDDARNQIEQGAEKLRQMVKMADQFS
        LQHS  EYKSQLIE E +N FLQKAV S E QLLICRKA EV TDDY QL E+YQ+MS DF +W+DEY+ +R KYDDA  ++E+GAEKLRQM ++ADQFS
Subjt:  LQHSTNEYKSQLIEVECRNTFLQKAVASFEGQLLICRKALEVATDDYTQLKEEYQDMSTDFAIWRDEYDIVRHKYDDARNQIEQGAEKLRQMVKMADQFS

Query:  AQAMTLQQGIIPTQHNSKELS-------------------------------------------------------------------------------
         QA TL+QG+IPT+   KELS                                                                               
Subjt:  AQAMTLQQGIIPTQHNSKELS-------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------QCCKD--------
                                                                                                C +D        
Subjt:  ---------------------------------------------------------------------------------------QCCKD--------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------NKVHDLT
                                                                                                     NKV D+ 
Subjt:  ---------------------------------------------------------------------------------------------NKVHDLT

Query:  TPTRVLFQILHEAGYLSP-IDSNGRNKIGCAIEKGCLFHPEMDDHSKEDCIDFKREIQKLMDAKILMISQTDPREIEVDMISNASSHEKISKEGSSIQEP
        T  + LFQILH AGYLSP  +++ R KIGC   + CL HPE +DHS EDC +FK E+QKLMD+KIL+I Q   ++IEV+MI+NASS+EK S E + + +P
Subjt:  TPTRVLFQILHEAGYLSP-IDSNGRNKIGCAIEKGCLFHPEMDDHSKEDCIDFKREIQKLMDAKILMISQTDPREIEVDMISNASSHEKISKEGSSIQEP

Query:  LVIHYEEKFSVTSCIEIPKTMIVEIP-GPFAYKDNRAVPWNM------------NGKAVEITSVDDLDNLSEVSAQKATLVRRKTDNKFVSEEEAREFLK
        LVIHYEEK S+ S I+ PK M VEIP G          P N+             GKA+E+   DDL++LS+V  +K TLV ++TD++ VS+EEA     
Subjt:  LVIHYEEKFSVTSCIEIPKTMIVEIP-GPFAYKDNRAVPWNM------------NGKAVEITSVDDLDNLSEVSAQKATLVRRKTDNKFVSEEEAREFLK

Query:  LIKQSKYKVIEQLHRTPARISILSLFLHSEPHRKILLDILNRAHVGQDISVNALSKIVENITATNCISFTDEEISPEGIGHTKALHISVKCKDYHVARVL
                                                                               EEI PEG GHTKALHISVKCKD+HVARVL
Subjt:  LIKQSKYKVIEQLHRTPARISILSLFLHSEPHRKILLDILNRAHVGQDISVNALSKIVENITATNCISFTDEEISPEGIGHTKALHISVKCKDYHVARVL

Query:  VDNGSSLNIMSRSTLMKL------------------GARREVIGDIEIPLKIGPTTFNLPFQVMDVNSSYSCLLGRPWIHSVRAVPSSLHQKVKFNVEGG
        VDNGSSLNIMSRSTLMKL                   A REVIGDI+IPLKIGP+TFN+ FQVMD+NS YSCLLGRPWIHS  AVPSSLHQ++KF+VEGG
Subjt:  VDNGSSLNIMSRSTLMKL------------------GARREVIGDIEIPLKIGPTTFNLPFQVMDVNSSYSCLLGRPWIHSVRAVPSSLHQKVKFNVEGG

Query:  QAIVYGEEDIFVTKTSALPYVEAAEEAFECSYRSFEVANATIFPTEGLDLNRYMSRTSLMIAKMMIRNGFQINGGLEKNNQGNPEVIFLPKAKERFELGY
        QAIVYGEED+FVTKTS LPYVEA EEA ECSYRSFE ANATIFPTEGL ++RY+S+TSLMIAK MI++GFQ++ GL K+NQG+ EVI LPKAKE+F LGY
Subjt:  QAIVYGEEDIFVTKTSALPYVEAAEEAFECSYRSFEVANATIFPTEGLDLNRYMSRTSLMIAKMMIRNGFQINGGLEKNNQGNPEVIFLPKAKERFELGY

Query:  RPKAFEWEKVRAEKKEMRNARLEGHEAEQRRLNIPHLSEIFK----------------DAR------DENINL---------------------------
        +P   E EKVRA+KKE R+A L G E ++ R++IPHLSE FK                D+        EN +L                           
Subjt:  RPKAFEWEKVRAEKKEMRNARLEGHEAEQRRLNIPHLSEIFK----------------DAR------DENINL---------------------------

Query:  --------------SVDFKVSICNLEQNM-EDECDVSPELLKIIEQEEKKTVPYQEPLEVINLGTLEE------GTTISEENDTSI
                      +VDF+V ICNLEQN+ E E D+SPELL++IEQEEKKT+ YQE L VIN GT EE      GT  SE++ + +
Subjt:  --------------SVDFKVSICNLEQNM-EDECDVSPELLKIIEQEEKKTVPYQEPLEVINLGTLEE------GTTISEENDTSI

KAA0043020.1 uncharacterized protein E6C27_scaffold75G00990 [Cucumis melo var. makuwa]4.4e-15353.58Show/hide
Query:  MDAKILMISQTDPREIEVDMISNASSHEKISKEGSSIQEPLVIHYEEKFSVTSCIEIPKTMIVEIPGPFAYKDNRAVPWNMNGKAVEITSVDDLDNLSEV
        MD KIL+I Q   +EIEV+MI+NASS+EK S E +S+ + LVIHYE+K S+ S I+ PK M VEIPGPFAYKDN  VPW    + +    V     + E+
Subjt:  MDAKILMISQTDPREIEVDMISNASSHEKISKEGSSIQEPLVIHYEEKFSVTSCIEIPKTMIVEIPGPFAYKDNRAVPWNMNGKAVEITSVDDLDNLSEV

Query:  SAQKATLVRRKTDNKFVSEEEAREFLKLIKQSKYKVIEQLHRTPARISILSLFLHSEPHRKILLDILNRAHVGQDIS-VNALSKIVENITATNCISFTDE
        +            + F  +E  R   K I+ +    +  L                    K+LLDILNRAHVG DIS V+ALS+IVENITAT+CISFTDE
Subjt:  SAQKATLVRRKTDNKFVSEEEAREFLKLIKQSKYKVIEQLHRTPARISILSLFLHSEPHRKILLDILNRAHVGQDIS-VNALSKIVENITATNCISFTDE

Query:  EISPEGIGHTKALHISVKCKDYHVARVLVDNGSSLNIMSRSTLMKL------------------GARREVIGDIEIPLKIGPTTFNLPFQVMDVNSSYSC
        EI P+G GHTKALHISVKCKD+HVARVLVDNGSSLNIMSRSTLMKL                  GARREVI DI+IPLKIGP+TFN+  QVMDVNSSYSC
Subjt:  EISPEGIGHTKALHISVKCKDYHVARVLVDNGSSLNIMSRSTLMKL------------------GARREVIGDIEIPLKIGPTTFNLPFQVMDVNSSYSC

Query:  LLGRPWIHSVRAVPSSLHQKVKFNVEGGQAIVYGEEDIFVTKTSALPYVEAAEEAFECSYRSFEVANATIFPTEGLDLNRYMSRTSLMIAKMMIRNGFQI
        LLGRPWIHS  AVPS LHQ++KF+VEGGQAIVYGE+++FVTK S LPYVEA EEA ECSYRSFE+ANATIFP EGL ++RY+S+TSLMIAK MI++ FQ+
Subjt:  LLGRPWIHSVRAVPSSLHQKVKFNVEGGQAIVYGEEDIFVTKTSALPYVEAAEEAFECSYRSFEVANATIFPTEGLDLNRYMSRTSLMIAKMMIRNGFQI

Query:  NGGLEKNNQGNPEVIFLPKAKERFELGYRPKAFEWEKVRAEKKEMRNARLEGHEAEQRRLNIPHLSEIFKDAR----------------------DENIN
        + GL K+NQG+ +VI LPKAKE+F LGY+P A EWEKVRA+KKE R+  L G E ++ R++IPHL E FK                          EN +
Subjt:  NGGLEKNNQGNPEVIFLPKAKERFELGYRPKAFEWEKVRAEKKEMRNARLEGHEAEQRRLNIPHLSEIFKDAR----------------------DENIN

Query:  LS-----------------------------------------VDFKVSICNLEQNME-DECDVSPELLKIIEQEEKKTVPYQEPLEVINLGTLEE----
        LS                                         VDFKV ICNLEQN+E  E D+SPELL++IEQEEKKT+ YQE L+VINLGTL+E    
Subjt:  LS-----------------------------------------VDFKVSICNLEQNME-DECDVSPELLKIIEQEEKKTVPYQEPLEVINLGTLEE----

Query:  --GTTISEENDTSI
          GT  SE++ + +
Subjt:  --GTTISEENDTSI

KAA0046608.1 uncharacterized protein E6C27_scaffold114G001540 [Cucumis melo var. makuwa]1.1e-17765.74Show/hide
Query:  NKVHDLTTPTRVLFQILHEAGYLSP-IDSNGRNKIGCAIEKGCLFHPEMDDHSKEDCIDFKREIQKLMDAKILMISQTDPREIEVDMISNASSHEKISKE
        NKV D+ T  + LFQILH +GYLSP  +++ R KIGC   + CLFHPE +DHS EDC +FK E+QKLM++KIL+I Q   +EIEV+MI+NASS+EK S E
Subjt:  NKVHDLTTPTRVLFQILHEAGYLSP-IDSNGRNKIGCAIEKGCLFHPEMDDHSKEDCIDFKREIQKLMDAKILMISQTDPREIEVDMISNASSHEKISKE

Query:  GSSIQEPLVIHYEEKFSVTSCIEIPKTMIVEIPGPFAYKDNRAVPW-----------------------------------------NMNGKAVEITSVD
         + + +PLVIHYEEK S+ S I+ PK M VEIPGPFAYKDN AVPW                                            GKA+E+   D
Subjt:  GSSIQEPLVIHYEEKFSVTSCIEIPKTMIVEIPGPFAYKDNRAVPW-----------------------------------------NMNGKAVEITSVD

Query:  DLDNLSEVSAQKATLVRRKTDNKFVSEEEAREFLKLIKQSKYKVIEQLHRTPARISILSLFLHSEPHRKILLDILNRAHVGQDISVNALSKIVENITATN
        DL++LS+V  +K TLV ++TD++ VS+EEA EFLKLIKQS+YKVIEQLHRT ARIS+LSLF++SEPHRK+LLDILNRAHVG DISVNALS+IVENITATN
Subjt:  DLDNLSEVSAQKATLVRRKTDNKFVSEEEAREFLKLIKQSKYKVIEQLHRTPARISILSLFLHSEPHRKILLDILNRAHVGQDISVNALSKIVENITATN

Query:  CISFTDEEISPEGIGHTKALHISVKCKDYHVARVLVDNGSSLNIMSRSTLMKL------------------GARREVIGDIEIPLKIGPTTFNLPFQVMD
        CISFTDEEI PEG GHTKALHISVKCKD+HVARVLVDNGSSLNIMSRSTLMKL                  GARREVI DI+IPLKIGP+TFN+ FQVMD
Subjt:  CISFTDEEISPEGIGHTKALHISVKCKDYHVARVLVDNGSSLNIMSRSTLMKL------------------GARREVIGDIEIPLKIGPTTFNLPFQVMD

Query:  VNSSYSCLLGRPWIHSVRAVPSSLHQKVKFNVEGGQAIVYGEEDIFVTKTSALPYVEAAEEAFECSYRSFEVANATIFPTEGLDLNRYMSRTSLMIAKMM
        +NSSYSCLLGRPWIHSV AVPSSLHQ++KF+VEGGQAIVYGEED+FVTKTS LPYVEA EEA ECSYRSFE+ANATIFPTE L ++RY+S+TSLMIAK M
Subjt:  VNSSYSCLLGRPWIHSVRAVPSSLHQKVKFNVEGGQAIVYGEEDIFVTKTSALPYVEAAEEAFECSYRSFEVANATIFPTEGLDLNRYMSRTSLMIAKMM

Query:  IRNGF
        I++GF
Subjt:  IRNGF

KAA0061241.1 uncharacterized protein E6C27_scaffold455G00760 [Cucumis melo var. makuwa]3.5e-20365.51Show/hide
Query:  NKVHDLTTPTRVLFQILHEAGYLSP-IDSNGRNKIGCAIEKGCLFHPEMDDHSKEDCIDFKREIQKLMDAKILMISQTDPREIEVDMISNASSHEKISKE
        NKV D+ T  + LFQILH AGYLSP  +++ R KIGC   + CLFHPE +DHS EDC +FK E+QKLMD+KIL+I Q   +EIEV+MI+NASS+EK S E
Subjt:  NKVHDLTTPTRVLFQILHEAGYLSP-IDSNGRNKIGCAIEKGCLFHPEMDDHSKEDCIDFKREIQKLMDAKILMISQTDPREIEVDMISNASSHEKISKE

Query:  GSSIQEPLVIHYEEKFSVTSCIEIPKTMIVEIPGPFAYKDNRAVPW-----------------------------------------NMNGKAVEITSVD
         + + +P VIHYEEK S+ S I+ PK M VEIPGPFAYKDN AVPW                                            GKA+E+   D
Subjt:  GSSIQEPLVIHYEEKFSVTSCIEIPKTMIVEIPGPFAYKDNRAVPW-----------------------------------------NMNGKAVEITSVD

Query:  DLDNLSEVSAQKATLVRRKTDNKFVSEEEAREFLKLIKQSKYKVIEQLHRTPARISILSLFLHSEPHRKILLDILNRAHVGQDISVNALSKIVENITATN
        DL++LS+V  +K TLV ++TD++ VS+EEA EFLKLIKQS+YKVIEQLHRTPARIS+LSLF++SEPHRK+LLDILNRAHVG DISVNALS+IVENI ATN
Subjt:  DLDNLSEVSAQKATLVRRKTDNKFVSEEEAREFLKLIKQSKYKVIEQLHRTPARISILSLFLHSEPHRKILLDILNRAHVGQDISVNALSKIVENITATN

Query:  CISFTDEEISPEGIGHTKALHISVKCKDYHVARVLVDNGSSLNIMSRSTLMKL------------------GARREVIGDIEIPLKIGPTTFNLPFQVMD
        CISFTDEEI PEG GHTKALHISVKCKD+HVARVLVDNGSSLNIMSRSTLMKL                  GARREVIGDI+IPLKIGP+TFN+ FQVMD
Subjt:  CISFTDEEISPEGIGHTKALHISVKCKDYHVARVLVDNGSSLNIMSRSTLMKL------------------GARREVIGDIEIPLKIGPTTFNLPFQVMD

Query:  VNSSYSCLLGRPWIHSVRAVPSSLHQKVKFNVEGGQAIVYGEEDIFVTKTSALPYVEAAEEAFECSYRSFEVANATIFPTEGLDLNRYMSRTSLMIAKMM
        +NSSYSCLLGRPWIHS  AVPSSLHQ++KF+VE GQAIVYGEED+FVTKTS LPYVEA EEA ECSYRSFE+ANATIFPTEGL ++RY+S+TSLMIAK M
Subjt:  VNSSYSCLLGRPWIHSVRAVPSSLHQKVKFNVEGGQAIVYGEEDIFVTKTSALPYVEAAEEAFECSYRSFEVANATIFPTEGLDLNRYMSRTSLMIAKMM

Query:  IRNGFQINGGLEKNNQGNPEVIFLPKAKERFELGYRPKAFEWEKVRAEKKEMRNARLEGHEAEQRRLNIPHLSEIFK
        I++GFQ++ GL K+NQG+ EVI LPKAKE F LGY+P   EWEKVRA+KKE R+A L G E ++ R++IPHLSE FK
Subjt:  IRNGFQINGGLEKNNQGNPEVIFLPKAKERFELGYRPKAFEWEKVRAEKKEMRNARLEGHEAEQRRLNIPHLSEIFK

KAA0065293.1 uncharacterized protein E6C27_scaffold1023G00060 [Cucumis melo var. makuwa]8.7e-19454.62Show/hide
Query:  NKVHDLTTPTRVLFQILHEAGYLSP-IDSNGRNKIGCAIEKGCLFHPEMDDHSKEDCIDFKREIQKLMDAKILMISQTDPREIEVDMISNASSHEKISKE
        NKV D+ T  + LFQILH AGYLSP  +++ R KIGC   + CLFHPE +DHS EDC +FK E+QKLMD+KIL+I Q   +EIEV+MI+NASS+EK S E
Subjt:  NKVHDLTTPTRVLFQILHEAGYLSP-IDSNGRNKIGCAIEKGCLFHPEMDDHSKEDCIDFKREIQKLMDAKILMISQTDPREIEVDMISNASSHEKISKE

Query:  GSSIQEPLVIHYEEKFSVTSCIEIPKTMIVEIPGPFAYKDNRAVPW-----------------------------------------NMNGKAVEITSVD
         + + +PLVIHYEEK S+ S I+ PK M VEIPGPFAYKDN AVPW                                            GKA+E+   D
Subjt:  GSSIQEPLVIHYEEKFSVTSCIEIPKTMIVEIPGPFAYKDNRAVPW-----------------------------------------NMNGKAVEITSVD

Query:  DLDNLSEVSAQKATLVRRKTDNKFVSEEEAREFLKLIKQSKYKVIEQLHRTPARISILSLFLHSEPHRKILLDILNRAHVGQDISVNALSKIVENITATN
        DL++LS+V  +K TLV ++TD++ VS+EEA EFLKLIKQS+YKVIEQLHRTPARIS+LSLF +SEPHRK+LLDILNRAHVG DISVNALS+IVENITATN
Subjt:  DLDNLSEVSAQKATLVRRKTDNKFVSEEEAREFLKLIKQSKYKVIEQLHRTPARISILSLFLHSEPHRKILLDILNRAHVGQDISVNALSKIVENITATN

Query:  CISFTDEEISPEGIGHTKALHISVKCKDYHVARVLVDNGSSLNIMSRSTLMKL------------------GARREVIGDIEIPLKIGPTTFNLPFQVMD
        CISFTDEEI PEG GHTKALHISVKCKD+HVARVLVDNGSSLNIMSRSTLMKL                  GARREVIGDI+IPLKIGP+TFN+ FQVMD
Subjt:  CISFTDEEISPEGIGHTKALHISVKCKDYHVARVLVDNGSSLNIMSRSTLMKL------------------GARREVIGDIEIPLKIGPTTFNLPFQVMD

Query:  VNSSYSCLLGRPWIHSVRAVPSSLHQKVKFNVEGGQAIVYGEEDIFVTKTSALPYVEAAEEAFECSYRSFEVANATIFPTEGLDLNRYMSRTSLMIAKMM
        +NSSYSCLLGRPWIHS  AVPSSLHQ                               +   +  CSYRSFE+ANATIFPTEGL ++RY+S+TSLMIAK M
Subjt:  VNSSYSCLLGRPWIHSVRAVPSSLHQKVKFNVEGGQAIVYGEEDIFVTKTSALPYVEAAEEAFECSYRSFEVANATIFPTEGLDLNRYMSRTSLMIAKMM

Query:  IRNGFQINGGLEKNNQGNPEVIFLPKAKERFELGYRPKAFEWEKVRAEKKEMRNARLEGHEAEQRRLNIPHLSEIFK----------------------D
        I++GFQ++ GL K+NQG+ EVI LPKAKE+F LGY+P   EWEKVRA+ KE R+A L G E ++ R++IPHLSE FK                      +
Subjt:  IRNGFQINGGLEKNNQGNPEVIFLPKAKERFELGYRPKAFEWEKVRAEKKEMRNARLEGHEAEQRRLNIPHLSEIFK----------------------D

Query:  ARDENINL---------------------------------------------SVDFKVSICNLEQNM-EDECDVSPELLKIIEQEEKKTVPYQEPLEVI
           EN +L                                             +VDF+V ICNLEQN+ E E D+SPELL++IEQEEKKT+PYQE L+VI
Subjt:  ARDENINL---------------------------------------------SVDFKVSICNLEQNM-EDECDVSPELLKIIEQEEKKTVPYQEPLEVI

Query:  NLGTLEE------GTTISEENDTSI
        NLGT EE      GT  SE++ + +
Subjt:  NLGTLEE------GTTISEENDTSI

TrEMBL top hitse value%identityAlignment
A0A5A7SUT0 Reverse transcriptase3.2e-16532.37Show/hide
Query:  DWDFLYKSDVTNAQAHEVIKAWKFAKRMKSLRHCEGTMKQ----------------------------------------ERNHVLAAENEGLREEVRRW
        DW+F Y+S++ N + HE ++AWKF K+MKSLRHCEGT +Q                                        E N VLA ENE LREEV++W
Subjt:  DWDFLYKSDVTNAQAHEVIKAWKFAKRMKSLRHCEGTMKQ----------------------------------------ERNHVLAAENEGLREEVRRW

Query:  VQQATSAQRQLEEAKRHIQKQSKLEKDMNSLNTEAVQMRKKNKRLLRNIVALHNKADTQKICIKELKQELERMNKVATKFQNALNEQVISSQKIAAEFEA
        VQQA +  R L+EAKR                          KRL                                               + A   E 
Subjt:  VQQATSAQRQLEEAKRHIQKQSKLEKDMNSLNTEAVQMRKKNKRLLRNIVALHNKADTQKICIKELKQELERMNKVATKFQNALNEQVISSQKIAAEFEA

Query:  LQHSTNEYKSQLIEVECRNTFLQKAVASFEGQLLICRKALEVATDDYTQLKEEYQDMSTDFAIWRDEYDIVRHKYDDARNQIEQGAEKLRQMVKMADQFS
        LQHS  EYKSQLIE E +N FLQKAV S E QLLICRKA EV TDDY QL E+YQ+MS DF +W+DEY+ +R KYDDA  ++E+GAEKLRQM ++ADQFS
Subjt:  LQHSTNEYKSQLIEVECRNTFLQKAVASFEGQLLICRKALEVATDDYTQLKEEYQDMSTDFAIWRDEYDIVRHKYDDARNQIEQGAEKLRQMVKMADQFS

Query:  AQAMTLQQGIIPTQHNSKELS-------------------------------------------------------------------------------
         QA TL+QG+IPT+   KELS                                                                               
Subjt:  AQAMTLQQGIIPTQHNSKELS-------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------QCCKD--------
                                                                                                C +D        
Subjt:  ---------------------------------------------------------------------------------------QCCKD--------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------NKVHDLT
                                                                                                     NKV D+ 
Subjt:  ---------------------------------------------------------------------------------------------NKVHDLT

Query:  TPTRVLFQILHEAGYLSP-IDSNGRNKIGCAIEKGCLFHPEMDDHSKEDCIDFKREIQKLMDAKILMISQTDPREIEVDMISNASSHEKISKEGSSIQEP
        T  + LFQILH AGYLSP  +++ R KIGC   + CL HPE +DHS EDC +FK E+QKLMD+KIL+I Q   ++IEV+MI+NASS+EK S E + + +P
Subjt:  TPTRVLFQILHEAGYLSP-IDSNGRNKIGCAIEKGCLFHPEMDDHSKEDCIDFKREIQKLMDAKILMISQTDPREIEVDMISNASSHEKISKEGSSIQEP

Query:  LVIHYEEKFSVTSCIEIPKTMIVEIP-GPFAYKDNRAVPWNM------------NGKAVEITSVDDLDNLSEVSAQKATLVRRKTDNKFVSEEEAREFLK
        LVIHYEEK S+ S I+ PK M VEIP G          P N+             GKA+E+   DDL++LS+V  +K TLV ++TD++ VS+EEA     
Subjt:  LVIHYEEKFSVTSCIEIPKTMIVEIP-GPFAYKDNRAVPWNM------------NGKAVEITSVDDLDNLSEVSAQKATLVRRKTDNKFVSEEEAREFLK

Query:  LIKQSKYKVIEQLHRTPARISILSLFLHSEPHRKILLDILNRAHVGQDISVNALSKIVENITATNCISFTDEEISPEGIGHTKALHISVKCKDYHVARVL
                                                                               EEI PEG GHTKALHISVKCKD+HVARVL
Subjt:  LIKQSKYKVIEQLHRTPARISILSLFLHSEPHRKILLDILNRAHVGQDISVNALSKIVENITATNCISFTDEEISPEGIGHTKALHISVKCKDYHVARVL

Query:  VDNGSSLNIMSRSTLMKL------------------GARREVIGDIEIPLKIGPTTFNLPFQVMDVNSSYSCLLGRPWIHSVRAVPSSLHQKVKFNVEGG
        VDNGSSLNIMSRSTLMKL                   A REVIGDI+IPLKIGP+TFN+ FQVMD+NS YSCLLGRPWIHS  AVPSSLHQ++KF+VEGG
Subjt:  VDNGSSLNIMSRSTLMKL------------------GARREVIGDIEIPLKIGPTTFNLPFQVMDVNSSYSCLLGRPWIHSVRAVPSSLHQKVKFNVEGG

Query:  QAIVYGEEDIFVTKTSALPYVEAAEEAFECSYRSFEVANATIFPTEGLDLNRYMSRTSLMIAKMMIRNGFQINGGLEKNNQGNPEVIFLPKAKERFELGY
        QAIVYGEED+FVTKTS LPYVEA EEA ECSYRSFE ANATIFPTEGL ++RY+S+TSLMIAK MI++GFQ++ GL K+NQG+ EVI LPKAKE+F LGY
Subjt:  QAIVYGEEDIFVTKTSALPYVEAAEEAFECSYRSFEVANATIFPTEGLDLNRYMSRTSLMIAKMMIRNGFQINGGLEKNNQGNPEVIFLPKAKERFELGY

Query:  RPKAFEWEKVRAEKKEMRNARLEGHEAEQRRLNIPHLSEIFK----------------DAR------DENINL---------------------------
        +P   E EKVRA+KKE R+A L G E ++ R++IPHLSE FK                D+        EN +L                           
Subjt:  RPKAFEWEKVRAEKKEMRNARLEGHEAEQRRLNIPHLSEIFK----------------DAR------DENINL---------------------------

Query:  --------------SVDFKVSICNLEQNM-EDECDVSPELLKIIEQEEKKTVPYQEPLEVINLGTLEE------GTTISEENDTSI
                      +VDF+V ICNLEQN+ E E D+SPELL++IEQEEKKT+ YQE L VIN GT EE      GT  SE++ + +
Subjt:  --------------SVDFKVSICNLEQNM-EDECDVSPELLKIIEQEEKKTVPYQEPLEVINLGTLEE------GTTISEENDTSI

A0A5A7TI68 Uncharacterized protein2.1e-15353.58Show/hide
Query:  MDAKILMISQTDPREIEVDMISNASSHEKISKEGSSIQEPLVIHYEEKFSVTSCIEIPKTMIVEIPGPFAYKDNRAVPWNMNGKAVEITSVDDLDNLSEV
        MD KIL+I Q   +EIEV+MI+NASS+EK S E +S+ + LVIHYE+K S+ S I+ PK M VEIPGPFAYKDN  VPW    + +    V     + E+
Subjt:  MDAKILMISQTDPREIEVDMISNASSHEKISKEGSSIQEPLVIHYEEKFSVTSCIEIPKTMIVEIPGPFAYKDNRAVPWNMNGKAVEITSVDDLDNLSEV

Query:  SAQKATLVRRKTDNKFVSEEEAREFLKLIKQSKYKVIEQLHRTPARISILSLFLHSEPHRKILLDILNRAHVGQDIS-VNALSKIVENITATNCISFTDE
        +            + F  +E  R   K I+ +    +  L                    K+LLDILNRAHVG DIS V+ALS+IVENITAT+CISFTDE
Subjt:  SAQKATLVRRKTDNKFVSEEEAREFLKLIKQSKYKVIEQLHRTPARISILSLFLHSEPHRKILLDILNRAHVGQDIS-VNALSKIVENITATNCISFTDE

Query:  EISPEGIGHTKALHISVKCKDYHVARVLVDNGSSLNIMSRSTLMKL------------------GARREVIGDIEIPLKIGPTTFNLPFQVMDVNSSYSC
        EI P+G GHTKALHISVKCKD+HVARVLVDNGSSLNIMSRSTLMKL                  GARREVI DI+IPLKIGP+TFN+  QVMDVNSSYSC
Subjt:  EISPEGIGHTKALHISVKCKDYHVARVLVDNGSSLNIMSRSTLMKL------------------GARREVIGDIEIPLKIGPTTFNLPFQVMDVNSSYSC

Query:  LLGRPWIHSVRAVPSSLHQKVKFNVEGGQAIVYGEEDIFVTKTSALPYVEAAEEAFECSYRSFEVANATIFPTEGLDLNRYMSRTSLMIAKMMIRNGFQI
        LLGRPWIHS  AVPS LHQ++KF+VEGGQAIVYGE+++FVTK S LPYVEA EEA ECSYRSFE+ANATIFP EGL ++RY+S+TSLMIAK MI++ FQ+
Subjt:  LLGRPWIHSVRAVPSSLHQKVKFNVEGGQAIVYGEEDIFVTKTSALPYVEAAEEAFECSYRSFEVANATIFPTEGLDLNRYMSRTSLMIAKMMIRNGFQI

Query:  NGGLEKNNQGNPEVIFLPKAKERFELGYRPKAFEWEKVRAEKKEMRNARLEGHEAEQRRLNIPHLSEIFKDAR----------------------DENIN
        + GL K+NQG+ +VI LPKAKE+F LGY+P A EWEKVRA+KKE R+  L G E ++ R++IPHL E FK                          EN +
Subjt:  NGGLEKNNQGNPEVIFLPKAKERFELGYRPKAFEWEKVRAEKKEMRNARLEGHEAEQRRLNIPHLSEIFKDAR----------------------DENIN

Query:  LS-----------------------------------------VDFKVSICNLEQNME-DECDVSPELLKIIEQEEKKTVPYQEPLEVINLGTLEE----
        LS                                         VDFKV ICNLEQN+E  E D+SPELL++IEQEEKKT+ YQE L+VINLGTL+E    
Subjt:  LS-----------------------------------------VDFKVSICNLEQNME-DECDVSPELLKIIEQEEKKTVPYQEPLEVINLGTLEE----

Query:  --GTTISEENDTSI
          GT  SE++ + +
Subjt:  --GTTISEENDTSI

A0A5A7TUT4 Uncharacterized protein5.5e-17865.74Show/hide
Query:  NKVHDLTTPTRVLFQILHEAGYLSP-IDSNGRNKIGCAIEKGCLFHPEMDDHSKEDCIDFKREIQKLMDAKILMISQTDPREIEVDMISNASSHEKISKE
        NKV D+ T  + LFQILH +GYLSP  +++ R KIGC   + CLFHPE +DHS EDC +FK E+QKLM++KIL+I Q   +EIEV+MI+NASS+EK S E
Subjt:  NKVHDLTTPTRVLFQILHEAGYLSP-IDSNGRNKIGCAIEKGCLFHPEMDDHSKEDCIDFKREIQKLMDAKILMISQTDPREIEVDMISNASSHEKISKE

Query:  GSSIQEPLVIHYEEKFSVTSCIEIPKTMIVEIPGPFAYKDNRAVPW-----------------------------------------NMNGKAVEITSVD
         + + +PLVIHYEEK S+ S I+ PK M VEIPGPFAYKDN AVPW                                            GKA+E+   D
Subjt:  GSSIQEPLVIHYEEKFSVTSCIEIPKTMIVEIPGPFAYKDNRAVPW-----------------------------------------NMNGKAVEITSVD

Query:  DLDNLSEVSAQKATLVRRKTDNKFVSEEEAREFLKLIKQSKYKVIEQLHRTPARISILSLFLHSEPHRKILLDILNRAHVGQDISVNALSKIVENITATN
        DL++LS+V  +K TLV ++TD++ VS+EEA EFLKLIKQS+YKVIEQLHRT ARIS+LSLF++SEPHRK+LLDILNRAHVG DISVNALS+IVENITATN
Subjt:  DLDNLSEVSAQKATLVRRKTDNKFVSEEEAREFLKLIKQSKYKVIEQLHRTPARISILSLFLHSEPHRKILLDILNRAHVGQDISVNALSKIVENITATN

Query:  CISFTDEEISPEGIGHTKALHISVKCKDYHVARVLVDNGSSLNIMSRSTLMKL------------------GARREVIGDIEIPLKIGPTTFNLPFQVMD
        CISFTDEEI PEG GHTKALHISVKCKD+HVARVLVDNGSSLNIMSRSTLMKL                  GARREVI DI+IPLKIGP+TFN+ FQVMD
Subjt:  CISFTDEEISPEGIGHTKALHISVKCKDYHVARVLVDNGSSLNIMSRSTLMKL------------------GARREVIGDIEIPLKIGPTTFNLPFQVMD

Query:  VNSSYSCLLGRPWIHSVRAVPSSLHQKVKFNVEGGQAIVYGEEDIFVTKTSALPYVEAAEEAFECSYRSFEVANATIFPTEGLDLNRYMSRTSLMIAKMM
        +NSSYSCLLGRPWIHSV AVPSSLHQ++KF+VEGGQAIVYGEED+FVTKTS LPYVEA EEA ECSYRSFE+ANATIFPTE L ++RY+S+TSLMIAK M
Subjt:  VNSSYSCLLGRPWIHSVRAVPSSLHQKVKFNVEGGQAIVYGEEDIFVTKTSALPYVEAAEEAFECSYRSFEVANATIFPTEGLDLNRYMSRTSLMIAKMM

Query:  IRNGF
        I++GF
Subjt:  IRNGF

A0A5A7V681 Retrotrans_gag domain-containing protein1.7e-20365.51Show/hide
Query:  NKVHDLTTPTRVLFQILHEAGYLSP-IDSNGRNKIGCAIEKGCLFHPEMDDHSKEDCIDFKREIQKLMDAKILMISQTDPREIEVDMISNASSHEKISKE
        NKV D+ T  + LFQILH AGYLSP  +++ R KIGC   + CLFHPE +DHS EDC +FK E+QKLMD+KIL+I Q   +EIEV+MI+NASS+EK S E
Subjt:  NKVHDLTTPTRVLFQILHEAGYLSP-IDSNGRNKIGCAIEKGCLFHPEMDDHSKEDCIDFKREIQKLMDAKILMISQTDPREIEVDMISNASSHEKISKE

Query:  GSSIQEPLVIHYEEKFSVTSCIEIPKTMIVEIPGPFAYKDNRAVPW-----------------------------------------NMNGKAVEITSVD
         + + +P VIHYEEK S+ S I+ PK M VEIPGPFAYKDN AVPW                                            GKA+E+   D
Subjt:  GSSIQEPLVIHYEEKFSVTSCIEIPKTMIVEIPGPFAYKDNRAVPW-----------------------------------------NMNGKAVEITSVD

Query:  DLDNLSEVSAQKATLVRRKTDNKFVSEEEAREFLKLIKQSKYKVIEQLHRTPARISILSLFLHSEPHRKILLDILNRAHVGQDISVNALSKIVENITATN
        DL++LS+V  +K TLV ++TD++ VS+EEA EFLKLIKQS+YKVIEQLHRTPARIS+LSLF++SEPHRK+LLDILNRAHVG DISVNALS+IVENI ATN
Subjt:  DLDNLSEVSAQKATLVRRKTDNKFVSEEEAREFLKLIKQSKYKVIEQLHRTPARISILSLFLHSEPHRKILLDILNRAHVGQDISVNALSKIVENITATN

Query:  CISFTDEEISPEGIGHTKALHISVKCKDYHVARVLVDNGSSLNIMSRSTLMKL------------------GARREVIGDIEIPLKIGPTTFNLPFQVMD
        CISFTDEEI PEG GHTKALHISVKCKD+HVARVLVDNGSSLNIMSRSTLMKL                  GARREVIGDI+IPLKIGP+TFN+ FQVMD
Subjt:  CISFTDEEISPEGIGHTKALHISVKCKDYHVARVLVDNGSSLNIMSRSTLMKL------------------GARREVIGDIEIPLKIGPTTFNLPFQVMD

Query:  VNSSYSCLLGRPWIHSVRAVPSSLHQKVKFNVEGGQAIVYGEEDIFVTKTSALPYVEAAEEAFECSYRSFEVANATIFPTEGLDLNRYMSRTSLMIAKMM
        +NSSYSCLLGRPWIHS  AVPSSLHQ++KF+VE GQAIVYGEED+FVTKTS LPYVEA EEA ECSYRSFE+ANATIFPTEGL ++RY+S+TSLMIAK M
Subjt:  VNSSYSCLLGRPWIHSVRAVPSSLHQKVKFNVEGGQAIVYGEEDIFVTKTSALPYVEAAEEAFECSYRSFEVANATIFPTEGLDLNRYMSRTSLMIAKMM

Query:  IRNGFQINGGLEKNNQGNPEVIFLPKAKERFELGYRPKAFEWEKVRAEKKEMRNARLEGHEAEQRRLNIPHLSEIFK
        I++GFQ++ GL K+NQG+ EVI LPKAKE F LGY+P   EWEKVRA+KKE R+A L G E ++ R++IPHLSE FK
Subjt:  IRNGFQINGGLEKNNQGNPEVIFLPKAKERFELGYRPKAFEWEKVRAEKKEMRNARLEGHEAEQRRLNIPHLSEIFK

A0A5A7VAU5 Uncharacterized protein4.2e-19454.62Show/hide
Query:  NKVHDLTTPTRVLFQILHEAGYLSP-IDSNGRNKIGCAIEKGCLFHPEMDDHSKEDCIDFKREIQKLMDAKILMISQTDPREIEVDMISNASSHEKISKE
        NKV D+ T  + LFQILH AGYLSP  +++ R KIGC   + CLFHPE +DHS EDC +FK E+QKLMD+KIL+I Q   +EIEV+MI+NASS+EK S E
Subjt:  NKVHDLTTPTRVLFQILHEAGYLSP-IDSNGRNKIGCAIEKGCLFHPEMDDHSKEDCIDFKREIQKLMDAKILMISQTDPREIEVDMISNASSHEKISKE

Query:  GSSIQEPLVIHYEEKFSVTSCIEIPKTMIVEIPGPFAYKDNRAVPW-----------------------------------------NMNGKAVEITSVD
         + + +PLVIHYEEK S+ S I+ PK M VEIPGPFAYKDN AVPW                                            GKA+E+   D
Subjt:  GSSIQEPLVIHYEEKFSVTSCIEIPKTMIVEIPGPFAYKDNRAVPW-----------------------------------------NMNGKAVEITSVD

Query:  DLDNLSEVSAQKATLVRRKTDNKFVSEEEAREFLKLIKQSKYKVIEQLHRTPARISILSLFLHSEPHRKILLDILNRAHVGQDISVNALSKIVENITATN
        DL++LS+V  +K TLV ++TD++ VS+EEA EFLKLIKQS+YKVIEQLHRTPARIS+LSLF +SEPHRK+LLDILNRAHVG DISVNALS+IVENITATN
Subjt:  DLDNLSEVSAQKATLVRRKTDNKFVSEEEAREFLKLIKQSKYKVIEQLHRTPARISILSLFLHSEPHRKILLDILNRAHVGQDISVNALSKIVENITATN

Query:  CISFTDEEISPEGIGHTKALHISVKCKDYHVARVLVDNGSSLNIMSRSTLMKL------------------GARREVIGDIEIPLKIGPTTFNLPFQVMD
        CISFTDEEI PEG GHTKALHISVKCKD+HVARVLVDNGSSLNIMSRSTLMKL                  GARREVIGDI+IPLKIGP+TFN+ FQVMD
Subjt:  CISFTDEEISPEGIGHTKALHISVKCKDYHVARVLVDNGSSLNIMSRSTLMKL------------------GARREVIGDIEIPLKIGPTTFNLPFQVMD

Query:  VNSSYSCLLGRPWIHSVRAVPSSLHQKVKFNVEGGQAIVYGEEDIFVTKTSALPYVEAAEEAFECSYRSFEVANATIFPTEGLDLNRYMSRTSLMIAKMM
        +NSSYSCLLGRPWIHS  AVPSSLHQ                               +   +  CSYRSFE+ANATIFPTEGL ++RY+S+TSLMIAK M
Subjt:  VNSSYSCLLGRPWIHSVRAVPSSLHQKVKFNVEGGQAIVYGEEDIFVTKTSALPYVEAAEEAFECSYRSFEVANATIFPTEGLDLNRYMSRTSLMIAKMM

Query:  IRNGFQINGGLEKNNQGNPEVIFLPKAKERFELGYRPKAFEWEKVRAEKKEMRNARLEGHEAEQRRLNIPHLSEIFK----------------------D
        I++GFQ++ GL K+NQG+ EVI LPKAKE+F LGY+P   EWEKVRA+ KE R+A L G E ++ R++IPHLSE FK                      +
Subjt:  IRNGFQINGGLEKNNQGNPEVIFLPKAKERFELGYRPKAFEWEKVRAEKKEMRNARLEGHEAEQRRLNIPHLSEIFK----------------------D

Query:  ARDENINL---------------------------------------------SVDFKVSICNLEQNM-EDECDVSPELLKIIEQEEKKTVPYQEPLEVI
           EN +L                                             +VDF+V ICNLEQN+ E E D+SPELL++IEQEEKKT+PYQE L+VI
Subjt:  ARDENINL---------------------------------------------SVDFKVSICNLEQNM-EDECDVSPELLKIIEQEEKKTVPYQEPLEVI

Query:  NLGTLEE------GTTISEENDTSI
        NLGT EE      GT  SE++ + +
Subjt:  NLGTLEE------GTTISEENDTSI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTGGGATTTCTTGTACAAGTCTGACGTGACGAATGCCCAAGCACATGAAGTGATAAAAGCATGGAAATTTGCCAAGAGAATGAAGAGCTTGCGACATTGTGAAGG
TACAATGAAACAAGAAAGGAATCACGTCCTAGCAGCAGAAAATGAAGGATTGAGAGAAGAGGTTAGACGTTGGGTCCAACAAGCTACAAGTGCTCAGAGACAACTGGAAG
AAGCGAAGCGTCATATACAAAAACAATCAAAGTTAGAGAAAGACATGAATTCTTTGAATACTGAAGCAGTTCAGATGAGAAAGAAAAATAAGAGATTACTAAGAAACATT
GTTGCCTTACACAACAAGGCAGATACTCAGAAAATTTGCATCAAAGAACTAAAACAGGAATTGGAAAGGATGAATAAAGTTGCTACAAAATTCCAAAATGCCTTGAATGA
GCAAGTCATATCCTCTCAAAAGATCGCTGCAGAATTTGAAGCTCTACAACACTCAACTAATGAATATAAATCTCAGCTTATAGAAGTAGAATGTCGCAATACATTTCTCC
AGAAGGCTGTTGCAAGTTTTGAAGGCCAGTTGCTGATATGTCGTAAGGCGCTAGAGGTCGCAACAGATGACTATACTCAGTTGAAGGAAGAGTATCAAGACATGTCAACA
GATTTTGCGATATGGAGAGATGAGTATGACATAGTGAGACACAAATATGATGACGCTAGAAATCAGATTGAGCAAGGAGCTGAGAAATTAAGACAGATGGTGAAAATGGC
AGATCAATTCTCAGCACAAGCAATGACTCTCCAGCAAGGCATCATACCTACACAGCATAATAGCAAAGAGTTGTCTCAATGTTGCAAAGATAATAAAGTTCATGACTTGA
CTACTCCAACGAGAGTTCTTTTTCAAATTCTTCATGAGGCTGGGTACCTATCACCAATTGACAGTAATGGCAGAAATAAGATAGGATGCGCCATTGAAAAAGGGTGTTTA
TTCCACCCTGAAATGGATGACCATTCCAAAGAAGATTGTATTGATTTTAAGAGGGAAATACAGAAATTAATGGATGCAAAGATTTTAATGATAAGTCAGACAGACCCGCG
GGAAATCGAGGTTGATATGATTTCCAATGCTTCATCTCACGAAAAAATCTCAAAAGAGGGATCATCTATACAAGAACCATTAGTTATTCATTATGAAGAAAAATTCAGTG
TCACCTCTTGTATCGAGATACCAAAGACAATGATCGTAGAGATACCAGGTCCTTTTGCTTATAAGGATAATCGAGCTGTACCATGGAATATGAATGGTAAAGCTGTGGAA
ATAACGAGCGTGGATGATCTAGACAATTTAAGCGAAGTCTCTGCTCAAAAGGCCACTCTAGTAAGAAGGAAGACAGACAATAAGTTTGTTTCAGAGGAAGAGGCTCGCGA
ATTTTTGAAGTTGATCAAGCAGAGCAAATATAAAGTAATTGAGCAATTACATCGTACCCCAGCCCGTATATCGATATTGTCATTATTCTTGCATTCTGAGCCACATCGCA
AGATTTTGCTAGACATCTTAAATCGGGCACACGTGGGTCAGGATATTTCGGTAAACGCCCTTAGTAAAATTGTGGAGAATATAACTGCAACAAACTGCATCTCATTTACA
GATGAAGAAATTTCTCCTGAAGGTATTGGGCATACAAAGGCATTACACATATCTGTGAAATGTAAAGATTACCATGTGGCTAGAGTTCTAGTTGATAATGGGTCGTCTCT
AAATATAATGTCGAGATCTACTTTGATGAAACTCGGTGCTCGTAGGGAGGTAATTGGAGATATAGAGATTCCACTGAAAATTGGACCCACTACCTTTAATTTACCATTTC
AAGTAATGGATGTAAACTCTTCGTATAGTTGCTTGCTTGGACGACCTTGGATCCATTCGGTAAGGGCAGTCCCGTCTTCACTACACCAAAAGGTAAAGTTTAATGTGGAA
GGTGGTCAGGCCATTGTATATGGAGAAGAGGACATATTTGTAACCAAAACATCAGCACTTCCTTATGTTGAAGCAGCCGAGGAAGCCTTTGAGTGCTCGTACAGATCATT
TGAGGTTGCCAATGCTACTATCTTTCCCACTGAGGGTTTAGATCTAAATCGCTATATGTCAAGAACTTCTCTAATGATTGCAAAGATGATGATAAGAAATGGTTTTCAAA
TAAACGGAGGATTGGAAAAGAATAATCAAGGAAATCCAGAAGTGATCTTTCTTCCTAAAGCTAAAGAGAGGTTTGAATTGGGGTATAGGCCGAAGGCTTTTGAGTGGGAG
AAGGTTCGGGCTGAGAAGAAGGAAATGAGAAACGCACGTCTCGAAGGACATGAAGCTGAGCAAAGAAGACTGAATATACCTCATCTATCTGAAATTTTCAAGGATGCTAG
AGACGAAAACATCAATTTAAGTGTCGATTTTAAAGTTTCGATATGTAATCTCGAACAAAATATGGAAGACGAATGTGATGTATCGCCCGAGTTACTCAAGATAATAGAGC
AAGAAGAAAAAAAGACGGTGCCATATCAAGAACCTTTAGAAGTTATTAATTTGGGGACACTGGAAGAGGGGACAACTATATCAGAAGAAAATGACACGAGCATAGGACAG
GAAAGTTCAACGTCGTTGTTTCCAAGAAGGAGATGTGGTGTTGAAAAGAATCTTACCATTTCAAAAAGATCACAGAGGGAAGTGGACTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATTGGGATTTCTTGTACAAGTCTGACGTGACGAATGCCCAAGCACATGAAGTGATAAAAGCATGGAAATTTGCCAAGAGAATGAAGAGCTTGCGACATTGTGAAGG
TACAATGAAACAAGAAAGGAATCACGTCCTAGCAGCAGAAAATGAAGGATTGAGAGAAGAGGTTAGACGTTGGGTCCAACAAGCTACAAGTGCTCAGAGACAACTGGAAG
AAGCGAAGCGTCATATACAAAAACAATCAAAGTTAGAGAAAGACATGAATTCTTTGAATACTGAAGCAGTTCAGATGAGAAAGAAAAATAAGAGATTACTAAGAAACATT
GTTGCCTTACACAACAAGGCAGATACTCAGAAAATTTGCATCAAAGAACTAAAACAGGAATTGGAAAGGATGAATAAAGTTGCTACAAAATTCCAAAATGCCTTGAATGA
GCAAGTCATATCCTCTCAAAAGATCGCTGCAGAATTTGAAGCTCTACAACACTCAACTAATGAATATAAATCTCAGCTTATAGAAGTAGAATGTCGCAATACATTTCTCC
AGAAGGCTGTTGCAAGTTTTGAAGGCCAGTTGCTGATATGTCGTAAGGCGCTAGAGGTCGCAACAGATGACTATACTCAGTTGAAGGAAGAGTATCAAGACATGTCAACA
GATTTTGCGATATGGAGAGATGAGTATGACATAGTGAGACACAAATATGATGACGCTAGAAATCAGATTGAGCAAGGAGCTGAGAAATTAAGACAGATGGTGAAAATGGC
AGATCAATTCTCAGCACAAGCAATGACTCTCCAGCAAGGCATCATACCTACACAGCATAATAGCAAAGAGTTGTCTCAATGTTGCAAAGATAATAAAGTTCATGACTTGA
CTACTCCAACGAGAGTTCTTTTTCAAATTCTTCATGAGGCTGGGTACCTATCACCAATTGACAGTAATGGCAGAAATAAGATAGGATGCGCCATTGAAAAAGGGTGTTTA
TTCCACCCTGAAATGGATGACCATTCCAAAGAAGATTGTATTGATTTTAAGAGGGAAATACAGAAATTAATGGATGCAAAGATTTTAATGATAAGTCAGACAGACCCGCG
GGAAATCGAGGTTGATATGATTTCCAATGCTTCATCTCACGAAAAAATCTCAAAAGAGGGATCATCTATACAAGAACCATTAGTTATTCATTATGAAGAAAAATTCAGTG
TCACCTCTTGTATCGAGATACCAAAGACAATGATCGTAGAGATACCAGGTCCTTTTGCTTATAAGGATAATCGAGCTGTACCATGGAATATGAATGGTAAAGCTGTGGAA
ATAACGAGCGTGGATGATCTAGACAATTTAAGCGAAGTCTCTGCTCAAAAGGCCACTCTAGTAAGAAGGAAGACAGACAATAAGTTTGTTTCAGAGGAAGAGGCTCGCGA
ATTTTTGAAGTTGATCAAGCAGAGCAAATATAAAGTAATTGAGCAATTACATCGTACCCCAGCCCGTATATCGATATTGTCATTATTCTTGCATTCTGAGCCACATCGCA
AGATTTTGCTAGACATCTTAAATCGGGCACACGTGGGTCAGGATATTTCGGTAAACGCCCTTAGTAAAATTGTGGAGAATATAACTGCAACAAACTGCATCTCATTTACA
GATGAAGAAATTTCTCCTGAAGGTATTGGGCATACAAAGGCATTACACATATCTGTGAAATGTAAAGATTACCATGTGGCTAGAGTTCTAGTTGATAATGGGTCGTCTCT
AAATATAATGTCGAGATCTACTTTGATGAAACTCGGTGCTCGTAGGGAGGTAATTGGAGATATAGAGATTCCACTGAAAATTGGACCCACTACCTTTAATTTACCATTTC
AAGTAATGGATGTAAACTCTTCGTATAGTTGCTTGCTTGGACGACCTTGGATCCATTCGGTAAGGGCAGTCCCGTCTTCACTACACCAAAAGGTAAAGTTTAATGTGGAA
GGTGGTCAGGCCATTGTATATGGAGAAGAGGACATATTTGTAACCAAAACATCAGCACTTCCTTATGTTGAAGCAGCCGAGGAAGCCTTTGAGTGCTCGTACAGATCATT
TGAGGTTGCCAATGCTACTATCTTTCCCACTGAGGGTTTAGATCTAAATCGCTATATGTCAAGAACTTCTCTAATGATTGCAAAGATGATGATAAGAAATGGTTTTCAAA
TAAACGGAGGATTGGAAAAGAATAATCAAGGAAATCCAGAAGTGATCTTTCTTCCTAAAGCTAAAGAGAGGTTTGAATTGGGGTATAGGCCGAAGGCTTTTGAGTGGGAG
AAGGTTCGGGCTGAGAAGAAGGAAATGAGAAACGCACGTCTCGAAGGACATGAAGCTGAGCAAAGAAGACTGAATATACCTCATCTATCTGAAATTTTCAAGGATGCTAG
AGACGAAAACATCAATTTAAGTGTCGATTTTAAAGTTTCGATATGTAATCTCGAACAAAATATGGAAGACGAATGTGATGTATCGCCCGAGTTACTCAAGATAATAGAGC
AAGAAGAAAAAAAGACGGTGCCATATCAAGAACCTTTAGAAGTTATTAATTTGGGGACACTGGAAGAGGGGACAACTATATCAGAAGAAAATGACACGAGCATAGGACAG
GAAAGTTCAACGTCGTTGTTTCCAAGAAGGAGATGTGGTGTTGAAAAGAATCTTACCATTTCAAAAAGATCACAGAGGGAAGTGGACTCCTAA
Protein sequenceShow/hide protein sequence
MDWDFLYKSDVTNAQAHEVIKAWKFAKRMKSLRHCEGTMKQERNHVLAAENEGLREEVRRWVQQATSAQRQLEEAKRHIQKQSKLEKDMNSLNTEAVQMRKKNKRLLRNI
VALHNKADTQKICIKELKQELERMNKVATKFQNALNEQVISSQKIAAEFEALQHSTNEYKSQLIEVECRNTFLQKAVASFEGQLLICRKALEVATDDYTQLKEEYQDMST
DFAIWRDEYDIVRHKYDDARNQIEQGAEKLRQMVKMADQFSAQAMTLQQGIIPTQHNSKELSQCCKDNKVHDLTTPTRVLFQILHEAGYLSPIDSNGRNKIGCAIEKGCL
FHPEMDDHSKEDCIDFKREIQKLMDAKILMISQTDPREIEVDMISNASSHEKISKEGSSIQEPLVIHYEEKFSVTSCIEIPKTMIVEIPGPFAYKDNRAVPWNMNGKAVE
ITSVDDLDNLSEVSAQKATLVRRKTDNKFVSEEEAREFLKLIKQSKYKVIEQLHRTPARISILSLFLHSEPHRKILLDILNRAHVGQDISVNALSKIVENITATNCISFT
DEEISPEGIGHTKALHISVKCKDYHVARVLVDNGSSLNIMSRSTLMKLGARREVIGDIEIPLKIGPTTFNLPFQVMDVNSSYSCLLGRPWIHSVRAVPSSLHQKVKFNVE
GGQAIVYGEEDIFVTKTSALPYVEAAEEAFECSYRSFEVANATIFPTEGLDLNRYMSRTSLMIAKMMIRNGFQINGGLEKNNQGNPEVIFLPKAKERFELGYRPKAFEWE
KVRAEKKEMRNARLEGHEAEQRRLNIPHLSEIFKDARDENINLSVDFKVSICNLEQNMEDECDVSPELLKIIEQEEKKTVPYQEPLEVINLGTLEEGTTISEENDTSIGQ
ESSTSLFPRRRCGVEKNLTISKRSQREVDS