; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007542 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007542
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr9:873699..881777
RNA-Seq ExpressionLag0007542
SyntenyLag0007542
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142953.1 uncharacterized protein LOC111012947 [Momordica charantia]2.6e-11539.72Show/hide
Query:  NEQQNNQAENPILVANDRTRAIRAYAFPMFDELNPGIARPQIQAANFEMKPIMFQMFQTVGQFHGLSSEDPHLHLKPFLGVSDSFVIQGVPRDGLRLTLF
        N+   N   NPI VA+ R RA+R YA  + ++LN  +       A FE KP+M QM   +GQF GL  EDP  HLK F+ V+++F + G+  D LRLTLF
Subjt:  NEQQNNQAENPILVANDRTRAIRAYAFPMFDELNPGIARPQIQAANFEMKPIMFQMFQTVGQFHGLSSEDPHLHLKPFLGVSDSFVIQGVPRDGLRLTLF

Query:  PFSLRDGANAWLNSFAPGSIRIWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFNEAWERFKELLRKCPHHGLPHCIQMETFY--------IELNG
        PFSL   A AWLN+F   +I    ++ + FL KYFPP RNA +R EI+ FRQ E+E  N AWERFK+L+R CP+ G+P C+Q+E F+        + LNG
Subjt:  PFSLRDGANAWLNSFAPGSIRIWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFNEAWERFKELLRKCPHHGLPHCIQMETFY--------IELNG

Query:  ATQDVRGTNKKVKSVLEV-------------------------------DGVSTIRVDLVMIANALKNV---TVISHQQPLVVEPAAVV-----------
        A    + T+K    ++E+                               D +++++  +  I   LKN+      +   P    P+ V            
Subjt:  ATQDVRGTNKKVKSVLEV-------------------------------DGVSTIRVDLVMIANALKNV---TVISHQQPLVVEPAAVV-----------

Query:  ---------------------------------NQVIRGITLILTF--------IIQVLPQQNKQALPQQNSGSSLEAMMKEF-----------MARTDA
                                         NQ  +       F          Q   QQ     P Q + S++E +MKEF           M RTDA
Subjt:  ---------------------------------NQVIRGITLILTF--------IIQVLPQQNKQALPQQNSGSSLEAMMKEF-----------MARTDA

Query:  AIQS----------NQASMRALELQVGQLANELKVRPQGKLPSDTEHPRRKGAEGSNKDAGASGSVPD--VEPPY---VPP--PSYVPPLPSPQRQKPKN
         I+           N  ++R LE+Q+GQLANE++ RPQG LPS TE PRR  +   +++      VPD  VEP     V P   +   P P PQR   KN
Subjt:  AIQS----------NQASMRALELQVGQLANELKVRPQGKLPSDTEHPRRKGAEGSNKDAGASGSVPD--VEPPY---VPP--PSYVPPLPSPQRQKPKN

Query:  QD-------------------------------------------GDFETVSLTEECCAILKNGLPPKAKDLGSFTIPVSIGGKELGRELCDLGASINLM
        QD                                           G++ETV+LTE    + K+  PPK KD GSFTI   IGGK++GR LCDLGA INLM
Subjt:  QD-------------------------------------------GDFETVSLTEECCAILKNGLPPKAKDLGSFTIPVSIGGKELGRELCDLGASINLM

Query:  PLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIDDVLVKVDKFIFPVDFIILDYEADKDVSIILGRPFLATSRALIDVQKGELTTRVCNEEVKFNVFKA
        PLS+++KL IG+A PTTVTL LADRSIT PEGKI+DVLVKVDKFIFP DFIILD EADKDV IILGRPFLAT   LIDV+KGELT RV +++V FN+  A
Subjt:  PLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIDDVLVKVDKFIFPVDFIILDYEADKDVSIILGRPFLATSRALIDVQKGELTTRVCNEEVKFNVFKA

Query:  MKYPDEMEDCSFIRI
        MKYPD+ E+C  I I
Subjt:  MKYPDEMEDCSFIRI

XP_022159235.1 uncharacterized protein LOC111025653 [Momordica charantia]6.5e-11436.23Show/hide
Query:  NQAENPILVANDRTRAIRAYAFPMFDELNPGIARPQIQAANFEMKPIMFQMFQTVGQFHGLSSEDPHLHLKPFLGVSDSFVIQGVPRDGLRLTLFPFSLR
        N   NPI VA+ R RA+R YA  + ++LN  +       A FE KP+M QM   +GQF GL  EDP  HLK F+ V+++F + G+  D LRLTLFPFS+ 
Subjt:  NQAENPILVANDRTRAIRAYAFPMFDELNPGIARPQIQAANFEMKPIMFQMFQTVGQFHGLSSEDPHLHLKPFLGVSDSFVIQGVPRDGLRLTLFPFSLR

Query:  DGANAWLNSFAPGSIRIWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFNEAWERFKELLRKCPHHGLPHCIQMETFY--------IELNGATQDV
          A AWLN+F   +I  W ++ + FL KYFPP RNA +R EI+ FRQ E+E  N AWERFK+L+  CP+ G+P C+Q+E F+        + LNGA    
Subjt:  DGANAWLNSFAPGSIRIWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFNEAWERFKELLRKCPHHGLPHCIQMETFY--------IELNGATQDV

Query:  RGTNKKVKSVLEV-------------------------------DGVSTIRVDLVMIANALKNV---TVISHQQPLVVEPAAVVNQVIRGITLI------
        + T+K    ++E+                               D +++++  +  I   LKN+      +        P+ V  Q+             
Subjt:  RGTNKKVKSVLEV-------------------------------DGVSTIRVDLVMIANALKNV---TVISHQQPLVVEPAAVVNQVIRGITLI------

Query:  ----------LTFIIQV------------------------------------------------------------LPQQNKQALPQQNSGSSLEAMMK
                  + ++ Q+                                                              QQ     P Q + S++E +MK
Subjt:  ----------LTFIIQV------------------------------------------------------------LPQQNKQALPQQNSGSSLEAMMK

Query:  EFMARTDAAIQS---------------------NQASMRALELQVGQLANELKVRPQGKLPSDTEHPRRKGAEGSNKDAGASG-----------------
        E + + DA ++                      N  ++R LE+Q+GQL NE++ RPQG LPS TE PRR G E  N  A  SG                 
Subjt:  EFMARTDAAIQS---------------------NQASMRALELQVGQLANELKVRPQGKLPSDTEHPRRKGAEGSNKDAGASG-----------------

Query:  ------SVPD--VEP----PYVPPPSYV-PPLPSPQRQKPKNQD-------------------------------------------GDFETVSLTEECC
              +VPD  VEP    P  P  S   PP P PQR   KNQD                                           G++ETV+LTE   
Subjt:  ------SVPD--VEP----PYVPPPSYV-PPLPSPQRQKPKNQD-------------------------------------------GDFETVSLTEECC

Query:  AILKNGLPPKAKDLGSFTIPVSIGGKELGRELCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIDDVLVKVDKFIFPVDFIILDYEAD
         + K+ +PPK KD GSFTIP  IGGK++GR LCDLGASINLMPLS+++K  IG+A PTTVTLQLADRSIT PEGKI+DVLVKVDKFIFP DFIILD EAD
Subjt:  AILKNGLPPKAKDLGSFTIPVSIGGKELGRELCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIDDVLVKVDKFIFPVDFIILDYEAD

Query:  KDVSIILGRPFLATSRALIDVQKGELTTRVCNEEVKFNVFKAMKYPDEMEDCSFIRI-----------MENTIVETTIQD--------STNLDQKKSPPI
        KDV IILGRPFLAT   LIDV+KGELT RV +++V FN+  AMKY D+ME+C+ I I           + N  +E  +++        +T L ++K   I
Subjt:  KDVSIILGRPFLATSRALIDVQKGELTTRVCNEEVKFNVFKAMKYPDEMEDCSFIRI-----------MENTIVETTIQD--------STNLDQKKSPPI

Query:  KPSLIE
        +P  IE
Subjt:  KPSLIE

XP_022929949.1 uncharacterized protein LOC111436411 [Cucurbita moschata]2.0e-13943.65Show/hide
Query:  QNPPLEQNEQQNNQAENPILVANDRTRAIRAYAFPMFDELNPGIARPQIQAANFEMKPIMFQMFQTVGQFHGLSSEDPHLHLKPFLGV-------SDSFV
        +NP +  N  Q     NPI +A+DR RAIRAYA P  +ELNP I RP+IQ   FE+KP+MFQM QT+GQFHGL  EDPHLHLK FLGV       SDSF 
Subjt:  QNPPLEQNEQQNNQAENPILVANDRTRAIRAYAFPMFDELNPGIARPQIQAANFEMKPIMFQMFQTVGQFHGLSSEDPHLHLKPFLGV-------SDSFV

Query:  IQGVPRDGLRLTLFPFSLRDGANAWLNSFAPGSIRIWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFNEAWERFKELLRKCPHHGLPHCIQMETF
         QGV +D +RL+LFP+ LRDGA +WLN+ APG+I  W+ LAENFL KYFPP RNA+ ++EIV F+Q EDET +EA ERFKE+LRKCPHHGLPHCIQMETF
Subjt:  IQGVPRDGLRLTLFPFSLRDGANAWLNSFAPGSIRIWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFNEAWERFKELLRKCPHHGLPHCIQMETF

Query:  YIELNGATQ----------------------------------DVRGT-NKKVKSVLEVDGVSTIRVDLVMIANALKNVTVISHQQPLVVEP---AAVVN
        Y  LN  T+                                  DVR    +K + VLEVD +S+I   L  + N L+N+ +   Q  ++  P   AA +N
Subjt:  YIELNGATQ----------------------------------DVRGT-NKKVKSVLEVDGVSTIRVDLVMIANALKNVTVISHQQPLVVEP---AAVVN

Query:  QVI--------------------------------------------------------RGITLI-------------------LTFIIQVLPQQNK-QA
        Q                                                          +G +L                    L +  Q +  Q K   
Subjt:  QVI--------------------------------------------------------RGITLI-------------------LTFIIQVLPQQNK-QA

Query:  LPQQNSGSSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKVRPQGKLPSDTEHPRRKGA--EGSNKDAGASGSVPDVEPPYVPPP---SYVPP
          Q  S +S+E+++KE+MA+ DA IQS QAS+R LE+Q+G   N  +     +  +DT+    + A  +  +KD       P ++          +Y P 
Subjt:  LPQQNSGSSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKVRPQGKLPSDTEHPRRKGA--EGSNKDAGASGSVPDVEPPYVPPP---SYVPP

Query:  LPSPQRQKPKNQDGDFE-------------------------------------------TVSLTEECCAILKNGLPPKAKDLGSFTIPVSIGGKELGRE
         P PQR K K ++  FE                                            VSL EEC AILKN +P K KD GSFTIPVSIGGKELGR 
Subjt:  LPSPQRQKPKNQDGDFE-------------------------------------------TVSLTEECCAILKNGLPPKAKDLGSFTIPVSIGGKELGRE

Query:  LCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIDDVLVKVDKFIFPVDFIILDYEADKDVSIILGRPFLATSRALIDVQKGELTTRVC
        LCDLGA+INLMPLS+Y+KLGIGEARPTTVTLQLADRSITYPEGKI+D+L++VDKFIF  DFIILDYE D DV IILGRPFL   R L+DV KG +T R+ 
Subjt:  LCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIDDVLVKVDKFIFPVDFIILDYEADKDVSIILGRPFLATSRALIDVQKGELTTRVC

Query:  NEEVKFNVFKAMKYPDEMEDCSFI
         ++V+FN+  +MKYP  +E+CS +
Subjt:  NEEVKFNVFKAMKYPDEMEDCSFI

XP_030505184.1 uncharacterized protein LOC115720166 [Cannabis sativa]5.8e-13140.83Show/hide
Query:  QAENPILVANDRTRAIRAYAFPMFDELNPGIARPQIQAANFEMKPIMFQMFQTVGQFHGLSSEDPHLHLKPFLGVSDSFVIQGVPRDGLRLTLFPFSLRD
        Q  +PI++ +DR RAIR YA PMF+ELNPGI RP+IQA  FE+KP+MFQM QTVGQF  + +EDPHLHL+ FL +SDSF IQGV  +  RL LFPFSLRD
Subjt:  QAENPILVANDRTRAIRAYAFPMFDELNPGIARPQIQAANFEMKPIMFQMFQTVGQFHGLSSEDPHLHLKPFLGVSDSFVIQGVPRDGLRLTLFPFSLRD

Query:  GANAWLNSFAPGSIRIWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFNEAWERFKELLRKCPHHGLPHCIQMETFYIELNGATQDVRGT------
         A +WLN+ +P S+  W++ AE FL KYFPP RNAK RSEI+ F QLEDE+ ++AWERFKELLRKCPHHG+PHCIQMETFY  LN  +Q V         
Subjt:  GANAWLNSFAPGSIRIWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFNEAWERFKELLRKCPHHGLPHCIQMETFYIELNGATQDVRGT------

Query:  -----------------------------NKKVKSVLEVDGVSTIRVDLVMIANALKNVTVISHQQPLVVEPAAVV------------------------
                                     ++KV  VLEVD ++ +   +  + N LKN+++ + +    ++PAA +                        
Subjt:  -----------------------------NKKVKSVLEVDGVSTIRVDLVMIANALKNVTVISHQQPLVVEPAAVV------------------------

Query:  ---------------------NQVIRGITLILTFIIQVLP--QQNKQALP-------------QQNSGSSLEAMMKEFMARTDAAIQSNQASMRALELQV
                             NQ  +    +       L    Q +QA P             Q +  SSLE++M+++MA+ DA IQS  A +R LELQ+
Subjt:  ---------------------NQVIRGITLILTFIIQVLP--QQNKQALP-------------QQNSGSSLEAMMKEFMARTDAAIQSNQASMRALELQV

Query:  GQLANELKVRPQGKLPSDTEHPRRKGAE---------------------GSNKDAG-----------------------ASGSVPDVEPPYVPPPSYVPP
        G LANELK RPQG LPSDTE+PRR G E                     GS +                          A+G   + +       S  PP
Subjt:  GQLANELKVRPQGKLPSDTEHPRRKGAE---------------------GSNKDAG-----------------------ASGSVPDVEPPYVPPPSYVPP

Query:  LPSPQRQKPKNQDGDFETV------------------------------------------SLTEECCAILKNGLPPKAKDLGSFTIPVSIGGKELGREL
        LP PQR + + QDG F+                                             LTE   A+LKN +PPK KD GSFTIP+SIGG       
Subjt:  LPSPQRQKPKNQDGDFETV------------------------------------------SLTEECCAILKNGLPPKAKDLGSFTIPVSIGGKELGREL

Query:  CDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIDDVLVKVDKFIFPVDFIILDYEADKDVSIILGRPFLATSRALIDVQKGELTTRVCN
                       R LGIGEARPTTVTLQLADRS+ +P+GKI+DVLV+VDKFIFP DFIILDYE D++V IIL RPFLAT R LIDV+KGELT R  +
Subjt:  CDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIDDVLVKVDKFIFPVDFIILDYEADKDVSIILGRPFLATSRALIDVQKGELTTRVCN

Query:  EEVKFNVFKAMKYPDEMEDCSFIRIMENTIVETTIQDSTNLDQKKSP
        E+  F VF+ ++ PD + +C  I  M+  +VE +    +   +KK P
Subjt:  EEVKFNVFKAMKYPDEMEDCSFIRIMENTIVETTIQDSTNLDQKKSP

XP_034899370.1 LOW QUALITY PROTEIN: uncharacterized protein LOC118037487 [Populus alba]5.5e-11340.68Show/hide
Query:  ENPILVANDRTRAIRAYAFPMFDELNPGIARPQIQAANFEMKPIMFQMFQTVGQFHGLSSEDPHLHLKPFLGVSDSFVIQGVPRDGLRLTLFPFSLRDGA
        ENP       TRA+R +A P    +   I +P+I+A NFE+KP + QM QT  QF+GL S+DP+ H+  FL + D+F   GV  D +RL LFPFSLRD A
Subjt:  ENPILVANDRTRAIRAYAFPMFDELNPGIARPQIQAANFEMKPIMFQMFQTVGQFHGLSSEDPHLHLKPFLGVSDSFVIQGVPRDGLRLTLFPFSLRDGA

Query:  NAWLNSFAPGSIRIWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFNEAWERFKELLRKCPHHGLPHCIQMETFYIELNGATQDV-----------
          WLNS    S+  W++LA+ FL+K+FPP + AK+R EI  F QLE E   E WER+K+LLR+CPHHGLP  +Q++ FY  LN +T+ +           
Subjt:  NAWLNSFAPGSIRIWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFNEAWERFKELLRKCPHHGLPHCIQMETFYIELNGATQDV-----------

Query:  -----------------------RGTNKKVKSVLEVDGVSTIRVDLVMIANALKNVTVISHQQPLVV---------EPAAVVNQVIRG--ITLILTFIIQ
                               R   KK   V E+D ++ +   +  +   LK   + ++               E   V N   +      +  +  Q
Subjt:  -----------------------RGTNKKVKSVLEVDGVSTIRVDLVMIANALKNVTVISHQQPLVV---------EPAAVVNQVIRG--ITLILTFIIQ

Query:  VLPQQ---------------NKQALPQQNSGSSLEAMMKE-----------------FMARTDAAIQSNQASMRALELQVGQLANELKVRPQGKLPSDTE
          P                 N Q + +  +  S    MKE                 FM  T+  +Q+  AS+R LE+QVGQLAN L  R QG LPS TE
Subjt:  VLPQQ---------------NKQALPQQNSGSSLEAMMKE-----------------FMARTDAAIQSNQASMRALELQVGQLANELKVRPQGKLPSDTE

Query:  -HPR--------RKGAE----GSNKDAGASGSVPDVEP-------PYVPPP--SYVPPLPSPQRQKPKNQDGDF--------------------------
         +P+        R G E      NK AG       VEP         +P P    +  +P PQR K    D  F                          
Subjt:  -HPR--------RKGAE----GSNKDAGASGSVPDVEP-------PYVPPP--SYVPPLPSPQRQKPKNQDGDF--------------------------

Query:  --ETVSLTEECCAILKNGLPPKAKDLGSFTIPVSIGGKELGRELCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIDDVLVKVDKFIF
          ETV+LTEEC AIL+  LPPK KD GSFTIP SIG     + LCDLGASINLMPLS+++KLG+GEARPTTVTLQLADRS+ +P G I+DVLVKV KFIF
Subjt:  --ETVSLTEECCAILKNGLPPKAKDLGSFTIPVSIGGKELGRELCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIDDVLVKVDKFIF

Query:  PVDFIILDYEADKDVSIILGRPFLATSRALIDVQKGELTTRVCNEEVKFNVFKAMKYPDEMEDCSFIRIMENTIVE
        P DFIILD E D ++ I+LGRPFLAT  ALIDV+KGEL  RV  EEV FNVFKA+K PD  E C  I+++++ I E
Subjt:  PVDFIILDYEADKDVSIILGRPFLATSRALIDVQKGELTTRVCNEEVKFNVFKAMKYPDEMEDCSFIRIMENTIVE

TrEMBL top hitse value%identityAlignment
A0A2G9HH15 Reverse transcriptase1.5e-10336.75Show/hide
Query:  ERSAWGKQNPPLEQNEQQNNQAENPILVAND--RTRAIRAYAFPMFDELNPGIARPQIQAANFEMKPIMFQMFQTVGQFHGLSSEDPHLHLKPFLGVSDS
        ER+   ++    E  EQ+    EN I++  D      +R  A P   E    +  P++  A  +++  M +M Q   QF GLS E+P+ H+  FL + D+
Subjt:  ERSAWGKQNPPLEQNEQQNNQAENPILVAND--RTRAIRAYAFPMFDELNPGIARPQIQAANFEMKPIMFQMFQTVGQFHGLSSEDPHLHLKPFLGVSDS

Query:  FVIQGVPRDGLRLTLFPFSLRDGANAWLNSFAPGSIRIWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFNEAWERFKELLRKCPHHGLPHCIQME
           +GV +D LRL LF FSL   A  W  S    SI  W +L E F+SK+F P + A LR+EI+ FRQ   ET  EAW RF+++LR CP+H +P  IQ+ 
Subjt:  FVIQGVPRDGLRLTLFPFSLRDGANAWLNSFAPGSIRIWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFNEAWERFKELLRKCPHHGLPHCIQME

Query:  TFYIELNGATQD----------------------------------VRGTNKKVKSVLEVDGVSTIRVDLVMIANALKN--VTVISH-------------
        TFY  L    +D                                   R T  K   V+EVD V+ +   +  +  ++KN  V  + H             
Subjt:  TFYIELNGATQD----------------------------------VRGTNKKVKSVLEVDGVSTIRVDLVMIANALKN--VTVISH-------------

Query:  --QQPLVVEPAAVVNQVIR--------------------------GITLILTFIIQVLPQQNKQALPQQNSGSSLEAMMKEFMARTDAAIQSNQASMRAL
          Q P  VE    V+   +                          G  L L F  Q   QQ  Q  P Q    SLE  + +FMA       S  A+ + +
Subjt:  --QQPLVVEPAAVVNQVIR--------------------------GITLILTFIIQVLPQQNKQALPQQNSGSSLEAMMKEFMARTDAAIQSNQASMRAL

Query:  ELQVGQLANELKVRPQGKLPSDTE-HPR------------RKGAE----------GSNKDAGASGSVPDVEPP-----YVPPPSYVPPLPSPQRQKPKNQ
        E Q+GQLAN +  RP+  LPS+TE +PR            R G E             K+  +     ++E P         PSYV  +     +  K +
Subjt:  ELQVGQLANELKVRPQGKLPSDTE-HPR------------RKGAE----------GSNKDAGASGSVPDVEPP-----YVPPPSYVPPLPSPQRQKPKNQ

Query:  DGDFETVSLTEECCAILKNGLPPKAKDLGSFTIPVSIGGKELGRELCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIDDVLVKVDKF
         GD+ETV+LTEEC AI++N LPPK KD GSFTIP +IG    GR LCDLGASINLMP S+YR LG+GEA+PT++TLQLADRS+TYP G I+D+LVKVDKF
Subjt:  DGDFETVSLTEECCAILKNGLPPKAKDLGSFTIPVSIGGKELGRELCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIDDVLVKVDKF

Query:  IFPVDFIILDYEADKDVSIILGRPFLATSRALIDVQKGELTTRVCNEEVKFNVFKAMKYPDEMEDCSFIRIMENTIVETTIQDS----------------
        IFP DF++LD E D +V IILGRPFLAT R LIDVQKGELT RV ++++ FNVFKAMK+P+E ++C  + + +      +I +                 
Subjt:  IFPVDFIILDYEADKDVSIILGRPFLATSRALIDVQKGELTTRVCNEEVKFNVFKAMKYPDEMEDCSFIRIMENTIVETTIQDS----------------

Query:  --------TNLDQKK---------------SPPIKPSLIEARTLYLKPLPDHLKL-----SKTRPRILSS
                  LD  K               S  +KPS+ E  TL LKPLP HL       S T P I+SS
Subjt:  --------TNLDQKK---------------SPPIKPSLIEARTLYLKPLPDHLKL-----SKTRPRILSS

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129451.3e-11237.69Show/hide
Query:  EQQNNQAENPILVANDRTRAIRAYAFPMFDELNPGIARPQIQAANFEMKPIMFQMFQTVGQFHGLSSEDPHLHLKPFLGVSDSFVIQGVPRDGLRLTLFP
        E  NN   N I +  +  RA+R Y  P+   L+  I RP I A NFE+KP   QM Q+  QF GL S+DP+ HL  FL + D+F   GV  D +RL LFP
Subjt:  EQQNNQAENPILVANDRTRAIRAYAFPMFDELNPGIARPQIQAANFEMKPIMFQMFQTVGQFHGLSSEDPHLHLKPFLGVSDSFVIQGVPRDGLRLTLFP

Query:  FSLRDGANAWLNSFAPGSIRIWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFNEAWERFKELLRKCPHHGLPHCIQMETFYIELNGATQDV----
        FSLRD A +WLNS   GSI  W++LA+ FL+K+FPP + AK+R++I  F Q + E+  EAWERFKELLR+CPHHG+P  +Q++TFY  L G+ + +    
Subjt:  FSLRDGANAWLNSFAPGSIRIWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFNEAWERFKELLRKCPHHGLPHCIQMETFYIELNGATQDV----

Query:  ------------------------------RGTNKKVKSVLEVDGVSTIRVDLVMIANALKNVTVISHQQPLVV-----------------EPAAVVNQV
                                      R  ++K     E+D + T+   +  ++  L  + V + Q  LVV                 E    V   
Subjt:  ------------------------------RGTNKKVKSVLEVDGVSTIRVDLVMIANALKNVTVISHQQPLVV-----------------EPAAVVNQV

Query:  IR--------------------------GITLILTFIIQVLPQQNKQALPQQNSGSSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKVRPQG
         R                          G +     +     QQ +  +P++   S LE ++ +++++TDA IQS  AS+R LE QVGQLAN +  RPQG
Subjt:  IR--------------------------GITLILTFIIQVLPQQNKQALPQQNSGSSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKVRPQG

Query:  KLPSDTE-HPR------------RKGAEGSNKDAGAS----------------------------GSVPDVEPPYVPP----------------------
         LPSDT+ +P+             K  EG N+ A  S                            G+   + PP  PP                      
Subjt:  KLPSDTE-HPR------------RKGAEGSNKDAGAS----------------------------GSVPDVEPPYVPP----------------------

Query:  ---------------PSYVPPLPSPQRQKPKNQDGDFETVSLTEECCAILKNGLPPKAKDLGSFTIPVSIGGKELGRELCDLGASINLMPLSVYRKLGIG
                       PSYV  L     +K K   G+FETV LTEEC AIL+N LPPK KD GSFTIP +IG     + L DLGASINLMP S++ KLG+G
Subjt:  ---------------PSYVPPLPSPQRQKPKNQDGDFETVSLTEECCAILKNGLPPKAKDLGSFTIPVSIGGKELGRELCDLGASINLMPLSVYRKLGIG

Query:  EARPTTVTLQLADRSITYPEGKIDDVLVKVDKFIFPVDFIILDYEADKDVSIILGRPFLATSRALIDVQKGELTTRVCNEEVKFNVFKAMKYPDEMEDCS
        E +PT+VTLQLADRS  YP G I+DVLVKVDKFIFPVDF+ILD E D+ + IILGRPFLAT+ A+IDV++G+++ +V  E V+FN+F A K+P     C 
Subjt:  EARPTTVTLQLADRSITYPEGKIDDVLVKVDKFIFPVDFIILDYEADKDVSIILGRPFLATSRALIDVQKGELTTRVCNEEVKFNVFKAMKYPDEMEDCS

Query:  FIRIMENTIVE-----TTIQDSTNLDQKKSPPIKP
         + +++    E     T+ Q ST   +   PP++P
Subjt:  FIRIMENTIVE-----TTIQDSTNLDQKKSPPIKP

A0A6J1CPJ3 uncharacterized protein LOC1110129479.8e-11639.72Show/hide
Query:  NEQQNNQAENPILVANDRTRAIRAYAFPMFDELNPGIARPQIQAANFEMKPIMFQMFQTVGQFHGLSSEDPHLHLKPFLGVSDSFVIQGVPRDGLRLTLF
        N+   N   NPI VA+ R RA+R YA  + ++LN  +       A FE KP+M QM   +GQF GL  EDP  HLK F+ V+++F + G+  D LRLTLF
Subjt:  NEQQNNQAENPILVANDRTRAIRAYAFPMFDELNPGIARPQIQAANFEMKPIMFQMFQTVGQFHGLSSEDPHLHLKPFLGVSDSFVIQGVPRDGLRLTLF

Query:  PFSLRDGANAWLNSFAPGSIRIWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFNEAWERFKELLRKCPHHGLPHCIQMETFY--------IELNG
        PFSL   A AWLN+F   +I    ++ + FL KYFPP RNA +R EI+ FRQ E+E  N AWERFK+L+R CP+ G+P C+Q+E F+        + LNG
Subjt:  PFSLRDGANAWLNSFAPGSIRIWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFNEAWERFKELLRKCPHHGLPHCIQMETFY--------IELNG

Query:  ATQDVRGTNKKVKSVLEV-------------------------------DGVSTIRVDLVMIANALKNV---TVISHQQPLVVEPAAVV-----------
        A    + T+K    ++E+                               D +++++  +  I   LKN+      +   P    P+ V            
Subjt:  ATQDVRGTNKKVKSVLEV-------------------------------DGVSTIRVDLVMIANALKNV---TVISHQQPLVVEPAAVV-----------

Query:  ---------------------------------NQVIRGITLILTF--------IIQVLPQQNKQALPQQNSGSSLEAMMKEF-----------MARTDA
                                         NQ  +       F          Q   QQ     P Q + S++E +MKEF           M RTDA
Subjt:  ---------------------------------NQVIRGITLILTF--------IIQVLPQQNKQALPQQNSGSSLEAMMKEF-----------MARTDA

Query:  AIQS----------NQASMRALELQVGQLANELKVRPQGKLPSDTEHPRRKGAEGSNKDAGASGSVPD--VEPPY---VPP--PSYVPPLPSPQRQKPKN
         I+           N  ++R LE+Q+GQLANE++ RPQG LPS TE PRR  +   +++      VPD  VEP     V P   +   P P PQR   KN
Subjt:  AIQS----------NQASMRALELQVGQLANELKVRPQGKLPSDTEHPRRKGAEGSNKDAGASGSVPD--VEPPY---VPP--PSYVPPLPSPQRQKPKN

Query:  QD-------------------------------------------GDFETVSLTEECCAILKNGLPPKAKDLGSFTIPVSIGGKELGRELCDLGASINLM
        QD                                           G++ETV+LTE    + K+  PPK KD GSFTI   IGGK++GR LCDLGA INLM
Subjt:  QD-------------------------------------------GDFETVSLTEECCAILKNGLPPKAKDLGSFTIPVSIGGKELGRELCDLGASINLM

Query:  PLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIDDVLVKVDKFIFPVDFIILDYEADKDVSIILGRPFLATSRALIDVQKGELTTRVCNEEVKFNVFKA
        PLS+++KL IG+A PTTVTL LADRSIT PEGKI+DVLVKVDKFIFP DFIILD EADKDV IILGRPFLAT   LIDV+KGELT RV +++V FN+  A
Subjt:  PLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIDDVLVKVDKFIFPVDFIILDYEADKDVSIILGRPFLATSRALIDVQKGELTTRVCNEEVKFNVFKA

Query:  MKYPDEMEDCSFIRI
        MKYPD+ E+C  I I
Subjt:  MKYPDEMEDCSFIRI

A0A6J1DY39 uncharacterized protein LOC1110256533.1e-11436.23Show/hide
Query:  NQAENPILVANDRTRAIRAYAFPMFDELNPGIARPQIQAANFEMKPIMFQMFQTVGQFHGLSSEDPHLHLKPFLGVSDSFVIQGVPRDGLRLTLFPFSLR
        N   NPI VA+ R RA+R YA  + ++LN  +       A FE KP+M QM   +GQF GL  EDP  HLK F+ V+++F + G+  D LRLTLFPFS+ 
Subjt:  NQAENPILVANDRTRAIRAYAFPMFDELNPGIARPQIQAANFEMKPIMFQMFQTVGQFHGLSSEDPHLHLKPFLGVSDSFVIQGVPRDGLRLTLFPFSLR

Query:  DGANAWLNSFAPGSIRIWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFNEAWERFKELLRKCPHHGLPHCIQMETFY--------IELNGATQDV
          A AWLN+F   +I  W ++ + FL KYFPP RNA +R EI+ FRQ E+E  N AWERFK+L+  CP+ G+P C+Q+E F+        + LNGA    
Subjt:  DGANAWLNSFAPGSIRIWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFNEAWERFKELLRKCPHHGLPHCIQMETFY--------IELNGATQDV

Query:  RGTNKKVKSVLEV-------------------------------DGVSTIRVDLVMIANALKNV---TVISHQQPLVVEPAAVVNQVIRGITLI------
        + T+K    ++E+                               D +++++  +  I   LKN+      +        P+ V  Q+             
Subjt:  RGTNKKVKSVLEV-------------------------------DGVSTIRVDLVMIANALKNV---TVISHQQPLVVEPAAVVNQVIRGITLI------

Query:  ----------LTFIIQV------------------------------------------------------------LPQQNKQALPQQNSGSSLEAMMK
                  + ++ Q+                                                              QQ     P Q + S++E +MK
Subjt:  ----------LTFIIQV------------------------------------------------------------LPQQNKQALPQQNSGSSLEAMMK

Query:  EFMARTDAAIQS---------------------NQASMRALELQVGQLANELKVRPQGKLPSDTEHPRRKGAEGSNKDAGASG-----------------
        E + + DA ++                      N  ++R LE+Q+GQL NE++ RPQG LPS TE PRR G E  N  A  SG                 
Subjt:  EFMARTDAAIQS---------------------NQASMRALELQVGQLANELKVRPQGKLPSDTEHPRRKGAEGSNKDAGASG-----------------

Query:  ------SVPD--VEP----PYVPPPSYV-PPLPSPQRQKPKNQD-------------------------------------------GDFETVSLTEECC
              +VPD  VEP    P  P  S   PP P PQR   KNQD                                           G++ETV+LTE   
Subjt:  ------SVPD--VEP----PYVPPPSYV-PPLPSPQRQKPKNQD-------------------------------------------GDFETVSLTEECC

Query:  AILKNGLPPKAKDLGSFTIPVSIGGKELGRELCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIDDVLVKVDKFIFPVDFIILDYEAD
         + K+ +PPK KD GSFTIP  IGGK++GR LCDLGASINLMPLS+++K  IG+A PTTVTLQLADRSIT PEGKI+DVLVKVDKFIFP DFIILD EAD
Subjt:  AILKNGLPPKAKDLGSFTIPVSIGGKELGRELCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIDDVLVKVDKFIFPVDFIILDYEAD

Query:  KDVSIILGRPFLATSRALIDVQKGELTTRVCNEEVKFNVFKAMKYPDEMEDCSFIRI-----------MENTIVETTIQD--------STNLDQKKSPPI
        KDV IILGRPFLAT   LIDV+KGELT RV +++V FN+  AMKY D+ME+C+ I I           + N  +E  +++        +T L ++K   I
Subjt:  KDVSIILGRPFLATSRALIDVQKGELTTRVCNEEVKFNVFKAMKYPDEMEDCSFIRI-----------MENTIVETTIQD--------STNLDQKKSPPI

Query:  KPSLIE
        +P  IE
Subjt:  KPSLIE

A0A6J1EQ90 uncharacterized protein LOC1114364119.7e-14043.65Show/hide
Query:  QNPPLEQNEQQNNQAENPILVANDRTRAIRAYAFPMFDELNPGIARPQIQAANFEMKPIMFQMFQTVGQFHGLSSEDPHLHLKPFLGV-------SDSFV
        +NP +  N  Q     NPI +A+DR RAIRAYA P  +ELNP I RP+IQ   FE+KP+MFQM QT+GQFHGL  EDPHLHLK FLGV       SDSF 
Subjt:  QNPPLEQNEQQNNQAENPILVANDRTRAIRAYAFPMFDELNPGIARPQIQAANFEMKPIMFQMFQTVGQFHGLSSEDPHLHLKPFLGV-------SDSFV

Query:  IQGVPRDGLRLTLFPFSLRDGANAWLNSFAPGSIRIWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFNEAWERFKELLRKCPHHGLPHCIQMETF
         QGV +D +RL+LFP+ LRDGA +WLN+ APG+I  W+ LAENFL KYFPP RNA+ ++EIV F+Q EDET +EA ERFKE+LRKCPHHGLPHCIQMETF
Subjt:  IQGVPRDGLRLTLFPFSLRDGANAWLNSFAPGSIRIWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFNEAWERFKELLRKCPHHGLPHCIQMETF

Query:  YIELNGATQ----------------------------------DVRGT-NKKVKSVLEVDGVSTIRVDLVMIANALKNVTVISHQQPLVVEP---AAVVN
        Y  LN  T+                                  DVR    +K + VLEVD +S+I   L  + N L+N+ +   Q  ++  P   AA +N
Subjt:  YIELNGATQ----------------------------------DVRGT-NKKVKSVLEVDGVSTIRVDLVMIANALKNVTVISHQQPLVVEP---AAVVN

Query:  QVI--------------------------------------------------------RGITLI-------------------LTFIIQVLPQQNK-QA
        Q                                                          +G +L                    L +  Q +  Q K   
Subjt:  QVI--------------------------------------------------------RGITLI-------------------LTFIIQVLPQQNK-QA

Query:  LPQQNSGSSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKVRPQGKLPSDTEHPRRKGA--EGSNKDAGASGSVPDVEPPYVPPP---SYVPP
          Q  S +S+E+++KE+MA+ DA IQS QAS+R LE+Q+G   N  +     +  +DT+    + A  +  +KD       P ++          +Y P 
Subjt:  LPQQNSGSSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKVRPQGKLPSDTEHPRRKGA--EGSNKDAGASGSVPDVEPPYVPPP---SYVPP

Query:  LPSPQRQKPKNQDGDFE-------------------------------------------TVSLTEECCAILKNGLPPKAKDLGSFTIPVSIGGKELGRE
         P PQR K K ++  FE                                            VSL EEC AILKN +P K KD GSFTIPVSIGGKELGR 
Subjt:  LPSPQRQKPKNQDGDFE-------------------------------------------TVSLTEECCAILKNGLPPKAKDLGSFTIPVSIGGKELGRE

Query:  LCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIDDVLVKVDKFIFPVDFIILDYEADKDVSIILGRPFLATSRALIDVQKGELTTRVC
        LCDLGA+INLMPLS+Y+KLGIGEARPTTVTLQLADRSITYPEGKI+D+L++VDKFIF  DFIILDYE D DV IILGRPFL   R L+DV KG +T R+ 
Subjt:  LCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIDDVLVKVDKFIFPVDFIILDYEADKDVSIILGRPFLATSRALIDVQKGELTTRVC

Query:  NEEVKFNVFKAMKYPDEMEDCSFI
         ++V+FN+  +MKYP  +E+CS +
Subjt:  NEEVKFNVFKAMKYPDEMEDCSFI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTTGCTGGGCAACTGGAGGGAGCAAATTCTGTGCTGCAGCAAAACTGGGAGCAAAACTGCCACGTCACAGTGTGAGTTTGGTGCATGAGCGATCCGCCTGGGGTAA
GCAAAATCCGCCACTGGAGCAAAATGAGCAGCAAAATAATCAGGCTGAGAATCCTATCTTGGTAGCGAACGATAGGACCAGAGCCATTCGAGCATATGCTTTTCCAATGT
TTGATGAGTTAAATCCAGGGATTGCACGTCCTCAAATTCAAGCGGCAAATTTTGAAATGAAACCGATAATGTTTCAGATGTTTCAAACCGTGGGTCAATTCCATGGTTTG
TCATCTGAAGACCCTCATTTACATCTTAAGCCTTTTCTAGGAGTTAGTGATTCTTTTGTTATTCAAGGAGTGCCTAGGGATGGCCTTAGATTAACTTTGTTCCCATTTTC
TCTTAGAGATGGAGCAAATGCATGGTTAAATTCTTTTGCTCCAGGATCAATTAGGATATGGGATGAGTTAGCTGAAAATTTTTTGAGTAAATATTTCCCACCTAATAGAA
ATGCTAAATTAAGGAGTGAAATAGTAGGGTTTAGGCAACTTGAAGATGAAACTTTTAATGAGGCTTGGGAAAGGTTTAAGGAGCTTTTACGAAAGTGCCCCCACCATGGT
TTACCTCATTGTATCCAAATGGAAACATTTTACATTGAGTTAAACGGAGCAACCCAAGATGTTAGAGGCACAAATAAAAAGGTTAAGAGTGTATTAGAGGTTGATGGTGT
GTCCACCATTAGGGTTGATCTTGTAATGATTGCTAACGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCCACTAGTTGTGGAGCCTGCTGCAGTGGTGAACCAAG
TAATCAGAGGAATAACCCTTATTCTAACTTTTATAATCCAGGTATTGCCCCAGCAAAATAAGCAGGCTTTGCCCCAGCAAAATTCGGGGAGTTCTCTTGAGGCAATGATG
AAAGAATTTATGGCTCGTACAGATGCCGCCATTCAAAGTAATCAAGCTTCAATGAGAGCCCTGGAATTGCAAGTGGGTCAGCTAGCTAATGAGCTGAAGGTCAGGCCTCA
AGGGAAACTTCCCTCAGATACTGAACACCCTCGAAGGAAGGGTGCTGAAGGCAGCAATAAAGATGCTGGAGCATCTGGTTCTGTTCCAGATGTGGAACCACCTTATGTGC
CTCCCCCATCTTATGTACCACCTCTACCTTCTCCACAAAGGCAAAAGCCTAAGAATCAGGATGGTGATTTTGAAACTGTATCTCTTACTGAGGAATGTTGCGCTATTCTT
AAGAATGGGCTACCACCCAAGGCTAAGGATCTAGGATCATTTACCATACCTGTATCTATAGGTGGAAAAGAGTTAGGTAGAGAACTTTGTGATTTAGGTGCAAGCATTAA
CCTTATGCCTCTTTCGGTCTATCGGAAGCTAGGTATTGGTGAAGCTAGACCTACCACAGTCACACTCCAACTAGCTGATAGGTCTATCACATATCCAGAGGGTAAAATTG
ATGATGTCTTAGTGAAGGTAGATAAATTTATATTTCCTGTTGATTTCATTATTTTAGACTATGAGGCTGATAAAGATGTCTCAATTATTCTAGGTCGTCCATTCTTGGCT
ACTAGTAGGGCGTTGATAGATGTTCAGAAAGGGGAATTAACAACGAGAGTCTGTAATGAGGAAGTGAAATTTAATGTGTTTAAAGCCATGAAGTATCCAGACGAGATGGA
AGATTGCTCTTTCATTAGGATTATGGAGAACACAATTGTTGAGACAACAATACAGGATTCAACTAATTTAGATCAAAAGAAGTCTCCTCCTATTAAGCCATCCCTGATTG
AGGCACGTACTTTATATTTGAAGCCCTTGCCGGATCATTTAAAACTCTCAAAGACAAGACCTAGAATTCTCTCTAGCCTCCCTCTTAGAGAAAGACTTCCACAAGTCTTT
TGTCTCCCAGACTTAGAGACATACCGGTGTAACCTCTGTGGTTATTGTGTCATTCAAAGAAGAAAATTCCAGCGATCAAGAGGCGAGGAGGCTGCTGCGTTTTCGTTCGT
TGGAGCGTCGTTGGCGAAGAACGGTCAAGTCTACAACGAAGATACAAGACTCATCAATGTCCTGATTAAAGCCAAAAGATCCGATCACCGTATCAAATCTTATATTCCAA
GATCGAAATTCTTAGGTCAATCCATTAATGGATCACTTAAGCTTGCAAACCCTTTTGCTTTTGATCTTGTTGTGTTACCCATCAAGCAGATGTATAAGATCCCAGATGGA
ATTGAAATACAAAGATTTCATTTCCAGATTCCAGAGGTCGACCCCAACAATGCTACCGTAGGGAAAGTCAAAGTTGGGAATGGACCTAAGAAACCCTATCCATTTCTGGA
GTTTGCGTTGTTTCGAATCAATGATCAAGCCCTCCGTAGGGTGATCGCTTCCGGGGCGACTCGCAGGCAGATCATAGGAATCTCATGGTGCAACCTAATGGAGGAGACCG
TAGGATACGTTGACACGTGTCCTGCTCCCACTTACTACAAAATACTCCCTCATTCACCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTGTTGCTGGGCAACTGGAGGGAGCAAATTCTGTGCTGCAGCAAAACTGGGAGCAAAACTGCCACGTCACAGTGTGAGTTTGGTGCATGAGCGATCCGCCTGGGGTAA
GCAAAATCCGCCACTGGAGCAAAATGAGCAGCAAAATAATCAGGCTGAGAATCCTATCTTGGTAGCGAACGATAGGACCAGAGCCATTCGAGCATATGCTTTTCCAATGT
TTGATGAGTTAAATCCAGGGATTGCACGTCCTCAAATTCAAGCGGCAAATTTTGAAATGAAACCGATAATGTTTCAGATGTTTCAAACCGTGGGTCAATTCCATGGTTTG
TCATCTGAAGACCCTCATTTACATCTTAAGCCTTTTCTAGGAGTTAGTGATTCTTTTGTTATTCAAGGAGTGCCTAGGGATGGCCTTAGATTAACTTTGTTCCCATTTTC
TCTTAGAGATGGAGCAAATGCATGGTTAAATTCTTTTGCTCCAGGATCAATTAGGATATGGGATGAGTTAGCTGAAAATTTTTTGAGTAAATATTTCCCACCTAATAGAA
ATGCTAAATTAAGGAGTGAAATAGTAGGGTTTAGGCAACTTGAAGATGAAACTTTTAATGAGGCTTGGGAAAGGTTTAAGGAGCTTTTACGAAAGTGCCCCCACCATGGT
TTACCTCATTGTATCCAAATGGAAACATTTTACATTGAGTTAAACGGAGCAACCCAAGATGTTAGAGGCACAAATAAAAAGGTTAAGAGTGTATTAGAGGTTGATGGTGT
GTCCACCATTAGGGTTGATCTTGTAATGATTGCTAACGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCCACTAGTTGTGGAGCCTGCTGCAGTGGTGAACCAAG
TAATCAGAGGAATAACCCTTATTCTAACTTTTATAATCCAGGTATTGCCCCAGCAAAATAAGCAGGCTTTGCCCCAGCAAAATTCGGGGAGTTCTCTTGAGGCAATGATG
AAAGAATTTATGGCTCGTACAGATGCCGCCATTCAAAGTAATCAAGCTTCAATGAGAGCCCTGGAATTGCAAGTGGGTCAGCTAGCTAATGAGCTGAAGGTCAGGCCTCA
AGGGAAACTTCCCTCAGATACTGAACACCCTCGAAGGAAGGGTGCTGAAGGCAGCAATAAAGATGCTGGAGCATCTGGTTCTGTTCCAGATGTGGAACCACCTTATGTGC
CTCCCCCATCTTATGTACCACCTCTACCTTCTCCACAAAGGCAAAAGCCTAAGAATCAGGATGGTGATTTTGAAACTGTATCTCTTACTGAGGAATGTTGCGCTATTCTT
AAGAATGGGCTACCACCCAAGGCTAAGGATCTAGGATCATTTACCATACCTGTATCTATAGGTGGAAAAGAGTTAGGTAGAGAACTTTGTGATTTAGGTGCAAGCATTAA
CCTTATGCCTCTTTCGGTCTATCGGAAGCTAGGTATTGGTGAAGCTAGACCTACCACAGTCACACTCCAACTAGCTGATAGGTCTATCACATATCCAGAGGGTAAAATTG
ATGATGTCTTAGTGAAGGTAGATAAATTTATATTTCCTGTTGATTTCATTATTTTAGACTATGAGGCTGATAAAGATGTCTCAATTATTCTAGGTCGTCCATTCTTGGCT
ACTAGTAGGGCGTTGATAGATGTTCAGAAAGGGGAATTAACAACGAGAGTCTGTAATGAGGAAGTGAAATTTAATGTGTTTAAAGCCATGAAGTATCCAGACGAGATGGA
AGATTGCTCTTTCATTAGGATTATGGAGAACACAATTGTTGAGACAACAATACAGGATTCAACTAATTTAGATCAAAAGAAGTCTCCTCCTATTAAGCCATCCCTGATTG
AGGCACGTACTTTATATTTGAAGCCCTTGCCGGATCATTTAAAACTCTCAAAGACAAGACCTAGAATTCTCTCTAGCCTCCCTCTTAGAGAAAGACTTCCACAAGTCTTT
TGTCTCCCAGACTTAGAGACATACCGGTGTAACCTCTGTGGTTATTGTGTCATTCAAAGAAGAAAATTCCAGCGATCAAGAGGCGAGGAGGCTGCTGCGTTTTCGTTCGT
TGGAGCGTCGTTGGCGAAGAACGGTCAAGTCTACAACGAAGATACAAGACTCATCAATGTCCTGATTAAAGCCAAAAGATCCGATCACCGTATCAAATCTTATATTCCAA
GATCGAAATTCTTAGGTCAATCCATTAATGGATCACTTAAGCTTGCAAACCCTTTTGCTTTTGATCTTGTTGTGTTACCCATCAAGCAGATGTATAAGATCCCAGATGGA
ATTGAAATACAAAGATTTCATTTCCAGATTCCAGAGGTCGACCCCAACAATGCTACCGTAGGGAAAGTCAAAGTTGGGAATGGACCTAAGAAACCCTATCCATTTCTGGA
GTTTGCGTTGTTTCGAATCAATGATCAAGCCCTCCGTAGGGTGATCGCTTCCGGGGCGACTCGCAGGCAGATCATAGGAATCTCATGGTGCAACCTAATGGAGGAGACCG
TAGGATACGTTGACACGTGTCCTGCTCCCACTTACTACAAAATACTCCCTCATTCACCTTGA
Protein sequenceShow/hide protein sequence
MCCWATGGSKFCAAAKLGAKLPRHSVSLVHERSAWGKQNPPLEQNEQQNNQAENPILVANDRTRAIRAYAFPMFDELNPGIARPQIQAANFEMKPIMFQMFQTVGQFHGL
SSEDPHLHLKPFLGVSDSFVIQGVPRDGLRLTLFPFSLRDGANAWLNSFAPGSIRIWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFNEAWERFKELLRKCPHHG
LPHCIQMETFYIELNGATQDVRGTNKKVKSVLEVDGVSTIRVDLVMIANALKNVTVISHQQPLVVEPAAVVNQVIRGITLILTFIIQVLPQQNKQALPQQNSGSSLEAMM
KEFMARTDAAIQSNQASMRALELQVGQLANELKVRPQGKLPSDTEHPRRKGAEGSNKDAGASGSVPDVEPPYVPPPSYVPPLPSPQRQKPKNQDGDFETVSLTEECCAIL
KNGLPPKAKDLGSFTIPVSIGGKELGRELCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIDDVLVKVDKFIFPVDFIILDYEADKDVSIILGRPFLA
TSRALIDVQKGELTTRVCNEEVKFNVFKAMKYPDEMEDCSFIRIMENTIVETTIQDSTNLDQKKSPPIKPSLIEARTLYLKPLPDHLKLSKTRPRILSSLPLRERLPQVF
CLPDLETYRCNLCGYCVIQRRKFQRSRGEEAAAFSFVGASLAKNGQVYNEDTRLINVLIKAKRSDHRIKSYIPRSKFLGQSINGSLKLANPFAFDLVVLPIKQMYKIPDG
IEIQRFHFQIPEVDPNNATVGKVKVGNGPKKPYPFLEFALFRINDQALRRVIASGATRRQIIGISWCNLMEETVGYVDTCPAPTYYKILPHSP