; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021950 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021950
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr7:14465919..14472212
RNA-Seq ExpressionLag0021950
SyntenyLag0021950
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017233063.1 PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus]1.6e-10441.55Show/hide
Query:  RVIRAYVSPIVDELNPVIICPVIEATNFEMKTVMFHMLQMIRQFHGLSSEDPHLHLKYFLGVSDSFVIQGVSRDALRLTLFPYSLTDGTKAWLSSFAPGT
        R IR Y +P  +ELN  II P I+AT FE+K VMF MLQ I QF G+ +EDPHLHL+ F+ +SDSF  QGV  DALRL LFPYS+ D  + WL+S   G+
Subjt:  RVIRAYVSPIVDELNPVIICPVIEATNFEMKTVMFHMLQMIRQFHGLSSEDPHLHLKYFLGVSDSFVIQGVSRDALRLTLFPYSLTDGTKAWLSSFAPGT

Query:  ISSWNELVENFLTKYFPPIRNAKLRSEIVRFRQNEEETFSEAWERFKELLRKCPYHYLPHCIQIKIFYNGLNITTQCMVDASAGGALLAKTFNEAYEIFE
        +++WN+L E FL+KYFPP  NAKLR+EI  F+Q ++E+  +AWERFKELLRKCP+H + HCIQ++ FYNGLN  T+ +VDASA GALL+K++N+AYEI E
Subjt:  ISSWNELVENFLTKYFPPIRNAKLRSEIVRFRQNEEETFSEAWERFKELLRKCPYHYLPHCIQIKIFYNGLNITTQCMVDASAGGALLAKTFNEAYEIFE

Query:  RISTYSCQWSDVRG-SSKKVKSVIAVDSVSTIRADIATLANTLKNINVVSNVYQPPVVESAVVVNQ----------------------VVYYG-------
         I+T + QWS  R  + KKV  +  VDS+++++A +A++ + LKN+++ +N  +   + S +   +                      V Y G       
Subjt:  RISTYSCQWSDVRG-SSKKVKSVIAVDSVSTIRADIATLANTLKNINVVSNVYQPPVVESAVVVNQ----------------------VVYYG-------

Query:  --------WRNHPNFAWGGQGSNLQAPQAQQKVVNQSGFAKSQALPQQNKQALPQQNSESSLEAKMKEYM-------ARTDAAIQSNQVSMRALKLHMGQ
                WR HPNF+W  QG+N        K     GF  SQ  PQ N           SLE  +KEY+       ++T+A +QS   S+R L+  +GQ
Subjt:  --------WRNHPNFAWGGQGSNLQAPQAQQKVVNQSGFAKSQALPQQNKQALPQQNSESSLEAKMKEYM-------ARTDAAIQSNQVSMRALKLHMGQ

Query:  LSNELKARPHGNIPSDTEHPRREGTKQVQAVTLRSGKSLEERKEPSKHQE-VEKCSNKNSVVEKELESGKNDRGDIKDAGASSFVPNVEPPPYVPPP---
        L+NEL+ RPHG +PSDTE P+  G +  +A+TL+SGK L      +K+ + VE   N      +E+   K    D      SS   +++P P  P     
Subjt:  LSNELKARPHGNIPSDTEHPRREGTKQVQAVTLRSGKSLEERKEPSKHQE-VEKCSNKNSVVEKELESGKNDRGDIKDAGASSFVPNVEPPPYVPPP---

Query:  -----------------PYNSPL-----PFP-----QKDILTKKKRLRESETVTLTEECSVILKNGLPSKVEDPESFTIPISIGGKELGRAL
                           N PL       P      KDILTKK+RL E ETV LT+ECS  L++ LP+K++DP SFTIP +IG    G AL
Subjt:  -----------------PYNSPL-----PFP-----QKDILTKKKRLRESETVTLTEECSVILKNGLPSKVEDPESFTIPISIGGKELGRAL

XP_021279280.1 LOW QUALITY PROTEIN: uncharacterized protein LOC110412945 [Herrania umbratica]1.2e-10737.32Show/hide
Query:  RVIRAYVSPIVDELNPVIICPVIEATNFEMKTVMFHMLQMIRQFHGLSSEDPHLHLKYFLGVSDSFVIQGVSRDALRLTLFPYSLTDGTKAWLSSFAPGT
        R +R YV P+V  L+  I  P I A NFE+K     M+Q   QF GL S+DP+ HL  FL + D+F   GV+ DA+RL LFP+SL D  K+WL+S   G+
Subjt:  RVIRAYVSPIVDELNPVIICPVIEATNFEMKTVMFHMLQMIRQFHGLSSEDPHLHLKYFLGVSDSFVIQGVSRDALRLTLFPYSLTDGTKAWLSSFAPGT

Query:  ISSWNELVENFLTKYFPPIRNAKLRSEIVRFRQNEEETFSEAWERFKELLRKCPYHYLPHCIQIKIFYNGLNITTQCMVDASAGGALLAKTFNEAYEIFE
        I++W +L + FL K+FPP + AK+R++I  F Q + E+  EAWERFKELLR+CP+H +P  +Q++ FYNGL  + + ++DA+AGGAL++K   +AY + E
Subjt:  ISSWNELVENFLTKYFPPIRNAKLRSEIVRFRQNEEETFSEAWERFKELLRKCPYHYLPHCIQIKIFYNGLNITTQCMVDASAGGALLAKTFNEAYEIFE

Query:  RISTYSCQWSDVRGSSKKVKSVIAVDSVSTIRADIATLANTLKNINV--------------VSNVYQ--PPVVESAVVV-----------NQVVYYGWRN
         +++ + QW   R  S+K      +D++ T+   +A L+  L  + V               S+ Y   P   ES   V           +     GWRN
Subjt:  RISTYSCQWSDVRGSSKKVKSVIAVDSVSTIRADIATLANTLKNINV--------------VSNVYQ--PPVVESAVVV-----------NQVVYYGWRN

Query:  HPNFAWGGQGSNLQAPQAQQKVVNQSGFAKSQALPQQNKQALPQQNSESSLEAKMKEYMARTDAAIQSNQVSMRALKLHMGQLSNELKARPHGNIPSDTE
        HPNF+W    SN   P +  K +   GF + QA PQ     +P++ S+  LE  + +Y+++TDA IQS   S+R L+  +GQL+N +  RP G++PSDT+
Subjt:  HPNFAWGGQGSNLQAPQAQQKVVNQSGFAKSQALPQQNKQALPQQNSESSLEAKMKEYMARTDAAIQSNQVSMRALKLHMGQLSNELKARPHGNIPSDTE

Query:  -HPRREGTKQVQAVTLRSGKSLEERKEPSKHQEVEKCSNKNSVVEKELESGKNDRGDIKDAGASSFVPNVEPPPYVPPPPYNSPL---------------
         +P+  G +Q QA+TLRSGK +E   + +   E+E   +K  + E E+E  + D    ++ G S     + PPP  P       L               
Subjt:  -HPRREGTKQVQAVTLRSGKSLEERKEPSKHQEVEKCSNKNSVVEKELESGKNDRGDIKDAGASSFVPNVEPPPYVPPPPYNSPL---------------

Query:  ---PFPQ------------KDILTKKKRLRESETVTLTEECSVILKNGLPSKVEDPESFTIPISIGGKELGRALCDSGASINLMPLSAYRKLGIGEVRPT
           PF +            KDIL+KK++L E ETV LTEECS IL+N LP K++DP SFTIP +IG     +AL D GASINLMP S + KLG+GE +PT
Subjt:  ---PFPQ------------KDILTKKKRLRESETVTLTEECSVILKNGLPSKVEDPESFTIPISIGGKELGRALCDSGASINLMPLSAYRKLGIGEVRPT

Query:  TVTLQLAYRSIIYPEGKIENVLVQVH-----------------------------------------------DQKVKFNVFDAMKYPNDIEDCSCIQVL
        +VTLQLA RS +YP G IE+VLV+V                                                ++ V+FN+F+A K+P+    C  ++++
Subjt:  TVTLQLAYRSIIYPEGKIENVLVQVH-----------------------------------------------DQKVKFNVFDAMKYPNDIEDCSCIQVL

Query:  DE
        DE
Subjt:  DE

XP_022929949.1 uncharacterized protein LOC111436411 [Cucurbita moschata]8.6e-13843.55Show/hide
Query:  RVIRAYVSPIVDELNPVIICPVIEATNFEMKTVMFHMLQMIRQFHGLSSEDPHLHLKYFLGV-------SDSFVIQGVSRDALRLTLFPYSLTDGTKAWL
        R IRAY  P V+ELNP II P I+ T FE+K VMF MLQ I QFHGL  EDPHLHLK FLGV       SDSF  QGV +D +RL+LFPY L DG K+WL
Subjt:  RVIRAYVSPIVDELNPVIICPVIEATNFEMKTVMFHMLQMIRQFHGLSSEDPHLHLKYFLGV-------SDSFVIQGVSRDALRLTLFPYSLTDGTKAWL

Query:  SSFAPGTISSWNELVENFLTKYFPPIRNAKLRSEIVRFRQNEEETFSEAWERFKELLRKCPYHYLPHCIQIKIFYNGLNITTQCMVDASAGGALLAKTFN
        ++ APGTI SWN L ENFL KYFPP RNA+ ++EIV F+Q E+ET SEA ERFKE+LRKCP+H LPHCIQ++ FYNGLNI T+ +VDASA GA+L+KT+N
Subjt:  SSFAPGTISSWNELVENFLTKYFPPIRNAKLRSEIVRFRQNEEETFSEAWERFKELLRKCPYHYLPHCIQIKIFYNGLNITTQCMVDASAGGALLAKTFN

Query:  EAYEIFERISTYSCQWSDVRGS-SKKVKSVIAVDSVSTIRADIATLANTLKNINVVSNVYQPPVVESAVVVNQV-----------------------VYY
        EAYEI ERI++ +CQW+DVR +  +K + V+ VD++S+I A +A++ N L+N+ +  +      V +A  +NQ                        ++Y
Subjt:  EAYEIFERISTYSCQWSDVRGS-SKKVKSVIAVDSVSTIRADIATLANTLKNINVVSNVYQPPVVESAVVVNQV-----------------------VYY

Query:  ---------------------GWRNHPNFAWGGQG-SNLQAPQAQQ-----KVVNQSGFAKSQALPQQNKQALPQQNSESSLEAKMKEYMARTDAAIQSN
                             GWRNHPNF+W GQ   N Q P         ++ NQ  ++  Q   Q       Q  SE+S+E+ +KEYMA+ DA IQS 
Subjt:  ---------------------GWRNHPNFAWGGQG-SNLQAPQAQQ-----KVVNQSGFAKSQALPQQNKQALPQQNSESSLEAKMKEYMARTDAAIQSN

Query:  QVSMRALKLHMGQLSNELKARPHGNIPSDTEHPRREGTKQVQAVTLRSGKSLEERKEPSKHQEVEKCSNKNSVVEKELESGKNDRGDIKDAGASSFVPNV
        Q S+R L++ +G   N  +   H    +DT+    E      AV     K   E +E  K Q       ++          +  +   ++A    F+ ++
Subjt:  QVSMRALKLHMGQLSNELKARPHGNIPSDTEHPRREGTKQVQAVTLRSGKSLEERKEPSKHQEVEKCSNKNSVVEKELESGKNDRGDIKDAGASSFVPNV

Query:  EPPPYVPPPPYNSPLPFPQ-----KDILTKKKRLRESETVTLTEECSVILKNGLPSKVEDPESFTIPISIGGKELGRALCDSGASINLMPLSAYRKLGIG
            ++  P   +    P      KD+L  +++  E + V+L EECS ILKN +P K +DP SFTIP+SIGGKELGRALCD GA+INLMPLS Y+KLGIG
Subjt:  EPPPYVPPPPYNSPLPFPQ-----KDILTKKKRLRESETVTLTEECSVILKNGLPSKVEDPESFTIPISIGGKELGRALCDSGASINLMPLSAYRKLGIG

Query:  EVRPTTVTLQLAYRSIIYPEGKIENVLVQV-----------------HD------------------------------QKVKFNVFDAMKYPNDIEDCS
        E RPTTVTLQLA RSI YPEGKIE++L+QV                 HD                              QKV+FN+ D+MKYP  IE+CS
Subjt:  EVRPTTVTLQLAYRSIIYPEGKIENVLVQV-----------------HD------------------------------QKVKFNVFDAMKYPNDIEDCS

Query:  CIQVL
         +  L
Subjt:  CIQVL

XP_024025307.1 uncharacterized protein LOC112092710 [Morus notabilis]3.9e-10643.66Show/hide
Query:  QFHGLSSEDPHLHLKYFLGVSDSFVIQGVSRDALRLTLFPYSLTDGTKAWLSSFAPGTISSWNELVENFLTKYFPPIRNAKLRSEIVRFRQNEEETFSEA
        QF  +++EDPH+HL+ F+ VSD+F + GV++D LRL LFPYSL D  + WL+S  P +++SW+EL E FL KYFPP +NAKLR++I  F+Q + E+  EA
Subjt:  QFHGLSSEDPHLHLKYFLGVSDSFVIQGVSRDALRLTLFPYSLTDGTKAWLSSFAPGTISSWNELVENFLTKYFPPIRNAKLRSEIVRFRQNEEETFSEA

Query:  WERFKELLRKCPYHYLPHCIQIKIFYNGLNITTQCMVDASAGGALLAKTFNEAYEIFERISTYSCQWSDVRGSSKKVKSVIAVDSVSTIRADIA------
        W+RFKELLRKCP+H +PH IQ++ FYNGLN  T+ +VDA+AGGA+LAK++N+AYEI ER++  + QW   R +++K   +  VD+++ + A ++      
Subjt:  WERFKELLRKCPYHYLPHCIQIKIFYNGLNITTQCMVDASAGGALLAKTFNEAYEIFERISTYSCQWSDVRGSSKKVKSVIAVDSVSTIRADIA------

Query:  ---------------TLANTLKNINVVSNVYQPPVVESAVVVNQVVYYGWRNHPNFAWGGQGSNLQAPQAQQKVVNQSGFAKSQALPQQNKQALPQQNSE
                         A  + N N  ++ Y  P        NQ    G R HPN +W    SNL A  +Q    +++ +       QQ  QA+ Q    
Subjt:  ---------------TLANTLKNINVVSNVYQPPVVESAVVVNQVVYYGWRNHPNFAWGGQGSNLQAPQAQQKVVNQSGFAKSQALPQQNKQALPQQNSE

Query:  SSLEAKMKEYMARTDAAIQSNQVSMRALKLHMGQLSNELKARPHGNIPSDTEHPRREGTKQVQAVTLRSGKSLEERKEPSKHQEVEKCSNKNSVVEKELE
        SSLE+ +KE+++R++A IQS+  ++R L+  +G +SN L  RP G++ S+T+ PRR+G +  +A+TLR GK LE + +P      E  S +    +K L 
Subjt:  SSLEAKMKEYMARTDAAIQSNQVSMRALKLHMGQLSNELKARPHGNIPSDTEHPRREGTKQVQAVTLRSGKSLEERKEPSKHQEVEKCSNKNSVVEKELE

Query:  SGKNDRGDIKDAGASSFVPNVEPPPYVPPPPYNSPLPFPQKDILTKKKRLRESETVTLTEECSVILKNGLPSKVEDPESFTIPISIGGKELGRALCDSGA
          +  R  +K    +  +P VE    +P     +   F  KD++TKK+R  E ETV LTEECS IL+N LP K++DP SFTIP SIG +  G+ALCD GA
Subjt:  SGKNDRGDIKDAGASSFVPNVEPPPYVPPPPYNSPLPFPQKDILTKKKRLRESETVTLTEECSVILKNGLPSKVEDPESFTIPISIGGKELGRALCDSGA

Query:  SINLMPLSAYRKLGIGEVRPTTVTLQLAYRSIIYPE
        SINLMP+S ++KLGIGE  PTTV+LQLA RSI  PE
Subjt:  SINLMPLSAYRKLGIGEVRPTTVTLQLAYRSIIYPE

XP_030505184.1 uncharacterized protein LOC115720166 [Cannabis sativa]1.7e-11742.61Show/hide
Query:  RVIRAYVSPIVDELNPVIICPVIEATNFEMKTVMFHMLQMIRQFHGLSSEDPHLHLKYFLGVSDSFVIQGVSRDALRLTLFPYSLTDGTKAWLSSFAPGT
        R IR Y +P+ +ELNP I+ P I+A  FE+K VMF MLQ + QF  + +EDPHLHL+ FL +SDSF IQGVS +  RL LFP+SL D  ++WL++ +P +
Subjt:  RVIRAYVSPIVDELNPVIICPVIEATNFEMKTVMFHMLQMIRQFHGLSSEDPHLHLKYFLGVSDSFVIQGVSRDALRLTLFPYSLTDGTKAWLSSFAPGT

Query:  ISSWNELVENFLTKYFPPIRNAKLRSEIVRFRQNEEETFSEAWERFKELLRKCPYHYLPHCIQIKIFYNGLNITTQCMVDASAGGALLAKTFNEAYEIFE
        +++WN+  E FL KYFPP RNAK RSEI+ F Q E+E+ S+AWERFKELLRKCP+H +PHCIQ++ FYNGLN T+Q ++DASA GA+L+K++NEA+EI E
Subjt:  ISSWNELVENFLTKYFPPIRNAKLRSEIVRFRQNEEETFSEAWERFKELLRKCPYHYLPHCIQIKIFYNGLNITTQCMVDASAGGALLAKTFNEAYEIFE

Query:  RISTYSCQWSDVRG-SSKKVKSVIAVDSVSTIRADIATLANTLKNINV--VSNVYQPPVVESAVV-----------------VNQVVYYG----------
         I++ + QWS+ R   S+KV  V+ VD+++ +   +A++ N LKN+++    N+     ++S  V                    V Y G          
Subjt:  RISTYSCQWSDVRG-SSKKVKSVIAVDSVSTIRADIATLANTLKNINV--VSNVYQPPVVESAVV-----------------VNQVVYYG----------

Query:  --------WRNHPNFAWGGQGSNLQAPQAQQKVVNQSGFAKSQALPQQNKQALPQQNSESSLEAKMKEYMARTDAAIQSNQVSMRALKLHMGQLSNELKA
                W+NHPN +WG +       Q +Q      GF++    PQ  + + P     SSLE+ M++YMA+ DA IQS    +R L+L +G L+NELKA
Subjt:  --------WRNHPNFAWGGQGSNLQAPQAQQKVVNQSGFAKSQALPQQNKQALPQQNSESSLEAKMKEYMARTDAAIQSNQVSMRALKLHMGQLSNELKA

Query:  RPHGNIPSDTEHPRREGTKQVQAVTLRSGKSL----EERK---EPSKHQEVEKCSNKNSVVEKELESGKNDRGDIKDAGASSFVPNVEPPPYVPPPPY--
        RP G++PSDTE+PRR+G +Q +++ LRSGK L    EE K   EP+  Q  EK S K +    +        G   ++  S+ V +   PP   P  +  
Subjt:  RPHGNIPSDTEHPRREGTKQVQAVTLRSGKSL----EERK---EPSKHQEVEKCSNKNSVVEKELESGKNDRGDIKDAGASSFVPNVEPPPYVPPPPY--

Query:  -------------------NSPL-----PFPQ-----KDILTKKKRLRESETVTLTEECSVILKNGLPSKVEDPESFTIPISIGGKELGRALCDSGASIN
                           N PL       P      KDILTKK+RL E E+  LTE    +LKN +P K++DP SFTIPISIGG               
Subjt:  -------------------NSPL-----PFPQ-----KDILTKKKRLRESETVTLTEECSVILKNGLPSKVEDPESFTIPISIGGKELGRALCDSGASIN

Query:  LMPLSAYRKLGIGEVRPTTVTLQLAYRSIIYPEGKIENVLVQV
               R LGIGE RPTTVTLQLA RS+ +P+GKIE+VLVQV
Subjt:  LMPLSAYRKLGIGEVRPTTVTLQLAYRSIIYPEGKIENVLVQV

TrEMBL top hitse value%identityAlignment
A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129455.9e-10837.32Show/hide
Query:  RVIRAYVSPIVDELNPVIICPVIEATNFEMKTVMFHMLQMIRQFHGLSSEDPHLHLKYFLGVSDSFVIQGVSRDALRLTLFPYSLTDGTKAWLSSFAPGT
        R +R YV P+V  L+  I  P I A NFE+K     M+Q   QF GL S+DP+ HL  FL + D+F   GV+ DA+RL LFP+SL D  K+WL+S   G+
Subjt:  RVIRAYVSPIVDELNPVIICPVIEATNFEMKTVMFHMLQMIRQFHGLSSEDPHLHLKYFLGVSDSFVIQGVSRDALRLTLFPYSLTDGTKAWLSSFAPGT

Query:  ISSWNELVENFLTKYFPPIRNAKLRSEIVRFRQNEEETFSEAWERFKELLRKCPYHYLPHCIQIKIFYNGLNITTQCMVDASAGGALLAKTFNEAYEIFE
        I++W +L + FL K+FPP + AK+R++I  F Q + E+  EAWERFKELLR+CP+H +P  +Q++ FYNGL  + + ++DA+AGGAL++K   +AY + E
Subjt:  ISSWNELVENFLTKYFPPIRNAKLRSEIVRFRQNEEETFSEAWERFKELLRKCPYHYLPHCIQIKIFYNGLNITTQCMVDASAGGALLAKTFNEAYEIFE

Query:  RISTYSCQWSDVRGSSKKVKSVIAVDSVSTIRADIATLANTLKNINV--------------VSNVYQ--PPVVESAVVV-----------NQVVYYGWRN
         +++ + QW   R  S+K      +D++ T+   +A L+  L  + V               S+ Y   P   ES   V           +     GWRN
Subjt:  RISTYSCQWSDVRGSSKKVKSVIAVDSVSTIRADIATLANTLKNINV--------------VSNVYQ--PPVVESAVVV-----------NQVVYYGWRN

Query:  HPNFAWGGQGSNLQAPQAQQKVVNQSGFAKSQALPQQNKQALPQQNSESSLEAKMKEYMARTDAAIQSNQVSMRALKLHMGQLSNELKARPHGNIPSDTE
        HPNF+W    SN   P +  K +   GF + QA PQ     +P++ S+  LE  + +Y+++TDA IQS   S+R L+  +GQL+N +  RP G++PSDT+
Subjt:  HPNFAWGGQGSNLQAPQAQQKVVNQSGFAKSQALPQQNKQALPQQNSESSLEAKMKEYMARTDAAIQSNQVSMRALKLHMGQLSNELKARPHGNIPSDTE

Query:  -HPRREGTKQVQAVTLRSGKSLEERKEPSKHQEVEKCSNKNSVVEKELESGKNDRGDIKDAGASSFVPNVEPPPYVPPPPYNSPL---------------
         +P+  G +Q QA+TLRSGK +E   + +   E+E   +K  + E E+E  + D    ++ G S     + PPP  P       L               
Subjt:  -HPRREGTKQVQAVTLRSGKSLEERKEPSKHQEVEKCSNKNSVVEKELESGKNDRGDIKDAGASSFVPNVEPPPYVPPPPYNSPL---------------

Query:  ---PFPQ------------KDILTKKKRLRESETVTLTEECSVILKNGLPSKVEDPESFTIPISIGGKELGRALCDSGASINLMPLSAYRKLGIGEVRPT
           PF +            KDIL+KK++L E ETV LTEECS IL+N LP K++DP SFTIP +IG     +AL D GASINLMP S + KLG+GE +PT
Subjt:  ---PFPQ------------KDILTKKKRLRESETVTLTEECSVILKNGLPSKVEDPESFTIPISIGGKELGRALCDSGASINLMPLSAYRKLGIGEVRPT

Query:  TVTLQLAYRSIIYPEGKIENVLVQVH-----------------------------------------------DQKVKFNVFDAMKYPNDIEDCSCIQVL
        +VTLQLA RS +YP G IE+VLV+V                                                ++ V+FN+F+A K+P+    C  ++++
Subjt:  TVTLQLAYRSIIYPEGKIENVLVQVH-----------------------------------------------DQKVKFNVFDAMKYPNDIEDCSCIQVL

Query:  DE
        DE
Subjt:  DE

A0A6J1CPJ3 uncharacterized protein LOC1110129476.3e-10236.96Show/hide
Query:  RVIRAYVSPIVDELNPVIICPVIEATNFEMKTVMFHMLQMIRQFHGLSSEDPHLHLKYFLGVSDSFVIQGVSRDALRLTLFPYSLTDGTKAWLSSFAPGT
        R +R Y + I+++LN  +         FE K +M  ML  I QF GL  EDP  HLK F+ V+++F + G+S DALRLTLFP+SL+    AWL++F   T
Subjt:  RVIRAYVSPIVDELNPVIICPVIEATNFEMKTVMFHMLQMIRQFHGLSSEDPHLHLKYFLGVSDSFVIQGVSRDALRLTLFPYSLTDGTKAWLSSFAPGT

Query:  ISSWNELVENFLTKYFPPIRNAKLRSEIVRFRQNEEETFSEAWERFKELLRKCPYHYLPHCIQIKIFYNGLNITTQCMVDASAGGALLAKTFNEAYEIFE
        I + +++V+ FL KYFPP RNA +R EI+ FRQ E E  + AWERFK+L+R CP   +P C+QI+ F+   +I T  M++ +A G   +K+FNE  EI +
Subjt:  ISSWNELVENFLTKYFPPIRNAKLRSEIVRFRQNEEETFSEAWERFKELLRKCPYHYLPHCIQIKIFYNGLNITTQCMVDASAGGALLAKTFNEAYEIFE

Query:  RISTYSCQWSDVRGSSKKVKS----VIAVDSVSTIRADIATLANTLKNINV-------------VSNVYQPPVVESAVVVNQ--------VVYYGWRNHP
        ++S ++ QW   +  ++  ++    V+A+D++++++  I T+   LKN+                S VYQ  + ES   +NQ        +   GW+ HP
Subjt:  RISTYSCQWSDVRGSSKKVKS----VIAVDSVSTIRADIATLANTLKNINV-------------VSNVYQPPVVESAVVVNQ--------VVYYGWRNHP

Query:  NFAWGGQGSNLQAPQAQQ--KVVNQSGFAKSQALP-------QQNKQALPQQNSESSLE-----------AKMKEYMARTDAAIQS----------NQVS
        NF+W GQGS+    Q QQ  +      F  S A P       QQ     P Q + S++E           A MKE M RTDA I+           N V+
Subjt:  NFAWGGQGSNLQAPQAQQ--KVVNQSGFAKSQALP-------QQNKQALPQQNSESSLE-----------AKMKEYMARTDAAIQS----------NQVS

Query:  MRALKLHMGQLSNELKARPHGNIPSDTEHPRR------EGTKQVQAVTLRSGKSLEERKEPSKHQEVEKCSN----KNSVVEKELESGKNDRGDIKDAGA
        +R L++ +GQL+NE++ RP G++PS TE PRR         K  Q V     K +E     S   +V  C +       +V K  ++      DI     
Subjt:  MRALKLHMGQLSNELKARPHGNIPSDTEHPRR------EGTKQVQAVTLRSGKSLEERKEPSKHQEVEKCSN----KNSVVEKELESGKNDRGDIKDAGA

Query:  SSFVPNVEPPPYVPPPPYNSPLPFPQKDILTKKKRLRESETVTLTEECSVILKNGLPSKVEDPESFTIPISIGGKELGRALCDSGASINLMPLSAYRKLG
           +P VE    +  P Y   L    KDI+T+KK+L E ETV LTE  S + K+  P K++DP SFTI   IGGK++GRALCD GA INLMPLS ++KL 
Subjt:  SSFVPNVEPPPYVPPPPYNSPLPFPQKDILTKKKRLRESETVTLTEECSVILKNGLPSKVEDPESFTIPISIGGKELGRALCDSGASINLMPLSAYRKLG

Query:  IGEVRPTTVTLQLAYRSIIYPEGKIENVLV-----------------------------------------------QVHDQKVKFNVFDAMKYPNDIED
        IG+  PTTVTL LA RSI  PEGKIE+VLV                                               +V DQKV FN+ DAMKYP+D E+
Subjt:  IGEVRPTTVTLQLAYRSIIYPEGKIENVLV-----------------------------------------------QVHDQKVKFNVFDAMKYPNDIED

Query:  CSCIQV--------LDEMVEDHFEKELMEYHTQKFGEIQIEDLE
        C  I +        LD+++    E EL E   +K G I I  L+
Subjt:  CSCIQV--------LDEMVEDHFEKELMEYHTQKFGEIQIEDLE

A0A6J1DY39 uncharacterized protein LOC1110256533.9e-10435.1Show/hide
Query:  RVIRAYVSPIVDELNPVIICPVIEATNFEMKTVMFHMLQMIRQFHGLSSEDPHLHLKYFLGVSDSFVIQGVSRDALRLTLFPYSLTDGTKAWLSSFAPGT
        R +R Y + I+++LN  +I        FE K +M  ML  I QF GL  EDP  HLK F+ V+++F + G+S DALRLTLFP+S++    AWL++F   T
Subjt:  RVIRAYVSPIVDELNPVIICPVIEATNFEMKTVMFHMLQMIRQFHGLSSEDPHLHLKYFLGVSDSFVIQGVSRDALRLTLFPYSLTDGTKAWLSSFAPGT

Query:  ISSWNELVENFLTKYFPPIRNAKLRSEIVRFRQNEEETFSEAWERFKELLRKCPYHYLPHCIQIKIFYNGLNITTQCMVDASAGGALLAKTFNEAYEIFE
        I++W+++V+ FL KYFPP RNA +R EI+ FRQ E E  + AWERFK+L+  CP   +P C+QI+ F+ G +I T+ M++ +A G   +K+FNE  EI +
Subjt:  ISSWNELVENFLTKYFPPIRNAKLRSEIVRFRQNEEETFSEAWERFKELLRKCPYHYLPHCIQIKIFYNGLNITTQCMVDASAGGALLAKTFNEAYEIFE

Query:  RISTYSCQWSDVRGSSKKVKS----VIAVDSVSTIRADIATLANTLKNINV-------------VSNVYQ------------------PPVVESAVVVNQ
        ++S ++ QW   +  ++  ++    V+A+D++++++  I T+   LKN+                S VYQ                  P    S   V Q
Subjt:  RISTYSCQWSDVRGSSKKVKS----VIAVDSVSTIRADIATLANTLKNINV-------------VSNVYQ------------------PPVVESAVVVNQ

Query:  VVYY-----------GWRNHPNFAWGGQGSNLQAPQAQQ--KVVNQSGFAKSQALP-------QQNKQALPQQNSESSLE-----------AKMKEYMAR
        +              GW+ HPNF+W GQGS+      QQ  +     GF  S A P       QQ     P Q + S++E           A MKE M R
Subjt:  VVYY-----------GWRNHPNFAWGGQGSNLQAPQAQQ--KVVNQSGFAKSQALP-------QQNKQALPQQNSESSLE-----------AKMKEYMAR

Query:  TDAAIQS----------NQVSMRALKLHMGQLSNELKARPHGNIPSDTEHPRREGTKQVQAVTLRSGKSLEERKEP--SKHQEVEKCSNKNSVVEKELES
        TD  ++           N V++R L++ +GQL NE++ RP G++PS TE PRR G +   ++  RSG   E  + P  S H    +  +  +V +K +E 
Subjt:  TDAAIQS----------NQVSMRALKLHMGQLSNELKARPHGNIPSDTEHPRREGTKQVQAVTLRSGKSLEERKEP--SKHQEVEKCSNKNSVVEKELES

Query:  GKNDRGDIKDAGASSFVPNVEPPPYVPP------------------PPYNSPLPFPQ------------KDILTKKKRLRESETVTLTEECSVILKNGLP
          +          +  V N  PPP  P                      +  +PF +            KDI+T+KK+L E ETV LTE  S + K+ +P
Subjt:  GKNDRGDIKDAGASSFVPNVEPPPYVPP------------------PPYNSPLPFPQ------------KDILTKKKRLRESETVTLTEECSVILKNGLP

Query:  SKVEDPESFTIPISIGGKELGRALCDSGASINLMPLSAYRKLGIGEVRPTTVTLQLAYRSIIYPEGKIENVLV---------------------------
         K++DP SFTIP  IGGK++GRALCD GASINLMPLS ++K  IG+  PTTVTLQLA RSI  PEGKIE+VLV                           
Subjt:  SKVEDPESFTIPISIGGKELGRALCDSGASINLMPLSAYRKLGIGEVRPTTVTLQLAYRSIIYPEGKIENVLV---------------------------

Query:  --------------------QVHDQKVKFNVFDAMKYPNDIEDCSCIQV--------LDEMVEDHFEKELME
                            +V DQKV FN+ DAMKY +D+E+C+ I +        LD+++    E EL E
Subjt:  --------------------QVHDQKVKFNVFDAMKYPNDIEDCSCIQV--------LDEMVEDHFEKELME

A0A6J1EQ90 uncharacterized protein LOC1114364114.2e-13843.55Show/hide
Query:  RVIRAYVSPIVDELNPVIICPVIEATNFEMKTVMFHMLQMIRQFHGLSSEDPHLHLKYFLGV-------SDSFVIQGVSRDALRLTLFPYSLTDGTKAWL
        R IRAY  P V+ELNP II P I+ T FE+K VMF MLQ I QFHGL  EDPHLHLK FLGV       SDSF  QGV +D +RL+LFPY L DG K+WL
Subjt:  RVIRAYVSPIVDELNPVIICPVIEATNFEMKTVMFHMLQMIRQFHGLSSEDPHLHLKYFLGV-------SDSFVIQGVSRDALRLTLFPYSLTDGTKAWL

Query:  SSFAPGTISSWNELVENFLTKYFPPIRNAKLRSEIVRFRQNEEETFSEAWERFKELLRKCPYHYLPHCIQIKIFYNGLNITTQCMVDASAGGALLAKTFN
        ++ APGTI SWN L ENFL KYFPP RNA+ ++EIV F+Q E+ET SEA ERFKE+LRKCP+H LPHCIQ++ FYNGLNI T+ +VDASA GA+L+KT+N
Subjt:  SSFAPGTISSWNELVENFLTKYFPPIRNAKLRSEIVRFRQNEEETFSEAWERFKELLRKCPYHYLPHCIQIKIFYNGLNITTQCMVDASAGGALLAKTFN

Query:  EAYEIFERISTYSCQWSDVRGS-SKKVKSVIAVDSVSTIRADIATLANTLKNINVVSNVYQPPVVESAVVVNQV-----------------------VYY
        EAYEI ERI++ +CQW+DVR +  +K + V+ VD++S+I A +A++ N L+N+ +  +      V +A  +NQ                        ++Y
Subjt:  EAYEIFERISTYSCQWSDVRGS-SKKVKSVIAVDSVSTIRADIATLANTLKNINVVSNVYQPPVVESAVVVNQV-----------------------VYY

Query:  ---------------------GWRNHPNFAWGGQG-SNLQAPQAQQ-----KVVNQSGFAKSQALPQQNKQALPQQNSESSLEAKMKEYMARTDAAIQSN
                             GWRNHPNF+W GQ   N Q P         ++ NQ  ++  Q   Q       Q  SE+S+E+ +KEYMA+ DA IQS 
Subjt:  ---------------------GWRNHPNFAWGGQG-SNLQAPQAQQ-----KVVNQSGFAKSQALPQQNKQALPQQNSESSLEAKMKEYMARTDAAIQSN

Query:  QVSMRALKLHMGQLSNELKARPHGNIPSDTEHPRREGTKQVQAVTLRSGKSLEERKEPSKHQEVEKCSNKNSVVEKELESGKNDRGDIKDAGASSFVPNV
        Q S+R L++ +G   N  +   H    +DT+    E      AV     K   E +E  K Q       ++          +  +   ++A    F+ ++
Subjt:  QVSMRALKLHMGQLSNELKARPHGNIPSDTEHPRREGTKQVQAVTLRSGKSLEERKEPSKHQEVEKCSNKNSVVEKELESGKNDRGDIKDAGASSFVPNV

Query:  EPPPYVPPPPYNSPLPFPQ-----KDILTKKKRLRESETVTLTEECSVILKNGLPSKVEDPESFTIPISIGGKELGRALCDSGASINLMPLSAYRKLGIG
            ++  P   +    P      KD+L  +++  E + V+L EECS ILKN +P K +DP SFTIP+SIGGKELGRALCD GA+INLMPLS Y+KLGIG
Subjt:  EPPPYVPPPPYNSPLPFPQ-----KDILTKKKRLRESETVTLTEECSVILKNGLPSKVEDPESFTIPISIGGKELGRALCDSGASINLMPLSAYRKLGIG

Query:  EVRPTTVTLQLAYRSIIYPEGKIENVLVQV-----------------HD------------------------------QKVKFNVFDAMKYPNDIEDCS
        E RPTTVTLQLA RSI YPEGKIE++L+QV                 HD                              QKV+FN+ D+MKYP  IE+CS
Subjt:  EVRPTTVTLQLAYRSIIYPEGKIENVLVQV-----------------HD------------------------------QKVKFNVFDAMKYPNDIEDCS

Query:  CIQVL
         +  L
Subjt:  CIQVL

A0A6J1G7Q6 uncharacterized protein LOC1114515989.1e-10150Show/hide
Query:  RVIRAYVSPIVDELNPVIICPVIEATNFEMKTVMFHMLQMIRQFHGLSSEDPHLHLKYFLGVSDSFVIQGVSRDALRLTLFPYSLTDGTKAWLSSFAPGT
        R IRAY  P V+ELNP II P ++AT FE+K VMF MLQ I QFHGLSS+DPHLHLK FLGVSDSF  QGV +D +RL+ F YSL DG K+WL+  A G 
Subjt:  RVIRAYVSPIVDELNPVIICPVIEATNFEMKTVMFHMLQMIRQFHGLSSEDPHLHLKYFLGVSDSFVIQGVSRDALRLTLFPYSLTDGTKAWLSSFAPGT

Query:  ISSWNELVENFLTKYFPPIRNAKLRSEIVRFRQNEEETFSEAWERFKELLRKCPYHYLPHCIQIKIFYNGLNITTQCMVDASAGGALLAKTFNEAYEIFE
        I SWN L E FL KYFPP R+A+ R+EIV F++ E ET SEAWERFKE LRKCP+H LPHCIQI+ FYNGLN  T+ +VDASA G +L+KT+NEAYEI E
Subjt:  ISSWNELVENFLTKYFPPIRNAKLRSEIVRFRQNEEETFSEAWERFKELLRKCPYHYLPHCIQIKIFYNGLNITTQCMVDASAGGALLAKTFNEAYEIFE

Query:  RISTYSCQWSDVRGS-SKKVKSVIAVDSVSTIRADIATLANTLKNINV-----------VSNVYQPPVVESAVVVNQ------------VVYY-------
        RI++ +CQW DVR +  KK + V+ VD++S+I A +A++ N L+N+              + V      ES V   +             ++Y       
Subjt:  RISTYSCQWSDVRGS-SKKVKSVIAVDSVSTIRADIATLANTLKNINV-----------VSNVYQPPVVESAVVVNQ------------VVYY-------

Query:  --------------GWRNHPNFAWGGQGS-NLQAPQAQQ-----KVVNQSGFAKSQALPQQNKQALPQQNSESSLEAKMKEYMARTDAAIQSNQVSMRAL
                      GWRNHPNF   GQGS N Q P          + NQ  +   QA  Q    +  Q  S + LE+ +KEYMAR DA IQS QVS+R L
Subjt:  --------------GWRNHPNFAWGGQGS-NLQAPQAQQ-----KVVNQSGFAKSQALPQQNKQALPQQNSESSLEAKMKEYMARTDAAIQSNQVSMRAL

Query:  KLHMGQLSNELKARPHGNIPSDTEHPRREG
        ++ +GQL+NEL+ RP G +P+DTE P+REG
Subjt:  KLHMGQLSNELKARPHGNIPSDTEHPRREG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAAAGTATGGTTCGACCTCATGCCAAGGTCGAGGCCGACCACCACAGCTCCGAGGTCGAGCTTTGTCCTCAAAATGCCAGGACCACACTGACGCACAAGGAGCTGA
CGAGGACAACCTGACACAGGTAGGACAAGAAAACCGACCCAGAGGAAGACCAGACCAAGGGTCGGGCCATGATGGTCGGCCTCGGGGCCCGCTTGCGTGGGTCGAGCATG
GTCACCTTCTTTCGGTCTCTGATGCCCCTGATCGCCCCGATTCTGCCTGGGGTAGAGTCATTAGAGCGTATGTTTCTCCAATAGTTGACGAGTTAAATCCAGTGATAATA
TGTCCAGTTATTGAGGCAACAAACTTTGAAATGAAAACGGTCATGTTTCACATGTTACAGATGATTCGCCAATTTCATGGCTTATCATCTGAGGACCCTCATTTACATCT
TAAGTATTTTCTAGGAGTTAGTGATTCATTTGTTATCCAGGGAGTGTCTAGAGATGCCCTTAGGTTAACCCTATTTCCTTATTCCCTTACAGATGGAACAAAGGCATGGT
TGAGTTCTTTTGCTCCAGGGACGATAAGTAGTTGGAATGAGTTAGTGGAGAATTTTCTTACTAAGTATTTTCCACCAATTAGAAATGCCAAGTTAAGGAGTGAGATAGTG
AGATTTAGGCAGAATGAGGAAGAAACTTTTAGTGAGGCTTGGGAAAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCTACCACTATTTACCACATTGTATTCAAATAAAAAT
ATTTTACAATGGGTTGAACATAACAACCCAGTGTATGGTTGATGCTTCTGCGGGAGGGGCCCTTTTGGCAAAAACCTTTAATGAAGCTTATGAGATTTTTGAGAGAATAT
CAACCTACAGTTGTCAATGGTCAGATGTTAGAGGCTCTAGTAAGAAAGTTAAGAGTGTGATAGCAGTTGATAGTGTGTCTACCATTAGGGCCGATATTGCAACATTAGCT
AACACTCTTAAAAACATAAATGTGGTAAGCAATGTTTACCAACCGCCAGTGGTGGAGTCTGCTGTAGTTGTAAACCAAGTGGTCTATTATGGTTGGCGCAATCACCCCAA
CTTTGCGTGGGGAGGACAAGGAAGCAATTTGCAAGCCCCCCAAGCACAGCAAAAGGTGGTGAACCAGTCCGGATTTGCTAAATCGCAGGCATTGCCCCAGCAAAATAAGC
AGGCTTTGCCCCAGCAAAATTCAGAGAGTTCTCTTGAGGCAAAGATGAAAGAATATATGGCTCGTACAGATGCCGCAATTCAAAGTAATCAAGTTTCAATGAGAGCCTTG
AAATTGCATATGGGTCAGCTATCTAATGAGCTAAAGGCACGACCTCACGGGAACATTCCTTCAGATACTGAACACCCTAGAAGGGAGGGTACGAAGCAGGTGCAGGCAGT
AACTCTAAGGAGCGGTAAGTCACTAGAAGAGAGGAAAGAGCCTAGTAAACATCAAGAGGTAGAGAAATGTAGCAATAAAAATAGTGTTGTTGAGAAAGAGTTGGAGTCTG
GTAAAAATGATAGAGGCGACATTAAAGATGCTGGAGCATCTAGTTTTGTTCCAAATGTAGAACCACCACCTTATGTACCGCCTCCACCCTATAACTCACCCCTACCTTTT
CCACAAAAGGACATCTTGACTAAAAAGAAGAGATTAAGAGAGTCTGAAACCGTGACTCTTACTGAGGAATGTAGTGTTATTCTTAAGAATGGGCTACCTTCCAAGGTTGA
GGATCCAGAATCATTCACTATTCCTATATCCATAGGTGGAAAAGAGTTGGGGAGAGCACTTTGTGATTCAGGCGCAAGCATTAACCTTATGCCTCTTTCGGCTTATCGAA
AGCTAGGTATTGGTGAAGTTAGGCCTACCACAGTTACACTCCAATTAGCATACAGATCTATCATATATCCTGAGGGTAAGATTGAGAATGTTCTAGTCCAGGTGCATGAC
CAAAAGGTGAAGTTCAATGTGTTTGATGCAATGAAATATCCTAATGATATTGAGGATTGCTCGTGCATTCAGGTGTTGGATGAAATGGTTGAGGACCACTTTGAGAAGGA
ATTGATGGAGTACCATACCCAAAAATTTGGAGAAATCCAAATTGAGGATTTGGAAATAGGTAGATTGAAGTATGAGCATAAAGATACAGGTAAGATTTCTAGTTTTAAGA
GGAATTTTGAATCCTTAGAGTCAATAGATGAGAAATCCAAGCCTATTGAACCTTGTAATTCATTGACATTGCTTCAGCAACCTGAGATTAGGAACTCCTTCATTGATGAG
TGGTTATTTACTGTAGCTCATATTAAGGCAGTGAAAACACCTTGGTATGATGACTTTTCCAATTACCTTGATTTTGGAAATTTGCCTCGTGGTTTATCAAAAGAACAAAT
GAAAGAATATTTCCATGGGGTGAAGTTTTATTTATCTAATGATGCATCTATGGTTAGACAATGTGTTGATGAGGATGGTTGTGAGTTGAAAGTAACAGGGCAGCAAATAA
AGCATTATTGGAGAGAGGAGTTTCGTTCCAAATATCTCTCTTTGATGTTGCTGGCAGATTTAGGGAGCACAAGTTTTGAGGGAGCAAAAATTTCAGCATTCGAATCAGAA
GCTCCTATTATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAAAAGTATGGTTCGACCTCATGCCAAGGTCGAGGCCGACCACCACAGCTCCGAGGTCGAGCTTTGTCCTCAAAATGCCAGGACCACACTGACGCACAAGGAGCTGA
CGAGGACAACCTGACACAGGTAGGACAAGAAAACCGACCCAGAGGAAGACCAGACCAAGGGTCGGGCCATGATGGTCGGCCTCGGGGCCCGCTTGCGTGGGTCGAGCATG
GTCACCTTCTTTCGGTCTCTGATGCCCCTGATCGCCCCGATTCTGCCTGGGGTAGAGTCATTAGAGCGTATGTTTCTCCAATAGTTGACGAGTTAAATCCAGTGATAATA
TGTCCAGTTATTGAGGCAACAAACTTTGAAATGAAAACGGTCATGTTTCACATGTTACAGATGATTCGCCAATTTCATGGCTTATCATCTGAGGACCCTCATTTACATCT
TAAGTATTTTCTAGGAGTTAGTGATTCATTTGTTATCCAGGGAGTGTCTAGAGATGCCCTTAGGTTAACCCTATTTCCTTATTCCCTTACAGATGGAACAAAGGCATGGT
TGAGTTCTTTTGCTCCAGGGACGATAAGTAGTTGGAATGAGTTAGTGGAGAATTTTCTTACTAAGTATTTTCCACCAATTAGAAATGCCAAGTTAAGGAGTGAGATAGTG
AGATTTAGGCAGAATGAGGAAGAAACTTTTAGTGAGGCTTGGGAAAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCTACCACTATTTACCACATTGTATTCAAATAAAAAT
ATTTTACAATGGGTTGAACATAACAACCCAGTGTATGGTTGATGCTTCTGCGGGAGGGGCCCTTTTGGCAAAAACCTTTAATGAAGCTTATGAGATTTTTGAGAGAATAT
CAACCTACAGTTGTCAATGGTCAGATGTTAGAGGCTCTAGTAAGAAAGTTAAGAGTGTGATAGCAGTTGATAGTGTGTCTACCATTAGGGCCGATATTGCAACATTAGCT
AACACTCTTAAAAACATAAATGTGGTAAGCAATGTTTACCAACCGCCAGTGGTGGAGTCTGCTGTAGTTGTAAACCAAGTGGTCTATTATGGTTGGCGCAATCACCCCAA
CTTTGCGTGGGGAGGACAAGGAAGCAATTTGCAAGCCCCCCAAGCACAGCAAAAGGTGGTGAACCAGTCCGGATTTGCTAAATCGCAGGCATTGCCCCAGCAAAATAAGC
AGGCTTTGCCCCAGCAAAATTCAGAGAGTTCTCTTGAGGCAAAGATGAAAGAATATATGGCTCGTACAGATGCCGCAATTCAAAGTAATCAAGTTTCAATGAGAGCCTTG
AAATTGCATATGGGTCAGCTATCTAATGAGCTAAAGGCACGACCTCACGGGAACATTCCTTCAGATACTGAACACCCTAGAAGGGAGGGTACGAAGCAGGTGCAGGCAGT
AACTCTAAGGAGCGGTAAGTCACTAGAAGAGAGGAAAGAGCCTAGTAAACATCAAGAGGTAGAGAAATGTAGCAATAAAAATAGTGTTGTTGAGAAAGAGTTGGAGTCTG
GTAAAAATGATAGAGGCGACATTAAAGATGCTGGAGCATCTAGTTTTGTTCCAAATGTAGAACCACCACCTTATGTACCGCCTCCACCCTATAACTCACCCCTACCTTTT
CCACAAAAGGACATCTTGACTAAAAAGAAGAGATTAAGAGAGTCTGAAACCGTGACTCTTACTGAGGAATGTAGTGTTATTCTTAAGAATGGGCTACCTTCCAAGGTTGA
GGATCCAGAATCATTCACTATTCCTATATCCATAGGTGGAAAAGAGTTGGGGAGAGCACTTTGTGATTCAGGCGCAAGCATTAACCTTATGCCTCTTTCGGCTTATCGAA
AGCTAGGTATTGGTGAAGTTAGGCCTACCACAGTTACACTCCAATTAGCATACAGATCTATCATATATCCTGAGGGTAAGATTGAGAATGTTCTAGTCCAGGTGCATGAC
CAAAAGGTGAAGTTCAATGTGTTTGATGCAATGAAATATCCTAATGATATTGAGGATTGCTCGTGCATTCAGGTGTTGGATGAAATGGTTGAGGACCACTTTGAGAAGGA
ATTGATGGAGTACCATACCCAAAAATTTGGAGAAATCCAAATTGAGGATTTGGAAATAGGTAGATTGAAGTATGAGCATAAAGATACAGGTAAGATTTCTAGTTTTAAGA
GGAATTTTGAATCCTTAGAGTCAATAGATGAGAAATCCAAGCCTATTGAACCTTGTAATTCATTGACATTGCTTCAGCAACCTGAGATTAGGAACTCCTTCATTGATGAG
TGGTTATTTACTGTAGCTCATATTAAGGCAGTGAAAACACCTTGGTATGATGACTTTTCCAATTACCTTGATTTTGGAAATTTGCCTCGTGGTTTATCAAAAGAACAAAT
GAAAGAATATTTCCATGGGGTGAAGTTTTATTTATCTAATGATGCATCTATGGTTAGACAATGTGTTGATGAGGATGGTTGTGAGTTGAAAGTAACAGGGCAGCAAATAA
AGCATTATTGGAGAGAGGAGTTTCGTTCCAAATATCTCTCTTTGATGTTGCTGGCAGATTTAGGGAGCACAAGTTTTGAGGGAGCAAAAATTTCAGCATTCGAATCAGAA
GCTCCTATTATTTGA
Protein sequenceShow/hide protein sequence
MQKYGSTSCQGRGRPPQLRGRALSSKCQDHTDAQGADEDNLTQVGQENRPRGRPDQGSGHDGRPRGPLAWVEHGHLLSVSDAPDRPDSAWGRVIRAYVSPIVDELNPVII
CPVIEATNFEMKTVMFHMLQMIRQFHGLSSEDPHLHLKYFLGVSDSFVIQGVSRDALRLTLFPYSLTDGTKAWLSSFAPGTISSWNELVENFLTKYFPPIRNAKLRSEIV
RFRQNEEETFSEAWERFKELLRKCPYHYLPHCIQIKIFYNGLNITTQCMVDASAGGALLAKTFNEAYEIFERISTYSCQWSDVRGSSKKVKSVIAVDSVSTIRADIATLA
NTLKNINVVSNVYQPPVVESAVVVNQVVYYGWRNHPNFAWGGQGSNLQAPQAQQKVVNQSGFAKSQALPQQNKQALPQQNSESSLEAKMKEYMARTDAAIQSNQVSMRAL
KLHMGQLSNELKARPHGNIPSDTEHPRREGTKQVQAVTLRSGKSLEERKEPSKHQEVEKCSNKNSVVEKELESGKNDRGDIKDAGASSFVPNVEPPPYVPPPPYNSPLPF
PQKDILTKKKRLRESETVTLTEECSVILKNGLPSKVEDPESFTIPISIGGKELGRALCDSGASINLMPLSAYRKLGIGEVRPTTVTLQLAYRSIIYPEGKIENVLVQVHD
QKVKFNVFDAMKYPNDIEDCSCIQVLDEMVEDHFEKELMEYHTQKFGEIQIEDLEIGRLKYEHKDTGKISSFKRNFESLESIDEKSKPIEPCNSLTLLQQPEIRNSFIDE
WLFTVAHIKAVKTPWYDDFSNYLDFGNLPRGLSKEQMKEYFHGVKFYLSNDASMVRQCVDEDGCELKVTGQQIKHYWREEFRSKYLSLMLLADLGSTSFEGAKISAFESE
APII