; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005764 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005764
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr6:28916465..28923288
RNA-Seq ExpressionLag0005764
SyntenyLag0005764
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6734747.1 hypothetical protein I3842_01G285500 [Carya illinoinensis]2.3e-11137.48Show/hide
Query:  MANPNPQDEQKPIRDDFQPTFLDQQSGIVYAPINANNFELKTGLIQMAR---------------------------------------------------
        MA  + +   + ++D  +P      S I+  PINANNFELK  LI M +                                                   
Subjt:  MANPNPQDEQKPIRDDFQPTFLDQQSGIVYAPINANNFELKTGLIQMAR---------------------------------------------------

Query:  -----------------DSFSCQIFPPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKCPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKI
                         + F  + FPPAKT +LR+EIG F+Q   E L+EAWERYK+L+R+CPQHG PDWLQ+Q+FYNGLN  T+TIVDAA+GGTL+SK 
Subjt:  -----------------DSFSCQIFPPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKCPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKI

Query:  VENARTLLEDMATKSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSL-------------------ANASMKFSSTGSAQ---------SIEGHHNSSP
         E A  LLE+MA+ +YQWPTER+  KK+ AGI+E++ +++L AQ+ +L                   A+ SM   S  ++Q         +     N  P
Subjt:  VENARTLLEDMATKSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSL-------------------ANASMKFSSTGSAQ---------SIEGHHNSSP

Query:  THYHPNVRNHENFSYANTKNIL---NPPGYTSQKAEIKPSLEDTIRAFIAESSNRTNKLEEAVIAINTTVTGHSAAINNIETQLGPLVNAVNTLQKGKST
         +YHP +RNHEN SY NTKN+L   +PPG+ SQ +E K SLED + +F+ E++ R  K +  +  I T  +   A + N+E Q+G L   +N  Q+G   
Subjt:  THYHPNVRNHENFSYANTKNIL---NPPGYTSQKAEIKPSLEDTIRAFIAESSNRTNKLEEAVIAINTTVTGHSAAINNIETQLGPLVNAVNTLQKGKST

Query:  AEPEKTQMEYCKAITVHQVEEVQVAGAQEIHEPEV------TKEKVEEGSSSNEAEKLTSDP----LIPSPTVL---VPKPKKKKKKNYSTQFKKFLDIF
        +  E    E CKAIT+   +E++ A  +E            +K++VEE    N+  + T  P       +P +L   +P P++ +K+    QF KFLDIF
Subjt:  AEPEKTQMEYCKAITVHQVEEVQVAGAQEIHEPEV------TKEKVEEGSSSNEAEKLTSDP----LIPSPTVL---VPKPKKKKKKNYSTQFKKFLDIF

Query:  MSLNINLPFAEALEQMPKYVQFIKEWLSRKKKEKKVETIFLTSTCSARLQKNVPDKLADPGSFSFPC-----------------------------NFGE
          ++IN+PFA+ALEQMP YV+F+K+ +S+K++ ++ ET+ L+  CSA LQK +P KL DP SF+ PC                               GE
Subjt:  MSLNINLPFAEALEQMPKYVQFIKEWLSRKKKEKKVETIFLTSTCSARLQKNVPDKLADPGSFSFPC-----------------------------NFGE

Query:  IKPTAVKLQLADQSVVSPYEVVENVLIQVGKFFLPVDFFVMDVKENPLVPVILGRPFLATGRVIIDIERRELTMRVLHEKEVF
        +K T + LQLAD+S+  P  ++E+VL++V KF  P DF V+D++E+  VP+ILGRPFLATGR +ID+++ ELT+RV  E+ +F
Subjt:  IKPTAVKLQLADQSVVSPYEVVENVLIQVGKFFLPVDFFVMDVKENPLVPVILGRPFLATGRVIIDIERRELTMRVLHEKEVF

KAG7947748.1 hypothetical protein I3843_14G109500 [Carya illinoinensis]2.7e-11237.63Show/hide
Query:  MANPNPQDEQKPIRDDFQPTFLDQQSGIVYAPINANNFELKTGLIQMAR---------------------------------------------------
        MA  + +   + ++D  +P      S I+  PINANNFELK  LI M +                                                   
Subjt:  MANPNPQDEQKPIRDDFQPTFLDQQSGIVYAPINANNFELKTGLIQMAR---------------------------------------------------

Query:  -----------------DSFSCQIFPPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKCPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKI
                         + F  + FPPAKT +LR+EIG F+Q   E L+EAWERYK+L+R+CPQHG PDWLQ+Q+FYNGLN  T+TIVDAA+GGTL+SK 
Subjt:  -----------------DSFSCQIFPPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKCPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKI

Query:  VENARTLLEDMATKSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSL-------------------ANASMKFSSTGSAQ---------SIEGHHNSSP
         E A  LLE+MA+ +YQWPTER+  KK+ AGI+E++ +++L AQ+ +L                   A+ SM   S  ++Q         +     N  P
Subjt:  VENARTLLEDMATKSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSL-------------------ANASMKFSSTGSAQ---------SIEGHHNSSP

Query:  THYHPNVRNHENFSYANTKNIL---NPPGYTSQKAEIKPSLEDTIRAFIAESSNRTNKLEEAVIAINTTVTGHSAAINNIETQLGPLVNAVNTLQKGKST
         +YHP +RNHEN SY NTKN+L   +PPG+ SQ +E K SLED + +F+ E++ R  K +  +  I T  +   A + N+E Q+G L   +N  Q+G   
Subjt:  THYHPNVRNHENFSYANTKNIL---NPPGYTSQKAEIKPSLEDTIRAFIAESSNRTNKLEEAVIAINTTVTGHSAAINNIETQLGPLVNAVNTLQKGKST

Query:  AEPEKTQMEYCKAITVHQVEEVQVAGAQEIHEPEV------TKEKVEEGSSSNEAEKLTSDP----LIPSPTVL---VPKPKKKKKKNYSTQFKKFLDIF
        +  E    E CKAIT+   +E++ A  +E            +K++VEE    N+  + T  P       +P +L   +P P++ +K+    QF KFLDIF
Subjt:  AEPEKTQMEYCKAITVHQVEEVQVAGAQEIHEPEV------TKEKVEEGSSSNEAEKLTSDP----LIPSPTVL---VPKPKKKKKKNYSTQFKKFLDIF

Query:  MSLNINLPFAEALEQMPKYVQFIKEWLSRKKKEKKVETIFLTSTCSARLQKNVPDKLADPGSFSFPC-----------------------------NFGE
          ++IN+PFA+ALEQMP YV+F+K+ +S+K++ ++ ET+ L+  CSA LQK +P KL DPGSF+ PC                               GE
Subjt:  MSLNINLPFAEALEQMPKYVQFIKEWLSRKKKEKKVETIFLTSTCSARLQKNVPDKLADPGSFSFPC-----------------------------NFGE

Query:  IKPTAVKLQLADQSVVSPYEVVENVLIQVGKFFLPVDFFVMDVKENPLVPVILGRPFLATGRVIIDIERRELTMRVLHEKEVF
        +K T + LQLAD+S+  P  ++E+VL++V KF  P DF V+D++E+  VP+ILGRPFLATGR +ID+++ ELT+RV  E+ +F
Subjt:  IKPTAVKLQLADQSVVSPYEVVENVLIQVGKFFLPVDFFVMDVKENPLVPVILGRPFLATGRVIIDIERRELTMRVLHEKEVF

KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]2.5e-11838.02Show/hide
Query:  RRIDNKDALLVPFDFEIEKTCKKNRKEKRERLASMANPNPQDEQKPIRDDFQPTFLDQQSGIVYAPINANNFELKTGLIQMAR-----------------
        RR  ++D  ++P D EIE+T    R  +R ++ +MA  + +   + ++D  +P      S I+  PINANNFELK  LI M +                 
Subjt:  RRIDNKDALLVPFDFEIEKTCKKNRKEKRERLASMANPNPQDEQKPIRDDFQPTFLDQQSGIVYAPINANNFELKTGLIQMAR-----------------

Query:  ---------------------------------------------------DSFSCQIFPPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKCPQHG
                                                           + F  + FPPAKT +LR+EIG F+Q   E L+EAWERYK+L+R+CPQHG
Subjt:  ---------------------------------------------------DSFSCQIFPPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKCPQHG

Query:  YPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKIVENARTLLEDMATKSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTS-------------------
         PDWLQ+Q+FYNGLN  T+TIVDAA+GGTL+SK  E A  LLE+MA+ +YQWPTER+  KK+ AGI++++ +++L AQ+ +                   
Subjt:  YPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKIVENARTLLEDMATKSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTS-------------------

Query:  LANASMKFSSTGSAQ---------SIEGHHNSSPTHYHPNVRNHENFSYANTKNIL---NPPGYTSQKAEIKPSLEDTIRAFIAESSNRTNKLEEAVIAI
        LA+ SM   S  ++Q         +     N  P +YHP +RNHEN SY NTKN+L   +PPG+ SQ +E K SLED + +F+ E++ R  K +  +  I
Subjt:  LANASMKFSSTGSAQ---------SIEGHHNSSPTHYHPNVRNHENFSYANTKNIL---NPPGYTSQKAEIKPSLEDTIRAFIAESSNRTNKLEEAVIAI

Query:  NTTVTGHSAAINNIETQLGPLVNAVNTLQKGKSTAEPEKTQMEYCKAITVHQVEEVQVAGAQEIHEPEV------TKEKVEEGSSSNEA-EKLTSDPLI-
         T  +   AAI NIE Q+G L   +N  Q+G   +  E    E CKAIT+   +E++ +  +E            +K KVEE    N+  E+    P I 
Subjt:  NTTVTGHSAAINNIETQLGPLVNAVNTLQKGKSTAEPEKTQMEYCKAITVHQVEEVQVAGAQEIHEPEV------TKEKVEEGSSSNEA-EKLTSDPLI-

Query:  --PSPTVL---VPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQMPKYVQFIKEWLSRKKKEKKVETIFLTSTCSARLQKNVPDKLADPGSFSFP
           +P +L   +P P++ +K+    QF KFLDIF  ++IN+PFA+ALEQMP YV+F+K+ +S+K++ ++ ET+ L+  CSA LQK +P KL DPGSF+ P
Subjt:  --PSPTVL---VPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQMPKYVQFIKEWLSRKKKEKKVETIFLTSTCSARLQKNVPDKLADPGSFSFP

Query:  CNFG-----------------------------EIKPTAVKLQLADQSVVSPYEVVENVLIQVGKFFLPVDFFVMDVKENPLVPVILGRPFLATGRVIID
        C  G                             E+KPT + LQLAD+S+  P  ++E+VL++V KF  P DF V+D++E+  VP+ILGRPFLATGR +ID
Subjt:  CNFG-----------------------------EIKPTAVKLQLADQSVVSPYEVVENVLIQVGKFFLPVDFFVMDVKENPLVPVILGRPFLATGRVIID

Query:  IERRELTMRVLHEKEVFK
        +++ ELT+RV  E+ +FK
Subjt:  IERRELTMRVLHEKEVFK

XP_022843226.1 uncharacterized protein LOC111366761 [Olea europaea var. sylvestris]5.2e-10837.21Show/hide
Query:  DTASRRH-----VDSVETLSRRIDNKDALLVPFDFEIEKTCKKNRKEKRERLASMAN-----PNPQDEQKPIRDDFQPTFLDQQSGIVYAPINANNFELK
        D   RRH        +    RR  N D L V  D E E+T +  R  +R    +MA       N  ++Q+ IRD  +P   D  SGI    I A NFELK
Subjt:  DTASRRH-----VDSVETLSRRIDNKDALLVPFDFEIEKTCKKNRKEKRERLASMAN-----PNPQDEQKPIRDDFQPTFLDQQSGIVYAPINANNFELK

Query:  TGLIQMARDS--------------------------------------------------------------------FSCQIFPPAKTVKLRTEIGTFQ
         GLI M + +                                                                    F  + FPP+K+ +LR EI  F+
Subjt:  TGLIQMARDS--------------------------------------------------------------------FSCQIFPPAKTVKLRTEIGTFQ

Query:  QLGDEQLFEAWERYKELLRKCPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKIVENARTLLEDMATKSYQWPTERSAPKKIAAGIYEIDNVSSL
        QL  E  +EAWER+K+LLR+CPQHG+  W+QI++FYNGLN  T+T+VDAAAGG L++K  E A  LL+D+AT SYQWP+ERS  KK+ AG++E+D +++L
Subjt:  QLGDEQLFEAWERYKELLRKCPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKIVENARTLLEDMATKSYQWPTERSAPKKIAAGIYEIDNVSSL

Query:  QAQMTSLANASMKFSSTGSAQSIEGHHNSSPT----------------------------HYHPNVRNHENFSYANTKNILN-PPGYTSQKAEIKPSLED
         AQ+ SL N  +  ++ G+ Q+++   ++S +                            HYHP +RNHEN SY N +N L  PPG+ +Q ++ KP LED
Subjt:  QAQMTSLANASMKFSSTGSAQSIEGHHNSSPT----------------------------HYHPNVRNHENFSYANTKNILN-PPGYTSQKAEIKPSLED

Query:  TIRAFIAESSNRTNKLEEAVIAINTTVTGHSAAINNIETQLGPLVNAVNTLQKGKSTAEPEKTQMEYCKAITVHQVEEVQVAGAQEIHEPE----VTKEK
         +  FI+E+ +R NK E  +  I T V+   A + N+E Q+G L   + + QKGK  ++ E    E+C AIT+   + V+ +  ++I  P     VT E+
Subjt:  TIRAFIAESSNRTNKLEEAVIAINTTVTGHSAAINNIETQLGPLVNAVNTLQKGKSTAEPEKTQMEYCKAITVHQVEEVQVAGAQEIHEPE----VTKEK

Query:  VEEGSSSNEAE-----KLTSDPLIPSPTVL---VPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQMPKYVQFIKEWLSRKKKEKKVETIFLTST
          E     EAE     K  S     +P +L   +P P++  KK +  QF KFL++F  ++IN+PFAE L QMP Y +F+KE +S KKK ++ ETI LT  
Subjt:  VEEGSSSNEAE-----KLTSDPLIPSPTVL---VPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQMPKYVQFIKEWLSRKKKEKKVETIFLTST

Query:  CSARLQKNVPDKLADPGSFSFPCN-----------------------------FGEIKPTAVKLQLADQSVVSPYEVVENVLIQVGKFFLPVDFFVMDVK
        CS  LQK +P KL DPGSF+ PCN                              GE+KPT + LQLAD+S+  P  ++E+VL++V KF LPVDF V+D++
Subjt:  CSARLQKNVPDKLADPGSFSFPCN-----------------------------FGEIKPTAVKLQLADQSVVSPYEVVENVLIQVGKFFLPVDFFVMDVK

Query:  ENPLVPVILGRPFLATGRVIIDI
        EN  +P+ILGRPFLATGR +ID+
Subjt:  ENPLVPVILGRPFLATGRVIIDI

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]3.0e-11137.57Show/hide
Query:  MANPNPQDEQKPIRDDFQPTFLDQQSGIVYAPINANNFELKTGLIQMAR---------------------------------------------------
        MA      + + ++D  +P   D  SGI    INANNFELK  LI M +                                                   
Subjt:  MANPNPQDEQKPIRDDFQPTFLDQQSGIVYAPINANNFELKTGLIQMAR---------------------------------------------------

Query:  -----------------DSFSCQIFPPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKCPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKI
                         + F  + FPPAKT +LR+EIG F+Q   E L+EAWERYK+L+R CPQHG PDWLQ+Q+FYNGLN  T+TIVDAA+GGTL+SK 
Subjt:  -----------------DSFSCQIFPPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKCPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKI

Query:  VENARTLLEDMATKSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSLAN-----------------------ASMKFSSTGSAQSIEGHH-----NSSP
         E A +LLE+MA+ +YQWPTER+  KK+ AGI+E++  ++L AQ+ SL++                         M  +S    Q I   +     N  P
Subjt:  VENARTLLEDMATKSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSLAN-----------------------ASMKFSSTGSAQSIEGHH-----NSSP

Query:  THYHPNVRNHENFSYANTKNILN-PPGYTSQKAEIKPSLEDTIRAFIAESSNRTNKLEEAVIAINTTVTGHSAAINNIETQLGPLVNAVNTLQKGKSTAE
         +YHP +RNHENFSY NTKN+L  PPG+ SQ +E K SLED + +F+ E+     K +  +  I T  +   A + N+E Q+G L   +N  Q+G   + 
Subjt:  THYHPNVRNHENFSYANTKNILN-PPGYTSQKAEIKPSLEDTIRAFIAESSNRTNKLEEAVIAINTTVTGHSAAINNIETQLGPLVNAVNTLQKGKSTAE

Query:  PEKTQMEYCKAITVHQVEEVQVAGAQEIHEPEVTKEKVEEGSSSN--EAEKLTSDPL----IP-------SPTVL---VPKPKKKKKKNYSTQFKKFLDI
         E    E CKAIT+    E++ + ++   E E T      G S N  E E++  D L    +P       +P +L   +P P++ +K+    QF KFLDI
Subjt:  PEKTQMEYCKAITVHQVEEVQVAGAQEIHEPEVTKEKVEEGSSSN--EAEKLTSDPL----IP-------SPTVL---VPKPKKKKKKNYSTQFKKFLDI

Query:  FMSLNINLPFAEALEQMPKYVQFIKEWLSRKKKEKKVETIFLTSTCSARLQKNVPDKLADPGSFSFPC-----------------------------NFG
        F  ++IN+PFA+ALEQMP Y +F+K+ +S+K++ ++ ET+ L+  CSA +QK +P KL DPGSF+ PC                               G
Subjt:  FMSLNINLPFAEALEQMPKYVQFIKEWLSRKKKEKKVETIFLTSTCSARLQKNVPDKLADPGSFSFPC-----------------------------NFG

Query:  EIKPTAVKLQLADQSVVSPYEVVENVLIQVGKFFLPVDFFVMDVKENPLVPVILGRPFLATGRVIIDIERRELTMRVLHEKEVF
        E+K T + LQLAD+S+  P  ++E+VL++V KF  P DF V+D++E+  VP+ILGRPFLATGR ++D+++ ELT+RV  E+  F
Subjt:  EIKPTAVKLQLADQSVVSPYEVVENVLIQVGKFFLPVDFFVMDVKENPLVPVILGRPFLATGRVIIDIERRELTMRVLHEKEVF

TrEMBL top hitse value%identityAlignment
A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129451.5e-9233.87Show/hide
Query:  LVPFDFEIEKTCKKNRKEK------RERLASMANPNPQ-------DEQKPIRDDFQPTFLDQQSGIVYAPINANNFELKTGLIQMARDS-----------
        LVPFD +IE+T +++R+E        + +A   N N         +  + +RD   P        I    INANNFE+K   IQM + S           
Subjt:  LVPFDFEIEKTCKKNRKEK------RERLASMANPNPQ-------DEQKPIRDDFQPTFLDQQSGIVYAPINANNFELKTGLIQMARDS-----------

Query:  ----------------------------------------------------------FSCQIFPPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRK
                                                                  F  + FPPAKT K+R +I +F Q   E L+EAWER+KELLR+
Subjt:  ----------------------------------------------------------FSCQIFPPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRK

Query:  CPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKIVENARTLLEDMATKSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSLA------------
        CP HG PDWLQ+Q FYNGL  + KTI+DAAAGG L+SK   +A  LLE+MA+ +YQWP+ERS  +K A G YEID + +L  Q+ +L+            
Subjt:  CPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKIVENARTLLEDMATKSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSLA------------

Query:  NASMKFSSTGSAQSIE------------GHHNSSPTH-----YHPNVRNHENFSYANTKNILNPPGYTSQKAEIKPSLEDTIRAFIAESSNRTNK-LEEA
        N+ +     G + S +            G+ N    +     Y+P  RNH NFS++N     NP      K  + P  +   R  I E  ++  + L + 
Subjt:  NASMKFSSTGSAQSIE------------GHHNSSPTH-----YHPNVRNHENFSYANTKNILNPPGYTSQKAEIKPSLEDTIRAFIAESSNRTNK-LEEA

Query:  VIAINTTVTGHSAAINNIETQLGPLVNAVNTLQKGKSTAEPEKTQM-----EYCKAITVHQVEEVQVAGAQEIHEPEVTKEKVEEGSSSNEAE-------
        +   +  +    A++ N+ETQ+G L N++N   +G   + P  TQ+     E C+AIT+   +E++    Q+  E E+ +   +EG   NE E       
Subjt:  VIAINTTVTGHSAAINNIETQLGPLVNAVNTLQKGKSTAEPEKTQM-----EYCKAITVHQVEEVQVAGAQEIHEPEVTKEKVEEGSSSNEAE-------

Query:  ----KLTSDPLIPSPTVLVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQMPKYVQFIKEWLSRKKKEKKVETIFLTSTCSARLQKNVPDKLAD
            + TS  + P P    P P++ +K+    QF+KFL++F  L+IN+PFAEALEQMP YV+F+K+ LS+K+K  + ET+FLT  CSA LQ  +P KL D
Subjt:  ----KLTSDPLIPSPTVLVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQMPKYVQFIKEWLSRKKKEKKVETIFLTSTCSARLQKNVPDKLAD

Query:  PGSFSFPC-----------------------------NFGEIKPTAVKLQLADQSVVSPYEVVENVLIQVGKFFLPVDFFVMDVKENPLVPVILGRPFLA
        PGSF+ PC                               GE KPT+V LQLAD+S V P  ++E+VL++V KF  PVDF ++D++E+  +P+ILGRPFLA
Subjt:  PGSFSFPC-----------------------------NFGEIKPTAVKLQLADQSVVSPYEVVENVLIQVGKFFLPVDFFVMDVKENPLVPVILGRPFLA

Query:  TGRVIIDIERRELTMRVLHEKEVFKAVGVS------TGCSEVMLVGYRKG
        T   IID+   +++ +V  E   F     S        C  V L+   KG
Subjt:  TGRVIIDIERRELTMRVLHEKEVFKAVGVS------TGCSEVMLVGYRKG

A0A6J1DU19 uncharacterized protein LOC1110243611.7e-9638.82Show/hide
Query:  QDEQKPIRDDFQPTFLDQQSGIVYAPINANNFELKTGLIQMARD---------------------------------------------------SFSCQ
        Q+ Q  IRD  QP F     GI+  PINANN ELK GLIQM R+                                                   +F   
Subjt:  QDEQKPIRDDFQPTFLDQQSGIVYAPINANNFELKTGLIQMARD---------------------------------------------------SFSCQ

Query:  IFPPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKCPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKIVENARTLLEDMATKSYQWPTERS
         FPPAKT +LRTEI +F++   EQLFE WERYKELLRKCPQHG  +WLQIQ+FYNGLN  T+TI+DAAAGGTLLS+  ENA  LL+DMA  S+QWP+ERS
Subjt:  IFPPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKCPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKIVENARTLLEDMATKSYQWPTERS

Query:  APKKIAAGIYEIDNVSSLQAQMTSLANASMKFSSTGSAQSIE-GHHNSSPTHYHPNVRNHENFSYANTKNILNPPGYTSQKAEIKPSLEDTIRAFIAESS
          KK+ AG+YEID +SSL+AQ+ +L NA  K S  G++ S E      + ++Y P +   +               +TS  AE K SLED + AFI E  
Subjt:  APKKIAAGIYEIDNVSSLQAQMTSLANASMKFSSTGSAQSIE-GHHNSSPTHYHPNVRNHENFSYANTKNILNPPGYTSQKAEIKPSLEDTIRAFIAESS

Query:  NRTNKLEEAVIAINTTVTGHSAAINNIETQLGPLVNAVNTLQKGKSTAEPEKTQMEYCKAITVHQVEEVQVAGAQEIHEPEVTKEK-------VEEGSSS
        +R +++E  V  +   + G++ +I N+E Q+G +   +NT+QKGK  ++ E    E+CKA+T+   +E+Q    +++ EP +T E+       V+E + +
Subjt:  NRTNKLEEAVIAINTTVTGHSAAINNIETQLGPLVNAVNTLQKGKSTAEPEKTQMEYCKAITVHQVEEVQVAGAQEIHEPEVTKEK-------VEEGSSS

Query:  NEAEKLTSDPLIPSPTVLVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQMPKYVQFIKEWLSRKKKEKKVETIFLTSTCSARLQKNVPDKLAD
         +A+K TS  ++ SP   +P P+                             ALEQMP YV+F+K+ ++ K+K +  ET+ LT  CSA LQ+ +P KL D
Subjt:  NEAEKLTSDPLIPSPTVLVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQMPKYVQFIKEWLSRKKKEKKVETIFLTSTCSARLQKNVPDKLAD

Query:  PGSFSFPCNFGEIKPTAVKLQLADQSVVSPYEVVENVLIQVGKFFLPVDFFVMDVKENPLVPVILGRPFLATGRVIIDIERRELTMRVLHEKEVF
        PGSF+ PC             +     + P  V+E+VL++V +   P DF V+  +E+  +P+ILGR FLATG  +ID++   LT+RV  E  VF
Subjt:  PGSFSFPCNFGEIKPTAVKLQLADQSVVSPYEVVENVLIQVGKFFLPVDFFVMDVKENPLVPVILGRPFLATGRVIIDIERRELTMRVLHEKEVF

A0A6P6SQ09 uncharacterized protein LOC1136935786.7e-7733.75Show/hide
Query:  SFSCQIFPPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKCPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKIVENARTLLEDMATKSYQW
        +F  + FPP KT KLR +I  F Q+  E L+EAWER+++LL KCP HG P+WL IQ FYNGL+ +TKTI+DAAAGG L+ K  +  + L+E+MA  +YQW
Subjt:  SFSCQIFPPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKCPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKIVENARTLLEDMATKSYQW

Query:  PTERSAPKKIAAGIYEIDNVSSLQAQMTSLANASMKFSSTGSAQS---------------------------IEGHHNSSPTH-----YHPNVRNHENFS
          ER   ++  AG+ E+D ++ L AQM ++     +    G + S                           +  ++ ++P +     Y+P  RNH NF 
Subjt:  PTERSAPKKIAAGIYEIDNVSSLQAQMTSLANASMKFSSTGSAQS---------------------------IEGHHNSSPTH-----YHPNVRNHENFS

Query:  Y---ANTKNILNPPGY--TSQKAEIKPSLEDTIRAFIAESSNRTNKLEEAVIAINTTVTGHSAAINNIETQLGPLVNAVNTLQKGKSTAEPEKTQMEYCK
        +    N +   NPPG+     +AE KP+ E  +      +S+R  ++E  +  +       +    N+E Q+G + N +N    G+  ++ E    E+  
Subjt:  Y---ANTKNILNPPGY--TSQKAEIKPSLEDTIRAFIAESSNRTNKLEEAVIAINTTVTGHSAAINNIETQLGPLVNAVNTLQKGKSTAEPEKTQMEYCK

Query:  AITVHQVEEVQVAGAQEIHEPEVTKEKVE-EGSSSNEAEKLTSDPLIPS-PTVLVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQMPKYVQFI
        AI +   + V+    +     +  K+ +E E  S N+   +  D L P   + ++P P + KK     +F+KF  +F  L+IN+PF +A+ Q+  Y +F+
Subjt:  AITVHQVEEVQVAGAQEIHEPEVTKEKVE-EGSSSNEAEKLTSDPLIPS-PTVLVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQMPKYVQFI

Query:  KEWLSRKKKEKKVETIFLTSTCSARLQKNVPDKLADPGSFSFPCNFG-----------------------------EIKPTAVKLQLADQSVVSPYEVVE
        K+ +S+K+K    E I LT  CSA ++  +P KL D GSFS PC  G                             E+K T + LQLAD+S+  P  ++E
Subjt:  KEWLSRKKKEKKVETIFLTSTCSARLQKNVPDKLADPGSFSFPCNFG-----------------------------EIKPTAVKLQLADQSVVSPYEVVE

Query:  NVLIQVGKFFLPVDFFVMDVKENPLVPVILGRPFLATGRVIIDIERRELTMRVLHEKEVF
        NVLI+V +  +PVDF V+D++E+  +P+ILGRPFLAT R +ID+E+ +L +RV  E+  F
Subjt:  NVLIQVGKFFLPVDFFVMDVKENPLVPVILGRPFLATGRVIIDIERRELTMRVLHEKEVF

A0A6P8DD93 uncharacterized protein LOC1162064531.8e-7731.77Show/hide
Query:  NKDALLVPFDFEIEKTCKKNRKEKRER----LASMA----NPNPQDEQKPIRDDFQPTFLDQQSGIVYAPINANNFELKTGLIQMARDS-----------
        ++ A L+P D EIE+T  + R+E R R    +  MA    N   Q   + +RD   PT +   S I    I ANNFELK  LIQM + +           
Subjt:  NKDALLVPFDFEIEKTCKKNRKEKRER----LASMA----NPNPQDEQKPIRDDFQPTFLDQQSGIVYAPINANNFELKTGLIQMARDS-----------

Query:  ---------------------------------------------------------FSCQIFPPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKC
                                                                 F  + FPPA+T +LR EI  F +   E L+EAWER+KE +RKC
Subjt:  ---------------------------------------------------------FSCQIFPPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKC

Query:  PQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKIVENARTLLEDMATKSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSLANASMKFSSTGS--
        P HG PD L I++FY  L+   +++VDAAAGG L+ K  + A  L+E+MA+ ++ W  ERS  K   A + ++D +++L  Q+++L     K +S  S  
Subjt:  PQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKIVENARTLLEDMATKSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSLANASMKFSSTGS--

Query:  ----------------AQSIEGHHNSSP--------------------THYHPNVRNHENFSYANTKNILNPPGYTSQKAEIKPSLEDTIRAFIAESSNR
                         + + G+ ++SP                      Y+P  RNH NFS+ N  N L PP          P  +    A  A     
Subjt:  ----------------AQSIEGHHNSSP--------------------THYHPNVRNHENFSYANTKNILNPPGYTSQKAEIKPSLEDTIRAFIAESSNR

Query:  TNKLEEAVIA----INTTVTGHSAAINNIETQLGPLVNAVNTLQKGKSTAEPEKTQMEYCKAITVHQVEEVQVAG-----AQEIHEPEVTKEKVEEGSSS
         +++EE +++     +T +    A I N+E Q+  +   ++    G   +  E+   +   AI +   +E+++        +E  E +  K+KVEE    
Subjt:  TNKLEEAVIA----INTTVTGHSAAINNIETQLGPLVNAVNTLQKGKSTAEPEKTQMEYCKAITVHQVEEVQVAG-----AQEIHEPEVTKEKVEEGSSS

Query:  NEAEKLTSDPLIPSPTVLVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQMPKYVQFIKEWLSRKKKEKKVETIFLTSTCSARLQK---NVPDK
        +   K    P +P     VP P++ K++    QF KFLD+F  L IN+PFAEAL+QMP Y +F+K+ L++K+K    E + LT  CS  LQK   N+P K
Subjt:  NEAEKLTSDPLIPSPTVLVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQMPKYVQFIKEWLSRKKKEKKVETIFLTSTCSARLQK---NVPDK

Query:  LADPGSFSFPC---NF--------------------------GEIKPTAVKLQLADQSVVSPYEVVENVLIQVGKFFLPVDFFVMDVKENPLVPVILGRP
          D GSF+ PC   NF                          GE K T V LQLAD+S+  P  +VENVL++V KF  PVDF V++++E+  VP+ILGRP
Subjt:  LADPGSFSFPC---NF--------------------------GEIKPTAVKLQLADQSVVSPYEVVENVLIQVGKFFLPVDFFVMDVKENPLVPVILGRP

Query:  FLATGRVIIDIERRELTMRVLHEKEVF
        FLATG+ +ID+E+ +LT+RV++E+  F
Subjt:  FLATGRVIIDIERRELTMRVLHEKEVF

A0A6P8DKJ2 uncharacterized protein LOC1162042316.7e-7731.64Show/hide
Query:  NKDALLVPFDFEIEKTCKKNRKEKRER----LASMA----NPNPQDEQKPIRDDFQPTFLDQQSGIVYAPINANNFELKTGLIQMARDS-----------
        ++ A L+P D EIE+T  + R+E R R    +  MA    N   Q   + +RD   PT +   S I    I ANNFELK  LIQM + +           
Subjt:  NKDALLVPFDFEIEKTCKKNRKEKRER----LASMA----NPNPQDEQKPIRDDFQPTFLDQQSGIVYAPINANNFELKTGLIQMARDS-----------

Query:  ---------------------------------------------------------FSCQIFPPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKC
                                                                 F  + FPPA+T +LR EI  F +   E L+EAWER+KE +RKC
Subjt:  ---------------------------------------------------------FSCQIFPPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKC

Query:  PQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKIVENARTLLEDMATKSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSLANASMKFSSTGS--
        P HG PD L I++FY  L+   +++VDAAAGG L+ K  + A  L+E+MA+ ++ W  ERS  K   A + ++D +++L  Q+++L     K +S  S  
Subjt:  PQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKIVENARTLLEDMATKSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSLANASMKFSSTGS--

Query:  ----------------AQSIEGHHNSSP--------------------THYHPNVRNHENFSYANTKNILNPPGYTSQKAEIKPSLEDTIRAFIAESSNR
                         + + G+ ++SP                      Y+P  RNH NFS+ N  N L PP          P  +    A  A     
Subjt:  ----------------AQSIEGHHNSSP--------------------THYHPNVRNHENFSYANTKNILNPPGYTSQKAEIKPSLEDTIRAFIAESSNR

Query:  TNKLEEAVIA----INTTVTGHSAAINNIETQLGPLVNAVNTLQKGKSTAEPEKTQMEYCKAITVHQVEEVQVAG-----AQEIHEPEVTKEKVEEGSSS
         +++EE +++     +T +    A I N+E Q+  +   ++    G   +  E+   +   AI +   +E+++        +E  E +  K+KVEE    
Subjt:  TNKLEEAVIA----INTTVTGHSAAINNIETQLGPLVNAVNTLQKGKSTAEPEKTQMEYCKAITVHQVEEVQVAG-----AQEIHEPEVTKEKVEEGSSS

Query:  NEAEKLTSDPLIPSPTVLVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQMPKYVQFIKEWLSRKKKEKKVETIFLTSTCSARLQK---NVPDK
           + L   P +P     VP P + K++    QF KFLD+F  L IN+PFAEAL+QMP Y +F+K+ L++K+K    E + LT  CS  LQK   N+P K
Subjt:  NEAEKLTSDPLIPSPTVLVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQMPKYVQFIKEWLSRKKKEKKVETIFLTSTCSARLQK---NVPDK

Query:  LADPGSFSFPC---NF--------------------------GEIKPTAVKLQLADQSVVSPYEVVENVLIQVGKFFLPVDFFVMDVKENPLVPVILGRP
          D GSF+ PC   NF                          GE K T + LQLAD+S+  P  +VENVL++V KF  PVDF V++++E+  VP+ILGRP
Subjt:  LADPGSFSFPC---NF--------------------------GEIKPTAVKLQLADQSVVSPYEVVENVLIQVGKFFLPVDFFVMDVKENPLVPVILGRP

Query:  FLATGRVIIDIERRELTMRVLHEKEVF
        FLATG+ +ID+E+ +LT+RV++E+  F
Subjt:  FLATGRVIIDIERRELTMRVLHEKEVF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCCTGATGAGTCATATGATAGCCTAGTTGAGTCTGAGGAATCCACTGAGCATGTAGATATTGAATCTGATGAATGGACTGGCATGATTGATAGGCCACCTTTAGA
TCCCATATCGGTTGATATCATAACTCTTGATGACTCTGTTAACAATTGTTTTGGCCCAGAGGATGATGAAACTTTTATTTCGAGCCTCCGTAAGCGTGCCGAGCCTCGTC
TAGCTAAAGACGTTAAAGAAAGCGCTGCTTGGGAGGCAACCCAAGGATTAAAGTGTTGTTTTTTCGGAGGCGCGCGCATACGAGGCAGCGTCGAGACGCTACCTCCACAG
CGTCTCGACGCTGCAAACTTTCCGAAAGTAATTAGGGCGTACATTGATACAGCGTCGAGACGCCATGTCGACAGCGTCGAGACGCTGTCACGCAGAATCGACAATAAGGA
CGCTCTGCTAGTGCCCTTTGATTTTGAAATCGAAAAGACTTGCAAAAAGAACAGGAAAGAGAAAAGGGAAAGACTTGCATCCATGGCCAATCCAAATCCACAAGATGAGC
AGAAGCCGATACGGGACGATTTCCAGCCTACTTTTCTTGATCAACAATCTGGAATAGTCTACGCGCCGATTAACGCAAATAATTTCGAGCTCAAAACTGGCCTCATCCAG
ATGGCTAGAGATAGCTTTTCTTGCCAAATATTTCCACCTGCAAAGACAGTCAAGCTTAGGACAGAGATTGGGACATTCCAGCAGCTAGGCGATGAACAATTATTTGAGGC
TTGGGAGCGCTATAAGGAGTTACTGAGGAAATGCCCTCAGCATGGATACCCAGATTGGCTGCAGATCCAGTTGTTTTACAATGGTCTAAATCCGAACACTAAAACCATTG
TTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGATCGTAGAAAACGCACGTACTTTACTGGAGGATATGGCCACGAAAAGCTACCAGTGGCCAACTGAGCGGTCTGCA
CCTAAGAAGATTGCAGCTGGGATCTATGAGATTGACAATGTAAGTTCGCTTCAAGCCCAGATGACCTCCCTTGCTAATGCTTCCATGAAATTCTCAAGTACAGGGAGTGC
ACAATCTATTGAGGGACACCATAATTCTTCACCCACACACTACCACCCTAATGTTAGAAATCATGAAAATTTCTCATATGCTAACACTAAGAATATCTTGAACCCTCCTG
GATATACATCTCAAAAGGCTGAAATTAAACCTTCTCTTGAGGATACAATTCGAGCTTTTATTGCAGAGTCTAGTAATAGAACTAACAAGTTGGAGGAAGCTGTGATAGCA
ATAAATACCACAGTAACAGGCCACAGTGCAGCAATTAATAACATAGAAACTCAGTTGGGTCCGTTGGTGAATGCTGTAAACACTTTGCAGAAAGGTAAATCCACAGCTGA
GCCAGAGAAAACCCAAATGGAGTATTGCAAGGCCATCACTGTGCATCAAGTGGAGGAAGTTCAAGTAGCTGGGGCACAGGAGATTCATGAGCCTGAAGTAACTAAGGAAA
AAGTTGAAGAGGGATCGTCTTCAAACGAAGCTGAAAAGCTTACTTCTGACCCTCTTATTCCTTCGCCTACTGTTCTGGTTCCAAAGCCCAAGAAAAAGAAGAAAAAGAAT
TACTCAACTCAATTTAAGAAGTTTCTTGATATTTTTATGAGTTTAAATATTAATTTACCATTTGCAGAGGCTTTGGAGCAGATGCCCAAATATGTACAATTTATTAAGGA
ATGGCTTTCGAGGAAAAAGAAGGAAAAGAAAGTCGAGACAATATTCCTCACCTCTACCTGTAGTGCCCGACTTCAAAAGAATGTACCGGATAAACTTGCTGATCCAGGGA
GTTTTTCTTTTCCCTGCAATTTCGGAGAGATTAAACCTACTGCAGTGAAACTTCAGTTAGCTGACCAGTCTGTAGTTAGTCCATACGAAGTAGTAGAGAATGTTTTAATT
CAAGTAGGGAAATTTTTTCTGCCAGTTGACTTCTTTGTAATGGATGTAAAGGAAAATCCGTTAGTGCCTGTTATTTTAGGGAGACCATTCCTCGCAACTGGGCGAGTTAT
CATTGACATCGAGCGTAGGGAGCTGACTATGCGAGTCCTACATGAAAAAGAAGTTTTTAAAGCGGTTGGAGTCTCTACAGGTTGTTCTGAAGTTATGCTGGTCGGTTACA
GAAAAGGTGCAAATTTACAGAGGTTTAAGCTCGTGGGTCTCATGATTTTCGGTGCATTTTCGCCATTTTCGGTTCGAGTTCGCCATGGGTTTAGTTTTGACACAATGCAC
AAAAGATCTAGGAGACACGCTGCAGCTTCTTCCTCCTCTTCCACTCCTTTTGACCCAGATAGGTTCATTAGTAGTGAAGCCTCAGAAAGATTTGACAAATATGTGGAGAG
TCGGAACTTTATTCCTGAGCGAGGTTTCAGCTCTAATCCCGAGATGCAACCAAACTTAGTGAACAATATTGTTGCACGGGGTTGGGGCGACTTTGTTCATCATCCTGCGC
CCGGGGTAGCGTCTATAGTGAGGGAGTTCTATGCCAACATGGAAGACTCTTCTTCCTCATCCTTTGTTCGAGGACACCGTGTCCCCTACGACTCCCTCACCATCAATAGG
TTTTATGGGCTGCCCGATTTGAGAGGGATGATTACAGCACCTACCTACATGGCCACTTGGACGTTCATGAGGAGCGGAGGTGGTGAGATTCAGGACCACCGACCTGTTCG
TAGATTACAGGGCGTGGCACACGTTCCTCTACGCCAAGTTGATGCCTGTGGCGCATTTAAGCGATGTTACCAAGAGTCGTGCCGTTCTTCTATTCGCTATCGCCACAAGT
CGTTCGGTGAATGTCGGGCAGCCCTTTGTCGAGCTGCTGGTGTCGTGTGGGACGCCCAGGAGGAGATGGTCCACCCCGGGGCGATTATCGACAAGAATTTCATCAGTCGC
TACCGAGGACCTGGACCACAGGGTGCGCAGCCAGCTCTCCCCATCCACGTAAAGCCAGAGCATCTTGAGCAGCCTGCAGAGCCTGAGCACCAAGAGCAGGAGGTTCCGCA
TTCCACCATCGAGGAGCTGCTGCAGCAGCTGCACATGGAGTTCCAAAGCCACCGGCTGGAGTTCCATACTCTCCAACAAGGGTTCCAGAGTCATCAACAGGAGCTCCAGA
GTCAGCAGCGCGAGCACCAGAGAGTGAGGAGTAGAGATCGTCGTCATTTCCTCTACACGACGAGCATGCATGCCCACACCTATCAGTGTCAGGTGGCTATTAGTACGGGT
CAGCCTTTACCGCCACCTTTACCACCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGACCCTGATGAGTCATATGATAGCCTAGTTGAGTCTGAGGAATCCACTGAGCATGTAGATATTGAATCTGATGAATGGACTGGCATGATTGATAGGCCACCTTTAGA
TCCCATATCGGTTGATATCATAACTCTTGATGACTCTGTTAACAATTGTTTTGGCCCAGAGGATGATGAAACTTTTATTTCGAGCCTCCGTAAGCGTGCCGAGCCTCGTC
TAGCTAAAGACGTTAAAGAAAGCGCTGCTTGGGAGGCAACCCAAGGATTAAAGTGTTGTTTTTTCGGAGGCGCGCGCATACGAGGCAGCGTCGAGACGCTACCTCCACAG
CGTCTCGACGCTGCAAACTTTCCGAAAGTAATTAGGGCGTACATTGATACAGCGTCGAGACGCCATGTCGACAGCGTCGAGACGCTGTCACGCAGAATCGACAATAAGGA
CGCTCTGCTAGTGCCCTTTGATTTTGAAATCGAAAAGACTTGCAAAAAGAACAGGAAAGAGAAAAGGGAAAGACTTGCATCCATGGCCAATCCAAATCCACAAGATGAGC
AGAAGCCGATACGGGACGATTTCCAGCCTACTTTTCTTGATCAACAATCTGGAATAGTCTACGCGCCGATTAACGCAAATAATTTCGAGCTCAAAACTGGCCTCATCCAG
ATGGCTAGAGATAGCTTTTCTTGCCAAATATTTCCACCTGCAAAGACAGTCAAGCTTAGGACAGAGATTGGGACATTCCAGCAGCTAGGCGATGAACAATTATTTGAGGC
TTGGGAGCGCTATAAGGAGTTACTGAGGAAATGCCCTCAGCATGGATACCCAGATTGGCTGCAGATCCAGTTGTTTTACAATGGTCTAAATCCGAACACTAAAACCATTG
TTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGATCGTAGAAAACGCACGTACTTTACTGGAGGATATGGCCACGAAAAGCTACCAGTGGCCAACTGAGCGGTCTGCA
CCTAAGAAGATTGCAGCTGGGATCTATGAGATTGACAATGTAAGTTCGCTTCAAGCCCAGATGACCTCCCTTGCTAATGCTTCCATGAAATTCTCAAGTACAGGGAGTGC
ACAATCTATTGAGGGACACCATAATTCTTCACCCACACACTACCACCCTAATGTTAGAAATCATGAAAATTTCTCATATGCTAACACTAAGAATATCTTGAACCCTCCTG
GATATACATCTCAAAAGGCTGAAATTAAACCTTCTCTTGAGGATACAATTCGAGCTTTTATTGCAGAGTCTAGTAATAGAACTAACAAGTTGGAGGAAGCTGTGATAGCA
ATAAATACCACAGTAACAGGCCACAGTGCAGCAATTAATAACATAGAAACTCAGTTGGGTCCGTTGGTGAATGCTGTAAACACTTTGCAGAAAGGTAAATCCACAGCTGA
GCCAGAGAAAACCCAAATGGAGTATTGCAAGGCCATCACTGTGCATCAAGTGGAGGAAGTTCAAGTAGCTGGGGCACAGGAGATTCATGAGCCTGAAGTAACTAAGGAAA
AAGTTGAAGAGGGATCGTCTTCAAACGAAGCTGAAAAGCTTACTTCTGACCCTCTTATTCCTTCGCCTACTGTTCTGGTTCCAAAGCCCAAGAAAAAGAAGAAAAAGAAT
TACTCAACTCAATTTAAGAAGTTTCTTGATATTTTTATGAGTTTAAATATTAATTTACCATTTGCAGAGGCTTTGGAGCAGATGCCCAAATATGTACAATTTATTAAGGA
ATGGCTTTCGAGGAAAAAGAAGGAAAAGAAAGTCGAGACAATATTCCTCACCTCTACCTGTAGTGCCCGACTTCAAAAGAATGTACCGGATAAACTTGCTGATCCAGGGA
GTTTTTCTTTTCCCTGCAATTTCGGAGAGATTAAACCTACTGCAGTGAAACTTCAGTTAGCTGACCAGTCTGTAGTTAGTCCATACGAAGTAGTAGAGAATGTTTTAATT
CAAGTAGGGAAATTTTTTCTGCCAGTTGACTTCTTTGTAATGGATGTAAAGGAAAATCCGTTAGTGCCTGTTATTTTAGGGAGACCATTCCTCGCAACTGGGCGAGTTAT
CATTGACATCGAGCGTAGGGAGCTGACTATGCGAGTCCTACATGAAAAAGAAGTTTTTAAAGCGGTTGGAGTCTCTACAGGTTGTTCTGAAGTTATGCTGGTCGGTTACA
GAAAAGGTGCAAATTTACAGAGGTTTAAGCTCGTGGGTCTCATGATTTTCGGTGCATTTTCGCCATTTTCGGTTCGAGTTCGCCATGGGTTTAGTTTTGACACAATGCAC
AAAAGATCTAGGAGACACGCTGCAGCTTCTTCCTCCTCTTCCACTCCTTTTGACCCAGATAGGTTCATTAGTAGTGAAGCCTCAGAAAGATTTGACAAATATGTGGAGAG
TCGGAACTTTATTCCTGAGCGAGGTTTCAGCTCTAATCCCGAGATGCAACCAAACTTAGTGAACAATATTGTTGCACGGGGTTGGGGCGACTTTGTTCATCATCCTGCGC
CCGGGGTAGCGTCTATAGTGAGGGAGTTCTATGCCAACATGGAAGACTCTTCTTCCTCATCCTTTGTTCGAGGACACCGTGTCCCCTACGACTCCCTCACCATCAATAGG
TTTTATGGGCTGCCCGATTTGAGAGGGATGATTACAGCACCTACCTACATGGCCACTTGGACGTTCATGAGGAGCGGAGGTGGTGAGATTCAGGACCACCGACCTGTTCG
TAGATTACAGGGCGTGGCACACGTTCCTCTACGCCAAGTTGATGCCTGTGGCGCATTTAAGCGATGTTACCAAGAGTCGTGCCGTTCTTCTATTCGCTATCGCCACAAGT
CGTTCGGTGAATGTCGGGCAGCCCTTTGTCGAGCTGCTGGTGTCGTGTGGGACGCCCAGGAGGAGATGGTCCACCCCGGGGCGATTATCGACAAGAATTTCATCAGTCGC
TACCGAGGACCTGGACCACAGGGTGCGCAGCCAGCTCTCCCCATCCACGTAAAGCCAGAGCATCTTGAGCAGCCTGCAGAGCCTGAGCACCAAGAGCAGGAGGTTCCGCA
TTCCACCATCGAGGAGCTGCTGCAGCAGCTGCACATGGAGTTCCAAAGCCACCGGCTGGAGTTCCATACTCTCCAACAAGGGTTCCAGAGTCATCAACAGGAGCTCCAGA
GTCAGCAGCGCGAGCACCAGAGAGTGAGGAGTAGAGATCGTCGTCATTTCCTCTACACGACGAGCATGCATGCCCACACCTATCAGTGTCAGGTGGCTATTAGTACGGGT
CAGCCTTTACCGCCACCTTTACCACCGTAG
Protein sequenceShow/hide protein sequence
MDPDESYDSLVESEESTEHVDIESDEWTGMIDRPPLDPISVDIITLDDSVNNCFGPEDDETFISSLRKRAEPRLAKDVKESAAWEATQGLKCCFFGGARIRGSVETLPPQ
RLDAANFPKVIRAYIDTASRRHVDSVETLSRRIDNKDALLVPFDFEIEKTCKKNRKEKRERLASMANPNPQDEQKPIRDDFQPTFLDQQSGIVYAPINANNFELKTGLIQ
MARDSFSCQIFPPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKCPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKIVENARTLLEDMATKSYQWPTERSA
PKKIAAGIYEIDNVSSLQAQMTSLANASMKFSSTGSAQSIEGHHNSSPTHYHPNVRNHENFSYANTKNILNPPGYTSQKAEIKPSLEDTIRAFIAESSNRTNKLEEAVIA
INTTVTGHSAAINNIETQLGPLVNAVNTLQKGKSTAEPEKTQMEYCKAITVHQVEEVQVAGAQEIHEPEVTKEKVEEGSSSNEAEKLTSDPLIPSPTVLVPKPKKKKKKN
YSTQFKKFLDIFMSLNINLPFAEALEQMPKYVQFIKEWLSRKKKEKKVETIFLTSTCSARLQKNVPDKLADPGSFSFPCNFGEIKPTAVKLQLADQSVVSPYEVVENVLI
QVGKFFLPVDFFVMDVKENPLVPVILGRPFLATGRVIIDIERRELTMRVLHEKEVFKAVGVSTGCSEVMLVGYRKGANLQRFKLVGLMIFGAFSPFSVRVRHGFSFDTMH
KRSRRHAAASSSSSTPFDPDRFISSEASERFDKYVESRNFIPERGFSSNPEMQPNLVNNIVARGWGDFVHHPAPGVASIVREFYANMEDSSSSSFVRGHRVPYDSLTINR
FYGLPDLRGMITAPTYMATWTFMRSGGGEIQDHRPVRRLQGVAHVPLRQVDACGAFKRCYQESCRSSIRYRHKSFGECRAALCRAAGVVWDAQEEMVHPGAIIDKNFISR
YRGPGPQGAQPALPIHVKPEHLEQPAEPEHQEQEVPHSTIEELLQQLHMEFQSHRLEFHTLQQGFQSHQQELQSQQREHQRVRSRDRRHFLYTTSMHAHTYQCQVAISTG
QPLPPPLPP