; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0029754 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0029754
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr8:41804925..41813803
RNA-Seq ExpressionLag0029754
SyntenyLag0029754
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6734747.1 hypothetical protein I3842_01G285500 [Carya illinoinensis]3.3e-15044.62Show/hide
Query:  MANPNPQDEQKPIRDYFQPTFPDQQSGIVYAPINANNFELKTGLIQMARDSAFRGFPSEDPNSHLKSFLDICGTVKLNGVSEDAIRLRLFPFSLQDKARD
        MA  + +   + ++DY +P      S I+  PINANNFELK  LI M + + F G P +DPN HL  FL+IC TVK+NGV+ED IRLRLFPFSL+DKAR 
Subjt:  MANPNPQDEQKPIRDYFQPTFPDQQSGIVYAPINANNFELKTGLIQMARDSAFRGFPSEDPNSHLKSFLDICGTVKLNGVSEDAIRLRLFPFSLQDKARD

Query:  WLRSLPSGSITTWDALVQAFLAKYFSPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKCPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKT
        WL+SL  GSI +W  + + FLAK+F PAKT +LR+EIG F+Q   E L+EAWERYK+L+R+CPQHG PDWLQ+Q+FYNGLN  T+TIVDAA+GGTL+SKT
Subjt:  WLRSLPSGSITTWDALVQAFLAKYFSPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKCPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKT

Query:  VENARTLLEDMATNSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSLANAFMKFSGTGSAQSIESAALASQC--QEETTTEQVHYV-ERNSNYRGHHNS
         E A  LLE+MA+N+YQWPTER+  KK+ AGI+E++ +++L AQ+ +L++     +     QS E  A  S      E + EQV YV  RN NYRG  N 
Subjt:  VENARTLLEDMATNSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSLANAFMKFSGTGSAQSIESAALASQC--QEETTTEQVHYV-ERNSNYRGHHNS

Query:  TPTHYHPNVRNHENFSYANTKNIL---NPPGYASQKVENKPSLEDIVGAFIAESSKRTNKLEEAVIAINTTVTGHGAAIKNIETQLGQLVNAVSSMQKGK
         P +YHP +RNHEN SY NTKN+L   +PPG+ SQ  E K SLED + +F+ E++ R  K +  +  I T  +  GA +KN+E Q+GQL   +++ Q+G 
Subjt:  TPTHYHPNVRNHENFSYANTKNIL---NPPGYASQKVENKPSLEDIVGAFIAESSKRTNKLEEAVIAINTTVTGHGAAIKNIETQLGQLVNAVSSMQKGK

Query:  TTAEPEKTQMEYCKAITVHQVEEVQVVDTQE----------------IHEPEVTKEEVEEGSSSTEAEKLTSDPLIPSPTVLVPKPKKKKKKNYSTQFKK
          +  E    E CKAIT+   +E++    +E                + E E+  + +EE           + P++  P   +P P++ +K+    QF K
Subjt:  TTAEPEKTQMEYCKAITVHQVEEVQVVDTQE----------------IHEPEVTKEEVEEGSSSTEAEKLTSDPLIPSPTVLVPKPKKKKKKNYSTQFKK

Query:  FLDIFMSLNINLPFAEALEQMPKYVQFMKEWLSRKKK-----------------EKKLRQ-------YSLPLHAVPDF--KRMCLTNLLIQGVFLFPAIL
        FLDIF  ++IN+PFA+ALEQMP YV+F+K+ +S+K++                 +KKL Q       ++LP      F  + +C     I  +  F    
Subjt:  FLDIFMSLNINLPFAEALEQMPKYVQFMKEWLSRKKK-----------------EKKLRQ-------YSLPLHAVPDF--KRMCLTNLLIQGVFLFPAIL

Query:  LNIGEIKPTAVKLQLADQSVVSPYGVVENVLIQVGKFFLPVDFFVMDVKENPVVPVILGRPFLATGRVIIDIERRELTVRVLHEKEIF
        L +GE+K T + LQLAD+S+  P G++E+VL++V KF  P DF V+D++E+  VP+ILGRPFLATGR +ID+++ ELT+RV  E+ +F
Subjt:  LNIGEIKPTAVKLQLADQSVVSPYGVVENVLIQVGKFFLPVDFFVMDVKENPVVPVILGRPFLATGRVIIDIERRELTVRVLHEKEIF

KAG7947748.1 hypothetical protein I3843_14G109500 [Carya illinoinensis]1.2e-14943.9Show/hide
Query:  MANPNPQDEQKPIRDYFQPTFPDQQSGIVYAPINANNFELKTGLIQMARDSAFRGFPSEDPNSHLKSFLDICGTVKLNGVSEDAIRLRLFPFSLQDKARD
        MA  + +   + ++DY +P      S I+  PINANNFELK  LI M + + F G P +DPN HL  FL+IC TVK+NGV+ED IRLRLFPFSL+DKAR 
Subjt:  MANPNPQDEQKPIRDYFQPTFPDQQSGIVYAPINANNFELKTGLIQMARDSAFRGFPSEDPNSHLKSFLDICGTVKLNGVSEDAIRLRLFPFSLQDKARD

Query:  WLRSLPSGSITTWDALVQAFLAKYFSPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKCPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKT
        WL+SL  GSI +W  + + FLAK+F PAKT +LR+EIG F+Q   E L+EAWERYK+L+R+CPQHG PDWLQ+Q+FYNGLN  T+TIVDAA+GGTL+SKT
Subjt:  WLRSLPSGSITTWDALVQAFLAKYFSPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKCPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKT

Query:  VENARTLLEDMATNSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSLANAFMKFSGTGSAQSIESAALASQC--QEETTTEQVHYV-ERNSNYRGHHNS
         E A  LLE+MA+N+YQWPTER+  KK+ AGI+E++ +++L AQ+ +L++     +     QS E  A  S      E + EQV YV  RN NYRG  N 
Subjt:  VENARTLLEDMATNSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSLANAFMKFSGTGSAQSIESAALASQC--QEETTTEQVHYV-ERNSNYRGHHNS

Query:  TPTHYHPNVRNHENFSYANTKNIL---NPPGYASQKVENKPSLEDIVGAFIAESSKRTNKLEEAVIAINTTVTGHGAAIKNIETQLGQLVNAVSSMQKGK
         P +YHP +RNHEN SY NTKN+L   +PPG+ SQ  E K SLED + +F+ E++ R  K +  +  I T  +  GA +KN+E Q+GQL   +++ Q+G 
Subjt:  TPTHYHPNVRNHENFSYANTKNIL---NPPGYASQKVENKPSLEDIVGAFIAESSKRTNKLEEAVIAINTTVTGHGAAIKNIETQLGQLVNAVSSMQKGK

Query:  TTAEPEKTQMEYCKAITVHQVEEVQVVDTQE----------------IHEPEVTKEEVEEGSSSTEAEKLTSDPLIPSPTVLVPKPKKKKKKNYSTQFKK
          +  E    E CKAIT+   +E++    +E                + E E+  + +EE           + P++  P   +P P++ +K+    QF K
Subjt:  TTAEPEKTQMEYCKAITVHQVEEVQVVDTQE----------------IHEPEVTKEEVEEGSSSTEAEKLTSDPLIPSPTVLVPKPKKKKKKNYSTQFKK

Query:  FLDIFMSLNINLPFAEALEQMPKYVQFMKEWLSRKKKEKKLRQYSLPLHAVPDFKRMCLTNLLIQGVFLFPAIL--------------------------
        FLDIF  ++IN+PFA+ALEQMP YV+F+K+ +S+K++ ++     L        ++     L   G F  P  +                          
Subjt:  FLDIFMSLNINLPFAEALEQMPKYVQFMKEWLSRKKKEKKLRQYSLPLHAVPDFKRMCLTNLLIQGVFLFPAIL--------------------------

Query:  LNIGEIKPTAVKLQLADQSVVSPYGVVENVLIQVGKFFLPVDFFVMDVKENPVVPVILGRPFLATGRVIIDIERRELTVRVLHEKEIF
        L +GE+K T + LQLAD+S+  P G++E+VL++V KF  P DF V+D++E+  VP+ILGRPFLATGR +ID+++ ELT+RV  E+ +F
Subjt:  LNIGEIKPTAVKLQLADQSVVSPYGVVENVLIQVGKFFLPVDFFVMDVKENPVVPVILGRPFLATGRVIIDIERRELTVRVLHEKEIF

KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]8.9e-15644.46Show/hide
Query:  MRKDKDALLVPFDFEIEKTCKKNRKEKRERLASMANPNPQDEQKPIRDYFQPTFPDQQSGIVYAPINANNFELKTGLIQMARDSAFRGFPSEDPNSHLKS
        MR+ +   ++P D EIE+T    R  +R ++ +MA  + +   + ++DY +P      S I+  PINANNFELK  LI M + + F G P +DPN HL  
Subjt:  MRKDKDALLVPFDFEIEKTCKKNRKEKRERLASMANPNPQDEQKPIRDYFQPTFPDQQSGIVYAPINANNFELKTGLIQMARDSAFRGFPSEDPNSHLKS

Query:  FLDICGTVKLNGVSEDAIRLRLFPFSLQDKARDWLRSLPSGSITTWDALVQAFLAKYFSPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKCPQHGY
        FL+IC TVK+NGV+ED IRLRLFPFSL+DKAR WL+SL  GSI +W  + + FLAK+F PAKT +LR+EIG F+Q   E L+EAWERYK+L+R+CPQHG 
Subjt:  FLDICGTVKLNGVSEDAIRLRLFPFSLQDKARDWLRSLPSGSITTWDALVQAFLAKYFSPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKCPQHGY

Query:  PDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSLANAFMKFSGTGSAQSIESA
        PDWLQ+Q+FYNGLN  T+TIVDAA+GGTL+SKT E A  LLE+MA+N+YQWPTER+  KK+ AGI++++ +++L AQ+ +L++     +     QS E  
Subjt:  PDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSLANAFMKFSGTGSAQSIESA

Query:  ALASQC--QEETTTEQVHYV-ERNSNYRGHHNSTPTHYHPNVRNHENFSYANTKNIL---NPPGYASQKVENKPSLEDIVGAFIAESSKRTNKLEEAVIA
        A  S      E + EQV YV  RN NYRG  N  P +YHP +RNHEN SY NTKN+L   +PPG+ SQ  E K SLED + +F+ E++ R  K +  +  
Subjt:  ALASQC--QEETTTEQVHYV-ERNSNYRGHHNSTPTHYHPNVRNHENFSYANTKNIL---NPPGYASQKVENKPSLEDIVGAFIAESSKRTNKLEEAVIA

Query:  INTTVTGHGAAIKNIETQLGQLVNAVSSMQKGKTTAEPEKTQMEYCKAITVHQVEEVQVVDTQE----------------IHEPEVTKEEVEEGSSSTEA
        I T  +  GAAIKNIE Q+GQL   +++ Q+G   +  E    E CKAIT+   +E++    +E                + E E+  + +EE   +   
Subjt:  INTTVTGHGAAIKNIETQLGQLVNAVSSMQKGKTTAEPEKTQMEYCKAITVHQVEEVQVVDTQE----------------IHEPEVTKEEVEEGSSSTEA

Query:  EKLTSDPLIPSPTVLVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQMPKYVQFMKEWLSRKKK-----------------EKKLRQ-------
            + P++  P   +P P++ +K+    QF KFLDIF  ++IN+PFA+ALEQMP YV+F+K+ +S+K++                 +KKL Q       
Subjt:  EKLTSDPLIPSPTVLVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQMPKYVQFMKEWLSRKKK-----------------EKKLRQ-------

Query:  YSLPLHAVPDF--KRMCLTNLLIQGVFLFPAILLNIGEIKPTAVKLQLADQSVVSPYGVVENVLIQVGKFFLPVDFFVMDVKENPVVPVILGRPFLATGR
        ++LP      F  K +C     I  + L     L + E+KPT + LQLAD+S+  P G++E+VL++V KF  P DF V+D++E+  VP+ILGRPFLATGR
Subjt:  YSLPLHAVPDF--KRMCLTNLLIQGVFLFPAILLNIGEIKPTAVKLQLADQSVVSPYGVVENVLIQVGKFFLPVDFFVMDVKENPVVPVILGRPFLATGR

Query:  VIIDIERRELTVRVLHEKEIFK
         +ID+++ ELT+RV  E+ +FK
Subjt:  VIIDIERRELTVRVLHEKEIFK

XP_022843226.1 uncharacterized protein LOC111366761 [Olea europaea var. sylvestris]7.3e-15045.09Show/hide
Query:  MRKDKDALLVPFDFEIEKTCKKNRKEKRERLASMAN-----PNPQDEQKPIRDYFQPTFPDQQSGIVYAPINANNFELKTGLIQMARDSAFRGFPSEDPN
        MR+ ++  L+  D E E+T +  R  +R    +MA       N  ++Q+ IRDY +P   D  SGI    I A NFELK GLI M + + F G   EDPN
Subjt:  MRKDKDALLVPFDFEIEKTCKKNRKEKRERLASMAN-----PNPQDEQKPIRDYFQPTFPDQQSGIVYAPINANNFELKTGLIQMARDSAFRGFPSEDPN

Query:  SHLKSFLDICGTVKLNGVSEDAIRLRLFPFSLQDKARDWLRSLPSGSITTWDALVQAFLAKYFSPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKC
        +HL SFL+IC TVK+NGV+EDAIRLRLF FSL+DKA+ W +SLP GSITTWD L Q FL KYF P+K+ +LR EI  F+QL  E  +EAWER+K+LLR+C
Subjt:  SHLKSFLDICGTVKLNGVSEDAIRLRLFPFSLQDKARDWLRSLPSGSITTWDALVQAFLAKYFSPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKC

Query:  PQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSLANAFMKFSGTGSAQ
        PQHG+  W+QI++FYNGLN  T+T+VDAAAGG L++KT E A  LL+D+ATNSYQWP+ERS  KK+ AG++E+D +++L AQ+ SL N  +  +  G+ Q
Subjt:  PQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSLANAFMKFSGTGSAQ

Query:  SIESAALASQCQEET--TTEQVHYVE-RNSNYRGHHNSTPTHYHPNVRNHENFSYANTKNILN-PPGYASQKVENKPSLEDIVGAFIAESSKRTNKLEEA
        +++S    S   +ET    EQV Y++ RN N RG + +   HYHP +RNHEN SY N +N L  PPG+ +Q  + KP LEDI+G FI+E+  R NK E  
Subjt:  SIESAALASQCQEET--TTEQVHYVE-RNSNYRGHHNSTPTHYHPNVRNHENFSYANTKNILN-PPGYASQKVENKPSLEDIVGAFIAESSKRTNKLEEA

Query:  VIAINTTVTGHGAAIKNIETQLGQLVNAVSSMQKGKTTAEPEKTQMEYCKAITVHQVEEVQVVDTQEIHEPE---VTKEEVEEGSSSTEAE-----KLTS
        +  I T V+  GA +KN+E Q+GQL   + S QKGK  ++ E    E+C AIT+   + V+    ++I  P    +  +E +     TEAE     K  S
Subjt:  VIAINTTVTGHGAAIKNIETQLGQLVNAVSSMQKGKTTAEPEKTQMEYCKAITVHQVEEVQVVDTQEIHEPE---VTKEEVEEGSSSTEAE-----KLTS

Query:  DPLIPSPTVL---VPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQMPKYVQFMKEWLSRKKKEKKLRQYSLPLHAVPDFKRMCLTNLLIQGVFL
             +P +L   +P P++  KK +  QF KFL++F  ++IN+PFAE L QMP Y +F+KE +S KKK ++     L      D  +     L   G F 
Subjt:  DPLIPSPTVL---VPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQMPKYVQFMKEWLSRKKKEKKLRQYSLPLHAVPDFKRMCLTNLLIQGVFL

Query:  FPAIL--------------------------LNIGEIKPTAVKLQLADQSVVSPYGVVENVLIQVGKFFLPVDFFVMDVKENPVVPVILGRPFLATGRVI
         P  +                          L +GE+KPT + LQLAD+S+  P G++E+VL++V KF LPVDF V+D++EN  +P+ILGRPFLATGR +
Subjt:  FPAIL--------------------------LNIGEIKPTAVKLQLADQSVVSPYGVVENVLIQVGKFFLPVDFFVMDVKENPVVPVILGRPFLATGRVI

Query:  IDI
        ID+
Subjt:  IDI

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]1.8e-14844.46Show/hide
Query:  MANPNPQDEQKPIRDYFQPTFPDQQSGIVYAPINANNFELKTGLIQMARDSAFRGFPSEDPNSHLKSFLDICGTVKLNGVSEDAIRLRLFPFSLQDKARD
        MA      + + ++DY +P   D  SGI    INANNFELK  LI M + + F G P +DPN HL  FL+IC T+K+NGV+ED IRLRLFPFSL+DKAR 
Subjt:  MANPNPQDEQKPIRDYFQPTFPDQQSGIVYAPINANNFELKTGLIQMARDSAFRGFPSEDPNSHLKSFLDICGTVKLNGVSEDAIRLRLFPFSLQDKARD

Query:  WLRSLPSGSITTWDALVQAFLAKYFSPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKCPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKT
        WL+SL  GSIT+W  + + FLAK+F PAKT +LR+EIG F+Q   E L+EAWERYK+L+R CPQHG PDWLQ+Q+FYNGLN  T+TIVDAA+GGTL+SKT
Subjt:  WLRSLPSGSITTWDALVQAFLAKYFSPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKCPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKT

Query:  VENARTLLEDMATNSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSLANAFMKFSGTGSAQSIESAALASQC--QEETTTEQVHYV-ERNSNYRGHHNS
         E A +LLE+MA+N+YQWPTER+  KK+ AGI+E++  ++L AQ+ SL++     +     Q  E  A +S      E + EQV Y+  RN NYRG  N 
Subjt:  VENARTLLEDMATNSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSLANAFMKFSGTGSAQSIESAALASQC--QEETTTEQVHYV-ERNSNYRGHHNS

Query:  TPTHYHPNVRNHENFSYANTKNILN-PPGYASQKVENKPSLEDIVGAFIAESSKRTNKLEEAVIAINTTVTGHGAAIKNIETQLGQLVNAVSSMQKGKTT
         P +YHP +RNHENFSY NTKN+L  PPG+ SQ  E K SLED + +F+ E+     K +  +  I T  +  GA +KN+E Q+GQL   +++ Q+G   
Subjt:  TPTHYHPNVRNHENFSYANTKNILN-PPGYASQKVENKPSLEDIVGAFIAESSKRTNKLEEAVIAINTTVTGHGAAIKNIETQLGQLVNAVSSMQKGKTT

Query:  AEPEKTQMEYCKAITVHQVEEVQVVDTQE----------------IHEPEVTKEEVEEGSSSTEAEKLTSDPLIPSPTVLVPKPKKKKKKNYSTQFKKFL
        +  E    E CKAIT+    E++   ++E                + E E+ ++ + E           + P++ +P   +P P++ +K+    QF KFL
Subjt:  AEPEKTQMEYCKAITVHQVEEVQVVDTQE----------------IHEPEVTKEEVEEGSSSTEAEKLTSDPLIPSPTVLVPKPKKKKKKNYSTQFKKFL

Query:  DIFMSLNINLPFAEALEQMPKYVQFMKEWLSRKKK-----------------EKKLRQ-------YSLPLHAVPDF--KRMCLTNLLIQGVFLFPAILLN
        DIF  ++IN+PFA+ALEQMP Y +F+K+ +S+K++                 +KKL Q       ++LP      F  K +C     I  + L     L 
Subjt:  DIFMSLNINLPFAEALEQMPKYVQFMKEWLSRKKK-----------------EKKLRQ-------YSLPLHAVPDF--KRMCLTNLLIQGVFLFPAILLN

Query:  IGEIKPTAVKLQLADQSVVSPYGVVENVLIQVGKFFLPVDFFVMDVKENPVVPVILGRPFLATGRVIIDIERRELTVRVLHEKEIF
        +GE+K T + LQLAD+S+  P G++E+VL++V KF  P DF V+D++E+  VP+ILGRPFLATGR ++D+++ ELT+RV  E+  F
Subjt:  IGEIKPTAVKLQLADQSVVSPYGVVENVLIQVGKFFLPVDFFVMDVKENPVVPVILGRPFLATGRVIIDIERRELTVRVLHEKEIF

TrEMBL top hitse value%identityAlignment
A0A5N6LUB5 Retrotrans_gag domain-containing protein8.8e-10935.3Show/hide
Query:  EIEKTCKKNRKEKRERLASMANPNPQDEQKPIRDYFQPTFPDQQSGIVYAPINANNFELKTGLIQMARDS-AFRGFPSEDPNSHLKSFLDICGTVKLNGV
        E EK      +   E +A   N     E++ I DY +P+  D  S IV   INANNFE++   IQM +++  F G   EDP++H+ SF++IC T K NGV
Subjt:  EIEKTCKKNRKEKRERLASMANPNPQDEQKPIRDYFQPTFPDQQSGIVYAPINANNFELKTGLIQMARDS-AFRGFPSEDPNSHLKSFLDICGTVKLNGV

Query:  SEDAIRLRLFPFSLQDKARDWLRSLPSGSITTWDALVQAFLAKYFSPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKCPQHGYPDWLQIQLFYNGL
        S+DAI+LR+FPFSL+D+A+ WL SLP GS+TTW+ L Q FL KYF P+KT +LR  I +F Q   E L++AWERYK+L+RKCP HG   W+Q+  FYNGL
Subjt:  SEDAIRLRLFPFSLQDKARDWLRSLPSGSITTWDALVQAFLAKYFSPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKCPQHGYPDWLQIQLFYNGL

Query:  NPNTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPTERSAPKKIAAGIYEIDNVSSLQAQ---MTSLANAFMKFSGTGSAQSIESAALASQCQE--
         P  + ++DA AGG    KT E    LLE +A N++QW   R    K   G++++D+ +SL AQ   MT   N                  L+  CQ   
Subjt:  NPNTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPTERSAPKKIAAGIYEIDNVSSLQAQ---MTSLANAFMKFSGTGSAQSIESAALASQCQE--

Query:  -ETTTEQVHYVERNSNYRGHHNSTPTHYHPNVRNHENFSY-ANTKNILNPPGY--------------ASQKVENKP------------------------
         +T+ EQV ++   S  R  +N     Y+P  +NH NFS+ A+  N   PPG+              A Q+ +++P                        
Subjt:  -ETTTEQVHYVERNSNYRGHHNSTPTHYHPNVRNHENFSY-ANTKNILNPPGY--------------ASQKVENKP------------------------

Query:  SLEDIVGAFIA-------ESSKRTNKLEEAVIAINTTVTGHGAAIKNIETQLGQLVNAVSSMQKGKTTAEPEKTQMEYCKAITVHQVEEVQVVDTQEIHE
        +LE ++  F++       +S  R ++ +         +   G+AI+ IE Q+GQ+   ++  +KGK  +  E    E+CKA+T+   +  +  D     +
Subjt:  SLEDIVGAFIA-------ESSKRTNKLEEAVIAINTTVTGHGAAIKNIETQLGQLVNAVSSMQKGKTTAEPEKTQMEYCKAITVHQVEEVQVVDTQEIHE

Query:  PEVTKE-EVEEGSSSTEAEKLTSDPL-----IPSPTVLVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQMPKYVQFMKEWLSRKKKEKKLRQY
        P V +E EV++   +T+ +     P+     +  PT  +P P + K +N    + KFLD+F  L+INLPF EAL QMPKY +F+K+ L+ K+K ++L   
Subjt:  PEVTKE-EVEEGSSSTEAEKLTSDPL-----IPSPTVLVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQMPKYVQFMKEWLSRKKKEKKLRQY

Query:  SLPLHAVPDFKRMCLTNLLIQGVFLFPAIL--------------------------LNIGEIKPTAVKLQLADQSVVSPYGVVENVLIQVGKFFLPVDFF
         L        +      +   G F  P ++                          L++GE KPT + +QLAD+SV  P G+VEN+L+++GKF  PVDF 
Subjt:  SLPLHAVPDFKRMCLTNLLIQGVFLFPAIL--------------------------LNIGEIKPTAVKLQLADQSVVSPYGVVENVLIQVGKFFLPVDFF

Query:  VMDVKENPVVPVILGRPFLATGRVIIDIERRELTVRVLHEKEIFK
        ++D+ E+  VP+ILGRPFLAT R ++D+   +LT+RV  E+ +F+
Subjt:  VMDVKENPVVPVILGRPFLATGRVIIDIERRELTVRVLHEKEIFK

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129453.2e-12739.01Show/hide
Query:  MRKDKDALLVPFDFEIEKTCKKNRKEK------RERLASMANPNPQ-------DEQKPIRDYFQPTFPDQQSGIVYAPINANNFELKTGLIQMARDSA-F
        M++  +  LVPFD +IE+T +++R+E        + +A   N N         +  + +RDY  P        I    INANNFE+K   IQM + S  F
Subjt:  MRKDKDALLVPFDFEIEKTCKKNRKEK------RERLASMANPNPQ-------DEQKPIRDYFQPTFPDQQSGIVYAPINANNFELKTGLIQMARDSA-F

Query:  RGFPSEDPNSHLKSFLDICGTVKLNGVSEDAIRLRLFPFSLQDKARDWLRSLPSGSITTWDALVQAFLAKYFSPAKTVKLRTEIGTFQQLGDEQLFEAWE
         G PS+DPNSHL +FL+IC T K NGV++DAIRLRLFPFSL+DKA+ WL SLP+GSITTW+ L Q FLAK+F PAKT K+R +I +F Q   E L+EAWE
Subjt:  RGFPSEDPNSHLKSFLDICGTVKLNGVSEDAIRLRLFPFSLQDKARDWLRSLPSGSITTWDALVQAFLAKYFSPAKTVKLRTEIGTFQQLGDEQLFEAWE

Query:  RYKELLRKCPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSLANAFM
        R+KELLR+CP HG PDWLQ+Q FYNGL  + KTI+DAAAGG L+SK   +A  LLE+MA+N+YQWP+ERS  +K A G YEID + +L  Q+ +L+    
Subjt:  RYKELLRKCPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSLANAFM

Query:  KFSGTGSAQSIESAALASQCQEETTTEQVHYVERNSNYRGHHNSTPTH-----YHPNVRNHENFSYANTKNILN-----PPGYASQK----VENKPSLED
        K    G      S  +   C +  + +Q  Y   +  + G+ N    +     Y+P  RNH NFS++N     N     PPG+  Q      E K  LE+
Subjt:  KFSGTGSAQSIESAALASQCQEETTTEQVHYVERNSNYRGHHNSTPTH-----YHPNVRNHENFSYANTKNILN-----PPGYASQK----VENKPSLED

Query:  IVGAFIAESSKRTNKLEEAVIAINTTVTGHGAAIKNIETQLGQLVNAVSSMQKGKTTAEPEKTQM-----EYCKAITVHQVEEVQVVDTQEIHE--PEVT
        ++  +I+++              +  +   GA+++N+ETQ+GQL N++++  +G     P  TQ+     E C+AIT+   +E++ V+ + +      V 
Subjt:  IVGAFIAESSKRTNKLEEAVIAINTTVTGHGAAIKNIETQLGQLVNAVSSMQKGKTTAEPEKTQM-----EYCKAITVHQVEEVQVVDTQEIHE--PEVT

Query:  KE-------EVEEGSSSTEAEKLTSDPLIPSPTVLVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQMPKYVQFMKEWLSRKKKEKKLRQYSLP
        KE       E+++        + TS  + P P    P P++ +K+    QF+KFL++F  L+IN+PFAEALEQMP YV+F+K+ LS+K+K  +     L 
Subjt:  KE-------EVEEGSSSTEAEKLTSDPLIPSPTVLVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQMPKYVQFMKEWLSRKKKEKKLRQYSLP

Query:  LHAVPDFKRMCLTNLLIQGVFLFPAIL--------------------------LNIGEIKPTAVKLQLADQSVVSPYGVVENVLIQVGKFFLPVDFFVMD
               +      L   G F  P  +                          L +GE KPT+V LQLAD+S V P G++E+VL++V KF  PVDF ++D
Subjt:  LHAVPDFKRMCLTNLLIQGVFLFPAIL--------------------------LNIGEIKPTAVKLQLADQSVVSPYGVVENVLIQVGKFFLPVDFFVMD

Query:  VKENPVVPVILGRPFLATGRVIIDIERRELTVRVLHEKEIFKAVGASTNCS
        ++E+  +P+ILGRPFLAT   IID+   +++ +V  E   F    AS + S
Subjt:  VKENPVVPVILGRPFLATGRVIIDIERRELTVRVLHEKEIFKAVGASTNCS

A0A6J1DU19 uncharacterized protein LOC1110243612.6e-11640.39Show/hide
Query:  QDEQKPIRDYFQPTFPDQQSGIVYAPINANNFELKTGLIQMARDSAFRGFPSEDPNSHLKSFLDICGTVKLNGVSEDAIRLRLFPFSLQDKARDWLRSLP
        Q+ Q  IRDY QP FP+   GI+  PINANN ELK GLIQM R++ FRG  +EDPN+HL  FLD+CGTVK+NGV +DAIRLRLFP SLQDK         
Subjt:  QDEQKPIRDYFQPTFPDQQSGIVYAPINANNFELKTGLIQMARDSAFRGFPSEDPNSHLKSFLDICGTVKLNGVSEDAIRLRLFPFSLQDKARDWLRSLP

Query:  SGSITTWDALVQAFLAKYFSPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKCPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKTVENART
                 +VQAFL  +F PAKT +LRTEI +F++   EQLFE WERYKELLRKCPQHG  +WLQIQ+FYNGLN  T+TI+DAAAGGTLLS+T ENA  
Subjt:  SGSITTWDALVQAFLAKYFSPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKCPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKTVENART

Query:  LLEDMATNSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSLANAFMKFSGTGSAQSIESAALASQCQEETTTEQVHYVERNSNYRGHHNSTPTHYHPNV
        LL+DMA NS+QWP+ERS  KK+ AG+YEID +SSL+AQ+ +L NA  K SG G++ S E  A                             T ++Y P +
Subjt:  LLEDMATNSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSLANAFMKFSGTGSAQSIESAALASQCQEETTTEQVHYVERNSNYRGHHNSTPTHYHPNV

Query:  RNHENFSYANTKNILNPPGYASQKVENKPSLEDIVGAFIAESSKRTNKLEEAVIAINTTVTGHGAAIKNIETQLGQLVNAVSSMQKGKTTAEPEKTQMEY
           +               + S   E K SLED++GAFI E   R +++E  V  +   + G+  +IKN+E Q+GQ+   +++MQKGK  ++ E    E+
Subjt:  RNHENFSYANTKNILNPPGYASQKVENKPSLEDIVGAFIAESSKRTNKLEEAVIAINTTVTGHGAAIKNIETQLGQLVNAVSSMQKGKTTAEPEKTQMEY

Query:  CKAITVHQVEEVQVVDTQEIHEPEVTKEE-------VEEGSSSTEAEKLTSDPLIPSPTVLVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQM
        CKA+T+   +E+Q  + +++ EP +T EE       V+E + + +A+K TS  ++ SP   +P P+                             ALEQM
Subjt:  CKAITVHQVEEVQVVDTQEIHEPEVTKEE-------VEEGSSSTEAEKLTSDPLIPSPTVLVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQM

Query:  PKYVQFMKEWLSRKKKEKKLRQYSLPLHAVPDFKRMCLTNLLIQGVFLFPAILLNIGEIKPTAVKLQLADQSVVSPYGVVENVLIQVGKFFLPVDFFVMD
        P YV+FMK+ ++ K+K +     +L        +R     L   G F  P  + +    K       +     + P GV+E+VL++V +   P DF V+ 
Subjt:  PKYVQFMKEWLSRKKKEKKLRQYSLPLHAVPDFKRMCLTNLLIQGVFLFPAILLNIGEIKPTAVKLQLADQSVVSPYGVVENVLIQVGKFFLPVDFFVMD

Query:  VKENPVVPVILGRPFLATGRVIIDIERRELTVRVLHEKEIFKAVGASTNCSEVMMFRSVCAVGCSI
         +E+  +P+ILGR FLATG  +ID++   LT+RV  E  +F    A     EV     +  V  ++
Subjt:  VKENPVVPVILGRPFLATGRVIIDIERRELTVRVLHEKEIFKAVGASTNCSEVMMFRSVCAVGCSI

A0A6P8DD93 uncharacterized protein LOC1162064534.7e-11036.99Show/hide
Query:  MRKDKDALLVPFDFEIEKTCKKNRKEKRER----LASMA----NPNPQDEQKPIRDYFQPTFPDQQSGIVYAPINANNFELKTGLIQMARDSAFRGFPSE
        MR+ + A L+P D EIE+T  + R+E R R    +  MA    N   Q   + +RDY  PT     S I    I ANNFELK  LIQM + + F G+P+E
Subjt:  MRKDKDALLVPFDFEIEKTCKKNRKEKRER----LASMA----NPNPQDEQKPIRDYFQPTFPDQQSGIVYAPINANNFELKTGLIQMARDSAFRGFPSE

Query:  DPNSHLKSFLDICGTVKLNGVSEDAIRLRLFPFSLQDKARDWLRSLPSGSITTWDALVQAFLAKYFSPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELL
         P+ H+  FL  C TVK+N V++D IRL+LFPFSL+DKAR W  SLP  SITTW  L   FL ++F PA+T +LR EI  F +   E L+EAWER+KE +
Subjt:  DPNSHLKSFLDICGTVKLNGVSEDAIRLRLFPFSLQDKARDWLRSLPSGSITTWDALVQAFLAKYFSPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELL

Query:  RKCPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSLANAFMKFSGTG
        RKCP HG PD L I++FY  L+   +++VDAAAGG L+ K  + A  L+E+MA++++ W  ERS  K   A + ++D +++L  Q+++L     K +   
Subjt:  RKCPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSLANAFMKFSGTG

Query:  SAQSIESAALASQCQEETTT--------------EQVHYVERNSNYRGHHNSTPTHYHPNVRNHENFSYANTKNILNPPGYASQKVENKPSLEDIVGAFI
        S  +    A    C    +T              EQV++V  N+  R +       Y+P  RNH NFS+ N  N L PP          P  +    A  
Subjt:  SAQSIESAALASQCQEETTT--------------EQVHYVERNSNYRGHHNSTPTHYHPNVRNHENFSYANTKNILNPPGYASQKVENKPSLEDIVGAFI

Query:  AESSKRTNKLEEAVIA----INTTVTGHGAAIKNIETQLGQLVNAVSSMQKGKTTAEPEKTQMEYCKAITVHQVEEVQVVD-----TQEIHEPEVTKEEV
        A   +  +++EE +++     +T +    A I+N+E Q+ Q+   +S+   G   +  E+   +   AI +   +E+++V+      +E  E +  K++V
Subjt:  AESSKRTNKLEEAVIA----INTTVTGHGAAIKNIETQLGQLVNAVSSMQKGKTTAEPEKTQMEYCKAITVHQVEEVQVVD-----TQEIHEPEVTKEEV

Query:  EEGSSSTEAEKLTSDPLIPSPTVLVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQMPKYVQFMKEWLSRKKK-----------------EKKL
        EE       + L   P +P     VP P++ K++    QF KFLD+F  L IN+PFAEAL+QMP Y +FMK+ L++K+K                 +K L
Subjt:  EEGSSSTEAEKLTSDPLIPSPTVLVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQMPKYVQFMKEWLSRKKK-----------------EKKL

Query:  RQYSLPLHAVPDFKRMC------LTNLLIQ---GVFLFPAIL---LNIGEIKPTAVKLQLADQSVVSPYGVVENVLIQVGKFFLPVDFFVMDVKENPVVP
                    F   C        N+LI     + L P  +   L +GE K T V LQLAD+S+  P G+VENVL++V KF  PVDF V++++E+  VP
Subjt:  RQYSLPLHAVPDFKRMC------LTNLLIQ---GVFLFPAIL---LNIGEIKPTAVKLQLADQSVVSPYGVVENVLIQVGKFFLPVDFFVMDVKENPVVP

Query:  VILGRPFLATGRVIIDIERRELTVRVLHEKEIFKAVGA
        +ILGRPFLATG+ +ID+E+ +LT+RV++E+  F    A
Subjt:  VILGRPFLATGRVIIDIERRELTVRVLHEKEIFKAVGA

A0A6P8DKJ2 uncharacterized protein LOC1162042311.8e-10936.86Show/hide
Query:  MRKDKDALLVPFDFEIEKTCKKNRKEKRER----LASMA----NPNPQDEQKPIRDYFQPTFPDQQSGIVYAPINANNFELKTGLIQMARDSAFRGFPSE
        MR+ + A L+P D EIE+T  + R+E R R    +  MA    N   Q   + +RDY  PT     S I    I ANNFELK  LIQM + + F G+P+E
Subjt:  MRKDKDALLVPFDFEIEKTCKKNRKEKRER----LASMA----NPNPQDEQKPIRDYFQPTFPDQQSGIVYAPINANNFELKTGLIQMARDSAFRGFPSE

Query:  DPNSHLKSFLDICGTVKLNGVSEDAIRLRLFPFSLQDKARDWLRSLPSGSITTWDALVQAFLAKYFSPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELL
         P+ H+  FL  C TVK+N V++D IRL+LFPFSL+DKAR W  SLP  SITTW  L   FL ++F PA+T +LR EI  F +   E L+EAWER+KE +
Subjt:  DPNSHLKSFLDICGTVKLNGVSEDAIRLRLFPFSLQDKARDWLRSLPSGSITTWDALVQAFLAKYFSPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELL

Query:  RKCPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSLANAFMKFSGTG
        RKCP HG PD L I++FY  L+   +++VDAAAGG L+ K  + A  L+E+MA++++ W  ERS  K   A + ++D +++L  Q+++L     K +   
Subjt:  RKCPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSLANAFMKFSGTG

Query:  SAQSIESAALASQCQEETTT--------------EQVHYVERNSNYRGHHNSTPTHYHPNVRNHENFSYANTKNILNPPGYASQKVENKPSLEDIVGAFI
        S  +    A    C    +T              EQV++V  N+  R +       Y+P  RNH NFS+ N  N L PP          P  +    A  
Subjt:  SAQSIESAALASQCQEETTT--------------EQVHYVERNSNYRGHHNSTPTHYHPNVRNHENFSYANTKNILNPPGYASQKVENKPSLEDIVGAFI

Query:  AESSKRTNKLEEAVIA----INTTVTGHGAAIKNIETQLGQLVNAVSSMQKGKTTAEPEKTQMEYCKAITVHQVEEVQVVD-----TQEIHEPEVTKEEV
        A   +  +++EE +++     +T +    A I+N+E Q+ Q+   +S+   G   +  E+   +   AI +   +E+++V+      +E  E +  K++V
Subjt:  AESSKRTNKLEEAVIA----INTTVTGHGAAIKNIETQLGQLVNAVSSMQKGKTTAEPEKTQMEYCKAITVHQVEEVQVVD-----TQEIHEPEVTKEEV

Query:  EEGSSSTEAEKLTSDPLIPSPTVLVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQMPKYVQFMKEWLSRKKK-----------------EKKL
        EE       + L   P +P     VP P + K++    QF KFLD+F  L IN+PFAEAL+QMP Y +FMK+ L++K+K                 +K L
Subjt:  EEGSSSTEAEKLTSDPLIPSPTVLVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQMPKYVQFMKEWLSRKKK-----------------EKKL

Query:  RQYSLPLHAVPDFKRMC------LTNLLIQ---GVFLFPAIL---LNIGEIKPTAVKLQLADQSVVSPYGVVENVLIQVGKFFLPVDFFVMDVKENPVVP
                    F   C        N+LI     + L P  +   L +GE K T + LQLAD+S+  P G+VENVL++V KF  PVDF V++++E+  VP
Subjt:  RQYSLPLHAVPDFKRMC------LTNLLIQ---GVFLFPAIL---LNIGEIKPTAVKLQLADQSVVSPYGVVENVLIQVGKFFLPVDFFVMDVKENPVVP

Query:  VILGRPFLATGRVIIDIERRELTVRVLHEKEIFKAVGA
        +ILGRPFLATG+ +ID+E+ +LT+RV++E+  F    A
Subjt:  VILGRPFLATGRVIIDIERRELTVRVLHEKEIFKAVGA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGAAAGGATAAGGACGCTCTGCTAGTGCCCTTTGATTTTGAAATTGAAAAGACTTGCAAAAAGAACAGGAAAGAGAAGAGGGAAAGACTTGCATCAATGGCTAATCC
AAATCCACAAGATGAGCAAAAGCCGATACGGGACTATTTCCAGCCTACTTTTCCTGATCAACAATCTGGGATAGTCTACGCGCCGATTAATGCAAATAATTTTGAGCTGA
AAACTGGCCTCATCCAGATGGCTAGAGATAGTGCATTTAGAGGATTCCCCTCTGAGGATCCTAATTCTCATTTAAAATCTTTTCTTGACATCTGTGGGACTGTGAAGTTG
AATGGTGTCTCTGAAGATGCCATACGCTTACGATTGTTTCCTTTCTCTTTGCAGGACAAAGCTAGAGACTGGCTCAGATCACTTCCATCTGGAAGCATAACTACGTGGGA
CGCGTTGGTTCAAGCTTTTCTTGCCAAATATTTCTCACCTGCAAAGACAGTCAAGCTTAGGACAGAGATTGGGACATTCCAGCAGCTAGGCGATGAACAACTATTTGAGG
CTTGGGAGCGCTATAAGGAGTTACTGAGGAAATGCCCTCAGCATGGATACCCAGATTGGCTGCAGATCCAGTTGTTTTATAATGGTCTAAATCCAAATACTAAAACCATT
GTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACTGTAGAGAATGCAAGAACTTTACTTGAGGATATGGCCACGAACAGCTACCAGTGGCCAACTGAGCGGTCTGC
ACCTAAGAAAATTGCAGCTGGGATTTATGAGATCGATAATGTAAGTTCGCTTCAAGCCCAGATGACCTCCCTTGCTAATGCTTTCATGAAATTCTCAGGTACAGGGAGTG
CACAATCTATTGAGTCAGCTGCTCTTGCATCTCAGTGTCAGGAGGAGACCACTACTGAACAGGTTCATTATGTTGAAAGAAATTCTAACTATAGGGGACACCATAATTCT
ACACCCACACACTACCACCCTAATGTTAGAAATCATGAAAATTTCTCATATGCTAACACTAAGAATATATTGAACCCTCCTGGTTATGCATCTCAAAAGGTTGAAAATAA
GCCTTCTCTTGAAGATATAGTTGGAGCTTTTATTGCAGAATCGAGCAAAAGGACCAACAAGTTGGAGGAAGCAGTGATTGCAATAAATACCACTGTGACAGGCCACGGAG
CAGCAATTAAAAACATCGAAACTCAATTGGGTCAACTGGTGAATGCTGTAAGCAGTATGCAGAAAGGTAAAACCACAGCTGAACCAGAGAAAACCCAAATGGAGTATTGC
AAGGCCATTACCGTACACCAAGTGGAGGAAGTTCAAGTAGTTGATACACAGGAGATTCATGAGCCTGAAGTCACTAAGGAAGAAGTTGAAGAGGGTTCATCTTCAACCGA
AGCTGAAAAACTCACTTCTGACCCTCTTATTCCTTCACCTACTGTTTTGGTTCCAAAGCCCAAGAAAAAGAAGAAAAAGAATTACTCAACTCAATTCAAAAAGTTTCTTG
ATATTTTTATGAGTTTAAATATTAATTTACCATTTGCAGAGGCTTTGGAGCAGATGCCCAAATATGTACAATTTATGAAGGAATGGCTTTCGAGGAAAAAGAAGGAAAAG
AAGTTGAGACAATATTCCTTACCTCTACATGCAGTGCCCGACTTCAAAAGAATGTGCCTGACAAACTTGCTGATCCAGGGAGTTTTTCTTTTCCCTGCAATTTTGTTAAA
CATAGGAGAGATTAAACCTACTGCAGTAAAACTCCAGTTAGCTGACCAATCTGTAGTTAGTCCGTATGGAGTAGTAGAGAATGTATTGATTCAAGTAGGGAAATTTTTTC
TGCCAGTTGACTTCTTTGTGATGGATGTAAAGGAGAACCCTGTAGTGCCTGTGATTTTAGGGAGACCATTCCTCGCAACTGGGCGAGTTATCATTGACATTGAGCGTAGG
GAGCTGACTGTGCGAGTCTTACATGAAAAAGAAATTTTTAAAGCAGTTGGAGCCTCTACAAATTGCTCTGAAGTTATGATGTTTCGAAGTGTTTGTGCCGTGGGTTGCTC
GATTTTCGGCCGGTTGCAGCGTTTAGTTAGTGGAGATTGTCTAGCATTTGAAGCCTCTTCTCGCCGCCAGTTGAAGTTTTTTGGTCGAGTTGCAGCCGTTGTATGCTCGA
GTTCGCCGTGGGTTTCGAGTCCCTCAAGTTTGGGCGAGCTTTTGGGTTGTTTTCTTGCGTTTTGTCCGAGTTCGCCGTGGGTTGCAAGTTTCGAGATTTTCAGCAAGTTT
CCGGCGATTCTTCAGCTTGTTTTAGCCGTGGCTTTTAGTACCTCTCCATTAGTTACAATGCATAAAAGATCTAGGAGACATGCTGCCTCTTCTTCCTCCTCCCCTCCACC
ACCTTTTGACCAAGATAGGTTCGTTAGTGCTGAAGCCTCAGCGAGGTTTGATAAATATGTGGAGGGTAGAAATTTTATTCCTGAGCGTGGTTTTAGCCCCGACCCCGAGA
TGCAACCAAACCTTGTTAATAACATTGTTGCGCGGGGTTGGGGCGACTTTGTTCATCATCCTGCACCTGGGGTAGCAACTATAGTGAGGGAGTTCTATGCCAACATGGAA
ACCTCCTCTTCATCATCCTTTGTTCGAGGACACCGTGTCCTCTACGACCCCCTCACCATAAATAGGTTTTATAGTTTGCCCAACTTTGACAGGGATGAATATAGTACCTA
TCTTCATGGCCACTTGGATGTAAACGAAGTCATTCAGACTATTTGTAGGCCAGGGGCAGAATGGATTATGACTGGCGCAGAGGTGGTGAGATTCAAAACCACTGATTTGT
TCGTAGATTACAGAGCGTGGCATACCTTCCTCTGTGCCAAGTTGATGCCTGTGGCGCATCTTAGCGATGTTACCAAGAGTCGTGCCATCCTGCTTTTCGCTATCGCCACA
GGTCGCTCCGTCAATGTCGGTCAGGTCATCCATCAGTCTATGAACCATATTCGCCGACGTTACACGACAGTGGGGCTCGGGCATCCTTCACTGATCACAGCCCTTTGCCG
AGCTGCTGGTTTCGTGTGGGACGCCCAAGAGGAGTTGGTTCATCCTGGAGCGCTAATCGACAAGAACTTCATCAGTCGCTACAGAGGACCTGGACCACAGGGCGCACAAC
CACCCCTCACTATCCATGCACCGCCACAGCATCATGAGCAGCACGAGCAGCCTGCAGAGCCTGAGCAGCAGGAGCAGGAGATTCCACATCCCTCCATCGAGGAGCAGCTG
CAGCAGCTGCGTATGGAGTTTCAGAGCCATCGCCTAGACTTCCAGACTCTCCAGCAGGGCATCCAGAGCCAGCAGCGAGAGCACCAGAGGAGAGGCGTAGAGATCGTCGC
CATTTCCTCTACTCCACGAGCATGCATGCCCACACCTATCAGTGTCAGGAGGCGCGCGCATCGGGACAGCGTCGAGACGCTACCTCCACAGCGTCTCGACGCTGCGAACT
TTCCAGATCTAATTAGGGCATACGCGCGACAGCGTCGCGACGCTATGTCCTCAGCGTCTCGACGCTGTCACGCAGATTCGACAGTATTTAAAGGAAACGATGCTTTCAGA
ATGGGGAGCTGTTTTGGAGAGAGAAAAACGACTTCTAGAGAGAGAAAAACGACCAGAACAAGGCAGAGAGCGAGGGGGCTTGATTCGAGGAGCTTGGATCGAGAGAGGTA
TGCCAAGGATAAGCACGTGTCAACTAGAGCGAGCGATCGTTTGAAGGCTGCGGGAGTAACGCCAGGAAGAAAACCCCCGGAACAAACGTCCCCAATCACATTGGGGAGCG
AACAGGACTCTGAAGAAGCCATGAGTACAACAGTTTCAGTCACTAAGGGATCCGACGAAAAGACGAAAGGGATTGAGAAACCCACTGAGGCACGAAGCAATAAGAAGACA
AAGCGGTCGAAACAGACTAAAAACACAGATAAGGCGAGTCATGTGACACCAGAGGTTGCCCCTGAAACAAGCGAGGACACCGCTAAACATGACACCGAAGACACCGAATC
TGATAGTGTGACGAATGAAAACTCCACGAGTGATGACGGGGAAGAACAAGGGAAAAAGAAGGCATCACTTGCTAAAAAGGACGATGATTACTTTATGTCACCATCGAAAA
GAAGTAAGGCCCTAAAGATTAACCTATGTTGCAGAACAGAAATAATGGACACCATCAACAACATCCTAGGAGATAGGTGCAGAGAAGCTTTCAGAAACACGTGCTTCGGC
CACCTACTTGACTTCACGTTCAAAAAGACGTCTTCCCAGTTACTATTGCACTTGATCCAGCATCAGTGCAAACCCAACTTTACTTCAAGATTGGAGGGAAAATCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCGAAAGGATAAGGACGCTCTGCTAGTGCCCTTTGATTTTGAAATTGAAAAGACTTGCAAAAAGAACAGGAAAGAGAAGAGGGAAAGACTTGCATCAATGGCTAATCC
AAATCCACAAGATGAGCAAAAGCCGATACGGGACTATTTCCAGCCTACTTTTCCTGATCAACAATCTGGGATAGTCTACGCGCCGATTAATGCAAATAATTTTGAGCTGA
AAACTGGCCTCATCCAGATGGCTAGAGATAGTGCATTTAGAGGATTCCCCTCTGAGGATCCTAATTCTCATTTAAAATCTTTTCTTGACATCTGTGGGACTGTGAAGTTG
AATGGTGTCTCTGAAGATGCCATACGCTTACGATTGTTTCCTTTCTCTTTGCAGGACAAAGCTAGAGACTGGCTCAGATCACTTCCATCTGGAAGCATAACTACGTGGGA
CGCGTTGGTTCAAGCTTTTCTTGCCAAATATTTCTCACCTGCAAAGACAGTCAAGCTTAGGACAGAGATTGGGACATTCCAGCAGCTAGGCGATGAACAACTATTTGAGG
CTTGGGAGCGCTATAAGGAGTTACTGAGGAAATGCCCTCAGCATGGATACCCAGATTGGCTGCAGATCCAGTTGTTTTATAATGGTCTAAATCCAAATACTAAAACCATT
GTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACTGTAGAGAATGCAAGAACTTTACTTGAGGATATGGCCACGAACAGCTACCAGTGGCCAACTGAGCGGTCTGC
ACCTAAGAAAATTGCAGCTGGGATTTATGAGATCGATAATGTAAGTTCGCTTCAAGCCCAGATGACCTCCCTTGCTAATGCTTTCATGAAATTCTCAGGTACAGGGAGTG
CACAATCTATTGAGTCAGCTGCTCTTGCATCTCAGTGTCAGGAGGAGACCACTACTGAACAGGTTCATTATGTTGAAAGAAATTCTAACTATAGGGGACACCATAATTCT
ACACCCACACACTACCACCCTAATGTTAGAAATCATGAAAATTTCTCATATGCTAACACTAAGAATATATTGAACCCTCCTGGTTATGCATCTCAAAAGGTTGAAAATAA
GCCTTCTCTTGAAGATATAGTTGGAGCTTTTATTGCAGAATCGAGCAAAAGGACCAACAAGTTGGAGGAAGCAGTGATTGCAATAAATACCACTGTGACAGGCCACGGAG
CAGCAATTAAAAACATCGAAACTCAATTGGGTCAACTGGTGAATGCTGTAAGCAGTATGCAGAAAGGTAAAACCACAGCTGAACCAGAGAAAACCCAAATGGAGTATTGC
AAGGCCATTACCGTACACCAAGTGGAGGAAGTTCAAGTAGTTGATACACAGGAGATTCATGAGCCTGAAGTCACTAAGGAAGAAGTTGAAGAGGGTTCATCTTCAACCGA
AGCTGAAAAACTCACTTCTGACCCTCTTATTCCTTCACCTACTGTTTTGGTTCCAAAGCCCAAGAAAAAGAAGAAAAAGAATTACTCAACTCAATTCAAAAAGTTTCTTG
ATATTTTTATGAGTTTAAATATTAATTTACCATTTGCAGAGGCTTTGGAGCAGATGCCCAAATATGTACAATTTATGAAGGAATGGCTTTCGAGGAAAAAGAAGGAAAAG
AAGTTGAGACAATATTCCTTACCTCTACATGCAGTGCCCGACTTCAAAAGAATGTGCCTGACAAACTTGCTGATCCAGGGAGTTTTTCTTTTCCCTGCAATTTTGTTAAA
CATAGGAGAGATTAAACCTACTGCAGTAAAACTCCAGTTAGCTGACCAATCTGTAGTTAGTCCGTATGGAGTAGTAGAGAATGTATTGATTCAAGTAGGGAAATTTTTTC
TGCCAGTTGACTTCTTTGTGATGGATGTAAAGGAGAACCCTGTAGTGCCTGTGATTTTAGGGAGACCATTCCTCGCAACTGGGCGAGTTATCATTGACATTGAGCGTAGG
GAGCTGACTGTGCGAGTCTTACATGAAAAAGAAATTTTTAAAGCAGTTGGAGCCTCTACAAATTGCTCTGAAGTTATGATGTTTCGAAGTGTTTGTGCCGTGGGTTGCTC
GATTTTCGGCCGGTTGCAGCGTTTAGTTAGTGGAGATTGTCTAGCATTTGAAGCCTCTTCTCGCCGCCAGTTGAAGTTTTTTGGTCGAGTTGCAGCCGTTGTATGCTCGA
GTTCGCCGTGGGTTTCGAGTCCCTCAAGTTTGGGCGAGCTTTTGGGTTGTTTTCTTGCGTTTTGTCCGAGTTCGCCGTGGGTTGCAAGTTTCGAGATTTTCAGCAAGTTT
CCGGCGATTCTTCAGCTTGTTTTAGCCGTGGCTTTTAGTACCTCTCCATTAGTTACAATGCATAAAAGATCTAGGAGACATGCTGCCTCTTCTTCCTCCTCCCCTCCACC
ACCTTTTGACCAAGATAGGTTCGTTAGTGCTGAAGCCTCAGCGAGGTTTGATAAATATGTGGAGGGTAGAAATTTTATTCCTGAGCGTGGTTTTAGCCCCGACCCCGAGA
TGCAACCAAACCTTGTTAATAACATTGTTGCGCGGGGTTGGGGCGACTTTGTTCATCATCCTGCACCTGGGGTAGCAACTATAGTGAGGGAGTTCTATGCCAACATGGAA
ACCTCCTCTTCATCATCCTTTGTTCGAGGACACCGTGTCCTCTACGACCCCCTCACCATAAATAGGTTTTATAGTTTGCCCAACTTTGACAGGGATGAATATAGTACCTA
TCTTCATGGCCACTTGGATGTAAACGAAGTCATTCAGACTATTTGTAGGCCAGGGGCAGAATGGATTATGACTGGCGCAGAGGTGGTGAGATTCAAAACCACTGATTTGT
TCGTAGATTACAGAGCGTGGCATACCTTCCTCTGTGCCAAGTTGATGCCTGTGGCGCATCTTAGCGATGTTACCAAGAGTCGTGCCATCCTGCTTTTCGCTATCGCCACA
GGTCGCTCCGTCAATGTCGGTCAGGTCATCCATCAGTCTATGAACCATATTCGCCGACGTTACACGACAGTGGGGCTCGGGCATCCTTCACTGATCACAGCCCTTTGCCG
AGCTGCTGGTTTCGTGTGGGACGCCCAAGAGGAGTTGGTTCATCCTGGAGCGCTAATCGACAAGAACTTCATCAGTCGCTACAGAGGACCTGGACCACAGGGCGCACAAC
CACCCCTCACTATCCATGCACCGCCACAGCATCATGAGCAGCACGAGCAGCCTGCAGAGCCTGAGCAGCAGGAGCAGGAGATTCCACATCCCTCCATCGAGGAGCAGCTG
CAGCAGCTGCGTATGGAGTTTCAGAGCCATCGCCTAGACTTCCAGACTCTCCAGCAGGGCATCCAGAGCCAGCAGCGAGAGCACCAGAGGAGAGGCGTAGAGATCGTCGC
CATTTCCTCTACTCCACGAGCATGCATGCCCACACCTATCAGTGTCAGGAGGCGCGCGCATCGGGACAGCGTCGAGACGCTACCTCCACAGCGTCTCGACGCTGCGAACT
TTCCAGATCTAATTAGGGCATACGCGCGACAGCGTCGCGACGCTATGTCCTCAGCGTCTCGACGCTGTCACGCAGATTCGACAGTATTTAAAGGAAACGATGCTTTCAGA
ATGGGGAGCTGTTTTGGAGAGAGAAAAACGACTTCTAGAGAGAGAAAAACGACCAGAACAAGGCAGAGAGCGAGGGGGCTTGATTCGAGGAGCTTGGATCGAGAGAGGTA
TGCCAAGGATAAGCACGTGTCAACTAGAGCGAGCGATCGTTTGAAGGCTGCGGGAGTAACGCCAGGAAGAAAACCCCCGGAACAAACGTCCCCAATCACATTGGGGAGCG
AACAGGACTCTGAAGAAGCCATGAGTACAACAGTTTCAGTCACTAAGGGATCCGACGAAAAGACGAAAGGGATTGAGAAACCCACTGAGGCACGAAGCAATAAGAAGACA
AAGCGGTCGAAACAGACTAAAAACACAGATAAGGCGAGTCATGTGACACCAGAGGTTGCCCCTGAAACAAGCGAGGACACCGCTAAACATGACACCGAAGACACCGAATC
TGATAGTGTGACGAATGAAAACTCCACGAGTGATGACGGGGAAGAACAAGGGAAAAAGAAGGCATCACTTGCTAAAAAGGACGATGATTACTTTATGTCACCATCGAAAA
GAAGTAAGGCCCTAAAGATTAACCTATGTTGCAGAACAGAAATAATGGACACCATCAACAACATCCTAGGAGATAGGTGCAGAGAAGCTTTCAGAAACACGTGCTTCGGC
CACCTACTTGACTTCACGTTCAAAAAGACGTCTTCCCAGTTACTATTGCACTTGATCCAGCATCAGTGCAAACCCAACTTTACTTCAAGATTGGAGGGAAAATCTTAA
Protein sequenceShow/hide protein sequence
MRKDKDALLVPFDFEIEKTCKKNRKEKRERLASMANPNPQDEQKPIRDYFQPTFPDQQSGIVYAPINANNFELKTGLIQMARDSAFRGFPSEDPNSHLKSFLDICGTVKL
NGVSEDAIRLRLFPFSLQDKARDWLRSLPSGSITTWDALVQAFLAKYFSPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKCPQHGYPDWLQIQLFYNGLNPNTKTI
VDAAAGGTLLSKTVENARTLLEDMATNSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSLANAFMKFSGTGSAQSIESAALASQCQEETTTEQVHYVERNSNYRGHHNS
TPTHYHPNVRNHENFSYANTKNILNPPGYASQKVENKPSLEDIVGAFIAESSKRTNKLEEAVIAINTTVTGHGAAIKNIETQLGQLVNAVSSMQKGKTTAEPEKTQMEYC
KAITVHQVEEVQVVDTQEIHEPEVTKEEVEEGSSSTEAEKLTSDPLIPSPTVLVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQMPKYVQFMKEWLSRKKKEK
KLRQYSLPLHAVPDFKRMCLTNLLIQGVFLFPAILLNIGEIKPTAVKLQLADQSVVSPYGVVENVLIQVGKFFLPVDFFVMDVKENPVVPVILGRPFLATGRVIIDIERR
ELTVRVLHEKEIFKAVGASTNCSEVMMFRSVCAVGCSIFGRLQRLVSGDCLAFEASSRRQLKFFGRVAAVVCSSSPWVSSPSSLGELLGCFLAFCPSSPWVASFEIFSKF
PAILQLVLAVAFSTSPLVTMHKRSRRHAASSSSSPPPPFDQDRFVSAEASARFDKYVEGRNFIPERGFSPDPEMQPNLVNNIVARGWGDFVHHPAPGVATIVREFYANME
TSSSSSFVRGHRVLYDPLTINRFYSLPNFDRDEYSTYLHGHLDVNEVIQTICRPGAEWIMTGAEVVRFKTTDLFVDYRAWHTFLCAKLMPVAHLSDVTKSRAILLFAIAT
GRSVNVGQVIHQSMNHIRRRYTTVGLGHPSLITALCRAAGFVWDAQEELVHPGALIDKNFISRYRGPGPQGAQPPLTIHAPPQHHEQHEQPAEPEQQEQEIPHPSIEEQL
QQLRMEFQSHRLDFQTLQQGIQSQQREHQRRGVEIVAISSTPRACMPTPISVRRRAHRDSVETLPPQRLDAANFPDLIRAYARQRRDAMSSASRRCHADSTVFKGNDAFR
MGSCFGERKTTSRERKTTRTRQRARGLDSRSLDRERYAKDKHVSTRASDRLKAAGVTPGRKPPEQTSPITLGSEQDSEEAMSTTVSVTKGSDEKTKGIEKPTEARSNKKT
KRSKQTKNTDKASHVTPEVAPETSEDTAKHDTEDTESDSVTNENSTSDDGEEQGKKKASLAKKDDDYFMSPSKRSKALKINLCCRTEIMDTINNILGDRCREAFRNTCFG
HLLDFTFKKTSSQLLLHLIQHQCKPNFTSRLEGKS