; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015686 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015686
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC110412945
Genome locationchr12:20185565..20192655
RNA-Seq ExpressionLag0015686
SyntenyLag0015686
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR025846 - PMR5 N-terminal domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6734747.1 hypothetical protein I3842_01G285500 [Carya illinoinensis]1.3e-11942.77Show/hide
Query:  MADQNPPEEPRPIRDYFQHVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSKDAIRLRLFPFSLQDKARD
        MA+++    PR ++DY + V  G  S I+  PINANNFELK  LI M +   + GSP +DPN HL  FL+IC TVKINGV++D IRLRLFPFSL+DKAR 
Subjt:  MADQNPPEEPRPIRDYFQHVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSKDAIRLRLFPFSLQDKARD

Query:  WLQSITPGSITTWDALVQAFLKKFFPLAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKFPQHGYPDWLQVQLFYNGLTPSTKTIVDVAAGGTLLSKT
        WLQS+ PGSI +W  + + FL KFFP AKT +LR+EIG F+Q   E L+EAWER+K+L+R+ PQHG PDWLQVQ+FYNGL   T+TIVD A+GGTL+SKT
Subjt:  WLQSITPGSITTWDALVQAFLKKFFPLAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKFPQHGYPDWLQVQLFYNGLTPSTKTIVDVAAGGTLLSKT

Query:  VENARTLLEDMATNSYQWPSERSTPKKIVVGVFEVDKVSALQAQMTSLANAFMKFSGIGSAQSIE--SAAALASRPQEETVEQVQYVSNFNSRGYNNSST
         E A  LLE+MA+N+YQWP+ER+  KK V G+ E++ ++AL AQ+ +L++     +     QS E  ++ ++     E + EQVQYV+N N   Y  +  
Subjt:  VENARTLLEDMATNSYQWPSERSTPKKIVVGVFEVDKVSALQAQMTSLANAFMKFSGIGSAQSIE--SAAALASRPQEETVEQVQYVSNFNSRGYNNSST

Query:  PTHYHPNNRNHENFSYANTKNVL---NPPCFAPQTQDNK-KLEDLVGAFIPESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKA
        P +YHP  RNHEN SY NTKNVL   +PP F  Q  + K  LED + +F+ E++ R  K +  +  I T  +   A +KN+E Q+GQL   ++   +G  
Subjt:  PTHYHPNNRNHENFSYANTKNVL---NPPCFAPQTQDNK-KLEDLVGAFIPESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKA

Query:  PAEQEKPQIEYCKAITVHQ-EESEEEPESEDYETPTG----------EAEE--DTSLDEAEKP-----NPEPPIPSLTLLVPKEKKKKKKKKNNQVQFDK
        P+  E    E CKAIT+   +E E  P  E   TPT           E EE  + +L+E + P        PPI +  L  P+  +K+K  K    QF K
Subjt:  PAEQEKPQIEYCKAITVHQ-EESEEEPESEDYETPTG----------EAEE--DTSLDEAEKP-----NPEPPIPSLTLLVPKEKKKKKKKKNNQVQFDK

Query:  FMNAFMNLNINIPFAEALE-MPQY-------------------------------------------------------NRALCDLGARINIIPLSLCKK
        F++ F  ++INIPFA+ALE MP Y                                                       +R LCDLGA IN++P  +C+K
Subjt:  FMNAFMNLNINIPFAEALE-MPQY-------------------------------------------------------NRALCDLGARINIIPLSLCKK

Query:  LDIGEIKSTPVKLQLADQSVVRPVGKVQE
        L +GE+K T + LQLAD+S+  P G +++
Subjt:  LDIGEIKSTPVKLQLADQSVVRPVGKVQE

KAG7947748.1 hypothetical protein I3843_14G109500 [Carya illinoinensis]2.6e-12042.93Show/hide
Query:  MADQNPPEEPRPIRDYFQHVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSKDAIRLRLFPFSLQDKARD
        MA+++    PR ++DY + V  G  S I+  PINANNFELK  LI M +   + GSP +DPN HL  FL+IC TVKINGV++D IRLRLFPFSL+DKAR 
Subjt:  MADQNPPEEPRPIRDYFQHVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSKDAIRLRLFPFSLQDKARD

Query:  WLQSITPGSITTWDALVQAFLKKFFPLAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKFPQHGYPDWLQVQLFYNGLTPSTKTIVDVAAGGTLLSKT
        WLQS+ PGSI +W  + + FL KFFP AKT +LR+EIG F+Q   E L+EAWER+K+L+R+ PQHG PDWLQVQ+FYNGL   T+TIVD A+GGTL+SKT
Subjt:  WLQSITPGSITTWDALVQAFLKKFFPLAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKFPQHGYPDWLQVQLFYNGLTPSTKTIVDVAAGGTLLSKT

Query:  VENARTLLEDMATNSYQWPSERSTPKKIVVGVFEVDKVSALQAQMTSLANAFMKFSGIGSAQSIE--SAAALASRPQEETVEQVQYVSNFNSRGYNNSST
         E A  LLE+MA+N+YQWP+ER+  KK V G+ E++ ++AL AQ+ +L++     +     QS E  ++ ++     E + EQVQYV+N N   Y  +  
Subjt:  VENARTLLEDMATNSYQWPSERSTPKKIVVGVFEVDKVSALQAQMTSLANAFMKFSGIGSAQSIE--SAAALASRPQEETVEQVQYVSNFNSRGYNNSST

Query:  PTHYHPNNRNHENFSYANTKNVL---NPPCFAPQTQDNK-KLEDLVGAFIPESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKA
        P +YHP  RNHEN SY NTKNVL   +PP F  Q  + K  LED + +F+ E++ R  K +  +  I T  +   A +KN+E Q+GQL   ++   +G  
Subjt:  PTHYHPNNRNHENFSYANTKNVL---NPPCFAPQTQDNK-KLEDLVGAFIPESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKA

Query:  PAEQEKPQIEYCKAITVHQ-EESEEEPESEDYETPTG----------EAEE--DTSLDEAEKP-----NPEPPIPSLTLLVPKEKKKKKKKKNNQVQFDK
        P+  E    E CKAIT+   +E E  P  E   TPT           E EE  + +L+E + P        PPI +  L  P+  +K+K  K    QF K
Subjt:  PAEQEKPQIEYCKAITVHQ-EESEEEPESEDYETPTG----------EAEE--DTSLDEAEKP-----NPEPPIPSLTLLVPKEKKKKKKKKNNQVQFDK

Query:  FMNAFMNLNINIPFAEALE-MPQY-------------------------------------------------------NRALCDLGARINIIPLSLCKK
        F++ F  ++INIPFA+ALE MP Y                                                       +R LCDLGA IN++P S+C+K
Subjt:  FMNAFMNLNINIPFAEALE-MPQY-------------------------------------------------------NRALCDLGARINIIPLSLCKK

Query:  LDIGEIKSTPVKLQLADQSVVRPVGKVQE
        L +GE+K T + LQLAD+S+  P G +++
Subjt:  LDIGEIKSTPVKLQLADQSVVRPVGKVQE

KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]5.3e-12642.81Show/hide
Query:  MRRNKDLILAPLDPEIERTIHRLRRKNRENVQMADQNPPEEPRPIRDYFQHVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSF
        MRR +   + P+DPEIERT+  LRR   + + MA+++    PR ++DY + V  G  S I+  PINANNFELK  LI M +   + GSP +DPN HL  F
Subjt:  MRRNKDLILAPLDPEIERTIHRLRRKNRENVQMADQNPPEEPRPIRDYFQHVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSF

Query:  LDICGTVKINGVSKDAIRLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPLAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKFPQHGYP
        L+IC TVKINGV++D IRLRLFPFSL+DKAR WLQS+ PGSI +W  + + FL KFFP AKT +LR+EIG F+Q   E L+EAWER+K+L+R+ PQHG P
Subjt:  LDICGTVKINGVSKDAIRLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPLAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKFPQHGYP

Query:  DWLQVQLFYNGLTPSTKTIVDVAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIVVGVFEVDKVSALQAQMTSLANAFMKFSGIGSAQSIE--S
        DWLQVQ+FYNGL   T+TIVD A+GGTL+SKT E A  LLE+MA+N+YQWP+ER+  KK V G+ +++ ++AL AQ+ +L++     +     QS E  +
Subjt:  DWLQVQLFYNGLTPSTKTIVDVAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIVVGVFEVDKVSALQAQMTSLANAFMKFSGIGSAQSIE--S

Query:  AAALASRPQEETVEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHENFSYANTKNVL---NPPCFAPQTQDNK-KLEDLVGAFIPESSNRTTKLEEAVIAIN
        + ++     E + EQVQYV+N N   Y  +  P +YHP  RNHEN SY NTKNVL   +PP F  Q  + K  LED + +F+ E++ R  K +  +  I 
Subjt:  AAALASRPQEETVEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHENFSYANTKNVL---NPPCFAPQTQDNK-KLEDLVGAFIPESSNRTTKLEEAVIAIN

Query:  TTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQIEYCKAITVHQ-EESEEEPESEDYETPT----GEAEEDTSLDEAEKPNPE----------
        T  +   AAIKNIE Q+GQL   ++   +G  P+  E    E CKAIT+   +E E  P  E   TPT    G+++     DE      E          
Subjt:  TTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQIEYCKAITVHQ-EESEEEPESEDYETPT----GEAEEDTSLDEAEKPNPE----------

Query:  ---PPIPSLTLLVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQY--------------------------------------------
           PPI +  L  P+  +K+K  K    QF KF++ F  ++INIPFA+ALE MP Y                                            
Subjt:  ---PPIPSLTLLVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQY--------------------------------------------

Query:  -----------NRALCDLGARINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGKVQE
                   ++ LCDLGA IN++PLS+C+KL + E+K T + LQLAD+S+  P G +++
Subjt:  -----------NRALCDLGARINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGKVQE

WP_217833153.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]1.0e-12153.78Show/hide
Query:  LAPLDPEIERTIHR-LRRKNRENVQMADQNPPEEPRPIRDYFQHVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTV
        L PLDPEI+RT  R LR    +  +MA+    E P+ IRDYFQ      Q GI+  PIN NNFELK GLIQMAR+ A+RG   EDP+ HL+SFL+ICGTV
Subjt:  LAPLDPEIERTIHR-LRRKNRENVQMADQNPPEEPRPIRDYFQHVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTV

Query:  KINGVSKDAIRLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPLAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKFPQHGYPDWLQVQL
        K+NGVS DAI+LRLFPFSLQD+A+DWL++I P SITTW+ L QAFL K+FP AK+ +LRTEIGTF+Q  DEQL+EAWER+K+LLR+ PQHGYPDWLQ+QL
Subjt:  KINGVSKDAIRLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPLAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKFPQHGYPDWLQVQL

Query:  FYNGLTPSTKTIVDVAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPK-KIVVGVFEVDKVSALQAQMTSLANAFMKFSGIGSAQ----SIESAAAL
        FYNGL  STK+I+D  AGG++ SK  + A T+LED+AT SY WP ER++P      G++EVD+V++L+AQM SL NA  K +  G AQ    SI S AAL
Subjt:  FYNGLTPSTKTIVDVAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPK-KIVVGVFEVDKVSALQAQMTSLANAFMKFSGIGSAQ----SIESAAAL

Query:  ASR-PQEETVEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHENFSYANTKNVLNPP--CFAPQTQDNKKLEDLVGAFIPESSNRTTKLEEAVIAINTTVNG
        AS        E   YV   + R Y +   PTHYHPN RNHENFSYAN KNVL  P             LED++  F+ ES +RTT LE +V AI +TV  
Subjt:  ASR-PQEETVEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHENFSYANTKNVLNPP--CFAPQTQDNKKLEDLVGAFIPESSNRTTKLEEAVIAINTTVNG

Query:  HSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQIEYCKAITVHQEESEEEP----ESEDYE
           A++N+E QL Q+   + TM KGK P+  E    E CKA+T+   +    P    E E+ E
Subjt:  HSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQIEYCKAITVHQEESEEEP----ESEDYE

XP_022843226.1 uncharacterized protein LOC111366761 [Olea europaea var. sylvestris]2.8e-11943.29Show/hide
Query:  MRRNKDLILAPLDPEIERTIHRLRRKNR-ENVQMADQ-----NPPEEPRPIRDYFQHVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPN
        MRR ++L L  +DPE ERT   LR   R E   MA+Q     N   + R IRDY + V     SGI    I A NFELK GLI M +   + G+  EDPN
Subjt:  MRRNKDLILAPLDPEIERTIHRLRRKNR-ENVQMADQ-----NPPEEPRPIRDYFQHVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPN

Query:  SHLKSFLDICGTVKINGVSKDAIRLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPLAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKF
        +HL SFL+IC TVK+NGV++DAIRLRLF FSL+DKA+ W QS+  GSITTWD L Q FL K+FP +K+ +LR EI  F+Q   E  +EAWERFK+LLR+ 
Subjt:  SHLKSFLDICGTVKINGVSKDAIRLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPLAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKF

Query:  PQHGYPDWLQVQLFYNGLTPSTKTIVDVAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIVVGVFEVDKVSALQAQMTSLANAFMKFSGIGSAQ
        PQHG+  W+Q+++FYNGL   T+T+VD AAGG L++KT E A  LL+D+ATNSYQWPSERS  KK V G+ EVD ++AL AQ+ SL N  +  +  G+ Q
Subjt:  PQHGYPDWLQVQLFYNGLTPSTKTIVDVAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIVVGVFEVDKVSALQAQMTSLANAFMKFSGIGSAQ

Query:  SIESAAALASRPQEETV--EQVQYVS--NFNSRGYNNSSTPTHYHPNNRNHENFSYANTKNVLN-PPCFAPQTQDNK-KLEDLVGAFIPESSNRTTKLEE
        +++S  + +S  QE  V  EQVQY+   N+N RG   ++   HYHP  RNHEN SY N +N L  PP F  Q  D K  LED++G FI E+ +R  K E 
Subjt:  SIESAAALASRPQEETV--EQVQYVS--NFNSRGYNNSSTPTHYHPNNRNHENFSYANTKNVLN-PPCFAPQTQDNK-KLEDLVGAFIPESSNRTTKLEE

Query:  AVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQIEYCKAITVHQEESEEEPESEDYETP------TGEAEEDTSLDEAE-----KPN
         +  I T V+   A +KN+E Q+GQL  ++ +  KGK P++ E    E+C AIT+   +  EE + +    P      T E + +    EAE     KP 
Subjt:  AVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQIEYCKAITVHQEESEEEPESEDYETP------TGEAEEDTSLDEAE-----KPN

Query:  ----PEPPIPSLTLLVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMPQY-----------------------------------------
            P+ P P L   +P  ++  KKK ++  QF KF+  F  ++INIPFAE L +MP Y                                         
Subjt:  ----PEPPIPSLTLLVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMPQY-----------------------------------------

Query:  -------------NRALCDLGARINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGKVQE
                     +RALCD GA IN++PLS+ KKL +GE+K T + LQLAD+S+  P G +++
Subjt:  -------------NRALCDLGARINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGKVQE

TrEMBL top hitse value%identityAlignment
A0A3S3N117 Retrotrans_gag domain-containing protein1.9e-9240.14Show/hide
Query:  MRRNKDLILAPLDPEIERTIHRLRRKNRENVQMADQNPPEEP-RPIRDYFQHVFQGQQSGIVYAPINANNFELKTGLIQM-ARDCAYRGSPTEDPNSHLK
        MRRN++L L PLDPEIERT+ RL+++ ++  +       E+  R + DY   +  G  S I    I ANNFE+K  +IQM A    + G P +DPN+H+ 
Subjt:  MRRNKDLILAPLDPEIERTIHRLRRKNRENVQMADQNPPEEP-RPIRDYFQHVFQGQQSGIVYAPINANNFELKTGLIQM-ARDCAYRGSPTEDPNSHLK

Query:  SFLDICGTVKINGVSKDAIRLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPLAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKFPQHG
        +FL++C T K NGV+ DA+RLRL PFSL+DKA+ WL S+   +ITTWD L + FL KFFP  KTVK+R +I TF Q   E L+EAWER+KELLRK P HG
Subjt:  SFLDICGTVKINGVSKDAIRLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPLAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKFPQHG

Query:  YPDWLQVQLFYNGLTPSTKTIVDVAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIVVGVFEVDKVSALQAQMTSLANAF--MKFSGIGSAQSI
         P W+QVQ FYNGL  +T+T +D A GGTL+ K+ E A  L+E+MATN+YQWPS+    KKI  GV E+D +SAL AQ+ +L+     MK   + S   +
Subjt:  YPDWLQVQLFYNGLTPSTKTIVDVAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIVVGVFEVDKVSALQAQMTSLANAF--MKFSGIGSAQSI

Query:  ESAAALASRPQE-------ETVEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHENFSYANTKN-VLNPPCF-APQTQDNKKLEDLVGAFIPESSNRTTKLE
            A      +        + EQV YVSN++ +    S+T   Y+P  RNH NFS+ N +N    PP F  PQ ++   LE ++  FI +  ++    +
Subjt:  ESAAALASRPQE-------ETVEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHENFSYANTKN-VLNPPCF-APQTQDNKKLEDLVGAFIPESSNRTTKLE

Query:  EAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQIEYCKAITVH-----------QEESEEEPESEDYETPTGEAEEDTSLDEAEKP
         A+      +     AI+NIE  +GQL N+++   +G  P+  E    E  +AIT+            + E E+ P     +     +EE    ++   P
Subjt:  EAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQIEYCKAITVH-----------QEESEEEPESEDYETPTGEAEEDTSLDEAEKP

Query:  -------NPEPPIPSLTLLVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE
               NP P +P +    P+  +K K  K    QF KF++ F  L++NIPFA+ALE
Subjt:  -------NPEPPIPSLTLLVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129451.4e-10340.12Show/hide
Query:  MRRNKDLILAPLDPEIERTIHRLRRKNRE----NVQMADQN----------PPEEPRPIRDYFQHVFQGQQSGIVYAPINANNFELKTGLIQMARDCA-Y
        M+R  +L L P DP+IERT  R RR+N +    N  MA+ N           PE  R +RDY   + QG    I    INANNFE+K   IQM +    +
Subjt:  MRRNKDLILAPLDPEIERTIHRLRRKNRE----NVQMADQN----------PPEEPRPIRDYFQHVFQGQQSGIVYAPINANNFELKTGLIQMARDCA-Y

Query:  RGSPTEDPNSHLKSFLDICGTVKINGVSKDAIRLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPLAKTVKLRTEIGTFQQQYDEQLFEAWE
         G P++DPNSHL +FL+IC T K NGV+ DAIRLRLFPFSL+DKA+ WL S+  GSITTW+ L Q FL KFFP AKT K+R +I +F Q   E L+EAWE
Subjt:  RGSPTEDPNSHLKSFLDICGTVKINGVSKDAIRLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPLAKTVKLRTEIGTFQQQYDEQLFEAWE

Query:  RFKELLRKFPQHGYPDWLQVQLFYNGLTPSTKTIVDVAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIVVGVFEVDKVSALQAQMTSLANAFM
        RFKELLR+ P HG PDWLQVQ FYNGL  S KTI+D AAGG L+SK   +A  LLE+MA+N+YQWPSERS  +K  VG +E+D +  L  Q+ +L+   +
Subjt:  RFKELLRKFPQHGYPDWLQVQLFYNGLTPSTKTIVDVAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIVVGVFEVDKVSALQAQMTSLANAFM

Query:  KFSGIGSAQS----IESAAALASRPQ-EETVEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHENFSYANTKNVLNP-PCFAPQTQDNKKLEDLVGAFIPES
           G+ + Q+     E      S  Q     E VQ+V NFN R  NN  + T Y+P  RNH NFS++N     NP P   P  Q   +         P+ 
Subjt:  KFSGIGSAQS----IESAAALASRPQ-EETVEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHENFSYANTKNVLNP-PCFAPQTQDNKKLEDLVGAFIPES

Query:  SNRTTKLEEAVI----AINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAE-QEKPQ-IEYCKAIT---------VHQEESEEEPESEDYETPTGEA
          + ++LEE ++      +  +    A+++N+ETQ+GQL N ++   +G  P++ Q  P+  E C+AIT         V+Q+  E E E  D E   G  
Subjt:  SNRTTKLEEAVI----AINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAE-QEKPQ-IEYCKAIT---------VHQEESEEEPESEDYETPTGEA

Query:  EEDTSLDEAEKPNPEPPIPSLTLLVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQY--------------------------------
        E +  + + +    E    S  +  P    ++ +K+  + QF KF+N F  L+INIPFAEALE MP Y                                
Subjt:  EEDTSLDEAEKPNPEPPIPSLTLLVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQY--------------------------------

Query:  -----------------------NRALCDLGARINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGKVQE
                                +AL DLGA IN++P S+ +KL +GE K T V LQLAD+S V P G +++
Subjt:  -----------------------NRALCDLGARINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGKVQE

A0A6J1DU19 uncharacterized protein LOC1110243611.1e-8940.85Show/hide
Query:  IRDYFQHVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSKDAIRLRLFPFSLQDKARDWLQSITPGSITT
        IRDY Q  F     GI+  PINANN ELK GLIQM R+  +RG+ TEDPN+HL  FLD+CGTVK+NGV  DAIRLRLFP SLQDK               
Subjt:  IRDYFQHVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSKDAIRLRLFPFSLQDKARDWLQSITPGSITT

Query:  WDALVQAFLKKFFPLAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKFPQHGYPDWLQVQLFYNGLTPSTKTIVDVAAGGTLLSKTVENARTLLEDMA
           +VQAFL  FFP AKT +LRTEI +F++   EQLFE WER+KELLRK PQHG  +WLQ+Q+FYNGL   T+TI+D AAGGTLLS+T ENA  LL+DMA
Subjt:  WDALVQAFLKKFFPLAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKFPQHGYPDWLQVQLFYNGLTPSTKTIVDVAAGGTLLSKTVENARTLLEDMA

Query:  TNSYQWPSERSTPKKIVVGVFEVDKVSALQAQMTSLANAFMKFSGIGSAQSIESAAALASRP-QEETVEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHEN
         NS+QWPSERS  KK V G++E+D++S+L+AQ+ +L NA  K SG G++ S E  AA  +    E T+EQ Q+ S                HP       
Subjt:  TNSYQWPSERSTPKKIVVGVFEVDKVSALQAQMTSLANAFMKFSGIGSAQSIESAAALASRP-QEETVEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHEN

Query:  FSYANTKNVLNPPCFAPQTQDNKKLEDLVGAFIPESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQIEYCKAITV
                           +    LEDL+GAFI E  +R +++E  V  +   + G++ +IKN+E Q+GQ+   ++TM KGK P++ E    E+CKA+T+
Subjt:  FSYANTKNVLNPPCFAPQTQDNKKLEDLVGAFIPESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQIEYCKAITV

Query:  HQEESEEEPESEDYETPTGEAEEDTSLDEAEKP-----NPEPPIPSLTLLVPKEKKKKKKKKNNQVQFDKFM----------NAFMNLNINIPFAEALE-
           +  +EPE +  E P    EE  + +E  K        + P  S+    P      +        + +FM           A+  +N+    +  L+ 
Subjt:  HQEESEEEPESEDYETPTGEAEEDTSLDEAEKP-----NPEPPIPSLTLLVPKEKKKKKKKKNNQVQFDKFM----------NAFMNLNINIPFAEALE-

Query:  -MPQ------------------YNRALCDLGARINIIPLSL
         +PQ                  +N+ALCD+ A IN++PL +
Subjt:  -MPQ------------------YNRALCDLGARINIIPLSL

A0A6P6X688 uncharacterized protein LOC1137397912.4e-8434.71Show/hide
Query:  IRDYFQHVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSKDAIRLRLFPFSLQDKARDWLQSITPGSITT
        +RD+     QG Q+ I    +NAN FE++  LIQM +   Y G+ TED +SHL +F +IC T+K NGVS DAI+ RLFPFSL+DKA+ WLQ  +P + T 
Subjt:  IRDYFQHVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSKDAIRLRLFPFSLQDKARDWLQSITPGSITT

Query:  WDALVQAFLKKFFPLAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKFPQHGYPDWLQVQLFYNGLTPSTKTIVDVAAGGTLLSKTVENARTLLEDMA
        W  L + FL KFF   KT K R +I +F QQ +E L+E WER++EL R+ P HG PDWL VQ FYNGLT  TKT VD AAGG L+ KTV+ A+ L+E+MA
Subjt:  WDALVQAFLKKFFPLAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKFPQHGYPDWLQVQLFYNGLTPSTKTIVDVAAGGTLLSKTVENARTLLEDMA

Query:  TNSYQWPSERSTPKKIVVGVFEVDKVSALQAQMTSLANAFMKFSGIGSAQSIESAAALASRPQEE-----TVEQVQYVSNFNSRGYNNSSTPTHYHPNNR
         N+YQW +ER   ++   G+ EVD ++ L A+M ++     ++ G  S + +  A         +     +  QVQY++N+N    NN  + T Y+P  R
Subjt:  TNSYQWPSERSTPKKIVVGVFEVDKVSALQAQMTSLANAFMKFSGIGSAQSIESAAALASRPQEE-----TVEQVQYVSNFNSRGYNNSSTPTHYHPNNR

Query:  NHENFSY---ANTKNVLNPPCFAPQ--TQDNKKLEDLVGAFIPESSNRTTKLEEAVIA-------INTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAP
        NH NF +    N +  +NPP F P+    ++K    L    +   SN   K+E+ V A       I   ++  +   +N+E QLGQ+ NVV+  N+   P
Subjt:  NHENFSY---ANTKNVLNPPCFAPQ--TQDNKKLEDLVGAFIPESSNRTTKLEEAVIA-------INTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAP

Query:  AEQEKPQIEYCKAITVHQEES-------EEEPESEDYET-------------PTGEAEEDTSLDEAEKPNPEPPIPSLTLLVPKEKKKKKKKKNNQVQFD
        ++ E    E+ KAIT+  ++        E E E E  E                 E  E+  L   +     PP+P      P+  K  K    N  +F+
Subjt:  AEQEKPQIEYCKAITVHQEES-------EEEPESEDYET-------------PTGEAEEDTSLDEAEKPNPEPPIPSLTLLVPKEKKKKKKKKNNQVQFD

Query:  KFMNAFMNLNINIPFAEA-LEMPQY-------------------------------------------------------NRALCDLGARINIIPLSLCK
        KF+N F  L+INIPF +A L++P Y                                                       ++  CD GA +++IPL++ +
Subjt:  KFMNAFMNLNINIPFAEA-LEMPQY-------------------------------------------------------NRALCDLGARINIIPLSLCK

Query:  KLDIGEIKSTPVKLQLADQSVVRPVGKVQER
        +L + E+K   + LQLAD+S+  P+G ++ +
Subjt:  KLDIGEIKSTPVKLQLADQSVVRPVGKVQER

A0A6P6XAQ1 Reverse transcriptase1.3e-9339.55Show/hide
Query:  RPIRDYFQHVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSKDAIRLRLFPFSLQDKARDWLQSITPGSI
        R +RD+     QG Q+ IV   +NANNFE+K  LIQM +   Y G+ TEDPNSHL +FL+IC T+K NGVS+DAI+LRLFPFSL+DKA+ WLQS  P + 
Subjt:  RPIRDYFQHVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSKDAIRLRLFPFSLQDKARDWLQSITPGSI

Query:  TTWDALVQAFLKKFFPLAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKFPQHGYPDWLQVQLFYNGLTPSTKTIVDVAAGGTLLSKTVENARTLLED
        TTWD L +AFL KFFP  KT KLR +I +F QQ  E L+EAWER++EL R+ P HG PDWL VQ FYNGLT  TKT VD AAGG L+ KT E A+ L+E+
Subjt:  TTWDALVQAFLKKFFPLAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKFPQHGYPDWLQVQLFYNGLTPSTKTIVDVAAGGTLLSKTVENARTLLED

Query:  MATNSYQWPSERSTPKKIVVGVFEVDKVSALQAQMTSLANAFMKFSGIGSAQSIESAAALASRPQEE-----TVEQVQYVSNFNSRGYNNSSTPTHYHPN
        MA N+YQW +ER   ++   G+ EVD ++ L A+M ++     +  G  S Q +  A+        +     + EQVQY++N+N    NN  + T Y+P 
Subjt:  MATNSYQWPSERSTPKKIVVGVFEVDKVSALQAQMTSLANAFMKFSGIGSAQSIESAAALASRPQEE-----TVEQVQYVSNFNSRGYNNSSTPTHYHPN

Query:  NRNHENFSY---ANTKNVLNPPCF-APQTQDNKK------LEDLVGAFIPESSNRTTKLEEAVIAINTTVNGHSAAI----KNIETQLGQLVNVVSTMNK
         RNH NF +    N +  +NPP F   QT    K      +E L  A    S+++  KL  A       + G    +    +N+E QLGQ+ N V+  N+
Subjt:  NRNHENFSY---ANTKNVLNPPCF-APQTQDNKK------LEDLVGAFIPESSNRTTKLEEAVIAINTTVNGHSAAI----KNIETQLGQLVNVVSTMNK

Query:  GKAPAEQEKPQIEYCKAITVHQEESEEEP---------------------ESEDYETPTGEAEEDTSLDEAEKPNPEPPIPSLTLLVPKEKKKKKKKKNN
        G  P++ E    E+ KAIT+   +   EP                     E    E    + EE+    E   P P PPIPS    + +   KK+K  ++
Subjt:  GKAPAEQEKPQIEYCKAITVHQEESEEEP---------------------ESEDYETPTGEAEEDTSLDEAEKPNPEPPIPSLTLLVPKEKKKKKKKKNN

Query:  Q-VQFDKFMNAFM--NLNINIPFAEALEMP------QYNRALCDLGARINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVG
        + +   +  +A +   L   +    +  +P      ++++ALCDLGA +++IPL++ ++L + E+K T + LQLAD+S+  P+G
Subjt:  Q-VQFDKFMNAFM--NLNINIPFAEALEMP------QYNRALCDLGARINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVG

SwissProt top hitse value%identityAlignment
F4IH21 Protein trichome birefringence-like 332.1e-0860.98Show/hide
Query:  EKCDVFSGKWVFDNTSYPLYDESQCPYMSNQLACHKHGRSE
        E CDVFSGKWV D  S PLY+E +CPY+  QL C +HGR +
Subjt:  EKCDVFSGKWVFDNTSYPLYDESQCPYMSNQLACHKHGRSE

O80919 Protein trichome birefringence-like 341.1e-0961.9Show/hide
Query:  KCDVFSGKWVFDNTSYPLYDESQCPYMSNQLACHKHGRSEVS
        +C++F GKWVFDN SYPLY E  C +MS+QLAC K GR ++S
Subjt:  KCDVFSGKWVFDNTSYPLYDESQCPYMSNQLACHKHGRSEVS

Q1PFD9 Protein trichome birefringence-like 311.0e-0753.66Show/hide
Query:  EKCDVFSGKWVFDNTSYPLYDESQCPYMSNQLACHKHGRSE
        E C+VF G+WV+DN SYPLY E  CPY+  Q  C ++GR +
Subjt:  EKCDVFSGKWVFDNTSYPLYDESQCPYMSNQLACHKHGRSE

Q8RXQ1 Protein trichome birefringence-like 351.6e-1947.9Show/hide
Query:  GSASTLPRSRKCNGTKDYSGRKISW-DGQKSESSRRKVRSEKCDVFSGKWVFDN-TSYPLYDESQCPYMSNQLACHKHGRSEVSVLEMETSREAPPILPP
        G+   L R  KCN TK+YSG+KI W D  +    +     +KCDVFSGKWVFDN +SYPL+ ESQCPYMS+QLAC KHGR +   LE +  R  P     
Subjt:  GSASTLPRSRKCNGTKDYSGRKISW-DGQKSESSRRKVRSEKCDVFSGKWVFDN-TSYPLYDESQCPYMSNQLACHKHGRSEVSVLEMETSREAPPILPP

Query:  PLCRGSGVDPYAALSGNRV
         L R + ++ +  L G R+
Subjt:  PLCRGSGVDPYAALSGNRV

Q94K00 Protein trichome birefringence-like 284.0e-0742.86Show/hide
Query:  SSRRKVRSEKCDVFSGKWVFDNTSYPLYDESQCPYMSNQLACHKHGRSE
        SS  ++  ++CD+F+G+WVFDN +YPLY E +C +++ Q+ C ++GR +
Subjt:  SSRRKVRSEKCDVFSGKWVFDNTSYPLYDESQCPYMSNQLACHKHGRSE

Arabidopsis top hitse value%identityAlignment
AT2G38320.1 TRICHOME BIREFRINGENCE-LIKE 347.9e-1161.9Show/hide
Query:  KCDVFSGKWVFDNTSYPLYDESQCPYMSNQLACHKHGRSEVS
        +C++F GKWVFDN SYPLY E  C +MS+QLAC K GR ++S
Subjt:  KCDVFSGKWVFDNTSYPLYDESQCPYMSNQLACHKHGRSEVS

AT2G40320.1 TRICHOME BIREFRINGENCE-LIKE 331.5e-0960.98Show/hide
Query:  EKCDVFSGKWVFDNTSYPLYDESQCPYMSNQLACHKHGRSE
        E CDVFSGKWV D  S PLY+E +CPY+  QL C +HGR +
Subjt:  EKCDVFSGKWVFDNTSYPLYDESQCPYMSNQLACHKHGRSE

AT5G01620.1 TRICHOME BIREFRINGENCE-LIKE 351.1e-2047.9Show/hide
Query:  GSASTLPRSRKCNGTKDYSGRKISW-DGQKSESSRRKVRSEKCDVFSGKWVFDN-TSYPLYDESQCPYMSNQLACHKHGRSEVSVLEMETSREAPPILPP
        G+   L R  KCN TK+YSG+KI W D  +    +     +KCDVFSGKWVFDN +SYPL+ ESQCPYMS+QLAC KHGR +   LE +  R  P     
Subjt:  GSASTLPRSRKCNGTKDYSGRKISW-DGQKSESSRRKVRSEKCDVFSGKWVFDN-TSYPLYDESQCPYMSNQLACHKHGRSEVSVLEMETSREAPPILPP

Query:  PLCRGSGVDPYAALSGNRV
         L R + ++ +  L G R+
Subjt:  PLCRGSGVDPYAALSGNRV

AT5G01620.2 TRICHOME BIREFRINGENCE-LIKE 351.1e-2047.9Show/hide
Query:  GSASTLPRSRKCNGTKDYSGRKISW-DGQKSESSRRKVRSEKCDVFSGKWVFDN-TSYPLYDESQCPYMSNQLACHKHGRSEVSVLEMETSREAPPILPP
        G+   L R  KCN TK+YSG+KI W D  +    +     +KCDVFSGKWVFDN +SYPL+ ESQCPYMS+QLAC KHGR +   LE +  R  P     
Subjt:  GSASTLPRSRKCNGTKDYSGRKISW-DGQKSESSRRKVRSEKCDVFSGKWVFDN-TSYPLYDESQCPYMSNQLACHKHGRSEVSVLEMETSREAPPILPP

Query:  PLCRGSGVDPYAALSGNRV
         L R + ++ +  L G R+
Subjt:  PLCRGSGVDPYAALSGNRV

AT5G01620.3 TRICHOME BIREFRINGENCE-LIKE 358.4e-2148.31Show/hide
Query:  SASTLPRSRKCNGTKDYSGRKISW-DGQKSESSRRKVRSEKCDVFSGKWVFDN-TSYPLYDESQCPYMSNQLACHKHGRSEVSVLEMETSREAPPILPPP
        +A  L R  KCN TK+YSG+KI W D  +    +     +KCDVFSGKWVFDN +SYPL+ ESQCPYMS+QLAC KHGR +   LE +  R  P      
Subjt:  SASTLPRSRKCNGTKDYSGRKISW-DGQKSESSRRKVRSEKCDVFSGKWVFDN-TSYPLYDESQCPYMSNQLACHKHGRSEVSVLEMETSREAPPILPPP

Query:  LCRGSGVDPYAALSGNRV
        L R + ++ +  L G R+
Subjt:  LCRGSGVDPYAALSGNRV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTAGGAATAAAGATTTAATTTTAGCACCTTTGGATCCCGAGATAGAAAGAACCATCCATAGGCTTCGAAGGAAGAATAGAGAAAACGTTCAAATGGCTGACCAAAA
TCCACCTGAAGAGCCTAGGCCTATTAGAGATTATTTTCAGCATGTGTTTCAGGGGCAACAGTCGGGGATTGTCTATGCACCGATTAATGCCAACAACTTTGAGCTCAAAA
CCGGTCTCATTCAGATGGCTCGAGATTGTGCTTATAGAGGATCACCCACTGAGGATCCAAATTCTCATCTTAAATCATTCTTGGACATTTGTGGGACGGTAAAGATTAAT
GGAGTCTCTAAGGATGCTATTCGTTTACGTTTATTTCCCTTTTCTTTGCAGGATAAAGCACGAGATTGGTTGCAGTCTATTACCCCTGGGAGCATCACCACTTGGGATGC
TTTGGTCCAGGCCTTTTTAAAGAAATTTTTCCCTCTTGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGACATTCCAACAACAATATGATGAGCAGTTGTTCGAAGCTT
GGGAGCGGTTCAAAGAGCTACTGAGGAAGTTTCCTCAGCATGGTTACCCCGATTGGCTTCAGGTACAGTTGTTTTATAATGGTTTAACTCCTAGTACAAAAACGATTGTT
GATGTAGCTGCAGGTGGGACTCTGTTGTCCAAGACTGTGGAAAACGCTCGCACACTTCTAGAGGATATGGCCACCAACAGCTATCAGTGGCCATCTGAGCGGTCTACACC
TAAAAAGATTGTTGTTGGAGTGTTTGAGGTTGATAAAGTAAGTGCACTCCAGGCCCAGATGACTTCCCTCGCTAATGCTTTTATGAAATTTTCAGGTATAGGGAGCGCAC
AGTCAATTGAATCAGCTGCTGCTTTAGCATCTAGACCTCAGGAGGAGACCGTCGAGCAGGTTCAGTATGTATCAAATTTTAATTCTAGGGGATATAATAATAGTTCTACA
CCTACACATTATCACCCTAACAATAGGAACCATGAAAATTTCTCTTATGCCAATACTAAGAATGTTCTTAATCCTCCTTGTTTTGCCCCTCAAACTCAAGATAATAAAAA
GTTAGAAGATCTTGTTGGAGCTTTCATTCCAGAGTCTAGTAACAGGACAACAAAGTTAGAGGAGGCAGTCATTGCCATCAACACCACAGTGAATGGCCACAGCGCTGCCA
TCAAGAATATTGAGACTCAGCTGGGACAGTTGGTGAATGTTGTAAGCACCATGAATAAAGGTAAGGCCCCAGCTGAACAAGAGAAACCCCAGATAGAGTATTGTAAGGCA
ATCACTGTGCACCAGGAGGAATCTGAAGAGGAACCTGAATCTGAGGACTATGAAACGCCTACAGGGGAAGCTGAGGAGGACACATCATTAGATGAGGCTGAAAAGCCTAA
CCCTGAGCCTCCTATTCCTTCTCTCACACTGTTGGTTCCCAAGGAAAAGAAAAAGAAAAAGAAGAAAAAGAACAATCAGGTTCAGTTTGATAAATTTATGAATGCTTTTA
TGAATCTGAATATTAATATTCCTTTTGCAGAGGCATTAGAGATGCCCCAATACAACAGAGCATTATGTGACTTAGGTGCTAGAATTAATATCATTCCTCTATCGTTGTGT
AAAAAGTTAGATATAGGTGAGATTAAGTCTACTCCTGTAAAACTCCAATTGGCTGATCAATCTGTGGTGAGACCAGTTGGAAAGGTGCAAGAAAGAGCACCTCTGTTGGA
TTCACAGAACGAAAGCCTCCTTGAAGCACGATCAACACGTCGAGCTAATGACGTTAAACAAGCGCTTATGGGAGGCAACCCAAGAAAAGTTGCAGCGTCGAGACGCTGTG
GAGGTAGCGCCTCGACGCTGCCTCGTAGCCGTAAGTGCAATGGAACGAAAGATTACAGTGGGCGGAAAATCTCGTGGGATGGCCAAAAATCGGAGTCAAGTCGTCGGAAA
GTGAGGTCGGAGAAGTGTGATGTGTTTTCAGGCAAATGGGTTTTTGATAATACTTCGTATCCACTGTACGATGAGTCGCAATGTCCATACATGTCCAACCAGTTGGCTTG
CCACAAGCATGGCAGATCTGAGGTATCAGTACTGGAGATGGAAACCAGTAGGGAGGCTCCTCCAATCCTCCCTCCTCCTCTTTGTAGAGGAAGTGGCGTCGATCCCTACG
CTGCTCTCTCTGGTAATCGCGTTGCTGACTCTAGAGCTCCTGCCGATGACTCTGGAACTCAAGCTGGTGAGTCTGAAACTCCAGCCGGTGACTCTGGAACTCCATGCGCA
GCTGCTGCAGCTGCGCCTCGATTGTGGAAGGTGGAACCTCCTGCTCCTGATGCTCGGGCTCAGCAGACTGCTCAGGATACTCTGGCGGTGCAGAAATGGGAAGAGCTGGC
TGTGCACCCTGTTGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCGTAGGAATAAAGATTTAATTTTAGCACCTTTGGATCCCGAGATAGAAAGAACCATCCATAGGCTTCGAAGGAAGAATAGAGAAAACGTTCAAATGGCTGACCAAAA
TCCACCTGAAGAGCCTAGGCCTATTAGAGATTATTTTCAGCATGTGTTTCAGGGGCAACAGTCGGGGATTGTCTATGCACCGATTAATGCCAACAACTTTGAGCTCAAAA
CCGGTCTCATTCAGATGGCTCGAGATTGTGCTTATAGAGGATCACCCACTGAGGATCCAAATTCTCATCTTAAATCATTCTTGGACATTTGTGGGACGGTAAAGATTAAT
GGAGTCTCTAAGGATGCTATTCGTTTACGTTTATTTCCCTTTTCTTTGCAGGATAAAGCACGAGATTGGTTGCAGTCTATTACCCCTGGGAGCATCACCACTTGGGATGC
TTTGGTCCAGGCCTTTTTAAAGAAATTTTTCCCTCTTGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGACATTCCAACAACAATATGATGAGCAGTTGTTCGAAGCTT
GGGAGCGGTTCAAAGAGCTACTGAGGAAGTTTCCTCAGCATGGTTACCCCGATTGGCTTCAGGTACAGTTGTTTTATAATGGTTTAACTCCTAGTACAAAAACGATTGTT
GATGTAGCTGCAGGTGGGACTCTGTTGTCCAAGACTGTGGAAAACGCTCGCACACTTCTAGAGGATATGGCCACCAACAGCTATCAGTGGCCATCTGAGCGGTCTACACC
TAAAAAGATTGTTGTTGGAGTGTTTGAGGTTGATAAAGTAAGTGCACTCCAGGCCCAGATGACTTCCCTCGCTAATGCTTTTATGAAATTTTCAGGTATAGGGAGCGCAC
AGTCAATTGAATCAGCTGCTGCTTTAGCATCTAGACCTCAGGAGGAGACCGTCGAGCAGGTTCAGTATGTATCAAATTTTAATTCTAGGGGATATAATAATAGTTCTACA
CCTACACATTATCACCCTAACAATAGGAACCATGAAAATTTCTCTTATGCCAATACTAAGAATGTTCTTAATCCTCCTTGTTTTGCCCCTCAAACTCAAGATAATAAAAA
GTTAGAAGATCTTGTTGGAGCTTTCATTCCAGAGTCTAGTAACAGGACAACAAAGTTAGAGGAGGCAGTCATTGCCATCAACACCACAGTGAATGGCCACAGCGCTGCCA
TCAAGAATATTGAGACTCAGCTGGGACAGTTGGTGAATGTTGTAAGCACCATGAATAAAGGTAAGGCCCCAGCTGAACAAGAGAAACCCCAGATAGAGTATTGTAAGGCA
ATCACTGTGCACCAGGAGGAATCTGAAGAGGAACCTGAATCTGAGGACTATGAAACGCCTACAGGGGAAGCTGAGGAGGACACATCATTAGATGAGGCTGAAAAGCCTAA
CCCTGAGCCTCCTATTCCTTCTCTCACACTGTTGGTTCCCAAGGAAAAGAAAAAGAAAAAGAAGAAAAAGAACAATCAGGTTCAGTTTGATAAATTTATGAATGCTTTTA
TGAATCTGAATATTAATATTCCTTTTGCAGAGGCATTAGAGATGCCCCAATACAACAGAGCATTATGTGACTTAGGTGCTAGAATTAATATCATTCCTCTATCGTTGTGT
AAAAAGTTAGATATAGGTGAGATTAAGTCTACTCCTGTAAAACTCCAATTGGCTGATCAATCTGTGGTGAGACCAGTTGGAAAGGTGCAAGAAAGAGCACCTCTGTTGGA
TTCACAGAACGAAAGCCTCCTTGAAGCACGATCAACACGTCGAGCTAATGACGTTAAACAAGCGCTTATGGGAGGCAACCCAAGAAAAGTTGCAGCGTCGAGACGCTGTG
GAGGTAGCGCCTCGACGCTGCCTCGTAGCCGTAAGTGCAATGGAACGAAAGATTACAGTGGGCGGAAAATCTCGTGGGATGGCCAAAAATCGGAGTCAAGTCGTCGGAAA
GTGAGGTCGGAGAAGTGTGATGTGTTTTCAGGCAAATGGGTTTTTGATAATACTTCGTATCCACTGTACGATGAGTCGCAATGTCCATACATGTCCAACCAGTTGGCTTG
CCACAAGCATGGCAGATCTGAGGTATCAGTACTGGAGATGGAAACCAGTAGGGAGGCTCCTCCAATCCTCCCTCCTCCTCTTTGTAGAGGAAGTGGCGTCGATCCCTACG
CTGCTCTCTCTGGTAATCGCGTTGCTGACTCTAGAGCTCCTGCCGATGACTCTGGAACTCAAGCTGGTGAGTCTGAAACTCCAGCCGGTGACTCTGGAACTCCATGCGCA
GCTGCTGCAGCTGCGCCTCGATTGTGGAAGGTGGAACCTCCTGCTCCTGATGCTCGGGCTCAGCAGACTGCTCAGGATACTCTGGCGGTGCAGAAATGGGAAGAGCTGGC
TGTGCACCCTGTTGTCTAG
Protein sequenceShow/hide protein sequence
MRRNKDLILAPLDPEIERTIHRLRRKNRENVQMADQNPPEEPRPIRDYFQHVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKIN
GVSKDAIRLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPLAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKFPQHGYPDWLQVQLFYNGLTPSTKTIV
DVAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIVVGVFEVDKVSALQAQMTSLANAFMKFSGIGSAQSIESAAALASRPQEETVEQVQYVSNFNSRGYNNSST
PTHYHPNNRNHENFSYANTKNVLNPPCFAPQTQDNKKLEDLVGAFIPESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQIEYCKA
ITVHQEESEEEPESEDYETPTGEAEEDTSLDEAEKPNPEPPIPSLTLLVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALEMPQYNRALCDLGARINIIPLSLC
KKLDIGEIKSTPVKLQLADQSVVRPVGKVQERAPLLDSQNESLLEARSTRRANDVKQALMGGNPRKVAASRRCGGSASTLPRSRKCNGTKDYSGRKISWDGQKSESSRRK
VRSEKCDVFSGKWVFDNTSYPLYDESQCPYMSNQLACHKHGRSEVSVLEMETSREAPPILPPPLCRGSGVDPYAALSGNRVADSRAPADDSGTQAGESETPAGDSGTPCA
AAAAAPRLWKVEPPAPDARAQQTAQDTLAVQKWEELAVHPVV