; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008965 (gene) of Snake gourd v1 genome

Gene IDTan0008965
OrganismTrichosanthes anguina (Snake gourd v1)
Description1-aminocyclopropane-1-carboxylate oxidase
Genome locationLG04:86255926..86260570
RNA-Seq ExpressionTan0008965
SyntenyTan0008965
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0051213 - dioxygenase activity (molecular function)
InterPro domainsIPR026992 - Non-haem dioxygenase N-terminal domain
IPR027443 - Isopenicillin N synthase-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606752.1 hypothetical protein SDJN03_00094, partial [Cucurbita argyrosperma subsp. sororia]6.0e-15776.61Show/hide
Query:  MVMMSPEPTAASPFDFRAPPPSPIGTSRRSSVTNDDVLTEFLEHSLRVPDLVLPENIFPRQRFIENPPRIDFRSIESLDYDSVLKLLDSMGSIGCFQLVN
        M MMSPE TAA+PFDFRAPPPSPI TSRRSSVTNDDVLT+FLEHSLRVP+LVLPENIFPRQRFI+NPPRIDFRSIES D+DSV+++LDSMGSIGCFQL N
Subjt:  MVMMSPEPTAASPFDFRAPPPSPIGTSRRSSVTNDDVLTEFLEHSLRVPDLVLPENIFPRQRFIENPPRIDFRSIESLDYDSVLKLLDSMGSIGCFQLVN

Query:  HGIPVELIGAIAAAAAVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFVWSRDEGLKVEMEAISPFGYSNFSKKMEALTQVTEKIGEKILKI
        HGIP+ELIGA+A+AAA GVFG+S EKK  V RSPEK YGFEEYW GEDESE+SEEFVWSRDEGL+ EMEAIS FGYS+FS KMEALTQVTEK+GEKIL+I
Subjt:  HGIPVELIGAIAAAAAVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFVWSRDEGLKVEMEAISPFGYSNFSKKMEALTQVTEKIGEKILKI

Query:  FRENCGKVAEKEVLLGHGSVWCVYKQKQRMSWNEELENCLKHDVIRMLIRGTDFSHAFCFHFCHGSSNS--------LFHVYSKRGWVSFVPDQSAIVVT
        FRENCGKVAEKEV    GSVWCV+KQKQ    NEELE C+KHDV+RMLIRG+D SHA CFHFCHGSS+S        +FHVYSK+GWV FVP++SAI+VT
Subjt:  FRENCGKVAEKEVLLGHGSVWCVYKQKQRMSWNEELENCLKHDVIRMLIRGTDFSHAFCFHFCHGSSNS--------LFHVYSKRGWVSFVPDQSAIVVT

Query:  VGDQIQAWSGGQYKHVIGRPIYRDHKKEEIGNNNKNN-----GISMAFHFSPNSSSSNSHNEIRTLSLAHQALFALFLTLLYNFLFYIL
        VGDQIQAWSGGQYKHVIGRPIY    KEE GNN  N      GISMAF F+P SS SN      TLSL HQALFALFL LLYN   Y L
Subjt:  VGDQIQAWSGGQYKHVIGRPIYRDHKKEEIGNNNKNN-----GISMAFHFSPNSSSSNSHNEIRTLSLAHQALFALFLTLLYNFLFYIL

KAG7036466.1 hypothetical protein SDJN02_00083, partial [Cucurbita argyrosperma subsp. argyrosperma]2.7e-15777.55Show/hide
Query:  MVMMSPEPTAASPFDFRAPPPSPIGTSRRSSVTNDDVLTEFLEHSLRVPDLVLPENIFPRQRFIENPPRIDFRSIESLDYDSVLKLLDSMGSIGCFQLVN
        M MMSPE TAA+PFDFRAPPPSPI TSRRSSVTNDDVLT+FLEHSLRVP+LVLPENIFPRQRFI+NPPRIDFRSIES D+DSV+++LDSMGSIGCFQL N
Subjt:  MVMMSPEPTAASPFDFRAPPPSPIGTSRRSSVTNDDVLTEFLEHSLRVPDLVLPENIFPRQRFIENPPRIDFRSIESLDYDSVLKLLDSMGSIGCFQLVN

Query:  HGIPVELIGAIAAAAAVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFVWSRDEGLKVEMEAISPFGYSNFSKKMEALTQVTEKIGEKILKI
        HGIP+ELIGA+A+AAA GVFG+S EKK  V RSPEK YGFEEYW GEDESE+SEEFVWSRDEGL+ EMEAIS FGYS+FS KMEALTQVTEK+GEKIL+I
Subjt:  HGIPVELIGAIAAAAAVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFVWSRDEGLKVEMEAISPFGYSNFSKKMEALTQVTEKIGEKILKI

Query:  FRENCGKVAEKEVLLGHGSVWCVYKQKQRMSWNEELENCLKHDVIRMLIRGTDFSHAFCFHFCHGSSNS------LFHVYSKRGWVSFVPDQSAIVVTVG
        FRENCGKVAEKEV    GSVWCV+KQKQ    N+ELE C+KHDV+RMLIRG+D SHA CFHFCHGSS+S      +FHVYSK+GWV FVP++SAI+VTVG
Subjt:  FRENCGKVAEKEVLLGHGSVWCVYKQKQRMSWNEELENCLKHDVIRMLIRGTDFSHAFCFHFCHGSSNS------LFHVYSKRGWVSFVPDQSAIVVTVG

Query:  DQIQAWSGGQYKHVIGRPIYRDHKKEEIGNNNKNN-GISMAFHFSPNSSSSNSHNEIRTLSLAHQALFALFLTLLYNFLFYIL
        DQIQAWSGGQYKHVIGRPIY    KEE GNN  N  GISMAF F+P SS SN      TLSL HQALFALFL LLYN   Y L
Subjt:  DQIQAWSGGQYKHVIGRPIYRDHKKEEIGNNNKNN-GISMAFHFSPNSSSSNSHNEIRTLSLAHQALFALFLTLLYNFLFYIL

XP_004151051.2 uncharacterized protein LOC101203918 [Cucumis sativus]1.2e-16879.79Show/hide
Query:  MVMMSPEPTAASPFDFRAPPPSPIGTSRRSSVTNDDVLTEFLEHSLRVPDLVLPENIFPRQRFIENPPRIDFRSIESLDYDSVLKLLDSMGSIGCFQLVN
        MVMMSPE  AA+PF+FRAPPPSPI TSRRSSVTND+VLTEFLEHSLRVPDLVLP+ IFPR+RF+E+PPRID+R IES D DSVLK+LDSM S G FQLVN
Subjt:  MVMMSPEPTAASPFDFRAPPPSPIGTSRRSSVTNDDVLTEFLEHSLRVPDLVLPENIFPRQRFIENPPRIDFRSIESLDYDSVLKLLDSMGSIGCFQLVN

Query:  HGIPVELIGAIAAAAAVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFVWSRDEGLKVEMEAISPFGYSNFSKKMEALTQVTEKIGEKILKI
        HGIPVELIGA+A AA  GVFG+SPEKKV V RSPEKAYGFEEYWHGEDESE+SEEFVWSRDE LK+EME ISP GYSNFSKKME LTQ+TEK+GEK+L I
Subjt:  HGIPVELIGAIAAAAAVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFVWSRDEGLKVEMEAISPFGYSNFSKKMEALTQVTEKIGEKILKI

Query:  FRENCGKVAEKEVLLGHGSVWCVYKQKQRMS---WNEELENCLKHDVIRMLIRGTDFSHAFCFHFCHGSSNSLFHVYSKRGWVSFVPDQSAIVVTVGDQI
        F EN GK+A  EV+LGHGSVWCVYK K +     WN+ELENC KHDVIRMLIRGTDFSHAFCFHFCHGSS  LFH YSKRGWVSFVPD SAIVVTVGD I
Subjt:  FRENCGKVAEKEVLLGHGSVWCVYKQKQRMS---WNEELENCLKHDVIRMLIRGTDFSHAFCFHFCHGSSNSLFHVYSKRGWVSFVPDQSAIVVTVGDQI

Query:  QAWSGGQYKHVIGRPIYRDHKKE-EIGNNNKNNGISMAFHFSPNSSSSNSH-----NEIRTLSLAHQALFALFLTLLYNFLFYILK
        Q WSGGQYKHVIGRPIY+DH KE + G+NN  NGISMAF FSP  SSS+S+     NEIRTLSLAHQALFALFLTL YNF FYILK
Subjt:  QAWSGGQYKHVIGRPIYRDHKKE-EIGNNNKNNGISMAFHFSPNSSSSNSH-----NEIRTLSLAHQALFALFLTLLYNFLFYILK

XP_022948619.1 uncharacterized protein LOC111452244 [Cucurbita moschata]6.4e-15978.48Show/hide
Query:  MVMMSPEPTAASPFDFRAPPPSPIGTSRRSSVTNDDVLTEFLEHSLRVPDLVLPENIFPRQRFIENPPRIDFRSIESLDYDSVLKLLDSMGSIGCFQLVN
        M MMSPE TAA+PFDFRAPPPSPI TSRRSSVTNDDVLT+FLEHSLRVP+LVLPENIFPRQRFI+NPPRIDFRSIES D+DSV+++LDSMGSIGCFQL N
Subjt:  MVMMSPEPTAASPFDFRAPPPSPIGTSRRSSVTNDDVLTEFLEHSLRVPDLVLPENIFPRQRFIENPPRIDFRSIESLDYDSVLKLLDSMGSIGCFQLVN

Query:  HGIPVELIGAIAAAAAVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFVWSRDEGLKVEMEAISPFGYSNFSKKMEALTQVTEKIGEKILKI
        HGIP+ELIGA+A+AAA GVFG+S EKK  V RSPEK YGFEEYW GEDESE+SEEFVWSRDEGL+ EME IS FGYS+FS KMEALTQVTEK+GEKIL+I
Subjt:  HGIPVELIGAIAAAAAVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFVWSRDEGLKVEMEAISPFGYSNFSKKMEALTQVTEKIGEKILKI

Query:  FRENCGKVAEKEVLLGHGSVWCVYKQKQRMSWNEELENCLKHDVIRMLIRGTDFSHAFCFHFCHGSSNS----LFHVYSKRGWVSFVPDQSAIVVTVGDQ
        FRENCGKVAEKEV    GSVWCV+KQKQR   N+ELENCLKHDV+RMLIRG+D SHA CFHFCHGSS+S    +FHVYSK+GWV FVP++SAI+VTVGDQ
Subjt:  FRENCGKVAEKEVLLGHGSVWCVYKQKQRMSWNEELENCLKHDVIRMLIRGTDFSHAFCFHFCHGSSNS----LFHVYSKRGWVSFVPDQSAIVVTVGDQ

Query:  IQAWSGGQYKHVIGRPIYRDHKKEEIGNN-NKNNGISMAFHFSPNSSSSNSHNEIRTLSLAHQALFALFLTLLYNFLFYIL
        IQAWSGGQYKHVIGRPIY    KEE GNN N   GISMAF F+P SS SN      TLSL HQALFALFL LLYN   Y L
Subjt:  IQAWSGGQYKHVIGRPIYRDHKKEEIGNN-NKNNGISMAFHFSPNSSSSNSHNEIRTLSLAHQALFALFLTLLYNFLFYIL

XP_038882380.1 uncharacterized protein LOC120073645 [Benincasa hispida]8.9e-17784.2Show/hide
Query:  MMSPEPTAASPFDFRAPPPSPIGTSRRSSVTNDDVLTEFLEHSLRVPDLVLPENIFPRQRFIENPPRIDFRSIESLDYDSVLKLLDSMGSIGCFQLVNHG
        MMSPE +A +PF+FRAPPPSPIGTSRRSSVTND+VLTEFLEHSLRVPDLVLPENIFPRQRFIENPPRIDFR IES D DSVLK+LDSM SIG FQLVNHG
Subjt:  MMSPEPTAASPFDFRAPPPSPIGTSRRSSVTNDDVLTEFLEHSLRVPDLVLPENIFPRQRFIENPPRIDFRSIESLDYDSVLKLLDSMGSIGCFQLVNHG

Query:  IPVELIGAIAAAAAVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFVWSRDEGLKVEMEAISPFGYSNFSKKMEALTQVTEKIGEKILKIFR
        IP +LIGA+AAAAA GVFG+SPEKKVAVARSPEKAYGFEEYWHGEDESEL EEFVWSRDEGLKVEMEAISPFGYS+FSKKME +TQ+TEK+GEKIL+I  
Subjt:  IPVELIGAIAAAAAVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFVWSRDEGLKVEMEAISPFGYSNFSKKMEALTQVTEKIGEKILKIFR

Query:  ENC-GKVAEKEVLLGHGSVWCVYKQ-KQRMSWNEELENCLKHDVIRMLIRGTDFSHAFCFHFCHGSSNSLFHVYSKRGWVSFVPDQSAIVVTVGDQIQAW
        EN  GKV +K+V+LGHGSVWCVYKQ KQRM+WN+ELENC KHDVIRMLIRGTDFSHAFCFHFCHGSS  LFHVY+KRGWVSFVPD+SAIVVTVG+ IQAW
Subjt:  ENC-GKVAEKEVLLGHGSVWCVYKQ-KQRMSWNEELENCLKHDVIRMLIRGTDFSHAFCFHFCHGSSNSLFHVYSKRGWVSFVPDQSAIVVTVGDQIQAW

Query:  SGGQYKHVIGRPIYRDHKKEE-----IGNNNKNNGISMAFHFSPNSSSSNSH----NEIRTLSLAHQALFALFLTLLYNFLFYILK
        SGGQYKHVIGRPIYRDH KEE       NNN NNGISMAF FSP SSSS++     NEIRTLSLAHQALFALFLTLLYNF FYILK
Subjt:  SGGQYKHVIGRPIYRDHKKEE-----IGNNNKNNGISMAFHFSPNSSSSNSH----NEIRTLSLAHQALFALFLTLLYNFLFYILK

TrEMBL top hitse value%identityAlignment
A0A0A0LBM9 DIOX_N domain-containing protein5.7e-16979.79Show/hide
Query:  MVMMSPEPTAASPFDFRAPPPSPIGTSRRSSVTNDDVLTEFLEHSLRVPDLVLPENIFPRQRFIENPPRIDFRSIESLDYDSVLKLLDSMGSIGCFQLVN
        MVMMSPE  AA+PF+FRAPPPSPI TSRRSSVTND+VLTEFLEHSLRVPDLVLP+ IFPR+RF+E+PPRID+R IES D DSVLK+LDSM S G FQLVN
Subjt:  MVMMSPEPTAASPFDFRAPPPSPIGTSRRSSVTNDDVLTEFLEHSLRVPDLVLPENIFPRQRFIENPPRIDFRSIESLDYDSVLKLLDSMGSIGCFQLVN

Query:  HGIPVELIGAIAAAAAVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFVWSRDEGLKVEMEAISPFGYSNFSKKMEALTQVTEKIGEKILKI
        HGIPVELIGA+A AA  GVFG+SPEKKV V RSPEKAYGFEEYWHGEDESE+SEEFVWSRDE LK+EME ISP GYSNFSKKME LTQ+TEK+GEK+L I
Subjt:  HGIPVELIGAIAAAAAVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFVWSRDEGLKVEMEAISPFGYSNFSKKMEALTQVTEKIGEKILKI

Query:  FRENCGKVAEKEVLLGHGSVWCVYKQKQRMS---WNEELENCLKHDVIRMLIRGTDFSHAFCFHFCHGSSNSLFHVYSKRGWVSFVPDQSAIVVTVGDQI
        F EN GK+A  EV+LGHGSVWCVYK K +     WN+ELENC KHDVIRMLIRGTDFSHAFCFHFCHGSS  LFH YSKRGWVSFVPD SAIVVTVGD I
Subjt:  FRENCGKVAEKEVLLGHGSVWCVYKQKQRMS---WNEELENCLKHDVIRMLIRGTDFSHAFCFHFCHGSSNSLFHVYSKRGWVSFVPDQSAIVVTVGDQI

Query:  QAWSGGQYKHVIGRPIYRDHKKE-EIGNNNKNNGISMAFHFSPNSSSSNSH-----NEIRTLSLAHQALFALFLTLLYNFLFYILK
        Q WSGGQYKHVIGRPIY+DH KE + G+NN  NGISMAF FSP  SSS+S+     NEIRTLSLAHQALFALFLTL YNF FYILK
Subjt:  QAWSGGQYKHVIGRPIYRDHKKE-EIGNNNKNNGISMAFHFSPNSSSSNSH-----NEIRTLSLAHQALFALFLTLLYNFLFYILK

A0A1S3BIA4 1-aminocyclopropane-1-carboxylate oxidase1.4e-14380.97Show/hide
Query:  FPRQRFIENPPRIDFRSIESLDYDSVLKLLDSMGSIGCFQLVNHGIPVELIGAIAAAAAVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFV
        FPR+RF+ENPPRIDFR IES D DSVLK+LDSM S G FQLVNHGIPVELIGA+AAAA  GVFG+SPEKKVAV RSPEKAYGFEE WHGEDESE+SEEFV
Subjt:  FPRQRFIENPPRIDFRSIESLDYDSVLKLLDSMGSIGCFQLVNHGIPVELIGAIAAAAAVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFV

Query:  WSRDEGLKVEMEAISPFGYSNFSKKMEALTQVTEKIGEKILKIFRENCGKVAEKEVLLGHGSVWCVYKQ--KQRMS-WNEELENCLKHDVIRMLIRGTDF
        W+RDE LK+EMEAISP GYSNFSKKME LTQ+TEK+GEKIL+IF EN GK+A  EV  GHGSVWCVYKQ  KQR+S WN+ELENC KHDVIRMLIRGTDF
Subjt:  WSRDEGLKVEMEAISPFGYSNFSKKMEALTQVTEKIGEKILKIFRENCGKVAEKEVLLGHGSVWCVYKQ--KQRMS-WNEELENCLKHDVIRMLIRGTDF

Query:  SHAFCFHFCHGSSNSLFHVYSKRGWVSFVPDQSAIVVTVGDQIQAWSGGQYKHVIGRPIYRDHKKEE-IGNNNKNNGISMAFHFSPNSSSSNSH------
        SHAFCFHFCHGSS  LFH YSKRGWVSFV D+SA+VVTVGD IQ WSGGQYKHVIGRPIY+DH KEE  G+NN NNGISMAF FSP SSSS+S       
Subjt:  SHAFCFHFCHGSSNSLFHVYSKRGWVSFVPDQSAIVVTVGDQIQAWSGGQYKHVIGRPIYRDHKKEE-IGNNNKNNGISMAFHFSPNSSSSNSH------

Query:  -NEIRTLSLAHQALFALFLTLLYNFLFYILK
         NEIRTLSLAHQALFALFLTLLYNF FYILK
Subjt:  -NEIRTLSLAHQALFALFLTLLYNFLFYILK

A0A5A7U6N0 1-aminocyclopropane-1-carboxylate oxidase1.8e-13883.39Show/hide
Query:  MVMMSPEPTAASPFDFRAPPPSPIGTSRRSSVTNDDVLTEFLEHSLRVPDLVLPENIFPRQRFIENPPRIDFRSIESLDYDSVLKLLDSMGSIGCFQLVN
        MVMMSPE  AA+PF+FRAPPPSPIGTSRRSSVTND+VLTEFLEHSLRVPDLVLPE IFPR+RF+ENPPRIDFR IES D DSVLK+LDSM S G FQLVN
Subjt:  MVMMSPEPTAASPFDFRAPPPSPIGTSRRSSVTNDDVLTEFLEHSLRVPDLVLPENIFPRQRFIENPPRIDFRSIESLDYDSVLKLLDSMGSIGCFQLVN

Query:  HGIPVELIGAIAAAAAVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFVWSRDEGLKVEMEAISPFGYSNFSKKMEALTQVTEKIGEKILKI
        HGIPVELIGA+AAAA  GVFG+SPEKKVAV RSPEKAYGFEE WHGEDESE+SEEFVW+RDE LK+EMEAISP GYSNFSKKME LTQ+TEK+GEKIL+I
Subjt:  HGIPVELIGAIAAAAAVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFVWSRDEGLKVEMEAISPFGYSNFSKKMEALTQVTEKIGEKILKI

Query:  FRENCGKVAEKEVLLGHGSVWCVYKQ--KQRMS-WNEELENCLKHDVIRMLIRGTDFSHAFCFHFCHGSSNSLFHVYSKRGWVSFVPDQSAIVVTVGDQI
        F EN GK+A  EV  GHGSVWCVYKQ  KQR+S WN+ELENC KHDVIRMLIRGTDFSHAFCFHFCHGSS  LFH YSKRGWVSFV D+SA+VVTVGD I
Subjt:  FRENCGKVAEKEVLLGHGSVWCVYKQ--KQRMS-WNEELENCLKHDVIRMLIRGTDFSHAFCFHFCHGSSNSLFHVYSKRGWVSFVPDQSAIVVTVGDQI

Query:  Q
        Q
Subjt:  Q

A0A6J1GAF1 uncharacterized protein LOC1114522443.1e-15978.48Show/hide
Query:  MVMMSPEPTAASPFDFRAPPPSPIGTSRRSSVTNDDVLTEFLEHSLRVPDLVLPENIFPRQRFIENPPRIDFRSIESLDYDSVLKLLDSMGSIGCFQLVN
        M MMSPE TAA+PFDFRAPPPSPI TSRRSSVTNDDVLT+FLEHSLRVP+LVLPENIFPRQRFI+NPPRIDFRSIES D+DSV+++LDSMGSIGCFQL N
Subjt:  MVMMSPEPTAASPFDFRAPPPSPIGTSRRSSVTNDDVLTEFLEHSLRVPDLVLPENIFPRQRFIENPPRIDFRSIESLDYDSVLKLLDSMGSIGCFQLVN

Query:  HGIPVELIGAIAAAAAVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFVWSRDEGLKVEMEAISPFGYSNFSKKMEALTQVTEKIGEKILKI
        HGIP+ELIGA+A+AAA GVFG+S EKK  V RSPEK YGFEEYW GEDESE+SEEFVWSRDEGL+ EME IS FGYS+FS KMEALTQVTEK+GEKIL+I
Subjt:  HGIPVELIGAIAAAAAVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFVWSRDEGLKVEMEAISPFGYSNFSKKMEALTQVTEKIGEKILKI

Query:  FRENCGKVAEKEVLLGHGSVWCVYKQKQRMSWNEELENCLKHDVIRMLIRGTDFSHAFCFHFCHGSSNS----LFHVYSKRGWVSFVPDQSAIVVTVGDQ
        FRENCGKVAEKEV    GSVWCV+KQKQR   N+ELENCLKHDV+RMLIRG+D SHA CFHFCHGSS+S    +FHVYSK+GWV FVP++SAI+VTVGDQ
Subjt:  FRENCGKVAEKEVLLGHGSVWCVYKQKQRMSWNEELENCLKHDVIRMLIRGTDFSHAFCFHFCHGSSNS----LFHVYSKRGWVSFVPDQSAIVVTVGDQ

Query:  IQAWSGGQYKHVIGRPIYRDHKKEEIGNN-NKNNGISMAFHFSPNSSSSNSHNEIRTLSLAHQALFALFLTLLYNFLFYIL
        IQAWSGGQYKHVIGRPIY    KEE GNN N   GISMAF F+P SS SN      TLSL HQALFALFL LLYN   Y L
Subjt:  IQAWSGGQYKHVIGRPIYRDHKKEEIGNN-NKNNGISMAFHFSPNSSSSNSHNEIRTLSLAHQALFALFLTLLYNFLFYIL

A0A6J1K8P0 uncharacterized protein LOC1114926939.4e-15678.31Show/hide
Query:  MVMMSPEPTAASPFDFRAPPPSPIGTSRRSSVTNDDVLTEFLEHSLRVPDLVLPENIFPRQRFIENPPRIDFRSIESLDYDSVLKLLDSMGSIGCFQLVN
        M MMSPE TAA PFDFRAPPPSPI TSRRSSVTNDDVLT+FLEHSLRVP+LVLPENIFPRQRFI+ PPRIDFRSIES D+DSV+++LDSMGSIGCFQL N
Subjt:  MVMMSPEPTAASPFDFRAPPPSPIGTSRRSSVTNDDVLTEFLEHSLRVPDLVLPENIFPRQRFIENPPRIDFRSIESLDYDSVLKLLDSMGSIGCFQLVN

Query:  HGIPVELIGAIAAAAAVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFVWSRDEGLKVEMEAISPFGYSNFSKKMEALTQVTEKIGEKILKI
        HGIP ELIGA A+AAAVGVFG+S EKK+ V RSPEK YGFEEYW GEDESE+SEEFVWSRDEGL+ EME IS FGYS+FS KMEALTQVTEK+GEKI+KI
Subjt:  HGIPVELIGAIAAAAAVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFVWSRDEGLKVEMEAISPFGYSNFSKKMEALTQVTEKIGEKILKI

Query:  FRENCGKVAEKEVLLGHGSVWCVYKQKQRMSWNEELENCLKHDVIRMLIRGTDFSHAFCFHFCHGSSNS-LFHVYSKRGWVSFVPDQSAIVVTVGDQIQA
        FRENCGKVAEKEV  G GSVWCVYKQKQ    N+ELE CLKHDV+RMLIRG+D SHA CFHF HGSS+S +FHV SKRGWV FVP++SAI+VTVGDQIQA
Subjt:  FRENCGKVAEKEVLLGHGSVWCVYKQKQRMSWNEELENCLKHDVIRMLIRGTDFSHAFCFHFCHGSSNS-LFHVYSKRGWVSFVPDQSAIVVTVGDQIQA

Query:  WSGGQYKHVIGRPIYRDHKKEEIGNN-NKNNGISMAFHFSPNSSSSNSHNEIRTLSLAHQALFALFLTLLYNFLFYIL
        WSGGQYKHVIGRPIY    KEE GN+ N   GISMAF F+P SS S++     TLSL HQALFALFL LLYN   Y L
Subjt:  WSGGQYKHVIGRPIYRDHKKEEIGNN-NKNNGISMAFHFSPNSSSSNSHNEIRTLSLAHQALFALFLTLLYNFLFYIL

SwissProt top hitse value%identityAlignment
Q08506 1-aminocyclopropane-1-carboxylate oxidase 15.5e-0422.57Show/hide
Query:  IENPPRIDFRSIESLDYDSVLKLL-DSMGSIGCFQLVNHGIPVELIGAIAAAAAVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFVWSRDE
        +EN P I    +  ++  + ++++ D+  + G F+LVNHGIP E++  +      G +    E++     + +   G         ++E++ +  W    
Subjt:  IENPPRIDFRSIESLDYDSVLKLL-DSMGSIGCFQLVNHGIPVELIGAIAAAAAVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFVWSRDE

Query:  GLK-VEMEAIS--PFGYSNFSKKMEALTQVTEKIGEKILKIFRENCGKVAEKEVLLGHGSVWCVYKQKQRMSWNEELEN---CLKHDVIRMLIRGTDFSH
         LK + +  IS  P     + + M    +  EK+ E++L +  EN G        L  G +   +   +  ++  ++ N   C K D+I+ L   TD   
Subjt:  GLK-VEMEAIS--PFGYSNFSKKMEALTQVTEKIGEKILKIFRENCGKVAEKEVLLGHGSVWCVYKQKQRMSWNEELEN---CLKHDVIRMLIRGTDFSH

Query:  AFCFHFCHGSSNSLFHVYSKRGWVSFVPDQSAIVVTVGDQIQAWSGGQYKHVIGRPI
                    S   +     W+   P + +IVV +GDQ++  + G+YK V+ R I
Subjt:  AFCFHFCHGSSNSLFHVYSKRGWVSFVPDQSAIVVTVGDQIQAWSGGQYKHVIGRPI

Q08507 1-aminocyclopropane-1-carboxylate oxidase 31.1e-0423.05Show/hide
Query:  IENPPRIDFRSIESLDYDSVLKLL-DSMGSIGCFQLVNHGIPVELIGAIAAAAAVGVFGVSPEKKVAVARSPE--KAYGFEEYWHGEDESELSEEFVWSR
        +EN P I+   +   + D+ ++++ D+  + G F+LVNHGIP E++  +              KK    R  E   + G E       + +    F    
Subjt:  IENPPRIDFRSIESLDYDSVLKLL-DSMGSIGCFQLVNHGIPVELIGAIAAAAAVGVFGVSPEKKVAVARSPE--KAYGFEEYWHGEDESELSEEFVWSR

Query:  DEGLKVEMEAISPFGYSNFSKKMEALTQVTEKIGEKILKIFRENCGKVAEKEVLLGHGSVWCVYKQKQRMSWNEELEN---CLKHDVIRMLIRGTDFSHA
           L V   +  P     + + M    +  EK+ E++L +  EN G        L  G +   +   +  ++  ++ N   C K D+I+ L   TD    
Subjt:  DEGLKVEMEAISPFGYSNFSKKMEALTQVTEKIGEKILKIFRENCGKVAEKEVLLGHGSVWCVYKQKQRMSWNEELEN---CLKHDVIRMLIRGTDFSHA

Query:  FCFHFCHGSSNSLFHVYSKRGWVSFVPDQSAIVVTVGDQIQAWSGGQYKHVIGRPI
                   S   +     W+   P + +IVV +GDQ++  + G+YK V+ R I
Subjt:  FCFHFCHGSSNSLFHVYSKRGWVSFVPDQSAIVVTVGDQIQAWSGGQYKHVIGRPI

Q08508 1-aminocyclopropane-1-carboxylate oxidase 42.5e-0423.05Show/hide
Query:  IENPPRIDFRSIESLDYDSVLKLL-DSMGSIGCFQLVNHGIPVELIGAIAAAAAVGVFGVSPEKKVAVARSPE--KAYGFEEYWHGEDESELSEEFVWSR
        +EN P I+  ++   + D+ ++++ D+  + G F+LVNHGIP E++  +        F     KK    R  E   + G E       + +    F    
Subjt:  IENPPRIDFRSIESLDYDSVLKLL-DSMGSIGCFQLVNHGIPVELIGAIAAAAAVGVFGVSPEKKVAVARSPE--KAYGFEEYWHGEDESELSEEFVWSR

Query:  DEGLKVEMEAISPFGYSNFSKKMEALTQVTEKIGEKILKIFRENCGKVAEKEVLLGHGSVWCVYKQKQRMSWNEELEN---CLKHDVIRMLIRGTDFSHA
           L V   +  P     + + M    +  EK+ E++L +  EN G        L  G +   +   +  ++  ++ N   C K D+I+ L   TD    
Subjt:  DEGLKVEMEAISPFGYSNFSKKMEALTQVTEKIGEKILKIFRENCGKVAEKEVLLGHGSVWCVYKQKQRMSWNEELEN---CLKHDVIRMLIRGTDFSHA

Query:  FCFHFCHGSSNSLFHVYSKRGWVSFVPDQSAIVVTVGDQIQAWSGGQYKHVIGRPI
                   S   +     W+   P + +IV+ +GDQ++  + G+YK V  R I
Subjt:  FCFHFCHGSSNSLFHVYSKRGWVSFVPDQSAIVVTVGDQIQAWSGGQYKHVIGRPI

Q41452 Flavonol synthase/flavanone 3-hydroxylase7.2e-0424.32Show/hide
Query:  PRIDFRSIESLDYDSVLKLLDSMGSIGCFQLVNHGIPVELIGAIAAAAAVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFVWSRDEGLKVE
        P ID  +++  +   V +++++    G FQ++NHGIP E+I  +          V  E+K  +A+ P  A   E Y     + E+  +  W   + L  +
Subjt:  PRIDFRSIESLDYDSVLKLLDSMGSIGCFQLVNHGIPVELIGAIAAAAAVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFVWSRDEGLKVE

Query:  MEAISPFGYSNFSKKMEALTQVTEKIGEKILK----IFRE-NCGKVAE-KEVLLGHGSVWCVYKQKQRMSWNEELENCLKHDVIRMLIRGTDFSHAFCFH
        +   S   Y  + K   +  +  E+  + + K    IFR  + G   E  E++   GS   VY  K           C + D+   ++  TD S+     
Subjt:  MEAISPFGYSNFSKKMEALTQVTEKIGEKILK----IFRE-NCGKVAE-KEVLLGHGSVWCVYKQKQRMSWNEELENCLKHDVIRMLIRGTDFSHAFCFH

Query:  FCHGSSNSLFHVYSKRGW--VSFVPDQSAIVVTVGDQIQAWSGGQYKHVIGRPIYRDHK
               +   V+    W  V+++P  +AI+V +GDQ++  S G+YK V  R     +K
Subjt:  FCHGSSNSLFHVYSKRGW--VSFVPDQSAIVVTVGDQIQAWSGGQYKHVIGRPIYRDHK

Q9FR99 1-aminocyclopropane-1-carboxylate oxidase5.9e-0625.48Show/hide
Query:  PRIDFRSIESLD-YDSVLKLLDSMGSIGCFQLVNHGIPVELIGAIAAAAAVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFVWSRDEGL--
        P IDF  ++  +  +++ ++ +     G FQLVNHGIPVEL+  +              KKV    S E     EE + G    +L +  V   D     
Subjt:  PRIDFRSIESLD-YDSVLKLLDSMGSIGCFQLVNHGIPVELIGAIAAAAAVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFVWSRDEGL--

Query:  KVEMEAIS--------PFGYSNFSKKMEALTQVTEKIGEKILKIFRENCG--KVAEKEVLLGHGSVWCVYKQKQRMSWNEELENCLKHDVIRMLIRGTDF
         V+ E +         P    +F + M+   +   K+ EK++++  EN G  K   K+   G G     +  K           C + D+++ L   TD 
Subjt:  KVEMEAIS--------PFGYSNFSKKMEALTQVTEKIGEKILKIFRENCG--KVAEKEVLLGHGSVWCVYKQKQRMSWNEELENCLKHDVIRMLIRGTDF

Query:  SHAFCFHFCHGSSNSLFHVYSKRGWVSFVPDQSAIVVTVGDQIQAWSGGQYKHVIGRPIYRDH
               F       L  +   R W+   P   AIV+  GDQI+  S G+YK    R +   H
Subjt:  SHAFCFHFCHGSSNSLFHVYSKRGWVSFVPDQSAIVVTVGDQIQAWSGGQYKHVIGRPIYRDH

Arabidopsis top hitse value%identityAlignment
AT1G79760.1 downstream target of AGL15-46.4e-4836.71Show/hide
Query:  DFRAPPPSPIGTSRRSSVT-NDDVLTEFLEHSLRVPDLVLPENIFPRQRFIENPPRIDFRSIESLDYDSVLKLLDSMGSIGCFQLVNHGIPVELIGAIAA
        DFRAPPPSP+ +SRR+S T N+DVL+EFL+   RVP+LVLP+ +FP+  F+ NPP  DF  ++SL       LLD++ +IGCFQLVNHG+P  ++     
Subjt:  DFRAPPPSPIGTSRRSSVT-NDDVLTEFLEHSLRVPDLVLPENIFPRQRFIENPPRIDFRSIESLDYDSVLKLLDSMGSIGCFQLVNHGIPVELIGAIAA

Query:  AAAVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFVWSRDEGLKVEMEAISPFGYSNFSKKMEALTQVTEKIGEKILKIFRENCGKVAEKEV
                              KA   +  +H E     +EEFV+++D              YS+  + M    ++ + + EK+        G  ++KE 
Subjt:  AAAVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFVWSRDEGLKVEMEAISPFGYSNFSKKMEALTQVTEKIGEKILKIFRENCGKVAEKEV

Query:  LLGHGSVWCVYKQKQRMSWNEELENCLKHDVIRMLIRGTDFSHAFCFHFCHGSSNSLFHVYSKRGWVSFVPDQSAIVVTVGDQIQAWSGGQYKHVIGRPI
            G   C  K+         ++N  K + IRML+RG D  H+ C +FCH      FHVYSKRGWVSF P   A++VT+GD  Q WS G++K V+GRP+
Subjt:  LLGHGSVWCVYKQKQRMSWNEELENCLKHDVIRMLIRGTDFSHAFCFHFCHGSSNSLFHVYSKRGWVSFVPDQSAIVVTVGDQIQAWSGGQYKHVIGRPI

Query:  YRDHKKEEIGNNNKNNGISMAFHFSPNSSSSNSHNEI------RTLSLAHQALFALFLTLLYNFL
                  ++  +N IS++F ++  ++  N+  +I      +T+SL  Q LFALFLTLL+ FL
Subjt:  YRDHKKEEIGNNNKNNGISMAFHFSPNSSSSNSHNEI------RTLSLAHQALFALFLTLLYNFL

AT2G38500.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.8e-1829.67Show/hide
Query:  PPPSPIGTSRRSSVTNDDVLTEFLEHSLRVPDLVLPENIFPRQRFIEN---PPRIDFRSIESLDYDSVLKLLDSMGSIGCFQLVNHGIPVELIGAIAAAA
        PPPSPI  +R S     ++LTE +E S++VP+L LPE+    +        P  IDFR + S    SV +L+ S    G F++  HGI  E + ++   +
Subjt:  PPPSPIGTSRRSSVTNDDVLTEFLEHSLRVPDLVLPENIFPRQRFIEN---PPRIDFRSIESLDYDSVLKLLDSMGSIGCFQLVNHGIPVELIGAIAAAA

Query:  AVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFVWSRD--EGLKVEMEAISPFGYSNFSKKMEALTQVTEKIGEKILKIFRENCGKVAEKEV
           VFGV   +            GF     G       +E VW R   E ++   E I P  Y  FS++ME +    E I  K+ +I  EN  +  +K++
Subjt:  AVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFVWSRD--EGLKVEMEAISPFGYSNFSKKMEALTQVTEKIGEKILKIFRENCGKVAEKEV

Query:  LLGHGSVWCVYKQKQRMSWNEELENCLKHDVIRMLIRGTDFSHAFCFHFCHGSSNSLFHVYSKRGWVSFVPDQSAIVVTVGDQIQAWSGGQYKHVIGRPI
          G  SV  VY+     +  E+     K     ML       +    H    + N  F V S +G +SF  D   I+VT G Q++ WS G++K   G  I
Subjt:  LLGHGSVWCVYKQKQRMSWNEELENCLKHDVIRMLIRGTDFSHAFCFHFCHGSSNSLFHVYSKRGWVSFVPDQSAIVVTVGDQIQAWSGGQYKHVIGRPI

Query:  YRDHKKEEIGNNNKNNG--ISMAFHFSPNSSSSNSHNEIRTLSLAHQALFALFLTLLYNFLFYI
        Y     +  G+    +     M+   S  S ++ S    +T SL HQ +F  FL L +   F+I
Subjt:  YRDHKKEEIGNNNKNNG--ISMAFHFSPNSSSSNSHNEIRTLSLAHQALFALFLTLLYNFLFYI

AT3G11180.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein5.1e-0523.53Show/hide
Query:  NPPRIDFRSIESLDYDSVLKLLDSMGSIGCFQLVNHGIPVELIGAIAAAAAVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFVWSRDEGLK
        N P ID  S+ S + D   ++ ++    G FQ++NHG+  EL+ A A       F +  E K   + SP    G+      E  + L     W+    L 
Subjt:  NPPRIDFRSIESLDYDSVLKLLDSMGSIGCFQLVNHGIPVELIGAIAAAAAVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFVWSRDEGLK

Query:  VEMEAISPFGY-----SNFSKKMEALTQVTEKIGEKILKIFRENCGKVAEKEVLLGHGSVWCVYKQKQRMSWNEELENCLKHDVIRMLIRGTDFSHAFCF
            A+  F       SN  +  +   +   K+G +++ I   N G  AE               Q Q     E++  CL+ +      +  + +     
Subjt:  VEMEAISPFGY-----SNFSKKMEALTQVTEKIGEKILKIFRENCGKVAEKEVLLGHGSVWCVYKQKQRMSWNEELENCLKHDVIRMLIRGTDFSHAFCF

Query:  HFCHGSSNSL--------FHVYSKRGWVSFVPDQSAIVVTVGDQIQAWSGGQYKHVIGRPIYRDHKKEEIGNNNKNNGISMAFHFSPNS
        H   G    L          V     W++  P + A +V +GDQIQ  S  +YK V  R I           N++   +S+AF ++P S
Subjt:  HFCHGSSNSL--------FHVYSKRGWVSFVPDQSAIVVTVGDQIQAWSGGQYKHVIGRPIYRDHKKEEIGNNNKNNGISMAFHFSPNS

AT3G11180.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein5.1e-0523.53Show/hide
Query:  NPPRIDFRSIESLDYDSVLKLLDSMGSIGCFQLVNHGIPVELIGAIAAAAAVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFVWSRDEGLK
        N P ID  S+ S + D   ++ ++    G FQ++NHG+  EL+ A A       F +  E K   + SP    G+      E  + L     W+    L 
Subjt:  NPPRIDFRSIESLDYDSVLKLLDSMGSIGCFQLVNHGIPVELIGAIAAAAAVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFVWSRDEGLK

Query:  VEMEAISPFGY-----SNFSKKMEALTQVTEKIGEKILKIFRENCGKVAEKEVLLGHGSVWCVYKQKQRMSWNEELENCLKHDVIRMLIRGTDFSHAFCF
            A+  F       SN  +  +   +   K+G +++ I   N G  AE               Q Q     E++  CL+ +      +  + +     
Subjt:  VEMEAISPFGY-----SNFSKKMEALTQVTEKIGEKILKIFRENCGKVAEKEVLLGHGSVWCVYKQKQRMSWNEELENCLKHDVIRMLIRGTDFSHAFCF

Query:  HFCHGSSNSL--------FHVYSKRGWVSFVPDQSAIVVTVGDQIQAWSGGQYKHVIGRPIYRDHKKEEIGNNNKNNGISMAFHFSPNS
        H   G    L          V     W++  P + A +V +GDQIQ  S  +YK V  R I           N++   +S+AF ++P S
Subjt:  HFCHGSSNSL--------FHVYSKRGWVSFVPDQSAIVVTVGDQIQAWSGGQYKHVIGRPIYRDHKKEEIGNNNKNNGISMAFHFSPNS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGATGATGTCGCCTGAGCCCACGGCGGCGTCCCCCTTCGACTTCCGGGCGCCGCCACCGTCCCCCATCGGCACCAGCCGCCGTTCCTCCGTCACAAACGACGACGT
CCTCACCGAGTTCCTCGAGCACTCTCTCCGGGTCCCCGATCTTGTTTTGCCCGAAAACATTTTCCCGAGGCAGAGATTTATTGAGAACCCACCAAGGATTGATTTTCGGT
CGATTGAATCATTGGATTATGATTCAGTTTTGAAGCTTTTGGATTCGATGGGCAGCATTGGGTGTTTCCAATTGGTGAACCATGGGATTCCGGTGGAGCTGATTGGAGCA
ATTGCGGCGGCGGCGGCAGTGGGTGTTTTTGGGGTGTCGCCGGAGAAGAAGGTGGCGGTGGCGAGGTCACCGGAGAAGGCGTATGGGTTTGAAGAGTATTGGCATGGAGA
GGATGAGAGTGAACTGAGTGAAGAGTTTGTGTGGAGCAGAGATGAAGGTTTGAAGGTGGAAATGGAGGCAATTTCGCCATTTGGATATTCGAATTTCAGCAAGAAAATGG
AAGCACTCACACAAGTAACAGAGAAAATTGGTGAGAAAATCTTGAAAATTTTCCGGGAAAATTGTGGGAAAGTTGCAGAAAAGGAAGTGCTTTTGGGGCATGGATCAGTG
TGGTGTGTGTACAAGCAGAAGCAGAGAATGAGTTGGAATGAAGAGTTGGAGAACTGTTTGAAACATGATGTGATCAGAATGCTTATAAGGGGAACTGATTTTTCTCATGC
TTTTTGCTTCCATTTCTGCCATGGATCTTCTAATTCTCTATTCCATGTTTACTCCAAGAGAGGTTGGGTTTCTTTTGTACCCGACCAATCCGCCATCGTCGTTACTGTCG
GAGATCAAATTCAGGCATGGAGTGGTGGGCAGTACAAGCATGTGATTGGTAGGCCAATTTACAGAGATCATAAAAAAGAAGAAATTGGGAATAATAATAAGAATAATGGC
ATCTCCATGGCTTTTCATTTTTCTCCAAATTCTTCTTCATCAAATTCCCACAATGAAATTAGGACTCTTTCTTTAGCCCATCAAGCTCTATTTGCTCTCTTTCTAACTCT
TCTTTACAATTTTTTGTTTTACATTCTCAAATAA
mRNA sequenceShow/hide mRNA sequence
AACAGAGTAGACCCATCAAAAAGGCGTCCAATCCAGTGATGATTTCCGCCATGTCCCTCAATTTCTTGACAACCCATCATTGCTTCTTCATCTCTCCTTTTTTCCCTTTT
AATTTCTCCATTTCTTCAAAATCAAACCTCGTCGGATTCAGAATCGGAAAATCCCTCTTCAACAAATTCGCCATTTCCCCCTGTTTTTCTTCTGATTTTCAACGATGGTG
ATGATGTCGCCTGAGCCCACGGCGGCGTCCCCCTTCGACTTCCGGGCGCCGCCACCGTCCCCCATCGGCACCAGCCGCCGTTCCTCCGTCACAAACGACGACGTCCTCAC
CGAGTTCCTCGAGCACTCTCTCCGGGTCCCCGATCTTGTTTTGCCCGAAAACATTTTCCCGAGGCAGAGATTTATTGAGAACCCACCAAGGATTGATTTTCGGTCGATTG
AATCATTGGATTATGATTCAGTTTTGAAGCTTTTGGATTCGATGGGCAGCATTGGGTGTTTCCAATTGGTGAACCATGGGATTCCGGTGGAGCTGATTGGAGCAATTGCG
GCGGCGGCGGCAGTGGGTGTTTTTGGGGTGTCGCCGGAGAAGAAGGTGGCGGTGGCGAGGTCACCGGAGAAGGCGTATGGGTTTGAAGAGTATTGGCATGGAGAGGATGA
GAGTGAACTGAGTGAAGAGTTTGTGTGGAGCAGAGATGAAGGTTTGAAGGTGGAAATGGAGGCAATTTCGCCATTTGGATATTCGAATTTCAGCAAGAAAATGGAAGCAC
TCACACAAGTAACAGAGAAAATTGGTGAGAAAATCTTGAAAATTTTCCGGGAAAATTGTGGGAAAGTTGCAGAAAAGGAAGTGCTTTTGGGGCATGGATCAGTGTGGTGT
GTGTACAAGCAGAAGCAGAGAATGAGTTGGAATGAAGAGTTGGAGAACTGTTTGAAACATGATGTGATCAGAATGCTTATAAGGGGAACTGATTTTTCTCATGCTTTTTG
CTTCCATTTCTGCCATGGATCTTCTAATTCTCTATTCCATGTTTACTCCAAGAGAGGTTGGGTTTCTTTTGTACCCGACCAATCCGCCATCGTCGTTACTGTCGGAGATC
AAATTCAGGCATGGAGTGGTGGGCAGTACAAGCATGTGATTGGTAGGCCAATTTACAGAGATCATAAAAAAGAAGAAATTGGGAATAATAATAAGAATAATGGCATCTCC
ATGGCTTTTCATTTTTCTCCAAATTCTTCTTCATCAAATTCCCACAATGAAATTAGGACTCTTTCTTTAGCCCATCAAGCTCTATTTGCTCTCTTTCTAACTCTTCTTTA
CAATTTTTTGTTTTACATTCTCAAATAAACTTTTACAATATTTTTTATTATATATATTCATCTTGTTATTTGCTTTGTGTCTATTTATTTCTGGAATGAGTTGTAATATA
GTTTTTAAGAAAAATTACAGAGACCGCAACTAAAAATTATACGTATTGTTATAATCATATGTGTGAGCTTTTAGTGGAATAATTACTCCCTTAATTAATATGTTGCAATT
ATT
Protein sequenceShow/hide protein sequence
MVMMSPEPTAASPFDFRAPPPSPIGTSRRSSVTNDDVLTEFLEHSLRVPDLVLPENIFPRQRFIENPPRIDFRSIESLDYDSVLKLLDSMGSIGCFQLVNHGIPVELIGA
IAAAAAVGVFGVSPEKKVAVARSPEKAYGFEEYWHGEDESELSEEFVWSRDEGLKVEMEAISPFGYSNFSKKMEALTQVTEKIGEKILKIFRENCGKVAEKEVLLGHGSV
WCVYKQKQRMSWNEELENCLKHDVIRMLIRGTDFSHAFCFHFCHGSSNSLFHVYSKRGWVSFVPDQSAIVVTVGDQIQAWSGGQYKHVIGRPIYRDHKKEEIGNNNKNNG
ISMAFHFSPNSSSSNSHNEIRTLSLAHQALFALFLTLLYNFLFYILK