; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0037148 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0037148
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationchr2:3754180..3755326
RNA-Seq ExpressionLag0037148
SyntenyLag0037148
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608324.1 hypothetical protein SDJN03_01666, partial [Cucurbita argyrosperma subsp. sororia]3.2e-11971.47Show/hide
Query:  MNSTDQLCNFEAAQIPEPQPNPHGEQKRQVRRRRQSRRLYKEMPLNMAEARREIVTALKLHRASTKEAKKQQQQQDQQIKKSLPLF-SQLLPCFEPEGRM
        MNSTDQLCNFEA +IP+PQP PHGE+K+QVRRRRQSRRLYK+MPLNMAEARREIVTALKLHRASTKEAK+QQQ+QDQQIK SLP++  Q  PCFEPE RM
Subjt:  MNSTDQLCNFEAAQIPEPQPNPHGEQKRQVRRRRQSRRLYKEMPLNMAEARREIVTALKLHRASTKEAKKQQQQQDQQIKKSLPLF-SQLLPCFEPEGRM

Query:  KSRRNPRIYPGCSFYLENGSDFSHVTPSCPIITPPPVAQSLNMDIPIQTLGLNLNFGDDFKTVDTCPVVC-DNSHSFCSVPLLPSSSYICPPVSYGA-TH
        KSRRNPRIYP CSFY ENGS F         I PPPVAQSL++DIPIQTLGLN NF       DT  VVC +N+HSF S+  LP SSYICP   Y A TH
Subjt:  KSRRNPRIYPGCSFYLENGSDFSHVTPSCPIITPPPVAQSLNMDIPIQTLGLNLNFGDDFKTVDTCPVVC-DNSHSFCSVPLLPSSSYICPPVSYGA-TH

Query:  QEVPKSISLSEEEGKLMASDVFWLNNVPTGESEKEMQGAVDEE------AMAEIRSMDVKALEIDGQ--CSF--------DKAMEFPDWLSINDDFLQQH
        QEVPKSISLSEEEG+LMASD+FW NN PTGESEKE+ GAV+EE       +AEIRSM+ K LEIDGQ  C+F        ++AMEFPDWLSINDDFLQ  
Subjt:  QEVPKSISLSEEEGKLMASDVFWLNNVPTGESEKEMQGAVDEE------AMAEIRSMDVKALEIDGQ--CSF--------DKAMEFPDWLSINDDFLQQH

Query:  SNYHCAEKDYLQDPDLFCLDIGEIEDVDGDWLA
        SNY  + +DYLQDPDL C+DIGEIEDVDGDWLA
Subjt:  SNYHCAEKDYLQDPDLFCLDIGEIEDVDGDWLA

KAG7037674.1 hypothetical protein SDJN02_01304, partial [Cucurbita argyrosperma subsp. argyrosperma]2.3e-11770Show/hide
Query:  MNSTDQLCNFEAAQI----PEPQPNPHGEQKRQVRRRRQSRRLYKEMPLNMAEARREIVTALKLHRASTKEAKKQQQQQDQQIKKSLPLF-SQLLPCFEP
        MNSTDQLCNFEA +I    P+PQP PHGE+K+QVRRRRQSRRLYK+MPLNMAEARREIVTALKLHRASTKEAK+QQQ+QDQQIK SLP++  Q  PCFEP
Subjt:  MNSTDQLCNFEAAQI----PEPQPNPHGEQKRQVRRRRQSRRLYKEMPLNMAEARREIVTALKLHRASTKEAKKQQQQQDQQIKKSLPLF-SQLLPCFEP

Query:  EGRMKSRRNPRIYPGCSFYLENGSDFSHVTPSCPIITPPPVAQSLNMDIPIQTLGLNLNFGDDFKTVDTCPVVC----DNSHSFCSVPLLPSSSYICPPV
        E RMKSRRNPRIYP CSFY ENGS F         I PPPVAQSL++DIPIQTLGLN NF       DT  VVC    +N+HSF S+  LP SSYICP  
Subjt:  EGRMKSRRNPRIYPGCSFYLENGSDFSHVTPSCPIITPPPVAQSLNMDIPIQTLGLNLNFGDDFKTVDTCPVVC----DNSHSFCSVPLLPSSSYICPPV

Query:  SYGA-THQEVPKSISLSEEEGKLMASDVFWLNNVPTGESEKEMQGAVDEE------AMAEIRSMDVKALEIDGQ--CSF--------DKAMEFPDWLSIN
         Y A THQEVPKSISLSEEEG+LMASD+FW NN PTGESEKE+ GAV+EE       +AEIRSM+ K LEIDGQ  C+F        ++AMEFPDWLSIN
Subjt:  SYGA-THQEVPKSISLSEEEGKLMASDVFWLNNVPTGESEKEMQGAVDEE------AMAEIRSMDVKALEIDGQ--CSF--------DKAMEFPDWLSIN

Query:  DDFLQQHSNYHCAEKDYLQDPDLFCLDIGEIEDVDGDWLA
        DDFLQ  SNY  + +DYLQDPDL C+DIGEIEDVDGDWLA
Subjt:  DDFLQQHSNYHCAEKDYLQDPDLFCLDIGEIEDVDGDWLA

XP_022132656.1 uncharacterized protein LOC111005459 [Momordica charantia]2.7e-9461.35Show/hide
Query:  MNSTDQLCNFEAAQIPEPQPNPHGEQKRQVRRRRQSRRLYKEMPLNMAEARREIVTALKLHRASTKEAKKQQQQQDQQIKKSLPLFSQLLPCFEPEGRMK
        MNS DQ+              PHG+QK+QVRRRRQSRRLYKE PLNMAEARREI TALKLHRAST+E ++ Q+Q                P FEPEGRMK
Subjt:  MNSTDQLCNFEAAQIPEPQPNPHGEQKRQVRRRRQSRRLYKEMPLNMAEARREIVTALKLHRASTKEAKKQQQQQDQQIKKSLPLFSQLLPCFEPEGRMK

Query:  SRRNPRIYPGCSFYLENGSDFSHVT---------PSCPIITPPPVAQSLNMDIPIQTLGLNLNFGDDFKTVDTCPVVCDNSHSFCSVPLLPSSSYICPPV
        SRRNPRIYPGCS YL+N SDFSHV+         PSCP + P P++Q+LN++ PIQTLGLNLNF D   TVDT  VV +NSHSFCS+  L  SSY+CPPV
Subjt:  SRRNPRIYPGCSFYLENGSDFSHVT---------PSCPIITPPPVAQSLNMDIPIQTLGLNLNFGDDFKTVDTCPVVCDNSHSFCSVPLLPSSSYICPPV

Query:  SYGATHQEVPKSISLSEEEGKLMASDVFWLNNVPTGESEKEMQGAVDEEAMAEIRSMDV---KALEIDGQCSFDKAMEFPDWLSINDDFLQQHSNYHCAE
        S  AT QEV +S++LS E GKL+AS     +N P+G   K+ QGAV+E+     ++M++   KALEIDG  SFDKA+EFPDWLSINDDFLQQHSN+HCAE
Subjt:  SYGATHQEVPKSISLSEEEGKLMASDVFWLNNVPTGESEKEMQGAVDEEAMAEIRSMDV---KALEIDGQCSFDKAMEFPDWLSINDDFLQQHSNYHCAE

Query:  KDYLQDPDLFCLDIGEIEDVDGDWLA
         DY+QDPDL C++IGEIEDVDGDWLA
Subjt:  KDYLQDPDLFCLDIGEIEDVDGDWLA

XP_022940715.1 uncharacterized protein LOC111446225 [Cucurbita moschata]4.9e-12071.34Show/hide
Query:  MNSTDQLCNFEAAQIPEPQPNPHGEQKRQVRRRRQSRRLYKEMPLNMAEARREIVTALKLHRASTKEAKKQQQQQDQQIKKSLPLF-SQLLPCFEPEGRM
        MNSTDQLCNFEA +IP+PQP PHGE+K+QVRRRRQSRRLYK+MPLNMAEARREIVTALKLHRASTKEAK+QQQ+QDQQIK SLP++  Q  PCFEPE RM
Subjt:  MNSTDQLCNFEAAQIPEPQPNPHGEQKRQVRRRRQSRRLYKEMPLNMAEARREIVTALKLHRASTKEAKKQQQQQDQQIKKSLPLF-SQLLPCFEPEGRM

Query:  KSRRNPRIYPGCSFYLENGSDFSHVTPSCPIITPPPVAQSLNMDIPIQTLGLNLNFGDDFKTVDTCPVVC---DNSHSFCSVPLLPSSSYICPPVSYGA-
        KSRRNPRIYP CSFY ENGSDF         I PPPVAQSL++DIPIQTLGLN NF       DT  VVC   +N+HSF S+  LP SSYICP   Y A 
Subjt:  KSRRNPRIYPGCSFYLENGSDFSHVTPSCPIITPPPVAQSLNMDIPIQTLGLNLNFGDDFKTVDTCPVVC---DNSHSFCSVPLLPSSSYICPPVSYGA-

Query:  THQEVPKSISLSEEEGKLMASDVFWLNNVPTGESEKEMQGAVDEE------AMAEIRSMDVKALEIDGQ--CSF--------DKAMEFPDWLSINDDFLQ
        THQEVPKSISLSEEEG+LMASD+FW NN PTGESEKE+ GAV+EE       +AEIRS+D K LEIDGQ  C+F        ++AMEFPDWLSINDDFLQ
Subjt:  THQEVPKSISLSEEEGKLMASDVFWLNNVPTGESEKEMQGAVDEE------AMAEIRSMDVKALEIDGQ--CSF--------DKAMEFPDWLSINDDFLQ

Query:  QHSNYHCAEKDYLQDPDLFCLDIGEIEDVDGDWLA
          SNY  + +DYLQDPDL C+DIGEIEDVDGDWLA
Subjt:  QHSNYHCAEKDYLQDPDLFCLDIGEIEDVDGDWLA

XP_022981721.1 uncharacterized protein LOC111480786 [Cucurbita maxima]3.3e-11669.53Show/hide
Query:  MNSTDQLCNFEAAQI----PEPQPNPHGEQKRQVRRRRQSRRLYKEMPLNMAEARREIVTALKLHRASTKEAKKQQQQQDQQIKKSLPLF-SQLLPCFEP
        MNSTDQLCNFEA +I    P+PQP PHGE+K+QVRRRR++RRLYK+MPLNMAEARREIVTALKLHRASTKEAK+QQQ+QDQQIK SLP++  Q  PCFEP
Subjt:  MNSTDQLCNFEAAQI----PEPQPNPHGEQKRQVRRRRQSRRLYKEMPLNMAEARREIVTALKLHRASTKEAKKQQQQQDQQIKKSLPLF-SQLLPCFEP

Query:  EGRMKSRRNPRIYPGCSFYLENGSDFSHVTPSCPIITPPPVAQSLNMDIPIQTLGLNLNFGDDFKTVDTCPVVC--DNSHSFCSVPLLPSSSYICPPVSY
        E RMKSRRNPRIYP CSFY +NGSDF         I PPPVAQSL++DIPIQTLGLN NF       DT  VVC  +N+HSF S+  L  SSYICP   Y
Subjt:  EGRMKSRRNPRIYPGCSFYLENGSDFSHVTPSCPIITPPPVAQSLNMDIPIQTLGLNLNFGDDFKTVDTCPVVC--DNSHSFCSVPLLPSSSYICPPVSY

Query:  GA-THQEVPKSISLSEEEGKLMASDVFWLNNVPTGESEKEMQGAVDEE------AMAEIRSMDVKALEIDGQ--CSF--------DKAMEFPDWLSINDD
         A TH+EVPKSISLSEEEG+LMASD+FW NN PTGESEKE+ GAV+EE       +AEIRSMD K LEIDGQ  C+F        ++AMEFPDWLSINDD
Subjt:  GA-THQEVPKSISLSEEEGKLMASDVFWLNNVPTGESEKEMQGAVDEE------AMAEIRSMDVKALEIDGQ--CSF--------DKAMEFPDWLSINDD

Query:  FLQQHSNYHCAEKDYLQDPDLFCLDIGEIEDVDGDWLA
        FLQ  SNY  + +DYLQDPDL C+DIGEIEDVDGDWLA
Subjt:  FLQQHSNYHCAEKDYLQDPDLFCLDIGEIEDVDGDWLA

TrEMBL top hitse value%identityAlignment
A0A1S4DZY0 uncharacterized protein LOC1034937174.5e-8761.75Show/hide
Query:  MNSTDQLCNFE-AAQIPEPQPNPHGEQKRQVRRRRQS-RRLYKEMPLNMAEARREIVTALKLHRA-STKE-AKKQQQQQDQQIKKSLPLFSQLLPCFEPE
        MNS DQL NFE AAQI +P   P    K+QVRRRR S RRLYKE+PL+MAEARREIVTALKLHRA STKE A++QQQ+QDQ+ K+S PLF +L  CFE E
Subjt:  MNSTDQLCNFE-AAQIPEPQPNPHGEQKRQVRRRRQS-RRLYKEMPLNMAEARREIVTALKLHRA-STKE-AKKQQQQQDQQIKKSLPLFSQLLPCFEPE

Query:  GRMKSRRNPRIYPG----CSFYLENGSDFSHVTPSCPIITPPPVAQSLNMDIPIQTLGLNLNFGDDFKTVDTCPVVCDNSHSFCSVPLL-PSSSYICPPV
        GR KS+RNPRIYP     CSFYLENGS F         + PPP  ++LN +IPIQT      F DDFKT+DTC        SFCS+    P SSYICP V
Subjt:  GRMKSRRNPRIYPG----CSFYLENGSDFSHVTPSCPIITPPPVAQSLNMDIPIQTLGLNLNFGDDFKTVDTCPVVCDNSHSFCSVPLL-PSSSYICPPV

Query:  SYGAT-HQEVPKSISLSEEEGKLMASDVFWLNNVPTGESEKEMQ--GAVDEEAMA------EIRSMDVKALEIDGQCSFDKAMEFPDWLSINDDFLQQHS
        S   T HQE PKS+SL EEEG LMASDVFW NN PTG +EK+MQ    ++EEAMA      +  SMDVKALEID   S D AM FPDW+SINDD LQQ+S
Subjt:  SYGAT-HQEVPKSISLSEEEGKLMASDVFWLNNVPTGESEKEMQ--GAVDEEAMA------EIRSMDVKALEIDGQCSFDKAMEFPDWLSINDDFLQQHS

Query:  NYHCAEKDYLQDPDLFCLDIGEIEDVDGDWLA
        NYHC E+D LQ+PDL C DIG+IED+  +WLA
Subjt:  NYHCAEKDYLQDPDLFCLDIGEIEDVDGDWLA

A0A5A7V8V7 Putative WRKY transcription factor protein 1 isoform X24.5e-8761.75Show/hide
Query:  MNSTDQLCNFE-AAQIPEPQPNPHGEQKRQVRRRRQS-RRLYKEMPLNMAEARREIVTALKLHRA-STKE-AKKQQQQQDQQIKKSLPLFSQLLPCFEPE
        MNS DQL NFE AAQI +P   P    K+QVRRRR S RRLYKE+PL+MAEARREIVTALKLHRA STKE A++QQQ+QDQ+ K+S PLF +L  CFE E
Subjt:  MNSTDQLCNFE-AAQIPEPQPNPHGEQKRQVRRRRQS-RRLYKEMPLNMAEARREIVTALKLHRA-STKE-AKKQQQQQDQQIKKSLPLFSQLLPCFEPE

Query:  GRMKSRRNPRIYPG----CSFYLENGSDFSHVTPSCPIITPPPVAQSLNMDIPIQTLGLNLNFGDDFKTVDTCPVVCDNSHSFCSVPLL-PSSSYICPPV
        GR KS+RNPRIYP     CSFYLENGS F         + PPP  ++LN +IPIQT      F DDFKT+DTC        SFCS+    P SSYICP V
Subjt:  GRMKSRRNPRIYPG----CSFYLENGSDFSHVTPSCPIITPPPVAQSLNMDIPIQTLGLNLNFGDDFKTVDTCPVVCDNSHSFCSVPLL-PSSSYICPPV

Query:  SYGAT-HQEVPKSISLSEEEGKLMASDVFWLNNVPTGESEKEMQ--GAVDEEAMA------EIRSMDVKALEIDGQCSFDKAMEFPDWLSINDDFLQQHS
        S   T HQE PKS+SL EEEG LMASDVFW NN PTG +EK+MQ    ++EEAMA      +  SMDVKALEID   S D AM FPDW+SINDD LQQ+S
Subjt:  SYGAT-HQEVPKSISLSEEEGKLMASDVFWLNNVPTGESEKEMQ--GAVDEEAMA------EIRSMDVKALEIDGQCSFDKAMEFPDWLSINDDFLQQHS

Query:  NYHCAEKDYLQDPDLFCLDIGEIEDVDGDWLA
        NYHC E+D LQ+PDL C DIG+IED+  +WLA
Subjt:  NYHCAEKDYLQDPDLFCLDIGEIEDVDGDWLA

A0A6J1BUG5 uncharacterized protein LOC1110054591.3e-9461.35Show/hide
Query:  MNSTDQLCNFEAAQIPEPQPNPHGEQKRQVRRRRQSRRLYKEMPLNMAEARREIVTALKLHRASTKEAKKQQQQQDQQIKKSLPLFSQLLPCFEPEGRMK
        MNS DQ+              PHG+QK+QVRRRRQSRRLYKE PLNMAEARREI TALKLHRAST+E ++ Q+Q                P FEPEGRMK
Subjt:  MNSTDQLCNFEAAQIPEPQPNPHGEQKRQVRRRRQSRRLYKEMPLNMAEARREIVTALKLHRASTKEAKKQQQQQDQQIKKSLPLFSQLLPCFEPEGRMK

Query:  SRRNPRIYPGCSFYLENGSDFSHVT---------PSCPIITPPPVAQSLNMDIPIQTLGLNLNFGDDFKTVDTCPVVCDNSHSFCSVPLLPSSSYICPPV
        SRRNPRIYPGCS YL+N SDFSHV+         PSCP + P P++Q+LN++ PIQTLGLNLNF D   TVDT  VV +NSHSFCS+  L  SSY+CPPV
Subjt:  SRRNPRIYPGCSFYLENGSDFSHVT---------PSCPIITPPPVAQSLNMDIPIQTLGLNLNFGDDFKTVDTCPVVCDNSHSFCSVPLLPSSSYICPPV

Query:  SYGATHQEVPKSISLSEEEGKLMASDVFWLNNVPTGESEKEMQGAVDEEAMAEIRSMDV---KALEIDGQCSFDKAMEFPDWLSINDDFLQQHSNYHCAE
        S  AT QEV +S++LS E GKL+AS     +N P+G   K+ QGAV+E+     ++M++   KALEIDG  SFDKA+EFPDWLSINDDFLQQHSN+HCAE
Subjt:  SYGATHQEVPKSISLSEEEGKLMASDVFWLNNVPTGESEKEMQGAVDEEAMAEIRSMDV---KALEIDGQCSFDKAMEFPDWLSINDDFLQQHSNYHCAE

Query:  KDYLQDPDLFCLDIGEIEDVDGDWLA
         DY+QDPDL C++IGEIEDVDGDWLA
Subjt:  KDYLQDPDLFCLDIGEIEDVDGDWLA

A0A6J1FRD8 uncharacterized protein LOC1114462252.4e-12071.34Show/hide
Query:  MNSTDQLCNFEAAQIPEPQPNPHGEQKRQVRRRRQSRRLYKEMPLNMAEARREIVTALKLHRASTKEAKKQQQQQDQQIKKSLPLF-SQLLPCFEPEGRM
        MNSTDQLCNFEA +IP+PQP PHGE+K+QVRRRRQSRRLYK+MPLNMAEARREIVTALKLHRASTKEAK+QQQ+QDQQIK SLP++  Q  PCFEPE RM
Subjt:  MNSTDQLCNFEAAQIPEPQPNPHGEQKRQVRRRRQSRRLYKEMPLNMAEARREIVTALKLHRASTKEAKKQQQQQDQQIKKSLPLF-SQLLPCFEPEGRM

Query:  KSRRNPRIYPGCSFYLENGSDFSHVTPSCPIITPPPVAQSLNMDIPIQTLGLNLNFGDDFKTVDTCPVVC---DNSHSFCSVPLLPSSSYICPPVSYGA-
        KSRRNPRIYP CSFY ENGSDF         I PPPVAQSL++DIPIQTLGLN NF       DT  VVC   +N+HSF S+  LP SSYICP   Y A 
Subjt:  KSRRNPRIYPGCSFYLENGSDFSHVTPSCPIITPPPVAQSLNMDIPIQTLGLNLNFGDDFKTVDTCPVVC---DNSHSFCSVPLLPSSSYICPPVSYGA-

Query:  THQEVPKSISLSEEEGKLMASDVFWLNNVPTGESEKEMQGAVDEE------AMAEIRSMDVKALEIDGQ--CSF--------DKAMEFPDWLSINDDFLQ
        THQEVPKSISLSEEEG+LMASD+FW NN PTGESEKE+ GAV+EE       +AEIRS+D K LEIDGQ  C+F        ++AMEFPDWLSINDDFLQ
Subjt:  THQEVPKSISLSEEEGKLMASDVFWLNNVPTGESEKEMQGAVDEE------AMAEIRSMDVKALEIDGQ--CSF--------DKAMEFPDWLSINDDFLQ

Query:  QHSNYHCAEKDYLQDPDLFCLDIGEIEDVDGDWLA
          SNY  + +DYLQDPDL C+DIGEIEDVDGDWLA
Subjt:  QHSNYHCAEKDYLQDPDLFCLDIGEIEDVDGDWLA

A0A6J1IXC1 uncharacterized protein LOC1114807861.6e-11669.53Show/hide
Query:  MNSTDQLCNFEAAQI----PEPQPNPHGEQKRQVRRRRQSRRLYKEMPLNMAEARREIVTALKLHRASTKEAKKQQQQQDQQIKKSLPLF-SQLLPCFEP
        MNSTDQLCNFEA +I    P+PQP PHGE+K+QVRRRR++RRLYK+MPLNMAEARREIVTALKLHRASTKEAK+QQQ+QDQQIK SLP++  Q  PCFEP
Subjt:  MNSTDQLCNFEAAQI----PEPQPNPHGEQKRQVRRRRQSRRLYKEMPLNMAEARREIVTALKLHRASTKEAKKQQQQQDQQIKKSLPLF-SQLLPCFEP

Query:  EGRMKSRRNPRIYPGCSFYLENGSDFSHVTPSCPIITPPPVAQSLNMDIPIQTLGLNLNFGDDFKTVDTCPVVC--DNSHSFCSVPLLPSSSYICPPVSY
        E RMKSRRNPRIYP CSFY +NGSDF         I PPPVAQSL++DIPIQTLGLN NF       DT  VVC  +N+HSF S+  L  SSYICP   Y
Subjt:  EGRMKSRRNPRIYPGCSFYLENGSDFSHVTPSCPIITPPPVAQSLNMDIPIQTLGLNLNFGDDFKTVDTCPVVC--DNSHSFCSVPLLPSSSYICPPVSY

Query:  GA-THQEVPKSISLSEEEGKLMASDVFWLNNVPTGESEKEMQGAVDEE------AMAEIRSMDVKALEIDGQ--CSF--------DKAMEFPDWLSINDD
         A TH+EVPKSISLSEEEG+LMASD+FW NN PTGESEKE+ GAV+EE       +AEIRSMD K LEIDGQ  C+F        ++AMEFPDWLSINDD
Subjt:  GA-THQEVPKSISLSEEEGKLMASDVFWLNNVPTGESEKEMQGAVDEE------AMAEIRSMDVKALEIDGQ--CSF--------DKAMEFPDWLSINDD

Query:  FLQQHSNYHCAEKDYLQDPDLFCLDIGEIEDVDGDWLA
        FLQ  SNY  + +DYLQDPDL C+DIGEIEDVDGDWLA
Subjt:  FLQQHSNYHCAEKDYLQDPDLFCLDIGEIEDVDGDWLA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G21280.1 hydroxyproline-rich glycoprotein family protein9.5e-2133.67Show/hide
Query:  KRQVRRRRQSRRLYKEMPLNMAEARREIVTALKLHRASTKEAKKQQQQQDQQIKKSLPLFSQLLPCFEPEGRMKSRRNPRIYPGCSFYLENGSDFSHVTP
        K+QVRRR  + R Y+E  LNMAEARREIVTALK HRAS ++A +    Q     + L LFS   P   P+                        FS   P
Subjt:  KRQVRRRRQSRRLYKEMPLNMAEARREIVTALKLHRASTKEAKKQQQQQDQQIKKSLPLFSQLLPCFEPEGRMKSRRNPRIYPGCSFYLENGSDFSHVTP

Query:  SCPIITPPPVAQSLNMDIPIQTLGLNLNFGD--DF-KTVDTCPVVCDNSHSFCSVPLLPSSSYIC-----PPVSYGATHQEVPKSISLSEEEGKLMASDV
                    SLN  +P Q LGLNLNF D  DF +T  T      +S S  S  + P++ +I      PP    AT    P+  S S  E  ++ S  
Subjt:  SCPIITPPPVAQSLNMDIPIQTLGLNLNFGD--DF-KTVDTCPVVCDNSHSFCSVPLLPSSSYIC-----PPVSYGATHQEVPKSISLSEEEGKLMASDV

Query:  FWLNNVPTGESEKEMQGAVDEEAMAEIRSMDVKALEIDGQCSFDKAMEFPDWLSINDDFLQQHSNYHCAEKDYLQDPDLFCLDIGEIEDVDG-DWLA
         W + +     E E++   +E          V  +E D    F   MEFP WL+  ++ L    N          +P L C++IGEIE +DG DWLA
Subjt:  FWLNNVPTGESEKEMQGAVDEEAMAEIRSMDVKALEIDGQCSFDKAMEFPDWLSINDDFLQQHSNYHCAEKDYLQDPDLFCLDIGEIEDVDG-DWLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTCTACAGACCAACTCTGCAACTTTGAAGCTGCACAAATCCCAGAGCCACAGCCAAACCCACATGGAGAACAAAAGAGACAGGTTAGAAGGAGGCGCCAAAGCCG
GCGGCTTTACAAGGAAATGCCTCTCAATATGGCTGAGGCCAGAAGAGAGATTGTAACTGCACTCAAACTCCACAGAGCATCAACCAAAGAAGCCAAGAAACAGCAACAGC
AACAGGACCAACAGATTAAAAAATCACTTCCTTTGTTCTCTCAATTGTTGCCATGTTTTGAACCTGAAGGAAGAATGAAATCCAGGAGAAATCCCAGGATATACCCAGGT
TGCTCATTTTATTTGGAAAATGGGTCTGATTTTTCTCATGTTACTCCTTCCTGCCCAATTATTACTCCTCCTCCTGTTGCACAGAGTCTCAATATGGATATTCCTATACA
AACCTTAGGTCTGAATCTGAATTTTGGTGATGATTTCAAAACTGTGGACACTTGTCCAGTTGTCTGTGACAACAGCCATTCATTTTGTTCAGTTCCATTGTTGCCCTCAT
CTTCATATATTTGTCCCCCTGTTTCTTATGGTGCTACTCATCAGGAAGTTCCCAAATCAATTTCATTATCTGAGGAAGAAGGGAAGTTAATGGCATCTGATGTGTTTTGG
TTGAATAATGTCCCAACTGGAGAGAGTGAAAAAGAGATGCAGGGGGCAGTGGACGAGGAGGCCATGGCTGAGATCAGGTCGATGGATGTGAAGGCTTTGGAGATTGATGG
TCAGTGTAGTTTTGATAAAGCCATGGAATTTCCAGATTGGTTGAGTATCAATGATGATTTTTTGCAGCAGCATTCGAATTATCACTGCGCAGAGAAGGATTACCTTCAAG
ATCCTGACCTATTTTGCTTGGATATTGGGGAGATTGAAGATGTGGATGGAGATTGGTTAGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGAACTCTACAGACCAACTCTGCAACTTTGAAGCTGCACAAATCCCAGAGCCACAGCCAAACCCACATGGAGAACAAAAGAGACAGGTTAGAAGGAGGCGCCAAAGCCG
GCGGCTTTACAAGGAAATGCCTCTCAATATGGCTGAGGCCAGAAGAGAGATTGTAACTGCACTCAAACTCCACAGAGCATCAACCAAAGAAGCCAAGAAACAGCAACAGC
AACAGGACCAACAGATTAAAAAATCACTTCCTTTGTTCTCTCAATTGTTGCCATGTTTTGAACCTGAAGGAAGAATGAAATCCAGGAGAAATCCCAGGATATACCCAGGT
TGCTCATTTTATTTGGAAAATGGGTCTGATTTTTCTCATGTTACTCCTTCCTGCCCAATTATTACTCCTCCTCCTGTTGCACAGAGTCTCAATATGGATATTCCTATACA
AACCTTAGGTCTGAATCTGAATTTTGGTGATGATTTCAAAACTGTGGACACTTGTCCAGTTGTCTGTGACAACAGCCATTCATTTTGTTCAGTTCCATTGTTGCCCTCAT
CTTCATATATTTGTCCCCCTGTTTCTTATGGTGCTACTCATCAGGAAGTTCCCAAATCAATTTCATTATCTGAGGAAGAAGGGAAGTTAATGGCATCTGATGTGTTTTGG
TTGAATAATGTCCCAACTGGAGAGAGTGAAAAAGAGATGCAGGGGGCAGTGGACGAGGAGGCCATGGCTGAGATCAGGTCGATGGATGTGAAGGCTTTGGAGATTGATGG
TCAGTGTAGTTTTGATAAAGCCATGGAATTTCCAGATTGGTTGAGTATCAATGATGATTTTTTGCAGCAGCATTCGAATTATCACTGCGCAGAGAAGGATTACCTTCAAG
ATCCTGACCTATTTTGCTTGGATATTGGGGAGATTGAAGATGTGGATGGAGATTGGTTAGCATGA
Protein sequenceShow/hide protein sequence
MNSTDQLCNFEAAQIPEPQPNPHGEQKRQVRRRRQSRRLYKEMPLNMAEARREIVTALKLHRASTKEAKKQQQQQDQQIKKSLPLFSQLLPCFEPEGRMKSRRNPRIYPG
CSFYLENGSDFSHVTPSCPIITPPPVAQSLNMDIPIQTLGLNLNFGDDFKTVDTCPVVCDNSHSFCSVPLLPSSSYICPPVSYGATHQEVPKSISLSEEEGKLMASDVFW
LNNVPTGESEKEMQGAVDEEAMAEIRSMDVKALEIDGQCSFDKAMEFPDWLSINDDFLQQHSNYHCAEKDYLQDPDLFCLDIGEIEDVDGDWLA