; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0022603 (gene) of Chayote v1 genome

Gene IDSed0022603
OrganismSechium edule (Chayote v1)
DescriptionChloroplast, nucleus, chloroplast envelope, putative
Genome locationLG01:17080513..17082189
RNA-Seq ExpressionSed0022603
SyntenySed0022603
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN65868.1 hypothetical protein Csa_023343 [Cucumis sativus]3.0e-8464.29Show/hide
Query:  MSSMQITA--NSICSNKSLCPVSKLIYPSFHVSQSRRAFVNLCANASSFKQVLPIFQYKNPRAGLKHQRTPIVSAFGSKGKEDNGDGGSPWKAYDKVVEN
        MSSMQITA  NSIC+NKS+C VSK IYPSFH +QSRRA VNL ANAS FKQ LP+ +Y++ R GLK+Q TPIVS +GSKGK  + DGGSPWK  DKVVE+
Subjt:  MSSMQITA--NSICSNKSLCPVSKLIYPSFHVSQSRRAFVNLCANASSFKQVLPIFQYKNPRAGLKHQRTPIVSAFGSKGKEDNGDGGSPWKAYDKVVEN

Query:  FKNGQSVEDVLRQQIENKEFYDGGDGGKRPPSGGGGGGGGG------NSSGGSGDDNLGGVMHETVQVVLATIGFIFMYIYIISGEELARLAKDYIKYLF
        FK G+SVEDVLRQQIE KEFYDGGDGGKRPP GGGG GGG       +SS GS D +L G+M E +QV+LAT+G +F+YIYI+SGEEL+RLAKDYIKYLF
Subjt:  FKNGQSVEDVLRQQIENKEFYDGGDGGKRPPSGGGGGGGGG------NSSGGSGDDNLGGVMHETVQVVLATIGFIFMYIYIISGEELARLAKDYIKYLF

Query:  GGSKSARLKRTMHQWGSFYQRLTGKKQYDKHWMEKAILNTPTGGDHPDNYR---------------HTVTDYIDSKYPKL
        GGSKS RLKR M+ WG FYQ L  KK+YD++W+EKAIL+TPT  D+PD Y                +  TDY+DS Y ++
Subjt:  GGSKSARLKRTMHQWGSFYQRLTGKKQYDKHWMEKAILNTPTGGDHPDNYR---------------HTVTDYIDSKYPKL

XP_008444591.1 PREDICTED: uncharacterized protein LOC103487859 [Cucumis melo]1.5e-8865.05Show/hide
Query:  MSSMQITA--NSICSNKSLCPVSKLIYPSFHVSQSRRAFVNLCANASSFKQVLPIFQYKNPRAGLKHQRTPIVSAFGSKGKEDNGDGGSPWKAYDKVVEN
        MSSMQITA  NSIC+NKS+C VSK IYPSFH +QS RA VNL ANAS FKQ LPI +YK+ R GLKHQ TPIVS FGSKGK  + DGGSPWKA+DKVVE+
Subjt:  MSSMQITA--NSICSNKSLCPVSKLIYPSFHVSQSRRAFVNLCANASSFKQVLPIFQYKNPRAGLKHQRTPIVSAFGSKGKEDNGDGGSPWKAYDKVVEN

Query:  FKNGQSVEDVLRQQIENKEFYDGGDGGKRPPSGGGGGGGGG--------NSSGGSGDDNLGGVMHETVQVVLATIGFIFMYIYIISGEELARLAKDYIKY
        FK G SVEDVLR+QIE KEFYDGGDGG+RPPSGGGGGGGGG        +SS G+ D +L   + ET+QVVLAT+GFIFMY Y+++GEE+ RL KDYIKY
Subjt:  FKNGQSVEDVLRQQIENKEFYDGGDGGKRPPSGGGGGGGGG--------NSSGGSGDDNLGGVMHETVQVVLATIGFIFMYIYIISGEELARLAKDYIKY

Query:  LFGGSKSARLKRTMHQWGSFYQRLTGKKQYDKHWMEKAILNTPTGGDHPDNYRHTVTDYIDSKYPKLKPHSSSSPYDADSDESHSDDEE
         FGGSKS RL+R M++WG FYQRLT KK+YD+ W+EKAI+NTPT  DHPDNYRH    Y        K  +    + +D D    DDEE
Subjt:  LFGGSKSARLKRTMHQWGSFYQRLTGKKQYDKHWMEKAILNTPTGGDHPDNYRHTVTDYIDSKYPKLKPHSSSSPYDADSDESHSDDEE

XP_022140099.1 uncharacterized protein LOC111010834 [Momordica charantia]1.5e-9670.46Show/hide
Query:  MSSMQITA--NSICSNKSLCPVSKLIYPSFHVSQSRRAFVNLCANASSFKQVLPIFQYKNPRAGLKHQRTPIVSAFGSKGKEDNGDGGSPWKAYDKVVEN
        MSSMQITA  NSICS++S+C  SK IYPSF  ++SR A VNL ANAS FKQ LP+ +YK+ RAGL HQ TPIVS FGSKGKE +GDGGSPWK +DKVVEN
Subjt:  MSSMQITA--NSICSNKSLCPVSKLIYPSFHVSQSRRAFVNLCANASSFKQVLPIFQYKNPRAGLKHQRTPIVSAFGSKGKEDNGDGGSPWKAYDKVVEN

Query:  FKNGQSVEDVLRQQIENKEFYDGGDGGKRPPSGGGGGGGGGNSSGGSGDDNLGGVMHETVQVVLATIGFIFMYIYIISGEELARLAKDYIKYLFGGSKSA
        FK G+SVEDVLRQQIE KEFYDGGDGGKRPPS   GGGG G+SS GS DD+LGG++ ET+QV+LATIGFIF+YIYIISGEEL RLAKDYIK++FGGSKS 
Subjt:  FKNGQSVEDVLRQQIENKEFYDGGDGGKRPPSGGGGGGGGGNSSGGSGDDNLGGVMHETVQVVLATIGFIFMYIYIISGEELARLAKDYIKYLFGGSKSA

Query:  RLKRTMHQWGSFYQRLTGKKQYDKHWMEKAILNTPTGGDHPDNYRHTVTDYIDSKYPKLKPHSSSSPY-DADSDESHSDDE
        RLKR M++WG FYQ+LT KKQYD++W+EKAI+NTPT  DHPD YR  V DY++S+Y     HS+S+   DA+ D S+SDDE
Subjt:  RLKRTMHQWGSFYQRLTGKKQYDKHWMEKAILNTPTGGDHPDNYRHTVTDYIDSKYPKLKPHSSSSPY-DADSDESHSDDE

XP_022994855.1 uncharacterized protein LOC111490456 [Cucurbita maxima]2.8e-8262.28Show/hide
Query:  SSMQITA--NSICSNKSLCPVSKLIYPSFHVSQSRRAFVNLCANASSFKQVLPIFQYKNPRAGLKHQRTPIVSAFGSKGKEDNGDGGSPWKAYDKVVENF
        S MQITA  NS+C NKSLC VSK  YPSF  SQ+R AFVN  AN S  K+ LP+ +Y + R GLKH+ TPI S FGSKGK DNGDGGSPWKA+DKVVENF
Subjt:  SSMQITA--NSICSNKSLCPVSKLIYPSFHVSQSRRAFVNLCANASSFKQVLPIFQYKNPRAGLKHQRTPIVSAFGSKGKEDNGDGGSPWKAYDKVVENF

Query:  KNGQSVEDVLRQQIENKEFYDGGDGGKRPPSGGGGGGGGGNSSGGSGDDNLGGVMHETVQVVLATIGFIFMYIYIISGEELARLAKDYIKYLFGGSKSAR
        K G+SVED+LRQQIENK+FYDGGDGG+ PP GGGGG  GG+SS  S D N+ G++ ET+ VVLATIG + +YIYII G+EL  LAKDYIKYLFG  +SAR
Subjt:  KNGQSVEDVLRQQIENKEFYDGGDGGKRPPSGGGGGGGGGNSSGGSGDDNLGGVMHETVQVVLATIGFIFMYIYIISGEELARLAKDYIKYLFGGSKSAR

Query:  LKRTMHQWGSFYQRLTGKKQY-DKHWMEKAILNTPTGGDHPDNYRHTVTDYIDSKYPKLKP-------HSSSSPYDADS-DESHSDDEE
        LK  M+ WG FY+R T KKQ  D++W+EKAILNTPT  DHPD YR+ + +Y++S+  +  P        SSSS YD +  +ES+SDDE+
Subjt:  LKRTMHQWGSFYQRLTGKKQY-DKHWMEKAILNTPTGGDHPDNYRHTVTDYIDSKYPKLKP-------HSSSSPYDADS-DESHSDDEE

XP_038895689.1 uncharacterized protein LOC120083861 [Benincasa hispida]1.2e-9872.24Show/hide
Query:  MSSMQITA--NSICSNKSLCPVSKLIYPSFHVSQSRRAFVNLCANASSFKQVLPIFQYKNPRAGLKHQRTPIVSAFGSKGKEDNGDGGSPWKAYDKVVEN
        MSSMQITA  NSICSNKS+C VSK IYPSFH SQSR   VNL AN SSFKQ LP+ +YK+ R GLKHQ TPIVS FGSKGK D GDGGSPWKA+D+VVEN
Subjt:  MSSMQITA--NSICSNKSLCPVSKLIYPSFHVSQSRRAFVNLCANASSFKQVLPIFQYKNPRAGLKHQRTPIVSAFGSKGKEDNGDGGSPWKAYDKVVEN

Query:  FKNGQSVEDVLRQQIENKEFYDGGDGGKRPPSGGGGGGGGGNSSGGSGDDNLGGVMHETVQVVLATIGFIFMYIYIISGEELARLAKDYIKYLFGGSKSA
        FK G+SVEDVLRQQIE KEFYDGG+GGKRPPS GGGG G G+SS GS DD+L G++ ET+QVVLAT+GFIF+YIYII+GEELARLAKDYIKYLFGGSKS 
Subjt:  FKNGQSVEDVLRQQIENKEFYDGGDGGKRPPSGGGGGGGGGNSSGGSGDDNLGGVMHETVQVVLATIGFIFMYIYIISGEELARLAKDYIKYLFGGSKSA

Query:  RLKRTMHQWGSFYQRLTGKKQYDKHWMEKAILNTPTGGDHPDNYRHTVTDYIDSKYPKLKPHSSSSPYDADSDESHSDDEE
        RL+R+M+QWG FYQ+LT KKQYD++W+EKAILNTPT  DHPDNYR TV  +I+S++   K + +S  Y  + D+ +SDDEE
Subjt:  RLKRTMHQWGSFYQRLTGKKQYDKHWMEKAILNTPTGGDHPDNYRHTVTDYIDSKYPKLKPHSSSSPYDADSDESHSDDEE

TrEMBL top hitse value%identityAlignment
A0A0A0LVP5 Uncharacterized protein1.4e-8464.29Show/hide
Query:  MSSMQITA--NSICSNKSLCPVSKLIYPSFHVSQSRRAFVNLCANASSFKQVLPIFQYKNPRAGLKHQRTPIVSAFGSKGKEDNGDGGSPWKAYDKVVEN
        MSSMQITA  NSIC+NKS+C VSK IYPSFH +QSRRA VNL ANAS FKQ LP+ +Y++ R GLK+Q TPIVS +GSKGK  + DGGSPWK  DKVVE+
Subjt:  MSSMQITA--NSICSNKSLCPVSKLIYPSFHVSQSRRAFVNLCANASSFKQVLPIFQYKNPRAGLKHQRTPIVSAFGSKGKEDNGDGGSPWKAYDKVVEN

Query:  FKNGQSVEDVLRQQIENKEFYDGGDGGKRPPSGGGGGGGGG------NSSGGSGDDNLGGVMHETVQVVLATIGFIFMYIYIISGEELARLAKDYIKYLF
        FK G+SVEDVLRQQIE KEFYDGGDGGKRPP GGGG GGG       +SS GS D +L G+M E +QV+LAT+G +F+YIYI+SGEEL+RLAKDYIKYLF
Subjt:  FKNGQSVEDVLRQQIENKEFYDGGDGGKRPPSGGGGGGGGG------NSSGGSGDDNLGGVMHETVQVVLATIGFIFMYIYIISGEELARLAKDYIKYLF

Query:  GGSKSARLKRTMHQWGSFYQRLTGKKQYDKHWMEKAILNTPTGGDHPDNYR---------------HTVTDYIDSKYPKL
        GGSKS RLKR M+ WG FYQ L  KK+YD++W+EKAIL+TPT  D+PD Y                +  TDY+DS Y ++
Subjt:  GGSKSARLKRTMHQWGSFYQRLTGKKQYDKHWMEKAILNTPTGGDHPDNYR---------------HTVTDYIDSKYPKL

A0A1S3BA69 uncharacterized protein LOC1034878597.3e-8965.05Show/hide
Query:  MSSMQITA--NSICSNKSLCPVSKLIYPSFHVSQSRRAFVNLCANASSFKQVLPIFQYKNPRAGLKHQRTPIVSAFGSKGKEDNGDGGSPWKAYDKVVEN
        MSSMQITA  NSIC+NKS+C VSK IYPSFH +QS RA VNL ANAS FKQ LPI +YK+ R GLKHQ TPIVS FGSKGK  + DGGSPWKA+DKVVE+
Subjt:  MSSMQITA--NSICSNKSLCPVSKLIYPSFHVSQSRRAFVNLCANASSFKQVLPIFQYKNPRAGLKHQRTPIVSAFGSKGKEDNGDGGSPWKAYDKVVEN

Query:  FKNGQSVEDVLRQQIENKEFYDGGDGGKRPPSGGGGGGGGG--------NSSGGSGDDNLGGVMHETVQVVLATIGFIFMYIYIISGEELARLAKDYIKY
        FK G SVEDVLR+QIE KEFYDGGDGG+RPPSGGGGGGGGG        +SS G+ D +L   + ET+QVVLAT+GFIFMY Y+++GEE+ RL KDYIKY
Subjt:  FKNGQSVEDVLRQQIENKEFYDGGDGGKRPPSGGGGGGGGG--------NSSGGSGDDNLGGVMHETVQVVLATIGFIFMYIYIISGEELARLAKDYIKY

Query:  LFGGSKSARLKRTMHQWGSFYQRLTGKKQYDKHWMEKAILNTPTGGDHPDNYRHTVTDYIDSKYPKLKPHSSSSPYDADSDESHSDDEE
         FGGSKS RL+R M++WG FYQRLT KK+YD+ W+EKAI+NTPT  DHPDNYRH    Y        K  +    + +D D    DDEE
Subjt:  LFGGSKSARLKRTMHQWGSFYQRLTGKKQYDKHWMEKAILNTPTGGDHPDNYRHTVTDYIDSKYPKLKPHSSSSPYDADSDESHSDDEE

A0A6J1CES8 uncharacterized protein LOC1110108347.3e-9770.46Show/hide
Query:  MSSMQITA--NSICSNKSLCPVSKLIYPSFHVSQSRRAFVNLCANASSFKQVLPIFQYKNPRAGLKHQRTPIVSAFGSKGKEDNGDGGSPWKAYDKVVEN
        MSSMQITA  NSICS++S+C  SK IYPSF  ++SR A VNL ANAS FKQ LP+ +YK+ RAGL HQ TPIVS FGSKGKE +GDGGSPWK +DKVVEN
Subjt:  MSSMQITA--NSICSNKSLCPVSKLIYPSFHVSQSRRAFVNLCANASSFKQVLPIFQYKNPRAGLKHQRTPIVSAFGSKGKEDNGDGGSPWKAYDKVVEN

Query:  FKNGQSVEDVLRQQIENKEFYDGGDGGKRPPSGGGGGGGGGNSSGGSGDDNLGGVMHETVQVVLATIGFIFMYIYIISGEELARLAKDYIKYLFGGSKSA
        FK G+SVEDVLRQQIE KEFYDGGDGGKRPPS   GGGG G+SS GS DD+LGG++ ET+QV+LATIGFIF+YIYIISGEEL RLAKDYIK++FGGSKS 
Subjt:  FKNGQSVEDVLRQQIENKEFYDGGDGGKRPPSGGGGGGGGGNSSGGSGDDNLGGVMHETVQVVLATIGFIFMYIYIISGEELARLAKDYIKYLFGGSKSA

Query:  RLKRTMHQWGSFYQRLTGKKQYDKHWMEKAILNTPTGGDHPDNYRHTVTDYIDSKYPKLKPHSSSSPY-DADSDESHSDDE
        RLKR M++WG FYQ+LT KKQYD++W+EKAI+NTPT  DHPD YR  V DY++S+Y     HS+S+   DA+ D S+SDDE
Subjt:  RLKRTMHQWGSFYQRLTGKKQYDKHWMEKAILNTPTGGDHPDNYRHTVTDYIDSKYPKLKPHSSSSPY-DADSDESHSDDE

A0A6J1GSZ4 uncharacterized protein LOC1114572076.7e-8261.46Show/hide
Query:  SSMQITA--NSICSNKSLCPVSKLIYPSFHVSQSRRAFVNLCANASSFKQVLPIFQYKNPRAGLKHQRTPIVSAFGSKGKEDNGDGGSPWKAYDKVVENF
        S MQITA  NS+C NKS+C VSK  YPSF  SQ+R AFVN  ANAS  K+ LP+ +Y + R GLKH+ TPI S FGSKGK DN DGGSPWKA+DKVVENF
Subjt:  SSMQITA--NSICSNKSLCPVSKLIYPSFHVSQSRRAFVNLCANASSFKQVLPIFQYKNPRAGLKHQRTPIVSAFGSKGKEDNGDGGSPWKAYDKVVENF

Query:  KNGQSVEDVLRQQIENKEFYDGGDGGKRPPSGGGGGG-GGGNSSGGSGDDNLGGVMHETVQVVLATIGFIFMYIYIISGEELARLAKDYIKYLFGGSKSA
        K G+SVED+LRQQIENK+FYDGGDGG+ PP GGGGGG  GG+SS  S D ++ G++ ET+ VVLATIG + +YIYII G+EL  LAKDYIKYLFG  +SA
Subjt:  KNGQSVEDVLRQQIENKEFYDGGDGGKRPPSGGGGGG-GGGNSSGGSGDDNLGGVMHETVQVVLATIGFIFMYIYIISGEELARLAKDYIKYLFGGSKSA

Query:  RLKRTMHQWGSFYQRLTGKK-QYDKHWMEKAILNTPTGGDHPDNYRHTVTDYIDSKYPKLKP-------HSSSSPYDADSDESHSDDE
        RLK  M+ WG FY+R T KK + D++W+EKAILNTPT  DHPD YR+ + +Y++S+     P        SSSS YDA S  S+ D+E
Subjt:  RLKRTMHQWGSFYQRLTGKK-QYDKHWMEKAILNTPTGGDHPDNYRHTVTDYIDSKYPKLKP-------HSSSSPYDADSDESHSDDE

A0A6J1K2H4 uncharacterized protein LOC1114904561.3e-8262.28Show/hide
Query:  SSMQITA--NSICSNKSLCPVSKLIYPSFHVSQSRRAFVNLCANASSFKQVLPIFQYKNPRAGLKHQRTPIVSAFGSKGKEDNGDGGSPWKAYDKVVENF
        S MQITA  NS+C NKSLC VSK  YPSF  SQ+R AFVN  AN S  K+ LP+ +Y + R GLKH+ TPI S FGSKGK DNGDGGSPWKA+DKVVENF
Subjt:  SSMQITA--NSICSNKSLCPVSKLIYPSFHVSQSRRAFVNLCANASSFKQVLPIFQYKNPRAGLKHQRTPIVSAFGSKGKEDNGDGGSPWKAYDKVVENF

Query:  KNGQSVEDVLRQQIENKEFYDGGDGGKRPPSGGGGGGGGGNSSGGSGDDNLGGVMHETVQVVLATIGFIFMYIYIISGEELARLAKDYIKYLFGGSKSAR
        K G+SVED+LRQQIENK+FYDGGDGG+ PP GGGGG  GG+SS  S D N+ G++ ET+ VVLATIG + +YIYII G+EL  LAKDYIKYLFG  +SAR
Subjt:  KNGQSVEDVLRQQIENKEFYDGGDGGKRPPSGGGGGGGGGNSSGGSGDDNLGGVMHETVQVVLATIGFIFMYIYIISGEELARLAKDYIKYLFGGSKSAR

Query:  LKRTMHQWGSFYQRLTGKKQY-DKHWMEKAILNTPTGGDHPDNYRHTVTDYIDSKYPKLKP-------HSSSSPYDADS-DESHSDDEE
        LK  M+ WG FY+R T KKQ  D++W+EKAILNTPT  DHPD YR+ + +Y++S+  +  P        SSSS YD +  +ES+SDDE+
Subjt:  LKRTMHQWGSFYQRLTGKKQY-DKHWMEKAILNTPTGGDHPDNYRHTVTDYIDSKYPKLKP-------HSSSSPYDADS-DESHSDDEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G43630.1 FUNCTIONS IN: molecular_function unknown6.4e-4541.92Show/hide
Query:  CPVSKLIYPSFHVSQSRRAFVNLCANASSFKQVLPIFQYKNPRAGLKHQRTPIVSAFGSKGKEDNGDGGSPWKAYDKVVENFKNGQSVEDVLRQQIENKE
        C  S  I  S       R    L A A+   Q  P+  ++      K +++  V  FG K K D  D  SPWKA +K +      +SVED+LR+QI+ K+
Subjt:  CPVSKLIYPSFHVSQSRRAFVNLCANASSFKQVLPIFQYKNPRAGLKHQRTPIVSAFGSKGKEDNGDGGSPWKAYDKVVENFKNGQSVEDVLRQQIENKE

Query:  FYDGGDGGKRPPSGGGGGGGGGN-----SSGGSGDDNLGGVMHETVQVVLATIGFIFMYIYIISGEELARLAKDYIKYLFGGSKSARLKRTMHQWGSFYQ
        FYD   GG  PP GGG GGGGGN        G  D  L G+  ET+QVVLAT+GFIF+Y YII+GEEL +LA+DYI++L G  K+ RL R M  W  F +
Subjt:  FYDGGDGGKRPPSGGGGGGGGGN-----SSGGSGDDNLGGVMHETVQVVLATIGFIFMYIYIISGEELARLAKDYIKYLFGGSKSARLKRTMHQWGSFYQ

Query:  RLTGKKQYDKHWMEKAILNTPTGGDHPDNYRHTVTDYIDSKYPKLKPHSSSSPYDADSDE
        +++ ++ YD++W+EKAI+NTPT  D P+ YR  +  Y+DS       +S  +  +++SDE
Subjt:  RLTGKKQYDKHWMEKAILNTPTGGDHPDNYRHTVTDYIDSKYPKLKPHSSSSPYDADSDE

AT3G59640.1 glycine-rich protein2.7e-2742.42Show/hide
Query:  VNLCANASSFKQVLPIFQY--KNPRAGLKHQRTPIVSAFGSKGKED-NGDGGSPWKAYDKVVENFKNGQSVEDVLRQQIENKEFYDGGDGGKRPPSGGGG
        +   A++S   Q  P+  +  +N R G      P+V   G K K + + +  S W+A +K +      +SVED+LR+QI+ K      D G  PP G GG
Subjt:  VNLCANASSFKQVLPIFQY--KNPRAGLKHQRTPIVSAFGSKGKED-NGDGGSPWKAYDKVVENFKNGQSVEDVLRQQIENKEFYDGGDGGKRPPSGGGG

Query:  GGGGGN-----SSGGSGDD-NLGGVMHETVQVVLATIGFIFMYIYIISGEELARLAKDYIKYLFGGSKSARLKRTMHQWGSFYQRLTGKKQYDKHWME
        GGGG N     S G SG+D  L     ET+QVVLAT+GFIF+Y YII+GEEL RLA+DYI+YL G  KS RL R M  W  F+++++ KK Y+++W++
Subjt:  GGGGGN-----SSGGSGDD-NLGGVMHETVQVVLATIGFIFMYIYIISGEELARLAKDYIKYLFGGSKSARLKRTMHQWGSFYQRLTGKKQYDKHWME

AT3G59640.2 glycine-rich protein2.7e-2742.42Show/hide
Query:  VNLCANASSFKQVLPIFQY--KNPRAGLKHQRTPIVSAFGSKGKED-NGDGGSPWKAYDKVVENFKNGQSVEDVLRQQIENKEFYDGGDGGKRPPSGGGG
        +   A++S   Q  P+  +  +N R G      P+V   G K K + + +  S W+A +K +      +SVED+LR+QI+ K      D G  PP G GG
Subjt:  VNLCANASSFKQVLPIFQY--KNPRAGLKHQRTPIVSAFGSKGKED-NGDGGSPWKAYDKVVENFKNGQSVEDVLRQQIENKEFYDGGDGGKRPPSGGGG

Query:  GGGGGN-----SSGGSGDD-NLGGVMHETVQVVLATIGFIFMYIYIISGEELARLAKDYIKYLFGGSKSARLKRTMHQWGSFYQRLTGKKQYDKHWME
        GGGG N     S G SG+D  L     ET+QVVLAT+GFIF+Y YII+GEEL RLA+DYI+YL G  KS RL R M  W  F+++++ KK Y+++W++
Subjt:  GGGGGN-----SSGGSGDD-NLGGVMHETVQVVLATIGFIFMYIYIISGEELARLAKDYIKYLFGGSKSARLKRTMHQWGSFYQRLTGKKQYDKHWME


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAGCATGCAGATAACTGCAAATTCTATTTGTTCCAACAAATCATTGTGTCCTGTTTCTAAGTTAATATATCCATCATTTCATGTTAGTCAGTCACGAAGAGCTTT
TGTGAACCTTTGTGCCAATGCATCTTCTTTCAAGCAAGTTCTACCAATATTTCAGTATAAAAATCCGAGAGCTGGCTTAAAACATCAGCGTACCCCAATCGTTTCCGCAT
TTGGTAGCAAGGGAAAGGAGGACAATGGCGATGGGGGTTCTCCCTGGAAAGCTTACGACAAAGTTGTTGAAAATTTTAAGAATGGACAGTCAGTAGAAGATGTATTGCGA
CAACAAATTGAAAACAAAGAGTTCTATGATGGTGGAGACGGTGGAAAAAGACCTCCAAGTGGTGGTGGTGGCGGCGGCGGCGGTGGGAATAGCTCAGGTGGATCTGGAGA
TGATAACCTTGGAGGAGTTATGCACGAAACAGTGCAAGTAGTTTTAGCAACCATTGGCTTTATTTTCATGTACATCTACATCATCAGTGGGGAAGAGCTGGCAAGATTAG
CGAAGGACTACATAAAGTATCTATTTGGAGGAAGCAAGAGTGCTCGTTTAAAGCGAACAATGCACCAATGGGGAAGCTTTTACCAAAGACTCACTGGAAAGAAGCAATAT
GATAAACACTGGATGGAGAAAGCTATTCTCAACACCCCAACTGGGGGGGACCATCCTGATAATTACAGGCATACCGTAACGGATTATATCGATTCCAAGTATCCGAAACT
GAAACCACATTCTTCGTCGTCACCATACGATGCTGATAGTGATGAGTCTCATTCGGATGATGAGGAATCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCAGCATGCAGATAACTGCAAATTCTATTTGTTCCAACAAATCATTGTGTCCTGTTTCTAAGTTAATATATCCATCATTTCATGTTAGTCAGTCACGAAGAGCTTT
TGTGAACCTTTGTGCCAATGCATCTTCTTTCAAGCAAGTTCTACCAATATTTCAGTATAAAAATCCGAGAGCTGGCTTAAAACATCAGCGTACCCCAATCGTTTCCGCAT
TTGGTAGCAAGGGAAAGGAGGACAATGGCGATGGGGGTTCTCCCTGGAAAGCTTACGACAAAGTTGTTGAAAATTTTAAGAATGGACAGTCAGTAGAAGATGTATTGCGA
CAACAAATTGAAAACAAAGAGTTCTATGATGGTGGAGACGGTGGAAAAAGACCTCCAAGTGGTGGTGGTGGCGGCGGCGGCGGTGGGAATAGCTCAGGTGGATCTGGAGA
TGATAACCTTGGAGGAGTTATGCACGAAACAGTGCAAGTAGTTTTAGCAACCATTGGCTTTATTTTCATGTACATCTACATCATCAGTGGGGAAGAGCTGGCAAGATTAG
CGAAGGACTACATAAAGTATCTATTTGGAGGAAGCAAGAGTGCTCGTTTAAAGCGAACAATGCACCAATGGGGAAGCTTTTACCAAAGACTCACTGGAAAGAAGCAATAT
GATAAACACTGGATGGAGAAAGCTATTCTCAACACCCCAACTGGGGGGGACCATCCTGATAATTACAGGCATACCGTAACGGATTATATCGATTCCAAGTATCCGAAACT
GAAACCACATTCTTCGTCGTCACCATACGATGCTGATAGTGATGAGTCTCATTCGGATGATGAGGAATCCTAA
Protein sequenceShow/hide protein sequence
MSSMQITANSICSNKSLCPVSKLIYPSFHVSQSRRAFVNLCANASSFKQVLPIFQYKNPRAGLKHQRTPIVSAFGSKGKEDNGDGGSPWKAYDKVVENFKNGQSVEDVLR
QQIENKEFYDGGDGGKRPPSGGGGGGGGGNSSGGSGDDNLGGVMHETVQVVLATIGFIFMYIYIISGEELARLAKDYIKYLFGGSKSARLKRTMHQWGSFYQRLTGKKQY
DKHWMEKAILNTPTGGDHPDNYRHTVTDYIDSKYPKLKPHSSSSPYDADSDESHSDDEES