; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028137 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028137
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionNHL domain-containing protein
Genome locationtig00153056:3870786..3879312
RNA-Seq ExpressionSgr028137
SyntenySgr028137
Gene Ontology termsNA
InterPro domainsIPR011042 - Six-bladed beta-propeller, TolB-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146020.1 uncharacterized protein LOC101206392 isoform X1 [Cucumis sativus]1.5e-8565.7Show/hide
Query:  LGCSASHENPLLVLASIVAIVNLQADAAPSGMWVLLILVCSGPLARHLSSLLKWTGSSYKTPQPDGNALQFESGYLVETIVEGNEIGMVPHKIRVSEDGE
        +GCS    NPLL LA IV IV+ Q D+A            SGPLARHLSSLLKWTGSS KTPQPDGNA+QFESGYLVETIVEGNEIGMVP+KIRVSEDGE
Subjt:  LGCSASHENPLLVLASIVAIVNLQADAAPSGMWVLLILVCSGPLARHLSSLLKWTGSSYKTPQPDGNALQFESGYLVETIVEGNEIGMVPHKIRVSEDGE

Query:  LFAVDSINSNVVKVSPPLSRC---------------------------------------------------------VTTIAGGKTNVPGYSDGPGEEA
        LFAVDS+NSNVVKVSPPLSR                                                          VTTIAGGKTNVPGYSDGPGEEA
Subjt:  LFAVDSINSNVVKVSPPLSRC---------------------------------------------------------VTTIAGGKTNVPGYSDGPGEEA

Query:  KFSNDFDVIYVRRTCSLLVIDRGNAALRQISLNKEDCDYQYGSASTADVAMFFGALLIGYVTYMLQHGFGLSVFSLM
        KFSNDFDVIYVRRTCSLLV+DRGNAALRQISLNKEDCDYQYGS ST+DVAMF GALLIGY TYMLQHGF LS F+ M
Subjt:  KFSNDFDVIYVRRTCSLLVIDRGNAALRQISLNKEDCDYQYGSASTADVAMFFGALLIGYVTYMLQHGFGLSVFSLM

XP_011654037.1 uncharacterized protein LOC101206392 isoform X2 [Cucumis sativus]1.5e-8565.7Show/hide
Query:  LGCSASHENPLLVLASIVAIVNLQADAAPSGMWVLLILVCSGPLARHLSSLLKWTGSSYKTPQPDGNALQFESGYLVETIVEGNEIGMVPHKIRVSEDGE
        +GCS    NPLL LA IV IV+ Q D+A            SGPLARHLSSLLKWTGSS KTPQPDGNA+QFESGYLVETIVEGNEIGMVP+KIRVSEDGE
Subjt:  LGCSASHENPLLVLASIVAIVNLQADAAPSGMWVLLILVCSGPLARHLSSLLKWTGSSYKTPQPDGNALQFESGYLVETIVEGNEIGMVPHKIRVSEDGE

Query:  LFAVDSINSNVVKVSPPLSRC---------------------------------------------------------VTTIAGGKTNVPGYSDGPGEEA
        LFAVDS+NSNVVKVSPPLSR                                                          VTTIAGGKTNVPGYSDGPGEEA
Subjt:  LFAVDSINSNVVKVSPPLSRC---------------------------------------------------------VTTIAGGKTNVPGYSDGPGEEA

Query:  KFSNDFDVIYVRRTCSLLVIDRGNAALRQISLNKEDCDYQYGSASTADVAMFFGALLIGYVTYMLQHGFGLSVFSLM
        KFSNDFDVIYVRRTCSLLV+DRGNAALRQISLNKEDCDYQYGS ST+DVAMF GALLIGY TYMLQHGF LS F+ M
Subjt:  KFSNDFDVIYVRRTCSLLVIDRGNAALRQISLNKEDCDYQYGSASTADVAMFFGALLIGYVTYMLQHGFGLSVFSLM

XP_022142855.1 uncharacterized protein LOC111012863 [Momordica charantia]3.4e-8565.8Show/hide
Query:  NPLLVLASIVAIVNLQADAAPSGMWVLLILVCSGPLARHLSSLLKWTGSSYKTPQPDGNALQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDSIN
        NPL+ LA +VAIV+ QAD+AP           SGPLARHLSSLLKWTGSS KTPQPDGNA+QFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVD ++
Subjt:  NPLLVLASIVAIVNLQADAAPSGMWVLLILVCSGPLARHLSSLLKWTGSSYKTPQPDGNALQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDSIN

Query:  SNVVKVSPPLSRC---------------------------------------------------------VTTIAGGKTNVPGYSDGPGEEAKFSNDFDV
        SNVVKVSPPLSR                                                          VTTIAGGKTNVPGYSDGPGEEAKFSNDFD+
Subjt:  SNVVKVSPPLSRC---------------------------------------------------------VTTIAGGKTNVPGYSDGPGEEAKFSNDFDV

Query:  IYVRRTCSLLVIDRGNAALRQISLNKEDCDYQYGSASTADVAMFFGALLIGYVTYMLQHGFGLSVFSLM
        IYVRR+CSLLV+DRGNAALRQISLNKEDCD QYGS ST+DVAMF GAL +GYVTYMLQHGFGLS F+LM
Subjt:  IYVRRTCSLLVIDRGNAALRQISLNKEDCDYQYGSASTADVAMFFGALLIGYVTYMLQHGFGLSVFSLM

XP_038900080.1 uncharacterized protein LOC120087236 isoform X1 [Benincasa hispida]1.5e-8566.17Show/hide
Query:  NPLLVLASIVAIVNLQADAAPSGMWVLLILVCSGPLARHLSSLLKWTGSSYKTPQPDGNALQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDSIN
        NPLL L  IV+++ LQAD+AP           SGPLARHLSSLLKWTGSS KTPQPDGNA+QFESGYLVETIVEGNEIGMVP+KIRVSEDGELFAVDS+N
Subjt:  NPLLVLASIVAIVNLQADAAPSGMWVLLILVCSGPLARHLSSLLKWTGSSYKTPQPDGNALQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDSIN

Query:  SNVVKVSPPLSRC---------------------------------------------------------VTTIAGGKTNVPGYSDGPGEEAKFSNDFDV
        SNVVKVSPPLSR                                                          VTTIAGGKTN+PGYSDGPGEEAKFSNDFD+
Subjt:  SNVVKVSPPLSRC---------------------------------------------------------VTTIAGGKTNVPGYSDGPGEEAKFSNDFDV

Query:  IYVRRTCSLLVIDRGNAALRQISLNKEDCDYQYGSASTADVAMFFGALLIGYVTYMLQHGFGLSVFSLM
        IYVRRTCSLLV+DRGNAALRQISLNKEDCDYQYGS ST+DVAMF GALLIGY TYMLQHGF LS FS M
Subjt:  IYVRRTCSLLVIDRGNAALRQISLNKEDCDYQYGSASTADVAMFFGALLIGYVTYMLQHGFGLSVFSLM

XP_038900081.1 uncharacterized protein LOC120087236 isoform X2 [Benincasa hispida]1.5e-8566.17Show/hide
Query:  NPLLVLASIVAIVNLQADAAPSGMWVLLILVCSGPLARHLSSLLKWTGSSYKTPQPDGNALQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDSIN
        NPLL L  IV+++ LQAD+AP           SGPLARHLSSLLKWTGSS KTPQPDGNA+QFESGYLVETIVEGNEIGMVP+KIRVSEDGELFAVDS+N
Subjt:  NPLLVLASIVAIVNLQADAAPSGMWVLLILVCSGPLARHLSSLLKWTGSSYKTPQPDGNALQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDSIN

Query:  SNVVKVSPPLSRC---------------------------------------------------------VTTIAGGKTNVPGYSDGPGEEAKFSNDFDV
        SNVVKVSPPLSR                                                          VTTIAGGKTN+PGYSDGPGEEAKFSNDFD+
Subjt:  SNVVKVSPPLSRC---------------------------------------------------------VTTIAGGKTNVPGYSDGPGEEAKFSNDFDV

Query:  IYVRRTCSLLVIDRGNAALRQISLNKEDCDYQYGSASTADVAMFFGALLIGYVTYMLQHGFGLSVFSLM
        IYVRRTCSLLV+DRGNAALRQISLNKEDCDYQYGS ST+DVAMF GALLIGY TYMLQHGF LS FS M
Subjt:  IYVRRTCSLLVIDRGNAALRQISLNKEDCDYQYGSASTADVAMFFGALLIGYVTYMLQHGFGLSVFSLM

TrEMBL top hitse value%identityAlignment
A0A0A0L212 Uncharacterized protein7.4e-8665.7Show/hide
Query:  LGCSASHENPLLVLASIVAIVNLQADAAPSGMWVLLILVCSGPLARHLSSLLKWTGSSYKTPQPDGNALQFESGYLVETIVEGNEIGMVPHKIRVSEDGE
        +GCS    NPLL LA IV IV+ Q D+A            SGPLARHLSSLLKWTGSS KTPQPDGNA+QFESGYLVETIVEGNEIGMVP+KIRVSEDGE
Subjt:  LGCSASHENPLLVLASIVAIVNLQADAAPSGMWVLLILVCSGPLARHLSSLLKWTGSSYKTPQPDGNALQFESGYLVETIVEGNEIGMVPHKIRVSEDGE

Query:  LFAVDSINSNVVKVSPPLSRC---------------------------------------------------------VTTIAGGKTNVPGYSDGPGEEA
        LFAVDS+NSNVVKVSPPLSR                                                          VTTIAGGKTNVPGYSDGPGEEA
Subjt:  LFAVDSINSNVVKVSPPLSRC---------------------------------------------------------VTTIAGGKTNVPGYSDGPGEEA

Query:  KFSNDFDVIYVRRTCSLLVIDRGNAALRQISLNKEDCDYQYGSASTADVAMFFGALLIGYVTYMLQHGFGLSVFSLM
        KFSNDFDVIYVRRTCSLLV+DRGNAALRQISLNKEDCDYQYGS ST+DVAMF GALLIGY TYMLQHGF LS F+ M
Subjt:  KFSNDFDVIYVRRTCSLLVIDRGNAALRQISLNKEDCDYQYGSASTADVAMFFGALLIGYVTYMLQHGFGLSVFSLM

A0A1S3CJZ4 uncharacterized protein LOC103501822 isoform X18.2e-8565.34Show/hide
Query:  LGCSASHENPLLVLASIVAIVNLQADAAPSGMWVLLILVCSGPLARHLSSLLKWTGSSYKTPQPDGNALQFESGYLVETIVEGNEIGMVPHKIRVSEDGE
        +GCS    NPLL LA IV I + Q D+A            SG LARHLSSLLKWTGSS KTPQPDGNA+QFESGYLVETIVEGNEIGMVP+KIRVSEDGE
Subjt:  LGCSASHENPLLVLASIVAIVNLQADAAPSGMWVLLILVCSGPLARHLSSLLKWTGSSYKTPQPDGNALQFESGYLVETIVEGNEIGMVPHKIRVSEDGE

Query:  LFAVDSINSNVVKVSPPLSRC---------------------------------------------------------VTTIAGGKTNVPGYSDGPGEEA
        LFAVDS+NSNVVKVSPPLSR                                                          VTTIAGGKTNVPGYSDGPGEEA
Subjt:  LFAVDSINSNVVKVSPPLSRC---------------------------------------------------------VTTIAGGKTNVPGYSDGPGEEA

Query:  KFSNDFDVIYVRRTCSLLVIDRGNAALRQISLNKEDCDYQYGSASTADVAMFFGALLIGYVTYMLQHGFGLSVFSLM
        KFSNDFDVIYVRRTCSLLV+DRGNAALRQISLNKEDCDYQYGS ST+DVAMF GALLIGY TYMLQHGF LS FS M
Subjt:  KFSNDFDVIYVRRTCSLLVIDRGNAALRQISLNKEDCDYQYGSASTADVAMFFGALLIGYVTYMLQHGFGLSVFSLM

A0A1S3CKG1 uncharacterized protein LOC103501822 isoform X28.2e-8565.34Show/hide
Query:  LGCSASHENPLLVLASIVAIVNLQADAAPSGMWVLLILVCSGPLARHLSSLLKWTGSSYKTPQPDGNALQFESGYLVETIVEGNEIGMVPHKIRVSEDGE
        +GCS    NPLL LA IV I + Q D+A            SG LARHLSSLLKWTGSS KTPQPDGNA+QFESGYLVETIVEGNEIGMVP+KIRVSEDGE
Subjt:  LGCSASHENPLLVLASIVAIVNLQADAAPSGMWVLLILVCSGPLARHLSSLLKWTGSSYKTPQPDGNALQFESGYLVETIVEGNEIGMVPHKIRVSEDGE

Query:  LFAVDSINSNVVKVSPPLSRC---------------------------------------------------------VTTIAGGKTNVPGYSDGPGEEA
        LFAVDS+NSNVVKVSPPLSR                                                          VTTIAGGKTNVPGYSDGPGEEA
Subjt:  LFAVDSINSNVVKVSPPLSRC---------------------------------------------------------VTTIAGGKTNVPGYSDGPGEEA

Query:  KFSNDFDVIYVRRTCSLLVIDRGNAALRQISLNKEDCDYQYGSASTADVAMFFGALLIGYVTYMLQHGFGLSVFSLM
        KFSNDFDVIYVRRTCSLLV+DRGNAALRQISLNKEDCDYQYGS ST+DVAMF GALLIGY TYMLQHGF LS FS M
Subjt:  KFSNDFDVIYVRRTCSLLVIDRGNAALRQISLNKEDCDYQYGSASTADVAMFFGALLIGYVTYMLQHGFGLSVFSLM

A0A5A7VG70 NHL domain-containing protein8.2e-8565.34Show/hide
Query:  LGCSASHENPLLVLASIVAIVNLQADAAPSGMWVLLILVCSGPLARHLSSLLKWTGSSYKTPQPDGNALQFESGYLVETIVEGNEIGMVPHKIRVSEDGE
        +GCS    NPLL LA IV I + Q D+A            SG LARHLSSLLKWTGSS KTPQPDGNA+QFESGYLVETIVEGNEIGMVP+KIRVSEDGE
Subjt:  LGCSASHENPLLVLASIVAIVNLQADAAPSGMWVLLILVCSGPLARHLSSLLKWTGSSYKTPQPDGNALQFESGYLVETIVEGNEIGMVPHKIRVSEDGE

Query:  LFAVDSINSNVVKVSPPLSRC---------------------------------------------------------VTTIAGGKTNVPGYSDGPGEEA
        LFAVDS+NSNVVKVSPPLSR                                                          VTTIAGGKTNVPGYSDGPGEEA
Subjt:  LFAVDSINSNVVKVSPPLSRC---------------------------------------------------------VTTIAGGKTNVPGYSDGPGEEA

Query:  KFSNDFDVIYVRRTCSLLVIDRGNAALRQISLNKEDCDYQYGSASTADVAMFFGALLIGYVTYMLQHGFGLSVFSLM
        KFSNDFDVIYVRRTCSLLV+DRGNAALRQISLNKEDCDYQYGS ST+DVAMF GALLIGY TYMLQHGF LS FS M
Subjt:  KFSNDFDVIYVRRTCSLLVIDRGNAALRQISLNKEDCDYQYGSASTADVAMFFGALLIGYVTYMLQHGFGLSVFSLM

A0A6J1CM34 uncharacterized protein LOC1110128631.7e-8565.8Show/hide
Query:  NPLLVLASIVAIVNLQADAAPSGMWVLLILVCSGPLARHLSSLLKWTGSSYKTPQPDGNALQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDSIN
        NPL+ LA +VAIV+ QAD+AP           SGPLARHLSSLLKWTGSS KTPQPDGNA+QFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVD ++
Subjt:  NPLLVLASIVAIVNLQADAAPSGMWVLLILVCSGPLARHLSSLLKWTGSSYKTPQPDGNALQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDSIN

Query:  SNVVKVSPPLSRC---------------------------------------------------------VTTIAGGKTNVPGYSDGPGEEAKFSNDFDV
        SNVVKVSPPLSR                                                          VTTIAGGKTNVPGYSDGPGEEAKFSNDFD+
Subjt:  SNVVKVSPPLSRC---------------------------------------------------------VTTIAGGKTNVPGYSDGPGEEAKFSNDFDV

Query:  IYVRRTCSLLVIDRGNAALRQISLNKEDCDYQYGSASTADVAMFFGALLIGYVTYMLQHGFGLSVFSLM
        IYVRR+CSLLV+DRGNAALRQISLNKEDCD QYGS ST+DVAMF GAL +GYVTYMLQHGFGLS F+LM
Subjt:  IYVRRTCSLLVIDRGNAALRQISLNKEDCDYQYGSASTADVAMFFGALLIGYVTYMLQHGFGLSVFSLM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G23880.1 NHL domain-containing protein1.4e-3134.42Show/hide
Query:  LSSTLGCSASHENPLLVLASIVAIVNLQADAAPSGMWVLLILVCSGPLARHLSSLLKWTGS-----SYKTPQPDGNALQFESGYLVETIVEGNEIGMVPH
        LSSTL  S       L L  I+ + +    +APS      I+  +  ++ H +SLLKW  S     + KT  P  + ++FE+GY VET+++G+++G+ P+
Subjt:  LSSTLGCSASHENPLLVLASIVAIVNLQADAAPSGMWVLLILVCSGPLARHLSSLLKWTGS-----SYKTPQPDGNALQFESGYLVETIVEGNEIGMVPH

Query:  KIRVSEDGELFAVDSINSNVVKVSPPLS---------------------------------------------------------RCVTTIAGGK-TNVP
         I+V  +GEL  +DS NSN+ ++S  LS                                                           VTTIAGGK     
Subjt:  KIRVSEDGELFAVDSINSNVVKVSPPLS---------------------------------------------------------RCVTTIAGGK-TNVP

Query:  GYSDGPGEEAKFSNDFDVIYVRRTCSLLVIDRGNAALRQISLNKEDCDYQYGSASTADVAMFFGALLIGYVTYMLQ
        G+ DGP E+AKFSNDFDV+Y+  +CSLLVIDRGN A+R+I L+ +DC  QYGS     +A+   A+  GY+  +LQ
Subjt:  GYSDGPGEEAKFSNDFDVIYVRRTCSLLVIDRGNAALRQISLNKEDCDYQYGSASTADVAMFFGALLIGYVTYMLQ

AT1G70280.1 NHL domain-containing protein5.7e-3037.98Show/hide
Query:  LQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDSINSNVVKVSPPLSRC-----------------------------------------------
        ++FE+GY VET+ +G+++G+ P+ I V  +GEL  +DS NSN+ K+S  LS                                                 
Subjt:  LQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDSINSNVVKVSPPLSRC-----------------------------------------------

Query:  ----------VTTIAGGKT-NVPGYSDGPGEEAKFSNDFDVIYVRRTCSLLVIDRGNAALRQISLNKEDCDYQYGSASTADVAMFFGALLIGYVTYMLQH
                  VTTIAGGKT    G+ DGP E+AKFSNDFDV+YV  +CSLLVIDRGN A+R+I L+ +DC YQYGS     +A+   A   GY+  +LQ 
Subjt:  ----------VTTIAGGKT-NVPGYSDGPGEEAKFSNDFDVIYVRRTCSLLVIDRGNAALRQISLNKEDCDYQYGSASTADVAMFFGALLIGYVTYMLQH

Query:  GFGLSVFS
          G  V S
Subjt:  GFGLSVFS

AT1G70280.2 NHL domain-containing protein2.9e-3436.19Show/hide
Query:  LVLASIVAIVNLQADAAPSGMWVLLILVCSGPLARHLSSLLKWTGS---SYKTPQPDGNALQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDSIN
        LVL+ ++ +++    +APS      IL  +G ++ H SSL+KW  S   + KT     + ++FE+GY VET+ +G+++G+ P+ I V  +GEL  +DS N
Subjt:  LVLASIVAIVNLQADAAPSGMWVLLILVCSGPLARHLSSLLKWTGS---SYKTPQPDGNALQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDSIN

Query:  SNVVKVSPPLSRC---------------------------------------------------------VTTIAGGKT-NVPGYSDGPGEEAKFSNDFD
        SN+ K+S  LS                                                           VTTIAGGKT    G+ DGP E+AKFSNDFD
Subjt:  SNVVKVSPPLSRC---------------------------------------------------------VTTIAGGKT-NVPGYSDGPGEEAKFSNDFD

Query:  VIYVRRTCSLLVIDRGNAALRQISLNKEDCDYQYGSASTADVAMFFGALLIGYVTYMLQHGFGLSVFS
        V+YV  +CSLLVIDRGN A+R+I L+ +DC YQYGS     +A+   A   GY+  +LQ   G  V S
Subjt:  VIYVRRTCSLLVIDRGNAALRQISLNKEDCDYQYGSASTADVAMFFGALLIGYVTYMLQHGFGLSVFS

AT3G14860.1 NHL domain-containing protein8.2e-6151.57Show/hide
Query:  QADAAPSGMWVLLILVCSGPLARHLSSLLKW-TGSSYKTPQPDGNALQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDSINSNVVKVSPPLSRC-
        QA AAP           SG L +H+SS+LKW TGSS K  Q D N LQFE+GYLVET+VEGN+IG+VP+KIRVS+DGEL+AVD +NSN++K++PPLS+  
Subjt:  QADAAPSGMWVLLILVCSGPLARHLSSLLKW-TGSSYKTPQPDGNALQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDSINSNVVKVSPPLSRC-

Query:  --------------------------------------------------------VTTIAGGKTNVPGYSDGPGEEAKFSNDFDVIYVRRTCSLLVIDR
                                                                VTTIAGGK+N+ GY DGP E+AKFSNDFDV+YVR TCSLLVIDR
Subjt:  --------------------------------------------------------VTTIAGGKTNVPGYSDGPGEEAKFSNDFDVIYVRRTCSLLVIDR

Query:  GNAALRQISLNKEDCDYQYGSA-STADVAMFFGALLIGYVTYMLQHGFGLSVFS
        GNAALRQISL++EDCDYQ  S+ S  D+ +  GA+LIGY T MLQ GFG S FS
Subjt:  GNAALRQISLNKEDCDYQYGSA-STADVAMFFGALLIGYVTYMLQHGFGLSVFS

AT3G14860.2 NHL domain-containing protein4.8e-6150.97Show/hide
Query:  QADAAPSGMWVLLILVCSGPLARHLSSLLKW-TGSSYKTPQPDGNALQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDSINSNVVKVSPPLSRC-
        QA AAP           SG L +H+SS+LKW TGSS K  Q D N LQFE+GYLVET+VEGN+IG+VP+KIRVS+DGEL+AVD +NSN++K++PPLS+  
Subjt:  QADAAPSGMWVLLILVCSGPLARHLSSLLKW-TGSSYKTPQPDGNALQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDSINSNVVKVSPPLSRC-

Query:  --------------------------------------------------------VTTIAGGKTNVPGYSDGPGEEAKFSNDFDVIYVRRTCSLLVIDR
                                                                VTTIAGGK+N+ GY DGP E+AKFSNDFDV+YVR TCSLLVIDR
Subjt:  --------------------------------------------------------VTTIAGGKTNVPGYSDGPGEEAKFSNDFDVIYVRRTCSLLVIDR

Query:  GNAALRQISLNKEDCDYQYGSA-STADVAMFFGALLIGYVTYMLQHGFGLSVFSLMKFG
        GNAALRQISL++EDCDYQ  S+ S  D+ +  GA+LIGY T MLQ GFG S FS  + G
Subjt:  GNAALRQISLNKEDCDYQYGSA-STADVAMFFGALLIGYVTYMLQHGFGLSVFSLMKFG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGTGACTGCAAGCAGAATTCCTGTCCCTGAACCAATTGCGCCCATGAAATCAGCCAACACGGTGAGTGCCCCAATGCACATGCCTCCAAATGCTGCCGCAGTGGT
TAAAAGTACCGCCATTAAAGACGAACTGCTTCAGAAACATACCGGAATATCTCCAGAGTCGGCAAAGTGGACTGGATCTAAGTTCCCATGGAGGAATAAGAATAACAAAA
GAAAGGAAGCTGGAACGGAACTCTCTTTCCCATTACATTCTGAATCCTTGCCCAGCTGCAAATCTCTCTTATCTTCAACATTGGGTTGTTCAGCGAGCCATGAAAATCCC
CTGCTCGTCCTCGCTTCCATAGTAGCTATTGTCAATCTCCAAGCTGATGCTGCTCCTTCTGGTATGTGGGTTCTTCTTATATTGGTCTGTTCAGGACCATTGGCAAGACA
TTTGTCTTCTCTTCTTAAATGGACTGGGTCTTCTTACAAAACTCCTCAGCCAGATGGGAATGCGCTTCAGTTTGAGAGTGGTTACTTAGTTGAGACTATTGTGGAGGGAA
ATGAAATTGGAATGGTTCCTCACAAGATACGCGTCTCCGAGGATGGCGAACTCTTCGCTGTTGATTCGATTAATAGCAATGTTGTCAAGGTTTCTCCGCCATTATCTCGA
TGTGTGACAACAATTGCTGGTGGCAAGACAAATGTTCCAGGCTATAGTGATGGGCCGGGCGAGGAAGCAAAATTTTCGAATGATTTTGATGTCATATATGTCCGGCGTAC
CTGTTCGTTGTTGGTCATTGATAGAGGAAATGCTGCACTTCGCCAAATATCTCTTAACAAGGAGGATTGTGATTATCAATATGGTTCAGCTTCTACCGCAGATGTTGCGA
TGTTCTTTGGCGCTCTTCTCATAGGATATGTTACATATATGCTTCAACATGGATTCGGGCTGTCAGTCTTCTCTCTTATGAAATTTGGTGGACGCCCAATGCCTCTCACC
ATTACCTTCTTAGATCTTGTGTTTAAGTTATTTGCTTTTGTCTATGAGACCCAAAAACATTGTGTTAGCCAAATGTTCGAAGTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGATTGTGACTGCAAGCAGAATTCCTGTCCCTGAACCAATTGCGCCCATGAAATCAGCCAACACGGTGAGTGCCCCAATGCACATGCCTCCAAATGCTGCCGCAGTGGT
TAAAAGTACCGCCATTAAAGACGAACTGCTTCAGAAACATACCGGAATATCTCCAGAGTCGGCAAAGTGGACTGGATCTAAGTTCCCATGGAGGAATAAGAATAACAAAA
GAAAGGAAGCTGGAACGGAACTCTCTTTCCCATTACATTCTGAATCCTTGCCCAGCTGCAAATCTCTCTTATCTTCAACATTGGGTTGTTCAGCGAGCCATGAAAATCCC
CTGCTCGTCCTCGCTTCCATAGTAGCTATTGTCAATCTCCAAGCTGATGCTGCTCCTTCTGGTATGTGGGTTCTTCTTATATTGGTCTGTTCAGGACCATTGGCAAGACA
TTTGTCTTCTCTTCTTAAATGGACTGGGTCTTCTTACAAAACTCCTCAGCCAGATGGGAATGCGCTTCAGTTTGAGAGTGGTTACTTAGTTGAGACTATTGTGGAGGGAA
ATGAAATTGGAATGGTTCCTCACAAGATACGCGTCTCCGAGGATGGCGAACTCTTCGCTGTTGATTCGATTAATAGCAATGTTGTCAAGGTTTCTCCGCCATTATCTCGA
TGTGTGACAACAATTGCTGGTGGCAAGACAAATGTTCCAGGCTATAGTGATGGGCCGGGCGAGGAAGCAAAATTTTCGAATGATTTTGATGTCATATATGTCCGGCGTAC
CTGTTCGTTGTTGGTCATTGATAGAGGAAATGCTGCACTTCGCCAAATATCTCTTAACAAGGAGGATTGTGATTATCAATATGGTTCAGCTTCTACCGCAGATGTTGCGA
TGTTCTTTGGCGCTCTTCTCATAGGATATGTTACATATATGCTTCAACATGGATTCGGGCTGTCAGTCTTCTCTCTTATGAAATTTGGTGGACGCCCAATGCCTCTCACC
ATTACCTTCTTAGATCTTGTGTTTAAGTTATTTGCTTTTGTCTATGAGACCCAAAAACATTGTGTTAGCCAAATGTTCGAAGTGTAA
Protein sequenceShow/hide protein sequence
MIVTASRIPVPEPIAPMKSANTVSAPMHMPPNAAAVVKSTAIKDELLQKHTGISPESAKWTGSKFPWRNKNNKRKEAGTELSFPLHSESLPSCKSLLSSTLGCSASHENP
LLVLASIVAIVNLQADAAPSGMWVLLILVCSGPLARHLSSLLKWTGSSYKTPQPDGNALQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDSINSNVVKVSPPLSR
CVTTIAGGKTNVPGYSDGPGEEAKFSNDFDVIYVRRTCSLLVIDRGNAALRQISLNKEDCDYQYGSASTADVAMFFGALLIGYVTYMLQHGFGLSVFSLMKFGGRPMPLT
ITFLDLVFKLFAFVYETQKHCVSQMFEV