; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g12140 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g12140
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr9:10325350..10327466
RNA-Seq ExpressionMoc09g12140
SyntenyMoc09g12140
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]1.3e-9279.58Show/hide
Query:  MCVRKGADG------------RSFFDVPTRF---GNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPTESSRPNSKL
        MC RKGA G            R +F     +      V+I+PVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRP ESSRPNS+L
Subjt:  MCVRKGADG------------RSFFDVPTRF---GNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPTESSRPNSKL

Query:  AMVCGFASNVKRKSKGRAHALEAAQSLEPAASAVAGSASEDPAPVIELESSEGPSREKRPRGQTEAADVSSLGEEVREEAPLKRRRKKKKTTSPLEVGAR
        AMVCGFASNVKRKSKG+AHALEAAQS +P   AV G ASEDPAPVIELESS GPSREKRPR QTEA DVS LGEEVREE PLKRRRKKKKTTSPLEVGAR
Subjt:  AMVCGFASNVKRKSKGRAHALEAAQSLEPAASAVAGSASEDPAPVIELESSEGPSREKRPRGQTEAADVSSLGEEVREEAPLKRRRKKKKTTSPLEVGAR

Query:  GALPASFVDRVDDPEARMGGTSDVTARFRVEPSSSGVRDQ
        G LPASF DRVDDPEARMGGT DVT RFRVEPSSSGVRDQ
Subjt:  GALPASFVDRVDDPEARMGGTSDVTARFRVEPSSSGVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]4.8e-10879.12Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCVRKGA---------------
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMC RKGA               
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCVRKGA---------------

Query:  --------------DGRSFFDVPTRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPTESSRPNSKLAMVCGFAS
                       GRSFFDVPTRFGNLVSI+PVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRP E SRPNS LAMVC FAS
Subjt:  --------------DGRSFFDVPTRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPTESSRPNSKLAMVCGFAS

Query:  NVKRKSKGRAHALEAAQSLEPAASAVAGSASEDPAPVIELESSEGPSREKRPR-------GQTEAADVSSLGE
         VKRKSKGRAHALEAAQS +P   AV G ASEDPAPVIELESS GPSREKRPR        QTEAADV  LGE
Subjt:  NVKRKSKGRAHALEAAQSLEPAASAVAGSASEDPAPVIELESSEGPSREKRPR-------GQTEAADVSSLGE

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]2.4e-8380.89Show/hide
Query:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDCCLRRASKFVARSNFLLCT----FSQAFVASIQSALAVKAELDWREALAAREKEEFSAALEAASSTMKD
        G   + A+ R+EPSSSGVRDQVSRISAASLD CLRRASKFV+    +L       ++AFVASIQSALAVKAELD RE LAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDCCLRRASKFVARSNFLLCT----FSQAFVASIQSALAVKAELDWREALAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQAIEAKEEELKHATAELETVKECLSNGALLEESFRQHPDFN
        ELLKAHSEVE LKAEVE++AELLKKEEDRR+AQLRAAHAIT+GLE+EKFQLLKEKDDMLQA+EAK++EL+HATAELET KE LSNG LLEE+FRQHPDF+
Subjt:  ELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQAIEAKEEELKHATAELETVKECLSNGALLEESFRQHPDFN

Query:  GFAKDFSDAGFKFLMKGIASDMPNL
        GFAKDFSDAGFKFLMKGIASDMP+L
Subjt:  GFAKDFSDAGFKFLMKGIASDMPNL

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]8.1e-15683.1Show/hide
Query:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPKNILLRIPEEGERADNPLEGWVTLYFKMFEYG
        MSSS SS+L  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIP+NILLR+PEEGERADNP EGWVTLYFKMFEYG
Subjt:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPKNILLRIPEEGERADNPLEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCVRKGA--------------------
        LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMC RKGA                    
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCVRKGA--------------------

Query:  ---------DGRSFFDVPTRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPTESSRPNSKLAMVCGFASNVKRK
                  GRSFFDVPTRFGNLVSI+PVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRP ESSRPNS+LAMVCGFAS VKRK
Subjt:  ---------DGRSFFDVPTRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPTESSRPNSKLAMVCGFASNVKRK

Query:  SKGRAHALEAAQSLEPAASAVAGSASEDPAPVIELESSEGPSREKRPRGQTEAAD
        SKGRAHALEAAQS +PA  AV G ASEDPA VIELESS GPSREKRPR QTEA D
Subjt:  SKGRAHALEAAQSLEPAASAVAGSASEDPAPVIELESSEGPSREKRPRGQTEAAD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]8.4e-13766.28Show/hide
Query:  KGADGRSFFDVPTRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPTESSRPNSKLAMVCGFASNVKRKSKGRAH
        K   GR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+HFPR RK+ TLVTDKLLLESGLLDYNP VR  E+SRPNS+LAMVCGF  +VKRKSKGRAH
Subjt:  KGADGRSFFDVPTRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPTESSRPNSKLAMVCGFASNVKRKSKGRAH

Query:  ALEAAQSLEPAASAV--------AGSASEDPAPVIELESSEGPSREKRPRGQTEAADVSSLGEEVREEAPLKRRRKKKKTTSPLEVGARGALPASFVDRV
        AL+     EP    V        +G +S  P PVIEL+ S G S EKR R ++EA DVS L  EVR E+PL+RRRKKKKT+S  E GARG LP S  D V
Subjt:  ALEAAQSLEPAASAV--------AGSASEDPAPVIELESSEGPSREKRPRGQTEAADVSSLGEEVREEAPLKRRRKKKKTTSPLEVGARGALPASFVDRV

Query:  DDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDCCLRRASKFVARSNFLL----CTFSQAFVASIQSALAVKAELDWREALAAREKEEFSAALE
        DDPEARM GTS+V  RF +EPSSSGV+DQVSRISA  LD  LRRASKFV+    +L       ++AF+ASI  A+ VKAELD REALAA+E+E   AALE
Subjt:  DDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDCCLRRASKFVARSNFLL----CTFSQAFVASIQSALAVKAELDWREALAAREKEEFSAALE

Query:  AASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQAIEAKEEELKHATAELETVKECLSNGALLEES
        AA +T+K ELLKA  EV+IL+AEV+AK +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ Q +E K+  +   T EL+ +KE L+NG LLEES
Subjt:  AASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQAIEAKEEELKHATAELETVKECLSNGALLEES

Query:  FRQHPDFNGFAKDFSDAGFKFLMKGIASDMPNL
        FRQHPDF+GFAKDFSDAGFKFLMKGIA+DMP+L
Subjt:  FRQHPDFNGFAKDFSDAGFKFLMKGIASDMPNL

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092986.2e-9379.58Show/hide
Query:  MCVRKGADG------------RSFFDVPTRF---GNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPTESSRPNSKL
        MC RKGA G            R +F     +      V+I+PVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRP ESSRPNS+L
Subjt:  MCVRKGADG------------RSFFDVPTRF---GNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPTESSRPNSKL

Query:  AMVCGFASNVKRKSKGRAHALEAAQSLEPAASAVAGSASEDPAPVIELESSEGPSREKRPRGQTEAADVSSLGEEVREEAPLKRRRKKKKTTSPLEVGAR
        AMVCGFASNVKRKSKG+AHALEAAQS +P   AV G ASEDPAPVIELESS GPSREKRPR QTEA DVS LGEEVREE PLKRRRKKKKTTSPLEVGAR
Subjt:  AMVCGFASNVKRKSKGRAHALEAAQSLEPAASAVAGSASEDPAPVIELESSEGPSREKRPRGQTEAADVSSLGEEVREEAPLKRRRKKKKTTSPLEVGAR

Query:  GALPASFVDRVDDPEARMGGTSDVTARFRVEPSSSGVRDQ
        G LPASF DRVDDPEARMGGT DVT RFRVEPSSSGVRDQ
Subjt:  GALPASFVDRVDDPEARMGGTSDVTARFRVEPSSSGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138262.3e-10879.12Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCVRKGA---------------
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMC RKGA               
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCVRKGA---------------

Query:  --------------DGRSFFDVPTRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPTESSRPNSKLAMVCGFAS
                       GRSFFDVPTRFGNLVSI+PVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRP E SRPNS LAMVC FAS
Subjt:  --------------DGRSFFDVPTRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPTESSRPNSKLAMVCGFAS

Query:  NVKRKSKGRAHALEAAQSLEPAASAVAGSASEDPAPVIELESSEGPSREKRPR-------GQTEAADVSSLGE
         VKRKSKGRAHALEAAQS +P   AV G ASEDPAPVIELESS GPSREKRPR        QTEAADV  LGE
Subjt:  NVKRKSKGRAHALEAAQSLEPAASAVAGSASEDPAPVIELESSEGPSREKRPR-------GQTEAADVSSLGE

A0A6J1D971 uncharacterized protein LOC1110185381.2e-8380.89Show/hide
Query:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDCCLRRASKFVARSNFLLCT----FSQAFVASIQSALAVKAELDWREALAAREKEEFSAALEAASSTMKD
        G   + A+ R+EPSSSGVRDQVSRISAASLD CLRRASKFV+    +L       ++AFVASIQSALAVKAELD RE LAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDCCLRRASKFVARSNFLLCT----FSQAFVASIQSALAVKAELDWREALAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQAIEAKEEELKHATAELETVKECLSNGALLEESFRQHPDFN
        ELLKAHSEVE LKAEVE++AELLKKEEDRR+AQLRAAHAIT+GLE+EKFQLLKEKDDMLQA+EAK++EL+HATAELET KE LSNG LLEE+FRQHPDF+
Subjt:  ELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQAIEAKEEELKHATAELETVKECLSNGALLEESFRQHPDFN

Query:  GFAKDFSDAGFKFLMKGIASDMPNL
        GFAKDFSDAGFKFLMKGIASDMP+L
Subjt:  GFAKDFSDAGFKFLMKGIASDMPNL

A0A6J1DXS5 uncharacterized protein LOC1110255023.9e-15683.1Show/hide
Query:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPKNILLRIPEEGERADNPLEGWVTLYFKMFEYG
        MSSS SS+L  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIP+NILLR+PEEGERADNP EGWVTLYFKMFEYG
Subjt:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPKNILLRIPEEGERADNPLEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCVRKGA--------------------
        LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMC RKGA                    
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCVRKGA--------------------

Query:  ---------DGRSFFDVPTRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPTESSRPNSKLAMVCGFASNVKRK
                  GRSFFDVPTRFGNLVSI+PVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRP ESSRPNS+LAMVCGFAS VKRK
Subjt:  ---------DGRSFFDVPTRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPTESSRPNSKLAMVCGFASNVKRK

Query:  SKGRAHALEAAQSLEPAASAVAGSASEDPAPVIELESSEGPSREKRPRGQTEAAD
        SKGRAHALEAAQS +PA  AV G ASEDPA VIELESS GPSREKRPR QTEA D
Subjt:  SKGRAHALEAAQSLEPAASAVAGSASEDPAPVIELESSEGPSREKRPRGQTEAAD

A0A6J1DZB3 uncharacterized protein LOC1110256654.1e-13766.28Show/hide
Query:  KGADGRSFFDVPTRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPTESSRPNSKLAMVCGFASNVKRKSKGRAH
        K   GR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+HFPR RK+ TLVTDKLLLESGLLDYNP VR  E+SRPNS+LAMVCGF  +VKRKSKGRAH
Subjt:  KGADGRSFFDVPTRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPTESSRPNSKLAMVCGFASNVKRKSKGRAH

Query:  ALEAAQSLEPAASAV--------AGSASEDPAPVIELESSEGPSREKRPRGQTEAADVSSLGEEVREEAPLKRRRKKKKTTSPLEVGARGALPASFVDRV
        AL+     EP    V        +G +S  P PVIEL+ S G S EKR R ++EA DVS L  EVR E+PL+RRRKKKKT+S  E GARG LP S  D V
Subjt:  ALEAAQSLEPAASAV--------AGSASEDPAPVIELESSEGPSREKRPRGQTEAADVSSLGEEVREEAPLKRRRKKKKTTSPLEVGARGALPASFVDRV

Query:  DDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDCCLRRASKFVARSNFLL----CTFSQAFVASIQSALAVKAELDWREALAAREKEEFSAALE
        DDPEARM GTS+V  RF +EPSSSGV+DQVSRISA  LD  LRRASKFV+    +L       ++AF+ASI  A+ VKAELD REALAA+E+E   AALE
Subjt:  DDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDCCLRRASKFVARSNFLL----CTFSQAFVASIQSALAVKAELDWREALAAREKEEFSAALE

Query:  AASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQAIEAKEEELKHATAELETVKECLSNGALLEES
        AA +T+K ELLKA  EV+IL+AEV+AK +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ Q +E K+  +   T EL+ +KE L+NG LLEES
Subjt:  AASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQAIEAKEEELKHATAELETVKECLSNGALLEES

Query:  FRQHPDFNGFAKDFSDAGFKFLMKGIASDMPNL
        FRQHPDF+GFAKDFSDAGFKFLMKGIA+DMP+L
Subjt:  FRQHPDFNGFAKDFSDAGFKFLMKGIASDMPNL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTTTTAGCAGCGACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGA
TAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTAAGAACATCCTCCTTA
GGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCTAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGGCTTCCCCTTCACCCTTTTGTCCAA
GAATTTCTCTTCCGAACTGGATTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAAGA
GGCCGAGCTGTTGGACGTAGACCAGCTACTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGTAAGGAAAGGCGCAGACGGTCGTT
CCTTCTTTGACGTTCCCACTAGGTTTGGGAACCTAGTATCAATCCAACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGTTGAAATATTACAAGGAGCATTTTCCG
AGGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCCGGGCTACTAGATTACAACCCTGCAGTTCGTCCCACTGAATCCTCAAGGCCGAACTCCAA
ACTTGCCATGGTTTGCGGATTTGCAAGCAACGTGAAACGCAAGTCCAAGGGTCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTTGGAACCTGCCGCTTCTGCCGTGGCAG
GGTCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGAGGGTCCTTCGAGGGAGAAGCGCCCCAGGGGTCAGACCGAGGCGGCGGATGTCTCGTCCTTG
GGTGAGGAGGTGAGGGAGGAGGCCCCTTTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGCCCTGCCTGCGAGCTTCGTAGA
TCGGGTGGATGATCCTGAGGCCAGGATGGGCGGAACGTCTGACGTGACAGCACGGTTCAGAGTCGAGCCGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCGG
CTGCAAGTTTAGACTGCTGCCTCAGAAGAGCGTCCAAATTTGTAGCTCGGTCTAACTTTCTTCTTTGTACCTTTTCTCAAGCGTTTGTTGCTTCCATTCAATCGGCTCTG
GCCGTGAAGGCCGAGCTAGATTGGAGGGAAGCTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCCTCTTCCACCATGAAGGATGAGCTGCTGAA
AGCTCACTCTGAGGTGGAAATTTTGAAGGCTGAGGTGGAGGCGAAGGCCGAGCTGCTGAAGAAAGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCA
CCAAGGGCTTGGAGAAGGAGAAGTTCCAACTCCTCAAGGAGAAGGACGACATGCTCCAGGCGATTGAAGCGAAGGAGGAGGAGCTGAAGCATGCGACTGCCGAGCTGGAG
ACGGTGAAGGAGTGTCTCAGCAATGGAGCCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCAATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCT
CATGAAGGGCATTGCTTCCGACATGCCTAACCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTTTTAGCAGCGACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGA
TAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTAAGAACATCCTCCTTA
GGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCTAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGGCTTCCCCTTCACCCTTTTGTCCAA
GAATTTCTCTTCCGAACTGGATTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAAGA
GGCCGAGCTGTTGGACGTAGACCAGCTACTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGTAAGGAAAGGCGCAGACGGTCGTT
CCTTCTTTGACGTTCCCACTAGGTTTGGGAACCTAGTATCAATCCAACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGTTGAAATATTACAAGGAGCATTTTCCG
AGGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCCGGGCTACTAGATTACAACCCTGCAGTTCGTCCCACTGAATCCTCAAGGCCGAACTCCAA
ACTTGCCATGGTTTGCGGATTTGCAAGCAACGTGAAACGCAAGTCCAAGGGTCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTTGGAACCTGCCGCTTCTGCCGTGGCAG
GGTCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGAGGGTCCTTCGAGGGAGAAGCGCCCCAGGGGTCAGACCGAGGCGGCGGATGTCTCGTCCTTG
GGTGAGGAGGTGAGGGAGGAGGCCCCTTTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGCCCTGCCTGCGAGCTTCGTAGA
TCGGGTGGATGATCCTGAGGCCAGGATGGGCGGAACGTCTGACGTGACAGCACGGTTCAGAGTCGAGCCGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCGG
CTGCAAGTTTAGACTGCTGCCTCAGAAGAGCGTCCAAATTTGTAGCTCGGTCTAACTTTCTTCTTTGTACCTTTTCTCAAGCGTTTGTTGCTTCCATTCAATCGGCTCTG
GCCGTGAAGGCCGAGCTAGATTGGAGGGAAGCTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCCTCTTCCACCATGAAGGATGAGCTGCTGAA
AGCTCACTCTGAGGTGGAAATTTTGAAGGCTGAGGTGGAGGCGAAGGCCGAGCTGCTGAAGAAAGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCA
CCAAGGGCTTGGAGAAGGAGAAGTTCCAACTCCTCAAGGAGAAGGACGACATGCTCCAGGCGATTGAAGCGAAGGAGGAGGAGCTGAAGCATGCGACTGCCGAGCTGGAG
ACGGTGAAGGAGTGTCTCAGCAATGGAGCCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCAATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCT
CATGAAGGGCATTGCTTCCGACATGCCTAACCTTTAG
Protein sequenceShow/hide protein sequence
MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPKNILLRIPEEGERADNPLEGWVTLYFKMFEYGLRLPLHPFVQ
EFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCVRKGADGRSFFDVPTRFGNLVSIQPVPELTQASFDTLKYYKEHFP
RGRKVGTLVTDKLLLESGLLDYNPAVRPTESSRPNSKLAMVCGFASNVKRKSKGRAHALEAAQSLEPAASAVAGSASEDPAPVIELESSEGPSREKRPRGQTEAADVSSL
GEEVREEAPLKRRRKKKKTTSPLEVGARGALPASFVDRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDCCLRRASKFVARSNFLLCTFSQAFVASIQSAL
AVKAELDWREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQAIEAKEEELKHATAELE
TVKECLSNGALLEESFRQHPDFNGFAKDFSDAGFKFLMKGIASDMPNL