; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g25310 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g25310
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTranslocase subunit seca
Genome locationchr4:18376946..18394642
RNA-Seq ExpressionMoc04g25310
SyntenyMoc04g25310
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601783.1 hypothetical protein SDJN03_07016, partial [Cucurbita argyrosperma subsp. sororia]1.6e-6069.4Show/hide
Query:  REKETKRKKMNQAAFQQSAFS--------ATADRRDQVVCPKPRRLGLVNATANEHHPFRPRFDSMAPNDPLDLLFFKGGCGLDNFNSNSSQLASSPPFL
        R+KE+KRKKMNQ A Q++ FS        A  DRR+++VCPKPRRLGL+N T N+ +PFR RFDS+A ND LDLLF KGGC L+NFNS+ +QLASSPP+L
Subjt:  REKETKRKKMNQAAFQQSAFS--------ATADRRDQVVCPKPRRLGLVNATANEHHPFRPRFDSMAPNDPLDLLFFKGGCGLDNFNSNSSQLASSPPFL

Query:  CGSPPSRVANPLIQDARFGDEKPKLL---SPAAASPSPSPPPSSGRKGGCGRVNFGNNPAVRIEGFDCRLDRERRNRSIPTLA
        CG+PPSRVANPLIQDARFG+EKPK+L   SP  A P  SPPPS+ R GG  R NFGN PAVRIEGFDCRLDRERRNRSIP LA
Subjt:  CGSPPSRVANPLIQDARFGDEKPKLL---SPAAASPSPSPPPSSGRKGGCGRVNFGNNPAVRIEGFDCRLDRERRNRSIPTLA

KAG7032498.1 hypothetical protein SDJN02_06547 [Cucurbita argyrosperma subsp. argyrosperma]2.8e-5769.54Show/hide
Query:  MNQAAFQQSAFS--------ATADRRDQVVCPKPRRLGLVNATANEHHPFRPRFDSMAPNDPLDLLFFKGGCGLDNFNSNSSQLASSPPFLCGSPPSRVA
        MNQ A Q++ FS        A  DRR+++VCPKPRRLGL+N T N+ +PFR RFDS+A ND LDLLF KGGC L+NFNS+S+QLASSPP+LCG+PPSRVA
Subjt:  MNQAAFQQSAFS--------ATADRRDQVVCPKPRRLGLVNATANEHHPFRPRFDSMAPNDPLDLLFFKGGCGLDNFNSNSSQLASSPPFLCGSPPSRVA

Query:  NPLIQDARFGDEKPKLL---SPAAASPSPSPPPSSGRKGGCGRVNFGNNPAVRIEGFDCRLDRERRNRSIPTLA
        NPLIQDARFG+EKPK+L   SP  A P  SPPPS+ R GG  R NFGN PAVRIEGFDCRLDRERRNRSIP LA
Subjt:  NPLIQDARFGDEKPKLL---SPAAASPSPSPPPSSGRKGGCGRVNFGNNPAVRIEGFDCRLDRERRNRSIPTLA

XP_022923501.1 uncharacterized protein LOC111431174 [Cucurbita moschata]5.7e-5870.11Show/hide
Query:  MNQAAFQQSAFS--------ATADRRDQVVCPKPRRLGLVNATANEHHPFRPRFDSMAPNDPLDLLFFKGGCGLDNFNSNSSQLASSPPFLCGSPPSRVA
        MNQ A Q++ FS        A  DRR+++VCPKPRRLGL+N T N+ +PFR RFDS+A ND LDLLFFKGGC L+NFNS+S+QLASSPP+LCG+PPSRVA
Subjt:  MNQAAFQQSAFS--------ATADRRDQVVCPKPRRLGLVNATANEHHPFRPRFDSMAPNDPLDLLFFKGGCGLDNFNSNSSQLASSPPFLCGSPPSRVA

Query:  NPLIQDARFGDEKPKLL---SPAAASPSPSPPPSSGRKGGCGRVNFGNNPAVRIEGFDCRLDRERRNRSIPTLA
        NPLIQDARFG+EKPK+L   SP  A P  SPPPS+ R GG  R NFGN PAVRIEGFDCRLDRERRNRSIP LA
Subjt:  NPLIQDARFGDEKPKLL---SPAAASPSPSPPPSSGRKGGCGRVNFGNNPAVRIEGFDCRLDRERRNRSIPTLA

XP_022971782.1 uncharacterized protein LOC111470460 [Cucurbita maxima]3.3e-5870.69Show/hide
Query:  MNQAAFQQSAFS--------ATADRRDQVVCPKPRRLGLVNATANEHHPFRPRFDSMAPNDPLDLLFFKGGCGLDNFNSNSSQLASSPPFLCGSPPSRVA
        MNQ A Q++ FS        A  DRR+++VCPKPRRLGL+NATAN+ +PFR RFDS+A ND LDLLFFKGGC L+NF S+S+QLASSPP+LCG+PPSRVA
Subjt:  MNQAAFQQSAFS--------ATADRRDQVVCPKPRRLGLVNATANEHHPFRPRFDSMAPNDPLDLLFFKGGCGLDNFNSNSSQLASSPPFLCGSPPSRVA

Query:  NPLIQDARFGDEKPKLL---SPAAASPSPSPPPSSGRKGGCGRVNFGNNPAVRIEGFDCRLDRERRNRSIPTLA
        NPLIQDARFG+EKPK+L   SP  A P  SPPPS+ R GG  R NFGN PAVRIEGFDCRLDRERRNRSIP LA
Subjt:  NPLIQDARFGDEKPKLL---SPAAASPSPSPPPSSGRKGGCGRVNFGNNPAVRIEGFDCRLDRERRNRSIPTLA

XP_038906747.1 uncharacterized protein LOC120092671 [Benincasa hispida]3.3e-5878.95Show/hide
Query:  ADRRDQVVCPKPRRLGLVNATANEHHPFRPRFDSMAPNDPLDLLFFKGGCGLDNFNSNSSQLASSPPFLCGSPPSRVANPLIQDARFGDEKPKLL---SP
        ADRR+++VCPKPRRLGL+N   N+ +PFR RFDS    D LDLLFFKGGCGL+NFNS+S+QLASSPP+ CGSPPSRVANPLIQDARFGDEKPKLL   SP
Subjt:  ADRRDQVVCPKPRRLGLVNATANEHHPFRPRFDSMAPNDPLDLLFFKGGCGLDNFNSNSSQLASSPPFLCGSPPSRVANPLIQDARFGDEKPKLL---SP

Query:  AAASPSPSPPPSSGRKGGCGRVNFGNNPAVRIEGFDCRLDRERRNRSIPTLA
        AA+ PS SPPPS+GRKGGC R+NFGNNPAVRIEGFDCRLDRERRNRS+P LA
Subjt:  AAASPSPSPPPSSGRKGGCGRVNFGNNPAVRIEGFDCRLDRERRNRSIPTLA

TrEMBL top hitse value%identityAlignment
A0A0A0KRF1 Uncharacterized protein1.4e-5777.42Show/hide
Query:  SATADRRDQVVCPKPRRLGLVNATANEHHPFRPRFDSMAPNDPLDLLFFKGGCGLDNFNSNSSQLASSPPFLCGSPPSRVANPLIQDARFGDEKPKLL--
        +A  DRR+++VCPKPRRLGL+N T N+ +PFR RFDS    D LDLLF KGGCGL+NFNS+++QLASSPP+ CGSPPSRVANPLIQDARFGDEKPKLL  
Subjt:  SATADRRDQVVCPKPRRLGLVNATANEHHPFRPRFDSMAPNDPLDLLFFKGGCGLDNFNSNSSQLASSPPFLCGSPPSRVANPLIQDARFGDEKPKLL--

Query:  -SPAAASPSPSPPPSSGRKGGCGRVNFGNNPAVRIEGFDCRLDRERRNRSIPTLA
         SP AASP  SPPPS+GRKGGC RVNFG NPAVRIEGFDCRLDRERRNRSIP LA
Subjt:  -SPAAASPSPSPPPSSGRKGGCGRVNFGNNPAVRIEGFDCRLDRERRNRSIPTLA

A0A1S3CR46 uncharacterized protein LOC1035033858.9e-5775.97Show/hide
Query:  SATADRRDQVVCPKPRRLGLVNATANEHHPFRPRFDSMAPNDPLDLLFFKGGCGLDNFNSNSSQLASSPPFLCGSPPSRVANPLIQDARFGDEKPKLLS-
        SA  DRR+++VCPKPRRLGL+N T N+ +PFR RFDS    D +DLLF KGGCGL+ FNS+++QLASS P+ CGSPPSRVANPLIQDARFGDEKPKLL  
Subjt:  SATADRRDQVVCPKPRRLGLVNATANEHHPFRPRFDSMAPNDPLDLLFFKGGCGLDNFNSNSSQLASSPPFLCGSPPSRVANPLIQDARFGDEKPKLLS-

Query:  -PAAASPSPSPPPSSGRKGGCGRVNFGNNPAVRIEGFDCRLDRERRNRSIPTLA
           AASP  SPPPS+GRKGGC RVNFGNNPAVRIEGFDCRLDRERRNRSIP LA
Subjt:  -PAAASPSPSPPPSSGRKGGCGRVNFGNNPAVRIEGFDCRLDRERRNRSIPTLA

A0A5D3C147 Uncharacterized protein8.9e-5775.97Show/hide
Query:  SATADRRDQVVCPKPRRLGLVNATANEHHPFRPRFDSMAPNDPLDLLFFKGGCGLDNFNSNSSQLASSPPFLCGSPPSRVANPLIQDARFGDEKPKLLS-
        SA  DRR+++VCPKPRRLGL+N T N+ +PFR RFDS    D +DLLF KGGCGL+ FNS+++QLASS P+ CGSPPSRVANPLIQDARFGDEKPKLL  
Subjt:  SATADRRDQVVCPKPRRLGLVNATANEHHPFRPRFDSMAPNDPLDLLFFKGGCGLDNFNSNSSQLASSPPFLCGSPPSRVANPLIQDARFGDEKPKLLS-

Query:  -PAAASPSPSPPPSSGRKGGCGRVNFGNNPAVRIEGFDCRLDRERRNRSIPTLA
           AASP  SPPPS+GRKGGC RVNFGNNPAVRIEGFDCRLDRERRNRSIP LA
Subjt:  -PAAASPSPSPPPSSGRKGGCGRVNFGNNPAVRIEGFDCRLDRERRNRSIPTLA

A0A6J1E6Z2 uncharacterized protein LOC1114311742.8e-5870.11Show/hide
Query:  MNQAAFQQSAFS--------ATADRRDQVVCPKPRRLGLVNATANEHHPFRPRFDSMAPNDPLDLLFFKGGCGLDNFNSNSSQLASSPPFLCGSPPSRVA
        MNQ A Q++ FS        A  DRR+++VCPKPRRLGL+N T N+ +PFR RFDS+A ND LDLLFFKGGC L+NFNS+S+QLASSPP+LCG+PPSRVA
Subjt:  MNQAAFQQSAFS--------ATADRRDQVVCPKPRRLGLVNATANEHHPFRPRFDSMAPNDPLDLLFFKGGCGLDNFNSNSSQLASSPPFLCGSPPSRVA

Query:  NPLIQDARFGDEKPKLL---SPAAASPSPSPPPSSGRKGGCGRVNFGNNPAVRIEGFDCRLDRERRNRSIPTLA
        NPLIQDARFG+EKPK+L   SP  A P  SPPPS+ R GG  R NFGN PAVRIEGFDCRLDRERRNRSIP LA
Subjt:  NPLIQDARFGDEKPKLL---SPAAASPSPSPPPSSGRKGGCGRVNFGNNPAVRIEGFDCRLDRERRNRSIPTLA

A0A6J1I2X1 uncharacterized protein LOC1114704601.6e-5870.69Show/hide
Query:  MNQAAFQQSAFS--------ATADRRDQVVCPKPRRLGLVNATANEHHPFRPRFDSMAPNDPLDLLFFKGGCGLDNFNSNSSQLASSPPFLCGSPPSRVA
        MNQ A Q++ FS        A  DRR+++VCPKPRRLGL+NATAN+ +PFR RFDS+A ND LDLLFFKGGC L+NF S+S+QLASSPP+LCG+PPSRVA
Subjt:  MNQAAFQQSAFS--------ATADRRDQVVCPKPRRLGLVNATANEHHPFRPRFDSMAPNDPLDLLFFKGGCGLDNFNSNSSQLASSPPFLCGSPPSRVA

Query:  NPLIQDARFGDEKPKLL---SPAAASPSPSPPPSSGRKGGCGRVNFGNNPAVRIEGFDCRLDRERRNRSIPTLA
        NPLIQDARFG+EKPK+L   SP  A P  SPPPS+ R GG  R NFGN PAVRIEGFDCRLDRERRNRSIP LA
Subjt:  NPLIQDARFGDEKPKLL---SPAAASPSPSPPPSSGRKGGCGRVNFGNNPAVRIEGFDCRLDRERRNRSIPTLA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13390.1 unknown protein4.0e-2541.3Show/hide
Query:  MNQAAFQQSAF------SATADRRDQVVCPKPRRLGLVNATANEHHPFR----------PRFDSMAPNDPLDLLFFKGGCGLDNFNSNSSQLASSPP-FL
        MN    QQ+AF      +A +DRRD V+CPKPRR+G +N     HH  R             +S + ++ LD +  KGG G      + ++   +PP F 
Subjt:  MNQAAFQQSAF------SATADRRDQVVCPKPRRLGLVNATANEHHPFR----------PRFDSMAPNDPLDLLFFKGGCGLDNFNSNSSQLASSPP-FL

Query:  CGSPPSRVANPLIQDARFGDEKPKLLSPAAASP--SPSPPPSSGRKGGC--GRVNFGNNPAVRIEGFDCRLDRERRNRSIPTLA
         GSPPSRV+NPL +D+ F +E   + SP+ ++P  +   PPSS R G C     +FGNNP VR+ GFDC  DR   NRSI TLA
Subjt:  CGSPPSRVANPLIQDARFGDEKPKLLSPAAASP--SPSPPPSSGRKGGC--GRVNFGNNPAVRIEGFDCRLDRERRNRSIPTLA

AT1G13390.2 unknown protein4.0e-2541.3Show/hide
Query:  MNQAAFQQSAF------SATADRRDQVVCPKPRRLGLVNATANEHHPFR----------PRFDSMAPNDPLDLLFFKGGCGLDNFNSNSSQLASSPP-FL
        MN    QQ+AF      +A +DRRD V+CPKPRR+G +N     HH  R             +S + ++ LD +  KGG G      + ++   +PP F 
Subjt:  MNQAAFQQSAF------SATADRRDQVVCPKPRRLGLVNATANEHHPFR----------PRFDSMAPNDPLDLLFFKGGCGLDNFNSNSSQLASSPP-FL

Query:  CGSPPSRVANPLIQDARFGDEKPKLLSPAAASP--SPSPPPSSGRKGGC--GRVNFGNNPAVRIEGFDCRLDRERRNRSIPTLA
         GSPPSRV+NPL +D+ F +E   + SP+ ++P  +   PPSS R G C     +FGNNP VR+ GFDC  DR   NRSI TLA
Subjt:  CGSPPSRVANPLIQDARFGDEKPKLLSPAAASP--SPSPPPSSGRKGGC--GRVNFGNNPAVRIEGFDCRLDRERRNRSIPTLA

AT1G68490.1 unknown protein3.0e-3351.05Show/hide
Query:  MNQAAFQQSAFSATAD---------RRDQ--VVCPKPRRLGLVNATANEHHPFR----------PRFDSMAPNDPLDLLFFKGGCGLDNFNSNSSQLASS
        MN  A Q +AF+A  D          RDQ  VVCPKPRR+GL N   N HHP R             +S A  D LD++  K G G +  N    Q+  S
Subjt:  MNQAAFQQSAFSATAD---------RRDQ--VVCPKPRRLGLVNATANEHHPFR----------PRFDSMAPNDPLDLLFFKGGCGLDNFNSNSSQLASS

Query:  P-PFLCGSPPSRVANPLIQDARFGDEKPKLLS----PAAASPSPSPPPSSGRKGGC-GRVNFGNNPAVRIEGFDCRLDRERRNRSIPTLA
        P PFLCGSPPSRVANPL QDARF DE   + S         PS SP  SSGRKGGC  R NFGN+P VR+EGFDC LDR+ RN SIP LA
Subjt:  P-PFLCGSPPSRVANPLIQDARFGDEKPKLLS----PAAASPSPSPPPSSGRKGGC-GRVNFGNNPAVRIEGFDCRLDRERRNRSIPTLA

AT3G02555.1 unknown protein3.3e-2747.13Show/hide
Query:  MNQAAFQQSAFSATADRR----------DQVVCPKPRRLGLVNATANEHHPFRPRFDSMAPNDPLDLLFFKGGCGLDNFNSNSSQLASSPPFLCGSPPSR
        MN  + QQ+AF +  + R          D VVCPKPRR        N   PFR  F     +D  D     G   LD F    S  + SPPF  GSPPSR
Subjt:  MNQAAFQQSAFSATADRR----------DQVVCPKPRRLGLVNATANEHHPFRPRFDSMAPNDPLDLLFFKGGCGLDNFNSNSSQLASSPPFLCGSPPSR

Query:  VANPLIQDARFGDEKPKLLSPAAASPSPSPPPSSGRKGGCGRVNFGNNPA-VRIEGFDCRLDRERRNRSIPTLA
         ANPL QDARFGDEK   +SP   S SP  P +S  K GCGR+ FG  PA VR+EGFDC L+R+R N SIP +A
Subjt:  VANPLIQDARFGDEKPKLLSPAAASPSPSPPPSSGRKGGCGRVNFGNNPA-VRIEGFDCRLDRERRNRSIPTLA

AT5G16110.1 unknown protein3.9e-2845.45Show/hide
Query:  KKMNQAAFQQSAFSATA-----DRRDQVVCPKPRRLGLVNATANEHHPFRPRFDSMAPNDPLDLLFFKGGCGL-------DNFNSNSSQLASSPPFLCGS
        KKMN    QQ+AF +       DR+D VVCPKPRR+GL+    N   P R      A     DL   K G  L       ++  +    L+SSPP+  GS
Subjt:  KKMNQAAFQQSAFSATA-----DRRDQVVCPKPRRLGLVNATANEHHPFRPRFDSMAPNDPLDLLFFKGGCGL-------DNFNSNSSQLASSPPFLCGS

Query:  PPSRVANPLIQDARFGDEKPKLLSP---------AAASPSPSPPPSSGRKGGCGRVNFG-NNPAVRIEGFDCRLDRERRNRSIPTLA
        PPSR ANPL QDARF DEK   +SP         A   PSPS   SS    GC R+ FG N+PAVR+EGFDC L+R+R+N SIP +A
Subjt:  PPSRVANPLIQDARFGDEKPKLLSP---------AAASPSPSPPPSSGRKGGCGRVNFG-NNPAVRIEGFDCRLDRERRNRSIPTLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGCCTGCACGGGAACCGAAATTCCACGAAGGTCACACTACTCAGTCTCCGATCTAATCTTAAAGGTAAAGCCCGGGTCCCCAGTGGAGTCGCCAAGCTGTCGCAACG
TCATCGCGACGCGGAGCGGCCGACGTCATCCCTTGATGGGTCTCTTTCGAGATTAGCTGAGGAAATTCTAGGTGTGCCCGCGGGTCTGAAAACTGTGAAGGAGCACTCGA
GGTCGAAGTCAGGTGGTCGTCTGTGTATCGTCGGGGTCGCCGGCCTGGGTGAAGAAGAAGAGAAGGTGATGGTAGAGAGAAAGAAGGAAGTGAATTCTTCTTTTGAGTTT
GAAAATATATTCGAGGAAGATGAGAATTCACCGATTACAGAGTTTAAAGAAATGGTGTTAAGTAGCCTTGATAACATGAATTGCAAAATCAATACACTCTTTTCTAAGAT
TGAGAGTGTTAAACGACTGGTGCATGAGAGATTAACTGATATCAGAAAGGAGGATGGGGACGGTGATGGGAATGGGGTGGAAGGGATGATAACAACGGTGGACAAGGAGA
ACATCGAGCAAAATGATATTATATCGGAGAAGGGAGACATGTACATCGATGCTATAAGTATTGGATTTATTCCCAAGGATATTTGGCGTATGTCCCTCTTGCTTAAGGTT
GCACCTCCTCAACTCGTCCTTATTGTCATTCGACAGAATCCTGCGTCAATAGCCCCTAATTCCATCCTCCGCTCTTTCTGTCAAATAGCAGCATCAGTGTTCATCTTCAG
AATATCAGAGGGAGATCTTGTTCTTCCAGGCAGAGAAAAAGAAACGAAGAGGAAGAAGATGAACCAGGCGGCGTTTCAACAGAGTGCGTTTTCCGCCACCGCCGACCGCC
GTGACCAGGTGGTTTGCCCGAAACCCCGCCGTCTCGGACTCGTTAACGCCACCGCCAACGAACACCACCCATTTAGACCCCGTTTTGACTCCATGGCACCCAATGATCCT
CTGGATCTCCTCTTCTTCAAGGGTGGTTGTGGATTGGACAATTTTAACAGTAACAGTTCACAATTAGCCTCGTCGCCGCCGTTTTTATGTGGGTCGCCGCCGAGCAGAGT
AGCTAACCCATTAATCCAGGACGCCCGATTTGGGGATGAGAAGCCGAAGCTCCTCTCGCCGGCAGCGGCGTCTCCGTCGCCGTCGCCGCCGCCGTCGTCCGGTCGGAAAG
GCGGCTGTGGTCGGGTGAATTTCGGTAACAATCCAGCGGTGAGGATCGAGGGGTTCGATTGCCGCCTCGACCGGGAGAGGCGAAACCGCAGCATCCCTACTCTGGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCGCCTGCACGGGAACCGAAATTCCACGAAGGTCACACTACTCAGTCTCCGATCTAATCTTAAAGGTAAAGCCCGGGTCCCCAGTGGAGTCGCCAAGCTGTCGCAACG
TCATCGCGACGCGGAGCGGCCGACGTCATCCCTTGATGGGTCTCTTTCGAGATTAGCTGAGGAAATTCTAGGTGTGCCCGCGGGTCTGAAAACTGTGAAGGAGCACTCGA
GGTCGAAGTCAGGTGGTCGTCTGTGTATCGTCGGGGTCGCCGGCCTGGGTGAAGAAGAAGAGAAGGTGATGGTAGAGAGAAAGAAGGAAGTGAATTCTTCTTTTGAGTTT
GAAAATATATTCGAGGAAGATGAGAATTCACCGATTACAGAGTTTAAAGAAATGGTGTTAAGTAGCCTTGATAACATGAATTGCAAAATCAATACACTCTTTTCTAAGAT
TGAGAGTGTTAAACGACTGGTGCATGAGAGATTAACTGATATCAGAAAGGAGGATGGGGACGGTGATGGGAATGGGGTGGAAGGGATGATAACAACGGTGGACAAGGAGA
ACATCGAGCAAAATGATATTATATCGGAGAAGGGAGACATGTACATCGATGCTATAAGTATTGGATTTATTCCCAAGGATATTTGGCGTATGTCCCTCTTGCTTAAGGTT
GCACCTCCTCAACTCGTCCTTATTGTCATTCGACAGAATCCTGCGTCAATAGCCCCTAATTCCATCCTCCGCTCTTTCTGTCAAATAGCAGCATCAGTGTTCATCTTCAG
AATATCAGAGGGAGATCTTGTTCTTCCAGGCAGAGAAAAAGAAACGAAGAGGAAGAAGATGAACCAGGCGGCGTTTCAACAGAGTGCGTTTTCCGCCACCGCCGACCGCC
GTGACCAGGTGGTTTGCCCGAAACCCCGCCGTCTCGGACTCGTTAACGCCACCGCCAACGAACACCACCCATTTAGACCCCGTTTTGACTCCATGGCACCCAATGATCCT
CTGGATCTCCTCTTCTTCAAGGGTGGTTGTGGATTGGACAATTTTAACAGTAACAGTTCACAATTAGCCTCGTCGCCGCCGTTTTTATGTGGGTCGCCGCCGAGCAGAGT
AGCTAACCCATTAATCCAGGACGCCCGATTTGGGGATGAGAAGCCGAAGCTCCTCTCGCCGGCAGCGGCGTCTCCGTCGCCGTCGCCGCCGCCGTCGTCCGGTCGGAAAG
GCGGCTGTGGTCGGGTGAATTTCGGTAACAATCCAGCGGTGAGGATCGAGGGGTTCGATTGCCGCCTCGACCGGGAGAGGCGAAACCGCAGCATCCCTACTCTGGCTTAG
Protein sequenceShow/hide protein sequence
MRLHGNRNSTKVTLLSLRSNLKGKARVPSGVAKLSQRHRDAERPTSSLDGSLSRLAEEILGVPAGLKTVKEHSRSKSGGRLCIVGVAGLGEEEEKVMVERKKEVNSSFEF
ENIFEEDENSPITEFKEMVLSSLDNMNCKINTLFSKIESVKRLVHERLTDIRKEDGDGDGNGVEGMITTVDKENIEQNDIISEKGDMYIDAISIGFIPKDIWRMSLLLKV
APPQLVLIVIRQNPASIAPNSILRSFCQIAASVFIFRISEGDLVLPGREKETKRKKMNQAAFQQSAFSATADRRDQVVCPKPRRLGLVNATANEHHPFRPRFDSMAPNDP
LDLLFFKGGCGLDNFNSNSSQLASSPPFLCGSPPSRVANPLIQDARFGDEKPKLLSPAAASPSPSPPPSSGRKGGCGRVNFGNNPAVRIEGFDCRLDRERRNRSIPTLA