; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g32400 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g32400
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUPF0496 protein 3-like
Genome locationchr4:24365041..24366177
RNA-Seq ExpressionMoc04g32400
SyntenyMoc04g32400
Gene Ontology termsNA
InterPro domainsIPR007749 - Protein of unknown function DUF677


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602186.1 UPF0496 protein 3, partial [Cucurbita argyrosperma subsp. sororia]2.5e-11868.75Show/hide
Query:  SQPVLISAFADAKQQIRNKPQKSFNVNDEYICALRTKSYVEFFIKAQSVIEESSPPSTSSSTGRRRRRKLSGTILLEPSQ-EAIPSILESPFLLMLPDLK
        S+ + IS FAD K QIRNKPQKSFNVN+EY+CALR++S+VEFF KAQ +++E SPPSTSSS      RK S TILL+  Q EA PSILESP L+MLP+L+
Subjt:  SQPVLISAFADAKQQIRNKPQKSFNVNDEYICALRTKSYVEFFIKAQSVIEESSPPSTSSSTGRRRRRKLSGTILLEPSQ-EAIPSILESPFLLMLPDLK

Query:  SLLIDYFNVSAEASKFCTRLLEDVELTRSNSRSIQKSLDSIENCSSPETIETIASEFLALREPFSDPDKHDFALIHDDYEAVSRRLNCTRKKVARKIKSI
         LLID+FNVSAEAS  C RLL +++L RSNSR +QK LDSIE CSSP+ IETI S+ LA R P SD DK DFA IHDDY AVSR LN TRKKVARKI+SI
Subjt:  SLLIDYFNVSAEASKFCTRLLEDVELTRSNSRSIQKSLDSIENCSSPETIETIASEFLALREPFSDPDKHDFALIHDDYEAVSRRLNCTRKKVARKIKSI

Query:  EIIETISCGLVAITARTLTELFR----------FPPESFRRKLLRYQMIRNGGLGEVGEQLEAAAKGSYILNREFDTTSRLVARLDDAMDHGKAMARLFV
        +II+  +CGLVAIT+R LT L            F  +S RRKLLR+QM+ NGGL  VGEQLEAAAKGSYILNREFDTTSRLV RL DA+DHGKAM RLFV
Subjt:  EIIETISCGLVAITARTLTELFR----------FPPESFRRKLLRYQMIRNGGLGEVGEQLEAAAKGSYILNREFDTTSRLVARLDDAMDHGKAMARLFV

Query:  ERKEDKFAVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQI
        ERKEDKFAV VAMDE+K+SN+ +R QVE+VEEHLYLC VTINR+R  VINQ+
Subjt:  ERKEDKFAVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQI

KAG7032870.1 UPF0496 protein 3, partial [Cucurbita argyrosperma subsp. argyrosperma]3.2e-11868.75Show/hide
Query:  SQPVLISAFADAKQQIRNKPQKSFNVNDEYICALRTKSYVEFFIKAQSVIEESSPPSTSSSTGRRRRRKLSGTILLEPSQ-EAIPSILESPFLLMLPDLK
        S+ + IS FAD K QIRNKPQKSFNVN+EY+CALR++S+VEFF KAQ +++E SPPSTSSS      RK S TILL+  Q EA PSILESP L+MLP+L+
Subjt:  SQPVLISAFADAKQQIRNKPQKSFNVNDEYICALRTKSYVEFFIKAQSVIEESSPPSTSSSTGRRRRRKLSGTILLEPSQ-EAIPSILESPFLLMLPDLK

Query:  SLLIDYFNVSAEASKFCTRLLEDVELTRSNSRSIQKSLDSIENCSSPETIETIASEFLALREPFSDPDKHDFALIHDDYEAVSRRLNCTRKKVARKIKSI
         LLID+FNVSAEAS  C RLL +++L RSNSR +QK LDSIE CSSP+ IETI S+ LA R P SD DK DFA IHDDY AVSR LN TRKKVARKI+SI
Subjt:  SLLIDYFNVSAEASKFCTRLLEDVELTRSNSRSIQKSLDSIENCSSPETIETIASEFLALREPFSDPDKHDFALIHDDYEAVSRRLNCTRKKVARKIKSI

Query:  EIIETISCGLVAITARTLTELFR----------FPPESFRRKLLRYQMIRNGGLGEVGEQLEAAAKGSYILNREFDTTSRLVARLDDAMDHGKAMARLFV
        +II+  +CGLVAIT+R LT L            F  +S RRKLLR+QM+ NGGL  VGEQLEAAAKGSYILNREFDTTSRLV RL DA+DHGKAM RLFV
Subjt:  EIIETISCGLVAITARTLTELFR----------FPPESFRRKLLRYQMIRNGGLGEVGEQLEAAAKGSYILNREFDTTSRLVARLDDAMDHGKAMARLFV

Query:  ERKEDKFAVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQI
        ERKEDKFAV VAMDE+K+SN+ +R QVE+VEEHLYLC VTINR+R  VINQ+
Subjt:  ERKEDKFAVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQI

XP_022133537.1 UPF0496 protein 3-like [Momordica charantia]6.0e-205100Show/hide
Query:  MWPKIRDLKIRNSTHISSSIFACSAAQLLNFSQPVLISAFADAKQQIRNKPQKSFNVNDEYICALRTKSYVEFFIKAQSVIEESSPPSTSSSTGRRRRRK
        MWPKIRDLKIRNSTHISSSIFACSAAQLLNFSQPVLISAFADAKQQIRNKPQKSFNVNDEYICALRTKSYVEFFIKAQSVIEESSPPSTSSSTGRRRRRK
Subjt:  MWPKIRDLKIRNSTHISSSIFACSAAQLLNFSQPVLISAFADAKQQIRNKPQKSFNVNDEYICALRTKSYVEFFIKAQSVIEESSPPSTSSSTGRRRRRK

Query:  LSGTILLEPSQEAIPSILESPFLLMLPDLKSLLIDYFNVSAEASKFCTRLLEDVELTRSNSRSIQKSLDSIENCSSPETIETIASEFLALREPFSDPDKH
        LSGTILLEPSQEAIPSILESPFLLMLPDLKSLLIDYFNVSAEASKFCTRLLEDVELTRSNSRSIQKSLDSIENCSSPETIETIASEFLALREPFSDPDKH
Subjt:  LSGTILLEPSQEAIPSILESPFLLMLPDLKSLLIDYFNVSAEASKFCTRLLEDVELTRSNSRSIQKSLDSIENCSSPETIETIASEFLALREPFSDPDKH

Query:  DFALIHDDYEAVSRRLNCTRKKVARKIKSIEIIETISCGLVAITARTLTELFRFPPESFRRKLLRYQMIRNGGLGEVGEQLEAAAKGSYILNREFDTTSR
        DFALIHDDYEAVSRRLNCTRKKVARKIKSIEIIETISCGLVAITARTLTELFRFPPESFRRKLLRYQMIRNGGLGEVGEQLEAAAKGSYILNREFDTTSR
Subjt:  DFALIHDDYEAVSRRLNCTRKKVARKIKSIEIIETISCGLVAITARTLTELFRFPPESFRRKLLRYQMIRNGGLGEVGEQLEAAAKGSYILNREFDTTSR

Query:  LVARLDDAMDHGKAMARLFVERKEDKFAVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQIQMKSHP
        LVARLDDAMDHGKAMARLFVERKEDKFAVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQIQMKSHP
Subjt:  LVARLDDAMDHGKAMARLFVERKEDKFAVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQIQMKSHP

XP_022990396.1 UPF0496 protein 3-like [Cucurbita maxima]3.2e-11869.32Show/hide
Query:  SQPVLISAFADAKQQIRNKPQKSFNVNDEYICALRTKSYVEFFIKAQSVIEESSPPSTSSSTGRRRRRKLSGTILLEPSQ-EAIPSILESPFLLMLPDLK
        S+ + IS FAD K QIRNKPQKSFNVN+EY+CALR++S+VEFF KAQ +++E SPPST SSTG    RK S TILL+  Q EA PSILESP L+MLP+L+
Subjt:  SQPVLISAFADAKQQIRNKPQKSFNVNDEYICALRTKSYVEFFIKAQSVIEESSPPSTSSSTGRRRRRKLSGTILLEPSQ-EAIPSILESPFLLMLPDLK

Query:  SLLIDYFNVSAEASKFCTRLLEDVELTRSNSRSIQKSLDSIENCSSPETIETIASEFLALREPFSDPDKHDFALIHDDYEAVSRRLNCTRKKVARKIKSI
         LLID+FNVSAEAS  C RLL +++L RSNSR +QK LDSIE CSSP+ IETI S+ LA R P SD DK DFA IHDDY AVSR LN TRKKVARKI+SI
Subjt:  SLLIDYFNVSAEASKFCTRLLEDVELTRSNSRSIQKSLDSIENCSSPETIETIASEFLALREPFSDPDKHDFALIHDDYEAVSRRLNCTRKKVARKIKSI

Query:  EIIETISCGLVAITARTLTELFR----------FPPESFRRKLLRYQMIRNGGLGEVGEQLEAAAKGSYILNREFDTTSRLVARLDDAMDHGKAMARLFV
        +II   +CGLVAIT+R LT L            F  +S RRKLLR+QM+ NGGL  VGEQLEAAAKGSYILNREFDTTSRLV RL DA+DHGKAM RLFV
Subjt:  EIIETISCGLVAITARTLTELFR----------FPPESFRRKLLRYQMIRNGGLGEVGEQLEAAAKGSYILNREFDTTSRLVARLDDAMDHGKAMARLFV

Query:  ERKEDKFAVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQI
        ERKEDKFAV VAMDE+K+SN+ +R QVE+VEEHLYLCIVTINR+R  VINQ+
Subjt:  ERKEDKFAVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQI

XP_038884256.1 UPF0496 protein At3g49070-like [Benincasa hispida]1.0e-12464.94Show/hide
Query:  MWPKIRDLKIRNSTHISSSIFACSAAQLLNFS--QPVLISAFADAKQQIRNKPQKSFNVNDEYICALRTKSYVEFFIKAQSVIEESSPPSTSSSTGRRRR
        MW K R  +IRNST +S SI AC  A L NFS  QPV IS FAD +Q+I  KPQKSFNVN+EY+C LRTKS+ EFF KA+ +++ES P  TSSS    R 
Subjt:  MWPKIRDLKIRNSTHISSSIFACSAAQLLNFS--QPVLISAFADAKQQIRNKPQKSFNVNDEYICALRTKSYVEFFIKAQSVIEESSPPSTSSSTGRRRR

Query:  RKLSGTILLEPSQ-EAIPSILESPFLLMLPDLKSLLIDYFNVSAEASKFCTRLLEDVELTRSNSRSIQKSLDSIENCSSPETIETIASEFLALREPFSDP
        RKLS TILL+P + EA+PSILES FLLMLP+LK L IDY NVSA+AS  CTRLL +V+ TRS SR IQ+SLDSIE C S ETIE+IAS+ L+LR PFSD 
Subjt:  RKLSGTILLEPSQ-EAIPSILESPFLLMLPDLKSLLIDYFNVSAEASKFCTRLLEDVELTRSNSRSIQKSLDSIENCSSPETIETIASEFLALREPFSDP

Query:  DKHDFALIHDDYEAVSRRLNCTRKKVARKIKSIEIIETISCGLVAITARTLTE----------LFRFPPESFRRKLLRYQMIRNGGLGEVGEQLEAAAKG
        DK DF LIH DY A+SRRLNCTRKKV RKI++I+II+ I+CGL +IT RTLT+          +F +  +S  RKL+R++M+RNGGL +VGE++EAAAKG
Subjt:  DKHDFALIHDDYEAVSRRLNCTRKKVARKIKSIEIIETISCGLVAITARTLTE----------LFRFPPESFRRKLLRYQMIRNGGLGEVGEQLEAAAKG

Query:  SYILNREFDTTSRLVARLDDAMDHGKAMARLFVERKEDKFAVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQI
        SYI+ RE DTTSRLV RL DA+DHGKAM RLF  RKEDKFAV VAMDE+KK+N+ +R QVE+VEEHLYLCIVTINR+R  VI Q+
Subjt:  SYILNREFDTTSRLVARLDDAMDHGKAMARLFVERKEDKFAVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQI

TrEMBL top hitse value%identityAlignment
A0A1S3B3L4 UPF0496 protein 3-like9.2e-11160.64Show/hide
Query:  MWPKIRDLKIRNSTHISSSIFACSAAQLLNFSQPVLISAFADAKQQ-IRNKPQKSFNVNDEYICALRTKSYVEFFIKAQSVIEESSPPSTSSSTGRRRRR
        MW K    KI NST +S SI AC              S FAD + + IR K ++SFNVN+EY+C LRT+S+ EFF+K +S + ES P +TSSS+      
Subjt:  MWPKIRDLKIRNSTHISSSIFACSAAQLLNFSQPVLISAFADAKQQ-IRNKPQKSFNVNDEYICALRTKSYVEFFIKAQSVIEESSPPSTSSSTGRRRRR

Query:  KLSGTILLEPSQ-EAIPSILESPFLLMLPDLKSLLIDYFNVSAEASKFCTRLLEDVELTRSNSRSIQKSLDSIENCSSPETIETIASEFLALREPFSDPD
        + S TILL+P Q EA+PSILES FLLMLP+LK L +DYFN+SA+AS  CTRLL + +LTRS SR IQ+SLDSIE C S ET+E+IAS  LALR PFSD +
Subjt:  KLSGTILLEPSQ-EAIPSILESPFLLMLPDLKSLLIDYFNVSAEASKFCTRLLEDVELTRSNSRSIQKSLDSIENCSSPETIETIASEFLALREPFSDPD

Query:  KHDFALIHDDYEAVSRRLNCTRKKVARKIKSIEIIETISCGLVAITARTLTELFRFP---PESFRRKLLRYQMIRNGGLGEVGEQLEAAAKGSYILNREF
        K DFALIHDDY  +S RLNCTRKKVARKI+S++I++ I+CGL AIT RTLT+L +     P  F RKLLR++M+RNGGL +VGE+LEAAAKGSYIL RE 
Subjt:  KHDFALIHDDYEAVSRRLNCTRKKVARKIKSIEIIETISCGLVAITARTLTELFRFP---PESFRRKLLRYQMIRNGGLGEVGEQLEAAAKGSYILNREF

Query:  DTTSRLVARLDDAMDHGKAMARLFVER-KEDKFAVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVIN
        +TTSRLV RL DA+D+GKAM RLF  R KEDKF V VAMDE+KK+N  +R +VE+VEEHL LCIV INR++  +IN
Subjt:  DTTSRLVARLDDAMDHGKAMARLFVER-KEDKFAVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVIN

A0A5D3DGL5 UPF0496 protein 3-like9.2e-11160.64Show/hide
Query:  MWPKIRDLKIRNSTHISSSIFACSAAQLLNFSQPVLISAFADAKQQ-IRNKPQKSFNVNDEYICALRTKSYVEFFIKAQSVIEESSPPSTSSSTGRRRRR
        MW K    KI NST +S SI AC              S FAD + + IR K ++SFNVN+EY+C LRT+S+ EFF+K +S + ES P +TSSS+      
Subjt:  MWPKIRDLKIRNSTHISSSIFACSAAQLLNFSQPVLISAFADAKQQ-IRNKPQKSFNVNDEYICALRTKSYVEFFIKAQSVIEESSPPSTSSSTGRRRRR

Query:  KLSGTILLEPSQ-EAIPSILESPFLLMLPDLKSLLIDYFNVSAEASKFCTRLLEDVELTRSNSRSIQKSLDSIENCSSPETIETIASEFLALREPFSDPD
        + S TILL+P Q EA+PSILES FLLMLP+LK L +DYFN+SA+AS  CTRLL + +LTRS SR IQ+SLDSIE C S ET+E+IAS  LALR PFSD +
Subjt:  KLSGTILLEPSQ-EAIPSILESPFLLMLPDLKSLLIDYFNVSAEASKFCTRLLEDVELTRSNSRSIQKSLDSIENCSSPETIETIASEFLALREPFSDPD

Query:  KHDFALIHDDYEAVSRRLNCTRKKVARKIKSIEIIETISCGLVAITARTLTELFRFP---PESFRRKLLRYQMIRNGGLGEVGEQLEAAAKGSYILNREF
        K DFALIHDDY  +S RLNCTRKKVARKI+S++I++ I+CGL AIT RTLT+L +     P  F RKLLR++M+RNGGL +VGE+LEAAAKGSYIL RE 
Subjt:  KHDFALIHDDYEAVSRRLNCTRKKVARKIKSIEIIETISCGLVAITARTLTELFRFP---PESFRRKLLRYQMIRNGGLGEVGEQLEAAAKGSYILNREF

Query:  DTTSRLVARLDDAMDHGKAMARLFVER-KEDKFAVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVIN
        +TTSRLV RL DA+D+GKAM RLF  R KEDKF V VAMDE+KK+N  +R +VE+VEEHL LCIV INR++  +IN
Subjt:  DTTSRLVARLDDAMDHGKAMARLFVER-KEDKFAVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVIN

A0A6J1BWZ5 UPF0496 protein 3-like2.9e-205100Show/hide
Query:  MWPKIRDLKIRNSTHISSSIFACSAAQLLNFSQPVLISAFADAKQQIRNKPQKSFNVNDEYICALRTKSYVEFFIKAQSVIEESSPPSTSSSTGRRRRRK
        MWPKIRDLKIRNSTHISSSIFACSAAQLLNFSQPVLISAFADAKQQIRNKPQKSFNVNDEYICALRTKSYVEFFIKAQSVIEESSPPSTSSSTGRRRRRK
Subjt:  MWPKIRDLKIRNSTHISSSIFACSAAQLLNFSQPVLISAFADAKQQIRNKPQKSFNVNDEYICALRTKSYVEFFIKAQSVIEESSPPSTSSSTGRRRRRK

Query:  LSGTILLEPSQEAIPSILESPFLLMLPDLKSLLIDYFNVSAEASKFCTRLLEDVELTRSNSRSIQKSLDSIENCSSPETIETIASEFLALREPFSDPDKH
        LSGTILLEPSQEAIPSILESPFLLMLPDLKSLLIDYFNVSAEASKFCTRLLEDVELTRSNSRSIQKSLDSIENCSSPETIETIASEFLALREPFSDPDKH
Subjt:  LSGTILLEPSQEAIPSILESPFLLMLPDLKSLLIDYFNVSAEASKFCTRLLEDVELTRSNSRSIQKSLDSIENCSSPETIETIASEFLALREPFSDPDKH

Query:  DFALIHDDYEAVSRRLNCTRKKVARKIKSIEIIETISCGLVAITARTLTELFRFPPESFRRKLLRYQMIRNGGLGEVGEQLEAAAKGSYILNREFDTTSR
        DFALIHDDYEAVSRRLNCTRKKVARKIKSIEIIETISCGLVAITARTLTELFRFPPESFRRKLLRYQMIRNGGLGEVGEQLEAAAKGSYILNREFDTTSR
Subjt:  DFALIHDDYEAVSRRLNCTRKKVARKIKSIEIIETISCGLVAITARTLTELFRFPPESFRRKLLRYQMIRNGGLGEVGEQLEAAAKGSYILNREFDTTSR

Query:  LVARLDDAMDHGKAMARLFVERKEDKFAVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQIQMKSHP
        LVARLDDAMDHGKAMARLFVERKEDKFAVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQIQMKSHP
Subjt:  LVARLDDAMDHGKAMARLFVERKEDKFAVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQIQMKSHP

A0A6J1H6T3 UPF0496 protein 3-like1.6e-11868.75Show/hide
Query:  SQPVLISAFADAKQQIRNKPQKSFNVNDEYICALRTKSYVEFFIKAQSVIEESSPPSTSSSTGRRRRRKLSGTILLEPSQ-EAIPSILESPFLLMLPDLK
        S+ + IS FAD K QIRNKPQKSFNVN+EY+CALR++S+VEFF KAQ +++E SPPSTSSS      RK S TILL+  Q EA PSILESP L+MLP+L+
Subjt:  SQPVLISAFADAKQQIRNKPQKSFNVNDEYICALRTKSYVEFFIKAQSVIEESSPPSTSSSTGRRRRRKLSGTILLEPSQ-EAIPSILESPFLLMLPDLK

Query:  SLLIDYFNVSAEASKFCTRLLEDVELTRSNSRSIQKSLDSIENCSSPETIETIASEFLALREPFSDPDKHDFALIHDDYEAVSRRLNCTRKKVARKIKSI
         LLID+FNVSAEAS  C RLL +++L RSNSR +QK LDSIE CSSP+ IETI S+ LA R P SD DK DFA IHDDY AVSR LN TRKKVARKI+SI
Subjt:  SLLIDYFNVSAEASKFCTRLLEDVELTRSNSRSIQKSLDSIENCSSPETIETIASEFLALREPFSDPDKHDFALIHDDYEAVSRRLNCTRKKVARKIKSI

Query:  EIIETISCGLVAITARTLTELFR----------FPPESFRRKLLRYQMIRNGGLGEVGEQLEAAAKGSYILNREFDTTSRLVARLDDAMDHGKAMARLFV
        +II+  +CGLVAIT+R LT L            F  +S RRKLLR+QM+ NGGL  VGEQLEAAAKGSYILNREFDTTSRLV RL DA+DHGKAM RLFV
Subjt:  EIIETISCGLVAITARTLTELFR----------FPPESFRRKLLRYQMIRNGGLGEVGEQLEAAAKGSYILNREFDTTSRLVARLDDAMDHGKAMARLFV

Query:  ERKEDKFAVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQI
        ERKE+KFAV VAMDE+K+SN+ +R QVE+VEEHLYLCIVTINR+R  VINQ+
Subjt:  ERKEDKFAVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQI

A0A6J1JMU4 UPF0496 protein 3-like1.6e-11869.32Show/hide
Query:  SQPVLISAFADAKQQIRNKPQKSFNVNDEYICALRTKSYVEFFIKAQSVIEESSPPSTSSSTGRRRRRKLSGTILLEPSQ-EAIPSILESPFLLMLPDLK
        S+ + IS FAD K QIRNKPQKSFNVN+EY+CALR++S+VEFF KAQ +++E SPPST SSTG    RK S TILL+  Q EA PSILESP L+MLP+L+
Subjt:  SQPVLISAFADAKQQIRNKPQKSFNVNDEYICALRTKSYVEFFIKAQSVIEESSPPSTSSSTGRRRRRKLSGTILLEPSQ-EAIPSILESPFLLMLPDLK

Query:  SLLIDYFNVSAEASKFCTRLLEDVELTRSNSRSIQKSLDSIENCSSPETIETIASEFLALREPFSDPDKHDFALIHDDYEAVSRRLNCTRKKVARKIKSI
         LLID+FNVSAEAS  C RLL +++L RSNSR +QK LDSIE CSSP+ IETI S+ LA R P SD DK DFA IHDDY AVSR LN TRKKVARKI+SI
Subjt:  SLLIDYFNVSAEASKFCTRLLEDVELTRSNSRSIQKSLDSIENCSSPETIETIASEFLALREPFSDPDKHDFALIHDDYEAVSRRLNCTRKKVARKIKSI

Query:  EIIETISCGLVAITARTLTELFR----------FPPESFRRKLLRYQMIRNGGLGEVGEQLEAAAKGSYILNREFDTTSRLVARLDDAMDHGKAMARLFV
        +II   +CGLVAIT+R LT L            F  +S RRKLLR+QM+ NGGL  VGEQLEAAAKGSYILNREFDTTSRLV RL DA+DHGKAM RLFV
Subjt:  EIIETISCGLVAITARTLTELFR----------FPPESFRRKLLRYQMIRNGGLGEVGEQLEAAAKGSYILNREFDTTSRLVARLDDAMDHGKAMARLFV

Query:  ERKEDKFAVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQI
        ERKEDKFAV VAMDE+K+SN+ +R QVE+VEEHLYLCIVTINR+R  VINQ+
Subjt:  ERKEDKFAVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQI

SwissProt top hitse value%identityAlignment
A2XCJ1 UPF0496 protein 32.1e-2731.23Show/hide
Query:  PQKSFNVNDEYICALRTKSYVEFFIK--------AQSVIEESSPPSTSSSTGRRRRRKLSGTILLEPSQEAIPSILESPFLLML-PDLKSLLIDYFNVSA
        P  SF+  +EY  A RT+SY +F+ +          +++         +++ R    +L    LLEP Q A+ + L SP    L PD++ LL  Y+  +A
Subjt:  PQKSFNVNDEYICALRTKSYVEFFIK--------AQSVIEESSPPSTSSSTGRRRRRKLSGTILLEPSQEAIPSILESPFLLML-PDLKSLLIDYFNVSA

Query:  EASKFCTRLLEDVELTRSNSRSIQKSLDSIENCSSPETIETIASEFLALREPFSDPDKHDFAL--IHDDYEAVSRRLNCTRKKVARKIKSI-EIIETISC
         AS  C+ LL+D+E  R   R ++ +L  +   +S   +  +A    AL +PF+        L  +      + R L+  RKK   +I+S+  +   +S 
Subjt:  EASKFCTRLLEDVELTRSNSRSIQKSLDSIENCSSPETIETIASEFLALREPFSDPDKHDFAL--IHDDYEAVSRRLNCTRKKVARKIKSI-EIIETISC

Query:  GLVAITA-----------RTLTELFRFPPESFRRKLLRYQMIRNGGLGEVGEQLEAAAKGSYILNREFDTTSRLVARLDDAMDHGKAMARLFVERKEDKF
          V   A             L     FP  S     L  +            QLEAAAKG+YILNR+ +T SRLVAR+ D  +H  A+ RL VE +    
Subjt:  GLVAITA-----------RTLTELFRFPPESFRRKLLRYQMIRNGGLGEVGEQLEAAAKGSYILNREFDTTSRLVARLDDAMDHGKAMARLFVERKEDKF

Query:  A------VQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVIN
        A      VQ  + +L K+    R Q++E+EEHL+LC +TIN+AR +V+N
Subjt:  A------VQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVIN

A2YH25 Putative UPF0496 protein 21.4e-1826.14Show/hide
Query:  LKSLLIDYFNVSAEASKFCTRLLEDVELTRSNSRSIQKSLDSIENCSSPETIETIASEFLALREPFSDPDKHDFALIHDDYEAVSRRLNCTRKKVARKIK
        +++LLI+YF+V+ EA + C+ LL  +   R +  ++++ L  ++     +  + +A   + L  P S     +F  +H     ++ RL   ++++ R  +
Subjt:  LKSLLIDYFNVSAEASKFCTRLLEDVELTRSNSRSIQKSLDSIENCSSPETIETIASEFLALREPFSDPDKHDFALIHDDYEAVSRRLNCTRKKVARKIK

Query:  SIEIIE-TISCGLVAITARTL-----------------TELFRFPPESFRRKLLR--YQMIRNGGLGEVGEQLEAAAKGSYILNREFDTTSRLVARLDDA
        ++ I   T +  LV   A  +                    F   P    R   R   + + +      G  L+AAA+G+YI+ R+ DT SR+V R  D 
Subjt:  SIEIIE-TISCGLVAITARTL-----------------TELFRFPPESFRRKLLR--YQMIRNGGLGEVGEQLEAAAKGSYILNREFDTTSRLVARLDDA

Query:  MDHGKAMARLFVERKEDKFAVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQI
        ++HG+ +AR+ +    ++  +Q    E ++    LR Q+ E+EEH+ LC++TINR R LV +++
Subjt:  MDHGKAMARLFVERKEDKFAVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQI

Q10RR9 UPF0496 protein 38.0e-2730.95Show/hide
Query:  PQKSFNVNDEYICALRTKSYVEFFIK--------AQSVIEESSPPSTSSSTGRRRRRKLSGTILLEPSQEAIPSILESPFLLML-PDLKSLLIDYFNVSA
        P  SF+  +EY  A RT+SY +F+ +          +++         +++ R    +L    LLEP Q A+ + L SP    L PD++ LL  Y+  +A
Subjt:  PQKSFNVNDEYICALRTKSYVEFFIK--------AQSVIEESSPPSTSSSTGRRRRRKLSGTILLEPSQEAIPSILESPFLLML-PDLKSLLIDYFNVSA

Query:  EASKFCTRLLEDVELTRSNSRSIQKSLDSIENCSSPETIETIASEFLALREPFSDPDKHDFAL--IHDDYEAVSRRLNCTRKKVARKIKSI-EIIETISC
         AS  C+ LL+D+E  R   R ++ +L  +   +S   +  +A    AL +PF+        L  +      + R L+  RKK   +I+S+  +   +S 
Subjt:  EASKFCTRLLEDVELTRSNSRSIQKSLDSIENCSSPETIETIASEFLALREPFSDPDKHDFAL--IHDDYEAVSRRLNCTRKKVARKIKSI-EIIETISC

Query:  GLVAITA-----------RTLTELFRFPPESFRRKLLRYQMIRNGGLGEVGEQLEAAAKGSYILNREFDTTSRLVARLDDAMDHGKAMARLFVERKEDKF
          V   A             L     FP  S     L  +            QLEAAAKG+YILNR+ +T SRLVAR+ D  +H  A+ RL VE +    
Subjt:  GLVAITA-----------RTLTELFRFPPESFRRKLLRYQMIRNGGLGEVGEQLEAAAKGSYILNREFDTTSRLVARLDDAMDHGKAMARLFVERKEDKF

Query:  A------VQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVIN
        A      VQ  + +L K+    R Q++E+EEHL+LC +T N+AR +V+N
Subjt:  A------VQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVIN

Q6DYE5 UPF0496 protein At1g201803.7e-2427.82Show/hide
Query:  NVNDEYICALRTKSYVEFFIKAQSVIE-------ESSPPSTSSSTGRRRRRKLSGTILLEPSQEAIPSILESPFLLMLPDLKSLLIDYFNVSAEASKFCT
        +VN+EY  A RT SY+E   KA+  +         SS PS SSS+        +   LL+P QE + ++++         L +L++ +F++S+EA   C 
Subjt:  NVNDEYICALRTKSYVEFFIKAQSVIE-------ESSPPSTSSSTGRRRRRKLSGTILLEPSQEAIPSILESPFLLMLPDLKSLLIDYFNVSAEASKFCT

Query:  RLLEDVELTRSNSRSIQKSLDSIEN-CSSPETIETI-----------ASEFLALREPFSD-PDKHDFALIHDDYEAVSRRLNCTRKKVARKIKSIEIIE-
         LL+ ++  + N   I++ +   +  C+  +T+E              S F AL+ P     ++  F ++HD    +  +L   ++++ RKI+  +  + 
Subjt:  RLLEDVELTRSNSRSIQKSLDSIEN-CSSPETIETI-----------ASEFLALREPFSD-PDKHDFALIHDDYEAVSRRLNCTRKKVARKIKSIEIIE-

Query:  -----------TISCGLVAITARTLTELFRFPPE----SF---RRKLLRYQMIRNG---GLGEVGEQLEAAAKGSYILNREFDTTSRLVARLDDAMDHGK
                    I   L+ I   ++  +F  P      SF   R+K  + +M ++     L ++G Q++ AAKG +IL  + DT SRL  RL D ++H K
Subjt:  -----------TISCGLVAITARTLTELFRFPPE----SF---RRKLLRYQMIRNG---GLGEVGEQLEAAAKGSYILNREFDTTSRLVARLDDAMDHGK

Query:  AMARLFVERKEDKFAVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQIQMKS
         +A +  + ++ +  ++ A+ E      +  +Q++E+EEHLYLC  TINR+R LV+ QI  +S
Subjt:  AMARLFVERKEDKFAVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQIQMKS

Q9SMU4 UPF0496 protein At3g490702.3e-2630.83Show/hide
Query:  ISAFADAKQQIRNKPQKSFNVNDEYICALRTKSYVEFFIKAQSVIEE--------SSPPSTSSSTGRR-RRRKLSGTILLEPSQEAIPSILESPFLLMLP
        +S+   A      K     +V +EY  A RT+SY  F+ +   +  +        SSPP  SSST  R    +L    LL+P    I  IL+     +  
Subjt:  ISAFADAKQQIRNKPQKSFNVNDEYICALRTKSYVEFFIKAQSVIEE--------SSPPSTSSSTGRR-RRRKLSGTILLEPSQEAIPSILESPFLLMLP

Query:  DLKSLLIDYFNVSAEASKFCTRLLEDVELTRSNSRSIQKSLDSIENCSSPETIETIASEFLALREPFSDPDKHDFALIHDDYEAVSRRLNCTRKKVARKI
          ++LL DYF  +A A   CT+LL+++   RS   S++    S EN +S   I+   +E     +PF         LI      + +RL   R K   K+
Subjt:  DLKSLLIDYFNVSAEASKFCTRLLEDVELTRSNSRSIQKSLDSIENCSSPETIETIASEFLALREPFSDPDKHDFALIHDDYEAVSRRLNCTRKKVARKI

Query:  KSIEIIETISCGLVAITARTLTELFRFPPESFRRKLLRYQMIRN--------GGLGEVGEQLEAAAKGSYILNREFDTTSRLVARLDDAMDHGKAMARLF
        K I  + T S GL+ + A T T +      +F   L    ++ +          L +   +L+ AAKG+YIL+R+ DT SRLV R++D ++H +AMA  +
Subjt:  KSIEIIETISCGLVAITARTLTELFRFPPESFRRKLLRYQMIRN--------GGLGEVGEQLEAAAKGSYILNREFDTTSRLVARLDDAMDHGKAMARLF

Query:  VERKEDKF-AVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQIQMKSHP
        V R   +    +    ELK+       +++E+EEH+YLC +TINRAR L++ +I     P
Subjt:  VERKEDKF-AVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQIQMKSHP

Arabidopsis top hitse value%identityAlignment
AT1G20180.1 Protein of unknown function (DUF677)2.6e-2527.82Show/hide
Query:  NVNDEYICALRTKSYVEFFIKAQSVIE-------ESSPPSTSSSTGRRRRRKLSGTILLEPSQEAIPSILESPFLLMLPDLKSLLIDYFNVSAEASKFCT
        +VN+EY  A RT SY+E   KA+  +         SS PS SSS+        +   LL+P QE + ++++         L +L++ +F++S+EA   C 
Subjt:  NVNDEYICALRTKSYVEFFIKAQSVIE-------ESSPPSTSSSTGRRRRRKLSGTILLEPSQEAIPSILESPFLLMLPDLKSLLIDYFNVSAEASKFCT

Query:  RLLEDVELTRSNSRSIQKSLDSIEN-CSSPETIETI-----------ASEFLALREPFSD-PDKHDFALIHDDYEAVSRRLNCTRKKVARKIKSIEIIE-
         LL+ ++  + N   I++ +   +  C+  +T+E              S F AL+ P     ++  F ++HD    +  +L   ++++ RKI+  +  + 
Subjt:  RLLEDVELTRSNSRSIQKSLDSIEN-CSSPETIETI-----------ASEFLALREPFSD-PDKHDFALIHDDYEAVSRRLNCTRKKVARKIKSIEIIE-

Query:  -----------TISCGLVAITARTLTELFRFPPE----SF---RRKLLRYQMIRNG---GLGEVGEQLEAAAKGSYILNREFDTTSRLVARLDDAMDHGK
                    I   L+ I   ++  +F  P      SF   R+K  + +M ++     L ++G Q++ AAKG +IL  + DT SRL  RL D ++H K
Subjt:  -----------TISCGLVAITARTLTELFRFPPE----SF---RRKLLRYQMIRNG---GLGEVGEQLEAAAKGSYILNREFDTTSRLVARLDDAMDHGK

Query:  AMARLFVERKEDKFAVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQIQMKS
         +A +  + ++ +  ++ A+ E      +  +Q++E+EEHLYLC  TINR+R LV+ QI  +S
Subjt:  AMARLFVERKEDKFAVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQIQMKS

AT1G20180.2 Protein of unknown function (DUF677)2.6e-2528.49Show/hide
Query:  NVNDEYICALRTKSYVEFFIKAQSVIE-------ESSPPSTSSSTGRRRRRKLSGTILLEPSQEAIPSILESPFLLMLPDLKSLLIDYFNVSAEASKFCT
        +VN+EY  A RT SY+E   KA+  +         SS PS SSS+        +   LL+P QE + ++++         L +L++ +F++S+EA   C 
Subjt:  NVNDEYICALRTKSYVEFFIKAQSVIE-------ESSPPSTSSSTGRRRRRKLSGTILLEPSQEAIPSILESPFLLMLPDLKSLLIDYFNVSAEASKFCT

Query:  RLLEDVELTRSNSRSIQKSLDSIEN-CSSPETIETI-----------ASEFLALREPFSD-PDKHDFALIHDDYEAVSRRLNCTRKKVARKIKSIEIIET
         LL+ ++  + N   I++ +   +  C+  +T+E              S F AL+ P     ++  F ++HD    +  +L   ++++ RKI  + +   
Subjt:  RLLEDVELTRSNSRSIQKSLDSIEN-CSSPETIETI-----------ASEFLALREPFSD-PDKHDFALIHDDYEAVSRRLNCTRKKVARKIKSIEIIET

Query:  ISCGLVAITARTLTELFRFPPESFRRKLLRYQMIRNG---GLGEVGEQLEAAAKGSYILNREFDTTSRLVARLDDAMDHGKAMARLFVERKEDKFAVQVA
                 A  L  L  F     R+K  + +M ++     L ++G Q++ AAKG +IL  + DT SRL  RL D ++H K +A +  + ++ +  ++ A
Subjt:  ISCGLVAITARTLTELFRFPPESFRRKLLRYQMIRNG---GLGEVGEQLEAAAKGSYILNREFDTTSRLVARLDDAMDHGKAMARLFVERKEDKFAVQVA

Query:  MDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQIQMKS
        + E      +  +Q++E+EEHLYLC  TINR+R LV+ QI  +S
Subjt:  MDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQIQMKS

AT3G19330.2 Protein of unknown function (DUF677)4.5e-1724.36Show/hide
Query:  PVLISAFADAKQQIRNKPQKSFNVNDEYICALRTKSYVEFFIKAQSVIEESSPPSTSSSTGRRRRRKLSGTILLEPSQEAIPSILESPFLLMLPDLKSLL
        P L S  A   +   +    +FN++ E   A +T SY +   +   V++ +               +L  + +L+P++E +   +     +    L +L+
Subjt:  PVLISAFADAKQQIRNKPQKSFNVNDEYICALRTKSYVEFFIKAQSVIEESSPPSTSSSTGRRRRRKLSGTILLEPSQEAIPSILESPFLLMLPDLKSLL

Query:  IDYFNVSAEASKFCTRLLEDVELTRSNSRSIQKSLDSI-ENCSSPETIETIA----SEFLAL---REPFSDPDKHDFALIHDDYEAVSRRLNCTRKKVAR
          YF  S +A++ C  L ++V   R +  +    L +I    S P   E++       FL L     PFS P+ + F    D     S+  +   +++ +
Subjt:  IDYFNVSAEASKFCTRLLEDVELTRSNSRSIQKSLDSI-ENCSSPETIETIA----SEFLAL---REPFSDPDKHDFALIHDDYEAVSRRLNCTRKKVAR

Query:  KIKSIEIIETISCGLVAITARTLTELFRFPPESFRRKLLRYQMIRNGGLGEVGEQLEAAAKGSYILNREFDTTSRLVARLDDAMDHGKAMARLFVERKED
            + +I   + G +            + P SF+RK L               QL AA+KG+++LN++ DT  RLV+RL   +++ K + RL +ER  D
Subjt:  KIKSIEIIETISCGLVAITARTLTELFRFPPESFRRKLLRYQMIRNGGLGEVGEQLEAAAKGSYILNREFDTTSRLVARLDDAMDHGKAMARLFVERKED

Query:  KFAVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQIQM
          ++Q  +  L+KS+L L +Q++++E+H+ L    +N+AR L++ +I +
Subjt:  KFAVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQIQM

AT3G19330.3 Protein of unknown function (DUF677)4.5e-1724.36Show/hide
Query:  PVLISAFADAKQQIRNKPQKSFNVNDEYICALRTKSYVEFFIKAQSVIEESSPPSTSSSTGRRRRRKLSGTILLEPSQEAIPSILESPFLLMLPDLKSLL
        P L S  A   +   +    +FN++ E   A +T SY +   +   V++ +               +L  + +L+P++E +   +     +    L +L+
Subjt:  PVLISAFADAKQQIRNKPQKSFNVNDEYICALRTKSYVEFFIKAQSVIEESSPPSTSSSTGRRRRRKLSGTILLEPSQEAIPSILESPFLLMLPDLKSLL

Query:  IDYFNVSAEASKFCTRLLEDVELTRSNSRSIQKSLDSI-ENCSSPETIETIA----SEFLAL---REPFSDPDKHDFALIHDDYEAVSRRLNCTRKKVAR
          YF  S +A++ C  L ++V   R +  +    L +I    S P   E++       FL L     PFS P+ + F    D     S+  +   +++ +
Subjt:  IDYFNVSAEASKFCTRLLEDVELTRSNSRSIQKSLDSI-ENCSSPETIETIA----SEFLAL---REPFSDPDKHDFALIHDDYEAVSRRLNCTRKKVAR

Query:  KIKSIEIIETISCGLVAITARTLTELFRFPPESFRRKLLRYQMIRNGGLGEVGEQLEAAAKGSYILNREFDTTSRLVARLDDAMDHGKAMARLFVERKED
            + +I   + G +            + P SF+RK L               QL AA+KG+++LN++ DT  RLV+RL   +++ K + RL +ER  D
Subjt:  KIKSIEIIETISCGLVAITARTLTELFRFPPESFRRKLLRYQMIRNGGLGEVGEQLEAAAKGSYILNREFDTTSRLVARLDDAMDHGKAMARLFVERKED

Query:  KFAVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQIQM
          ++Q  +  L+KS+L L +Q++++E+H+ L    +N+AR L++ +I +
Subjt:  KFAVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQIQM

AT3G49070.1 Protein of unknown function (DUF677)1.6e-2730.83Show/hide
Query:  ISAFADAKQQIRNKPQKSFNVNDEYICALRTKSYVEFFIKAQSVIEE--------SSPPSTSSSTGRR-RRRKLSGTILLEPSQEAIPSILESPFLLMLP
        +S+   A      K     +V +EY  A RT+SY  F+ +   +  +        SSPP  SSST  R    +L    LL+P    I  IL+     +  
Subjt:  ISAFADAKQQIRNKPQKSFNVNDEYICALRTKSYVEFFIKAQSVIEE--------SSPPSTSSSTGRR-RRRKLSGTILLEPSQEAIPSILESPFLLMLP

Query:  DLKSLLIDYFNVSAEASKFCTRLLEDVELTRSNSRSIQKSLDSIENCSSPETIETIASEFLALREPFSDPDKHDFALIHDDYEAVSRRLNCTRKKVARKI
          ++LL DYF  +A A   CT+LL+++   RS   S++    S EN +S   I+   +E     +PF         LI      + +RL   R K   K+
Subjt:  DLKSLLIDYFNVSAEASKFCTRLLEDVELTRSNSRSIQKSLDSIENCSSPETIETIASEFLALREPFSDPDKHDFALIHDDYEAVSRRLNCTRKKVARKI

Query:  KSIEIIETISCGLVAITARTLTELFRFPPESFRRKLLRYQMIRN--------GGLGEVGEQLEAAAKGSYILNREFDTTSRLVARLDDAMDHGKAMARLF
        K I  + T S GL+ + A T T +      +F   L    ++ +          L +   +L+ AAKG+YIL+R+ DT SRLV R++D ++H +AMA  +
Subjt:  KSIEIIETISCGLVAITARTLTELFRFPPESFRRKLLRYQMIRN--------GGLGEVGEQLEAAAKGSYILNREFDTTSRLVARLDDAMDHGKAMARLF

Query:  VERKEDKF-AVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQIQMKSHP
        V R   +    +    ELK+       +++E+EEH+YLC +TINRAR L++ +I     P
Subjt:  VERKEDKF-AVQVAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQIQMKSHP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGCCGAAAATTAGGGATTTGAAAATCAGAAATAGTACCCATATTTCTTCATCGATCTTTGCTTGCTCAGCTGCACAGTTGCTTAATTTTTCTCAACCCGTTTTGAT
TTCAGCTTTTGCAGATGCGAAGCAGCAAATCAGAAACAAGCCGCAGAAAAGCTTCAATGTAAACGACGAGTATATTTGTGCTCTGAGGACCAAATCCTACGTTGAATTCT
TCATAAAAGCTCAATCGGTCATCGAAGAATCATCGCCGCCGTCTACATCCTCCTCCACCGGCCGCCGCCGTCGCCGCAAATTATCGGGAACGATTTTGCTCGAGCCTAGT
CAAGAAGCCATTCCTTCAATTCTTGAATCGCCATTTCTTCTGATGTTGCCCGATCTTAAAAGCCTCTTAATCGATTACTTCAATGTCAGTGCGGAGGCTTCAAAATTCTG
CACTCGTCTCCTTGAGGACGTCGAATTAACCAGATCTAACTCCCGCTCCATCCAAAAATCGCTCGATTCGATCGAGAATTGCTCTTCTCCGGAGACAATAGAAACAATCG
CCTCTGAATTTCTCGCGCTGCGGGAGCCATTTTCCGATCCTGACAAACACGATTTCGCGCTGATCCACGACGATTACGAGGCAGTTTCGCGTCGCCTGAACTGCACGAGG
AAGAAGGTAGCCAGAAAGATCAAATCGATCGAAATTATCGAGACGATTTCGTGTGGATTGGTCGCCATTACAGCTCGCACTCTCACAGAACTATTCCGATTCCCCCCGGA
ATCCTTCCGCAGAAAGCTTCTCAGATATCAGATGATTAGAAATGGCGGCCTCGGCGAAGTAGGCGAACAACTGGAGGCGGCAGCGAAGGGAAGTTACATACTGAACAGAG
AGTTCGACACGACGAGCCGACTCGTGGCGCGACTGGACGACGCCATGGATCACGGCAAGGCGATGGCGCGATTGTTTGTGGAAAGGAAGGAGGATAAATTTGCAGTTCAG
GTAGCCATGGATGAGCTGAAGAAGAGCAATCTGAGGCTGAGAAATCAAGTTGAAGAGGTTGAAGAACACCTGTATTTGTGCATTGTGACGATTAACAGAGCCAGAGGGCT
GGTGATTAACCAGATCCAGATGAAATCGCATCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGGCCGAAAATTAGGGATTTGAAAATCAGAAATAGTACCCATATTTCTTCATCGATCTTTGCTTGCTCAGCTGCACAGTTGCTTAATTTTTCTCAACCCGTTTTGAT
TTCAGCTTTTGCAGATGCGAAGCAGCAAATCAGAAACAAGCCGCAGAAAAGCTTCAATGTAAACGACGAGTATATTTGTGCTCTGAGGACCAAATCCTACGTTGAATTCT
TCATAAAAGCTCAATCGGTCATCGAAGAATCATCGCCGCCGTCTACATCCTCCTCCACCGGCCGCCGCCGTCGCCGCAAATTATCGGGAACGATTTTGCTCGAGCCTAGT
CAAGAAGCCATTCCTTCAATTCTTGAATCGCCATTTCTTCTGATGTTGCCCGATCTTAAAAGCCTCTTAATCGATTACTTCAATGTCAGTGCGGAGGCTTCAAAATTCTG
CACTCGTCTCCTTGAGGACGTCGAATTAACCAGATCTAACTCCCGCTCCATCCAAAAATCGCTCGATTCGATCGAGAATTGCTCTTCTCCGGAGACAATAGAAACAATCG
CCTCTGAATTTCTCGCGCTGCGGGAGCCATTTTCCGATCCTGACAAACACGATTTCGCGCTGATCCACGACGATTACGAGGCAGTTTCGCGTCGCCTGAACTGCACGAGG
AAGAAGGTAGCCAGAAAGATCAAATCGATCGAAATTATCGAGACGATTTCGTGTGGATTGGTCGCCATTACAGCTCGCACTCTCACAGAACTATTCCGATTCCCCCCGGA
ATCCTTCCGCAGAAAGCTTCTCAGATATCAGATGATTAGAAATGGCGGCCTCGGCGAAGTAGGCGAACAACTGGAGGCGGCAGCGAAGGGAAGTTACATACTGAACAGAG
AGTTCGACACGACGAGCCGACTCGTGGCGCGACTGGACGACGCCATGGATCACGGCAAGGCGATGGCGCGATTGTTTGTGGAAAGGAAGGAGGATAAATTTGCAGTTCAG
GTAGCCATGGATGAGCTGAAGAAGAGCAATCTGAGGCTGAGAAATCAAGTTGAAGAGGTTGAAGAACACCTGTATTTGTGCATTGTGACGATTAACAGAGCCAGAGGGCT
GGTGATTAACCAGATCCAGATGAAATCGCATCCTTAA
Protein sequenceShow/hide protein sequence
MWPKIRDLKIRNSTHISSSIFACSAAQLLNFSQPVLISAFADAKQQIRNKPQKSFNVNDEYICALRTKSYVEFFIKAQSVIEESSPPSTSSSTGRRRRRKLSGTILLEPS
QEAIPSILESPFLLMLPDLKSLLIDYFNVSAEASKFCTRLLEDVELTRSNSRSIQKSLDSIENCSSPETIETIASEFLALREPFSDPDKHDFALIHDDYEAVSRRLNCTR
KKVARKIKSIEIIETISCGLVAITARTLTELFRFPPESFRRKLLRYQMIRNGGLGEVGEQLEAAAKGSYILNREFDTTSRLVARLDDAMDHGKAMARLFVERKEDKFAVQ
VAMDELKKSNLRLRNQVEEVEEHLYLCIVTINRARGLVINQIQMKSHP