In May 1996, the International Telecommunications Union (ITU) ratified the H.323 specification, which defines how voice, data, and video traffic will be transported over IP–based local area networks; it also incorporates the T.120 data-conferencing standard (see Figure 10). The recommendation is based on the real-time protocol/real-time control protocol (RTP/RTCP) for managing audio and video signals.

Figure 10. H.323 Call Sequence
As such, H.323 addresses the core Internet-telephony applications by defining how delay-sensitive traffic, (i.e., voice and video), gets priority transport to ensure real-time communications service over the Internet. (The H.324 specification defines the transport of voice, data, and video over regular telephony networks, while H.320 defines the protocols for transporting voice, data, and video over integrated services digital network (ISDN).
H.323 is a set of recommendations, one of which is G.729 for audio codecs, which the ITU ratified in November 1995. Despite the ITU recommendation, however, the Voice over IP (VoIP) Forum in March 1997 voted to recommend the G.723.1 specification over the G.729 standard. The industry consortium, which is led by Intel and Microsoft, agreed to sacrifice some sound quality for the sake of greater bandwidth efficiency—G.723.1 requires 6.3 kbps, while G.729 requires 7.9 kbps. Adoption of the audio codec standard, while an important step, is expected to improve reliability and sound quality mostly for intranet traffic and point-to-point IP connections. To achieve PSTN–like quality, standards are required to guarantee Internet connections.
The transport protocol RTP, on which the H.323 recommendation is based, essentially is a new protocol layer for real-time applications; RTP–compliant equipment will include control mechanisms for synchronizing different traffic streams. However, RTP does not have any mechanisms for ensuring the on-time delivery of traffic signals or for recovering lost packets. RTP also does not address the so-called quality of service (QoS) issue related to guaranteed bandwidth availability for specific applications. Currently, there is a draft signaling-protocol standard aimed at strengthening the Internet's ability to handle real-time traffic reliably (i.e., to dedicate end-to-end transport paths for specific sessions much like the circuit-switched PSTN does). If adopted, the resource reservation protocol (RSVP), will be implemented in routers to establish and maintain requested transmission paths and quality-of-service levels.
Finally, there is a need for industry standards in the area of Internet-telephony directory services. Directories are required to ensure interoperability between the Internet and the PSTN, and most current Internet-telephony applications involve proprietary implementations. However, the lightweight directory access protocol (LDAP v3.0) seems to be emerging as the basis for a new standard.


